메뉴 건너뛰기




Volumn 22, Issue 1, 2014, Pages 138-150

Multi-pitch streaming of harmonic sound mixtures

Author keywords

Cochannel speech; Constrained clustering; Multi pitch analysis; Pitch streaming; Timbre tracking

Indexed keywords

ACOUSTIC GENERATORS; CLUSTERING ALGORITHMS; ESTIMATION; HARMONIC ANALYSIS; CONSTRAINED OPTIMIZATION;

EID: 84897936477     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2013.2285484     Document Type: Article
Times cited : (48)

References (51)
  • 2
    • 64849117345 scopus 로고    scopus 로고
    • Unsupervised single-channel music source separation by average harmonic structure modeling
    • May
    • Z. Duan, Y. Zhang, C. Zhang, and Z. Shi, "Unsupervised single-channel music source separation by average harmonic structure modeling," IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 4, pp. 766-778, May 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.4 , pp. 766-778
    • Duan, Z.1    Zhang, Y.2    Zhang, C.3    Shi, Z.4
  • 4
    • 69249202377 scopus 로고    scopus 로고
    • Monaural speech separation and recognition challenge
    • M. Cooke, J. R. Hershey, and S. Rennie, "Monaural speech separation and recognition challenge," Comput. Speech Lang., vol. 24, pp. 1-15, 2010.
    • (2010) Comput. Speech Lang. , vol.24 , pp. 1-15
    • Cooke, M.1    Hershey, J.R.2    Rennie, S.3
  • 6
    • 80052339383 scopus 로고
    • Some experiments on the recognition of speech, with one and two ears
    • E. C. Cherry, "Some experiments on the recognition of speech, with one and two ears," J. Acoust. Soc. Amer., vol. 25, pp. 975-979, 1953.
    • (1953) J. Acoust. Soc. Amer. , vol.25 , pp. 975-979
    • Cherry, E.C.1
  • 7
    • 0034319894 scopus 로고    scopus 로고
    • Acomputationally efficient multipitch analysis model
    • Nov.
    • T. Tolonen and M. Karjalainen, "Acomputationally efficient multipitch analysis model," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 708-716, Nov. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 708-716
    • Tolonen, T.1    Karjalainen, M.2
  • 8
    • 0032663192 scopus 로고    scopus 로고
    • Multiple period estimation and pitch perceptionmodel
    • A. de Cheveigné and H. Kawahara, "Multiple period estimation and pitch perceptionmodel," Speech Commun., vol. 27, pp. 175-185, 1999.
    • (1999) Speech Commun. , vol.27 , pp. 175-185
    • De Cheveigné, A.1    Kawahara, H.2
  • 9
    • 33645360635 scopus 로고    scopus 로고
    • Bayesian analysis of polyphonic western tonal music
    • M. Davy, S. J. Godsill, and J. Idier, "Bayesian analysis of polyphonic western tonal music," J. Acoust. Soc. Amer., vol. 119, pp. 2498-2517, 2006.
    • (2006) J. Acoust. Soc. Amer. , vol.119 , pp. 2498-2517
    • Davy, M.1    Godsill, S.J.2    Idier, J.3
  • 10
    • 0028210066 scopus 로고
    • Fundamental frequency estimation of musical signals using a two-way mismatch procedure
    • R. C. Maher and J. W. Beauchamp, "Fundamental frequency estimation of musical signals using a two-way mismatch procedure," J. Acoust. Soc. Amer., vol. 95, no. 4, pp. 2254-2263, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.95 , Issue.4 , pp. 2254-2263
    • Maher, R.C.1    Beauchamp, J.W.2
  • 11
    • 4644242508 scopus 로고    scopus 로고
    • A real-time music-scene-description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals
    • M. Goto, "A real-time music-scene-description system: Predominant-f0 estimation for detecting melody and bass lines in real-world audio signals," Speech Commun., vol. 43, no. 4, pp. 311-329, 2004.
    • (2004) Speech Commun. , vol.43 , Issue.4 , pp. 311-329
    • Goto, M.1
  • 14
    • 84873444806 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation by summing harmonic amplitudes
    • A. Klapuri, "Multiple fundamental frequency estimation by summing harmonic amplitudes," in Proc. ISMIR, 2006, pp. 216-221.
    • Proc. ISMIR, 2006 , pp. 216-221
    • Klapuri, A.1
  • 17
    • 77956540787 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions
    • Nov.
    • Z. Duan, B. Pardo, and C. Zhang, "Multiple fundamental frequency estimation by modeling spectral peaks and non-peak regions," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2121-2133, Nov. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 2121-2133
    • Duan, Z.1    Pardo, B.2    Zhang, C.3
  • 18
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • May
    • M. Wu, D. Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 229-241, May 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.3 , pp. 229-241
    • Wu, M.1    Wang, D.2    Brown, G.J.3
  • 19
    • 84899027288 scopus 로고    scopus 로고
    • Real-time pitch determination of one or more voices by nonnegative matrix factorization
    • F. Sha and L. Saul, "Real-time pitch determination of one or more voices by nonnegative matrix factorization," in Proc. Adv. Neural Inf. Process. Syst. (NIPS), 2005, pp. 1233-1240.
    • Proc. Adv. Neural Inf. Process. Syst. (NIPS), 2005 , pp. 1233-1240
    • Sha, F.1    Saul, L.2
  • 21
    • 85008056718 scopus 로고    scopus 로고
    • HMM-based multipitch tracking for noisy and reverberant speech
    • Jul.
    • Z. Jin and D. Wang, "HMM-based multipitch tracking for noisy and reverberant speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 5, pp. 1091-1102, Jul. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.5 , pp. 1091-1102
    • Jin, Z.1    Wang, D.2
  • 23
    • 33846199251 scopus 로고    scopus 로고
    • A discriminative model for polyphonic piano transcription
    • DOI:10.1155/2007/48317
    • G. E. Poliner and D. P. W. Ellis, "A discriminative model for polyphonic piano transcription," EURASIP J. Adv. Signal Process., 2007, DOI:10.1155/2007/48317.
    • (2007) EURASIP J. Adv. Signal Process.
    • Poliner, G.E.1    Ellis, D.P.W.2
  • 24
    • 50249173884 scopus 로고    scopus 로고
    • A multipitch analyzer based on harmonic temporal structured clustering
    • Mar.
    • H. Kameoka, T. Nishimoto, and S. Sagayama, "A multipitch analyzer based on harmonic temporal structured clustering," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 982-994, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 982-994
    • Kameoka, H.1    Nishimoto, T.2    Sagayama, S.3
  • 26
    • 50249167077 scopus 로고    scopus 로고
    • Single and multiple f0 contour estimation through parametric spectrogram modeling of speech in noisy environments
    • Jul.
    • J. Le Roux, H. Kameoka, N. Ono, A. de Cheveigne, and S. Sagayama, "Single and multiple f0 contour estimation through parametric spectrogram modeling of speech in noisy environments," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1135-1145, Jul. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1135-1145
    • Le Roux, J.1    Kameoka, H.2    Ono, N.3    De Cheveigne, A.4    Sagayama, S.5
  • 28
    • 0032678384 scopus 로고    scopus 로고
    • A sound source identification system for ensemblemusic based on template adaptation andmusic streamextraction
    • K. Kashino and H. Murase, "A sound source identification system for ensemblemusic based on template adaptation andmusic streamextraction," Speech Commun., pp. 337-349, 1999.
    • (1999) Speech Commun. , pp. 337-349
    • Kashino, K.1    Murase, H.2
  • 29
    • 33744978751 scopus 로고    scopus 로고
    • Musical source separation using time-frequency source priors
    • DOI 10.1109/TSA.2005.860342
    • E. Vincent, "Musical source separation using time-frequency source priors," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 91-98, 2006. (Pubitemid 43863456)
    • (2006) IEEE Transactions on Audio, Speech and Language Processing , vol.14 , Issue.1 , pp. 91-98
    • Vincent, E.1
  • 31
    • 79951599228 scopus 로고    scopus 로고
    • A probabilistic interaction model for multipitch tracking with factorial hidden Markov models
    • May
    • M. Wohlmayr, M. Stark, and F. Pernkopf, "A probabilistic interaction model for multipitch tracking with factorial hidden Markov models," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 4, pp. 799-810, May 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.4 , pp. 799-810
    • Wohlmayr, M.1    Stark, M.2    Pernkopf, F.3
  • 32
    • 84867946385 scopus 로고    scopus 로고
    • An unsupervised approach to cochannel speech separation
    • Jan.
    • K. Hu and D. Wang, "An unsupervised approach to cochannel speech separation," IEEE Trans. Audio, Speech, Lang. Process., vol. 21, no. 1, pp. 122-131, Jan. 2013.
    • (2013) IEEE Trans. Audio, Speech, Lang. Process. , vol.21 , Issue.1 , pp. 122-131
    • Hu, K.1    Wang, D.2
  • 34
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug.
    • R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744-754, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , Issue.4 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 41
    • 76949083398 scopus 로고    scopus 로고
    • Dynamic spectral envelope-modeling for timbre analysis of musical instrument sounds
    • Mar.
    • J. J. Burred, A. Röbel, and T. Sikora, "Dynamic spectral envelope-modeling for timbre analysis of musical instrument sounds," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 3, pp. 663-674, Mar. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.3 , pp. 663-674
    • Burred, J.J.1    Röbel, A.2    Sikora, T.3
  • 43
    • 80052999230 scopus 로고    scopus 로고
    • Soundprism: An online system for score-informed source separation of music audio
    • Dec.
    • Z. Duan and B. Pardo, "Soundprism: An online system for score-informed source separation of music audio," IEEE J. Sel. Topics Signal Process., vol. 5, no. 6, pp. 1205-1215, Dec. 2011.
    • (2011) IEEE J. Sel. Topics Signal Process. , vol.5 , Issue.6 , pp. 1205-1215
    • Duan, Z.1    Pardo, B.2
  • 44
    • 85137465653 scopus 로고    scopus 로고
    • An improved cepstral method for deconvolution of source-filter systems with discrete spectra: Application to musical sounds
    • T. Galas and X. Rodet, "An improved cepstral method for deconvolution of source-filter systems with discrete spectra: Application to musical sounds," in Proc. Int. Comput. Music Conf. (ICMC), 1990, pp. 82-84.
    • Proc. Int. Comput. Music Conf. (ICMC), 1990 , pp. 82-84
    • Galas, T.1    Rodet, X.2
  • 47
    • 0036214787 scopus 로고    scopus 로고
    • Yin, a fundamental frequency estimator for speech and music
    • A. de Cheveigné and H. Kawahara, "Yin, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol. 111, pp. 1917-1930, 2002.
    • (2002) J. Acoust. Soc. Amer. , vol.111 , pp. 1917-1930
    • De Cheveigné, A.1    Kawahara, H.2
  • 48
    • 84865703367 scopus 로고    scopus 로고
    • A pitch tracking corpus with evaluation on multipitch tracking scenario
    • G. Pirker, M. Wohlmayr, S. Petrik, and F. Pernkopf, "A pitch tracking corpus with evaluation on multipitch tracking scenario," in Proc. Interspeech, 2011, pp. 1509-1512.
    • Proc. Interspeech, 2011 , pp. 1509-1512
    • Pirker, G.1    Wohlmayr, M.2    Petrik, S.3    Pernkopf, F.4
  • 50
    • 4444257069 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer
    • P. Boersma, "Praat, a system for doing phonetics by computer," Glot Int., vol. 5, no. 9/10, pp. 341-345, 2001.
    • (2001) Glot Int. , vol.5 , Issue.9-10 , pp. 341-345
    • Boersma, P.1
  • 51
    • 77955695149 scopus 로고    scopus 로고
    • A tandem algorithm for pitch estimation and voiced speech segregation
    • Nov.
    • G. Hu and D. Wang, "A tandem algorithm for pitch estimation and voiced speech segregation," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 8, pp. 2067-2079, Nov. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.8 , pp. 2067-2079
    • Hu, G.1    Wang, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.