메뉴 건너뛰기




Volumn 19, Issue 7, 2011, Pages 2210-2221

Maximum A Posteriori Probability Multiple-Pitch Tracking Using the Harmonic Model

Author keywords

F0 estimation; harmonic model; maximum a posteriori probability (MAP); multipitch estimation; multiple pitch estimation; pitch tracking

Indexed keywords


EID: 85008559665     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2011.2125952     Document Type: Article
Times cited : (12)

References (43)
  • 1
    • 0017367712 scopus 로고
    • On the use of autocorrelation analysis for pitch detection
    • Feb.
    • L. Rabiner “On the use of autocorrelation analysis for pitch detection,” IEEE Trans. Acoust., Speech, Signal Process., vol. 25, no. 1, pp. 24–33, Feb. 1977.
    • (1977) IEEE Trans. Acoust., Speech, Signal Process. , vol.25 , Issue.1 , pp. 24-33
    • Rabiner, L.1
  • 2
    • 0036214787 scopus 로고    scopus 로고
    • Yin, a fundamental frequency estimator for speech and music
    • Jan.
    • A. de Cheveigné and H. Kawahara “Yin, a fundamental frequency estimator for speech and music,” J. Acoust. Soc. Amer., vol. 111, no. 4, pp. 1917–1930, Jan. 2002.
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1917-1930
    • de Cheveigné, A.1    Kawahara, H.2
  • 3
    • 0742290126 scopus 로고    scopus 로고
    • Maximum a-posteriori probability pitch tracking in noisy environments using harmonic model
    • Jan.
    • J. Tabrikian, S. Dubnov, and Y. Dickalov “Maximum a-posteriori probability pitch tracking in noisy environments using harmonic model,” IEEE Trans. Speech Audio Process., vol. 12, no. 1, pp. 76–87, Jan. 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.1 , pp. 76-87
    • Tabrikian, J.1    Dubnov, S.2    Dickalov, Y.3
  • 6
    • 0030371135 scopus 로고    scopus 로고
    • Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency
    • Oct.
    • T. Abe, T. Kobayashi, and S. Imai, “Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency,” in Proc. 4th Int. Conf. Spoken Lang. ICSLP 96, Oct. 1996, pp. 1277–1280.
    • (1996) Proc. 4th Int. Conf. Spoken Lang. ICSLP 96 , pp. 1277-1280
    • Abe, T.1    Kobayashi, T.2    Imai, S.3
  • 7
    • 0035440178 scopus 로고    scopus 로고
    • Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure
    • Jun.
    • D. J. Liu and C. T. Lin, “Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure,” IEEE Trans. Audio, Speech, Lang. Process., vol. 9, no. 6, pp. 609–621, Jun. 2001.
    • (2001) IEEE Trans. Audio, Speech, Lang. Process. , vol.9 , Issue.6 , pp. 609-621
    • Liu, D.J.1    Lin, C.T.2
  • 8
    • 85008561872 scopus 로고    scopus 로고
    • High-pitch formant estimation by exploiting temporal change of pitch
    • Jan.
    • T. T. Wang and T. F. Quatieri “High-pitch formant estimation by exploiting temporal change of pitch,” IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 1, pp. 171–186, Jan. 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.1 , pp. 171-186
    • Wang, T.T.1    Quatieri, T.F.2
  • 10
    • 0346891525 scopus 로고
    • On the transcription of musical sound by computer
    • J. A. Moorer, “On the transcription of musical sound by computer,” Comput. Music J., pp. 32–38, 1977.
    • (1977) Comput. Music J. , pp. 32-38
    • Moorer, J.A.1
  • 11
    • 0022906102 scopus 로고
    • Source separation and note identification in polyphonic music
    • Apr.
    • C. Chafe and D. Jaffe, “Source separation and note identification in polyphonic music,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 1986, vol. 11, pp. 1289–1292.
    • (1986) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.11 , pp. 1289-1292
    • Chafe, C.1    Jaffe, D.2
  • 12
    • 0017004953 scopus 로고
    • Separation of speech from interfering speech by means of harmonic selection
    • T. W. Parsons, “Separation of speech from interfering speech by means of harmonic selection,” J. Acoust. Soc. Amer., pp. 911–918, 1976.
    • (1976) J. Acoust. Soc. Amer. , pp. 911-918
    • Parsons, T.W.1
  • 13
    • 0027298253 scopus 로고
    • Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing
    • Jun.
    • A. de Cheveigné “Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing,” J. Acoust. Soc. Amer., vol. 93, no. 6, pp. 3271–3290, Jun. 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.6 , pp. 3271-3290
    • de Cheveigné, A.1
  • 14
    • 0347337997 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation based on harmonicity and spectral smoothness
    • Nov.
    • A. Klapuri “Multiple fundamental frequency estimation based on harmonicity and spectral smoothness,” IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 804–816, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 804-816
    • Klapuri, A.1
  • 15
    • 0030846123 scopus 로고    scopus 로고
    • A unitary model of pitch perception
    • Jun.
    • R. Meddis and L. O'Mard “A unitary model of pitch perception,” J. Acoust. Soc. Amer., vol. 102, no. 3, pp. 1811–1820, Jun. 1997.
    • (1997) J. Acoust. Soc. Amer. , vol.102 , Issue.3 , pp. 1811-1820
    • Meddis, R.1    O'Mard, L.2
  • 16
    • 0003964243 scopus 로고    scopus 로고
    • Automatic transcription of simple polyphonic music: Robust front end processing
    • K. D. Martin, “Automatic transcription of simple polyphonic music: Robust front end processing,” Tech. Rep. MIT Media Lab., 1996.
    • (1996) Tech. Rep. MIT Media Lab.
    • Martin, K.D.1
  • 18
    • 84873444806 scopus 로고    scopus 로고
    • Multiple fundamental frequency estimation by summing harmonic amplitudes
    • A. Klapuri, “Multiple fundamental frequency estimation by summing harmonic amplitudes,” in Proc. 7th Int. Symp. Music Inf. Retrieval (ISMIR'06), 2006, pp. 216–221.
    • (2006) Proc. 7th Int. Symp. Music Inf. Retrieval (ISMIR'06) , pp. 216-221
    • Klapuri, A.1
  • 19
    • 39649094860 scopus 로고    scopus 로고
    • Multipitch analysis of polyphonic music and speech signals using an auditory model
    • Feb.
    • A. Klapuri “Multipitch analysis of polyphonic music and speech signals using an auditory model,” IEEE Trans. Audio, Speech, Lang. Process., vol. 16, no. 2, pp. 255–266, Feb. 2008.
    • (2008) IEEE Trans. Audio, Speech, Lang. Process. , vol.16 , Issue.2 , pp. 255-266
    • Klapuri, A.1
  • 20
    • 4043129651 scopus 로고    scopus 로고
    • Graphical models
    • J. A. Moorer, “Graphical models,” Statist. Sci., pp. 140–155, 2004.
    • (2004) Statist. Sci. , pp. 140-155
    • Moorer, J.A.1
  • 21
    • 33646773610 scopus 로고    scopus 로고
    • Discriminative training of hidden Markov models for multiple pitch tracking
    • F. R. Bach and M. I. Jordan, “Discriminative training of hidden Markov models for multiple pitch tracking,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'05), 2005, pp. 489–492.
    • (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 489-492
    • Bach, F.R.1    Jordan, M.I.2
  • 22
    • 0028210066 scopus 로고
    • Fundamental frequency estimation of musical signals using a two-way mismatch procedure
    • R. C. Maher and J. W. Beauchamp, “Fundamental frequency estimation of musical signals using a two-way mismatch procedure,” J. Acoust. Soc. Amer., pp. 2254–2263, 1994.
    • (1994) J. Acoust. Soc. Amer. , pp. 2254-2263
    • Maher, R.C.1    Beauchamp, J.W.2
  • 23
    • 0034319894 scopus 로고    scopus 로고
    • A computationally efficient multipitch analysis model
    • Nov.
    • T. Tolonen and M. Karjalainen “A computationally efficient multipitch analysis model,” IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 708–716, Nov. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.6 , pp. 708-716
    • Tolonen, T.1    Karjalainen, M.2
  • 24
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • May
    • M. Wu, D. Wang, and G. J. Brown, “A multipitch tracking algorithm for noisy speech,” IEEE Trans. Speech Audio Process., vol. 11, no. 3, pp. 229–241, May 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.3 , pp. 229-241
    • Wu, M.1    Wang, D.2    Brown, G.J.3
  • 25
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug.
    • R. McAulay and T. Quatieri “Speech analysis/synthesis based on a sinusoidal representation,” IEEE Trans. Acoust., Speech, Signal Process., vol. 34, no. 4, pp. 744–754, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , Issue.4 , pp. 744-754
    • McAulay, R.1    Quatieri, T.2
  • 28
    • 50249154775 scopus 로고    scopus 로고
    • Joint detection and tracking of time-varying harmonic components: A flexible Bayesian approach
    • May
    • C. Dubois and M. Davy “Joint detection and tracking of time-varying harmonic components: A flexible Bayesian approach,” IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1283–1295, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1283-1295
    • Dubois, C.1    Davy, M.2
  • 30
    • 84899027288 scopus 로고    scopus 로고
    • Real-time pitch determination of one or more voices by nonnegative matrix factorization
    • F. Sha and L. K. Saul, “Real-time pitch determination of one or more voices by nonnegative matrix factorization,” Adv. Neural Inf. Process. Syst., pp. 1233–1240, 2005.
    • (2005) Adv. Neural Inf. Process. Syst. , pp. 1233-1240
    • Sha, F.1    Saul, L.K.2
  • 31
    • 70449658600 scopus 로고    scopus 로고
    • Realtime multiple pitch observation using sparse non-negative constraints
    • A. Cont, “Realtime multiple pitch observation using sparse non-negative constraints,” in Proc. 7th Int. Symp. Music Inf. Retrieval (ISMIR'06), 2006.
    • (2006) Proc. 7th Int. Symp. Music Inf. Retrieval (ISMIR'06)
    • Cont, A.1
  • 34
    • 85008537526 scopus 로고    scopus 로고
    • Multiple F0 estimation
    • D. L. Wang and G. J. Brown, Eds. New York: Wiley/IEEE Press
    • A. de Cheveigne, “Multiple F0 estimation,” in Auditory Scene Analysis, Algorithms and Applications, D. L. Wang and G. J. Brown, Eds. New York: Wiley/IEEE Press, 2006.
    • (2006) Auditory Scene Analysis
    • de Cheveigne, A.1
  • 36
    • 84863772450 scopus 로고
    • Speech analysis-synthesis based on a sinusoidal representation
    • Aug.
    • R. J. McAulay and T. F. Quatieri “Speech analysis-synthesis based on a sinusoidal representation,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744–754, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-34 , Issue.4 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 37
    • 0033908965 scopus 로고    scopus 로고
    • Relationships between adaptive minimum variance beamforming and optimal source localization
    • Jan.
    • K. Harmanci, J. Tabrikian, and J. Krolik “Relationships between adaptive minimum variance beamforming and optimal source localization,” IEEE Trans. Signal Process., vol. 48, no. 1, pp. 1–12, Jan. 2000.
    • (2000) IEEE Trans. Signal Process. , vol.48 , Issue.1 , pp. 1-12
    • Harmanci, K.1    Tabrikian, J.2    Krolik, J.3
  • 41
    • 33847129521 scopus 로고    scopus 로고
    • Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model
    • Mar.
    • E. Fisher, J. Tabrikian, and S. Dubnov “Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model,” IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 2, pp. 502–510, Mar. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 502-510
    • Fisher, E.1    Tabrikian, J.2    Dubnov, S.3
  • 43
    • 33744996003 scopus 로고    scopus 로고
    • Model-based sequential organization in cochannel speech
    • Jan.
    • Y. Shao and D. Wang “Model-based sequential organization in cochannel speech,” IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 1, pp. 289–298, Jan. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.1 , pp. 289-298
    • Shao, Y.1    Wang, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.