메뉴 건너뛰기




Volumn 11, Issue 3, 2003, Pages 229-241

A multipitch tracking algorithm for noisy speech

Author keywords

Channel selection; Correlogram; Hidden Markov model (HMM); Multipitch tracking; Noisy speech; Pitch detection

Indexed keywords

ACOUSTIC NOISE; ALGORITHMS; COMMUNICATION CHANNELS (INFORMATION THEORY); DATABASE SYSTEMS; MARKOV PROCESSES; MATHEMATICAL MODELS; SIGNAL INTERFERENCE; SPEECH RECOGNITION; STATISTICAL METHODS;

EID: 0037767686     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2003.811539     Document Type: Article
Times cited : (230)

References (42)
  • 1
    • 0032654850 scopus 로고    scopus 로고
    • Cepstrum-based pitch detection using a new statistical V/UV classification algorithm
    • May
    • S. Ahmadi and A. S. Spanias, "Cepstrum-based pitch detection using a new statistical V/UV classification algorithm," IEEE Trans. Speech Audio Processing, vol.7, pp. 333-338, May 1999.
    • (1999) IEEE Trans. Speech Audio Processing , vol.7 , pp. 333-338
    • Ahmadi, S.1    Spanias, A.S.2
  • 2
    • 0028531926 scopus 로고
    • Computational auditory scene analysis
    • G. J. Brown and M. P. Cooke, "Computational auditory scene analysis," Comput. Speech Lang., vol. 8, pp. 297-336, 1949.
    • (1994) Comput. Speech Lang. , vol.8 , pp. 297-336
    • Brown, G.J.1    Cooke, M.P.2
  • 3
    • 0030710671 scopus 로고    scopus 로고
    • Robust pitch detection of speech signals using steerable filters
    • J. Cai and Z.-Q. Liu, "Robust pitch detection of speech signals using steerable filters," in Proc. IEEE ICASSP, vol. 2, 1997, pp. 1427-1430.
    • (1997) Proc. IEEE ICASSP , vol.2 , pp. 1427-1430
    • Cai, J.1    Liu, Z.-Q.2
  • 4
    • 0027307718 scopus 로고    scopus 로고
    • Optimal multi-pitch estimation using the EM algorithm for co-channel speech separation
    • D. Chazan, Y. Stettiner, and D. Malah, "Optimal multi-pitch estimation using the EM algorithm for co-channel speech separation," in Proc. IEEE ICASSP, 1993, pp. II-728-II-731.
    • Proc. IEEE ICASSP, 1993
    • Chazan, D.1    Stettiner, Y.2    Malah, D.3
  • 6
    • 0027298253 scopus 로고
    • Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing
    • A de Cheveigné, "Separation of concurrent harmonic sounds: Fundamental frequency estimation and a time-domain cancellation model of auditory processing," J. Acoust. Soc. Amer., vol. 93, pp. 3271-3290, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 3271-3290
    • De Cheveigné, A.1
  • 7
    • 0032663192 scopus 로고    scopus 로고
    • Multiple period estimation and pitch perception model
    • A. de Cheveigné and H. Kawahara, "Multiple period estimation and pitch perception model," Speech Commun., vol. 27, pp. 175-185, 1999.
    • (1999) Speech Commun. , vol.27 , pp. 175-185
    • De Cheveigné, A.1    Kawahara, H.2
  • 8
    • 0037750051 scopus 로고    scopus 로고
    • Sound source separation via computational auditory scene analysis (CASA)-Enhanced beamforming
    • Ph.D. dissertation, Dept. Elect. Eng., Northwestern Univ., Evanston, IL
    • L. A. Drake, "Sound Source Separation via Computational Auditory Scene Analysis (CASA)-Enhanced Beamforming," Ph.D. dissertation, Dept. Elect. Eng., Northwestern Univ., Evanston, IL, 2001.
    • (2001)
    • Drake, L.A.1
  • 10
    • 4243522694 scopus 로고
    • Linear and nonlinear adaptive filtering and their application to speech intelligibility enhancement
    • Ph.D. dissertation, Dept. Elect. Eng., Eindhoven Univ. Technol., Eindhoven, The Netherlands
    • Y. H. Gu, "Linear and Nonlinear Adaptive Filtering and Their Application to Speech Intelligibility Enhancement," Ph.D. dissertation, Dept. Elect. Eng., Eindhoven Univ. Technol., Eindhoven, The Netherlands, 1992.
    • (1992)
    • Gu, Y.H.1
  • 11
    • 0026400654 scopus 로고    scopus 로고
    • Co-channel speech separation using frequency bin nonlinear adaptive filter
    • Y. H. Gu and W. M. G. van Bokhoven, "Co-channel speech separation using frequency bin nonlinear adaptive filter," in Proc. IEEE ICASSP, 1991, pp. 949-952.
    • Proc. IEEE ICASSP, 1991 , pp. 949-952
    • Gu, Y.H.1    Van Bokhoven, W.M.G.2
  • 12
    • 0035528674 scopus 로고    scopus 로고
    • Idiot's Bayes - Not so stupid after all?
    • D. J. Hand and K. Yu, "Idiot's Bayes - Not so stupid after all?," Int. Statist. Rev., vol. 69, no. 3, pp. 385-398, 2001.
    • (2001) Int. Statist. Rev. , vol.69 , Issue.3 , pp. 385-398
    • Hand, D.J.1    Yu, K.2
  • 15
    • 0023867341 scopus 로고    scopus 로고
    • Speaker dependent and independent speech recognition experiments with an auditory model
    • M. J. Hunt and C. Lefèbvre, "Speaker dependent and independent speech recognition experiments with an auditory model," in Proc. IEEE ICASSP, 1988, pp. 215-218.
    • Proc. IEEE ICASSP, 1988 , pp. 215-218
    • Hunt, M.J.1    Lefèbvre, C.2
  • 17
    • 0025635254 scopus 로고    scopus 로고
    • On a simple algorithm to calculate the 'energy' of a signal
    • J. F. Kaiser, "On a simple algorithm to calculate the 'energy' of a signal," in Proc. IEEE ICASSP, 1990, pp. 381-384.
    • Proc. IEEE ICASSP, 1990 , pp. 381-384
    • Kaiser, J.F.1
  • 19
    • 0026103222 scopus 로고
    • An autocorrelation pitch detector and voicing decision with confidence measures developed for noise-corrupted speech
    • Feb.
    • D. A. Krubsack and R. J. Niederjohn, "An autocorrelation pitch detector and voicing decision with confidence measures developed for noise-corrupted speech," IEEE Trans. Signal Processing, vol. 39, pp. 319-329, Feb. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , pp. 319-329
    • Krubsack, D.A.1    Niederjohn, R.J.2
  • 20
    • 0033627218 scopus 로고    scopus 로고
    • Pitch extraction by using autocorrelation function on the log spectrum
    • N. Kunieda, T. Shimamura, and J. Suzuki, "Pitch extraction by using autocorrelation function on the log spectrum," Electron. Commun. Jpn., pt. 3, vol. 83, no. 1, pp. 90-98, 2000.
    • (2000) Electron. Commun. Jpn., Pt. 3 , vol.83 , Issue.1 , pp. 90-98
    • Kunieda, N.1    Shimamura, T.2    Suzuki, J.3
  • 21
    • 0033725389 scopus 로고    scopus 로고
    • Simplified pitch detection algorithm of mixed speech signals
    • Y.-H. Kwon, D.-J. Park, and B.-C. Ihm, "Simplified pitch detection algorithm of mixed speech signals," in Proc. IEEE ISCAS, 2000, pp. III-722-III-725.
    • Proc. IEEE ISCAS, 2000
    • Kwon, Y.-H.1    Park, D.-J.2    Ihm, B.-C.3
  • 22
    • 0021226391 scopus 로고    scopus 로고
    • A database for speaker-independent digit recognition
    • R. G. Leonard, "A database for speaker-independent digit recognition," in Proc. IEEE ICASSP, 1984, pp. 111-114.
    • Proc. IEEE ICASSP, 1984 , pp. 111-114
    • Leonard, R.G.1
  • 23
    • 0001463644 scopus 로고
    • A duplex theory of pitch perception
    • J. D. R. Licklider, "A duplex theory of pitch perception," Experientia, vol. 7, pp. 128-134, 1951.
    • (1951) Experientia , vol.7 , pp. 128-134
    • Licklider, J.D.R.1
  • 24
    • 0035440178 scopus 로고    scopus 로고
    • Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure
    • Sept.
    • D. J. Liu and C. T. Lin, "Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure," IEEE Trans. Speech Audio Processing, vol.9, pp. 609-621, Sept. 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 609-621
    • Liu, D.J.1    Lin, C.T.2
  • 25
    • 0025807353 scopus 로고
    • Super resolution pitch determination of speech signals
    • Jan.
    • Y. Medan, E. Yair, and D. Chazan, "Super resolution pitch determination of speech signals," IEEE Trans. Signal Processing, vol. 39, pp. 40-48, Jan. 1991.
    • (1991) IEEE Trans. Signal Processing , vol.39 , pp. 40-48
    • Medan, Y.1    Yair, E.2    Chazan, D.3
  • 26
    • 0026654967 scopus 로고
    • Modeling the identification of concurrent vowels with different fundamental frequencies
    • R. Meddis and M. J. Hewitt, "Modeling the identification of concurrent vowels with different fundamental frequencies," J. Acoust. Soc. Amer., vol. 91, no. 1, pp. 233-244, 1992.
    • (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.1 , pp. 233-244
    • Meddis, R.1    Hewitt, M.J.2
  • 28
    • 0003257037 scopus 로고    scopus 로고
    • The prosody of speech: Melody and rhythm
    • W. J. Hardcastle and J. Laver, Eds. Cambridge, MA: Blackwell
    • S. Nooteboom, "The prosody of speech: Melody and rhythm," in The Handbook of Phonetic Science, W. J. Hardcastle and J. Laver, Eds. Cambridge, MA: Blackwell, 1997, pp. 640-673.
    • (1997) The Handbook of Phonetic Science , pp. 640-673
    • Nooteboom, S.1
  • 29
    • 0142056390 scopus 로고
    • APU Rep. 2341: An efficient auditory filterbank based on the gammatone function
    • Cambridge, MA
    • R. D. Patterson, I. Nimmo-Smith, J. Holdsworth, and P. Price, "APU Rep. 2341: An Efficient Auditory Filterbank Based on the Gammatone Function," Appl. Psychol. Unit, Cambridge, MA, 1988.
    • (1988) Appl. Psychol. Unit
    • Patterson, R.D.1    Nimmo-Smith, I.2    Holdsworth, J.3    Price, P.4
  • 32
    • 0031124228 scopus 로고    scopus 로고
    • A pitch determination and voiced/unvoiced decision algorithm for noisy speech
    • J. Rouat, Y. C. Liu, and D. Morissette, "A pitch determination and voiced/unvoiced decision algorithm for noisy speech," Speech Commun., vol. 21, pp. 191-207, 1997.
    • (1997) Speech Commun. , vol.21 , pp. 191-207
    • Rouat, J.1    Liu, Y.C.2    Morissette, D.3
  • 34
    • 0035472923 scopus 로고    scopus 로고
    • Weighted autocorrelation for pitch extraction of noisy speech
    • Oct.
    • T. Shimamura and J. Kobayashi, "Weighted autocorrelation for pitch extraction of noisy speech," IEEE Trans. Speech Audio Processing, vol. 9, pp. 727-730, Oct. 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 727-730
    • Shimamura, T.1    Kobayashi, J.2
  • 35
    • 0033909254 scopus 로고    scopus 로고
    • A method for pitch extraction of speech signals using autocorrelation functions through multiple window lengths
    • T. Takagi, N. Seiyama, and E. Miyasaka, "A method for pitch extraction of speech signals using autocorrelation functions through multiple window lengths," Electron. Commun. Jpn., pt. 3, vol. 83, no. 2, pp. 67-79, 2000.
    • (2000) Electron. Commun. Jpn. , vol.83 , Issue.2 PART 3 , pp. 67-79
    • Takagi, T.1    Seiyama, N.2    Miyasaka, E.3
  • 36
    • 0032678076 scopus 로고    scopus 로고
    • Hidden Markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden Markov models based on multi-space probability distribution for pitch pattern modeling," in Proc. IEEE ICASSP, vol. 1, 1999, pp. 229-232.
    • (1999) Proc. IEEE ICASSP , vol.1 , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 37
    • 0034319894 scopus 로고    scopus 로고
    • A computationally efficient multipitch analysis model
    • Nov.
    • T. Tolonen and M. Karjalainen, "A computationally efficient multipitch analysis model," IEEE Trans. Speech Audio Processing, vol. 8, pp. 708-716, Nov. 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 708-716
    • Tolonen, T.1    Karjalainen, M.2
  • 38
    • 0026635515 scopus 로고
    • Pitch and voiced/unvoiced determination with an auditory model
    • L. M. Van Immerseel and J.-P. Martens, "Pitch and voiced/unvoiced determination with an auditory model," J. Acoust. Soc. Amer., vol. 91, no. 6, pp. 3511-3526, 1992.
    • (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.6 , pp. 3511-3526
    • Van Immerseel, L.M.1    Martens, J.-P.2
  • 39
    • 0032682770 scopus 로고    scopus 로고
    • Separation of speech from interfering sounds based on oscillatory correlation
    • May
    • D. L. Wang and G. J. Brown, "Separation of speech from interfering sounds based on oscillatory correlation," IEEE Trans. Neural Networks, vol. 10, pp. 684-697, May 1999.
    • (1999) IEEE Trans. Neural Networks , vol.10 , pp. 684-697
    • Wang, D.L.1    Brown, G.J.2
  • 40
    • 0033693009 scopus 로고    scopus 로고
    • Robust pitch tracking for prosodic modeling in telephone speech
    • C. Wang and S. Seneff, "Robust pitch tracking for prosodic modeling in telephone speech," in Proc. IEEE ICASSP, 2000, pp. 1343-1346.
    • Proc. IEEE ICASSP, 2000 , pp. 1343-1346
    • Wang, C.1    Seneff, S.2
  • 41
    • 0022907820 scopus 로고    scopus 로고
    • A computational model for separating two simultaneous talkers
    • M. Weintraub, "A computational model for separating two simultaneous talkers," in Proc. IEEE ICASSP, 1986, pp. 81-84.
    • Proc. IEEE ICASSP, 1986 , pp. 81-84
    • Weintraub, M.1
  • 42
    • 0034869339 scopus 로고    scopus 로고
    • Pitch tracking based on statistical anticipation
    • M. Wu, D. L. Wang, and G. J. Brown, "Pitch tracking based on statistical anticipation," in Proc. IJCNN, vol. 2, 2001, pp. 866-871.
    • (2001) Proc. IJCNN , vol.2 , pp. 866-871
    • Wu, M.1    Wang, D.L.2    Brown, G.J.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.