메뉴 건너뛰기




Volumn 15, Issue 5, 2004, Pages 1112-1124

A temporal-analysis-based pitch estimation system for noisy speech with a comparative study of performance of recent systems

Author keywords

[No Author keywords available]

Indexed keywords

DAMPING; ESTIMATION; HARMONIC GENERATION; MATHEMATICAL MODELS; NATURAL FREQUENCIES; OSCILLATORS (ELECTRONIC); SPEECH PROCESSING; SPURIOUS SIGNAL NOISE;

EID: 4644223415     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/TNN.2004.832818     Document Type: Article
Times cited : (12)

References (30)
  • 1
    • 0001463644 scopus 로고
    • A duplex theory of pitch perception
    • J. C. R. Licklider, "A duplex theory of pitch perception," Experientia, vol. 7, pp. 128-134, 1951.
    • (1951) Experientia , vol.7 , pp. 128-134
    • Licklider, J.C.R.1
  • 2
    • 0025740746 scopus 로고
    • Virtual pitch and phase sensitivity of a computer model of the auditory periphery. I. Pitch identification
    • R. Meddis and M. J. Hewitt, "Virtual pitch and phase sensitivity of a computer model of the auditory periphery. I. Pitch identification," J. Acoust. Soc. Amer., vol. 89, pp. 2866-2882, 1991.
    • (1991) J. Acoust. Soc. Amer. , vol.89 , pp. 2866-2882
    • Meddis, R.1    Hewitt, M.J.2
  • 3
    • 0029745579 scopus 로고    scopus 로고
    • Neural correlates of the pitch of complex tones. I. Pitch and pitch salience
    • P. A. Cariani and B. Delgutte, "Neural correlates of the pitch of complex tones. I. Pitch and pitch salience," J. Neurophysiol., vol. 76, pp. 2866-2882, 1996.
    • (1996) J. Neurophysiol. , vol.76 , pp. 2866-2882
    • Cariani, P.A.1    Delgutte, B.2
  • 4
    • 0015756315 scopus 로고
    • An optimum processor theory for the central formation of pitch of complex tones
    • J. L. Goldstein, "An optimum processor theory for the central formation of pitch of complex tones," J. Acoust. Soc. Amer., vol. 54, pp. 1496-1516, 1973.
    • (1973) J. Acoust. Soc. Amer. , vol.54 , pp. 1496-1516
    • Goldstein, J.L.1
  • 5
    • 0020141502 scopus 로고
    • Measurement of pitch in speech: An implementation of Goldstein's theory of pitch perception
    • H. Duifhuis, L. F. Willems, and R. J. Sluyter, "Measurement of pitch in speech: an implementation of Goldstein's theory of pitch perception," J. Acoust. Soc. Amer., vol. 71, pp. 1568-1580, 1982.
    • (1982) J. Acoust. Soc. Amer. , vol.71 , pp. 1568-1580
    • Duifhuis, H.1    Willems, L.F.2    Sluyter, R.J.3
  • 6
    • 0000618817 scopus 로고
    • New methods of pitch extraction
    • June
    • M. M. Sondhi, "New methods of pitch extraction," IEEE Trans. Audio Electroacoust., vol. AU-16, pp. 262-266, June 1968.
    • (1968) IEEE Trans. Audio Electroacoust. , vol.AU-16 , pp. 262-266
    • Sondhi, M.M.1
  • 8
    • 0008806588 scopus 로고
    • Pitch analysis
    • M. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley
    • D. J. Hermes, "Pitch analysis," in Visual Representation of Speech Signals, M. Cooke, S. Beet, and M. Crawford, Eds. New York: Wiley, 1993.
    • (1993) Visual Representation of Speech Signals
    • Hermes, D.J.1
  • 9
    • 0036214787 scopus 로고    scopus 로고
    • YIN, a fundamental frequency estimator for speech and music
    • Apr
    • A. de Cheveigné and H. Kawahara, "YIN, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol. 111, no. 4, Apr. 2002.
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4
    • de Cheveigné, A.1    Kawahara, H.2
  • 13
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics to noise ratio of a sampled sound
    • P. Boersma, "Accurate short-term analysis of the fundamental frequency and the harmonics to noise ratio of a sampled sound," Proc. Institute Phonetic Sciences, vol. 17, pp. 97-110, 1993.
    • (1993) Proc. Institute Phonetic Sciences , vol.17 , pp. 97-110
    • Boersma, P.1
  • 14
    • 0025003184 scopus 로고
    • Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies
    • P. F. Assmann and Q. Summerfield, "Modeling the perception of concurrent vowels: vowels with different fundamental frequencies," J. Acoust. Soc. Amer., vol. 88, pp. 680-697, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.88 , pp. 680-697
    • Assmann, P.F.1    Summerfield, Q.2
  • 15
    • 0031658055 scopus 로고    scopus 로고
    • Psychophysical evidence against the autocorrelation theory of auditory temporal processing
    • C. Kaernbach and L. Demany, "Psychophysical evidence against the autocorrelation theory of auditory temporal processing," J. Acoust. Soc. Amer., vol. 104, no. 4, pp. 2298-2306, 1998.
    • (1998) J. Acoust. Soc. Amer. , vol.104 , Issue.4 , pp. 2298-2306
    • Kaernbach, C.1    Demany, L.2
  • 16
    • 0030068327 scopus 로고    scopus 로고
    • Encoding the fundamental frequency of a complex tone in the presence of a spectrally overlapping masker
    • R. P. Carlyon, "Encoding the fundamental frequency of a complex tone in the presence of a spectrally overlapping masker," J. Acoust. Soc. Amer., vol. 99, pp. 517-524, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , pp. 517-524
    • Carlyon, R.P.1
  • 17
    • 0027298253 scopus 로고
    • Separation of concurrent harmonic sounds: Fundamental frequency estimation and time domain cancellation model of auditory processing
    • A. de Cheveigné, "Separation of concurrent harmonic sounds: fundamental frequency estimation and time domain cancellation model of auditory processing," J. Acoust. Soc. Amer., vol. 93, pp. 3271-3290, 1993.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , pp. 3271-3290
    • de Cheveigné, A.1
  • 19
    • 0037767686 scopus 로고    scopus 로고
    • A multipitch tracking algorithm for noisy speech
    • May
    • M. Wu, D. L. Wang, and G. J. Brown, "A multipitch tracking algorithm for noisy speech," IEEE Trans. Speech Audio Processing, vol. 11, pp. 229-241, May 2003.
    • (2003) IEEE Trans. Speech Audio Processing , vol.11 , pp. 229-241
    • Wu, M.1    Wang, D.L.2    Brown, G.J.3
  • 22
    • 0013385898 scopus 로고
    • Frequency selectivity and the tonal residue
    • E. Zwicker and E. Terhardt, Eds. Berlin, Germany: Springer-Verlag
    • R. J. Ritsma and A. Hoekstra, "Frequency selectivity and the tonal residue," in Facts and Models in Hearing, E. Zwicker and E. Terhardt, Eds. Berlin, Germany: Springer-Verlag, 1974, pp. 156-163.
    • (1974) Facts and Models in Hearing , pp. 156-163
    • Ritsma, R.J.1    Hoekstra, A.2
  • 23
    • 0031971032 scopus 로고    scopus 로고
    • Temporal processing of the pitch of complex tones
    • L. J. White and C. J. Plack, "Temporal processing of the pitch of complex tones," J. Acoust. Soc. Amer., vol. 103, pp. 2051-2063, 1998.
    • (1998) J. Acoust. Soc. Amer. , vol.103 , pp. 2051-2063
    • White, L.J.1    Plack, C.J.2
  • 24
    • 0000030810 scopus 로고
    • Auditory nerve representation as a basis for speech processing
    • S. Furai and M. Sondhi, Eds. New York: Marcel Dekker
    • O. Ghitza, "Auditory nerve representation as a basis for speech processing," in Advances in Speech Processing, S. Furai and M. Sondhi, Eds. New York: Marcel Dekker, 1991, pp. 453-485.
    • (1991) Advances in Speech Processing , pp. 453-485
    • Ghitza, O.1
  • 25
    • 0027460571 scopus 로고
    • Adequacy of auditory models to predict human internal representation of speech sounds
    • O. Ghitza, "Adequacy of auditory models to predict human internal representation of speech sounds," J. Acoust. Soc. Amer., vol. 93, no. 4, pp. 2160-2171, 1992.
    • (1992) J. Acoust. Soc. Amer. , vol.93 , Issue.4 , pp. 2160-2171
    • Ghitza, O.1
  • 26
    • 0030683381 scopus 로고    scopus 로고
    • The weft: A representation for periodic sounds
    • Munich, Germany
    • D. Ellis, "The weft: a representation for periodic sounds," in Proc. ICASSP, vol. 2, Munich, Germany, 1997, pp. 1307-1310.
    • (1997) Proc. ICASSP , vol.2 , pp. 1307-1310
    • Ellis, D.1
  • 27
    • 84928837806 scopus 로고
    • A joint synchrony/mean-rate model of auditory speech processing
    • S. Seneff, "A joint synchrony/mean-rate model of auditory speech processing," J. Phonetics, vol. 16, pp. 55-76, 1988.
    • (1988) J. Phonetics , vol.16 , pp. 55-76
    • Seneff, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.