메뉴 건너뛰기




Volumn 13, Issue 6, 2005, Pages 1161-1172

Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR

Author keywords

Feature compensation; Noise robust speech recognition; Polynomial regression; Signal to noise ratio (SNR) estimation

Indexed keywords

FEATURE COMPENSATION; NOISE ROBUST SPEECH RECOGNITION; POLYNOMIAL REGRESSION; SIGNAL-TO-NOISE RATIO (SIR) ESTIMATION;

EID: 27744539597     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.853002     Document Type: Article
Times cited : (53)

References (26)
  • 1
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech recognition in noisy environments: a survey," Speech Commun., vol. 16, pp. 261-291, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 2
    • 0018455310 scopus 로고
    • Suppression of acoutic noise in speech using spectral subtraction
    • S. Boll, "Suppression of acoutic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, no. 2, pp. 113-120, 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 3
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, no. 6, pp. 1304-1312, 1974.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.1
  • 5
    • 2442551863 scopus 로고    scopus 로고
    • Estimating cepstrum of speech under the presentee of noise using a joint prior of static and dynamic features
    • May
    • L. Deng, J. Droppo, and A. Acero, "Estimating cepstrum of speech under the presentee of noise using a joint prior of static and dynamic features," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 218-233, May 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.3 , pp. 218-233
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 7
    • 0025041264 scopus 로고
    • Perceptual linear prediction (PLP) analysis of speech
    • H. Hermansky, "Perceptual linear prediction (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 10
    • 0031238095 scopus 로고    scopus 로고
    • A model of dynamic auditory perception and its application to robust word recognition
    • B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, pp. 451-464, 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , pp. 451-464
    • Strope, B.1    Alwan, A.2
  • 11
    • 85009110489 scopus 로고    scopus 로고
    • Amplitude demodulation of speech and its application to noise robust speech recognition
    • Q. Zhu and A. Alwan, "Amplitude demodulation of speech and its application to noise robust speech recognition," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 341-344.
    • (2000) Proc. Int. Conf. Spoken Language Processing , pp. 341-344
    • Zhu, Q.1    Alwan, A.2
  • 12
    • 0033690878 scopus 로고    scopus 로고
    • On the use of variable frame rate analysis in speech recognition
    • _, "On the use of variable frame rate analysis in speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2000, pp. 1783-1786.
    • (2000) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 1783-1786
  • 13
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, pp. 352-359, 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 352-359
    • Gales, M.1    Young, S.2
  • 15
    • 0030365580 scopus 로고    scopus 로고
    • Cepstral compensation by polynomial approximation for environment-independent speech recognition
    • B. Raj, E. Gouvea, P. Moreno, and R. Stern, "Cepstral compensation by polynomial approximation for environment-independent speech recognition," in Proc. Int. Conf. Spoken Language Processing, 1996, pp. 2340-2343.
    • (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2340-2343
    • Raj, B.1    Gouvea, E.2    Moreno, P.3    Stern, R.4
  • 17
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 18
    • 0347899508 scopus 로고    scopus 로고
    • Piecewise-linear transformation-based HMM adaptation for noisy speech
    • Z. Zhang and S. Furui, "Piecewise-linear transformation-based HMM adaptation for noisy speech," Speech Commun., vol. 42, pp. 43-58, 2004.
    • (2004) Speech Commun. , vol.42 , pp. 43-58
    • Zhang, Z.1    Furui, S.2
  • 19
    • 0141480132 scopus 로고    scopus 로고
    • Variable parameter Gaussian mixture hidden Markov modeling for speech recognition
    • X. Cui and Y. Gong, "Variable parameter Gaussian mixture hidden Markov modeling for speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, 2003, pp. 12-15.
    • (2003) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 12-15
    • Cui, X.1    Gong, Y.2
  • 20
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 21
    • 0036476654 scopus 로고    scopus 로고
    • Noise-dependent Gaussian mixture classifers for robust rejection decision
    • Mar.
    • Y. Gong, "Noise-dependent Gaussian mixture classifers for robust rejection decision," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 57-64, Mar. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.2 , pp. 57-64
    • Gong, Y.1
  • 22
    • 2142756950 scopus 로고    scopus 로고
    • Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise
    • May
    • L. Deng, J. Droppo, and A. Acero, "Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 133-143, May 2004.
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.3 , pp. 133-143
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 23
    • 0036755378 scopus 로고    scopus 로고
    • The effect of additive noise on speech amplitude spectra: A quantitative analysis
    • Sep.
    • Q. Zhu and A. Alwan, "The effect of additive noise on speech amplitude spectra: A quantitative analysis," IEEE Signal Process. Lett., vol. 9, no. 9, pp. 275-277, Sep. 2002.
    • (2002) IEEE Signal Process. Lett. , vol.9 , Issue.9 , pp. 275-277
    • Zhu, Q.1    Alwan, A.2
  • 24
    • 85135379452 scopus 로고
    • An efficient algorithm to estimate instantanous SNR of speech signals
    • R. Martin, "An efficient algorithm to estimate instantanous SNR of speech signals," in Proc. Eur. Conf. Speech Communication Technology, 1993, pp. 1093-1096.
    • (1993) Proc. Eur. Conf. Speech Communication Technology , pp. 1093-1096
    • Martin, R.1
  • 25
    • 0038669544 scopus 로고    scopus 로고
    • The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • H. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ASR2000 Int. Workshop on Automatic Speech Recognition, 2000, pp. 181-188.
    • (2000) Proc. ASR2000 Int. Workshop on Automatic Speech Recognition , pp. 181-188
    • Hirsch, H.1    Pearce, D.2
  • 26
    • 4544219816 scopus 로고    scopus 로고
    • Cambridge, U.K.: Cambridge Univ. Press
    • The HTK Book (Version 3.1). Cambridge, U.K.: Cambridge Univ. Press, 2001.
    • (2001) The HTK Book (Version 3.1)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.