SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 13, Issue 6, 2005, Pages 1161-1172

Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR

(2) Cui, Xiaodong a,b Alwan, Abeer a,b,c,d,e

a IEEE (United States)

b UNIVERSITY OF CALIFORNIA (United States)

c Eta Kappa Nu ^* (United States)

d NEW YORK ACADEMY OF SCIENCES (United States)

e ACOUSTICAL SOCIETY OF AMERICA (United States)

Author keywords

Feature compensation; Noise robust speech recognition; Polynomial regression; Signal to noise ratio (SNR) estimation

Indexed keywords

FEATURE COMPENSATION; NOISE ROBUST SPEECH RECOGNITION; POLYNOMIAL REGRESSION; SIGNAL-TO-NOISE RATIO (SIR) ESTIMATION;

ALGORITHMS; AUTOMATION; NOISE ABATEMENT; POLYNOMIAL APPROXIMATION; REGRESSION ANALYSIS; SIGNAL PROCESSING; SIGNAL TO NOISE RATIO;

SPEECH RECOGNITION;

EID: 27744539597 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.853002 Document Type: Article

Times cited : (53)

References (26)

1
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong, "Speech recognition in noisy environments: a survey," Speech Commun., vol. 16, pp. 261-291, 1995.
- (1995) Speech Commun. , vol.16 , pp. 261-291
- Gong, Y.¹

2
- 0018455310
- Suppression of acoutic noise in speech using spectral subtraction
- S. Boll, "Suppression of acoutic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, no. 2, pp. 113-120, 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.¹

3
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Amer., vol. 55, no. 6, pp. 1304-1312, 1974.
- (1974) J. Acoust. Soc. Amer. , vol.55 , Issue.6 , pp. 1304-1312
- Atal, B.¹

4
- 0004319970
- Norwell, MA: Kluwer
- A. Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition. Norwell, MA: Kluwer, 1992.
- (1992) Acoustical and Environmental Robustness in Automatic Speech Recognition
- Acero, A.¹

5
- 2442551863
- Estimating cepstrum of speech under the presentee of noise using a joint prior of static and dynamic features
- May
- L. Deng, J. Droppo, and A. Acero, "Estimating cepstrum of speech under the presentee of noise using a joint prior of static and dynamic features," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 218-233, May 2004.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.3 , pp. 218-233
- Deng, L.¹ Droppo, J.² Acero, A.³

6
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, pp. 578-589, 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

7
- 0025041264
- Perceptual linear prediction (PLP) analysis of speech
- H. Hermansky, "Perceptual linear prediction (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

8
- 85009242725
- Evaluation of a noise-robust DSR front-end on aurora databases
- D. Macho, L. Mauuary, B. Noe, Y. Cheng, D. Ealey, D. Jouvet, H. Kelleher, D. Perace, and F. Saadoun, "Evaluation of a noise-robust DSR front-end on aurora databases," in Proc. Int. Conf. Spoken Language Processing, 2002, pp. 17-20.
- (2002) Proc. Int. Conf. Spoken Language Processing , pp. 17-20
- Macho, D.¹ Mauuary, L.² Noe, B.³ Cheng, Y.⁴ Ealey, D.⁵ Jouvet, D.⁶ Kelleher, H.⁷ Perace, D.⁸ Saadoun, F.⁹

9
- 4544267485
- Evaluation of noise robust features on the Aurora databases
- X. Cui, M. Iseli, Q. Zhu, and A. Alwan, "Evaluation of noise robust features on the Aurora databases," in Proc. Int. Conf. Spoken Language Processing, 2002, pp. 481-484.
- (2002) Proc. Int. Conf. Spoken Language Processing , pp. 481-484
- Cui, X.¹ Iseli, M.² Zhu, Q.³ Alwan, A.⁴

10
- 0031238095
- A model of dynamic auditory perception and its application to robust word recognition
- B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, pp. 451-464, 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , pp. 451-464
- Strope, B.¹ Alwan, A.²

11
- 85009110489
- Amplitude demodulation of speech and its application to noise robust speech recognition
- Q. Zhu and A. Alwan, "Amplitude demodulation of speech and its application to noise robust speech recognition," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 341-344.
- (2000) Proc. Int. Conf. Spoken Language Processing , pp. 341-344
- Zhu, Q.¹ Alwan, A.²

12
- 0033690878
- On the use of variable frame rate analysis in speech recognition
- _, "On the use of variable frame rate analysis in speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2000, pp. 1783-1786.
- (2000) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 1783-1786

13
- 0030245128
- Robust continuous speech recognition using parallel model combination
- M. Gales and S. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, pp. 352-359, 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 352-359
- Gales, M.¹ Young, S.²

14
- 65549153550
- Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA
- P. Moreno, "Speech Recognition in Noisy Environments," Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
- (1996) Speech Recognition in Noisy Environments
- Moreno, P.¹

15
- 0030365580
- Cepstral compensation by polynomial approximation for environment-independent speech recognition
- B. Raj, E. Gouvea, P. Moreno, and R. Stern, "Cepstral compensation by polynomial approximation for environment-independent speech recognition," in Proc. Int. Conf. Spoken Language Processing, 1996, pp. 2340-2343.
- (1996) Proc. Int. Conf. Spoken Language Processing , pp. 2340-2343
- Raj, B.¹ Gouvea, E.² Moreno, P.³ Stern, R.⁴

16
- 0030649027
- Jacobian approach to fast acoustic model adaptation
- S. Sagayama, Y. Yamaguchi, S. Takahashi, and J. Takahashi, "Jacobian approach to fast acoustic model adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1997, pp. 835-838.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 835-838
- Sagayama, S.¹ Yamaguchi, Y.² Takahashi, S.³ Takahashi, J.⁴

17
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

18
- 0347899508
- Piecewise-linear transformation-based HMM adaptation for noisy speech
- Z. Zhang and S. Furui, "Piecewise-linear transformation-based HMM adaptation for noisy speech," Speech Commun., vol. 42, pp. 43-58, 2004.
- (2004) Speech Commun. , vol.42 , pp. 43-58
- Zhang, Z.¹ Furui, S.²

19
- 0141480132
- Variable parameter Gaussian mixture hidden Markov modeling for speech recognition
- X. Cui and Y. Gong, "Variable parameter Gaussian mixture hidden Markov modeling for speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, 2003, pp. 12-15.
- (2003) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 12-15
- Cui, X.¹ Gong, Y.²

20
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

21
- 0036476654
- Noise-dependent Gaussian mixture classifers for robust rejection decision
- Mar.
- Y. Gong, "Noise-dependent Gaussian mixture classifers for robust rejection decision," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 57-64, Mar. 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.2 , pp. 57-64
- Gong, Y.¹

22
- 2142756950
- Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise
- May
- L. Deng, J. Droppo, and A. Acero, "Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 133-143, May 2004.
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.3 , pp. 133-143
- Deng, L.¹ Droppo, J.² Acero, A.³

23
- 0036755378
- The effect of additive noise on speech amplitude spectra: A quantitative analysis
- Sep.
- Q. Zhu and A. Alwan, "The effect of additive noise on speech amplitude spectra: A quantitative analysis," IEEE Signal Process. Lett., vol. 9, no. 9, pp. 275-277, Sep. 2002.
- (2002) IEEE Signal Process. Lett. , vol.9 , Issue.9 , pp. 275-277
- Zhu, Q.¹ Alwan, A.²

24
- 85135379452
- An efficient algorithm to estimate instantanous SNR of speech signals
- R. Martin, "An efficient algorithm to estimate instantanous SNR of speech signals," in Proc. Eur. Conf. Speech Communication Technology, 1993, pp. 1093-1096.
- (1993) Proc. Eur. Conf. Speech Communication Technology , pp. 1093-1096
- Martin, R.¹

25
- 0038669544
- The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- H. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ASR2000 Int. Workshop on Automatic Speech Recognition, 2000, pp. 181-188.
- (2000) Proc. ASR2000 Int. Workshop on Automatic Speech Recognition , pp. 181-188
- Hirsch, H.¹ Pearce, D.²

26
- 4544219816
- Cambridge, U.K.: Cambridge Univ. Press
- The HTK Book (Version 3.1). Cambridge, U.K.: Cambridge Univ. Press, 2001.
- (2001) The HTK Book (Version 3.1)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.