SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 8, 2010, Pages 1889-1901

Noise adaptive training for robust automatic speech recognition

(4) Kalinli, Ozlem a Seltzer, Michael L b Droppo, Jasha b Acero, Alex b

a Sony Computer Entertainment Inc (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

Model adaptation; noise adaptive training; robust speech recognition; vector Taylor series (VTS)

Indexed keywords

ACOUSTIC MODEL; ADAPTIVE TRAINING; AURORA 3; AUTOMATIC SPEECH RECOGNITION; CLEAN SPEECH; FEATURE ENHANCEMENT; MODEL ADAPTATION; MODEL PARAMETERS; MODEL TRAINING; NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION; POINT ESTIMATE; ROBUST SPEECH RECOGNITION; TEST TIME; TRAINING DATA; VECTOR TAYLOR SERIES; VECTOR TAYLOR SERIES (VTS);

DECODING; TAYLOR SERIES;

SPEECH RECOGNITION;

EID: 77956296425 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2040522 Document Type: Article

Times cited : (60)

References (30)

1
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr.
- S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.¹

2
- 51449089990
- A minimum- mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition
- Las Vegas, NV
- D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero, "A minimum- mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition," in Proc. ICASSP, Las Vegas, NV, 2008, pp. 4041-4044.
- (2008) Proc. ICASSP , pp. 4041-4044
- Yu, D.¹ Deng, L.² Droppo, J.³ Wu, J.⁴ Gong, Y.⁵ Acero, A.⁶

3
- 85009070292
- Large-vocabulary speech recognition under adverse acoustic environments
- Beijing, China
- L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, Beijing, China, 2000, pp. 806-809.
- (2000) Proc. ICSLP , pp. 806-809
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.⁴

4
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

5
- 0028419019
- Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains
- Apr.
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
- Gauvain, J.-L.¹ Lee, C.-H.²

6
- 0029375590
- Speaker adaptation using constrained estimation of Gaussian mixtures
- Sep.
- V. Digalakis, D. Rtischev, L. Neumeyer, and E. Sa, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
- Digalakis, V.¹ Rtischev, D.² Neumeyer, L.³ Sa, E.⁴

7
- 0032050110
- Maximum likelihood linear transformations for HMMbased speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
- Gales, M.J.F.¹

8
- 77956283772
- Regularized feature-based maximum likelihood linear regression for speech recognition
- Antwerp, Belgium
- M. K. Omar, "Regularized feature-based maximum likelihood linear regression for speech recognition," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 1561-1564.
- (2007) Proc. Interspeech , pp. 1561-1564
- Omar, M.K.¹

9
- 85009088984
- Robust digit recognition in noisy environments: The IBM Aurora 2 system
- Aalborg, Denmark
- G. Saon, J. M. Huerta, and E. E. Jan, "Robust digit recognition in noisy environments: The IBM Aurora 2 system," in Proc. Interspeech, Aalborg, Denmark, 2001, pp. 629-632.
- (2001) Proc. Interspeech , pp. 629-632
- Saon, G.¹ Huerta, J.M.² Jan, E.E.³

10
- 27744539597
- Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
- Nov.
- X. Cui and A. Alwan, "Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR," IEEE Trans. Speech Audio Processing, vol. 13, no. 6, pp. 1161-1172, Nov. 2005.
- (2005) IEEE Trans. Speech Audio Processing , vol.13 , Issue.6 , pp. 1161-1172
- Cui, X.¹ Alwan, A.²

11
- 0030245128
- Robust continuous speech recognition using parallel model combination
- Sep.
- M. Gales, S.Young, and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
- Gales, M.¹ Young, S.² Young, S.J.³

12
- 0029725301
- A vector Taylor series approach for environment-independent speech recognition
- P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. ICASSP, 1996, pp. 733-736.
- (1996) Proc. ICASSP , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

13
- 0032048385
- Speech recognition in noisy environments using first-order vector Taylor series
- D. Y. Kim, C. K. Un, and N. S. Kim, "Speech recognition in noisy environments using first-order vector Taylor series," Speech Commun., vol. 24, no. 1, pp. 39-49, 1998.
- (1998) Speech Commun. , vol.24 , Issue.1 , pp. 39-49
- Kim, D.Y.¹ Un, C.K.² Kim, N.S.³

14
- 85009113852
- HMM adaptation using vector Taylor series for noisy speech recognition
- Beijing, China
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, Beijing, China, 2000, pp. 869-872.
- (2000) Proc. ICSLP , pp. 869-872
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

15
- 44849125798
- High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
- Kyoto, Japan
- J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU, Kyoto, Japan, 2007, pp. 65-70.
- (2007) Proc. ASRU , pp. 65-70
- Li, J.¹ Deng, L.² Yu, D.³ Gong, Y.⁴ Acero, A.⁵

16
- 0030362995
- A compact model for speaker-adaptive training
- Philadelphia, PA
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. ICSLP, Philadelphia, PA, 1996, pp. 1137-1140.
- (1996) Proc. ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

17
- 44849122740
- Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions
- Antwerp, Belgium
- Y. Hu and Q. Huo, "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 1042-1045.
- (2007) Proc. Interspeech , pp. 1042-1045
- Hu, Y.¹ Huo, Q.²

18
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- Honolulu, HI
- H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP, Honolulu, HI, 2007, pp. 389-392.
- (2007) Proc. ICASSP , pp. 389-392
- Liao, H.¹ Gales, M.J.F.²

19
- 33745202806
- Joint uncertainty decoding for noise robust speech recognition
- Lisbon, Portugal
- H. Liao and M. J. F. Gales, "Joint uncertainty decoding for noise robust speech recognition," in Proc. Interspeech, Lisbon, Portugal, 2005, pp. 3129-3132.
- (2005) Proc. Interspeech , pp. 3129-3132
- Liao, H.¹ Gales, M.J.F.²

20
- 0004319970
- Norwell MA: Kluwer
- A. Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition. Norwell, MA: Kluwer, 1993.
- (1993) Acoustical and Environmental Robustness in Automatic Speech Recognition
- Acero, A.¹

21
- 0346126988
- Robust speech recognition in noise: Performance of the IBM continuous speech recognizer on the ARPA noise spoke task
- R. A. Gopinath, M. Gales, P. S. Gopalakrishnan, S. Balakrishnan- Aiyer, and M. A. Picheny, "Robust speech recognition in noise: Performance of the IBM continuous speech recognizer on the ARPA noise spoke task," in Proc. ARPA Workshop Spoken Lang. Syst. Technol., 1995.
- (1995) Proc. ARPA Workshop Spoken Lang. Syst. Technol.
- Gopinath, R.A.¹ Gales, M.² Gopalakrishnan, P.S.³ Balakrishnan-Aiyer, S.⁴ Picheny, M.A.⁵

22
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

23
- 0024610919
- A tutorial on hidden Markov models and selected applications in speech recognition
- L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

24
- 0004041275
- Englwood Cliffs, NJ: Prentice- Hall
- J. E. Dennis and R. B. Schnabel, Numerical Methods for Unconstrained Optimization and Nonlinear Equations. Englwood Cliffs, NJ: Prentice- Hall, 1983.
- (1983) Numerical Methods for Unconstrained Optimization and Nonlinear Equations
- Dennis, J.E.¹ Schnabel, R.B.²

25
- 34547537573
- Cambridge Univ., Tech. Rep. CUED/F-INFENG/TR-522
- H. Liao and M. J. F. Gales, "Joint uncertainty decoding for robust large vocabulary speech recognition," Cambridge Univ., 2006, Tech. Rep. CUED/F-INFENG/TR-522.
- (2006) Joint Uncertainty Decoding for Robust Large Vocabulary Speech Recognition
- Liao, H.¹ Gales, M.J.F.²

26
- 0003483593
- Cambridge, U.K.: Univ. of Cambridge, Dept. of Eng.
- S. J. Young, The HTK Hidden Markov Model Toolkit: Design and Philosophy. Cambridge, U.K.: Univ. of Cambridge, Dept. of Eng., 1994.
- (1994) The HTK Hidden Markov Model Toolkit: Design and Philosophy
- Young, S.J.¹

27
- 0038669544
- The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France Sep.
- H. G. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR, Paris, France, Sep. 2000, pp. 181-188.
- (2000) Proc. ISCA ITRW ASR , pp. 181-188
- Hirsch, H.G.¹ Pearce, D.²

28
- 77949413917
- Online, Available
- D. Pierce and A. Gunawardana, "Aurora 2.0 Speech Recognition in Noise: Update 2. Complex Backend Definition for Aurora 2.0," 2002 [Online]. Available: http://icslp2002.colorado.edu/special-sessions/ aurora
- (2002) Aurora 2.0 Speech Recognition in Noise: Update 2. Complex Backend Definition for Aurora 2.0
- Pierce, D.¹ Gunawardana, A.²

29
- 85009223874
- Speechdat-car: A large speech database for automotive environments
- Athens, Greece
- A. Moreno, B. Lindberg, C. Draxler, G. Richard, K. Choukri, J. Allen, and S. Euler, "Speechdat-car: A large speech database for automotive environments," in Proc. LREC, Athens, Greece, 2000.
- (2000) Proc. LREC
- Moreno, A.¹ Lindberg, B.² Draxler, C.³ Richard, G.⁴ Choukri, K.⁵ Allen, J.⁶ Euler, S.⁷

30
- 85009242725
- Evaluation of a noise-robust DSR front-end on Aurora databases
- Denver, CO
- D. Macho, L. Mauuary, B. Noé, Y. M. Cheng, D. Ealey, D. Jouvet, H. Kelleher, D. Pearce, and F. Saadoun, "Evaluation of a noise-robust DSR front-end on Aurora databases," in Proc. ICSLP, Denver, CO, 2002, pp. 17-20.
- (2002) Proc. ICSLP , pp. 17-20
- MacHo, D.¹ Mauuary, L.² Noé, B.³ Cheng, Y.M.⁴ Ealey, D.⁵ Jouvet, D.⁶ Kelleher, H.⁷ Pearce, D.⁸ Saadoun, F.⁹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.