SCOPUS 정보 검색 플랫폼

Techniques for Noise Robustness in Automatic Speech Recognition

Volumn , Issue , 2012, Pages 229-250

Feature Compensation

Author keywords

Acoustic and signals using microphone, linear nonlinear effects; AURORA digital speech recognition, and system evaluation tasks; Discriminative SPLICE and MMI, Aurora 4 and Rprop iterations; Discriminative SPLICE in shortcomings, noise into features; Feature compensation, ideal from observed noisy speech features; Feature enhancement, information and distortion removal; Joint distribution of clean noisy speech model, GMM; MBFE utterance by utterance, SLDM and speech or noise cepstra; MMSE SPLICE in noisy speech, word error rate on Aurora 2; VTS feature enhancement, for processing power and distortion

Indexed keywords

EID: 84886120743 PISSN: None EISSN: None Source Type: Book
DOI: 10.1002/9781118392683.ch9 Document Type: Chapter

Times cited : (3)

References (24)

1
- 0026385284
- Robust speech recognition by normalization of the acoustic space
- A. Acero and R. M. Stern, "Robust speech recognition by normalization of the acoustic space," in Proceedings of the IEEE ICASSP, vol. 2, pp. 893-896, 1991.
- (1991) Proceedings of the IEEE ICASSP. , vol.2 , pp. 893-896
- Acero, A.¹ Stern, R.M.²

2
- 84886205387
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- International Conference on Acoustics
- L. R. Bahl, P. F. Brown, P. V. D. Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition," in International Conference on Acoustics, Speech and Signal Processing, 1997.
- (1997) Speech and Signal Processing
- Bahl, L.R.¹ Brown, P.F.² Souza, P.V.D.³ Mercer, R.L.⁴

3
- 0036226165
- Noise estimation by minima controlled recursive averaging for robust speech enhancement
- I. Cohen and B. Berdugo, "Noise estimation by minima controlled recursive averaging for robust speech enhancement," IEEE Signal Processing Letters, vol. 9, no. 1, pp. 12-15, 2002.
- (2002) IEEE Signal Processing Letters , vol.9 , Issue.1 , pp. 12-15
- Cohen, I.¹ Berdugo, B.²

4
- 85009284508
- "Log-domain speech feature enhancement using sequential map noise estimate and a nonlinear model of acoustic environment,"
- Denver, CO, September
- L. Deng, J. Droppo, and A. Acero, "Log-domain speech feature enhancement using sequential map noise estimate and a nonlinear model of acoustic environment," in Proceedings of the ICSLP, Denver, CO, September 2002.
- (2002) Proceedings of the ICSLP.
- Deng, L.¹ Droppo, J.² Acero, A.³

5
- 2442551863
- "Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features,"
- May
- L. Deng, J. Droppo, and A. Acero, "Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features," IEEE Transactions on Speech and Audio Processing, vol. 12, no. 3, pp. 218-233, May 2004.
- (2004) IEEE Transactions on Speech and Audio Processing , vol.12 , Issue.3 , pp. 218-233
- Deng, L.¹ Droppo, J.² Acero, A.³

6
- 0029375590
- "Speaker adaptation using constrained estimation of gaussian mixtures,"
- September
- V. V. Digalakis, D. Rtischev, and L. G. Neumeyer, "Speaker adaptation using constrained estimation of gaussian mixtures," IEEE Transactions on Speech and Audio Processing, vol. 3, pp. 357-366, September 1995.
- (1995) IEEE Transactions on Speech and Audio Processing , vol.3 , pp. 357-366
- Digalakis, V.V.¹ Rtischev, D.² Neumeyer, L.G.³

7
- 33947702149
- "Joint discriminative front end and back end training for improved speech recognition accuracy,"
- Speech and Signal Processing, Toulouse, France, May
- J. Droppo and A. Acero, "Joint discriminative front end and back end training for improved speech recognition accuracy," in International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, May 2006.
- (2006) International Conference on Acoustics
- Droppo, J.¹ Acero, A.²

8
- 4544236840
- "Noise robust speech recognition with a switching linear dynamic model,"
- Speech and Signal Processing, Montreal, Canada, May
- J. Droppo and A. Acero, "Noise robust speech recognition with a switching linear dynamic model," in International Conference on Acoustics, Speech and Signal Processing, Montreal, Canada, May 2004.
- (2004) International Conference on Acoustics
- Droppo, J.¹ Acero, A.²

9
- 0036291376
- "Uncertainty decoding with SPLICE for noise robust speech recognition,"
- Orlando, Florida, May
- J. Droppo, A. Acero, and L. Deng, "Uncertainty decoding with SPLICE for noise robust speech recognition," in Proceedings of the 2002 ICASSP, Orlando, Florida, May 2002.
- (2002) Proceedings of the 2002 ICASSP.
- Droppo, J.¹ Acero, A.² Deng, L.³

10
- 85009265626
- Evaluation of SPLICE on the Aurora 2 and 3 tasks
- J. Droppo, L. Deng, and A. Acero, "Evaluation of SPLICE on the Aurora 2 and 3 tasks," in Proceedings of the ICSLP, pp. 29-32, 2002.
- (2002) Proceedings of the ICSLP. , pp. 29-32
- Droppo, J.¹ Deng, L.² Acero, A.³

11
- 54349123450
- "A comparison of three non-linear observation models for noisy speech features,"
- Geneva, Switzerland, September
- J. Droppo, L. Deng, and A. Acero, "A comparison of three non-linear observation models for noisy speech features," in Proceedings of the 2003 Eurospeech, Geneva, Switzerland, September 2003, pp. 681-684.
- (2003) Proceedings of the 2003 Eurospeech , pp. 681-684
- Droppo, J.¹ Deng, L.² Acero, A.³

12
- 33846265376
- "How to train a discriminative front end with stochastic gradient descent and maximum mutual information,"
- J. Droppo, M. Mahajan, A. Gunawardana, and A. Acero, "How to train a discriminative front end with stochastic gradient descent and maximum mutual information," in Proceedings of the IEEE ASRU, 2005.
- (2005) Proceedings of the IEEE ASRU.
- Droppo, J.¹ Mahajan, M.² Gunawardana, A.³ Acero, A.⁴

13
- 85009135386
- "Investigations into tandem acoustic modeling for the Aurora task,"
- D. Ellis and M. Gomez, "Investigations into tandem acoustic modeling for the Aurora task," in Proceedings of the Eurospeech, 2001, pp. 189-192.
- (2001) Proceedings of the Eurospeech , pp. 189-192
- Ellis, D.¹ Gomez, M.²

14
- 85009074657
- "ALGONQUIN: Iterating Laplace'smethod to remove multiple types of acoustic distortion for robust speech recognition,"
- Aalbork, Denmark, September
- B. Frey, L. Deng, A. Acero, and T. Kristjansson, "ALGONQUIN: Iterating Laplace'smethod to remove multiple types of acoustic distortion for robust speech recognition," in Proceedings of the 2001 Eurospeech, Aalbork, Denmark, September 2001.
- (2001) Proceedings of the 2001 Eurospeech
- Frey, B.¹ Deng, L.² Acero, A.³ Kristjansson, T.⁴

15
- 0031139839
- Minimum classification error rate methods for speech recognition
- B.-H. Juang,W. Hou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 5, pp. 257-265, 1997.
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , pp. 257-265
- Juang, B.H.¹ Hou, W.² Lee, C.H.³

16
- 0028531044
- Prototype-based minimum classification error/generalized probabilistic descent training for various speech units
- E. McDermott and S. Katagiri, "Prototype-based minimum classification error/generalized probabilistic descent training for various speech units," Computer Speech & Language, vol. 8, pp. 351-368, 1994.
- (1994) Computer Speech & Language , vol.8 , pp. 351-368
- McDermott, E.¹ Katagiri, S.²

17
- 65549153550
- Speech recognition in noisy environments
- Carnegie Mellon University
- P. Moreno, "Speech recognition in noisy environments," PhD dissertation, Carnegie Mellon University, 1996.
- (1996) PhD dissertation
- Moreno, P.¹

18
- 0028996861
- "Multivariate-gaussian-based cepstral normalization for robust speech recognition,"
- Speech and Signal Processing
- P. J. Moreno, B. Raj, E. Gouvea, and R. M. Stern, "Multivariate-gaussian-based cepstral normalization for robust speech recognition," in International Conference on Acoustics, Speech and Signal Processing, 1995, pp. 137-140.
- (1995) International Conference on Acoustics , pp. 137-140
- Moreno, P.J.¹ Raj, B.² Gouvea, E.³ Stern, R.M.⁴

19
- 0003459132
- Hidden Markov models, maximum mutual information estimation and the speech recognition problem
- McGill University
- Y. Normandin, "Hidden Markov models, maximum mutual information estimation and the speech recognition problem," PhD dissertation, McGill University, 1991.
- (1991) PhD dissertation
- Normandin, Y.¹

20
- 4544265717
- Discriminative training for large vocabulary speech recognition
- Cambridge University
- D. Povey, "Discriminative training for large vocabulary speech recognition," PhD dissertation, Cambridge University, 2003.
- (2003) PhD dissertation
- Povey, D.¹

21
- 4544365937
- "On tracking noise with linear dynamical system models,"
- Speech and Signal Processing, Montreal, Canada, May
- B. Raj, R. Singh, and R. Stern, "On tracking noise with linear dynamical system models," in International Conference on Acoustics, Speech and Signal Processing, Montreal, Canada, May 2004.
- (2004) International Conference on Acoustics
- Raj, B.¹ Singh, R.² Stern, R.³

22
- 84943274699
- "A direct adaptive method for faster backpropagation learning: The RPROP algorithm,"
- M. Riedmiller and H. Braun, "A direct adaptive method for faster backpropagation learning: The RPROP algorithm," in IEEE International Conference on Neural Networks, vol. 1, 1993, pp. 586-91.
- (1993) IEEE International Conference on Neural Networks , vol.1 , pp. 586-91
- Riedmiller, M.¹ Braun, H.²

23
- 67650135931
- Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement
- Speech and Music Processing
- B. Schuller, M.Wollmer, T. Moosmayr, and G. Rigoll, "Recognition of noisy speech: A comparative survey of robust model architecture and feature enhancement," EURASIP Journal on Audio, Speech and Music Processing, 2009.
- (2009) EURASIP Journal on Audio
- Schuller, B.¹ Wollmer, M.² Moosmayr, T.³ Rigoll, G.⁴

24
- 85009228863
- "Robust speech recognition using model-based feature enhancement,"
- Geneva, Switzerland, September
- V. Stouten, H. Van hamme, K. Demuynck, and P. Wambacq, "Robust speech recognition using model-based feature enhancement," in Proceedings of the 2003 Eurospeech,Geneva, Switzerland, September 2003, pp. 17-20.
- (2003) Proceedings of the 2003 Eurospeech , pp. 17-20
- Stouten, V.¹ Van Hamme, H.² Demuynck, K.³ Wambacq, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.