SCOPUS 정보 검색 플랫폼

Springer Handbooks

Volumn , Issue , 2008, Pages 653-680

Environmental Robustness

(2) Droppo, Jasha a Acero, Alex a

a MICROSOFT RESEARCH (United States)

Author keywords

Acoustic Model; Clean Speech; Noisy Speech; Speech Enhancement; Speech Recognition System

Indexed keywords

EID: 84901773892 PISSN: 25228692 EISSN: 25228706 Source Type: Book Series
DOI: 10.1007/978-3-540-49127-9_33 Document Type: Chapter

Times cited : (63)

References (56)

1
- 0038669544
- The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
- H.G. Hirsch, D. Pearce: The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions, ISCA ITRW ASR2000 “Automatic Speech Recognition: Challenges for the Next Millennium” (2000)
- (2000) ISCA ITRW ASR2000 “Automatic Speech Recognition: Challenges for the Next Millennium”
- Hirsch, H.G.¹ Pearce, D.²

2
- 0005908591
- Linguistic Data Consortium, Philadelphia
- R.G. Leonard, G. Doddington: Tidigits (Linguistic Data Consortium, Philadelphia 1993)
- (1993) Tidigits
- Leonard, R.G.¹ Doddington, G.²

3
- 77949413917
- D. Pierce, A. Gunawardana: Aurora 2.0 speech recognition in noise: Update 2. Complex backend definition for Aurora 2.0, http://icslp2002.colorado. edu/special_sessions/aurora (2002)
- (2002) Aurora 2.0 Speech Recognition in Noise: Update 2. Complex Backend Definition for Aurora 2.0
- Pierce, D.¹ Gunawardana, A.²

4
- 85009223874
- SpeechDat-Car: A large speech database for automotive environments
- A. Moreno, B. Lindberg, C. Draxler, G. Richard, K. Choukri, J. Allen, S. Euler: SpeechDat-Car: A large speech database for automotive environments, Proc. 2nd Int. Conf. Language Resources and Evaluation (2000)
- (2000) Proc. 2Nd Int. Conf. Language Resources and Evaluation
- Moreno, A.¹ Lindberg, B.² Draxler, C.³ Richard, G.⁴ Choukri, K.⁵ Allen, J.⁶ Euler, S.⁷

5
- 70450143974
- Linguistic Data Consortium, Philadelphia
- J. Garofalo, D. Graff, D. Paul, D. Pallett: CSR-I (WSJ0) Complete (Linguistic Data Consortium, Philadelphia 1993)
- (1993) CSR-I (WSJ0) Complete
- Garofalo, J.¹ Graff, D.² Paul, D.³ Pallett, D.⁴

6
- 0004319968
- Tech. Rep. Defence Evaluation and Research Agency (DERA) (Speech Research Unit, Malvern
- A. Varga, H.J.M. Steeneken, M. Tomlinson, D. Jones: The NOISEX-92 study on the effect of additive noise on automatic speech recognition. Tech. Rep. Defence Evaluation and Research Agency (DERA) (Speech Research Unit, Malvern 1992)
- (1992) The NOISEX-92 Study on the Effect of Additive Noise on Automatic Speech Recognition
- Varga, A.¹ Steeneken, H.J.M.² Tomlinson, M.³ Jones, D.⁴

7
- 84981722479
- Linguistic Data Consortium, Philadelphia
- A. Schmidt-Nielsen: Speech in Noisy Environments (SPINE) Evaluation Audio (Linguistic Data Consortium, Philadelphia 2000)
- (2000) Speech in Noisy Environments (SPINE) Evaluation Audio
- Schmidt-Nielsen, A.¹

8
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- pp
- R.P. Lippmann, E.A. Martin, D.P. Paul: Multi-style training for robust isolated-word speech recognition, Proc. IEEE ICASSP (1987) pp. 709– 712
- (1987) Proc. IEEE ICASSP , pp. 709-712
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.P.³

9
- 0033693211
- Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
- pp
- M. Matassoni, M. Omologo, D. Giuliani: Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation, Proc. IEEE ICASSP (2000) pp. 1407–1410
- (2000) Proc. IEEE ICASSP , pp. 1407-1410
- Matassoni, M.¹ Omologo, M.² Giuliani, D.³

10
- 85009088984
- Robust digit recognition in noisy environments: The Aurora 2 system
- G. Saon, J.M. Huerta, E.-E. Jan: Robust digit recognition in noisy environments: The Aurora 2 system, Proc. Eurospeech 2001 (2001)
- (2001) Proc. Eurospeech 2001
- Saon, G.¹ Huerta, J.M.² Jan, E.-E.³

11
- 85009265586
- Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases
- C.-P. Chen, K. Filali, J.A. Bilmes: Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases, Int. Conf. Spoken Language Process. (2002)
- (2002) Int. Conf. Spoken Language Process
- Chen, C.-P.¹ Filali, K.² Bilmes, J.A.³

12
- 0016067897
- Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
- B.S. Atal: Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification, J. Acoust. Soc. Am. 55(6), 1304–1312 (1974)
- (1974) J. Acoust. Soc. Am. , vol.55 , Issue.6 , pp. 1304-1312
- Atal, B.S.¹

13
- 0036289676
- Acoustic diversity for improved speech recognition in reverberant environments
- B.W. Gillespie, L.E. Atlas: Acoustic diversity for improved speech recognition in reverberant environments, Proc. IEEE ICASSP I, 557–560 (2002)
- (2002) Proc. IEEE ICASSP I , pp. 557-560
- Gillespie, B.W.¹ Atlas, L.E.²

14
- 18744371585
- Histogram equalization of speech representation for robust speech recognition
- A. de la Torre, A.M. Peinado, J.C. Segura, J.L. Perez-Cordoba, M.C. Benítez, A.J. Rubio: Histogram equalization of speech representation for robust speech recognition, IEEE Trans. Speech Audio Process. 13(3), 355–366 (2005)
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.3 , pp. 355-366
- de la Torre, A.¹ Peinado, A.M.² Segura, J.C.³ Perez-Cordoba, J.L.⁴ Benítez, M.C.⁵ Rubio, A.J.⁶

15
- 0029769867
- Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
- M.G. Rahim, B.H. Juang: Signal bias removal by maximum likelihood estimation for robust telephone speech recognition, IEEE Trans. Speech Audio Process. 4(1), 19–30 (1996)
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
- Rahim, M.G.¹ Juang, B.H.²

16
- 0009902452
- Augmented cepstral normalization for robust speech recognition
- A. Acero, X.D. Huang: Augmented cepstral normalization for robust speech recognition, Proc. IEEE Workshop on Automatic Speech Recognition (1995)
- (1995) Proc. IEEE Workshop on Automatic Speech Recognition
- Acero, A.¹ Huang, X.D.²

17
- 23344452899
- Statistical voice activity detection using a multiple observation likelihood ratio test
- J. Ramírez, J.C. Segura, C. Benítez, L. García, A. Ru-bio: Statistical voice activity detection using a multiple observation likelihood ratio test, IEEE Signal Proc. Lett. 12(10), 689–692 (2005)
- (2005) IEEE Signal Proc. Lett. , vol.12 , Issue.10 , pp. 689-692
- Ramírez, J.¹ Segura, J.C.² Benítez, C.³ García, L.⁴ Ru-Bio, A.⁵

18
- 0028517164
- RASTA processing of speech
- H. Hermansky, N. Morgan: RASTA processing of speech, IEEE Trans. Speech Audio Process. 2(4), 578–589 (1994)
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

19
- 85009070292
- Large-vocabulary speech recognition under adverse acoustic environments
- L. Deng, A. Acero, M. Plumpe, X.D. Huang: Large-vocabulary speech recognition under adverse acoustic environments, Int. Conf. Spoken Language Process. (2000)
- (2000) Int. Conf. Spoken Language Process
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.D.⁴

20
- 0004319970
- Kluwer Academic, Boston
- A. Acero: Acoustical and Environmental Robustness in Automatic Speech Recognition (Kluwer Academic, Boston 1993)
- (1993) Acoustical and Environmental Robustness in Automatic Speech Recognition
- Acero, A.¹

21
- 65549153550
- Ph.D. Thesis, Carnegie Mellon University, Pittsburgh
- P. Moreno: Speech Recognition in Noisy Environments, Ph.D. Thesis (Carnegie Mellon University, Pittsburgh 1996)
- (1996) Speech Recognition in Noisy Environments
- Moreno, P.¹

22
- 0025628728
- Environmental robustness in automatic speech recognition
- pp
- A. Acero, R.M. Stern: Environmental robustness in automatic speech recognition, Proc. IEEE ICASSP (1990) pp. 849–852
- (1990) Proc. IEEE ICASSP , pp. 849-852
- Acero, A.¹ Stern, R.M.²

23
- 33745216251
- Maximum mutual information SPLICE transform for seen and unseen conditions
- J. Droppo, A. Acero: Maximum mutual information SPLICE transform for seen and unseen conditions, Proc. Interspeech Conf. (2005)
- (2005) Proc. Interspeech Conf.
- Droppo, J.¹ Acero, A.²

24
- 85009257847
- An environment compensated minimum classification error training approach and its evaluation on Aurora 2 database
- J. Wu, Q. Huo: An environment compensated minimum classification error training approach and its evaluation on Aurora 2 database, Proc. ICSLP 1, 453–456 (2002)
- (2002) Proc. ICSLP , vol.1 , pp. 453-456
- Wu, J.¹ Huo, Q.²

25
- 33646788786
- FMPE: Discriminatively trained features for speech recognition
- D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau, G. Zweig: fMPE: Discriminatively trained features for speech recognition, Proc. IEEE ICASSP (2005)
- (2005) Proc. IEEE ICASSP
- Povey, D.¹ Kingsbury, B.² Mangu, L.³ Saon, G.⁴ Soltau, H.⁵ Zweig, G.⁶

26
- 0023739472
- Noise reduction using connectionist models
- pp
- S. Tamura, A. Waibel: Noise reduction using connectionist models, Proc. IEEE ICASSP (1988) pp. 553–556
- (1988) Proc. IEEE ICASSP , pp. 553-556
- Tamura, S.¹ Waibel, A.²

27
- 0002127129
- Probabilistic optimum filtering for robust speech recognition
- L. Neumeyer, M. Weintraub: Probabilistic optimum filtering for robust speech recognition, Proc. IEEE ICASSP 1, 417–420 (1994)
- (1994) Proc. IEEE ICASSP , vol.1 , pp. 417-420
- Neumeyer, L.¹ Weintraub, M.²

28
- 0026385284
- Robust speech recognition by normalization of the acoustic space
- A. Acero, R.M. Stern: Robust speech recognition by normalization of the acoustic space, Proc. IEEE ICASSP 2, 893–896 (1991)
- (1991) Proc. IEEE ICASSP , vol.2 , pp. 893-896
- Acero, A.¹ Stern, R.M.²

29
- 0003671941
- Ph.D. Thesis, Cambridge University, Cambridge
- M.J. Gales: Model Based Techniques for Noise Robust Speech Recognition, Ph.D. Thesis (Cambridge University, Cambridge 1995)
- (1995) Model Based Techniques for Noise Robust Speech Recognition
- Gales, M.J.¹

30
- 84899031901
- Dual estimation and the unscented transformation
- ed. by S.A. Solla, T.K. Leen, K.R. Muller (MIT Press, Cambridge,) pp
- E.A. Wan, R.V.D. Merwe, A.T. Nelson: Dual estimation and the unscented transformation. In: Advances in Neural Information Processing Systems, ed. by S.A. Solla, T.K. Leen, K.R. Muller (MIT Press, Cambridge 2000) pp. 666–672
- (2000) Advances in Neural Information Processing Systems , pp. 666-672
- Wan, E.A.¹ Merwe, R.V.D.² Nelson, A.T.³

31
- 0029725301
- A vector taylor series approach for environment indepen- dent speech recognition
- pp
- P.J. Moreno, B. Raj, R.M. Stern: A vector taylor series approach for environment indepen- dent speech recognition, Proc. IEEE ICASSP (1996) pp. 733–736
- (1996) Proc. IEEE ICASSP , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

32
- 85009074657
- AL-GONQUIN: Iterating Laplace’s method to remove multiple types of acoustic distortion for robust speech recognition
- B.J. Frey, L. Deng, A. Acero, T. Kristjansson: AL-GONQUIN: Iterating Laplace’s method to remove multiple types of acoustic distortion for robust speech recognition, Proc. Eurospeech (2001)
- (2001) Proc. Eurospeech
- Frey, B.J.¹ Deng, L.² Acero, A.³ Kristjansson, T.⁴

33
- 85009211607
- A nonlinear observation model for removing noise from corrupted speech log mel-spectral energies
- J. Droppo, A. Acero, L. Deng: A nonlinear observation model for removing noise from corrupted speech log mel-spectral energies, Proc. Int. Conf. Spoken Language Process. (2002)
- (2002) Proc. Int. Conf. Spoken Language Process.
- Droppo, J.¹ Acero, A.² Deng, L.³

34
- 0033708118
- Model-based feature enhancement for noisy speech recognition
- C. Couvreur, H. Van Hamme: Model-based feature enhancement for noisy speech recognition, Proc. IEEE ICASSP 3, 1719–1722 (2000)
- (2000) Proc. IEEE ICASSP , vol.3 , pp. 1719-1722
- Couvreur, C.¹ van Hamme, H.²

35
- 4544236840
- Noise robust speech recognition with a switching linear dynamic model
- J. Droppo, A. Acero: Noise robust speech recognition with a switching linear dynamic model, Proc. IEEE ICASSP (2004)
- (2004) Proc. IEEE ICASSP
- Droppo, J.¹ Acero, A.²

36
- 4544365937
- On tracking noise with linear dynamical system models
- B. Raj, R. Singh, R. Stern: On tracking noise with linear dynamical system models, Proc. IEEE ICASSP 1, 965–968 (2004)
- (2004) Proc. IEEE ICASSP , vol.1 , pp. 965-968
- Raj, B.¹ Singh, R.² Stern, R.³

37
- 0036296866
- Jacobian joint adaptation to noise, channel and vocal tract length
- H. Shimodaira, N. Sakai, M. Nakai, S. Sagayama: Jacobian joint adaptation to noise, channel and vocal tract length, Proc. IEEE ICASSP 1, 197–200 (2002)
- (2002) Proc. IEEE ICASSP , vol.1 , pp. 197-200
- Shimodaira, H.¹ Sakai, N.² Nakai, M.³ Sagayama, S.⁴

38
- 54349123450
- A comparison of three non-linear observation models for noisy speech features
- J. Droppo, L. Deng, A. Acero: A comparison of three non-linear observation models for noisy speech features, Proc. Eurospeech Conf. (2003)
- (2003) Proc. Eurospeech Conf.
- Droppo, J.¹ Deng, L.² Acero, A.³

39
- 0346126988
- Robust speech recognition in noise – performance of the IBM continuous speech recognizer on the ARPA noise spoke task
- pp
- R.A. Gopinath, M.J.F. Gales, P.S. Gopalakrishnan, S. Balakrishnan-Aiyer, M.A. Picheny: Robust speech recognition in noise – performance of the IBM continuous speech recognizer on the ARPA noise spoke task, Proc. ARPA Workshop on Spoken Language Systems Technology (1995) pp. 127–133
- (1995) Proc. ARPA Workshop on Spoken Language Systems Technology , pp. 127-133
- Gopinath, R.A.¹ Gales, M.J.F.² Gopalakrishnan, P.S.³ Balakrishnan-Aiyer, S.⁴ Picheny, M.A.⁵

40
- 0025681008
- Hidden markov model decomposition of speech and noise
- pp
- A.P. Varga, R.K. Moore: Hidden markov model decomposition of speech and noise, Proc. IEEE ICASSP (1990) pp. 845–848
- (1990) Proc. IEEE ICASSP , pp. 845-848
- Varga, A.P.¹ Moore, R.K.²

41
- 85009113852
- HMM adaptation using vector taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, J. Zhang: HMM adaptation using vector taylor series for noisy speech recognition, Int. Conf. Spoken Language Processing (2000)
- (2000) Int. Conf. Spoken Language Processing
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

42
- 84895879051
- Modeling non-verbal sounds for speech recognition
- pp
- W. Ward: Modeling non-verbal sounds for speech recognition, Proc. Speech and Natural Language Workshop (1989) pp. 311–318
- (1989) Proc. Speech and Natural Language Workshop , pp. 311-318
- Ward, W.¹

43
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- S.F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE T. Acoust. Speech 24(April), 113–120 (1979)
- (1979) IEEE T. Acoust. Speech , vol.24 April , pp. 113-120
- Boll, S.F.¹

44
- 0021892216
- Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
- Vol.,) pp
- Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process., Vol. ASSP-33 (1985) pp. 443–445
- (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-33 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

45
- 50449097354
- Ph.D. Thesis (K. U. Leuven, Leuven
- V. Stouten: Robust Automatic Speech Recognition in Time-varying Environments, Ph.D. Thesis (K. U. Leuven, Leuven 2006)
- (2006) Robust Automatic Speech Recognition in Time-Varying Environments
- Stouten, V.¹

46
- 0018320733
- Enhancement of speech corrupted by acoustic noise
- pp
- M. Berouti, R. Schwartz, J. Makhoul: Enhancement of speech corrupted by acoustic noise, Proc. IEEE ICASSP (1979) pp. 208–211
- (1979) Proc. IEEE ICASSP , pp. 208-211
- Berouti, M.¹ Schwartz, R.² Makhoul, J.³

47
- 38849170676
- distributed speech recognition; advanced front-end feature extraction algorithm
- ETSI ES 2002 050 Recommendation: Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm (2002)
- (2002) Speech Processing, Transmission and Quality Aspects (STQ)

48
- 85009242725
- Evaluation of a noise-robust DSR front-end on Aurora databases
- pp
- D. Macho, L. Mauuary, B. Noê, Y.M. Cheng, D. Ealey, D. Jouvet, H. Kelleher, D. Pearce, F. Saadoun: Evaluation of a noise-robust DSR front-end on Aurora databases, Proc. ICSLP (2002) pp. 17–20
- (2002) Proc. ICSLP , pp. 17-20
- Macho, D.¹ Mauuary, L.² Noê, B.³ Cheng, Y.M.⁴ Ealey, D.⁵ Jouvet, D.⁶ Kelleher, H.⁷ Pearce, D.⁸ Saadoun, F.⁹

49
- 4544245839
- Two-stage mel-warped Wiener filter for robust speech recognition
- A. Agarwal, Y.M. Cheng: Two-stage mel-warped Wiener filter for robust speech recognition, Proc. ASRU (1999)
- (1999) Proc. ASRU
- Agarwal, A.¹ Cheng, Y.M.²

50
- 85075935004
- Noise reduction for noise robust feature extraction for distributed speech recognition
- pp
- B. Noê, J. Sienel, D. Jouvet, L. Mauuary, L. Boves, J. de Veth, F. de Wet: Noise reduction for noise robust feature extraction for distributed speech recognition, Proc. Eurospeech (2001) pp. 201– 204
- (2001) Proc. Eurospeech , pp. 201-204
- Noê, B.¹ Sienel, J.² Jouvet, D.³ Mauuary, L.⁴ Boves, L.⁵ de Veth, J.⁶ de Wet, F.⁷

51
- 0034848706
- SNR-Dependent waveform processing for improving the robustness of ASR front-end
- pp
- D. Macho, Y.M. Cheng: SNR-Dependent waveform processing for improving the robustness of ASR front-end, Proc. IEEE ICASSP (2001) pp. 305– 308
- (2001) Proc. IEEE ICASSP , pp. 305-308
- Macho, D.¹ Cheng, Y.M.²

52
- 4544222091
- Blind equalization in the cepstral domain for robust telephone based speech recognition
- L. Mauuary: Blind equalization in the cepstral domain for robust telephone based speech recognition, Proc. EUSPICO 1, 359–363 (1998)
- (1998) Proc. EUSPICO , vol.1 , pp. 359-363
- Mauuary, L.¹

53
- 0742324997
- Sequential estimation with optimal forgetting for robust speech recognition
- M. Afify, O. Siohan: Sequential estimation with optimal forgetting for robust speech recognition, IEEE Trans. Speech Audio Process. 12(1), 19–26 (2004)
- (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.1 , pp. 19-26
- Afify, M.¹ Siohan, O.²

54
- 0036291376
- Uncertainty decoding with SPLICE for noise robust speech recognition
- J. Droppo, A. Acero, L. Deng: Uncertainty decoding with SPLICE for noise robust speech recognition, Proc. IEEE ICASSP (2002)
- (2002) Proc. IEEE ICASSP
- Droppo, J.¹ Acero, A.² Deng, L.³

55
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- M. Cooke, P. Green, L. Josifovski, A. Vizinho: Robust automatic speech recognition with missing and unreliable acoustic data, Speech Commun. 34(3), 267–285 (2001)
- (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

56
- 85009106519
- Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
- J.P. Barker, M. Cooke, P. Green: Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise, Proc. Eurospeech 2001, 213–216 (2001)
- (2001) Proc. Eurospeech , vol.2001 , pp. 213-216
- Barker, J.P.¹ Cooke, M.² Green, P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.