메뉴 건너뛰기




Volumn , Issue , 2008, Pages 653-680

Environmental Robustness

Author keywords

Acoustic Model; Clean Speech; Noisy Speech; Speech Enhancement; Speech Recognition System

Indexed keywords


EID: 84901773892     PISSN: 25228692     EISSN: 25228706     Source Type: Book Series    
DOI: 10.1007/978-3-540-49127-9_33     Document Type: Chapter
Times cited : (63)

References (56)
  • 2
    • 0005908591 scopus 로고
    • Linguistic Data Consortium, Philadelphia
    • R.G. Leonard, G. Doddington: Tidigits (Linguistic Data Consortium, Philadelphia 1993)
    • (1993) Tidigits
    • Leonard, R.G.1    Doddington, G.2
  • 8
    • 0023263708 scopus 로고
    • Multi-style training for robust isolated-word speech recognition
    • pp
    • R.P. Lippmann, E.A. Martin, D.P. Paul: Multi-style training for robust isolated-word speech recognition, Proc. IEEE ICASSP (1987) pp. 709– 712
    • (1987) Proc. IEEE ICASSP , pp. 709-712
    • Lippmann, R.P.1    Martin, E.A.2    Paul, D.P.3
  • 9
    • 0033693211 scopus 로고    scopus 로고
    • Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
    • pp
    • M. Matassoni, M. Omologo, D. Giuliani: Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation, Proc. IEEE ICASSP (2000) pp. 1407–1410
    • (2000) Proc. IEEE ICASSP , pp. 1407-1410
    • Matassoni, M.1    Omologo, M.2    Giuliani, D.3
  • 10
    • 85009088984 scopus 로고    scopus 로고
    • Robust digit recognition in noisy environments: The Aurora 2 system
    • G. Saon, J.M. Huerta, E.-E. Jan: Robust digit recognition in noisy environments: The Aurora 2 system, Proc. Eurospeech 2001 (2001)
    • (2001) Proc. Eurospeech 2001
    • Saon, G.1    Huerta, J.M.2    Jan, E.-E.3
  • 11
    • 85009265586 scopus 로고    scopus 로고
    • Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases
    • C.-P. Chen, K. Filali, J.A. Bilmes: Frontend post-processing and backend model enhancement on the Aurora 2.0/3.0 databases, Int. Conf. Spoken Language Process. (2002)
    • (2002) Int. Conf. Spoken Language Process
    • Chen, C.-P.1    Filali, K.2    Bilmes, J.A.3
  • 12
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B.S. Atal: Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification, J. Acoust. Soc. Am. 55(6), 1304–1312 (1974)
    • (1974) J. Acoust. Soc. Am. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.S.1
  • 13
    • 0036289676 scopus 로고    scopus 로고
    • Acoustic diversity for improved speech recognition in reverberant environments
    • B.W. Gillespie, L.E. Atlas: Acoustic diversity for improved speech recognition in reverberant environments, Proc. IEEE ICASSP I, 557–560 (2002)
    • (2002) Proc. IEEE ICASSP I , pp. 557-560
    • Gillespie, B.W.1    Atlas, L.E.2
  • 15
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • M.G. Rahim, B.H. Juang: Signal bias removal by maximum likelihood estimation for robust telephone speech recognition, IEEE Trans. Speech Audio Process. 4(1), 19–30 (1996)
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.1 , pp. 19-30
    • Rahim, M.G.1    Juang, B.H.2
  • 17
    • 23344452899 scopus 로고    scopus 로고
    • Statistical voice activity detection using a multiple observation likelihood ratio test
    • J. Ramírez, J.C. Segura, C. Benítez, L. García, A. Ru-bio: Statistical voice activity detection using a multiple observation likelihood ratio test, IEEE Signal Proc. Lett. 12(10), 689–692 (2005)
    • (2005) IEEE Signal Proc. Lett. , vol.12 , Issue.10 , pp. 689-692
    • Ramírez, J.1    Segura, J.C.2    Benítez, C.3    García, L.4    Ru-Bio, A.5
  • 22
    • 0025628728 scopus 로고
    • Environmental robustness in automatic speech recognition
    • pp
    • A. Acero, R.M. Stern: Environmental robustness in automatic speech recognition, Proc. IEEE ICASSP (1990) pp. 849–852
    • (1990) Proc. IEEE ICASSP , pp. 849-852
    • Acero, A.1    Stern, R.M.2
  • 23
    • 33745216251 scopus 로고    scopus 로고
    • Maximum mutual information SPLICE transform for seen and unseen conditions
    • J. Droppo, A. Acero: Maximum mutual information SPLICE transform for seen and unseen conditions, Proc. Interspeech Conf. (2005)
    • (2005) Proc. Interspeech Conf.
    • Droppo, J.1    Acero, A.2
  • 24
    • 85009257847 scopus 로고    scopus 로고
    • An environment compensated minimum classification error training approach and its evaluation on Aurora 2 database
    • J. Wu, Q. Huo: An environment compensated minimum classification error training approach and its evaluation on Aurora 2 database, Proc. ICSLP 1, 453–456 (2002)
    • (2002) Proc. ICSLP , vol.1 , pp. 453-456
    • Wu, J.1    Huo, Q.2
  • 26
    • 0023739472 scopus 로고
    • Noise reduction using connectionist models
    • pp
    • S. Tamura, A. Waibel: Noise reduction using connectionist models, Proc. IEEE ICASSP (1988) pp. 553–556
    • (1988) Proc. IEEE ICASSP , pp. 553-556
    • Tamura, S.1    Waibel, A.2
  • 27
    • 0002127129 scopus 로고
    • Probabilistic optimum filtering for robust speech recognition
    • L. Neumeyer, M. Weintraub: Probabilistic optimum filtering for robust speech recognition, Proc. IEEE ICASSP 1, 417–420 (1994)
    • (1994) Proc. IEEE ICASSP , vol.1 , pp. 417-420
    • Neumeyer, L.1    Weintraub, M.2
  • 28
    • 0026385284 scopus 로고
    • Robust speech recognition by normalization of the acoustic space
    • A. Acero, R.M. Stern: Robust speech recognition by normalization of the acoustic space, Proc. IEEE ICASSP 2, 893–896 (1991)
    • (1991) Proc. IEEE ICASSP , vol.2 , pp. 893-896
    • Acero, A.1    Stern, R.M.2
  • 30
    • 84899031901 scopus 로고    scopus 로고
    • Dual estimation and the unscented transformation
    • ed. by S.A. Solla, T.K. Leen, K.R. Muller (MIT Press, Cambridge,) pp
    • E.A. Wan, R.V.D. Merwe, A.T. Nelson: Dual estimation and the unscented transformation. In: Advances in Neural Information Processing Systems, ed. by S.A. Solla, T.K. Leen, K.R. Muller (MIT Press, Cambridge 2000) pp. 666–672
    • (2000) Advances in Neural Information Processing Systems , pp. 666-672
    • Wan, E.A.1    Merwe, R.V.D.2    Nelson, A.T.3
  • 31
    • 0029725301 scopus 로고    scopus 로고
    • A vector taylor series approach for environment indepen- dent speech recognition
    • pp
    • P.J. Moreno, B. Raj, R.M. Stern: A vector taylor series approach for environment indepen- dent speech recognition, Proc. IEEE ICASSP (1996) pp. 733–736
    • (1996) Proc. IEEE ICASSP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 32
    • 85009074657 scopus 로고    scopus 로고
    • AL-GONQUIN: Iterating Laplace’s method to remove multiple types of acoustic distortion for robust speech recognition
    • B.J. Frey, L. Deng, A. Acero, T. Kristjansson: AL-GONQUIN: Iterating Laplace’s method to remove multiple types of acoustic distortion for robust speech recognition, Proc. Eurospeech (2001)
    • (2001) Proc. Eurospeech
    • Frey, B.J.1    Deng, L.2    Acero, A.3    Kristjansson, T.4
  • 33
    • 85009211607 scopus 로고    scopus 로고
    • A nonlinear observation model for removing noise from corrupted speech log mel-spectral energies
    • J. Droppo, A. Acero, L. Deng: A nonlinear observation model for removing noise from corrupted speech log mel-spectral energies, Proc. Int. Conf. Spoken Language Process. (2002)
    • (2002) Proc. Int. Conf. Spoken Language Process.
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 34
    • 0033708118 scopus 로고    scopus 로고
    • Model-based feature enhancement for noisy speech recognition
    • C. Couvreur, H. Van Hamme: Model-based feature enhancement for noisy speech recognition, Proc. IEEE ICASSP 3, 1719–1722 (2000)
    • (2000) Proc. IEEE ICASSP , vol.3 , pp. 1719-1722
    • Couvreur, C.1    van Hamme, H.2
  • 35
    • 4544236840 scopus 로고    scopus 로고
    • Noise robust speech recognition with a switching linear dynamic model
    • J. Droppo, A. Acero: Noise robust speech recognition with a switching linear dynamic model, Proc. IEEE ICASSP (2004)
    • (2004) Proc. IEEE ICASSP
    • Droppo, J.1    Acero, A.2
  • 36
    • 4544365937 scopus 로고    scopus 로고
    • On tracking noise with linear dynamical system models
    • B. Raj, R. Singh, R. Stern: On tracking noise with linear dynamical system models, Proc. IEEE ICASSP 1, 965–968 (2004)
    • (2004) Proc. IEEE ICASSP , vol.1 , pp. 965-968
    • Raj, B.1    Singh, R.2    Stern, R.3
  • 37
    • 0036296866 scopus 로고    scopus 로고
    • Jacobian joint adaptation to noise, channel and vocal tract length
    • H. Shimodaira, N. Sakai, M. Nakai, S. Sagayama: Jacobian joint adaptation to noise, channel and vocal tract length, Proc. IEEE ICASSP 1, 197–200 (2002)
    • (2002) Proc. IEEE ICASSP , vol.1 , pp. 197-200
    • Shimodaira, H.1    Sakai, N.2    Nakai, M.3    Sagayama, S.4
  • 38
    • 54349123450 scopus 로고    scopus 로고
    • A comparison of three non-linear observation models for noisy speech features
    • J. Droppo, L. Deng, A. Acero: A comparison of three non-linear observation models for noisy speech features, Proc. Eurospeech Conf. (2003)
    • (2003) Proc. Eurospeech Conf.
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 40
    • 0025681008 scopus 로고
    • Hidden markov model decomposition of speech and noise
    • pp
    • A.P. Varga, R.K. Moore: Hidden markov model decomposition of speech and noise, Proc. IEEE ICASSP (1990) pp. 845–848
    • (1990) Proc. IEEE ICASSP , pp. 845-848
    • Varga, A.P.1    Moore, R.K.2
  • 42
    • 84895879051 scopus 로고
    • Modeling non-verbal sounds for speech recognition
    • pp
    • W. Ward: Modeling non-verbal sounds for speech recognition, Proc. Speech and Natural Language Workshop (1989) pp. 311–318
    • (1989) Proc. Speech and Natural Language Workshop , pp. 311-318
    • Ward, W.1
  • 43
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • S.F. Boll: Suppression of acoustic noise in speech using spectral subtraction, IEEE T. Acoust. Speech 24(April), 113–120 (1979)
    • (1979) IEEE T. Acoust. Speech , vol.24 April , pp. 113-120
    • Boll, S.F.1
  • 44
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Vol.,) pp
    • Y. Ephraim, D. Malah: Speech enhancement using a minimum mean-square error log-spectral amplitude estimator, IEEE Trans. Acoust. Speech Signal Process., Vol. ASSP-33 (1985) pp. 443–445
    • (1985) IEEE Trans. Acoust. Speech Signal Process. , vol.ASSP-33 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2
  • 46
    • 0018320733 scopus 로고
    • Enhancement of speech corrupted by acoustic noise
    • pp
    • M. Berouti, R. Schwartz, J. Makhoul: Enhancement of speech corrupted by acoustic noise, Proc. IEEE ICASSP (1979) pp. 208–211
    • (1979) Proc. IEEE ICASSP , pp. 208-211
    • Berouti, M.1    Schwartz, R.2    Makhoul, J.3
  • 47
    • 38849170676 scopus 로고    scopus 로고
    • distributed speech recognition; advanced front-end feature extraction algorithm
    • ETSI ES 2002 050 Recommendation: Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced front-end feature extraction algorithm (2002)
    • (2002) Speech Processing, Transmission and Quality Aspects (STQ)
  • 49
    • 4544245839 scopus 로고    scopus 로고
    • Two-stage mel-warped Wiener filter for robust speech recognition
    • A. Agarwal, Y.M. Cheng: Two-stage mel-warped Wiener filter for robust speech recognition, Proc. ASRU (1999)
    • (1999) Proc. ASRU
    • Agarwal, A.1    Cheng, Y.M.2
  • 51
    • 0034848706 scopus 로고    scopus 로고
    • SNR-Dependent waveform processing for improving the robustness of ASR front-end
    • pp
    • D. Macho, Y.M. Cheng: SNR-Dependent waveform processing for improving the robustness of ASR front-end, Proc. IEEE ICASSP (2001) pp. 305– 308
    • (2001) Proc. IEEE ICASSP , pp. 305-308
    • Macho, D.1    Cheng, Y.M.2
  • 52
    • 4544222091 scopus 로고    scopus 로고
    • Blind equalization in the cepstral domain for robust telephone based speech recognition
    • L. Mauuary: Blind equalization in the cepstral domain for robust telephone based speech recognition, Proc. EUSPICO 1, 359–363 (1998)
    • (1998) Proc. EUSPICO , vol.1 , pp. 359-363
    • Mauuary, L.1
  • 53
    • 0742324997 scopus 로고    scopus 로고
    • Sequential estimation with optimal forgetting for robust speech recognition
    • M. Afify, O. Siohan: Sequential estimation with optimal forgetting for robust speech recognition, IEEE Trans. Speech Audio Process. 12(1), 19–26 (2004)
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.1 , pp. 19-26
    • Afify, M.1    Siohan, O.2
  • 54
    • 0036291376 scopus 로고    scopus 로고
    • Uncertainty decoding with SPLICE for noise robust speech recognition
    • J. Droppo, A. Acero, L. Deng: Uncertainty decoding with SPLICE for noise robust speech recognition, Proc. IEEE ICASSP (2002)
    • (2002) Proc. IEEE ICASSP
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 55
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke, P. Green, L. Josifovski, A. Vizinho: Robust automatic speech recognition with missing and unreliable acoustic data, Speech Commun. 34(3), 267–285 (2001)
    • (2001) Speech Commun , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 56
    • 85009106519 scopus 로고    scopus 로고
    • Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
    • J.P. Barker, M. Cooke, P. Green: Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise, Proc. Eurospeech 2001, 213–216 (2001)
    • (2001) Proc. Eurospeech , vol.2001 , pp. 213-216
    • Barker, J.P.1    Cooke, M.2    Green, P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.