메뉴 건너뛰기




Volumn , Issue , 2008, Pages 461-496

Automatic Speech Recognition in Adverse Acoustic Conditions

Author keywords

Automatic speech recognition; Background noise and reverberation; Feature extraction or front end processing; Frequency weighting function; Hands free speech input and reverberant room environment; Hands free speech input influence; Room impulse response (RIR); Speech encoding and decoding; Statistically independent acoustic features; Zeroth cepstral coefficient

Indexed keywords


EID: 77955688063     PISSN: None     EISSN: None     Source Type: Book    
DOI: 10.1002/9780470727188.ch16     Document Type: Chapter
Times cited : (4)

References (31)
  • 2
    • 0035342414 scopus 로고    scopus 로고
    • Robust Automatic Speech Recognition with Missing and Unreliable Acoustic Data
    • Cooke, M.; Green, P.; Josifowski, L.; Vizinho, A. (2001). Robust Automatic Speech Recognition with Missing and Unreliable Acoustic Data, Speech Communication, vol. 34, pp. 267-285.
    • (2001) Speech Communication , vol.34 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifowski, L.3    Vizinho, A.4
  • 3
    • 27744560527 scopus 로고    scopus 로고
    • GSM 06.90: Digital Cellular Telecommunications Systems (Phase2+); Adaptive Multi Rate (AMR) Speech Transcoding
    • ETSI, ETSI standard document EN 301 712 v7.2.1
    • ETSI (2000). GSM 06.90: Digital Cellular Telecommunications Systems (Phase2+); Adaptive Multi Rate (AMR) Speech Transcoding, ETSI standard document EN 301 712 v7.2.1.
    • (2000)
  • 4
    • 84889370997 scopus 로고    scopus 로고
    • ETSI (2003a). Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithm, ETSI standard document ES 202 050 v1.1.3
    • ETSI (2003a). Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithm, ETSI standard document ES 202 050 v1.1.3.
  • 5
    • 84889361716 scopus 로고    scopus 로고
    • ETSI (2003b). Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithm, ETSI standard document ES 201 108 v1.1.3
    • ETSI (2003b). Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithm, ETSI standard document ES 201 108 v1.1.3.
  • 6
    • 84889330900 scopus 로고    scopus 로고
    • Web Interface to Experience the Simulation of Acoustic Scenarios
    • Finster, H. (2005). Web Interface to Experience the Simulation of Acoustic Scenarios, http://dnt.kr.hsnr.de/sireac.html.
    • (2005)
    • Finster, H.1
  • 7
    • 0003671941 scopus 로고
    • Model Based Techniques for Noise Robust Speech Recognition
    • PhD thesis, University of Cambridge, Great Britain
    • Gales, M. J. F. (1995). Model Based Techniques for Noise Robust Speech Recognition, PhD thesis, University of Cambridge, Great Britain.
    • (1995)
    • Gales, M.J.F.1
  • 8
  • 9
    • 0028419019 scopus 로고
    • Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains
    • Gauvain, J. L.; Lee, C. H. (1994). Maximum a Posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains, IEEE Transactions on Speech and Audio Processing, vol. 2, pp. 291-298.
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 10
    • 0034825470 scopus 로고    scopus 로고
    • HMM Adaptation for Applications in Telecommunication
    • Hirsch, H. G. (2001). HMM Adaptation for Applications in Telecommunication, Speech Communication, vol. 34, pp. 127-139.
    • (2001) Speech Communication , vol.34 , pp. 127-139
    • Hirsch, H.G.1
  • 11
    • 85009253189 scopus 로고    scopus 로고
    • The Influence of Speech Coding on Recognition Performance in Telecommunication Networks
    • Proceedings of the International Conference on Spoken Language Processing (ICSLP)
    • Hirsch, H. G. (2002). The Influence of Speech Coding on Recognition Performance in Telecommunication Networks, Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp. 1877-1880.
    • (2002) , pp. 1877-1880
    • Hirsch, H.G.1
  • 12
    • 0028996871 scopus 로고
    • Noise Estimation Techniques for Robust Speech Recognition
    • Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Hirsch, H. G.; Ehrlicher, C. (1995). Noise Estimation Techniques for Robust Speech Recognition, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 153-156.
    • (1995) , pp. 153-156
    • Hirsch, H.G.1    Ehrlicher, C.2
  • 13
    • 4544279104 scopus 로고    scopus 로고
    • The Aurora Experimental Framework for the Performance Evaluation of Speech Recognition Systems under Noisy Conditions
    • Proceedings of the ISCA workshop ASR2000, Paris, France
    • Hirsch, H. G.; Pearce, D. (2000). The Aurora Experimental Framework for the Performance Evaluation of Speech Recognition Systems under Noisy Conditions, Proceedings of the ISCA workshop ASR2000, Paris, France.
    • (2000)
    • Hirsch, H.G.1    Pearce, D.2
  • 14
    • 84889459669 scopus 로고    scopus 로고
    • The Aurora Project
    • Hirsch, H. G.; Pearce, D. (2006). The Aurora Project, http://aurora.hsnr.de.
    • (2006)
    • Hirsch, H.G.1    Pearce, D.2
  • 15
    • 0019060580 scopus 로고
    • Predicting Speech Intelligibility in Rooms from the Modulation Transfer Function
    • Houtgast, T.; Steeneken, H. J. M.; Plomp, R. (1980). Predicting Speech Intelligibility in Rooms from the Modulation Transfer Function, I. General Room Acoustics, Acustica, vol. 46, pp. 60-72.
    • (1980) I. General Room Acoustics, Acustica , vol.46 , pp. 60-72
    • Houtgast, T.1    Steeneken, H.J.M.2    Plomp, R.3
  • 16
    • 0012468071 scopus 로고
    • Telephone Transmission Quality Objective Measuring Apparatus: Objective Measurement of Active Speech Level
    • ITU, ITU-T recommendation P.56
    • ITU (1993). Telephone Transmission Quality Objective Measuring Apparatus: Objective Measurement of Active Speech Level, ITU-T recommendation P.56.
    • (1993)
  • 17
    • 0003786003 scopus 로고    scopus 로고
    • Statistical Methods for Speech Recognition
    • MIT Press
    • Jelinek, F. (1998). Statistical Methods for Speech Recognition, MIT Press.
    • (1998)
    • Jelinek, F.1
  • 18
    • 0042482799 scopus 로고    scopus 로고
    • Robust Speech Recognition in Embedded Systems and PC Applications
    • Kluwer Academic Publisher
    • Junqua, J. (2000). Robust Speech Recognition in Embedded Systems and PC Applications, Kluwer Academic Publisher.
    • (2000)
    • Junqua, J.1
  • 19
    • 0003870155 scopus 로고    scopus 로고
    • Room Acoustics
    • 4th edn, Spon Press
    • Kuttruff, H. (2000). Room Acoustics, 4th edn, Spon Press.
    • (2000)
    • Kuttruff, H.1
  • 20
    • 0029288633 scopus 로고
    • Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models
    • Leggeter, C.; Woodland, P. (1995). Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models, Computer Speech and Language, vol. 9, pp. 171-185.
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggeter, C.1    Woodland, P.2
  • 21
    • 0021226391 scopus 로고
    • A Database for Speaker-Independent Digit Recognition
    • Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Leonard, R. (1984). A Database for Speaker-Independent Digit Recognition, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 3.
    • (1984) , vol.3
    • Leonard, R.1
  • 22
    • 85009242725 scopus 로고    scopus 로고
    • Evaluation of a Noise Robust DSR Front-end on Aurora Databases
    • Proceedings of the International Conference on Spoken Language Processing (ICSLP)
    • Machoa, D.; Mauuary, L.; Pearce, D.; et. al. (2002). Evaluation of a Noise Robust DSR Front-end on Aurora Databases, Proceedings of the International Conference on Spoken Language Processing (ICSLP), pp. 17-20.
    • (2002) , pp. 17-20
    • Machoa, D.1    Mauuary, L.2    Pearce, D.3
  • 23
    • 85009071768 scopus 로고    scopus 로고
    • Blind Equalization in the Cepstral Domain for Robust Telephone Based Speech Recognition
    • Proceedings of the European Signal Processing Conference (EUSIPCO)
    • Mauuary, L. (1998). Blind Equalization in the Cepstral Domain for Robust Telephone Based Speech Recognition, Proceedings of the European Signal Processing Conference (EUSIPCO), pp. 359-362.
    • (1998) , pp. 359-362
    • Mauuary, L.1
  • 24
    • 0029745435 scopus 로고    scopus 로고
    • Adaptation Method Based on HMM Composition and EM Algorithm
    • Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
    • Minami, Y.; Furui, S. (1996). Adaptation Method Based on HMM Composition and EM Algorithm, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 327-330.
    • (1996) , pp. 327-330
    • Minami, Y.1    Furui, S.2
  • 25
    • 44849089531 scopus 로고    scopus 로고
    • Speech Recognition over Digital Channels; Robustness and Standards
    • John Wiley & Sons, Ltd
    • Peinado, A.; Segura, J. (2006). Speech Recognition over Digital Channels; Robustness and Standards, John Wiley & Sons, Ltd.
    • (2006)
    • Peinado, A.1    Segura, J.2
  • 26
    • 0004244302 scopus 로고
    • Fundamentals of Speech Recognition
    • Prentice Hall
    • Rabiner, L.; Juang, B. H. (1993). Fundamentals of Speech Recognition, Prentice Hall.
    • (1993)
    • Rabiner, L.1    Juang, B.H.2
  • 27
    • 0030149866 scopus 로고    scopus 로고
    • A Maximum Likelihood Approach to Stochastic Matching for Robust Speech Recognition
    • Sankar, A. J.; Lee, C. H. (1996). A Maximum Likelihood Approach to Stochastic Matching for Robust Speech Recognition, IEEE Transactions on Speech and Audio Processing, pp. 190-201.
    • (1996) IEEE Transactions on Speech and Audio Processing , pp. 190-201
    • Sankar, A.J.1    Lee, C.H.2
  • 28
    • 84889302357 scopus 로고    scopus 로고
    • Digital Speech Transmission. Enhancement, Coding and Error Concealment
    • John Wiley & Sons, Ltd
    • Vary, P.; Martin, R. (2006). Digital Speech Transmission. Enhancement, Coding and Error Concealment, John Wiley & Sons, Ltd.
    • (2006)
    • Vary, P.1    Martin, R.2
  • 29
    • 0346528936 scopus 로고    scopus 로고
    • Speaker Adaptation for Continuous Density HMMs: A Review
    • Proceedings of the Int. Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis, France
    • Woodland, P. C. (2001). Speaker Adaptation for Continuous Density HMMs: A Review, Proceedings of the Int. Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis, France.
    • (2001)
    • Woodland, P.C.1
  • 30
    • 38649133773 scopus 로고    scopus 로고
    • The HTK Book (Version 3.3)
    • Cambridge University Engineering Department
    • Young, S.; et. al. (2005). The HTK Book (Version 3.3), Cambridge University Engineering Department, http://htk.eng.cam.ac.uk.
    • (2005)
    • Young, S.1
  • 31
    • 0003553472 scopus 로고    scopus 로고
    • Psychoacoustics
    • 2nd edn, Springer Verlag, Berlin
    • Zwicker, E.; Fastl, H. (1999). Psychoacoustics, 2nd edn, Springer Verlag, Berlin.
    • (1999)
    • Zwicker, E.1    Fastl, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.