메뉴 건너뛰기




Volumn 50, Issue 3, 2008, Pages 244-263

A new approach for the adaptation of HMMs to reverberation and background noise

Author keywords

Hands free speech input; HMM adaptation; Reverberation; Robust speech recognition

Indexed keywords

PARAMETER ESTIMATION; SIGNAL PROCESSING; SPEECH PROCESSING; SPEECH RECOGNITION;

EID: 38649115063     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.09.004     Document Type: Article
Times cited : (73)

References (42)
  • 1
    • 38649083377 scopus 로고    scopus 로고
    • Aurora project, 2006. .
  • 2
    • 84994251727 scopus 로고    scopus 로고
    • Au Yeung, S.-K., Siu, M.-H., 2004. Improved performance of Aurora-4 using HTK and unsupervised MLLR adaptation. In: Proc. ICSLP.
  • 3
    • 0030362988 scopus 로고    scopus 로고
    • Avendano, C., Hermansky, H., 1996. Study on the dereverberation of speech based on temporal filtering. In: Proc. ICSLP, pp. 889-892.
  • 4
    • 38649083073 scopus 로고    scopus 로고
    • Bitzer, J., Simmer, K.U., Kammeyer, K.D., 1999. Multi microphone noise reduction techniques for hands-free speech recognition - a comparative study. In: Proc. Internat. Workshop on Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, pp. 171-175.
  • 6
    • 38649133273 scopus 로고    scopus 로고
    • Couvreur, L., Dupont, S., Ris, C., Boite, J.M., Couvreur, C., 2001. Fast adaptation for robust speech recognition in reverberant environments. In: Proc. Internat. Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis, France.
  • 7
    • 38649122014 scopus 로고    scopus 로고
    • ETSI Standard Document, 2003. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced Front-end feature extraction algorithm; Compression algorithm. ETSI document ES 202 050 v1.1.3 (2003-11).
  • 8
    • 38649140438 scopus 로고    scopus 로고
    • Finster, H., 2005. Web interface to experience the simulation of acoustic scenarios. .
  • 9
    • 38649137956 scopus 로고    scopus 로고
    • Gadrudadri, H., Hermansky, H., Morgan, N., et al., 2002. Qualcomm-ICSI-OGI features for ASR. In: Proc. ICSLP, pp. 21-24.
  • 10
    • 38649107003 scopus 로고    scopus 로고
    • Gales, M.J.F., 1995. Model based techniques for noise robust speech recognition. Dissertation at the University of Cambridge, Great Britain.
  • 11
    • 38649104773 scopus 로고    scopus 로고
    • Gales, M.J.F., 1997. Nice model-based compensation schemes for robust speech recognition. In: Proc. ESCA Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mousson, France, pp. 55-64.
  • 12
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Gales M.J.F., and Young S.J. Robust continuous speech recognition using parallel model combination. IEEE Trans. Speech Audio Proc. 4 (1996) 352-359
    • (1996) IEEE Trans. Speech Audio Proc. , vol.4 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 13
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Gauvain J.L., and Lee C.H. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech Audio Proc. 2 (1994) 291-298
    • (1994) IEEE Trans. Speech Audio Proc. , vol.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 14
    • 85009252959 scopus 로고    scopus 로고
    • Gelbart, D., Morgan, N., 2002. Double the trouble: handling noise and reverberation in far-field automatic speech recognition. In: Proc. ICSLP, pp. 2185-2188.
  • 15
    • 38649108094 scopus 로고    scopus 로고
    • Hirsch, H.G., 1999. HMM adaptation for telephone applications. In: Proc. European Conf. on Speech Communication and Technology, Vol. 1, pp. 9-12.
  • 16
    • 0034825470 scopus 로고    scopus 로고
    • HMM adaptation for applications in telecommunication
    • Hirsch H.G. HMM adaptation for applications in telecommunication. Speech Comm. 34 (2001) 127-139
    • (2001) Speech Comm. , vol.34 , pp. 127-139
    • Hirsch, H.G.1
  • 17
    • 0028996871 scopus 로고    scopus 로고
    • Hirsch, H.G., Ehrlicher, C., 1995. Noise estimation techniques for robust speech recognition. In: Proc. ICASSP, pp. 153-156.
  • 18
    • 33745206705 scopus 로고    scopus 로고
    • Hirsch, H.G., Finster, H., 2005. The simulation of realistic acoustic input scenarios for speech recognition systems. In: Proc. Interspeech Conf., pp. 2697-2700.
  • 19
    • 38649096880 scopus 로고    scopus 로고
    • Hirsch, H.G., Pearce, D., 2000. The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA Workshop ASR2000, Paris, France.
  • 20
    • 84872689395 scopus 로고    scopus 로고
    • Hirsch, H.G., Hellwig, K., Dobler, S., 2001b. Speech recognition at multiple sampling rates. In: Proc. European Conf. on Speech Communication and Technology, pp. 1837-1840.
  • 21
    • 0019060580 scopus 로고
    • Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics
    • Houtgast T., Steeneken H.J.M., and Plomp R. Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics. Acustica 46 (1980) 60-72
    • (1980) Acustica , vol.46 , pp. 60-72
    • Houtgast, T.1    Steeneken, H.J.M.2    Plomp, R.3
  • 22
    • 38649092254 scopus 로고    scopus 로고
    • Janin, A. et al., 2003. The ICSI meeting corpus. In: Proc. ICASSP.
  • 23
    • 38649084656 scopus 로고    scopus 로고
    • Kingsbury, B., 1998. Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments. Dissertation at UC Berkeley, USA.
  • 24
    • 33745195661 scopus 로고    scopus 로고
    • Kinshita, K., Nakatani, T., Miyoshi, M., 2005. Efficient blind dereverberation framework for automatic speech recognition. In: Proc. Interspeech Conf., Lisbon, Portugal, pp. 3145-3148.
  • 26
    • 38649107330 scopus 로고
    • Speech Data Base CSR-I (WSJ0)
    • http://www.ldc.upenn.edu
    • LDC. Speech Data Base CSR-I (WSJ0). Wall Street Journal (1993). http://www.ldc.upenn.edu http://www.ldc.upenn.edu
    • (1993) Wall Street Journal
    • LDC1
  • 27
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov Models
    • Leggeter C.J., and Woodland P.C. Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov Models. Comput. Speech Lang. 9 (1995) 171-185
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggeter, C.J.1    Woodland, P.C.2
  • 28
    • 0021226391 scopus 로고    scopus 로고
    • Leonard, R.G., 1984. A database for speaker-independent digit recognition. In: Proc. ICASSP, Vol. 3, p. 42.11.
  • 29
    • 0034852185 scopus 로고    scopus 로고
    • Liu, J., Malvar, H.S., 2001. Blind deconvolution of reverberated signals. In: Proc. ICASSP, Vol. 5, pp. 3037-3040.
  • 30
    • 85009242725 scopus 로고    scopus 로고
    • Macho, D., Mauuary, L., Pearce, D., et al., 2002. Evaluation of a noise robust DSR front-end on Aurora databases. In: Proc. ICSLP, pp. 17-20.
  • 31
    • 0029745435 scopus 로고    scopus 로고
    • Minami, Y., Furui, S., 1996. Adaptation method based on HMM composition and EM algorithm. In: Proc. ICASSP, pp. 327-330.
  • 32
    • 0032142014 scopus 로고    scopus 로고
    • Environmental conditions and acoustic transduction in hands-free speech recognition
    • Omologo M., Svaizer P., and Matassoni M. Environmental conditions and acoustic transduction in hands-free speech recognition. Speech Comm. 25 (1998) 75-95
    • (1998) Speech Comm. , vol.25 , pp. 75-95
    • Omologo, M.1    Svaizer, P.2    Matassoni, M.3
  • 33
    • 0036298106 scopus 로고    scopus 로고
    • Palomäki, K.J., Brown, G.J., Barker, J., 2002. Missing data speech recognition in reverberant conditions. In: Proc. ICASSP, pp. 65-68.
  • 34
    • 38649102436 scopus 로고    scopus 로고
    • Picone, J., Parihar, N., Hirsch, H.G., Pearce, D., 2004. Performance analysis of the Aurora large vocabulary experiment. In: Proc. European Signal Processing Conference, Vienna, Austria.
  • 35
    • 33745260725 scopus 로고    scopus 로고
    • Raut, C.K., Nishimoto, T., Sagayama, S., 2005. Model adaptation by state splitting of HMM for long reverberation. In: Proc. Interspeech Conf., Lisbon, Portugal, pp. 277-280.
  • 36
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • Sankar A., and Lee C.H. A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Trans. Speech Audio Proc. (1996) 190-201
    • (1996) IEEE Trans. Speech Audio Proc. , pp. 190-201
    • Sankar, A.1    Lee, C.H.2
  • 37
    • 4344607755 scopus 로고    scopus 로고
    • Likelihood-maximizing beamforming for robust hands-free speech recognition
    • Seltzer M.L., Raj B., and Stern R.M. Likelihood-maximizing beamforming for robust hands-free speech recognition. IEEE Trans. Speech Audio Proc. 12 5 (2004) 489-498
    • (2004) IEEE Trans. Speech Audio Proc. , vol.12 , Issue.5 , pp. 489-498
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 38
    • 38649101822 scopus 로고    scopus 로고
    • Tashev, I., Allred, D., 2005. Reverberation reduction for improved speech recognition. In: Proc. Workshop on Hands-free Speech Communication, Rutgers, USA.
  • 39
    • 38649125527 scopus 로고    scopus 로고
    • Woodland, P.C., 2001. Speaker adaptation for continuous density HMMs: a review. In: Proc. Internat. Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis, France.
  • 40
    • 33646809023 scopus 로고    scopus 로고
    • Wu, M., Wang, D., 2005. A two-stage algorithm for enhancement of reverberant speech. In: Proc. ICASSP, Vol. I, pp. 1085-1088.
  • 41
    • 0001379957 scopus 로고    scopus 로고
    • Enhancement of reverberant speech using LP residual signals
    • Yegnanarayana B., and Murthy P.S. Enhancement of reverberant speech using LP residual signals. IEEE Trans. Speech Audio Proc. 8 (2000) 267-281
    • (2000) IEEE Trans. Speech Audio Proc. , vol.8 , pp. 267-281
    • Yegnanarayana, B.1    Murthy, P.S.2
  • 42
    • 38649133773 scopus 로고    scopus 로고
    • Cambridge University Engineering Department http://htk.eng.cam.ac.uk
    • Young S., et al. The HTK Book (version 3.3) (2005), Cambridge University Engineering Department. http://htk.eng.cam.ac.uk http://htk.eng.cam.ac.uk
    • (2005) The HTK Book (version 3.3)
    • Young, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.