SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Speech Communication

Volumn 50, Issue 3, 2008, Pages 244-263

A new approach for the adaptation of HMMs to reverberation and background noise

(2) Hirsch, Hans Günter a Finster, Harald a

a NIEDERRHEIN UNIVERSITY OF APPLIED SCIENCES (Germany)

Author keywords

Hands free speech input; HMM adaptation; Reverberation; Robust speech recognition

Indexed keywords

PARAMETER ESTIMATION; SIGNAL PROCESSING; SPEECH PROCESSING; SPEECH RECOGNITION;

HANDS-FREE SPEECH INPUT; ROBUST SPEECH RECOGNITION; STATIONARY BACKGROUND NOISES;

ACOUSTIC NOISE;

EID: 38649115063 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2007.09.004 Document Type: Article

Times cited : (73)

References (42)

1
- 38649083377
- Aurora project, 2006. .

2
- 84994251727
- Au Yeung, S.-K., Siu, M.-H., 2004. Improved performance of Aurora-4 using HTK and unsupervised MLLR adaptation. In: Proc. ICSLP.

3
- 0030362988
- Avendano, C., Hermansky, H., 1996. Study on the dereverberation of speech based on temporal filtering. In: Proc. ICSLP, pp. 889-892.

4
- 38649083073
- Bitzer, J., Simmer, K.U., Kammeyer, K.D., 1999. Multi microphone noise reduction techniques for hands-free speech recognition - a comparative study. In: Proc. Internat. Workshop on Robust Methods for Speech Recognition in Adverse Conditions, Tampere, Finland, pp. 171-175.

5
- 0032672940
- The ITU-T software library
- Campos-Neto S. The ITU-T software library. Internat. J. Speech Technol. (1999) 259-272
- (1999) Internat. J. Speech Technol. , pp. 259-272
- Campos-Neto, S.¹

6
- 38649133273
- Couvreur, L., Dupont, S., Ris, C., Boite, J.M., Couvreur, C., 2001. Fast adaptation for robust speech recognition in reverberant environments. In: Proc. Internat. Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis, France.

7
- 38649122014
- ETSI Standard Document, 2003. Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Advanced Front-end feature extraction algorithm; Compression algorithm. ETSI document ES 202 050 v1.1.3 (2003-11).

8
- 38649140438
- Finster, H., 2005. Web interface to experience the simulation of acoustic scenarios. .

9
- 38649137956
- Gadrudadri, H., Hermansky, H., Morgan, N., et al., 2002. Qualcomm-ICSI-OGI features for ASR. In: Proc. ICSLP, pp. 21-24.

10
- 38649107003
- Gales, M.J.F., 1995. Model based techniques for noise robust speech recognition. Dissertation at the University of Cambridge, Great Britain.

11
- 38649104773
- Gales, M.J.F., 1997. Nice model-based compensation schemes for robust speech recognition. In: Proc. ESCA Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mousson, France, pp. 55-64.

12
- 0030245128
- Robust continuous speech recognition using parallel model combination
- Gales M.J.F., and Young S.J. Robust continuous speech recognition using parallel model combination. IEEE Trans. Speech Audio Proc. 4 (1996) 352-359
- (1996) IEEE Trans. Speech Audio Proc. , vol.4 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

13
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Gauvain J.L., and Lee C.H. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. Speech Audio Proc. 2 (1994) 291-298
- (1994) IEEE Trans. Speech Audio Proc. , vol.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

14
- 85009252959
- Gelbart, D., Morgan, N., 2002. Double the trouble: handling noise and reverberation in far-field automatic speech recognition. In: Proc. ICSLP, pp. 2185-2188.

15
- 38649108094
- Hirsch, H.G., 1999. HMM adaptation for telephone applications. In: Proc. European Conf. on Speech Communication and Technology, Vol. 1, pp. 9-12.

16
- 0034825470
- HMM adaptation for applications in telecommunication
- Hirsch H.G. HMM adaptation for applications in telecommunication. Speech Comm. 34 (2001) 127-139
- (2001) Speech Comm. , vol.34 , pp. 127-139
- Hirsch, H.G.¹

17
- 0028996871
- Hirsch, H.G., Ehrlicher, C., 1995. Noise estimation techniques for robust speech recognition. In: Proc. ICASSP, pp. 153-156.

18
- 33745206705
- Hirsch, H.G., Finster, H., 2005. The simulation of realistic acoustic input scenarios for speech recognition systems. In: Proc. Interspeech Conf., pp. 2697-2700.

19
- 38649096880
- Hirsch, H.G., Pearce, D., 2000. The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA Workshop ASR2000, Paris, France.

20
- 84872689395
- Hirsch, H.G., Hellwig, K., Dobler, S., 2001b. Speech recognition at multiple sampling rates. In: Proc. European Conf. on Speech Communication and Technology, pp. 1837-1840.

21
- 0019060580
- Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics
- Houtgast T., Steeneken H.J.M., and Plomp R. Predicting speech intelligibility in rooms from the modulation transfer function, I. General room acoustics. Acustica 46 (1980) 60-72
- (1980) Acustica , vol.46 , pp. 60-72
- Houtgast, T.¹ Steeneken, H.J.M.² Plomp, R.³

22
- 38649092254
- Janin, A. et al., 2003. The ICSI meeting corpus. In: Proc. ICASSP.

23
- 38649084656
- Kingsbury, B., 1998. Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments. Dissertation at UC Berkeley, USA.

24
- 33745195661
- Kinshita, K., Nakatani, T., Miyoshi, M., 2005. Efficient blind dereverberation framework for automatic speech recognition. In: Proc. Interspeech Conf., Lisbon, Portugal, pp. 3145-3148.

25
- 0003870155
- Spon Press
- Kuttruff H. Room Acoustics (2000), Spon Press
- (2000) Room Acoustics
- Kuttruff, H.¹

26
- 38649107330
- Speech Data Base CSR-I (WSJ0)
- http://www.ldc.upenn.edu
- LDC. Speech Data Base CSR-I (WSJ0). Wall Street Journal (1993). http://www.ldc.upenn.edu http://www.ldc.upenn.edu
- (1993) Wall Street Journal
- LDC¹

27
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov Models
- Leggeter C.J., and Woodland P.C. Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov Models. Comput. Speech Lang. 9 (1995) 171-185
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggeter, C.J.¹ Woodland, P.C.²

28
- 0021226391
- Leonard, R.G., 1984. A database for speaker-independent digit recognition. In: Proc. ICASSP, Vol. 3, p. 42.11.

29
- 0034852185
- Liu, J., Malvar, H.S., 2001. Blind deconvolution of reverberated signals. In: Proc. ICASSP, Vol. 5, pp. 3037-3040.

30
- 85009242725
- Macho, D., Mauuary, L., Pearce, D., et al., 2002. Evaluation of a noise robust DSR front-end on Aurora databases. In: Proc. ICSLP, pp. 17-20.

31
- 0029745435
- Minami, Y., Furui, S., 1996. Adaptation method based on HMM composition and EM algorithm. In: Proc. ICASSP, pp. 327-330.

32
- 0032142014
- Environmental conditions and acoustic transduction in hands-free speech recognition
- Omologo M., Svaizer P., and Matassoni M. Environmental conditions and acoustic transduction in hands-free speech recognition. Speech Comm. 25 (1998) 75-95
- (1998) Speech Comm. , vol.25 , pp. 75-95
- Omologo, M.¹ Svaizer, P.² Matassoni, M.³

33
- 0036298106
- Palomäki, K.J., Brown, G.J., Barker, J., 2002. Missing data speech recognition in reverberant conditions. In: Proc. ICASSP, pp. 65-68.

34
- 38649102436
- Picone, J., Parihar, N., Hirsch, H.G., Pearce, D., 2004. Performance analysis of the Aurora large vocabulary experiment. In: Proc. European Signal Processing Conference, Vienna, Austria.

35
- 33745260725
- Raut, C.K., Nishimoto, T., Sagayama, S., 2005. Model adaptation by state splitting of HMM for long reverberation. In: Proc. Interspeech Conf., Lisbon, Portugal, pp. 277-280.

36
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- Sankar A., and Lee C.H. A maximum-likelihood approach to stochastic matching for robust speech recognition. IEEE Trans. Speech Audio Proc. (1996) 190-201
- (1996) IEEE Trans. Speech Audio Proc. , pp. 190-201
- Sankar, A.¹ Lee, C.H.²

37
- 4344607755
- Likelihood-maximizing beamforming for robust hands-free speech recognition
- Seltzer M.L., Raj B., and Stern R.M. Likelihood-maximizing beamforming for robust hands-free speech recognition. IEEE Trans. Speech Audio Proc. 12 5 (2004) 489-498
- (2004) IEEE Trans. Speech Audio Proc. , vol.12 , Issue.5 , pp. 489-498
- Seltzer, M.L.¹ Raj, B.² Stern, R.M.³

38
- 38649101822
- Tashev, I., Allred, D., 2005. Reverberation reduction for improved speech recognition. In: Proc. Workshop on Hands-free Speech Communication, Rutgers, USA.

39
- 38649125527
- Woodland, P.C., 2001. Speaker adaptation for continuous density HMMs: a review. In: Proc. Internat. Workshop on Adaptation Methods for Speech Recognition, Sophia Antipolis, France.

40
- 33646809023
- Wu, M., Wang, D., 2005. A two-stage algorithm for enhancement of reverberant speech. In: Proc. ICASSP, Vol. I, pp. 1085-1088.

41
- 0001379957
- Enhancement of reverberant speech using LP residual signals
- Yegnanarayana B., and Murthy P.S. Enhancement of reverberant speech using LP residual signals. IEEE Trans. Speech Audio Proc. 8 (2000) 267-281
- (2000) IEEE Trans. Speech Audio Proc. , vol.8 , pp. 267-281
- Yegnanarayana, B.¹ Murthy, P.S.²

42
- 38649133773
- Cambridge University Engineering Department http://htk.eng.cam.ac.uk
- Young S., et al. The HTK Book (version 3.3) (2005), Cambridge University Engineering Department. http://htk.eng.cam.ac.uk http://htk.eng.cam.ac.uk
- (2005) The HTK Book (version 3.3)
- Young, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.