SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 7, 2010, Pages 1692-1707

Model-based feature enhancement for reverberant speech recognition

(2) Krueger, Alexander a Haeb Umbach, Reinhold a

a UNIVERSITY OF PADERBORN (Germany)

Author keywords

Automatic speech recognition (ASR); feature enhancement; reverberant speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; BAYESIAN INFERENCE; ENHANCEMENT TECHNIQUES; FEATURE ENHANCEMENT; FEATURE VECTORS; INTERMEDIATE STAGE; LINEAR DYNAMICAL MODELS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MICROPHONE SIGNALS; MINIMUM MEAN SQUARE ERROR ESTIMATE; MODEL-BASED; OBSERVATION MODEL; POWER SPECTRAL; PRIORI MODEL; REAL-TIME APPLICATION; REVERBERANT ENVIRONMENT; REVERBERANT SPEECH RECOGNITION; REVERBERATION TIME; ROOM IMPULSE RESPONSE; SIMPLIFIED MODELS; TWO PARAMETER;

ARCHITECTURAL ACOUSTICS; BAYESIAN NETWORKS; COMPUTATIONAL COMPLEXITY; IMPULSE RESPONSE; INFERENCE ENGINES; MAXIMUM LIKELIHOOD; REVERBERATION; SPECTROSCOPY; STOCHASTIC MODELS;

SPEECH RECOGNITION;

EID: 77955673019 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2049684 Document Type: Article

Times cited : (55)

References (46)

1
- 51449084820
- Ph.D. dissertation, Technische Univ. Eindhoven, Eindhoven, The Netherlands, Jun. 25
- E. Habets, "Single- and multi-microphone speech dereverberation using spectral enhancement," Ph.D. dissertation, Technische Univ. Eindhoven, Eindhoven, The Netherlands, Jun. 25, 2007.
- (2007) Single- And Multi-microphone Speech Dereverberation Using Spectral Enhancement
- Habets, E.¹

2
- 0029185029
- Evam: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals
- Jan.
- M. Gürelli and C. Nikias, "Evam: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals," IEEE Trans. Signal Process., vol.43, no.1, pp. 134-149, Jan. 1995.
- (1995) IEEE Trans. Signal Process. , vol.43 , Issue.1 , pp. 134-149
- Gürelli, M.¹ Nikias, C.²

3
- 33750041818
- On the use of lime dereverberation algorithm in an acoustic environment with a noise source
- May
- M. Delcroix, T. Hikichi, and M. Miyoshi, "On the use of lime dereverberation algorithm in an acoustic environment with a noise source," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06), May 2006, vol.1, pp. I-I.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'06) , vol.1
- Delcroix, M.¹ Hikichi, T.² Miyoshi, M.³

4
- 0242271432
- Subspace methods for multimicrophone speech dereverberation
- S. Gannot and M. Moonen, "Subspace methods for multimicrophone speech dereverberation," EURASIP J. Appl. Signal Process., vol.11, pp. 1074-1090, 2003.
- (2003) EURASIP J. Appl. Signal Process. , vol.11 , pp. 1074-1090
- Gannot, S.¹ Moonen, M.²

5
- 33745761716
- A two-stage algorithm for one-microphone reverberant speech enhancement
- May
- M. Wu and D. Wang, "A two-stage algorithm for one-microphone reverberant speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.3, pp. 774-784, May 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 774-784
- Wu, M.¹ Wang, D.²

6
- 14344274593
- A new method based on spectral subtraction for speech dereverberation
- K. Lebart, J. Boucher, and P. Denbigh, "A new method based on spectral subtraction for speech dereverberation," Acta Acust. United with Acust., vol.87, no.8, pp. 359-366, 2001.
- (2001) Acta Acust. United with Acust. , vol.87 , Issue.8 , pp. 359-366
- Lebart, K.¹ Boucher, J.² Denbigh, P.³

7
- 67749120610
- Low delay noise reduction and dereverberation for hearing aids
- H. W. Löllmann and P. Vary, "Low delay noise reduction and dereverberation for hearing aids," EURASIP J. Adv. Signal Process., 2009.
- (2009) EURASIP J. Adv. Signal Process.
- Löllmann, H.W.¹ Vary, P.²

8
- 65249167097
- Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction
- May
- K. Kinoshita, M. Delcroix, T. Nakatani, and M. Miyoshi, "Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.4, pp. 534-545, May 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 534-545
- Kinoshita, K.¹ Delcroix, M.² Nakatani, T.³ Miyoshi, M.⁴

9
- 70350435249
- Integrated speech enhancement method using noise suppression and dereverberation
- Feb.
- T. Yoshioka, T. Nakatani, and M. Miyoshi, "Integrated speech enhancement method using noise suppression and dereverberation," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 231-246, Feb. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 231-246
- Yoshioka, T.¹ Nakatani, T.² Miyoshi, M.³

10
- 17544388524
- Speech enhancement using excitation source information
- vol.1
- B. Yegnanarayana, S. Mahadeva Prasanna, and K. Sreenivasa Rao, "Speech enhancement using excitation source information," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'02), 2002, vol.1, pp. I-541-I-544, vol.1.
- (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'02) , vol.1
- Yegnanarayana, B.¹ Mahadeva Prasanna, S.² Sreenivasa Rao, K.³

11
- 0001379957
- Enhancement of reverberant speech using LP residual signal
- May
- B. Yegnanarayana and P. Murthy, "Enhancement of reverberant speech using LP residual signal," IEEE Trans. Speech Audio Process., vol.8, no.3, pp. 267-281, May 2000.
- (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 267-281
- Yegnanarayana, B.¹ Murthy, P.²

12
- 4344573437
- A speech dereverberation method based on the MTF concept in power envelope restoration
- M. Unoki, K. Sakata, M. Furukawa, and M. Akagi, "A speech dereverberation method based on the MTF concept in power envelope restoration," Acoust. Sci. Technol., vol.25, no.4, pp. 243-254, 2004.
- (2004) Acoust. Sci. Technol. , vol.25 , Issue.4 , pp. 243-254
- Unoki, M.¹ Sakata, K.² Furukawa, M.³ Akagi, M.⁴

13
- 84866496026
- Speech enhancement by nonlinear multiband envelope filtering
- May
- T. Langhans and H. Strube, "Speech enhancement by nonlinear multiband envelope filtering," in Proc. IEEE Int. Conf. . Acoust., Speech, Signal Process., May 1982, vol.7, pp. 156-159.
- (1982) Proc. IEEE Int. Conf. . Acoust., Speech, Signal Process. , vol.7 , pp. 156-159
- Langhans, T.¹ Strube, H.²

14
- 0030247605
- Cepstrum-based deconvolution for speech dereverberation
- Sep.
- S. Subramaniam, A. Petropulu, and C. Wendt, "Cepstrum-based deconvolution for speech dereverberation," IEEE Trans. Speech Audio Process., vol.4, no.5, pp. 392-396, Sep. 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 392-396
- Subramaniam, S.¹ Petropulu, A.² Wendt, C.³

15
- 33746622710
- Iterative cepstrum-based approach for speech dereverberation
- vol.1
- R. Kennedy and B. Radlovic, "Iterative cepstrum-based approach for speech dereverberation," in Proc. 5th Int. Symp. Signal Process. and Its Applicat. ISSPA'99, 1999, vol.1, pp. 55-58, vol.1.
- (1999) Proc. 5th Int. Symp. Signal Process. and Its Applicat. ISSPA'99 , vol.1 , pp. 55-58
- Kennedy, R.¹ Radlovic, B.²

16
- 1542677825
- Blind model selection for automatic speech recognition in reverberant environments
- L. Couvreur and C. Couvreur, "Blind model selection for automatic speech recognition in reverberant environments," J. VLSI Signal Process. Syst., vol. 36, no. 2/3, pp. 189-203, 2004.
- (2004) J. VLSI Signal Process. Syst. , vol.36 , Issue.2-3 , pp. 189-203
- Couvreur, L.¹ Couvreur, C.²

17
- 38649115063
- A new approach for the adaptation of HMMs to reverberation and background noise
- H.-G. Hirsch and H. Finster, "A new approach for the adaptation of HMMs to reverberation and background noise," Speech Commun., vol.50, no.3, pp. 244-263, 2008.
- (2008) Speech Commun. , vol.50 , Issue.3 , pp. 244-263
- Hirsch, H.-G.¹ Finster, H.²

18
- 70350450398
- Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
- Feb.
- M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 324-334, Feb. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 324-334
- Delcroix, M.¹ Nakatani, T.² Watanabe, S.³

19
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, no.2, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

20
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech Lang., vol.10, no.4, pp. 249-264, 1996.
- (1996) Comput. Speech Lang. , vol.10 , Issue.4 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

21
- 0032050110
- Maximum likelihood linear transformations for HMMbased speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput. Speech Lang., vol.12, no.2, pp. 75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

22
- 33846229072
- Model adaptation by state splitting of HMM for long reverberation
- Sep.
- C. K. Raut, T. Nishimoto, and S. Sagayama, "Model adaptation by state splitting of HMM for long reverberation," in Proc. Interspeech, Sep. 2005.
- (2005) Proc. Interspeech
- Raut, C.K.¹ Nishimoto, T.² Sagayama, S.³

23
- 34547517494
- A new concept for feature-domain dereverberation for robust distant-talking asr
- A. Sehr and W. Kellerman, "A new concept for feature-domain dereverberation for robust distant-talking asr," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. ICASSP'07, 2007, vol.4, pp. IV-369-IV-372.
- (2007) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. ICASSP'07 , vol.4
- Sehr, A.¹ Kellerman, W.²

24
- 0028517164
- RASTA processing of speech
- Oct.
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol.2, no.4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

25
- 0030682292
- Recognizing reverberant speech with rasta-plp
- Apr. 21-24
- B. E. D. Kingsbury and N. Morgan, "Recognizing reverberant speech with rasta-plp," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. ICASSP'97, Apr. 21-24, 1997, vol.2, pp. 1259-1262.
- (1997) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. ICASSP'97 , vol.2 , pp. 1259-1262
- Kingsbury, B.E.D.¹ Morgan, N.²

26
- 84881675408
- Cepstral channel normalization techniques for HMM-based speaker verification
- A. E. Rosenberg, C.-H. Lee, and F. K. Soong, "Cepstral channel normalization techniques for HMM-based speaker verification," in ICSLP'94, 1994, pp. 1835-1838.
- (1994) ICSLP'94 , pp. 1835-1838
- Rosenberg, A.E.¹ Lee, C.-H.² Soong, F.K.³

27
- 34247217970
- On multiplicative transfer function approximation in the short-time fourier transform domain
- DOI 10.1109/LSP.2006.888292
- Y. Avargel and I. Cohen, "On multiplicative transfer function approximation in the short-time Fourier transform domain," IEEE Signal Process. Lett., vol.14, no.5, pp. 337-340, May 2007. (Pubitemid 46614474)
- (2007) IEEE Signal Processing Letters , vol.14 , Issue.5 , pp. 337-340
- Avargel, Y.¹ Cohen, I.²

28
- 70450180986
- Model based feature enhancement for automatic speech recognition in reverberant environments
- A. Krueger and R. Haeb-Umbach, "Model based feature enhancement for automatic speech recognition in reverberant environments," in Proc. Interspeech'09, 2009, pp. 1231-1234.
- (2009) Proc. Interspeech'09 , pp. 1231-1234
- Krueger, A.¹ Haeb-Umbach, R.²

29
- 70350439261
- Enhanced speech features by single-channel joint compensation of noise and reverberation
- Feb.
- M. Wölfel, "Enhanced speech features by single-channel joint compensation of noise and reverberation," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 312-323, Feb. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 312-323
- Wölfel, M.¹

30
- 0009589650
- ETSI Standard Document, ETSI ES 201 108 v1.1.2, (2000-04)
- ETSI Standard Document, 2000. Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 201 108 v1.1.2, (2000-04).
- (2000) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms

31
- 70349707037
- Towards robust distant-talking automatic speech recognition in reverberant environments
- Berlin, Heidelberg, Germany: Springer
- A. Sehr and W. Kellermann, "Towards robust distant-talking automatic speech recognition in reverberant environments," in Speech and Audio Processing in Adverse Environments. Berlin, Heidelberg, Germany: Springer, 2008.
- (2008) Speech and Audio Processing in Adverse Environments
- Sehr, A.¹ Kellermann, W.²

32
- 0027629367
- Discrete Gabor transform
- Jul.
- S. Qian and D. Chen, "Discrete Gabor transform," IEEE Trans. Signal Process., vol.41, no.7, pp. 2429-2438, Jul. 1993.
- (1993) IEEE Trans. Signal Process. , vol.41 , Issue.7 , pp. 2429-2438
- Qian, S.¹ Chen, D.²

33
- 0028386321
- Linear systems in Gabor time-frequency space
- Mar.
- S. Farkash and S. Raz, "Linear systems in Gabor time-frequency space," IEEE Trans. Signal Process., vol.42, no.3, pp. 611-617, Mar. 1994.
- (1994) IEEE Trans. Signal Process. , vol.42 , Issue.3 , pp. 611-617
- Farkash, S.¹ Raz, S.²

34
- 50449087796
- System identification in the short-time Fourier transform domain with crossband filtering
- May
- Y. Avargel and I. Cohen, "System identification in the short-time Fourier transform domain with crossband filtering," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.4, pp. 1305-1319, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1305-1319
- Avargel, Y.¹ Cohen, I.²

35
- 0003870155
- 4th ed. London, U.K.: Spon
- H. Kuttruff, Room Acoustics, 4th ed. London, U.K.: Spon, 2000.
- (2000) Room Acoustics
- Kuttruff, H.¹

36
- 0242609086
- Blind estimation of reverberation time
- Nov.
- R. Ratnam, D. L. Jones, B. C. Wheeler,W. D. O'Brien, C. R. Lansing, and A. S. Feng, "Blind estimation of reverberation time," J. Acoust. Soc. Amer., vol.114, no.5, pp. 2877-2892, Nov. 2003.
- (2003) J. Acoust. Soc. Amer. , vol.114 , Issue.5 , pp. 2877-2892
- Ratnam, R.¹ Jones, D.L.² Wheeler, B.C.³ O'Brien, W.D.⁴ Lansing, C.R.⁵ Feng, A.S.⁶

37
- 0242460462
- U.C. Berkeley, Berkeley, CA, Tech. Rep.
- K. Murphy, "Switching Kalman Filters," U.C. Berkeley, Berkeley, CA, 1998, Tech. Rep..
- (1998) Switching Kalman Filters
- Murphy, K.¹

38
- 4544236840
- Noise robust speech recognition with a switching linear dynamic model
- vol.1
- J. Droppo and A. Acero, "Noise robust speech recognition with a switching linear dynamic model," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04), 2004, vol.1, pp. I-953-I-956, vol.1.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'04) , vol.1
- Droppo, J.¹ Acero, A.²

39
- 0003826365
- New York: Wiley
- Y. Bar-Shalom, X. R. Li, and T. Kirubarajan, Estimation with Applications to Tracking and Navigation: Theory, Algorithms, and Software. New York: Wiley, 2001.
- (2001) Estimation with Applications to Tracking and Navigation: Theory, Algorithms, and Software
- Bar-Shalom, Y.¹ Li, X.R.² Kirubarajan, T.³

40
- 66149116001
- A novel uncertainty decoding rule with applications to transmission error robust speech recognition
- V. Ion and R. Haeb-Umbach, "A novel uncertainty decoding rule with applications to transmission error robust speech recognition," IEEE Trans. Audio Speech Lang. Process., vol.16, no.5, pp. 1047-1060, 2008.
- (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.5 , pp. 1047-1060
- Ion, V.¹ Haeb-Umbach, R.²

41
- 33750291256
- Uncertainty decoding for distributed speech recognition over error-prone networks
- V. Ion and R. Haeb-Umbach, "Uncertainty decoding for distributed speech recognition over error-prone networks," Speech Commun., vol.48, no.11, pp. 1435-1446, 2006.
- (2006) Speech Commun. , vol.48 , Issue.11 , pp. 1435-1446
- Ion, V.¹ Haeb-Umbach, R.²

42
- 70450167188
- Niederrhein Univ. of Applied Sciences, Tech. Rep.
- H. Hirsch, "Aurora-5 experimental framework for the performance evaluation of speech recognition in case of a hands-free speech input in noisy environments," Niederrhein Univ. of Applied Sciences, 2007, Tech. Rep..
- (2007) Aurora-5 Experimental Framework for the Performance Evaluation of Speech Recognition in Case of A Hands-free Speech Input in Noisy Environments
- Hirsch, H.¹

43
- 0003822743
- Version 3.4.. Cambridge, U.K.: Cambridge Univ. Eng. Dept.
- S. J. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book, Version 3.4.. Cambridge, U.K.: Cambridge Univ. Eng. Dept., 2006.
- (2006) The HTK Book
- Young, S.J.¹ Evermann, G.² Gales, M.J.F.³ Hain, T.⁴ Kershaw, D.⁵ Moore, G.⁶ Odell, J.⁷ Ollason, D.⁸ Povey, D.⁹ Valtchev, V.¹⁰ Woodland, P.C.¹¹

44
- 33745206705
- The simulation of realistic acoustic input scenarios for speech recognition systems
- H. G. Hirsch and H. Finster, "The simulation of realistic acoustic input scenarios for speech recognition systems," in Proc. Interspeech'05, 2005, pp. 2697-2700.
- (2005) Proc. Interspeech'05 , pp. 2697-2700
- Hirsch, H.G.¹ Finster, H.²

45
- 0018455820
- Image method for efficiently simulating small-room acoustics
- J. B. Allen, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, no.4, pp. 943-950, 1979.
- (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
- Allen, J.B.¹

46
- 77955688063
- Automatic speech recognition in adverse acoustic conditions
- R. Martin, U. Heute, and C. Antweiler, Eds. New York:Wiley
- H.-G. Hirsch, "Automatic speech recognition in adverse acoustic conditions," in Advances in Digital Speech Transmission, R. Martin, U. Heute, and C. Antweiler, Eds. New York:Wiley , 2007, pp. 461-496.
- (2007) Advances in Digital Speech Transmission , pp. 461-496
- Hirsch, H.-G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.