SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 17, Issue 2, 2009, Pages 312-323

Enhanced speech features by single-channel joint compensation of noise and reverberation

Author keywords

Automatic speech recognition (ASR); Joint removal of additive and reverberant distortions; Multistep linear prediction (MSLP); Particle filter; Speech feature enhancement

Indexed keywords

AUTOMATIC SPEECH RECOGNITION (ASR); JOINT REMOVAL OF ADDITIVE AND REVERBERANT DISTORTIONS; MULTISTEP LINEAR PREDICTION (MSLP); PARTICLE FILTER; SPEECH FEATURE ENHANCEMENT;

AIR FILTERS; DISTRIBUTED COMPUTER SYSTEMS; MICROPHONES; NONLINEAR FILTERING; REMELTING; REVERBERATION; SPEECH COMMUNICATION; TARGET TRACKING;

SPEECH RECOGNITION;

EID: 70350439261 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.2009161 Document Type: Article

Times cited : (42)

References (31)

1
- 0018455310
- Suppression of acoustic noise in speech using spectral subtraction
- Apr
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoustics, Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
- (1979) IEEE Trans. Acoustics, Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
- Boll, S.F.¹

2
- 0029725301
- A vector taylor series approach for environment-independent speech recognition
- P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition, " in Proc. ICASSP, 1996, pp. 733-736.
- (1996) Proc. ICASSP , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

3
- 0032050110
- Maximum likelihood linear transformations for hmmbased speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition, " Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
- (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
- Gales, M.J.F.¹

4
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, " Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

5
- 0032099596
- Imm-based estimation for slowly evolving environments
- Jun.
- N. S. Kim, "IMM-Based estimation for slowly evolving environments, " IEEE Signal Process. Lett., vol. 5, no. 6, pp. 146-149, Jun. 1998.
- (1998) IEEE Signal Process. Lett. , vol.5 , Issue.6 , pp. 146-149
- Kim, N.S.¹

6
- 84898993440
- Sequential noise compensation by sequential monte carlo methods
- Sep.
- K. Yao and S. Nakamura, "Sequential noise compensation by sequential monte carlo methods, " Adv. Neural Inf. Process. Syst., vol. 14, pp. 1213-1220, Sep. 2002.
- (2002) Adv. Neural Inf. Process. Syst. , vol.14 , pp. 1213-1220
- Yao, K.¹ Nakamura, S.²

7
- 0141591493
- Tracking noise via dynamical systems with a continuum of states
- R. Singh and B. Raj, "Tracking noise via dynamical systems with a continuum of states, " in Proc. ICASSP, 2003, pp. 396-399.
- (2003) Proc. ICASSP , pp. 396-399
- Singh, R.¹ Raj, B.²

8
- 33645785839
- Particle filter based non-stationary noise tracking for robust speech feature enhancement
- M. Fujimoto and S. Nakamura, "Particle filter based non-stationary noise tracking for robust speech feature enhancement, " in Proc. ICASSP, 2005, pp. 257-260.
- (2005) Proc. ICASSP , pp. 257-260
- Fujimoto, M.¹ Nakamura, S.²

9
- 44949224968
- Coupling particle filters with automatic speech recognition for speech feature enhancement
- Sep.
- F. Faubel and M. Wölfel, "Coupling particle filters with automatic speech recognition for speech feature enhancement, " in Proc. Interspeech, Sep. 2006, pp. 37-40.
- (2006) Proc. Interspeech , pp. 37-40
- Faubel, F.¹ Wölfel, M.²

10
- 50449083999
- New York: Wiley
- M. Wölfel and J. W. McDonough, Distant Speech Recognition. New York: Wiley, 2009.
- (2009) Distant Speech Recognition
- Wölfel, M.¹ Mcdonough, J.W.²

11
- 50449102864
- The harming part of room acoustics in automatic speech recognition
- R. Petrick, K. Lohde, M.Wolff, and R. Hoffmann, "The harming part of room acoustics in automatic speech recognition, " in Proc. Interspeech, 2007, pp. 1094-1097.
- (2007) Proc. Interspeech , pp. 1094-1097
- Petrick, R.¹ Lohde, K.² Wolff, M.³ Hoffmann, R.⁴

12
- 33745761716
- A two-stage algorithm for one-microphone reverberant speech enhancement
- May
- M. Wu and D. Wang, "A two-stage algorithm for one-microphone reverberant speech enhancement, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 774-784, May 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 774-784
- Wu, M.¹ Wang, D.²

13
- 70349707037
- Towards robust distant-talking automatic speech recognition in reverberant environments
- A. Sehr and W. Kellermann, E. Hänsler and G. Schmidt, Eds.,New York: Springer
- A. Sehr and W. Kellermann, E. Hänsler and G. Schmidt, Eds., "Towards robust distant-Talking automatic speech recognition in reverberant environments, " in Topics in Speech and Audio Processing in Adverse Environments. New York: Springer, 2008.
- (2008) Topics in Speech and Audio Processing in Adverse Environments

14
- 14344274593
- A new method based on spectral subtraction for speech dereverberation
- May/Jun
- K. Lebart, J. M. Boucher, and P. N. Denbigh, "A new method based on spectral subtraction for speech dereverberation, " Acta Acustica United With Acustica, vol. 87, no. 3, pp. 359-366, May/Jun. 2001.
- (2001) Acta Acustica United with Acustica , vol.87 , Issue.3 , pp. 359-366
- Lebart, K.¹ Boucher, J.M.² Denbigh, P.N.³

15
- 38649115063
- A new approach for the adaptation of HMMs to reverberation and background noise
- H. G. Hirsch and H. Finster, "A new approach for the adaptation of HMMs to reverberation and background noise, " Speech Commun., vol. 50, pp. 244-263, 2008.
- (2008) Speech Commun. , vol.50 , pp. 244-263
- Hirsch, H.G.¹ Finster, H.²

16
- 33947694356
- Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation
- K. Kinoshita, T. Nakatani, and M. Miyoshi, "Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation, " in Proc. ICASSP, 2006, pp. 817-820.
- (2006) Proc. ICASSP , pp. 817-820
- Kinoshita, K.¹ Nakatani, T.² Miyoshi, M.³

17
- 0003663467
- 4nd ed. Upper Saddle River, NJ: McGraw-Hill
- A. Papoulis and S. U. Pillar, Probability, Random Variables, and Stochastic Processes, 4nd ed. Upper Saddle River, NJ: McGraw-Hill, 2002.
- (2002) Probability, Random Variables, and Stochastic Processes
- Papoulis, A.¹ Pillar, S.U.²

18
- 3242789232
- Boston, MA: Artech House
- B. Ristic, S. Arulampalam, and N. Gordon, Beyond the Kalman Filter: Particle Filters for Tracking Applications. Boston, MA: Artech House, 2004.
- (2004) Beyond the Kalman Filter: Particle Filters for Tracking Applications
- Ristic, B.¹ Arulampalam, S.² Gordon, N.³

19
- 85032772258
- Minimum variance distortionless response spectral estimation, review and refinements
- Sep.
- M. Wölfel and J. W. McDonough, "Minimum variance distortionless response spectral estimation, review and refinements, " IEEE Signal Process. Mag., vol. 22, no. 5, pp. 117-126, Sep. 2005.
- (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 117-126
- Wölfel, M.¹ Mcdonough, J.W.²

20
- 0036299277
- A bayesian approach to speech feature enhancement using the dynamic cepstral prior
- L. Deng, J. Droppo, and A. Acero, "A Bayesian approach to speech feature enhancement using the dynamic cepstral prior, " in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Deng, L.¹ Droppo, J.² Acero, A.³

21
- 34547552191
- Overcoming the vector tailor series approximation in speech feature enhancement-a particle filter approach
- F. Faubel and M. Wölfel, "Overcoming the vector tailor series approximation in speech feature enhancement-A particle filter approach, " in Proc. ICASSP, 2007, pp. 829-832.
- (2007) Proc. ICASSP , pp. 829-832
- Faubel, F.¹ Wölfel, M.²

22
- 4544365937
- On tracking noise with linear dynamical system models
- B. Raj, R. Singh, and R. Stern, "On tracking noise with linear dynamical system models, " in Proc. ICASSP, 2004, pp. 965-968.
- (2004) Proc. ICASSP , pp. 965-968
- Raj, B.¹ Singh, R.² Stern, R.³

23
- 34547494146
- Diploma Thesis, Universität Karlsruhe (TH), Germany, Aug.
- F. Faubel, "Speech Feature Enhancement for Speech Recognition by Sequential Monte Carlo Methods, " Diploma Thesis, Universität Karlsruhe (TH), Germany, Aug. 2006.
- (2006) Speech Feature Enhancement for Speech Recognition by Sequential Monte Carlo Methods
- Faubel, F.¹

24
- 51449110424
- Integration of the predicted walk model estimate into the particle filter framework
- M. Wölfel, "Integration of the predicted walk model estimate into the particle filter framework, " in Proc. ICASSP, 2008, pp. 4725-4728.
- (2008) Proc. ICASSP , pp. 4725-4728
- Wölfel, M.¹

25
- 84902053085
- The effects of room acoustics on MFCC speech parameter
- Y. Pan and A.Waibel, "The effects of room acoustics on MFCC speech parameter, " in Proc. ICSLP, 2000, pp. 129-132.
- (2000) Proc. ICSLP , pp. 129-132
- Pan, Y.¹ Waibel, A.²

26
- 0003870155
- New York: Elsevier
- H. Kuttruff, Room Acoustics. New York: Elsevier, 2000.
- (2000) Room Acoustics
- Kuttruff, H.¹

27
- 0030696446
- Robust blind channel identification and equalization based on multi-step predictors
- D. Gespert and P. Duhamel, "Robust blind channel identification and equalization based on multi-step predictors, " in Proc. ICASSP, 1997, pp. 3621-3624.
- (1997) Proc. ICASSP , pp. 3621-3624
- Gespert, D.¹ Duhamel, P.²

28
- 33745195661
- Efficient dereverberation framework for automatic speech recognition
- K. Kinoshita, T. Nakatani, and M. Miyoshi, "Efficient dereverberation framework for automatic speech recognition, " in Proc. Interspeech, 2005, pp. 3145-3148.
- (2005) Proc. Interspeech , pp. 3145-3148
- Kinoshita, K.¹ Nakatani, T.² Miyoshi, M.³

29
- 77249114287
- The rich transcription 2006 spring meeting recognition evaluation
- J. G. Fiscus, J. Ajot, M. Michel, and J. S. Garofolo, S. Renals, S. Bengio, and J. G. Fiscus, Eds., LNCS, Springer
- J. G. Fiscus, J. Ajot, M. Michel, and J. S. Garofolo, S. Renals, S. Bengio, and J. G. Fiscus, Eds., "The rich transcription 2006 spring meeting recognition evaluation, " in Proc. Mach. Learn. Multimodal Interaction, 2006, vol. 4299, pp. 309-322, LNCS, Springer.
- (2006) Proc. Mach. Learn. Multimodal Interaction , vol.4299 , pp. 309-322

30
- 47749117638
- The ISL RT-07 speech-to-text system
- R. Stiefelhagen, R. Bowers, and J. G. Fiscus, Eds., LNCS, Springer
- M. Wölfel, S. Stüker, and F. Kraft, "The ISL RT-07 speech-to-text system, " in Proc. Multimodal Technologies for Perception of Humans, R. Stiefelhagen, R. Bowers, and J. G. Fiscus, Eds., 2007, vol. 4625, pp. 464-474, LNCS, Springer.
- (2007) Proc. Multimodal Technologies for Perception of Humans , vol.4625 , pp. 464-474
- Wölfel, M.¹ Stüker, S.² Kraft, F.³

31
- 0032638856
- Semi-tied covariance matrices for hidden markov models
- May
- M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 272-281
- Gales, M.J.F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.