메뉴 건너뛰기




Volumn 17, Issue 2, 2009, Pages 312-323

Enhanced speech features by single-channel joint compensation of noise and reverberation

Author keywords

Automatic speech recognition (ASR); Joint removal of additive and reverberant distortions; Multistep linear prediction (MSLP); Particle filter; Speech feature enhancement

Indexed keywords

AUTOMATIC SPEECH RECOGNITION (ASR); JOINT REMOVAL OF ADDITIVE AND REVERBERANT DISTORTIONS; MULTISTEP LINEAR PREDICTION (MSLP); PARTICLE FILTER; SPEECH FEATURE ENHANCEMENT;

EID: 70350439261     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2009161     Document Type: Article
Times cited : (42)

References (31)
  • 1
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoustics, Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoustics, Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 2
    • 0029725301 scopus 로고    scopus 로고
    • A vector taylor series approach for environment-independent speech recognition
    • P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition, " in Proc. ICASSP, 1996, pp. 733-736.
    • (1996) Proc. ICASSP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 3
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for hmmbased speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition, " Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models, " Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 5
    • 0032099596 scopus 로고    scopus 로고
    • Imm-based estimation for slowly evolving environments
    • Jun.
    • N. S. Kim, "IMM-Based estimation for slowly evolving environments, " IEEE Signal Process. Lett., vol. 5, no. 6, pp. 146-149, Jun. 1998.
    • (1998) IEEE Signal Process. Lett. , vol.5 , Issue.6 , pp. 146-149
    • Kim, N.S.1
  • 6
    • 84898993440 scopus 로고    scopus 로고
    • Sequential noise compensation by sequential monte carlo methods
    • Sep.
    • K. Yao and S. Nakamura, "Sequential noise compensation by sequential monte carlo methods, " Adv. Neural Inf. Process. Syst., vol. 14, pp. 1213-1220, Sep. 2002.
    • (2002) Adv. Neural Inf. Process. Syst. , vol.14 , pp. 1213-1220
    • Yao, K.1    Nakamura, S.2
  • 7
    • 0141591493 scopus 로고    scopus 로고
    • Tracking noise via dynamical systems with a continuum of states
    • R. Singh and B. Raj, "Tracking noise via dynamical systems with a continuum of states, " in Proc. ICASSP, 2003, pp. 396-399.
    • (2003) Proc. ICASSP , pp. 396-399
    • Singh, R.1    Raj, B.2
  • 8
    • 33645785839 scopus 로고    scopus 로고
    • Particle filter based non-stationary noise tracking for robust speech feature enhancement
    • M. Fujimoto and S. Nakamura, "Particle filter based non-stationary noise tracking for robust speech feature enhancement, " in Proc. ICASSP, 2005, pp. 257-260.
    • (2005) Proc. ICASSP , pp. 257-260
    • Fujimoto, M.1    Nakamura, S.2
  • 9
    • 44949224968 scopus 로고    scopus 로고
    • Coupling particle filters with automatic speech recognition for speech feature enhancement
    • Sep.
    • F. Faubel and M. Wölfel, "Coupling particle filters with automatic speech recognition for speech feature enhancement, " in Proc. Interspeech, Sep. 2006, pp. 37-40.
    • (2006) Proc. Interspeech , pp. 37-40
    • Faubel, F.1    Wölfel, M.2
  • 11
    • 50449102864 scopus 로고    scopus 로고
    • The harming part of room acoustics in automatic speech recognition
    • R. Petrick, K. Lohde, M.Wolff, and R. Hoffmann, "The harming part of room acoustics in automatic speech recognition, " in Proc. Interspeech, 2007, pp. 1094-1097.
    • (2007) Proc. Interspeech , pp. 1094-1097
    • Petrick, R.1    Lohde, K.2    Wolff, M.3    Hoffmann, R.4
  • 12
    • 33745761716 scopus 로고    scopus 로고
    • A two-stage algorithm for one-microphone reverberant speech enhancement
    • May
    • M. Wu and D. Wang, "A two-stage algorithm for one-microphone reverberant speech enhancement, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 774-784, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 774-784
    • Wu, M.1    Wang, D.2
  • 13
    • 70349707037 scopus 로고    scopus 로고
    • Towards robust distant-talking automatic speech recognition in reverberant environments
    • A. Sehr and W. Kellermann, E. Hänsler and G. Schmidt, Eds.,New York: Springer
    • A. Sehr and W. Kellermann, E. Hänsler and G. Schmidt, Eds., "Towards robust distant-Talking automatic speech recognition in reverberant environments, " in Topics in Speech and Audio Processing in Adverse Environments. New York: Springer, 2008.
    • (2008) Topics in Speech and Audio Processing in Adverse Environments
  • 14
    • 14344274593 scopus 로고    scopus 로고
    • A new method based on spectral subtraction for speech dereverberation
    • May/Jun
    • K. Lebart, J. M. Boucher, and P. N. Denbigh, "A new method based on spectral subtraction for speech dereverberation, " Acta Acustica United With Acustica, vol. 87, no. 3, pp. 359-366, May/Jun. 2001.
    • (2001) Acta Acustica United with Acustica , vol.87 , Issue.3 , pp. 359-366
    • Lebart, K.1    Boucher, J.M.2    Denbigh, P.N.3
  • 15
    • 38649115063 scopus 로고    scopus 로고
    • A new approach for the adaptation of HMMs to reverberation and background noise
    • H. G. Hirsch and H. Finster, "A new approach for the adaptation of HMMs to reverberation and background noise, " Speech Commun., vol. 50, pp. 244-263, 2008.
    • (2008) Speech Commun. , vol.50 , pp. 244-263
    • Hirsch, H.G.1    Finster, H.2
  • 16
    • 33947694356 scopus 로고    scopus 로고
    • Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation
    • K. Kinoshita, T. Nakatani, and M. Miyoshi, "Spectral subtraction steered by multi-step forward linear prediction for single channel speech dereverberation, " in Proc. ICASSP, 2006, pp. 817-820.
    • (2006) Proc. ICASSP , pp. 817-820
    • Kinoshita, K.1    Nakatani, T.2    Miyoshi, M.3
  • 19
    • 85032772258 scopus 로고    scopus 로고
    • Minimum variance distortionless response spectral estimation, review and refinements
    • Sep.
    • M. Wölfel and J. W. McDonough, "Minimum variance distortionless response spectral estimation, review and refinements, " IEEE Signal Process. Mag., vol. 22, no. 5, pp. 117-126, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 117-126
    • Wölfel, M.1    Mcdonough, J.W.2
  • 20
    • 0036299277 scopus 로고    scopus 로고
    • A bayesian approach to speech feature enhancement using the dynamic cepstral prior
    • L. Deng, J. Droppo, and A. Acero, "A Bayesian approach to speech feature enhancement using the dynamic cepstral prior, " in Proc. ICASSP, 2002.
    • (2002) Proc. ICASSP
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 21
    • 34547552191 scopus 로고    scopus 로고
    • Overcoming the vector tailor series approximation in speech feature enhancement-a particle filter approach
    • F. Faubel and M. Wölfel, "Overcoming the vector tailor series approximation in speech feature enhancement-A particle filter approach, " in Proc. ICASSP, 2007, pp. 829-832.
    • (2007) Proc. ICASSP , pp. 829-832
    • Faubel, F.1    Wölfel, M.2
  • 22
    • 4544365937 scopus 로고    scopus 로고
    • On tracking noise with linear dynamical system models
    • B. Raj, R. Singh, and R. Stern, "On tracking noise with linear dynamical system models, " in Proc. ICASSP, 2004, pp. 965-968.
    • (2004) Proc. ICASSP , pp. 965-968
    • Raj, B.1    Singh, R.2    Stern, R.3
  • 24
    • 51449110424 scopus 로고    scopus 로고
    • Integration of the predicted walk model estimate into the particle filter framework
    • M. Wölfel, "Integration of the predicted walk model estimate into the particle filter framework, " in Proc. ICASSP, 2008, pp. 4725-4728.
    • (2008) Proc. ICASSP , pp. 4725-4728
    • Wölfel, M.1
  • 25
    • 84902053085 scopus 로고    scopus 로고
    • The effects of room acoustics on MFCC speech parameter
    • Y. Pan and A.Waibel, "The effects of room acoustics on MFCC speech parameter, " in Proc. ICSLP, 2000, pp. 129-132.
    • (2000) Proc. ICSLP , pp. 129-132
    • Pan, Y.1    Waibel, A.2
  • 27
    • 0030696446 scopus 로고    scopus 로고
    • Robust blind channel identification and equalization based on multi-step predictors
    • D. Gespert and P. Duhamel, "Robust blind channel identification and equalization based on multi-step predictors, " in Proc. ICASSP, 1997, pp. 3621-3624.
    • (1997) Proc. ICASSP , pp. 3621-3624
    • Gespert, D.1    Duhamel, P.2
  • 28
    • 33745195661 scopus 로고    scopus 로고
    • Efficient dereverberation framework for automatic speech recognition
    • K. Kinoshita, T. Nakatani, and M. Miyoshi, "Efficient dereverberation framework for automatic speech recognition, " in Proc. Interspeech, 2005, pp. 3145-3148.
    • (2005) Proc. Interspeech , pp. 3145-3148
    • Kinoshita, K.1    Nakatani, T.2    Miyoshi, M.3
  • 29
    • 77249114287 scopus 로고    scopus 로고
    • The rich transcription 2006 spring meeting recognition evaluation
    • J. G. Fiscus, J. Ajot, M. Michel, and J. S. Garofolo, S. Renals, S. Bengio, and J. G. Fiscus, Eds., LNCS, Springer
    • J. G. Fiscus, J. Ajot, M. Michel, and J. S. Garofolo, S. Renals, S. Bengio, and J. G. Fiscus, Eds., "The rich transcription 2006 spring meeting recognition evaluation, " in Proc. Mach. Learn. Multimodal Interaction, 2006, vol. 4299, pp. 309-322, LNCS, Springer.
    • (2006) Proc. Mach. Learn. Multimodal Interaction , vol.4299 , pp. 309-322
  • 30
    • 47749117638 scopus 로고    scopus 로고
    • The ISL RT-07 speech-to-text system
    • R. Stiefelhagen, R. Bowers, and J. G. Fiscus, Eds., LNCS, Springer
    • M. Wölfel, S. Stüker, and F. Kraft, "The ISL RT-07 speech-to-text system, " in Proc. Multimodal Technologies for Perception of Humans, R. Stiefelhagen, R. Bowers, and J. G. Fiscus, Eds., 2007, vol. 4625, pp. 464-474, LNCS, Springer.
    • (2007) Proc. Multimodal Technologies for Perception of Humans , vol.4625 , pp. 464-474
    • Wölfel, M.1    Stüker, S.2    Kraft, F.3
  • 31
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden markov models
    • May
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models, " IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.