메뉴 건너뛰기




Volumn 18, Issue 7, 2010, Pages 1692-1707

Model-based feature enhancement for reverberant speech recognition

Author keywords

Automatic speech recognition (ASR); feature enhancement; reverberant speech recognition

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; BAYESIAN INFERENCE; ENHANCEMENT TECHNIQUES; FEATURE ENHANCEMENT; FEATURE VECTORS; INTERMEDIATE STAGE; LINEAR DYNAMICAL MODELS; MEL-FREQUENCY CEPSTRAL COEFFICIENTS; MICROPHONE SIGNALS; MINIMUM MEAN SQUARE ERROR ESTIMATE; MODEL-BASED; OBSERVATION MODEL; POWER SPECTRAL; PRIORI MODEL; REAL-TIME APPLICATION; REVERBERANT ENVIRONMENT; REVERBERANT SPEECH RECOGNITION; REVERBERATION TIME; ROOM IMPULSE RESPONSE; SIMPLIFIED MODELS; TWO PARAMETER;

EID: 77955673019     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2049684     Document Type: Article
Times cited : (55)

References (46)
  • 2
    • 0029185029 scopus 로고
    • Evam: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals
    • Jan.
    • M. Gürelli and C. Nikias, "Evam: An eigenvector-based algorithm for multichannel blind deconvolution of input colored signals," IEEE Trans. Signal Process., vol.43, no.1, pp. 134-149, Jan. 1995.
    • (1995) IEEE Trans. Signal Process. , vol.43 , Issue.1 , pp. 134-149
    • Gürelli, M.1    Nikias, C.2
  • 4
    • 0242271432 scopus 로고    scopus 로고
    • Subspace methods for multimicrophone speech dereverberation
    • S. Gannot and M. Moonen, "Subspace methods for multimicrophone speech dereverberation," EURASIP J. Appl. Signal Process., vol.11, pp. 1074-1090, 2003.
    • (2003) EURASIP J. Appl. Signal Process. , vol.11 , pp. 1074-1090
    • Gannot, S.1    Moonen, M.2
  • 5
    • 33745761716 scopus 로고    scopus 로고
    • A two-stage algorithm for one-microphone reverberant speech enhancement
    • May
    • M. Wu and D. Wang, "A two-stage algorithm for one-microphone reverberant speech enhancement," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.3, pp. 774-784, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 774-784
    • Wu, M.1    Wang, D.2
  • 6
    • 14344274593 scopus 로고    scopus 로고
    • A new method based on spectral subtraction for speech dereverberation
    • K. Lebart, J. Boucher, and P. Denbigh, "A new method based on spectral subtraction for speech dereverberation," Acta Acust. United with Acust., vol.87, no.8, pp. 359-366, 2001.
    • (2001) Acta Acust. United with Acust. , vol.87 , Issue.8 , pp. 359-366
    • Lebart, K.1    Boucher, J.2    Denbigh, P.3
  • 8
    • 65249167097 scopus 로고    scopus 로고
    • Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction
    • May
    • K. Kinoshita, M. Delcroix, T. Nakatani, and M. Miyoshi, "Suppression of late reverberation effect on speech signal using long-term multiplestep linear prediction," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.4, pp. 534-545, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 534-545
    • Kinoshita, K.1    Delcroix, M.2    Nakatani, T.3    Miyoshi, M.4
  • 9
    • 70350435249 scopus 로고    scopus 로고
    • Integrated speech enhancement method using noise suppression and dereverberation
    • Feb.
    • T. Yoshioka, T. Nakatani, and M. Miyoshi, "Integrated speech enhancement method using noise suppression and dereverberation," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 231-246, Feb. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 231-246
    • Yoshioka, T.1    Nakatani, T.2    Miyoshi, M.3
  • 11
    • 0001379957 scopus 로고    scopus 로고
    • Enhancement of reverberant speech using LP residual signal
    • May
    • B. Yegnanarayana and P. Murthy, "Enhancement of reverberant speech using LP residual signal," IEEE Trans. Speech Audio Process., vol.8, no.3, pp. 267-281, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 267-281
    • Yegnanarayana, B.1    Murthy, P.2
  • 12
    • 4344573437 scopus 로고    scopus 로고
    • A speech dereverberation method based on the MTF concept in power envelope restoration
    • M. Unoki, K. Sakata, M. Furukawa, and M. Akagi, "A speech dereverberation method based on the MTF concept in power envelope restoration," Acoust. Sci. Technol., vol.25, no.4, pp. 243-254, 2004.
    • (2004) Acoust. Sci. Technol. , vol.25 , Issue.4 , pp. 243-254
    • Unoki, M.1    Sakata, K.2    Furukawa, M.3    Akagi, M.4
  • 14
    • 0030247605 scopus 로고    scopus 로고
    • Cepstrum-based deconvolution for speech dereverberation
    • Sep.
    • S. Subramaniam, A. Petropulu, and C. Wendt, "Cepstrum-based deconvolution for speech dereverberation," IEEE Trans. Speech Audio Process., vol.4, no.5, pp. 392-396, Sep. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 392-396
    • Subramaniam, S.1    Petropulu, A.2    Wendt, C.3
  • 16
    • 1542677825 scopus 로고    scopus 로고
    • Blind model selection for automatic speech recognition in reverberant environments
    • L. Couvreur and C. Couvreur, "Blind model selection for automatic speech recognition in reverberant environments," J. VLSI Signal Process. Syst., vol. 36, no. 2/3, pp. 189-203, 2004.
    • (2004) J. VLSI Signal Process. Syst. , vol.36 , Issue.2-3 , pp. 189-203
    • Couvreur, L.1    Couvreur, C.2
  • 17
    • 38649115063 scopus 로고    scopus 로고
    • A new approach for the adaptation of HMMs to reverberation and background noise
    • H.-G. Hirsch and H. Finster, "A new approach for the adaptation of HMMs to reverberation and background noise," Speech Commun., vol.50, no.3, pp. 244-263, 2008.
    • (2008) Speech Commun. , vol.50 , Issue.3 , pp. 244-263
    • Hirsch, H.-G.1    Finster, H.2
  • 18
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
    • Feb.
    • M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 324-334, Feb. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 19
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol.9, no.2, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 20
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech Lang., vol.10, no.4, pp. 249-264, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , Issue.4 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 21
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMMbased speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput. Speech Lang., vol.12, no.2, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 22
    • 33846229072 scopus 로고    scopus 로고
    • Model adaptation by state splitting of HMM for long reverberation
    • Sep.
    • C. K. Raut, T. Nishimoto, and S. Sagayama, "Model adaptation by state splitting of HMM for long reverberation," in Proc. Interspeech, Sep. 2005.
    • (2005) Proc. Interspeech
    • Raut, C.K.1    Nishimoto, T.2    Sagayama, S.3
  • 26
    • 84881675408 scopus 로고
    • Cepstral channel normalization techniques for HMM-based speaker verification
    • A. E. Rosenberg, C.-H. Lee, and F. K. Soong, "Cepstral channel normalization techniques for HMM-based speaker verification," in ICSLP'94, 1994, pp. 1835-1838.
    • (1994) ICSLP'94 , pp. 1835-1838
    • Rosenberg, A.E.1    Lee, C.-H.2    Soong, F.K.3
  • 27
    • 34247217970 scopus 로고    scopus 로고
    • On multiplicative transfer function approximation in the short-time fourier transform domain
    • DOI 10.1109/LSP.2006.888292
    • Y. Avargel and I. Cohen, "On multiplicative transfer function approximation in the short-time Fourier transform domain," IEEE Signal Process. Lett., vol.14, no.5, pp. 337-340, May 2007. (Pubitemid 46614474)
    • (2007) IEEE Signal Processing Letters , vol.14 , Issue.5 , pp. 337-340
    • Avargel, Y.1    Cohen, I.2
  • 28
    • 70450180986 scopus 로고    scopus 로고
    • Model based feature enhancement for automatic speech recognition in reverberant environments
    • A. Krueger and R. Haeb-Umbach, "Model based feature enhancement for automatic speech recognition in reverberant environments," in Proc. Interspeech'09, 2009, pp. 1231-1234.
    • (2009) Proc. Interspeech'09 , pp. 1231-1234
    • Krueger, A.1    Haeb-Umbach, R.2
  • 29
    • 70350439261 scopus 로고    scopus 로고
    • Enhanced speech features by single-channel joint compensation of noise and reverberation
    • Feb.
    • M. Wölfel, "Enhanced speech features by single-channel joint compensation of noise and reverberation," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.2, pp. 312-323, Feb. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.2 , pp. 312-323
    • Wölfel, M.1
  • 31
    • 70349707037 scopus 로고    scopus 로고
    • Towards robust distant-talking automatic speech recognition in reverberant environments
    • Berlin, Heidelberg, Germany: Springer
    • A. Sehr and W. Kellermann, "Towards robust distant-talking automatic speech recognition in reverberant environments," in Speech and Audio Processing in Adverse Environments. Berlin, Heidelberg, Germany: Springer, 2008.
    • (2008) Speech and Audio Processing in Adverse Environments
    • Sehr, A.1    Kellermann, W.2
  • 32
    • 0027629367 scopus 로고
    • Discrete Gabor transform
    • Jul.
    • S. Qian and D. Chen, "Discrete Gabor transform," IEEE Trans. Signal Process., vol.41, no.7, pp. 2429-2438, Jul. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.7 , pp. 2429-2438
    • Qian, S.1    Chen, D.2
  • 33
    • 0028386321 scopus 로고
    • Linear systems in Gabor time-frequency space
    • Mar.
    • S. Farkash and S. Raz, "Linear systems in Gabor time-frequency space," IEEE Trans. Signal Process., vol.42, no.3, pp. 611-617, Mar. 1994.
    • (1994) IEEE Trans. Signal Process. , vol.42 , Issue.3 , pp. 611-617
    • Farkash, S.1    Raz, S.2
  • 34
    • 50449087796 scopus 로고    scopus 로고
    • System identification in the short-time Fourier transform domain with crossband filtering
    • May
    • Y. Avargel and I. Cohen, "System identification in the short-time Fourier transform domain with crossband filtering," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.4, pp. 1305-1319, May 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1305-1319
    • Avargel, Y.1    Cohen, I.2
  • 35
  • 37
    • 0242460462 scopus 로고    scopus 로고
    • U.C. Berkeley, Berkeley, CA, Tech. Rep.
    • K. Murphy, "Switching Kalman Filters," U.C. Berkeley, Berkeley, CA, 1998, Tech. Rep..
    • (1998) Switching Kalman Filters
    • Murphy, K.1
  • 40
    • 66149116001 scopus 로고    scopus 로고
    • A novel uncertainty decoding rule with applications to transmission error robust speech recognition
    • V. Ion and R. Haeb-Umbach, "A novel uncertainty decoding rule with applications to transmission error robust speech recognition," IEEE Trans. Audio Speech Lang. Process., vol.16, no.5, pp. 1047-1060, 2008.
    • (2008) IEEE Trans. Audio Speech Lang. Process. , vol.16 , Issue.5 , pp. 1047-1060
    • Ion, V.1    Haeb-Umbach, R.2
  • 41
    • 33750291256 scopus 로고    scopus 로고
    • Uncertainty decoding for distributed speech recognition over error-prone networks
    • V. Ion and R. Haeb-Umbach, "Uncertainty decoding for distributed speech recognition over error-prone networks," Speech Commun., vol.48, no.11, pp. 1435-1446, 2006.
    • (2006) Speech Commun. , vol.48 , Issue.11 , pp. 1435-1446
    • Ion, V.1    Haeb-Umbach, R.2
  • 44
    • 33745206705 scopus 로고    scopus 로고
    • The simulation of realistic acoustic input scenarios for speech recognition systems
    • H. G. Hirsch and H. Finster, "The simulation of realistic acoustic input scenarios for speech recognition systems," in Proc. Interspeech'05, 2005, pp. 2697-2700.
    • (2005) Proc. Interspeech'05 , pp. 2697-2700
    • Hirsch, H.G.1    Finster, H.2
  • 45
    • 0018455820 scopus 로고
    • Image method for efficiently simulating small-room acoustics
    • J. B. Allen, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Amer., vol.65, no.4, pp. 943-950, 1979.
    • (1979) J. Acoust. Soc. Amer. , vol.65 , Issue.4 , pp. 943-950
    • Allen, J.B.1
  • 46
    • 77955688063 scopus 로고    scopus 로고
    • Automatic speech recognition in adverse acoustic conditions
    • R. Martin, U. Heute, and C. Antweiler, Eds. New York:Wiley
    • H.-G. Hirsch, "Automatic speech recognition in adverse acoustic conditions," in Advances in Digital Speech Transmission, R. Martin, U. Heute, and C. Antweiler, Eds. New York:Wiley , 2007, pp. 461-496.
    • (2007) Advances in Digital Speech Transmission , pp. 461-496
    • Hirsch, H.-G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.