메뉴 건너뛰기




Volumn 17, Issue 2, 2009, Pages 324-334

Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing

Author keywords

Dereverberation; Model adaptation; Robust automatic speech recognition (ASR); Variance compensation

Indexed keywords

ACOUSTIC MODEL; ADAPTATION SCHEME; ADAPTIVE TRAINING; AUTOMATIC SPEECH RECOGNITION; CONVENTIONAL MODELS; DEREVERBERATION; ERROR RATE; EXPECTATION-MAXIMIZATION ALGORITHMS; MODEL ADAPTATION; MODEL PARAMETERS; NOISE ROBUSTNESS; PARAMETRIC MODELS; PREPROCESSORS; RELATIVE ERROR RATES; REVERBERATION EFFECTS; REVERBERATION TIME; ROBUST AUTOMATIC SPEECH RECOGNITION (ASR); SPEECH FEATURES; SPEECH RECOGNIZER; STATIC AND DYNAMIC; VARIANCE COMPENSATION; WORD ERROR RATE;

EID: 70350450398     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2010214     Document Type: Article
Times cited : (50)

References (37)
  • 2
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • Sep.
    • B. Raj and R. M. Stern, "Missing-feature approaches in speech recognition, " IEEE Signal Process. Mag., vol. 22, no. 5, pp. 101-116, Sep. 2005.
    • (2005) IEEE Signal Process. Mag. , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.M.2
  • 3
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the mllr framework
    • M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework, " Comput. Speech Lang., vol. 10, pp. 249-264, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 4
    • 0032685060 scopus 로고    scopus 로고
    • Robust speech recognition based on a bayesian prediction approach
    • Jul.
    • H. Jiang, K. Hirose, and Q. Huo, "Robust speech recognition based on a Bayesian prediction approach, " IEEE Trans. Speech Audio Process., vol. 7, no. 4, pp. 426-440, Jul. 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.4 , pp. 426-440
    • Jiang, H.1    Hirose, K.2    Huo, Q.3
  • 6
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Sep.
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination, " IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 8
    • 1542677825 scopus 로고    scopus 로고
    • Blind model selection for automatic speech recognition in reverberant environments
    • L. Couvreur and C. Couvreur, "Blind model selection for automatic speech recognition in reverberant environments, " J. VLSI Signal Process. Syst., vol. 36, no. 2-3, pp. 189-203, 2004.
    • (2004) J. VLSI Signal Process. Syst. , vol.36 , Issue.2-3 , pp. 189-203
    • Couvreur, L.1    Couvreur, C.2
  • 14
    • 34548571735 scopus 로고    scopus 로고
    • Harmonicity-based blind dereverberation for single-channel speech signals
    • Jan
    • T. Nakatani, K. Kinoshita, and M. Miyoshi, "Harmonicity-based blind dereverberation for single-channel speech signals, " IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 80-95, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 80-95
    • Nakatani, T.1    Kinoshita, K.2    Miyoshi, M.3
  • 15
    • 29744447074 scopus 로고    scopus 로고
    • Speech dereverberation algorithm using transfer function estimates with overestimated order
    • T. Hikichi, M. Delcroix, and M. Miyoshi, "Speech dereverberation algorithm using transfer function estimates with overestimated order, " Acoust. Sci. Technol., vol. 27, no. 1, pp. 28-35, 2006.
    • (2006) Acoust. Sci. Technol. , vol.27 , Issue.1 , pp. 28-35
    • Hikichi, T.1    Delcroix, M.2    Miyoshi, M.3
  • 16
  • 17
  • 18
    • 33745761716 scopus 로고    scopus 로고
    • A two-stage algorithm for one-microphone reverberant speech enhancement
    • May
    • M. Wu and D. Wang, "A two-stage algorithm for one-microphone reverberant speech enhancement, " IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 3, pp. 774-784, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.3 , pp. 774-784
    • Wu, M.1    Wang, D.2
  • 19
    • 18744401086 scopus 로고    scopus 로고
    • Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
    • May
    • L. Deng, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion, " IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 412-421, May 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.3 , pp. 412-421
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 24
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and uncertain acoustic data
    • M. P. Cooke, P. D. Green, L. B. Josifovski, and A.Vizinho, "Robust automatic speech recognition with missing and uncertain acoustic data, " Speech Commun., vol. 34, pp. 267-285, 2001.
    • (2001) Speech Commun. , vol.34 , pp. 267-285
    • Cooke, M.P.1    Green, P.D.2    Josifovski, L.B.3    Vizinho, A.4
  • 25
    • 51449102822 scopus 로고    scopus 로고
    • Combined static and dynamic variance adaptation for efficient interconnection of a speech enhancement pre-processor with speech recognizer
    • M. Delcroix, T. Nakatani, and S. Watanabe, "Combined static and dynamic variance adaptation for efficient interconnection of a speech enhancement pre-processor with speech recognizer, " in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08), 2008, pp. 4073-4076.
    • (2008) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP'08) , pp. 4073-4076
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 26
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • May
    • A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition, " IEEE Trans. Speech Audio Process., vol. 4, no. 3, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.3 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 27
    • 0003870155 scopus 로고
    • 3rd ed. London, U.K.: Elsevier Science
    • H.Kuttruff, Room Acoustics, 3rd ed. London, U.K.: Elsevier Science, 1991.
    • (1991) Room Acoustics
    • Kuttruff, H.1
  • 29
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction, " IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.F.1
  • 30
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • May
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise, " IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 245-257, May 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.3 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 31
    • 0000251971 scopus 로고
    • Maximum likelihood estimation via the ECM algorithm: A general framework
    • X.-L. Meng and D. B. Rubin, "Maximum likelihood estimation via the ECM algorithm: A general framework, " Biometrika, vol. 80, pp. 267-278, 1993.
    • (1993) Biometrika , vol.80 , pp. 267-278
    • Meng, X.-L.1    Rubin, D.B.2
  • 37
    • 0000914334 scopus 로고    scopus 로고
    • Convolutive blind separation of non-stationary sources
    • May
    • L. Parra and C. Spence, "Convolutive blind separation of non-stationary sources, " IEEE Trans. Speech Audio Process., vol. 8, no. 3, pp. 320-327, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 320-327
    • Parra, L.1    Spence, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.