메뉴 건너뛰기




Volumn 12, Issue 2, 2004, Pages 133-143

Enhancement of Log Mel Power Spectra of Speech Using a Phase-Sensitive Model of the Acoustic Environment and Sequential Estimation of the Corrupting Noise

Author keywords

Noise estimate; Noise robust ASR; Phase sensitive acoustic environment model; Sequential algorithm; Speech feature enhancement

Indexed keywords

ACOUSTIC DISTORTION; ACOUSTIC NOISE; ALGORITHMS; LINEAR SYSTEMS; MATHEMATICAL MODELS; PROBABILITY;

EID: 2142756950     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2003.820201     Document Type: Article
Times cited : (101)

References (25)
  • 2
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noisy speech recognition
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, vol. 3, 2000, pp. 869-872.
    • (2000) Proc. ICSLP , vol.3 , pp. 869-872
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 3
    • 0034854958 scopus 로고    scopus 로고
    • Sequential noise estimation with optimal forgetting for robust speech recognition
    • M. Afify and O. Siohan, "Sequential noise estimation with optimal forgetting for robust speech recognition," in Proc. ICASSP, vol. 1, 2001, pp. 229-232.
    • (2001) Proc. ICASSP , vol.1 , pp. 229-232
    • Afify, M.1    Siohan, O.2
  • 4
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. B-39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.B-39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 5
    • 85009070292 scopus 로고    scopus 로고
    • Large-vocabulary speech recognition under adverse acoustic environments
    • L. Deng, A. Acero, M. Plumpe, and X. D. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, vol. 3, 2000, pp. 806-809.
    • (2000) Proc. ICSLP , vol.3 , pp. 806-809
    • Deng, L.1    Acero, A.2    Plumpe, M.3    Huang, X.D.4
  • 6
    • 0034855352 scopus 로고    scopus 로고
    • High-performance robust speech recognition using stereo training data
    • L. Deng, A. Acero, L. Jiang, J. Drpppo, and X. D. Huang, "High-performance robust speech recognition using stereo training data," in Proc. ICASSP, vol. 1, 2001, pp. 301-304.
    • (2001) Proc. ICASSP , vol.1 , pp. 301-304
    • Deng, L.1    Acero, A.2    Jiang, L.3    Drpppo, J.4    Huang, X.D.5
  • 7
    • 0036299277 scopus 로고    scopus 로고
    • A Bayesian approach to speech feature enhancement using the dynamic cepstral prior
    • May
    • L. Deng, J. Droppo, and A. Acero, "A Bayesian approach to speech feature enhancement using the dynamic cepstral prior," in Proc. ICASSP, vol. 1, May 2002, pp. 829-832.
    • (2002) Proc. ICASSP , vol.1 , pp. 829-832
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 8
    • 2142718298 scopus 로고    scopus 로고
    • Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition
    • Trento, Italy, Dec.
    • _, "Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition," in Proc. Automatic Speech Recognition and Understanding, Trento, Italy, Dec. 2001.
    • (2001) Proc. Automatic Speech Recognition and Understanding
  • 9
    • 0347968277 scopus 로고    scopus 로고
    • Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
    • Nov.
    • _, "Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 11, pp. 568-580, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Processing , vol.11 , pp. 568-580
  • 10
    • 0033623527 scopus 로고    scopus 로고
    • Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics
    • Dec.
    • L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, no. 6, pp. 3036-3048, Dec. 2000.
    • (2000) J. Acoust. Soc. Amer. , vol.108 , Issue.6 , pp. 3036-3048
    • Deng, L.1    Ma, J.2
  • 11
    • 85006734596 scopus 로고    scopus 로고
    • Evaluation of the SPLICE algorithm on the Aurora2 database
    • Sept.
    • J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the Aurora2 database," in Proc. Eurospeech, vol. 1, Sept. 2001, pp. 217-220.
    • (2001) Proc. Eurospeech , vol.1 , pp. 217-220
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 12
    • 0021892216 scopus 로고
    • Speech enhancement using a minimum mean-square error log-spectral amplitude estimator
    • Y. Ephraim, "Speech enhancement using a minimum mean-square error log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. 33, pp. 443-445, 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.33 , pp. 443-445
    • Ephraim, Y.1
  • 13
    • 84948598244 scopus 로고
    • Statistical-model-based speech enhancement systems
    • Oct.
    • _, "Statistical-model-based speech enhancement systems," Proc. IEEE, vol. 80, pp. 1526-1555, Oct. 1992.
    • (1992) Proc. IEEE , vol.80 , pp. 1526-1555
  • 14
    • 85009074657 scopus 로고    scopus 로고
    • ALGONQUIN: Iterating Laplace's method to remove multiple types of acoustic distortion for robust speech recognition
    • Sept.
    • B. Frey, L. Deng, A. Acero, and T. Kristjansson, "ALGONQUIN: Iterating Laplace's method to remove multiple types of acoustic distortion for robust speech recognition," in Proc. Eurospeech, vol. 2, Sept. 2001, pp. 901-904.
    • (2001) Proc. Eurospeech , vol.2 , pp. 901-904
    • Frey, B.1    Deng, L.2    Acero, A.3    Kristjansson, T.4
  • 16
    • 0032027527 scopus 로고    scopus 로고
    • Nonstationary environment compensation based on sequential estimation
    • N. S. Kim, "Nonstationary environment compensation based on sequential estimation," IEEE Signal Processing Lett., vol. 5, pp. 57-60, 1998.
    • (1998) IEEE Signal Processing Lett. , vol.5 , pp. 57-60
    • Kim, N.S.1
  • 17
    • 0003491370 scopus 로고
    • Prentice-Hall, London, U.K.
    • Speech Enhancement, J. S. Lim, Ed., Prentice-Hall, London, U.K., 1983.
    • (1983) Speech Enhancement
    • Lim, J.S.1
  • 18
    • 0026882842 scopus 로고
    • Experiments with a nonlinear spectral subtraction (NSS), hidden Markov models and the projection for robust speech recognition in cars
    • P. Lockwood and J. Boudy, "Experiments with a nonlinear spectral subtraction (NSS), hidden Markov models and the projection for robust speech recognition in cars," Speech Commun., vol. 11, pp. 215-228, 1992.
    • (1992) Speech Commun. , vol.11 , pp. 215-228
    • Lockwood, P.1    Boudy, J.2
  • 20
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • P. Moreno, B. Raj, and R. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. ICASSP, vol. 2, 1996, pp. 733-736.
    • (1996) Proc. ICASSP , vol.2 , pp. 733-736
    • Moreno, P.1    Raj, B.2    Stern, R.3
  • 22
    • 85009142179 scopus 로고    scopus 로고
    • Model-based compensation of the additive noise for continuous speech recognition: Experiments using the AURORA2 database and tasks
    • Aalborg, Denmark, Sept.
    • J. Segura, A. Torre, M. Benitez, and A. Peinado, "Model-based compensation of the additive noise for continuous speech recognition: Experiments using the AURORA2 database and tasks," in Proc. Eurospeech, Aalborg, Denmark, Sept. 2001.
    • (2001) Proc. Eurospeech
    • Segura, J.1    Torre, A.2    Benitez, M.3    Peinado, A.4
  • 23
    • 0001593436 scopus 로고
    • Recursive parameter estimation using incomplete data
    • D. M. Titterington, "Recursive parameter estimation using incomplete data," J. R. Statist. Soc. B, vol. 46, pp. 257-267, 1984.
    • (1984) J. R. Statist. Soc. B , vol.46 , pp. 257-267
    • Titterington, D.M.1
  • 24
    • 2142718297 scopus 로고    scopus 로고
    • O. Viikki, Ed.
    • Speech Commun., vol. 34, O. Viikki, Ed., 2001.
    • (2001) Speech Commun. , vol.34
  • 25
    • 0036755378 scopus 로고    scopus 로고
    • The effect of additive noise on speech amplitude spectra: A quantitative analysis
    • Sept.
    • Q. Zhu and A. Alwan, "The effect of additive noise on speech amplitude spectra: A quantitative analysis," IEEE Signal Processing Lett., vol. 9, pp. 275-277, Sept. 2002.
    • (2002) IEEE Signal Processing Lett. , vol.9 , pp. 275-277
    • Zhu, Q.1    Alwan, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.