메뉴 건너뛰기




Volumn 8, Issue 3, 2000, Pages 255-266

Frequency-domain maximum likelihood estimation for automatic speech recognition in additive and convolutive noises

Author keywords

Distortion; Gaussian noise; Map estimation; Maximum likelihood; Spectral analysis; Speech recognition

Indexed keywords


EID: 0001459635     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/89.841208     Document Type: Article
Times cited : (28)

References (28)
  • 2
    • 3643095347 scopus 로고
    • Speech recognition in noisy environments: A survey
    • June
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 12, pp. 231-239, June 1995.
    • (1995) Speech Commun. , vol.12 , pp. 231-239
    • Gong, Y.1
  • 3
    • 0029345417 scopus 로고
    • A signal subspace approach for speech enhancement
    • July
    • Y. Ephraim and H. L. Van Trees, "A signal subspace approach for speech enhancement," IEEE Trans. Speech Audio Processing, vol. 3, pp. 251-266, July 1995.
    • (1995) IEEE Trans. Speech Audio Processing , vol.3 , pp. 251-266
    • Ephraim, Y.1    Van Trees, H.L.2
  • 4
    • 0031257127 scopus 로고    scopus 로고
    • An energy constrained signal subspace method for speech enhancement and recognition
    • J. Huang and Y. Zhao, "An energy constrained signal subspace method for speech enhancement and recognition," IEEE Signal Processing Lett., vol. 10, pp. 283-285, 1997.
    • (1997) IEEE Signal Processing Lett. , vol.10 , pp. 283-285
    • Huang, J.1    Zhao, Y.2
  • 5
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral substraction
    • Apr.
    • S. F. Boll, "Suppression of acoustic noise in speech using spectral substraction," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-27, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-27 , pp. 113-120
    • Boll, S.F.1
  • 7
    • 0025681008 scopus 로고
    • Hidden Markov model decomposition of speech and noise
    • Apr.
    • A. Varga and R. K. Moore, "Hidden Markov model decomposition of speech and noise," in Proc. ICASSP, Albuquerque, NM, Apr. 1990.
    • (1990) Proc. ICASSP, Albuquerque, NM
    • Varga, A.1    Moore, R.K.2
  • 8
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Apr.
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 245-258, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 245-258
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 9
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Sept.
    • M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Processing, vol. 4, pp. 352-359, Sept. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 352-359
    • Gales, M.J.F.1    Young, S.J.2
  • 10
    • 0028460810 scopus 로고
    • An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition
    • July
    • Y. Zhao, "An acoustic-phonetic-based speaker adaptation technique for improving speaker-independent continuous speech recognition," IEEE Trans. Speech Audio Processing, vol. 2, pp. 380-394, July 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 380-394
    • Zhao, Y.1
  • 11
    • 0029770844 scopus 로고    scopus 로고
    • Self-learning speaker/channel adaptation based on spectral variation source decomposition
    • Jan.
    • "Self-learning speaker/channel adaptation based on spectral variation source decomposition," Speech Commun., vol. 18, pp. 65-78, Jan. 1996.
    • (1996) Speech Commun. , vol.18 , pp. 65-78
  • 12
    • 0029769867 scopus 로고    scopus 로고
    • Signal bias removal by maximum likelihood estimation for robust telephone speech recognition
    • Jan.
    • M. G. Rahim and B.-J. Juang, "Signal bias removal by maximum likelihood estimation for robust telephone speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 19-30, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 19-30
    • Rahim, M.G.1    Juang, B.-J.2
  • 13
    • 0030149866 scopus 로고    scopus 로고
    • A maximum likelihood approach to stochastic matching for robust speech recognition
    • May
    • A. Sankar and C.-H. Lee, "A maximum likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Processing, vol. 4, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 14
    • 0024940640 scopus 로고
    • Unsupervised speaker adaptation by probablistic spectral fitting
    • Glasgow, U.K., May
    • S. J. Cox and J. S. Bridle, "Unsupervised speaker adaptation by probablistic spectral fitting," in Proc. ICASSP, Glasgow, U.K., May 1989, pp. 294-297.
    • (1989) Proc. ICASSP , pp. 294-297
    • Cox, S.J.1    Bridle, J.S.2
  • 15
    • 0024899244 scopus 로고
    • Unsupervised speaker adaptation method based on hierarchical spectral clustering
    • Glasgow, U.K., May
    • S. Furui, "Unsupervised speaker adaptation method based on hierarchical spectral clustering," in Proc. ICASSP, Glasgow, U.K., May 1989, pp. 286-289.
    • (1989) Proc. ICASSP , pp. 286-289
    • Furui, S.1
  • 17
    • 0027166410 scopus 로고
    • Recognition of speech in additive and convolutional noise based on RASTA spectral processing
    • Apr.
    • H. Hermansky, N. Morgan, and H. Hirsch, "Recognition of speech in additive and convolutional noise based on RASTA spectral processing," in Proc. ICASSP, Minneapolis, MN, Apr. 1993, pp. II.83-86.
    • (1993) Proc. ICASSP, Minneapolis, MN , pp. 83-86
    • Hermansky, H.1    Morgan, N.2    Hirsch, H.3
  • 18
    • 0030674098 scopus 로고    scopus 로고
    • A unified approach to acoustic mismatch compensation: Application to noisy Lombard speech recognition
    • Apr.
    • M. Afify, Y. Gong, and J.-P. Haton, "A unified approach to acoustic mismatch compensation: Application to noisy Lombard speech recognition," in Proc. ICASSP, Munich, Germany, Apr. 1997, pp. 839-842.
    • (1997) Proc. ICASSP, Munich, Germany , pp. 839-842
    • Afify, M.1    Gong, Y.2    Haton, J.-P.3
  • 19
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment independent speech recognition
    • Atlanta, GA, May
    • P. J. Moreno, B. Raj, and R. M. Stem, "A vector Taylor series approach for environment independent speech recognition," in Proc. ICASSP, Atlanta, GA, May 1996, pp. 733-736.
    • (1996) Proc. ICASSP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stem, R.M.3
  • 20
    • 0029390135 scopus 로고
    • Robust speech recognition in additive and convolutional noise using parallel model combination
    • M. J. F. Gales, "Robust speech recognition in additive and convolutional noise using parallel model combination," Comput. Speech Lang., vol. 9, pp. 289-308, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 289-308
    • Gales, M.J.F.1
  • 21
    • 0029726509 scopus 로고    scopus 로고
    • Improving environmental robustness in large vocabulary speech recognition
    • P. C. Woodland, M. J. F. Gales, and D. Pye, "Improving environmental robustness in large vocabulary speech recognition," in Proc. ICASSP, Altanta, GA, May 1996, pp. 65-68.
    • (1996) Proc. ICASSP, Altanta, GA, May , pp. 65-68
    • Woodland, P.C.1    Gales, M.J.F.2    Pye, D.3
  • 22
    • 0032635304 scopus 로고    scopus 로고
    • An EM algorithm for linear distortion channel estimation based on observations from a mixture of Gaussian sources
    • July
    • Y. Zhao, "An EM algorithm for linear distortion channel estimation based on observations from a mixture of Gaussian sources," IEEE Trans. Speech Audio Processing, vol. 7, pp. 400-413, July 1999.
    • (1999) IEEE Trans. Speech Audio Processing , vol.7 , pp. 400-413
    • Zhao, Y.1
  • 24
  • 25
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 52-59, 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 52-59
    • Furui, S.1
  • 26
    • 0002629270 scopus 로고
    • Maximum likelihood estimation from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood estimation from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 28
    • 0027625639 scopus 로고
    • A speaker-independent continuous speech recognition system using continuous mixture Gaussian density HMM of phoneme-sized units
    • July
    • Y. Zhao, "A speaker-independent continuous speech recognition system using continuous mixture Gaussian density HMM of phoneme-sized units," IEEE Trans. Speech Audio Processing, vol. 1, pp. 345-361, July 1993.
    • (1993) IEEE Trans. Speech Audio Processing , vol.1 , pp. 345-361
    • Zhao, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.