메뉴 건너뛰기




Volumn 18, Issue 8, 2010, Pages 1889-1901

Noise adaptive training for robust automatic speech recognition

Author keywords

Model adaptation; noise adaptive training; robust speech recognition; vector Taylor series (VTS)

Indexed keywords

ACOUSTIC MODEL; ADAPTIVE TRAINING; AURORA 3; AUTOMATIC SPEECH RECOGNITION; CLEAN SPEECH; FEATURE ENHANCEMENT; MODEL ADAPTATION; MODEL PARAMETERS; MODEL TRAINING; NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION; POINT ESTIMATE; ROBUST SPEECH RECOGNITION; TEST TIME; TRAINING DATA; VECTOR TAYLOR SERIES; VECTOR TAYLOR SERIES (VTS);

EID: 77956296425     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2040522     Document Type: Article
Times cited : (60)

References (30)
  • 1
    • 0018455310 scopus 로고
    • Suppression of acoustic noise in speech using spectral subtraction
    • Apr.
    • S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 2, pp. 113-120, Apr. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.2 , pp. 113-120
    • Boll, S.1
  • 2
    • 51449089990 scopus 로고    scopus 로고
    • A minimum- mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition
    • Las Vegas, NV
    • D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero, "A minimum- mean-square-error noise reduction algorithm on mel-frequency cepstra for robust speech recognition," in Proc. ICASSP, Las Vegas, NV, 2008, pp. 4041-4044.
    • (2008) Proc. ICASSP , pp. 4041-4044
    • Yu, D.1    Deng, L.2    Droppo, J.3    Wu, J.4    Gong, Y.5    Acero, A.6
  • 3
    • 85009070292 scopus 로고    scopus 로고
    • Large-vocabulary speech recognition under adverse acoustic environments
    • Beijing, China
    • L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, Beijing, China, 2000, pp. 806-809.
    • (2000) Proc. ICSLP , pp. 806-809
    • Deng, L.1    Acero, A.2    Plumpe, M.3    Huang, X.4
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , Issue.2 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 5
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains
    • Apr.
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 6
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • Sep.
    • V. Digalakis, D. Rtischev, L. Neumeyer, and E. Sa, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.1    Rtischev, D.2    Neumeyer, L.3    Sa, E.4
  • 7
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMMbased speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang. , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 8
    • 77956283772 scopus 로고    scopus 로고
    • Regularized feature-based maximum likelihood linear regression for speech recognition
    • Antwerp, Belgium
    • M. K. Omar, "Regularized feature-based maximum likelihood linear regression for speech recognition," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 1561-1564.
    • (2007) Proc. Interspeech , pp. 1561-1564
    • Omar, M.K.1
  • 9
    • 85009088984 scopus 로고    scopus 로고
    • Robust digit recognition in noisy environments: The IBM Aurora 2 system
    • Aalborg, Denmark
    • G. Saon, J. M. Huerta, and E. E. Jan, "Robust digit recognition in noisy environments: The IBM Aurora 2 system," in Proc. Interspeech, Aalborg, Denmark, 2001, pp. 629-632.
    • (2001) Proc. Interspeech , pp. 629-632
    • Saon, G.1    Huerta, J.M.2    Jan, E.E.3
  • 10
    • 27744539597 scopus 로고    scopus 로고
    • Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
    • Nov.
    • X. Cui and A. Alwan, "Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR," IEEE Trans. Speech Audio Processing, vol. 13, no. 6, pp. 1161-1172, Nov. 2005.
    • (2005) IEEE Trans. Speech Audio Processing , vol.13 , Issue.6 , pp. 1161-1172
    • Cui, X.1    Alwan, A.2
  • 11
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speech recognition using parallel model combination
    • Sep.
    • M. Gales, S.Young, and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.1    Young, S.2    Young, S.J.3
  • 12
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. ICASSP, 1996, pp. 733-736.
    • (1996) Proc. ICASSP , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 13
    • 0032048385 scopus 로고    scopus 로고
    • Speech recognition in noisy environments using first-order vector Taylor series
    • D. Y. Kim, C. K. Un, and N. S. Kim, "Speech recognition in noisy environments using first-order vector Taylor series," Speech Commun., vol. 24, no. 1, pp. 39-49, 1998.
    • (1998) Speech Commun. , vol.24 , Issue.1 , pp. 39-49
    • Kim, D.Y.1    Un, C.K.2    Kim, N.S.3
  • 14
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noisy speech recognition
    • Beijing, China
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP, Beijing, China, 2000, pp. 869-872.
    • (2000) Proc. ICSLP , pp. 869-872
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 15
    • 44849125798 scopus 로고    scopus 로고
    • High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
    • Kyoto, Japan
    • J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU, Kyoto, Japan, 2007, pp. 65-70.
    • (2007) Proc. ASRU , pp. 65-70
    • Li, J.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5
  • 16
    • 0030362995 scopus 로고    scopus 로고
    • A compact model for speaker-adaptive training
    • Philadelphia, PA
    • T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. ICSLP, Philadelphia, PA, 1996, pp. 1137-1140.
    • (1996) Proc. ICSLP , pp. 1137-1140
    • Anastasakos, T.1    McDonough, J.2    Schwartz, R.3    Makhoul, J.4
  • 17
    • 44849122740 scopus 로고    scopus 로고
    • Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions
    • Antwerp, Belgium
    • Y. Hu and Q. Huo, "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 1042-1045.
    • (2007) Proc. Interspeech , pp. 1042-1045
    • Hu, Y.1    Huo, Q.2
  • 18
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • Honolulu, HI
    • H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP, Honolulu, HI, 2007, pp. 389-392.
    • (2007) Proc. ICASSP , pp. 389-392
    • Liao, H.1    Gales, M.J.F.2
  • 19
    • 33745202806 scopus 로고    scopus 로고
    • Joint uncertainty decoding for noise robust speech recognition
    • Lisbon, Portugal
    • H. Liao and M. J. F. Gales, "Joint uncertainty decoding for noise robust speech recognition," in Proc. Interspeech, Lisbon, Portugal, 2005, pp. 3129-3132.
    • (2005) Proc. Interspeech , pp. 3129-3132
    • Liao, H.1    Gales, M.J.F.2
  • 22
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 23
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 27
    • 0038669544 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France Sep.
    • H. G. Hirsch and D. Pearce, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. ISCA ITRW ASR, Paris, France, Sep. 2000, pp. 181-188.
    • (2000) Proc. ISCA ITRW ASR , pp. 181-188
    • Hirsch, H.G.1    Pearce, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.