메뉴 건너뛰기




Volumn 19, Issue 2, 2011, Pages 315-325

Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition

Author keywords

Adaptive training; noise robustness; speaker adaptation; speech recognition

Indexed keywords

ADAPTIVE TRAINING; ADAPTIVE TRAINING SCHEME; BACKGROUND NOISE; CLEAN SPEECH; EXPECTATION-MAXIMIZATION APPROACHES; FACTOR ANALYSIS; GENERATIVE MODEL; LINEAR TRANSFORM; MINIMUM PHONE ERROR; MODEL-BASED; NEW APPROACHES; NEW FORMS; NOISE ROBUST SPEECH RECOGNITION; NOISE ROBUSTNESS; NOISY OBSERVATIONS; NON-HOMOGENEOUS; RESOURCE MANAGEMENT; SPEAKER ADAPTATION; SPEECH RECOGNITION SYSTEMS; TRAINING DATA;

EID: 78049302682     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2047756     Document Type: Article
Times cited : (28)

References (36)
  • 1
    • 70450163444 scopus 로고    scopus 로고
    • Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition
    • D. Kim and M. J. F. Gales, "Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition", in Proc. Interspeech, 2009, pp. 2383-2386.
    • (2009) Proc. Interspeech , pp. 2383-2386
    • Kim, D.1    Gales, M.J.F.2
  • 2
    • 0023263708 scopus 로고
    • Multi-style training for robust isolated-word speech recognition
    • R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition", in Proc. ICASSP, 1987, pp. 705-708.
    • (1987) Proc. ICASSP , pp. 705-708
    • Lippmann, R.P.1    Martin, E.A.2    Paul, D.B.3
  • 4
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Jan
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition", Comput. Speech Lang., vol. 12, Jan. 1998.
    • (1998) Comput. Speech Lang. , vol.12
    • Gales, M.J.F.1
  • 5
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs", Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-186
    • Leggetter, C.J.1    Woodland, P.C.2
  • 6
    • 70450179002 scopus 로고    scopus 로고
    • Transforming features to compensate speech recogniser models for noise
    • R. van Dalen and M. J. F. Gales, "Transforming features to compensate speech recogniser models for noise", in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Van Dalen, R.1    Gales, M.J.F.2
  • 7
    • 85009070292 scopus 로고    scopus 로고
    • Large vocabulary speech recognition under adverse acoustic environments
    • Beijing, China, Oct
    • L. Deng, A. Acero, M. Plumpe, and X. D. Huang, "Large vocabulary speech recognition under adverse acoustic environments", in Proc. ICSLP, Beijing, China, Oct. 2000, pp. 806-809.
    • (2000) Proc. ICSLP , pp. 806-809
    • Deng, L.A.1    Acero, M.P.2    Huang, X.D.3
  • 8
    • 0033888153 scopus 로고    scopus 로고
    • A robust training algorithm for adverse speech recognition
    • W.-T. Hong and S.-H. Chen, "A robust training algorithm for adverse speech recognition", Speech Commun., vol. 30, pp. 273-293, 2000.
    • (2000) Speech Commun. , vol.30 , pp. 273-293
    • Hong, W.-T.1    Chen, S.-H.2
  • 9
    • 0141480138 scopus 로고    scopus 로고
    • A discriminative and robust training algorithm for noisy speech recognition
    • W.-T. Hong, "A discriminative and robust training algorithm for noisy speech recognition", in Proc. ICASSP, 2003, pp. 8-11.
    • (2003) Proc. ICASSP , pp. 8-11
    • Hong, W.-T.1
  • 10
    • 56149112485 scopus 로고    scopus 로고
    • Discriminative noise adaptive training approach for an environment migration
    • B.-O. Kang, H.-Y. Jung, and Y.-K. Lee, "Discriminative noise adaptive training approach for an environment migration", in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Kang, B.-O.1    Jung, H.-Y.2    Lee, Y.-K.3
  • 12
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noisy speech recognition
    • Beijing, China, Oct
    • A. Acero, L. Deng, T. T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition", in Proc. ICSLP, Beijing, China, Oct. 2000.
    • (2000) Proc. ICSLP
    • Acero, A.1    Deng, L.2    Kristjansson, T.T.3    Zhang, J.4
  • 13
    • 40249103761 scopus 로고    scopus 로고
    • Issues with uncertainty decoding for noise robust speech recognition
    • Apr
    • H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust speech recognition", Speech Commun., Apr. 2008.
    • (2008) Speech Commun.
    • Liao, H.1    Gales, M.J.F.2
  • 15
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data", in Proc. ICASSP, 2007, pp. 389-392.
    • (2007) Proc. ICASSP , pp. 389-392
    • Liao, H.1    Gales, M.J.F.2
  • 16
    • 68549095140 scopus 로고    scopus 로고
    • HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
    • Kyoto, Japan
    • J. Li, L. Deng, Y. Gong, and A. Acero, "HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series", in Proc. ASRU, Kyoto, Japan, 2007.
    • (2007) Proc. ASRU
    • Li, J.L.D.1    Gong, Y.2    Acero, A.3
  • 17
    • 44849122740 scopus 로고    scopus 로고
    • Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions
    • Y. Hu and Q. Huo, "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions", in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Hu, Y.1    Huo, Q.2
  • 18
    • 70349194599 scopus 로고    scopus 로고
    • Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
    • O. Kalinli, M. L. Seltzer, and A. Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition", in Proc. ICASSP, 2009, pp. 3825-3828.
    • (2009) Proc. ICASSP , pp. 3825-3828
    • Kalinli, O.1    Seltzer, M.L.2    Acero, A.3
  • 19
    • 0031139839 scopus 로고    scopus 로고
    • Minimum classification error rate methods for speech recognition
    • May
    • B.-H. Juang, W. Hou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition", IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997.
    • (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
    • Juang, B.-H.1    Hou, W.2    Lee, C.-H.3
  • 20
    • 0022890536 scopus 로고
    • Maximum mutual information estimation of hidden Markov model parameters for speech recognition
    • L. R. Bahl, P. F. Brown, P. V. De Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition", in Proc. ICASSP, 1986, vol. 1, pp. 49-52.
    • (1986) Proc. ICASSP , vol.1 , pp. 49-52
    • Bahl, L.R.1    Brown, P.F.2    De Souza, P.V.3    Mercer, R.L.4
  • 21
    • 0034849080 scopus 로고    scopus 로고
    • Improved discriminative training techniques for large vocabulary continuous speech recognition
    • D. Povey and P. C. Woodland, "Improved discriminative training techniques for large vocabulary continuous speech recognition", in Proc. ICASSP, 2001, pp. 45-48.
    • (2001) Proc. ICASSP , pp. 45-48
    • Povey, D.1    Woodland, P.C.2
  • 22
    • 0036294871 scopus 로고    scopus 로고
    • On maximum mutual information speaker-adapted training
    • J. Mcdonough, T. Schaaf, and A. Waibel, "On maximum mutual information speaker-adapted training", in Proc. ICASSP, 2002, pp. 601-604.
    • (2002) Proc. ICASSP , pp. 601-604
    • Mcdonough, J.1    Schaaf, T.2    Waibel, A.3
  • 24
    • 34047260093 scopus 로고    scopus 로고
    • Discriminative cluster adaptive training
    • Sep
    • K. Yu and M. J. F. Gales, "Discriminative cluster adaptive training", IEEE Trans. Speech Audio Process., vol. 14, no. 5, pp. 1694-1703, Sep. 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.5 , pp. 1694-1703
    • Yu, K.1    Gales, M.J.F.2
  • 25
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • May
    • A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition", IEEE Trans. Speech Audio Process., vol. 4, pp. 190-202, May 1996.
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 26
    • 34250232348 scopus 로고
    • EM algorithms for ML factor analysis
    • D. B. Rubin and D. T. Thayer, "EM algorithms for ML factor analysis", Psychometrica, vol. 47, no. 1, pp. 69-76, 1982.
    • (1982) Psychometrica , vol.47 , Issue.1 , pp. 69-76
    • Rubin, D.B.1    Thayer, D.T.2
  • 27
    • 84906270956 scopus 로고    scopus 로고
    • Factor analysis invariant to linear transformations of data
    • R. A. Gopinath, B. Ramabhadran, and S. Dharanipragada, "Factor analysis invariant to linear transformations of data", in Proc. ICSLP, 1998, pp. 397-400.
    • (1998) Proc. ICSLP , pp. 397-400
    • Gopinath, R.A.1    Ramabhadran, B.2    Dharanipragada, S.3
  • 28
    • 1642377925 scopus 로고    scopus 로고
    • Factor analysed hidden Markov models for speech recognition
    • A.-V. I. Rosti and M. J. F. Gales, "Factor analysed hidden Markov models for speech recognition", Comput. Speech Lang., vol. 18, no. 3, pp. 181-200, 2004.
    • (2004) Comput. Speech Lang. , vol.18 , Issue.3 , pp. 181-200
    • Rosti, I.A.-V.1    Gales, M.J.F.2
  • 29
    • 0028420014 scopus 로고
    • Integrated models of signal and background with application to speaker identification in noise
    • Apr
    • R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise", IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 245-257, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 245-257
    • Rose, R.C.1    Hofstetter, E.M.2    Reynolds, D.A.3
  • 31
    • 44849113871 scopus 로고    scopus 로고
    • Predictive linear transforms for noise robust speech recognition
    • M. J. F. Gales and R. C. van Dalen, "Predictive linear transforms for noise robust speech recognition", in Proc. ASRU, 2007, pp. 59-64.
    • (2007) Proc. ASRU , pp. 59-64
    • Gales, M.J.F.1    Van Dalen, R.C.2
  • 33
    • 0031624958 scopus 로고    scopus 로고
    • Comparison of discriminative training criteria
    • R. Schluter and W. Macherey, "Comparison of discriminative training criteria", in Proc. ICASSP, 1998, pp. 493-496.
    • (1998) Proc. ICASSP , pp. 493-496
    • Schluter, R.1    Macherey, W.2
  • 34
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J. L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains", IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.-H.2
  • 35
    • 44949193111 scopus 로고    scopus 로고
    • Feature and model space speaker adaptation with full covariance Gaussians
    • G. Saon and D. Povey, "Feature and model space speaker adaptation with full covariance Gaussians", in Proc. Interspeech, 2006.
    • (2006) Proc. Interspeech
    • Saon, G.1    Povey, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.