SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 2, 2011, Pages 315-325

Noisy constrained maximum-likelihood linear regression for noise-robust speech recognition

(2) Kim, D K a Gales, M J F b

a Chonnam National University (South Korea)

b UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Adaptive training; noise robustness; speaker adaptation; speech recognition

Indexed keywords

ADAPTIVE TRAINING; ADAPTIVE TRAINING SCHEME; BACKGROUND NOISE; CLEAN SPEECH; EXPECTATION-MAXIMIZATION APPROACHES; FACTOR ANALYSIS; GENERATIVE MODEL; LINEAR TRANSFORM; MINIMUM PHONE ERROR; MODEL-BASED; NEW APPROACHES; NEW FORMS; NOISE ROBUST SPEECH RECOGNITION; NOISE ROBUSTNESS; NOISY OBSERVATIONS; NON-HOMOGENEOUS; RESOURCE MANAGEMENT; SPEAKER ADAPTATION; SPEECH RECOGNITION SYSTEMS; TRAINING DATA;

ACOUSTIC NOISE; COVARIANCE MATRIX; MATHEMATICAL TRANSFORMATIONS; MAXIMUM LIKELIHOOD;

SPEECH RECOGNITION;

EID: 78049302682 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2047756 Document Type: Article

Times cited : (28)

References (36)

1
- 70450163444
- Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition
- D. Kim and M. J. F. Gales, "Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition", in Proc. Interspeech, 2009, pp. 2383-2386.
- (2009) Proc. Interspeech , pp. 2383-2386
- Kim, D.¹ Gales, M.J.F.²

2
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition", in Proc. ICASSP, 1987, pp. 705-708.
- (1987) Proc. ICASSP , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

3
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training", in Proc. ICSLP, 1996.
- (1996) Proc. ICSLP
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

4
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- Jan
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition", Comput. Speech Lang., vol. 12, Jan. 1998.
- (1998) Comput. Speech Lang. , vol.12
- Gales, M.J.F.¹

5
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density HMMs
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density HMMs", Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-186
- Leggetter, C.J.¹ Woodland, P.C.²

6
- 70450179002
- Transforming features to compensate speech recogniser models for noise
- R. van Dalen and M. J. F. Gales, "Transforming features to compensate speech recogniser models for noise", in Proc. Interspeech, 2009.
- (2009) Proc. Interspeech
- Van Dalen, R.¹ Gales, M.J.F.²

7
- 85009070292
- Large vocabulary speech recognition under adverse acoustic environments
- Beijing, China, Oct
- L. Deng, A. Acero, M. Plumpe, and X. D. Huang, "Large vocabulary speech recognition under adverse acoustic environments", in Proc. ICSLP, Beijing, China, Oct. 2000, pp. 806-809.
- (2000) Proc. ICSLP , pp. 806-809
- Deng, L.A.¹ Acero, M.P.² Huang, X.D.³

8
- 0033888153
- A robust training algorithm for adverse speech recognition
- W.-T. Hong and S.-H. Chen, "A robust training algorithm for adverse speech recognition", Speech Commun., vol. 30, pp. 273-293, 2000.
- (2000) Speech Commun. , vol.30 , pp. 273-293
- Hong, W.-T.¹ Chen, S.-H.²

9
- 0141480138
- A discriminative and robust training algorithm for noisy speech recognition
- W.-T. Hong, "A discriminative and robust training algorithm for noisy speech recognition", in Proc. ICASSP, 2003, pp. 8-11.
- (2003) Proc. ICASSP , pp. 8-11
- Hong, W.-T.¹

10
- 56149112485
- Discriminative noise adaptive training approach for an environment migration
- B.-O. Kang, H.-Y. Jung, and Y.-K. Lee, "Discriminative noise adaptive training approach for an environment migration", in Proc. Interspeech, 2007.
- (2007) Proc. Interspeech
- Kang, B.-O.¹ Jung, H.-Y.² Lee, Y.-K.³

11
- 0003671941
- Ph. D. dissertation, Cambridge Univ., Cambridge, U. K.
- M. J. F. Gales, "Model-based techniques for noise robust speech recognition", Ph. D. dissertation, Cambridge Univ., Cambridge, U. K., 1995.
- (1995) Model-based Techniques for Noise Robust Speech Recognition
- Gales, M.J.F.¹

12
- 85009113852
- HMM adaptation using vector Taylor series for noisy speech recognition
- Beijing, China, Oct
- A. Acero, L. Deng, T. T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition", in Proc. ICSLP, Beijing, China, Oct. 2000.
- (2000) Proc. ICSLP
- Acero, A.¹ Deng, L.² Kristjansson, T.T.³ Zhang, J.⁴

13
- 40249103761
- Issues with uncertainty decoding for noise robust speech recognition
- Apr
- H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust speech recognition", Speech Commun., Apr. 2008.
- (2008) Speech Commun.
- Liao, H.¹ Gales, M.J.F.²

14
- 65549153550
- Ph. D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA
- P. J. Moreno, "Speech recognition in noisy environments", Ph. D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
- (1996) Speech Recognition in Noisy Environments
- Moreno, P.J.¹

15
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data", in Proc. ICASSP, 2007, pp. 389-392.
- (2007) Proc. ICASSP , pp. 389-392
- Liao, H.¹ Gales, M.J.F.²

16
- 68549095140
- HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
- Kyoto, Japan
- J. Li, L. Deng, Y. Gong, and A. Acero, "HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series", in Proc. ASRU, Kyoto, Japan, 2007.
- (2007) Proc. ASRU
- Li, J.L.D.¹ Gong, Y.² Acero, A.³

17
- 44849122740
- Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions
- Y. Hu and Q. Huo, "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions", in Proc. Interspeech, 2007.
- (2007) Proc. Interspeech
- Hu, Y.¹ Huo, Q.²

18
- 70349194599
- Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
- O. Kalinli, M. L. Seltzer, and A. Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition", in Proc. ICASSP, 2009, pp. 3825-3828.
- (2009) Proc. ICASSP , pp. 3825-3828
- Kalinli, O.¹ Seltzer, M.L.² Acero, A.³

19
- 0031139839
- Minimum classification error rate methods for speech recognition
- May
- B.-H. Juang, W. Hou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition", IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997.
- (1997) IEEE Trans. Speech Audio Process. , vol.5 , Issue.3 , pp. 257-265
- Juang, B.-H.¹ Hou, W.² Lee, C.-H.³

20
- 0022890536
- Maximum mutual information estimation of hidden Markov model parameters for speech recognition
- L. R. Bahl, P. F. Brown, P. V. De Souza, and R. L. Mercer, "Maximum mutual information estimation of hidden Markov model parameters for speech recognition", in Proc. ICASSP, 1986, vol. 1, pp. 49-52.
- (1986) Proc. ICASSP , vol.1 , pp. 49-52
- Bahl, L.R.¹ Brown, P.F.² De Souza, P.V.³ Mercer, R.L.⁴

21
- 0034849080
- Improved discriminative training techniques for large vocabulary continuous speech recognition
- D. Povey and P. C. Woodland, "Improved discriminative training techniques for large vocabulary continuous speech recognition", in Proc. ICASSP, 2001, pp. 45-48.
- (2001) Proc. ICASSP , pp. 45-48
- Povey, D.¹ Woodland, P.C.²

22
- 0036294871
- On maximum mutual information speaker-adapted training
- J. Mcdonough, T. Schaaf, and A. Waibel, "On maximum mutual information speaker-adapted training", in Proc. ICASSP, 2002, pp. 601-604.
- (2002) Proc. ICASSP , pp. 601-604
- Mcdonough, J.¹ Schaaf, T.² Waibel, A.³

23
- 51449089561
- Ph. D. dissertation, Cambridge Univ., Cambridge, U. K.
- L. Wang, "Discriminative linear transforms for adaptation and adaptive training", Ph. D. dissertation, Cambridge Univ., Cambridge, U. K., 2006.
- (2006) Discriminative Linear Transforms for Adaptation and Adaptive Training
- Wang, L.¹

24
- 34047260093
- Discriminative cluster adaptive training
- Sep
- K. Yu and M. J. F. Gales, "Discriminative cluster adaptive training", IEEE Trans. Speech Audio Process., vol. 14, no. 5, pp. 1694-1703, Sep. 2006.
- (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.5 , pp. 1694-1703
- Yu, K.¹ Gales, M.J.F.²

25
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- May
- A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition", IEEE Trans. Speech Audio Process., vol. 4, pp. 190-202, May 1996.
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 190-202
- Sankar, A.¹ Lee, C.-H.²

26
- 34250232348
- EM algorithms for ML factor analysis
- D. B. Rubin and D. T. Thayer, "EM algorithms for ML factor analysis", Psychometrica, vol. 47, no. 1, pp. 69-76, 1982.
- (1982) Psychometrica , vol.47 , Issue.1 , pp. 69-76
- Rubin, D.B.¹ Thayer, D.T.²

27
- 84906270956
- Factor analysis invariant to linear transformations of data
- R. A. Gopinath, B. Ramabhadran, and S. Dharanipragada, "Factor analysis invariant to linear transformations of data", in Proc. ICSLP, 1998, pp. 397-400.
- (1998) Proc. ICSLP , pp. 397-400
- Gopinath, R.A.¹ Ramabhadran, B.² Dharanipragada, S.³

28
- 1642377925
- Factor analysed hidden Markov models for speech recognition
- A.-V. I. Rosti and M. J. F. Gales, "Factor analysed hidden Markov models for speech recognition", Comput. Speech Lang., vol. 18, no. 3, pp. 181-200, 2004.
- (2004) Comput. Speech Lang. , vol.18 , Issue.3 , pp. 181-200
- Rosti, I.A.-V.¹ Gales, M.J.F.²

29
- 0028420014
- Integrated models of signal and background with application to speaker identification in noise
- Apr
- R. C. Rose, E. M. Hofstetter, and D. A. Reynolds, "Integrated models of signal and background with application to speaker identification in noise", IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 245-257, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 245-257
- Rose, R.C.¹ Hofstetter, E.M.² Reynolds, D.A.³

30
- 34547553730
- Ph. D. dissertation, Cambridge Univ., Cambridge, U. K.
- H. Liao, "Uncertainty decoding for noise robust speech recognition", Ph. D. dissertation, Cambridge Univ., Cambridge, U. K., 2007.
- (2007) Uncertainty Decoding for Noise Robust Speech Recognition
- Liao, H.¹

31
- 44849113871
- Predictive linear transforms for noise robust speech recognition
- M. J. F. Gales and R. C. van Dalen, "Predictive linear transforms for noise robust speech recognition", in Proc. ASRU, 2007, pp. 59-64.
- (2007) Proc. ASRU , pp. 59-64
- Gales, M.J.F.¹ Van Dalen, R.C.²

32
- 4544265717
- Ph. D. dissertation, Cambridge Univ., Cambridge, U. K.
- D. Povey, "Discriminative training for large vocabulary speech recognition", Ph. D. dissertation, Cambridge Univ., Cambridge, U. K., 2003.
- (2003) Discriminative Training for Large Vocabulary Speech Recognition
- Povey, D.¹

33
- 0031624958
- Comparison of discriminative training criteria
- R. Schluter and W. Macherey, "Comparison of discriminative training criteria", in Proc. ICASSP, 1998, pp. 493-496.
- (1998) Proc. ICASSP , pp. 493-496
- Schluter, R.¹ Macherey, W.²

34
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J. L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains", IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.-H.²

35
- 44949193111
- Feature and model space speaker adaptation with full covariance Gaussians
- G. Saon and D. Povey, "Feature and model space speaker adaptation with full covariance Gaussians", in Proc. Interspeech, 2006.
- (2006) Proc. Interspeech
- Saon, G.¹ Povey, D.²

36
- 0003571976
- Cambridge, U. K.: Univ. of Cambridge, Dec
- S. Young, G. Evermann, M. J. F. Gales, T. Hain, D. Kershaw, X. A. Liu, G. Moore, J. J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book (for HTK Version 3.4). Cambridge, U. K.: Univ. of Cambridge, Dec. 2006.
- (2006) The HTK Book (For HTK Version 3.4)
- Young, S.¹ Evermann, G.² Gales, M.J.F.³ Hain, T.⁴ Kershaw, D.⁵ Liu, X.A.⁶ Moore, G.⁷ Odell, J.J.⁸ Ollason, D.⁹ Povey, D.¹⁰ Valtchev, V.¹¹ Woodland, P.C.¹²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.