SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 3, 2006, Pages 882-889

Minimum phone error training of precision matrix models

(2) Sim, Khe Chai a,b Gales, Mark J F a,b

b UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Discriminative training; Large vocabulary continuous speech recognition (LVCSR); Minimum phone error; Precision matrix modeling

Indexed keywords

DISCRIMINATIVE TRAINING; GAUSSIAN MIXTURE MODELS (GMM); LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (LVCSR); MINIMUM PHONE ERROR; PRECISION MATRIX MODELING;

CORRELATION METHODS; ERROR ANALYSIS; PROBLEM SOLVING; VECTOR QUANTIZATION;

APPROXIMATION THEORY;

EID: 34047275940 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.858062 Document Type: Article

Times cited : (17)

References (32)

1
- 84965063004
- An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology
- L. E. Baum and J. A. Eagon, "An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology," Bull. Amer. Math. Soc., vol. 73, pp. 360-363, 1967.
- (1967) Bull. Amer. Math. Soc , vol.73 , pp. 360-363
- Baum, L.E.¹ Eagon, J.A.²

2
- 0040230293
- Large vocabulary continuous speech recognition: A review
- Snowbird, UT, Dec
- S. J. Young, "Large vocabulary continuous speech recognition: a review," in Proc. IEEE Workshop Automatic Speech Recognition Understanding, Snowbird, UT, Dec. 1995, pp. 3-28.
- (1995) Proc. IEEE Workshop Automatic Speech Recognition Understanding , pp. 3-28
- Young, S.J.¹

3
- 0019053271
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
- Aug
- S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoustic, Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
- (1980) IEEE Trans. Acoustic, Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
- Davis, S.B.¹ Mermelstein, P.²

4
- 0025041264
- Perceptual Linear Predictive (PLP) analysis of speech
- H. Hermansky, "Perceptual Linear Predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
- (1990) J. Acoust. Soc. Amer , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

5
- 0033677121
- Maximum likelihood discriminant feature spaces
- G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. ICASSP, 2000, pp. 1129-1130.
- (2000) Proc. ICASSP , pp. 1129-1130
- Saon, G.¹ Padmanabhan, M.² Gopinath, R.³ Chen, S.⁴

6
- 0003871508
- Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
- Ph.D. dissertation, Dept. Elect. Comp. Eng, Johns Hopkins Univ, Baltimore, MD
- N. Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, Dept. Elect. Comp. Eng., Johns Hopkins Univ., Baltimore, MD, 1997.
- (1997)
- Kumar, N.¹

7
- 0032289099
- Heteroscedastic discriminant analysis and reduced-rank HMMs for improved speech recognition
- N. K. Goel and A. G. Andreou, "Heteroscedastic discriminant analysis and reduced-rank HMMs for improved speech recognition," Speech Commun., vol. 26, pp. 283-297, 1998.
- (1998) Speech Commun , vol.26 , pp. 283-297
- Goel, N.K.¹ Andreou, A.G.²

8
- 34047273845
- A.-V. I. Rosti and M. J. F. Gales, Factor analyzed hidden Markov models for speech recognition, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR453 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2003.
- A.-V. I. Rosti and M. J. F. Gales, "Factor analyzed hidden Markov models for speech recognition," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR453 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2003.

9
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- May
- M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.3 , pp. 272-281
- Gales, M.J.F.¹

10
- 0036295941
- Modeling inverse covariance matrices by basis expansion
- P. Olsen and R. A. Gopinalh, "Modeling inverse covariance matrices by basis expansion," in Proc. ICASSP, 2002, pp. 945-948.
- (2002) Proc. ICASSP , pp. 945-948
- Olsen, P.¹ Gopinalh, R.A.²

11
- 85009289957
- Modeling with a subspace constraint on inverse covariance matrices
- S. Axelrod, R. Gopinath, and P. Olsen, "Modeling with a subspace constraint on inverse covariance matrices," in Proc. ICSLP, 2002, pp. 2177-2180.
- (2002) Proc. ICSLP , pp. 2177-2180
- Axelrod, S.¹ Gopinath, R.² Olsen, P.³

12
- 85009288286
- Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model
- J. Huang, V. Goel, R. A. Gopinath, B. Kingsbury, P. Olsen, and K. Visweswariah, "Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model," in Proc. ICSLP, 2002, pp. 2597-2600.
- (2002) Proc. ICSLP , pp. 2597-2600
- Huang, J.¹ Goel, V.² Gopinath, R.A.³ Kingsbury, B.⁴ Olsen, P.⁵ Visweswariah, K.⁶

13
- 44949140997
- Large vocabulary conversational speech recognition with a subspace constraint on inverse covariance matrices
- S. Axelrod, V. Goel, B. Kingsbury, K. Visweswariah, and R. A. Gopinath, "Large vocabulary conversational speech recognition with a subspace constraint on inverse covariance matrices," in Proc. Eurospeech, 2003, pp. 1613-1616.
- (2003) Proc. Eurospeech , pp. 1613-1616
- Axelrod, S.¹ Goel, V.² Kingsbury, B.³ Visweswariah, K.⁴ Gopinath, R.A.⁵

14
- 0036461035
- Large scale discriminative training of hidden Markov models in speech recognition
- Jan
- P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models in speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-48, Jan. 2002.
- (2002) Comput. Speech Lang , vol.16 , Issue.1 , pp. 25-48
- Woodland, P.C.¹ Povey, D.²

15
- 0036296863
- Minimum phone error and I-smoothing for improved discriminative training
- D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, 2002, pp. 105-108.
- (2002) Proc. ICASSP , pp. 105-108
- Povey, D.¹ Woodland, P.C.²

16
- 4544236272
- Development of the 2003 CU-HTK conversational telephone speech transcription system
- G. Evermann, H. Y. Chan, M. J. F. Gales, T. Hain, X. Liu, D. Mrva, L. Wang, and P. C. Woodland, "Development of the 2003 CU-HTK conversational telephone speech transcription system," in Proc. ICASSP, 2004, pp. 249-252.
- (2004) Proc. ICASSP , pp. 249-252
- Evermann, G.¹ Chan, H.Y.² Gales, M.J.F.³ Hain, T.⁴ Liu, X.⁵ Mrva, D.⁶ Wang, L.⁷ Woodland, P.C.⁸

17
- 84946740232
- Recent advances in broadcast news transcription
- D. Y. Kim, G. Evermann, T. Hain, D. Mrva, S. E. Tranter, L. Wang, and P. C. Woodland, "Recent advances in broadcast news transcription," in Proc. ASRU, 2003, pp. 105-110.
- (2003) Proc. ASRU , pp. 105-110
- Kim, D.Y.¹ Evermann, G.² Hain, T.³ Mrva, D.⁴ Tranter, S.E.⁵ Wang, L.⁶ Woodland, P.C.⁷

18
- 0141703323
- Maximum mutual information speaker adapted training with semi-tied covariance matrices
- J. McDonough and A. Waibel, "Maximum mutual information speaker adapted training with semi-tied covariance matrices," in Proc. ICASSP, 2003, pp. 128-131.
- (2003) Proc. ICASSP , pp. 128-131
- McDonough, J.¹ Waibel, A.²

19
- 85009216283
- Discriminative estimation of Subspace Precision and Mean (SPAM) models
- V. Goel, S. Axelrod, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative estimation of Subspace Precision and Mean (SPAM) models," in P roc. EUROSPEECH, 2003, pp. 2617-2620.
- (2003) P roc. EUROSPEECH , pp. 2617-2620
- Goel, V.¹ Axelrod, S.² Gopinath, R.³ Olsen, P.⁴ Visweswariah, K.⁵

20
- 4544373872
- Basis superposition precision matrix modeling for large vocabulary continuous speech recognition
- K. C. Sim and M. J. F. Gales, "Basis superposition precision matrix modeling for large vocabulary continuous speech recognition," in Proc. ICASSP, 2004, pp. 801-804.
- (2004) Proc. ICASSP , pp. 801-804
- Sim, K.C.¹ Gales, M.J.F.²

21
- 34047253688
- _, Precision matrix modeling for large vocabulary continuous speech recognition, Cambridge Univ., Tech. Rep. CUED/F-IN-FENG/TR485 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2004.
- _, "Precision matrix modeling for large vocabulary continuous speech recognition," Cambridge Univ., Tech. Rep. CUED/F-IN-FENG/TR485 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2004.

22
- 0742284722
- Maximum likelihood training of subspaces for inverse covariance modeling
- K. Visweswariah, P. Olsen, R. Gopinath, and S. Axelrod, "Maximum likelihood training of subspaces for inverse covariance modeling," in Proc. ICASSP, 2003, pp. 896-899.
- (2003) Proc. ICASSP , pp. 896-899
- Visweswariah, K.¹ Olsen, P.² Gopinath, R.³ Axelrod, S.⁴

23
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soc., vol. 39, pp. 1-39, 1977.
- (1977) J. Royal Statist. Soc , vol.39 , pp. 1-39
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

24
- 0141477730
- Discriminative linear transforms for feature normalization and speaker adaptation in hmm estimation
- S. Tsakalidis, V. Doumpiotis, and W. Byrne, "Discriminative linear transforms for feature normalization and speaker adaptation in hmm estimation," in Proc. ICSLP, 2002, pp. 2585-2588.
- (2002) Proc. ICSLP , pp. 2585-2588
- Tsakalidis, S.¹ Doumpiotis, V.² Byrne, W.³

25
- 0141480019
- Discriminative MAP for acoustic model adaptation
- D. Povey, P. C. Woodland, and M. J. F. Gales, "Discriminative MAP for acoustic model adaptation," in Proc. ICASSP, 2003, pp. 312-315.
- (2003) Proc. ICASSP , pp. 312-315
- Povey, D.¹ Woodland, P.C.² Gales, M.J.F.³

26
- 0025952278
- An inequality for rational functions with applications to some statistical estimation problems
- Jan
- P. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inform. Theory, no. 1, pp. 107-113, Jan. 1991.
- (1991) IEEE Trans. Inform. Theory , Issue.1 , pp. 107-113
- Gopalakrishnan, P.¹ Kanevsky, D.² Nadas, A.³ Nahamoo, D.⁴

27
- 0003459132
- Hidden Markov models, maximum mutual information estimation and the speech recognition problem,
- Ph.D. dissertation, Dept. Elect. Comp. Eng, McGill Univ, Montreal, QC, Canada
- Y. Normandin, "Hidden Markov models, maximum mutual information estimation and the speech recognition problem," Ph.D. dissertation, Dept. Elect. Comp. Eng., McGill Univ., Montreal, QC, Canada, 1991.
- (1991)
- Normandin, Y.¹

28
- 0003555845
- New York: Academic
- E. Polak, Computational Methods in Optimization: A Unified Approach. New York: Academic, 1971.
- (1971) Computational Methods in Optimization: A Unified Approach
- Polak, E.¹

29
- 34047249342
- S. J. Young, D. Kershaw, J. J. Odell, D. Ollason, V. Valtchev, and P. C. Woodland, The HTK Book for HTK Version 3.0, Cambridge, U.K, Cambridge Univ. Press, 1997
- S. J. Young, D. Kershaw, J. J. Odell, D. Ollason, V. Valtchev, and P. C. Woodland, The HTK Book (for HTK Version 3.0). Cambridge, U.K.: Cambridge Univ. Press, 1997.

30
- 34047260667
- M. J. F. Gales, Maximum Likelihood Multiple Projection Schemes for Hidden Markov Models, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR365 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1999.
- M. J. F. Gales, "Maximum Likelihood Multiple Projection Schemes for Hidden Markov Models," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR365 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1999.

31
- 84946723273
- Automatic model complexity control using marginalized discriminative growth functions
- X. Liu and M. J. F. Gales, "Automatic model complexity control using marginalized discriminative growth functions," in Proc. IEEE Workshop Automatic Speech Recognition and Understanding (ASRU), 2003, pp. 37-42.
- (2003) Proc. IEEE Workshop Automatic Speech Recognition and Understanding (ASRU) , pp. 37-42
- Liu, X.¹ Gales, M.J.F.²

32
- 34047246754
- M. J. F. Gales, The Generation and the Use of Regression Class Trees for MLLR Adaptation, Cambridge Univ.., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1996.
- M. J. F. Gales, "The Generation and the Use of Regression Class Trees for MLLR Adaptation," Cambridge Univ.., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1996.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.