메뉴 건너뛰기




Volumn 14, Issue 3, 2006, Pages 882-889

Minimum phone error training of precision matrix models

Author keywords

Discriminative training; Large vocabulary continuous speech recognition (LVCSR); Minimum phone error; Precision matrix modeling

Indexed keywords

DISCRIMINATIVE TRAINING; GAUSSIAN MIXTURE MODELS (GMM); LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (LVCSR); MINIMUM PHONE ERROR; PRECISION MATRIX MODELING;

EID: 34047275940     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.858062     Document Type: Article
Times cited : (17)

References (32)
  • 1
    • 84965063004 scopus 로고
    • An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology
    • L. E. Baum and J. A. Eagon, "An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology," Bull. Amer. Math. Soc., vol. 73, pp. 360-363, 1967.
    • (1967) Bull. Amer. Math. Soc , vol.73 , pp. 360-363
    • Baum, L.E.1    Eagon, J.A.2
  • 3
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoustic, Speech, Signal Process., vol. ASSP-28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoustic, Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 4
    • 0025041264 scopus 로고
    • Perceptual Linear Predictive (PLP) analysis of speech
    • H. Hermansky, "Perceptual Linear Predictive (PLP) analysis of speech," J. Acoust. Soc. Amer., vol. 87, no. 4, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 5
    • 0033677121 scopus 로고    scopus 로고
    • Maximum likelihood discriminant feature spaces
    • G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. ICASSP, 2000, pp. 1129-1130.
    • (2000) Proc. ICASSP , pp. 1129-1130
    • Saon, G.1    Padmanabhan, M.2    Gopinath, R.3    Chen, S.4
  • 6
    • 0003871508 scopus 로고    scopus 로고
    • Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
    • Ph.D. dissertation, Dept. Elect. Comp. Eng, Johns Hopkins Univ, Baltimore, MD
    • N. Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, Dept. Elect. Comp. Eng., Johns Hopkins Univ., Baltimore, MD, 1997.
    • (1997)
    • Kumar, N.1
  • 7
    • 0032289099 scopus 로고    scopus 로고
    • Heteroscedastic discriminant analysis and reduced-rank HMMs for improved speech recognition
    • N. K. Goel and A. G. Andreou, "Heteroscedastic discriminant analysis and reduced-rank HMMs for improved speech recognition," Speech Commun., vol. 26, pp. 283-297, 1998.
    • (1998) Speech Commun , vol.26 , pp. 283-297
    • Goel, N.K.1    Andreou, A.G.2
  • 8
    • 34047273845 scopus 로고    scopus 로고
    • A.-V. I. Rosti and M. J. F. Gales, Factor analyzed hidden Markov models for speech recognition, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR453 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2003.
    • A.-V. I. Rosti and M. J. F. Gales, "Factor analyzed hidden Markov models for speech recognition," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR453 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2003.
  • 9
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • May
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 10
    • 0036295941 scopus 로고    scopus 로고
    • Modeling inverse covariance matrices by basis expansion
    • P. Olsen and R. A. Gopinalh, "Modeling inverse covariance matrices by basis expansion," in Proc. ICASSP, 2002, pp. 945-948.
    • (2002) Proc. ICASSP , pp. 945-948
    • Olsen, P.1    Gopinalh, R.A.2
  • 11
    • 85009289957 scopus 로고    scopus 로고
    • Modeling with a subspace constraint on inverse covariance matrices
    • S. Axelrod, R. Gopinath, and P. Olsen, "Modeling with a subspace constraint on inverse covariance matrices," in Proc. ICSLP, 2002, pp. 2177-2180.
    • (2002) Proc. ICSLP , pp. 2177-2180
    • Axelrod, S.1    Gopinath, R.2    Olsen, P.3
  • 12
    • 85009288286 scopus 로고    scopus 로고
    • Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model
    • J. Huang, V. Goel, R. A. Gopinath, B. Kingsbury, P. Olsen, and K. Visweswariah, "Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model," in Proc. ICSLP, 2002, pp. 2597-2600.
    • (2002) Proc. ICSLP , pp. 2597-2600
    • Huang, J.1    Goel, V.2    Gopinath, R.A.3    Kingsbury, B.4    Olsen, P.5    Visweswariah, K.6
  • 13
    • 44949140997 scopus 로고    scopus 로고
    • Large vocabulary conversational speech recognition with a subspace constraint on inverse covariance matrices
    • S. Axelrod, V. Goel, B. Kingsbury, K. Visweswariah, and R. A. Gopinath, "Large vocabulary conversational speech recognition with a subspace constraint on inverse covariance matrices," in Proc. Eurospeech, 2003, pp. 1613-1616.
    • (2003) Proc. Eurospeech , pp. 1613-1616
    • Axelrod, S.1    Goel, V.2    Kingsbury, B.3    Visweswariah, K.4    Gopinath, R.A.5
  • 14
    • 0036461035 scopus 로고    scopus 로고
    • Large scale discriminative training of hidden Markov models in speech recognition
    • Jan
    • P. C. Woodland and D. Povey, "Large scale discriminative training of hidden Markov models in speech recognition," Comput. Speech Lang., vol. 16, no. 1, pp. 25-48, Jan. 2002.
    • (2002) Comput. Speech Lang , vol.16 , Issue.1 , pp. 25-48
    • Woodland, P.C.1    Povey, D.2
  • 15
    • 0036296863 scopus 로고    scopus 로고
    • Minimum phone error and I-smoothing for improved discriminative training
    • D. Povey and P. C. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, 2002, pp. 105-108.
    • (2002) Proc. ICASSP , pp. 105-108
    • Povey, D.1    Woodland, P.C.2
  • 18
    • 0141703323 scopus 로고    scopus 로고
    • Maximum mutual information speaker adapted training with semi-tied covariance matrices
    • J. McDonough and A. Waibel, "Maximum mutual information speaker adapted training with semi-tied covariance matrices," in Proc. ICASSP, 2003, pp. 128-131.
    • (2003) Proc. ICASSP , pp. 128-131
    • McDonough, J.1    Waibel, A.2
  • 19
  • 20
    • 4544373872 scopus 로고    scopus 로고
    • Basis superposition precision matrix modeling for large vocabulary continuous speech recognition
    • K. C. Sim and M. J. F. Gales, "Basis superposition precision matrix modeling for large vocabulary continuous speech recognition," in Proc. ICASSP, 2004, pp. 801-804.
    • (2004) Proc. ICASSP , pp. 801-804
    • Sim, K.C.1    Gales, M.J.F.2
  • 21
    • 34047253688 scopus 로고    scopus 로고
    • _, Precision matrix modeling for large vocabulary continuous speech recognition, Cambridge Univ., Tech. Rep. CUED/F-IN-FENG/TR485 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2004.
    • _, "Precision matrix modeling for large vocabulary continuous speech recognition," Cambridge Univ., Tech. Rep. CUED/F-IN-FENG/TR485 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam.ac.uk, 2004.
  • 22
    • 0742284722 scopus 로고    scopus 로고
    • Maximum likelihood training of subspaces for inverse covariance modeling
    • K. Visweswariah, P. Olsen, R. Gopinath, and S. Axelrod, "Maximum likelihood training of subspaces for inverse covariance modeling," in Proc. ICASSP, 2003, pp. 896-899.
    • (2003) Proc. ICASSP , pp. 896-899
    • Visweswariah, K.1    Olsen, P.2    Gopinath, R.3    Axelrod, S.4
  • 23
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soc., vol. 39, pp. 1-39, 1977.
    • (1977) J. Royal Statist. Soc , vol.39 , pp. 1-39
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 24
    • 0141477730 scopus 로고    scopus 로고
    • Discriminative linear transforms for feature normalization and speaker adaptation in hmm estimation
    • S. Tsakalidis, V. Doumpiotis, and W. Byrne, "Discriminative linear transforms for feature normalization and speaker adaptation in hmm estimation," in Proc. ICSLP, 2002, pp. 2585-2588.
    • (2002) Proc. ICSLP , pp. 2585-2588
    • Tsakalidis, S.1    Doumpiotis, V.2    Byrne, W.3
  • 25
    • 0141480019 scopus 로고    scopus 로고
    • Discriminative MAP for acoustic model adaptation
    • D. Povey, P. C. Woodland, and M. J. F. Gales, "Discriminative MAP for acoustic model adaptation," in Proc. ICASSP, 2003, pp. 312-315.
    • (2003) Proc. ICASSP , pp. 312-315
    • Povey, D.1    Woodland, P.C.2    Gales, M.J.F.3
  • 26
    • 0025952278 scopus 로고
    • An inequality for rational functions with applications to some statistical estimation problems
    • Jan
    • P. Gopalakrishnan, D. Kanevsky, A. Nadas, and D. Nahamoo, "An inequality for rational functions with applications to some statistical estimation problems," IEEE Trans. Inform. Theory, no. 1, pp. 107-113, Jan. 1991.
    • (1991) IEEE Trans. Inform. Theory , Issue.1 , pp. 107-113
    • Gopalakrishnan, P.1    Kanevsky, D.2    Nadas, A.3    Nahamoo, D.4
  • 27
    • 0003459132 scopus 로고
    • Hidden Markov models, maximum mutual information estimation and the speech recognition problem,
    • Ph.D. dissertation, Dept. Elect. Comp. Eng, McGill Univ, Montreal, QC, Canada
    • Y. Normandin, "Hidden Markov models, maximum mutual information estimation and the speech recognition problem," Ph.D. dissertation, Dept. Elect. Comp. Eng., McGill Univ., Montreal, QC, Canada, 1991.
    • (1991)
    • Normandin, Y.1
  • 29
    • 34047249342 scopus 로고    scopus 로고
    • S. J. Young, D. Kershaw, J. J. Odell, D. Ollason, V. Valtchev, and P. C. Woodland, The HTK Book for HTK Version 3.0, Cambridge, U.K, Cambridge Univ. Press, 1997
    • S. J. Young, D. Kershaw, J. J. Odell, D. Ollason, V. Valtchev, and P. C. Woodland, The HTK Book (for HTK Version 3.0). Cambridge, U.K.: Cambridge Univ. Press, 1997.
  • 30
    • 34047260667 scopus 로고    scopus 로고
    • M. J. F. Gales, Maximum Likelihood Multiple Projection Schemes for Hidden Markov Models, Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR365 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1999.
    • M. J. F. Gales, "Maximum Likelihood Multiple Projection Schemes for Hidden Markov Models," Cambridge Univ., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR365 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1999.
  • 32
    • 34047246754 scopus 로고    scopus 로고
    • M. J. F. Gales, The Generation and the Use of Regression Class Trees for MLLR Adaptation, Cambridge Univ.., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1996.
    • M. J. F. Gales, "The Generation and the Use of Regression Class Trees for MLLR Adaptation," Cambridge Univ.., Cambridge, U.K., Tech. Rep. CUED/F-INFENG/TR263 [Online]. Available: (via anonymous) ftp://svr-www.eng.cam. ac.uk, 1996.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.