메뉴 건너뛰기




Volumn 14, Issue 6, 2006, Pages 2134-2146

Tree-based covariance modeling of hidden markov models

Author keywords

Automatic speech recognition; Covariance modeling; Gaussian mixture models; Tree modeling

Indexed keywords

AUTOMATIC SPEECH RECOGNITION; COVARIANCE MATRICES; COVARIANCE MODELING; DATA SPARSENESS PROBLEMS; E-M ALGORITHMS; FULL COVARIANCE MODELING; GAUSSIAN; GAUSSIAN MIXTURE MODELS; HETEROSCEDASTIC; HIDDEN MARKOV MODELING; HIERARCHICAL STRUCTURES; INTERPOLATION COEFFICIENTS; INVERSE COVARIANCES; KULLBACK-LEIBLER DIVERGENCES; LINEAR DISCRIMINANT ANALYSIS; MATRIXES; MULTI LAYERS; MULTI-LAYERED; PARAMETRIC FORMS; RESOURCE MANAGEMENTS; ROOT NODES; TREE MODELING; TREE-BASED;

EID: 44449103265     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.863210     Document Type: Article
Times cited : (8)

References (31)
  • 1
    • 0028466072 scopus 로고
    • The importance of cepstral parameter correlation in speech recognition
    • A. Ljolje, "The importance of cepstral parameter correlation in speech recognition," Comput. Speech Lang., vol. 8, pp. 223-232, 1994.
    • (1994) Comput. Speech Lang , vol.8 , pp. 223-232
    • Ljolje, A.1
  • 2
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug
    • S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, p. 357, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech, Signal Process , vol.ASSP-28 , Issue.4 , pp. 357
    • Davis, S.B.1    Mermelstein, P.2
  • 4
    • 85017287487 scopus 로고
    • Linear discriminant analysis for improved large vocabulary continuous speech recognition
    • R. Haeb-Umbach and H. Ney, "Linear discriminant analysis for improved large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1992, vol. 1, pp. 13-16.
    • (1992) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 13-16
    • Haeb-Umbach, R.1    Ney, H.2
  • 5
    • 17344383223 scopus 로고
    • Continuous mixture densities and linear discriminant analysis for improved context-dependent acoustic models
    • X. Aubert, R. Haeb-Umbach, and H. Ney, "Continuous mixture densities and linear discriminant analysis for improved context-dependent acoustic models," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1993, vol. 2, pp. 27-30.
    • (1993) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 27-30
    • Aubert, X.1    Haeb-Umbach, R.2    Ney, H.3
  • 7
    • 84892187452 scopus 로고    scopus 로고
    • Maximum likelihood modeling with Gaussian distributions for classification
    • R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1998, vol. 2, pp. 661-664.
    • (1998) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 661-664
    • Gopinath, R.A.1
  • 8
    • 0003871508 scopus 로고    scopus 로고
    • Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
    • Ph.D. dissertation, Johns Hopkins Univ, Baltimore, MD
    • N.Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, , Johns Hopkins Univ., Baltimore, MD, 1997.
    • (1997)
    • Kumar, N.1
  • 12
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • May
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 13
    • 0036475982 scopus 로고    scopus 로고
    • Maximum likelihood multiple subspace projections for hidden Markov models
    • Feb
    • -, "Maximum likelihood multiple subspace projections for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 37-47, Feb. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.2 , pp. 37-47
    • Gales, M.J.F.1
  • 14
  • 15
    • 0742272654 scopus 로고    scopus 로고
    • Modeling inverse covariance matrices by basis expansion
    • Jan
    • P. A. Olsen and R. A. Gopinath, "Modeling inverse covariance matrices by basis expansion," IEEE Trans. Acoust., Speech, Signal Process., vol. 12, no. 1, pp. 37-46, Jan. 2004.
    • (2004) IEEE Trans. Acoust., Speech, Signal Process , vol.12 , Issue.1 , pp. 37-46
    • Olsen, P.A.1    Gopinath, R.A.2
  • 17
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 7, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.7 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 18
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • Mar
    • K. Shinoda and C. H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276-287, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.3 , pp. 276-287
    • Shinoda, K.1    Lee, C.H.2
  • 19
    • 85009064348 scopus 로고    scopus 로고
    • Constrained maximum likelihood linear regression for speaker adaptation
    • M. Afify and O. Siohan, "Constrained maximum likelihood linear regression for speaker adaptation," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 861-864.
    • (2000) Proc. Int. Conf. Spoken Language Processing , pp. 861-864
    • Afify, M.1    Siohan, O.2
  • 20
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 22
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, pp. 461-464, 1973.
    • (1973) Ann. Statist , vol.6 , pp. 461-464
    • Schwarz, G.1
  • 25
    • 64549152628 scopus 로고    scopus 로고
    • S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P.Woodland, The HTK Book for HTK Version 3.0, 2000 [Online, Available
    • S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P.Woodland, The HTK Book (for HTK Version 3.0). : , 2000 [Online]. Available: http://htk.eng.cam.ac.uk/
  • 26
    • 64549145995 scopus 로고    scopus 로고
    • Speech lab in a box: AMandarin speech toolbox to jump start speech related research toolbox
    • E. Chang, Y. Shi, J. Zhou, and C. Huang, "Speech lab in a box: aMandarin speech toolbox to jump start speech related research toolbox," in Proc. Eur. Conf. Speech Communication and Technology, 2001, pp. 2782-2799.
    • (2001) Proc. Eur. Conf. Speech Communication and Technology , pp. 2782-2799
    • Chang, E.1    Shi, Y.2    Zhou, J.3    Huang, C.4
  • 27
    • 85009126501 scopus 로고    scopus 로고
    • Large vocabulary Mandarin speech recognition with different approaches in modeling tones
    • E. Chang, J. Zhou, C. Huang, and K. F. Lee, "Large vocabulary Mandarin speech recognition with different approaches in modeling tones," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 983-986.
    • (2000) Proc. Int. Conf. Spoken Language Processing , pp. 983-986
    • Chang, E.1    Zhou, J.2    Huang, C.3    Lee, K.F.4
  • 29
    • 1642377925 scopus 로고    scopus 로고
    • Factor analyzed hidden Markov models for speech recognition
    • A.-V. I. Rosti and M. J. F. Gales, "Factor analyzed hidden Markov models for speech recognition," Comput. Speech Lang., vol. 18, no. 2, pp. 181-200, 2003.
    • (2003) Comput. Speech Lang , vol.18 , Issue.2 , pp. 181-200
    • Rosti, A.-V.I.1    Gales, M.J.F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.