메뉴 건너뛰기




Volumn 13, Issue 3, 2005, Pages 367-376

Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation

Author keywords

Adaptive training; Correlation modeling; Discrim inative training

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; ALGORITHMS; CORRELATION METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; MATRIX ALGEBRA; MAXIMUM LIKELIHOOD ESTIMATION; SPEECH PROCESSING; VECTORS;

EID: 18744406714     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.845806     Document Type: Article
Times cited : (19)

References (23)
  • 1
    • 84892187452 scopus 로고    scopus 로고
    • Maximum likelihood modeling with Gaussian distributions for classification
    • May
    • R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, May 1998, pp. 661-664.
    • (1998) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 661-664
    • Gopinath, R.A.1
  • 2
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • May
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 272-281, May 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.5 , pp. 272-281
    • Gales, M.J.F.1
  • 4
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Apr.
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, Apr. 1998.
    • (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 5
    • 0002756136 scopus 로고    scopus 로고
    • Maximum mutual information estimation of hidden Markov models
    • C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds. Norwell, MA: Kluwer, ch. 3
    • Y. Normandin, "Maximum mutual information estimation of hidden Markov models," in Automatic Speech and Speaker Recognition: Advanced Topics, C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds. Norwell, MA: Kluwer, 1996, ch. 3, pp. 57-81.
    • (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 57-81
    • Normandin, Y.1
  • 8
    • 0034855183 scopus 로고    scopus 로고
    • Improvements in linear transforms based speaker adaptation
    • May
    • _, "Improvements in linear transforms based speaker adaptation," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, May 2001, pp. 49-52.
    • (2001) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 49-52
  • 9
    • 85009119467 scopus 로고    scopus 로고
    • Discriminative speaker adaptation with conditional maximum likelihood linear regression
    • A. Gunawardana and W. Byrne, "Discriminative speaker adaptation with conditional maximum likelihood linear regression," in Proc. Eur. Conf. Speech Communication and Technology, 2001, pp. 1203-1206.
    • (2001) Proc. Eur. Conf. Speech Communication and Technology , pp. 1203-1206
    • Gunawardana, A.1    Byrne, W.2
  • 11
    • 18744393254 scopus 로고    scopus 로고
    • The AT&T LVCSR-2001 system
    • A. Ljolje, "The AT&T LVCSR-2001 system," presented at the NIST LVCSR Workshop, 2001.
    • (2001) NIST LVCSR Workshop
    • Ljolje, A.1
  • 12
    • 0024905238 scopus 로고
    • A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
    • May
    • M. Hunt and C. Lefèbvre, "A comparison of several acoustic representations for speech recognition with degraded and undegraded speech," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, May 1989, pp. 262-265.
    • (1989) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 262-265
    • Hunt, M.1    Lefèbvre, C.2
  • 14
    • 4244119525 scopus 로고    scopus 로고
    • Maximum mutual information estimation of acoustic HMM emission densities
    • Johns Hopkins Univerisity, CLSP, Baltimore, MD
    • A. Gunawardana, "Maximum Mutual Information Estimation of Acoustic HMM Emission Densities," Johns Hopkins Univerisity, CLSP, Baltimore, MD, Tech. Rep. CLSP Reasearch Note no. 40, 2001.
    • (2001) Tech. Rep. CLSP Reasearch Note No. 40 , vol.40
    • Gunawardana, A.1
  • 15
    • 1642372928 scopus 로고    scopus 로고
    • Variance compensation within the MLLR framework
    • Eng. Dept., Univ. Cambridge, Cambridge, U.K.
    • M. J. F. Gales and P. Woodland, "Variance Compensation Within the MLLR Framework," Eng. Dept., Univ. Cambridge, Cambridge, U.K., Tech. Rep. CUED/F-INFENT/TR242, 1996.
    • (1996) Tech. Rep. , vol.CUED-F-INFENT-TR242
    • Gales, M.J.F.1    Woodland, P.2
  • 17
    • 2442597230 scopus 로고    scopus 로고
    • The JHU march 2001 Hub-5 conversational speech transcription system
    • W. Byrne, "The JHU march 2001 Hub-5 conversational speech transcription system," presented at the NIST LVCSR Workshop, 2001.
    • (2001) NIST LVCSR Workshop
    • Byrne, W.1
  • 18
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • Apr.
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
    • (1990) J. Acoust. Soc. America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 19
    • 85135259135 scopus 로고    scopus 로고
    • Integrated context-dependent networks in very large vocabulary speech recognition
    • M. Mohri and M. Riley, "Integrated context-dependent networks in very large vocabulary speech recognition," in Proc. Eur. Conf. Speech Commun. Technol., 1999, pp. 811-814.
    • (1999) Proc. Eur. Conf. Speech Commun. Technol. , pp. 811-814
    • Mohri, M.1    Riley, M.2
  • 21
    • 18744376446 scopus 로고    scopus 로고
    • The 2000 NIST evaluation for recognition of conversational speech over the telephone
    • A. Martin, M. Przybocki, J. Fiscus, and D. Pallett, "The 2000 NIST evaluation for recognition of conversational speech over the telephone," presented at the Speech Transcription Workshop, 2000.
    • (2000) Speech Transcription Workshop
    • Martin, A.1    Przybocki, M.2    Fiscus, J.3    Pallett, D.4
  • 22
    • 18744389914 scopus 로고    scopus 로고
    • The evaluation: Word error rates and confidence analysis
    • Linthicum Heights, MD, [Online]
    • A. Martin, J. Fiscus, M. Przybocki, and B. Fisher, "The evaluation: Word error rates and confidence analysis," presented at the Hub-5 Workshop, Linthicum Heights, MD, 1998. [Online]. Available: http://www.nist.gov/speech/ tests/ctr/hub5e_98/hub5e_98.htm.
    • (1998) Hub-5 Workshop
    • Martin, A.1    Fiscus, J.2    Przybocki, M.3    Fisher, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.