메뉴 건너뛰기




Volumn 13, Issue 4, 2005, Pages 554-563

Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation

Author keywords

Discriminative acoustic model; Eigenspace mapping; Hidden markov models; Rapid speaker adaptation; Speech recognition

Indexed keywords

ADAPTIVE ALGORITHMS; CORRELATION METHODS; EIGENVALUES AND EIGENFUNCTIONS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH RECOGNITION;

EID: 22544443963     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.845808     Document Type: Article
Times cited : (21)

References (33)
  • 1
    • 0031177213 scopus 로고    scopus 로고
    • Combined bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
    • S. M. Ahadi and P. C. Woodland, "Combined bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 11, pp. 187-206, 1997.
    • (1997) Comput. Speech Lang , vol.11 , pp. 187-206
    • Ahadi, S.M.1    Woodland, P.C.2
  • 2
    • 11844281179 scopus 로고    scopus 로고
    • Within-utterance correlation for speech recognition
    • Budapest, Hungary
    • M. Blomberg, "Within-utterance correlation for speech recognition," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 2479-2482.
    • (1999) Proc. Eurospeech , pp. 2479-2482
    • Blomberg, M.1
  • 3
    • 84871620008 scopus 로고    scopus 로고
    • Discounted likelihood linear regression for rapid speaker adaptation
    • Budapest, Hungary
    • W. Byrne and A. Gunawardana, "Discounted likelihood linear regression for rapid speaker adaptation," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 203-206.
    • (1999) Proc. Eurospeech , pp. 203-206
    • Byrne, W.1    Gunawardana, A.2
  • 4
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the Bayesian information criterion
    • Feb.
    • S. Chen and P. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion," in Proc. Broadcast News Transcription Understanding Workshop, Feb. 1998, pp. 127-132.
    • (1998) Proc. Broadcast News Transcription Understanding Workshop , pp. 127-132
    • Chen, S.1    Gopalakrishnan, P.2
  • 5
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posterior linear regression for hidden Markov model adaptation
    • Budapest, Hungary
    • C. Chesta, O. Siohan, and C. H. Lee, "Maximum a posterior linear regression for hidden Markov model adaptation," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 203-206.
    • (1999) Proc. Eurospeech , pp. 203-206
    • Chesta, C.1    Siohan, O.2    Lee, C.H.3
  • 6
    • 84874875877 scopus 로고    scopus 로고
    • Maximum a posterior linear regression with elliptically symmetric matrix priors
    • Budapest, Hungary
    • W. Chou, "Maximum a posterior linear regression with elliptically symmetric matrix priors," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 1-4.
    • (1999) Proc. Eurospeech , pp. 1-4
    • Chou, W.1
  • 7
    • 0002629270 scopus 로고
    • Maximum likelihood estimation from incomplete data via the em algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood estimation from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. B39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.B39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 9
    • 0022667694 scopus 로고
    • Speaker-independent isolated word recognition using dynamic features of speech spectrum
    • S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust., Speech, Signal Process., vol. 34, no. 1, pp. 52-59, 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , Issue.1 , pp. 52-59
    • Furui, S.1
  • 10
  • 11
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 13
    • 84892187452 scopus 로고    scopus 로고
    • Maximum likelihood modeling with Gaussian distributions for classification
    • Seattle, WA
    • R. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. ICASSP, Seattle, WA, 1998.
    • (1998) Proc. ICASSP
    • Gopinath, R.1
  • 16
    • 0141588436 scopus 로고    scopus 로고
    • M.S. thesis, Univ. Cambridge, Cambridge, U.K.
    • S. Johnson, "Speaker Tracking," M.S. thesis, Univ. Cambridge, Cambridge, U.K., 1997.
    • (1997) Speaker Tracking
    • Johnson, S.1
  • 18
    • 0034857758 scopus 로고    scopus 로고
    • Very fast adaptation with a compact context-dependent eigenvoice model
    • Salt Lake City, UT, May
    • R. Kuhn, F. Perronnin, P. Nguyen, J.-C. Junqua, and L. Rigazio, "Very fast adaptation with a compact context-dependent eigenvoice model," in Proc. ICASSP, Salt Lake City, UT, May 2001.
    • (2001) Proc. ICASSP
    • Kuhn, R.1    Perronnin, F.2    Nguyen, P.3    Junqua, J.-C.4    Rigazio, L.5
  • 20
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 21
    • 85135155427 scopus 로고
    • A comparative study of speaker adaptation techniques
    • L. R. Neumeyer, A. Sankar, and V. V. Digalakis, "A comparative study of speaker adaptation techniques," in Proc. Eurospeech, 1995, pp. 1127-1130.
    • (1995) Proc. Eurospeech , pp. 1127-1130
    • Neumeyer, L.R.1    Sankar, A.2    Digalakis, V.V.3
  • 24
    • 0033677121 scopus 로고    scopus 로고
    • Maximum likelihood discriminant feature spaces
    • Istanbul, Turkey, Jun.
    • G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. ICASSP, Istanbul, Turkey, Jun. 2000.
    • (2000) Proc. ICASSP
    • Saon, G.1    Padmanabhan, M.2    Gopinath, R.3    Chen, S.4
  • 26
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • Jan.
    • O. Siohan, T. A. Myrvoll, and C. H. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Comput. Speech Lang., vol. 16, no. 1, pp. 5-24, Jan. 2002.
    • (2002) Comput. Speech Lang , vol.16 , Issue.1 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.A.2    Lee, C.H.3
  • 28
    • 21444449963 scopus 로고    scopus 로고
    • Rapid speaker adaptation using MLLR and subspace regression classes
    • Aalborg, Denmark, Sep.
    • K. Wong and B. Mak, "Rapid speaker adaptation using MLLR and subspace regression classes," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001.
    • (2001) Proc. Eurospeech
    • Wong, K.1    Mak, B.2
  • 30
    • 85009084294 scopus 로고    scopus 로고
    • A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping
    • Aalborg, Denmark, Sep.
    • B. Zhou and J. H. L. Hansen, "A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping," in Proc. Eurospeech, vol. 2, Aalborg, Denmark, Sep. 2001, pp. 1215-1218.
    • (2001) Proc. Eurospeech , vol.2 , pp. 1215-1218
    • Zhou, B.1    Hansen, J.H.L.2
  • 31
    • 85009288404 scopus 로고    scopus 로고
    • Improved structural maximum likelihood eigenspace mapping for speaker adaptation
    • Denver, CO
    • _, "Improved structural maximum likelihood eigenspace mapping for speaker adaptation," in Proc. ICSLP'2002, Denver, CO, 2002, pp. 1433-14367.
    • (2002) Proc. ICSLP'2002 , pp. 1433-14367
  • 32
    • 85009275098 scopus 로고    scopus 로고
    • SpeechFind: An experimental on-line spoken document retrieval system for historical audio archives
    • Denver, CO
    • _, "SpeechFind: An experimental on-line spoken document retrieval system for historical audio archives," in Proc. ICSLP'2002, vol. 3, Denver, CO, 2002, pp. 1969-1972.
    • (2002) Proc. ICSLP'2002 , vol.3 , pp. 1969-1972
  • 33
    • 85009089453 scopus 로고    scopus 로고
    • Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
    • Beijing, China, Oct.
    • _, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. ICSLP'2000, Beijing, China, Oct. 2000, pp. 714-717.
    • (2000) Proc. ICSLP'2000 , pp. 714-717


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.