메뉴 건너뛰기




Volumn 17, Issue 7, 2009, Pages 1372-1381

Maximum penalized likelihood Kernel regression for fast adaptation

Author keywords

Kernel regression; Maximum likelihood linear regression (MLLR); Reference speaker weighting; Speaker adaptation

Indexed keywords

ADAPTATION ALGORITHMS; ADAPTATION METHODS; AFFINE MODEL; BAYESIAN PERSPECTIVE; EIGENVOICES; FAST ADAPTATIONS; GAUSSIANS; HIGH-DIMENSIONAL FEATURE SPACE; KERNEL METHODS; KERNEL REGRESSION; MAXIMUM A POSTERIORI; MAXIMUM PENALIZED LIKELIHOOD; MAXIMUM-LIKELIHOOD LINEAR REGRESSION (MLLR); NONLINEAR GENERALIZATIONS; REFERENCE SPEAKER WEIGHTING; RESOURCE MANAGEMENT; SPEAKER ADAPTATION; SYSTEM OF LINEAR EQUATIONS; UNSUPERVISED SPEAKER ADAPTATION; WALL STREET JOURNAL; WORD ERROR RATE REDUCTIONS;

EID: 68549133255     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2019920     Document Type: Article
Times cited : (14)

References (30)
  • 2
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMMbased speech recognition
    • Apr
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, Apr. 1998.
    • (1998) Comput. Speech Lang , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 3
    • 0029375590 scopus 로고
    • Speaker adaptation using constrained estimation of Gaussian mixtures
    • Sep
    • V. V. Digalakis, D. Rtischev, and L. G. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.5 , pp. 357-366
    • Digalakis, V.V.1    Rtischev, D.2    Neumeyer, L.G.3
  • 4
    • 0029735634 scopus 로고    scopus 로고
    • Speaker-independent speech recognition based on tree-structured speaker clustering
    • T. Kosaka, S. Matsunaga, and S. Sagayama, "Speaker-independent speech recognition based on tree-structured speaker clustering," Comput. Speech Lang., vol. 10, pp. 55-74, 1996.
    • (1996) Comput. Speech Lang , vol.10 , pp. 55-74
    • Kosaka, T.1    Matsunaga, S.2    Sagayama, S.3
  • 5
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • Jul
    • M. F. J. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.F.J.1
  • 6
    • 0009625231 scopus 로고    scopus 로고
    • A comparison of novel techniques for rapid speaker adaptation
    • May
    • T. J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Commun., vol. 31, pp. 15-33, May 2000.
    • (2000) Speech Commun , vol.31 , pp. 15-33
    • Hazen, T.J.1
  • 8
    • 85009097035 scopus 로고    scopus 로고
    • Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
    • K. T. Chen, W. W. Liau, H. M. Wang, and L. S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Lang. Process., 2000, vol. 3, pp. 742-745.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , vol.3 , pp. 742-745
    • Chen, K.T.1    Liau, W.W.2    Wang, H.M.3    Lee, L.S.4
  • 9
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 10
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) Comput. Speech Lang , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 11
    • 33947681802 scopus 로고    scopus 로고
    • Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers
    • Toulouse, France, May 14-19
    • B. Mak, T.-C. Lai, and R. Hsiao, "Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, May 14-19, 2006, vol. 1, pp. 229-232.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 229-232
    • Mak, B.1    Lai, T.-C.2    Hsiao, R.3
  • 12
    • 85135272864 scopus 로고    scopus 로고
    • Maximum a posteriori linear regression for hidden Markov model adaptation
    • C. Chesta, O. Siohan, and C. H. Lee, "Maximum a posteriori linear regression for hidden Markov model adaptation," in Proc. Eur. Conf. Speech Commun. Technol., 1999, vol. 1, pp. 211-214.
    • (1999) Proc. Eur. Conf. Speech Commun. Technol , vol.1 , pp. 211-214
    • Chesta, C.1    Siohan, O.2    Lee, C.H.3
  • 13
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • Jan
    • O. Siohan, T. A. Myrvoll, and C. H. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Comput. Speech Lang., vol. 16, pp. 5-24, Jan. 2002.
    • (2002) Comput. Speech Lang , vol.16 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.A.2    Lee, C.H.3
  • 16
    • 0035054375 scopus 로고    scopus 로고
    • Discounted likelihood linear regression for rapid speaker adaptation
    • Jan
    • A. Gunawardana and W. Byrne, "Discounted likelihood linear regression for rapid speaker adaptation," Comput. Speech Lang., vol. 15, pp. 15-38, Jan. 2001.
    • (2001) Comput. Speech Lang , vol.15 , pp. 15-38
    • Gunawardana, A.1    Byrne, W.2
  • 17
    • 33947674762 scopus 로고    scopus 로고
    • Fast speaker adaptation via maximum penalized likelihood kernel regression
    • Toulouse, France, May 14-19
    • I. W. Tsang, J. T. Kwok, B. Mak, K. Zhang, and J. J. Pan, "Fast speaker adaptation via maximum penalized likelihood kernel regression," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, May 14-19, 2006, vol. 1, pp. 997-1000.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 997-1000
    • Tsang, I.W.1    Kwok, J.T.2    Mak, B.3    Zhang, K.4    Pan, J.J.5
  • 21
    • 33947673495 scopus 로고    scopus 로고
    • A non-linear speaker adaptation technique using kernel ridge regression
    • G. Saon, "A non-linear speaker adaptation technique using kernel ridge regression," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2006, vol. I, pp. 225-228.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 225-228
    • Saon, G.1
  • 22
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • Apr
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, Apr. 1998.
    • (1998) Comput. Speech Lang , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 23
    • 30244545086 scopus 로고
    • Training and Speaker Adaptation in Template-Based Speech Recognition,
    • Ph.D. dissertation, Cambridge Univ, Cambridge, U.K
    • A. J. Hewett, "Training and Speaker Adaptation in Template-Based Speech Recognition," Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 1989.
    • (1989)
    • Hewett, A.J.1
  • 26
    • 27644511614 scopus 로고    scopus 로고
    • Kernel eigenvoice speaker adaptation
    • Sep
    • B. Mak, J. T. Kwok, and S.Ho, "Kernel eigenvoice speaker adaptation," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 984-992, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 984-992
    • Mak, B.1    Kwok, J.T.2    Ho, S.3
  • 27
    • 34047246852 scopus 로고    scopus 로고
    • B. Mak, R. Hsiao,S. Ho, andJ.T. Kwok, Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting, IEEE Trans. Speech Audio Process., 14, no. 4, pp. 1267-1280, Jul. 2006.
    • B. Mak, R. Hsiao,S. Ho, andJ.T. Kwok, "Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting," IEEE Trans. Speech Audio Process., vol. 14, no. 4, pp. 1267-1280, Jul. 2006.
  • 28
    • 56149122221 scopus 로고    scopus 로고
    • Kernel eigenspace-based MLLR adaptation
    • Mar
    • B. Mak and R. Hsiao, "Kernel eigenspace-based MLLR adaptation," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 784-795, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 784-795
    • Mak, B.1    Hsiao, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.