메뉴 건너뛰기




Volumn 14, Issue 4, 2006, Pages 1267-1279

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

Author keywords

Composite kernels; Eigenvoice speaker adaptation; Kernel eigenvoice speaker adaptation; Kernel principal component analysis (PCA); Pre image problem; Reference speaker weighting

Indexed keywords

COMPOSITE KERNELS; EIGENVOICE SPEAKER ADAPTATION; KERNEL EIGENVOICE SPEAKER ADAPTATION; KERNEL PRINCIPAL COMPONENT ANALYSIS (PCA); PRE-IMAGE PROBLEMS; REFERENCE SPEAKER WEIGHTING; REFERENCE SPEAKER WEIGHTING (RSW);

EID: 34047246852     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.860836     Document Type: Article
Times cited : (19)

References (35)
  • 1
    • 0029735634 scopus 로고    scopus 로고
    • Speaker-independent speech recognition based on tree-structured speaker clustering
    • T. Kosaka, S. Matsunaga, and S. Sagayama, "Speaker-independent speech recognition based on tree-structured speaker clustering," J. Comput. Speech Lang., vol. 10, pp. 55-74, 1996.
    • (1996) J. Comput. Speech Lang , vol.10 , pp. 55-74
    • Kosaka, T.1    Matsunaga, S.2    Sagayama, S.3
  • 2
    • 0009625231 scopus 로고    scopus 로고
    • A comparison of novel techniques for rapid speaker adaptation
    • May
    • T. J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Commun., vol. 31, pp. 15-33, May 2000.
    • (2000) Speech Commun , vol.31 , pp. 15-33
    • Hazen, T.J.1
  • 3
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 4
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," J. Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
    • (1995) J. Comput. Speech Lang , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 8
    • 85009080436 scopus 로고    scopus 로고
    • Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices
    • H. Botterweck, "Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices," in Proc. Int. Conf. Spoken Language Processing, vol. 4, 2000, pp. 354-357.
    • (2000) Proc. Int. Conf. Spoken Language Processing , vol.4 , pp. 354-357
    • Botterweck, H.1
  • 9
    • 85009097035 scopus 로고    scopus 로고
    • Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
    • K. T. Chen, W. W. Liau, H. M. Wang, and L. S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Language Processing, vol. 3, 2000, pp. 742-745.
    • (2000) Proc. Int. Conf. Spoken Language Processing , vol.3 , pp. 742-745
    • Chen, K.T.1    Liau, W.W.2    Wang, H.M.3    Lee, L.S.4
  • 11
    • 85009106031 scopus 로고    scopus 로고
    • Bayesian speaker adaptation based on probabilistic principal component analysis
    • D. K. Kim and N. S. Kim, "Bayesian speaker adaptation based on probabilistic principal component analysis," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 734-737.
    • (2000) Proc. Int. Conf. Spoken Language Processing , pp. 734-737
    • Kim, D.K.1    Kim, N.S.2
  • 13
    • 0034842307 scopus 로고    scopus 로고
    • Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition
    • H. Botterweck, "Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 1, 2001, pp. 353-356.
    • (2001) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 353-356
    • Botterweck, H.1
  • 14
    • 85135280100 scopus 로고    scopus 로고
    • Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments
    • P. Nguyen and C. Wellekens, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments," in Proc. Eur. Conf. Speech Communication and Technology, 1999, pp. 2519-2522.
    • (1999) Proc. Eur. Conf. Speech Communication and Technology , pp. 2519-2522
    • Nguyen, P.1    Wellekens, C.2
  • 15
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • Jul
    • M. F. J. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.F.J.1
  • 19
    • 0347243182 scopus 로고    scopus 로고
    • Nonlinear component analysis as a kernel eigenvalue problem
    • B. Schölkopf, A. Smola, and K. R. Müller, "Nonlinear component analysis as a kernel eigenvalue problem," Neural Comput., vol. 10, pp. 1299-1319, 1998.
    • (1998) Neural Comput , vol.10 , pp. 1299-1319
    • Schölkopf, B.1    Smola, A.2    Müller, K.R.3
  • 21
    • 0011812771 scopus 로고    scopus 로고
    • Kernel independent component analysis
    • F. R. Bach and M. I. Jordan, "Kernel independent component analysis," J. Mach. Learn. Res., vol. 3, pp. 1-48, 2002.
    • (2002) J. Mach. Learn. Res , vol.3 , pp. 1-48
    • Bach, F.R.1    Jordan, M.I.2
  • 22
    • 27644511614 scopus 로고    scopus 로고
    • Kernel eigenvoice speaker adaptation
    • Sep
    • B. Mak, J. T. Kwok, and S. Ho, "Kernel eigenvoice speaker adaptation," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 984-992, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 984-992
    • Mak, B.1    Kwok, J.T.2    Ho, S.3
  • 24
    • 9244258603 scopus 로고    scopus 로고
    • The pre-imace problem in kernel methods
    • Nov
    • J. T. Kwok and I. W. Tsang, "The pre-imace problem in kernel methods," IEEE Trans. Neural Netw., vol. 15, no. 6, pp. 1517-1525, Nov. 2004.
    • (2004) IEEE Trans. Neural Netw , vol.15 , Issue.6 , pp. 1517-1525
    • Kwok, J.T.1    Tsang, I.W.2
  • 26
    • 4544261737 scopus 로고    scopus 로고
    • A study of various composite kernels for kernel eigenvoice speaker adaptation
    • Montreal, QC, Canada, May
    • B. Mak, J. T. Kwok, and S. Ho, "A study of various composite kernels for kernel eigenvoice speaker adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., vol. I, Montreal, QC, Canada, May 2004, pp. 325-328.
    • (2004) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process , vol.1 , pp. 325-328
    • Mak, B.1    Kwok, J.T.2    Ho, S.3
  • 27
    • 34047258966 scopus 로고    scopus 로고
    • Eigenvoice speaker adaptation via composite kernel PCA
    • S. Thrun, L. Saul, and B. Schölkopf, Eds. Cambridge, MA: MIT Press
    • J. T. Kwok, B. Mak, and S. Ho, "Eigenvoice speaker adaptation via composite kernel PCA," in Advances in Neural Information Processing Systems 16, S. Thrun, L. Saul, and B. Schölkopf, Eds. Cambridge, MA: MIT Press, 2004.
    • (2004) Advances in Neural Information Processing Systems 16
    • Kwok, J.T.1    Mak, B.2    Ho, S.3
  • 28
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 29
    • 85009124446 scopus 로고    scopus 로고
    • Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA
    • Jeju Island, South Korea, Oct. 14-18
    • B. Mak, S. Ho, and J. T. Kwok, "Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA," in Proc. Int. Conf. Spoken Language Processing, vol. IV, Jeju Island, South Korea, Oct. 14-18, 2004, pp. 2913-2916.
    • (2004) Proc. Int. Conf. Spoken Language Processing , vol.4 , pp. 2913-2916
    • Mak, B.1    Ho, S.2    Kwok, J.T.3
  • 30
    • 33646794428 scopus 로고    scopus 로고
    • Various reference speakers determination methods for embedded kernel eigenvoice speaker adaptation
    • Philadelphia, PA, Mar. 18-23
    • B. Mak and S. Ho, "Various reference speakers determination methods for embedded kernel eigenvoice speaker adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 1, Philadelphia, PA, Mar. 18-23, 2005, pp. 981-984.
    • (2005) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 981-984
    • Mak, B.1    Ho, S.2
  • 33
    • 34047258241 scopus 로고    scopus 로고
    • N. Parihar and J. Picone. (2002) DSR Front End LVCSR Evaluation. AU/384/02, Aurora Working Group. [Online]. Available: http://www.isip.msstate. edu/projecls/aurora.
    • N. Parihar and J. Picone. (2002) DSR Front End LVCSR Evaluation. AU/384/02, Aurora Working Group. [Online]. Available: http://www.isip.msstate. edu/projecls/aurora.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.