SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 4, 2006, Pages 1267-1279

Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting

(4) Mak, Brian Kan Wing a,b Hsiao, Roger Wend Huu a,b Ho, Simon Ka Lung b Kwok, James T a,b

b HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGY (Hong Kong)

Author keywords

Composite kernels; Eigenvoice speaker adaptation; Kernel eigenvoice speaker adaptation; Kernel principal component analysis (PCA); Pre image problem; Reference speaker weighting

Indexed keywords

COMPOSITE KERNELS; EIGENVOICE SPEAKER ADAPTATION; KERNEL EIGENVOICE SPEAKER ADAPTATION; KERNEL PRINCIPAL COMPONENT ANALYSIS (PCA); PRE-IMAGE PROBLEMS; REFERENCE SPEAKER WEIGHTING; REFERENCE SPEAKER WEIGHTING (RSW);

EIGENVALUES AND EIGENFUNCTIONS; EMBEDDED SYSTEMS; PRINCIPAL COMPONENT ANALYSIS; PROBLEM SOLVING;

SPEECH RECOGNITION;

EID: 34047246852 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.860836 Document Type: Article

Times cited : (19)

References (35)

1
- 0029735634
- Speaker-independent speech recognition based on tree-structured speaker clustering
- T. Kosaka, S. Matsunaga, and S. Sagayama, "Speaker-independent speech recognition based on tree-structured speaker clustering," J. Comput. Speech Lang., vol. 10, pp. 55-74, 1996.
- (1996) J. Comput. Speech Lang , vol.10 , pp. 55-74
- Kosaka, T.¹ Matsunaga, S.² Sagayama, S.³

2
- 0009625231
- A comparison of novel techniques for rapid speaker adaptation
- May
- T. J. Hazen, "A comparison of novel techniques for rapid speaker adaptation," Speech Commun., vol. 31, pp. 15-33, May 2000.
- (2000) Speech Commun , vol.31 , pp. 15-33
- Hazen, T.J.¹

3
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

4
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," J. Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) J. Comput. Speech Lang , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

5
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- Nov
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 695-707, Nov. 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

6
- 0026384289
- Face recognition using eigenfaces
- M. Turk and A. Pentland, "Face recognition using eigenfaces," in Proc. Int. Conf. Computer Vision and Pattern Recognition, 1991, pp. 586-591.
- (1991) Proc. Int. Conf. Computer Vision and Pattern Recognition , pp. 586-591
- Turk, M.¹ Pentland, A.²

7
- 0034857758
- Very fast adaptation with a compact context-dependent eigenvoice model
- May
- R. Kuhn, F. Perronnin, P. Nguyen, J. C. Junqua, and L. Rigazio, "Very fast adaptation with a compact context-dependent eigenvoice model," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, May 2001, pp. 373-376.
- (2001) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 373-376
- Kuhn, R.¹ Perronnin, F.² Nguyen, P.³ Junqua, J.C.⁴ Rigazio, L.⁵

8
- 85009080436
- Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices
- H. Botterweck, "Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices," in Proc. Int. Conf. Spoken Language Processing, vol. 4, 2000, pp. 354-357.
- (2000) Proc. Int. Conf. Spoken Language Processing , vol.4 , pp. 354-357
- Botterweck, H.¹

9
- 85009097035
- Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
- K. T. Chen, W. W. Liau, H. M. Wang, and L. S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Language Processing, vol. 3, 2000, pp. 742-745.
- (2000) Proc. Int. Conf. Spoken Language Processing , vol.3 , pp. 742-745
- Chen, K.T.¹ Liau, W.W.² Wang, H.M.³ Lee, L.S.⁴

10
- 0034843060
- Rapid speaker adaptation using a priori knowledge by eigenspace analysis of MLLR parameters
- N. Wang, S. Lee, F. Seide, and L. S. Lee, "Rapid speaker adaptation using a priori knowledge by eigenspace analysis of MLLR parameters," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Process., 2001, pp. 345-348.
- (2001) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Process , pp. 345-348
- Wang, N.¹ Lee, S.² Seide, F.³ Lee, L.S.⁴

11
- 85009106031
- Bayesian speaker adaptation based on probabilistic principal component analysis
- D. K. Kim and N. S. Kim, "Bayesian speaker adaptation based on probabilistic principal component analysis," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 734-737.
- (2000) Proc. Int. Conf. Spoken Language Processing , pp. 734-737
- Kim, D.K.¹ Kim, N.S.²

12
- 0034841728
- EMAP-based speaker adaptation with robust correlation estimation
- E. Jon, D. K. Kim, and N. S. Kim, "EMAP-based speaker adaptation with robust correlation estimation," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 2001, pp. 321-324.
- (2001) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , pp. 321-324
- Jon, E.¹ Kim, D.K.² Kim, N.S.³

13
- 0034842307
- Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition
- H. Botterweck, "Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 1, 2001, pp. 353-356.
- (2001) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 353-356
- Botterweck, H.¹

14
- 85135280100
- Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments
- P. Nguyen and C. Wellekens, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments," in Proc. Eur. Conf. Speech Communication and Technology, 1999, pp. 2519-2522.
- (1999) Proc. Eur. Conf. Speech Communication and Technology , pp. 2519-2522
- Nguyen, P.¹ Wellekens, C.²

15
- 0034227757
- Cluster adaptive training of hidden Markov models
- Jul
- M. F. J. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, Jul. 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 417-428
- Gales, M.F.J.¹

16
- 0003991806
- New York: Wiley
- V. Vapnik, Statistical Learning Theory. New York: Wiley, 1998.
- (1998) Statistical Learning Theory
- Vapnik, V.¹

17
- 0003798635
- Cambridge, U.K, Cambridge Univ. Press
- N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines. Cambridge, U.K.: Cambridge Univ. Press, 2000.
- (2000) An Introduction to Support Vector Machines
- Cristianini, N.¹ Shawe-Taylor, J.²

18
- 0004094721
- Cambridge, MA: MIT Press
- B. Schölkopf and A. J. Smola, Learning with Kernels. Cambridge, MA: MIT Press, 2002.
- (2002) Learning with Kernels
- Schölkopf, B.¹ Smola, A.J.²

19
- 0347243182
- Nonlinear component analysis as a kernel eigenvalue problem
- B. Schölkopf, A. Smola, and K. R. Müller, "Nonlinear component analysis as a kernel eigenvalue problem," Neural Comput., vol. 10, pp. 1299-1319, 1998.
- (1998) Neural Comput , vol.10 , pp. 1299-1319
- Schölkopf, B.¹ Smola, A.² Müller, K.R.³

20
- 0001089823
- Support vector clustering
- A. Ben-Hur, D. Horn, H. T. Siegelmann, and V. Vapnik, "Support vector clustering," J. Mach. Learn. Res., vol. 2, pp. 125-137, 2001.
- (2001) J. Mach. Learn. Res , vol.2 , pp. 125-137
- Ben-Hur, A.¹ Horn, D.² Siegelmann, H.T.³ Vapnik, V.⁴

21
- 0011812771
- Kernel independent component analysis
- F. R. Bach and M. I. Jordan, "Kernel independent component analysis," J. Mach. Learn. Res., vol. 3, pp. 1-48, 2002.
- (2002) J. Mach. Learn. Res , vol.3 , pp. 1-48
- Bach, F.R.¹ Jordan, M.I.²

22
- 27644511614
- Kernel eigenvoice speaker adaptation
- Sep
- B. Mak, J. T. Kwok, and S. Ho, "Kernel eigenvoice speaker adaptation," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 984-992, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 984-992
- Mak, B.¹ Kwok, J.T.² Ho, S.³

23
- 0000156598
- Kernel PCA and de-noising in feature spaces
- M. S. Kearns, S. A. Solla, and D. A. Cohn, Eds. San Mateo, CA: Morgan Kaufmann
- S. Mika, B. Schölkopf, A. Smola, K. R. Müller, M. Scholz, and G. Rätsch, "Kernel PCA and de-noising in feature spaces," in Advances in Neural Information Processing Systems II, M. S. Kearns, S. A. Solla, and D. A. Cohn, Eds. San Mateo, CA: Morgan Kaufmann, 1998.
- (1998) Advances in Neural Information Processing Systems II
- Mika, S.¹ Schölkopf, B.² Smola, A.³ Müller, K.R.⁴ Scholz, M.⁵ Rätsch, G.⁶

24
- 9244258603
- The pre-imace problem in kernel methods
- Nov
- J. T. Kwok and I. W. Tsang, "The pre-imace problem in kernel methods," IEEE Trans. Neural Netw., vol. 15, no. 6, pp. 1517-1525, Nov. 2004.
- (2004) IEEE Trans. Neural Netw , vol.15 , Issue.6 , pp. 1517-1525
- Kwok, J.T.¹ Tsang, I.W.²

25
- 84898987558
- Learning to find pre-images
- S. Thrun, L. Saul, and B. Schölkopf, Eds. Cambridge, MA: MIT Press
- G. H. Bakir, J. Weston, and B. Schölkopf, "Learning to find pre-images," in Advances in Neural Information Processing Systems 16, S. Thrun, L. Saul, and B. Schölkopf, Eds. Cambridge, MA: MIT Press, 2004.
- (2004) Advances in Neural Information Processing Systems 16
- Bakir, G.H.¹ Weston, J.² Schölkopf, B.³

26
- 4544261737
- A study of various composite kernels for kernel eigenvoice speaker adaptation
- Montreal, QC, Canada, May
- B. Mak, J. T. Kwok, and S. Ho, "A study of various composite kernels for kernel eigenvoice speaker adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process., vol. I, Montreal, QC, Canada, May 2004, pp. 325-328.
- (2004) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process , vol.1 , pp. 325-328
- Mak, B.¹ Kwok, J.T.² Ho, S.³

27
- 34047258966
- Eigenvoice speaker adaptation via composite kernel PCA
- S. Thrun, L. Saul, and B. Schölkopf, Eds. Cambridge, MA: MIT Press
- J. T. Kwok, B. Mak, and S. Ho, "Eigenvoice speaker adaptation via composite kernel PCA," in Advances in Neural Information Processing Systems 16, S. Thrun, L. Saul, and B. Schölkopf, Eds. Cambridge, MA: MIT Press, 2004.
- (2004) Advances in Neural Information Processing Systems 16
- Kwok, J.T.¹ Mak, B.² Ho, S.³

28
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc. B, vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

29
- 85009124446
- Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA
- Jeju Island, South Korea, Oct. 14-18
- B. Mak, S. Ho, and J. T. Kwok, "Speedup of kernel eigenvoice speaker adaptation by embedded kernel PCA," in Proc. Int. Conf. Spoken Language Processing, vol. IV, Jeju Island, South Korea, Oct. 14-18, 2004, pp. 2913-2916.
- (2004) Proc. Int. Conf. Spoken Language Processing , vol.4 , pp. 2913-2916
- Mak, B.¹ Ho, S.² Kwok, J.T.³

30
- 33646794428
- Various reference speakers determination methods for embedded kernel eigenvoice speaker adaptation
- Philadelphia, PA, Mar. 18-23
- B. Mak and S. Ho, "Various reference speakers determination methods for embedded kernel eigenvoice speaker adaptation," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 1, Philadelphia, PA, Mar. 18-23, 2005, pp. 981-984.
- (2005) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 981-984
- Mak, B.¹ Ho, S.²

31
- 0021226391
- A database for speaker-independent digit recognition
- R. G. Leonard, "A database for speaker-independent digit recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 3, 1984, pp. 4211-4214.
- (1984) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.3 , pp. 4211-4214
- Leonard, R.G.¹

32
- 85079095310
- The design of the wall street journal-based CSR corpus
- Feb
- D. B. Paul and J. M. Baker, "The design of the wall street journal-based CSR corpus," in Proc. DARPA Speech and Natural Language Workshop, Feb. 1992.
- (1992) Proc. DARPA Speech and Natural Language Workshop
- Paul, D.B.¹ Baker, J.M.²

33
- 34047258241
- N. Parihar and J. Picone. (2002) DSR Front End LVCSR Evaluation. AU/384/02, Aurora Working Group. [Online]. Available: http://www.isip.msstate. edu/projecls/aurora.
- N. Parihar and J. Picone. (2002) DSR Front End LVCSR Evaluation. AU/384/02, Aurora Working Group. [Online]. Available: http://www.isip.msstate. edu/projecls/aurora.

34
- 0347548264
- Berlin, Germany: Springer-Verlag
- J. F. Bonnans, J. C. Gilbert, C. Lemaréchal, and C. A. Sagastiádbal, Numerical Optimization: Theoretical and Practical Aspects. Berlin, Germany: Springer-Verlag, 2003.
- (2003) Numerical Optimization: Theoretical and Practical Aspects
- Bonnans, J.F.¹ Gilbert, J.C.² Lemaréchal, C.³ Sagastiádbal, C.A.⁴

35
- 0023776398
- The DARPA 1000-word resource management database for continuous speech recognition
- P. Price, W. M. Fisher, J. Bernstein, and D. S. Pallett, "The DARPA 1000-word resource management database for continuous speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. 1, 1988, pp. 651-654.
- (1988) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 651-654
- Price, P.¹ Fisher, W.M.² Bernstein, J.³ Pallett, D.S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.