SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 16, Issue 3, 2008, Pages 607-616

Rapid speaker adaptation using clustered maximum-likelihood linear basis with sparse training data

(2) Tang, Yun a Rose, Richard a

a MCGILL UNIVERSITY (Canada)

Author keywords

Cluster adaptive training; Eigenvoices; Parameter tying; Speaker adaptation; Speech recognition

Indexed keywords

ADAPTATION METHODS; AUTOMATIC SPEECH RECOGNITION; BASIS VECTORS; CLUSTER ADAPTIVE TRAINING; COMPUTER MEMORIES; EIGENVOICES; GENERAL CLASS; LINEAR COMBINATIONS; LOW COMPLEXITY; MAXIMUM-LIKELIHOOD; MAXIMUM-LIKELIHOOD ESTIMATIONS; PARAMETER TYING; PERFORMANCE IMPROVEMENTS; RAPID SPEAKER ADAPTATIONS; RESOURCE MANAGEMENTS; SPACE-BASED; SPEAKER ADAPTATION; STORAGE REQUIREMENTS; TASK DOMAINS; TRAINING DATUM; WALL STREET JOURNALS; WORD ERROR RATE REDUCTIONS; WORD ERROR RATES;

MAXIMUM LIKELIHOOD ESTIMATION; RESOURCE ALLOCATION; SPEECH ANALYSIS;

SPEECH RECOGNITION;

EID: 64949158419 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2008.916530 Document Type: Article

Times cited : (10)

References (29)

1
- 0346528936
- Speaker adaptation for continuous density HMMs: A review
- Sophia Antipolis, France
- P. C. Woodland, "Speaker adaptation for continuous density HMMs: A review," in Proc. ITRW Adaptation Methods for Speech Recognition, Sophia Antipolis, France, 2001, pp. 11-19.
- (2001) Proc. ITRW Adaptation Methods for Speech Recognition , pp. 11-19
- Woodland, P.C.¹

2
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr
- J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

3
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-186, 1995.
- (1995) Comput. Speech Lang , vol.9 , Issue.2 , pp. 171-186
- Leggetter, C.J.¹ Woodland, P.²

4
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- Nov
- R. Kuhn, J.-C. Junqua, P. Nguyen, and N. Niedzielski, "Rapid speaker adaptation in eigenvoice space," IEEE Trans. Speech Audio Process., vol. 8, no. 6, pp. 695-707, Nov. 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.6 , pp. 695-707
- Kuhn, R.¹ Junqua, J.-C.² Nguyen, P.³ Niedzielski, N.⁴

5
- 85135280100
- Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments
- Budapest, Hungary
- P. Nguyen, C. Wellekens, and J. C. Junqua, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments," in Proc. Eur. Conf. Speech Commun. Technol., Budapest, Hungary, 1999, pp. 2519-2522.
- (1999) Proc. Eur. Conf. Speech Commun. Technol , pp. 2519-2522
- Nguyen, P.¹ Wellekens, C.² Junqua, J.C.³

6
- 85009080436
- Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices
- Beijing, China
- H. Botterweck, "Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, 2000, pp. 354-357.
- (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 354-357
- Botterweck, H.¹

7
- 0009625231
- A comparison of noval techniques for rapid speaker adaptation
- T. J. Hazen, "A comparison of noval techniques for rapid speaker adaptation," Speech Commun., vol. 31, pp. 15-33, 2000.
- (2000) Speech Commun , vol.31 , pp. 15-33
- Hazen, T.J.¹

8
- 33947681802
- Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers
- Toulouse, France
- B. Mak, T. C. Lai, and R. Hsiao, "Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, 2006, pp. 229-232.
- (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 229-232
- Mak, B.¹ Lai, T.C.² Hsiao, R.³

9
- 64949197536
- T. C. Lai and B. Bak, Unsupervised speaker adaptation using reference speaker weighting, in Proc. Int. Symp. Chinese Spoken Lang. Process., Singapore, Dec. 2006, pp. 380-389. [10] M. J. F. Gales, Cluster adaptive training of hidden Markov models, IEEE Trans. Speech Audio Process., 8, no. 4, pp. 417-428, 2000.
- T. C. Lai and B. Bak, "Unsupervised speaker adaptation using reference speaker weighting," in Proc. Int. Symp. Chinese Spoken Lang. Process., Singapore, Dec. 2006, pp. 380-389. [10] M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, 2000.

10
- 0034842307
- Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition
- Salt Lake, UT
- H. Botterweck, "Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition," in IEEE Proc. Int. Conf. Acoust., Speech, Signal Process., Salt Lake, UT, 2001, pp. 353-356.
- (2001) IEEE Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 353-356
- Botterweck, H.¹

11
- 18744386134
- Eigenvoiee modeling with sparse training data
- May
- P. Kenny, G. Boulianne, and P. Dumouchel, "Eigenvoiee modeling with sparse training data," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 345-354, May 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 345-354
- Kenny, P.¹ Boulianne, G.² Dumouchel, P.³

12
- 85009097035
- Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
- Beijing, China
- K. T. Chen, W. W. Liau, H. M. Wang, and L. S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, 2000, pp. 742-745.
- (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 742-745
- Chen, K.T.¹ Liau, W.W.² Wang, H.M.³ Lee, L.S.⁴

13
- 64949152565
- IEEE
- B. Mak and R. Hsiao, "Kernel eigenvoiee speaker adaptation," IEEE
- Kernel eigenvoiee speaker adaptation
- Mak, B.¹ Hsiao, R.²

14
- 27644511614
- Sep
- Trans. Speech Audio Process., vol. 13, no. 5, pp. 984-992, Sep. 2005.
- (2005) Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 984-992

15
- 56149122221
- Kernel eigenspace-based MLLR adaptation
- Mar
- B. Mak and R. Hsiao, "Kernel eigenspace-based MLLR adaptation," IEEE Trans. Audio, Speech, Language Process., vol. 15, no. 3, pp. 784-795, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Language Process , vol.15 , Issue.3 , pp. 784-795
- Mak, B.¹ Hsiao, R.²

16
- 34047260093
- Discriminative cluster adaptive training
- Sep
- K. Yu and M. J. F. Gales, "Discriminative cluster adaptive training," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1694-1703, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1694-1703
- Yu, K.¹ Gales, M.J.F.²

17
- 22544443963
- Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation
- Jul
- B. Zhou and J. Hansen, "Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation," IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 554-564, Jul. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.4 , pp. 554-564
- Zhou, B.¹ Hansen, J.²

18
- 0003946510
- New York: Springer-Verlag
- I. T. Jolliffe, Principle Component Analysis. New York: Springer-Verlag, 1986.
- (1986) Principle Component Analysis
- Jolliffe, I.T.¹

19
- 0033556788
- Mixtures of probabilistic principal component analyzers
- M. E. Tipping and C. M. Bishop, "Mixtures of probabilistic principal component analyzers," Neural Comput., vol. 11, no. 2, pp. 443-482, 1999.
- (1999) Neural Comput , vol.11 , Issue.2 , pp. 443-482
- Tipping, M.E.¹ Bishop, C.M.²

20
- 85009106031
- Bayesian speaker adaptation based on probabilistic pricipal component analysis
- Beijing, China
- D. K. Kim and N. S. Kim, "Bayesian speaker adaptation based on probabilistic pricipal component analysis," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China. 2000, pp. 734-737.
- (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 734-737
- Kim, D.K.¹ Kim, N.S.²

21
- 0023776398
- The DARPA 1000-word resource management database for continuous speech recognition
- New York, Apr
- P. Price, W. M. Fisher, J. Bernstein, and D. S. Pallett, "The DARPA 1000-word resource management database for continuous speech recognition," in Proc. IEEE Proc. Int. Conf. Acoust., Speech, Signal Process., New York, Apr. 1988, pp. 651-654.
- (1988) Proc. IEEE Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 651-654
- Price, P.¹ Fisher, W.M.² Bernstein, J.³ Pallett, D.S.⁴

22
- 0012330750
- The design of the wall street journal-based CSR corpus
- Austin, TX, Feb
- D. B. Paul and J. M. Baker, "The design of the wall street journal-based CSR corpus," in Proc. IEEE DARPA Speech Natural Lang. Workshop, Austin, TX, Feb. 1992, pp. 357-360.
- (1992) Proc. IEEE DARPA Speech Natural Lang. Workshop , pp. 357-360
- Paul, D.B.¹ Baker, J.M.²

23
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. Gales and P. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech, Lang., vol. 10, no. 4, pp. 249-264, 1996.
- (1996) Comput. Speech, Lang , vol.10 , Issue.4 , pp. 249-264
- Gales, M.¹ Woodland, P.²

24
- 33947625663
- Study of intra-speaker's speech variability over long and short time periods for speech recognition
- Toulouse, France, May
- S. Tsuge, M. Shishibori, K. Kita, F. Ren, and S. Kuroiwa, "Study of intra-speaker's speech variability over long and short time periods for speech recognition," in IEEE Pwc. Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, May 2006, pp. I-397-I-400.
- (2006) IEEE Pwc. Int. Conf. Acoust., Speech, Signal Process
- Tsuge, S.¹ Shishibori, M.² Kita, K.³ Ren, F.⁴ Kuroiwa, S.⁵

25
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc, ser. B, vol. 39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc, ser. B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

26
- 0030362995
- A compact model for speaker-adaptive training
- Philadelphia, PA
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. Int. Conf Spoken Lang. Process., Philadelphia, PA, 1996, pp. 1137-1140.
- (1996) Proc. Int. Conf Spoken Lang. Process , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

27
- 0032050110
- Maximum likelihood linear teansformations for HMM-based speech recognition
- M. Gales, "Maximum likelihood linear teansformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
- (1998) Comput. Speech Lang , vol.12 , pp. 75-98
- Gales, M.¹

28
- 0034704222
- Nonlinear dimensionality reduction by locally linear embedding
- S. Roweis and L. K. Saul, "Nonlinear dimensionality reduction by locally linear embedding," Science, vol. 290, pp. 2323-2326, 2000.
- (2000) Science , vol.290 , pp. 2323-2326
- Roweis, S.¹ Saul, L.K.²

29
- 84880203756
- Laplacian eigenmaps and spectral techniques for embedding and clustering
- M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering," in Proc. Adv. Neural Inf. Process. Syst. 14, 2001, pp. 585-591.
- (2001) Proc. Adv. Neural Inf. Process. Syst. 14 , pp. 585-591
- Belkin, M.¹ Niyogi, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.