메뉴 건너뛰기




Volumn 16, Issue 3, 2008, Pages 607-616

Rapid speaker adaptation using clustered maximum-likelihood linear basis with sparse training data

Author keywords

Cluster adaptive training; Eigenvoices; Parameter tying; Speaker adaptation; Speech recognition

Indexed keywords

ADAPTATION METHODS; AUTOMATIC SPEECH RECOGNITION; BASIS VECTORS; CLUSTER ADAPTIVE TRAINING; COMPUTER MEMORIES; EIGENVOICES; GENERAL CLASS; LINEAR COMBINATIONS; LOW COMPLEXITY; MAXIMUM-LIKELIHOOD; MAXIMUM-LIKELIHOOD ESTIMATIONS; PARAMETER TYING; PERFORMANCE IMPROVEMENTS; RAPID SPEAKER ADAPTATIONS; RESOURCE MANAGEMENTS; SPACE-BASED; SPEAKER ADAPTATION; STORAGE REQUIREMENTS; TASK DOMAINS; TRAINING DATUM; WALL STREET JOURNALS; WORD ERROR RATE REDUCTIONS; WORD ERROR RATES;

EID: 64949158419     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.916530     Document Type: Article
Times cited : (10)

References (29)
  • 1
    • 0346528936 scopus 로고    scopus 로고
    • Speaker adaptation for continuous density HMMs: A review
    • Sophia Antipolis, France
    • P. C. Woodland, "Speaker adaptation for continuous density HMMs: A review," in Proc. ITRW Adaptation Methods for Speech Recognition, Sophia Antipolis, France, 2001, pp. 11-19.
    • (2001) Proc. ITRW Adaptation Methods for Speech Recognition , pp. 11-19
    • Woodland, P.C.1
  • 2
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr
    • J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 3
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, no. 2, pp. 171-186, 1995.
    • (1995) Comput. Speech Lang , vol.9 , Issue.2 , pp. 171-186
    • Leggetter, C.J.1    Woodland, P.2
  • 5
    • 85135280100 scopus 로고    scopus 로고
    • Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments
    • Budapest, Hungary
    • P. Nguyen, C. Wellekens, and J. C. Junqua, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments," in Proc. Eur. Conf. Speech Commun. Technol., Budapest, Hungary, 1999, pp. 2519-2522.
    • (1999) Proc. Eur. Conf. Speech Commun. Technol , pp. 2519-2522
    • Nguyen, P.1    Wellekens, C.2    Junqua, J.C.3
  • 6
    • 85009080436 scopus 로고    scopus 로고
    • Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices
    • Beijing, China
    • H. Botterweck, "Very fast adaptation for large vocabulary continuous speech recognition using eigenvoices," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, 2000, pp. 354-357.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 354-357
    • Botterweck, H.1
  • 7
    • 0009625231 scopus 로고    scopus 로고
    • A comparison of noval techniques for rapid speaker adaptation
    • T. J. Hazen, "A comparison of noval techniques for rapid speaker adaptation," Speech Commun., vol. 31, pp. 15-33, 2000.
    • (2000) Speech Commun , vol.31 , pp. 15-33
    • Hazen, T.J.1
  • 8
    • 33947681802 scopus 로고    scopus 로고
    • Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers
    • Toulouse, France
    • B. Mak, T. C. Lai, and R. Hsiao, "Improving reference speaker weighting adaptation by the use of maximum-likelihood reference speakers," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, 2006, pp. 229-232.
    • (2006) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 229-232
    • Mak, B.1    Lai, T.C.2    Hsiao, R.3
  • 9
    • 64949197536 scopus 로고    scopus 로고
    • T. C. Lai and B. Bak, Unsupervised speaker adaptation using reference speaker weighting, in Proc. Int. Symp. Chinese Spoken Lang. Process., Singapore, Dec. 2006, pp. 380-389. [10] M. J. F. Gales, Cluster adaptive training of hidden Markov models, IEEE Trans. Speech Audio Process., 8, no. 4, pp. 417-428, 2000.
    • T. C. Lai and B. Bak, "Unsupervised speaker adaptation using reference speaker weighting," in Proc. Int. Symp. Chinese Spoken Lang. Process., Singapore, Dec. 2006, pp. 380-389. [10] M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, 2000.
  • 10
    • 0034842307 scopus 로고    scopus 로고
    • Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition
    • Salt Lake, UT
    • H. Botterweck, "Anisotropic MAP defined by eigenvoices for large vocabulary continuous speech recognition," in IEEE Proc. Int. Conf. Acoust., Speech, Signal Process., Salt Lake, UT, 2001, pp. 353-356.
    • (2001) IEEE Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 353-356
    • Botterweck, H.1
  • 11
  • 12
    • 85009097035 scopus 로고    scopus 로고
    • Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
    • Beijing, China
    • K. T. Chen, W. W. Liau, H. M. Wang, and L. S. Lee, "Fast speaker adaptation using eigenspace-based maximum likelihood linear regression," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, 2000, pp. 742-745.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 742-745
    • Chen, K.T.1    Liau, W.W.2    Wang, H.M.3    Lee, L.S.4
  • 14
    • 27644511614 scopus 로고    scopus 로고
    • Sep
    • Trans. Speech Audio Process., vol. 13, no. 5, pp. 984-992, Sep. 2005.
    • (2005) Trans. Speech Audio Process , vol.13 , Issue.5 , pp. 984-992
  • 15
    • 56149122221 scopus 로고    scopus 로고
    • Kernel eigenspace-based MLLR adaptation
    • Mar
    • B. Mak and R. Hsiao, "Kernel eigenspace-based MLLR adaptation," IEEE Trans. Audio, Speech, Language Process., vol. 15, no. 3, pp. 784-795, Mar. 2007.
    • (2007) IEEE Trans. Audio, Speech, Language Process , vol.15 , Issue.3 , pp. 784-795
    • Mak, B.1    Hsiao, R.2
  • 16
    • 34047260093 scopus 로고    scopus 로고
    • Discriminative cluster adaptive training
    • Sep
    • K. Yu and M. J. F. Gales, "Discriminative cluster adaptive training," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1694-1703, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1694-1703
    • Yu, K.1    Gales, M.J.F.2
  • 17
    • 22544443963 scopus 로고    scopus 로고
    • Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation
    • Jul
    • B. Zhou and J. Hansen, "Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation," IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 554-564, Jul. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.4 , pp. 554-564
    • Zhou, B.1    Hansen, J.2
  • 19
    • 0033556788 scopus 로고    scopus 로고
    • Mixtures of probabilistic principal component analyzers
    • M. E. Tipping and C. M. Bishop, "Mixtures of probabilistic principal component analyzers," Neural Comput., vol. 11, no. 2, pp. 443-482, 1999.
    • (1999) Neural Comput , vol.11 , Issue.2 , pp. 443-482
    • Tipping, M.E.1    Bishop, C.M.2
  • 20
    • 85009106031 scopus 로고    scopus 로고
    • Bayesian speaker adaptation based on probabilistic pricipal component analysis
    • Beijing, China
    • D. K. Kim and N. S. Kim, "Bayesian speaker adaptation based on probabilistic pricipal component analysis," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China. 2000, pp. 734-737.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , pp. 734-737
    • Kim, D.K.1    Kim, N.S.2
  • 22
    • 0012330750 scopus 로고
    • The design of the wall street journal-based CSR corpus
    • Austin, TX, Feb
    • D. B. Paul and J. M. Baker, "The design of the wall street journal-based CSR corpus," in Proc. IEEE DARPA Speech Natural Lang. Workshop, Austin, TX, Feb. 1992, pp. 357-360.
    • (1992) Proc. IEEE DARPA Speech Natural Lang. Workshop , pp. 357-360
    • Paul, D.B.1    Baker, J.M.2
  • 23
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. Gales and P. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech, Lang., vol. 10, no. 4, pp. 249-264, 1996.
    • (1996) Comput. Speech, Lang , vol.10 , Issue.4 , pp. 249-264
    • Gales, M.1    Woodland, P.2
  • 24
    • 33947625663 scopus 로고    scopus 로고
    • Study of intra-speaker's speech variability over long and short time periods for speech recognition
    • Toulouse, France, May
    • S. Tsuge, M. Shishibori, K. Kita, F. Ren, and S. Kuroiwa, "Study of intra-speaker's speech variability over long and short time periods for speech recognition," in IEEE Pwc. Int. Conf. Acoust., Speech, Signal Process., Toulouse, France, May 2006, pp. I-397-I-400.
    • (2006) IEEE Pwc. Int. Conf. Acoust., Speech, Signal Process
    • Tsuge, S.1    Shishibori, M.2    Kita, K.3    Ren, F.4    Kuroiwa, S.5
  • 25
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc, ser. B, vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc, ser. B , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 27
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear teansformations for HMM-based speech recognition
    • M. Gales, "Maximum likelihood linear teansformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998.
    • (1998) Comput. Speech Lang , vol.12 , pp. 75-98
    • Gales, M.1
  • 28
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • S. Roweis and L. K. Saul, "Nonlinear dimensionality reduction by locally linear embedding," Science, vol. 290, pp. 2323-2326, 2000.
    • (2000) Science , vol.290 , pp. 2323-2326
    • Roweis, S.1    Saul, L.K.2
  • 29
    • 84880203756 scopus 로고    scopus 로고
    • Laplacian eigenmaps and spectral techniques for embedding and clustering
    • M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering," in Proc. Adv. Neural Inf. Process. Syst. 14, 2001, pp. 585-591.
    • (2001) Proc. Adv. Neural Inf. Process. Syst. 14 , pp. 585-591
    • Belkin, M.1    Niyogi, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.