SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 55, Issue 9, 2013, Pages 893-908

Rapid speaker adaptation in latent speaker space with non-negative matrix factorization

(3) Zhang, Xueru a Demuynck, Kris a Van Hamme, Hugo a

a UNIVERSITY OF LEUVEN (Belgium)

Author keywords

Eigenvoice; fMLLR; NMF; SAT; Speaker adaptation

Indexed keywords

EIGENVOICES; FMLLR; NMF; SAT; SPEAKER ADAPTATION;

ALGORITHMS; FACTORIZATION; SPEECH RECOGNITION;

GAUSSIAN DISTRIBUTION;

EID: 84879322629 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2013.05.001 Document Type: Article

Times cited : (5)

References (31)

1
- 0030677475
- Speaker adaptive training: A maximum likelihood approach to speaker normalization
- Anastasakos, T.; McDonough, J.; Makhoul, J.; 1997. Speaker adaptive training: a maximum likelihood approach to speaker normalization. In: Proc. International Conference on Acoustics, Speech and Signal Processing, pp. 1043-1046.
- (1997) Proc. International Conference on Acoustics, Speech and Signal Processing , pp. 1043-1046
- Anastasakos, T.¹ McDonough, J.² Makhoul, J.³

2
- 0030362995
- A compact model for speaker-adaptive training
- Anastasakos, T.; Mcdonough, J.; Schwartz, R.; Makhoul, J.; 1996. A compact model for speaker-adaptive training. In: Proc. International Conference in Spoken Language Processing, pp. 1137-1140.
- (1996) Proc. International Conference in Spoken Language Processing , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

3
- 0001862769
- An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process
- L. Baum An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process Inequalities 3 1972 1 8
- (1972) Inequalities , vol.3 , pp. 1-8
- Baum, L.¹

4
- 85009097035
- Fast speaker adaptation using eigenspace-based maximum likelihood linear regression
- Chen, K.T.; Liau, W.W.; Wang, H.M.; Lee, L.S.; 2000. Fast speaker adaptation using eigenspace-based maximum likelihood linear regression. In: Proc. International Conference on Spoken Language Processing, pp. 742-745.
- (2000) Proc. International Conference on Spoken Language Processing , pp. 742-745
- Chen K., .T.¹ Liau W., .W.² Wang H., .M.³ Lee L., .S.⁴

5
- 84879315200
- Orthogonal nonnegative matrix tri-factorizations for clustering
- (Harvard University). Technical, Report, TR-10-98
- Chen, S.; Goodman, J.; 1998. Orthogonal nonnegative matrix tri-factorizations for clustering. Technical Report. Center for Research in Computing Technology (Harvard University). Technical, Report, TR-10-98.
- (1998) Technical Report. Center for Research in Computing Technology
- Chen, S.¹ Goodman, J.²

6
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A.P. Dempster, N.M. Laird, and D.B. Rubin Maximum likelihood from incomplete data via the EM algorithm Journal of the Royal Statistical Society, Series B 39 1977 1 38
- (1977) Journal of the Royal Statistical Society, Series B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

7
- 0003589269
- Ph.D. Thesis. Katholieke Universiteit Leuven
- Demuynck, K.; 2001. Extracting, modelling and combining information in speech recognition. Ph.D. Thesis. Katholieke Universiteit Leuven.
- (2001) Extracting, Modelling and Combining Information in Speech Recognition
- Demuynck, K.¹

8
- 51449115117
- Fast speaker adaptation using non-negative matrix factorization
- Duchateau, J.; Leroy, T.; Demuynck, K.; Van hamme, H.; 2008. Fast speaker adaptation using non-negative matrix factorization. In: Proc. International Conference on Acoustics, Speech and Signal Processing, pp. 4269-4272.
- (2008) Proc. International Conference on Acoustics, Speech and Signal Processing , pp. 4269-4272
- Duchateau, J.¹ Leroy, T.² Demuynck, K.³ Van Hamme, H.⁴

9
- 44949142593
- A flexible recogniser architecture in a reading tutor for children
- Duchateau, J.; Wigham, M.; Demuynck, K.; Van hamme, H.; 2006. A flexible recogniser architecture in a reading tutor for children. In: Proc. ISCA Tutorial and Research Workshop (ITRW) on Speech Recognition and Intrinsic Variation, pp. 59-64.
- (2006) Proc. ISCA Tutorial and Research Workshop (ITRW) on Speech Recognition and Intrinsic Variation , pp. 59-64
- Duchateau, J.¹ Wigham, M.² Demuynck, K.³ Van Hamme, H.⁴

10
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M.J.F. Gales Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech and Language 12 1998 75 98
- (1998) Computer Speech and Language , vol.12 , pp. 75-98
- Gales, M.J.F.¹

11
- 84885621082
- Relation between PLSA and NMF and implications
- Gaussier, E.; Goutte, C.; 2005. Relation between PLSA and NMF and implications. In: Proc. ACM SIGIR conference on Research and Development in, Information Retrieval, pp. 601-602.
- (2005) Proc. ACM SIGIR Conference on Research and Development In, Information Retrieval , pp. 601-602
- Gaussier, E.¹ Goutte, C.²

12
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- J.L. Gauvain, and C.H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing 2 1994 291 298
- (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

13
- 0001509519
- Probabilistic latent semantic analysis
- Hofmann, T.; 1999a. Probabilistic latent semantic analysis. In: Proc. 15th Conference on Uncertainty in AI.
- (1999) Proc. 15th Conference on Uncertainty in AI
- Hofmann, T.¹

14
- 85026972772
- Probabilistic latent semantic indexing
- Hofmann, T.; 1999b. Probabilistic latent semantic indexing. In: Proc. ACM SIGIR Special Interest Group on, Information Retrieval, pp. 50-57.
- (1999) Proc. ACM SIGIR Special Interest Group On, Information Retrieval , pp. 50-57
- Hofmann, T.¹

15
- 0004056285
- Prentice Hall, PTR
- X. Huang, A. Acero, and H.W. Hon Spoken Language Processing: A Guide to Theory, Algorithm and System Development 2001 Prentice Hall, PTR
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm and System Development
- Huang, X.¹ Acero, A.² Hon, H.W.³

16
- 0034320005
- Rapid speaker adaptation in eigenvoice space
- R. Kuhn, J.C. Junqua, P. Nguyen, and N. Niedzielski Rapid speaker adaptation in eigenvoice space IEEE Transactions on Speech and Audio Processing 8 2000 695 707
- (2000) IEEE Transactions on Speech and Audio Processing , vol.8 , pp. 695-707
- Kuhn, R.¹ Junqua, J.C.² Nguyen, P.³ Niedzielski, N.⁴

17
- 0033592606
- Learning the parts of objects by non-negative matrix factorization
- D.D. Lee, and H.S. Seung Learning the parts of objects by non-negative matrix factorization Nature 401 1999 788 791
- (1999) Nature , vol.401 , pp. 788-791
- Lee, D.D.¹ Seung, H.S.²

18
- 84898964201
- Algorithms for non-negative matrix factorization
- D.D. Lee, and H.S. Seung Algorithms for non-negative matrix factorization Advances in Neural Information Processing Systems 13 2001 556 562
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 556-562
- Lee, D.D.¹ Seung, H.S.²

19
- 0003445971
- Cambridge University Engineering Department
- Leggetter, C.J.; Woodland, P.C.; 1994. Speaker adaptation using linear regression. Technical Report CUED/F-INFENG/TR.181. Cambridge University Engineering Department.
- (1994) Speaker Adaptation Using Linear Regression. Technical Report CUED/F-INFENG/TR.181
- Leggetter C., .J.¹ Woodland P., .C.²

20
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Computer Speech and Language 9 1995 171 185
- (1995) Computer Speech and Language , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

21
- 0141517881
- Fast speaker adaptation
- Institut Eurécom
- Nguyen, P.; 1998. Fast speaker adaptation. Industrial Thesis Report. Institut Eurécom.
- (1998) Industrial Thesis Report
- Nguyen, P.¹

22
- 84945116938
- Non-negative matrix factorization for polyphonic music transcription
- Smaragdis, P.; Brown, J.C.; 2003. Non-negative matrix factorization for polyphonic music transcription. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 177-180.
- (2003) IEEE Workshop on Applications of Signal Processing to Audio and Acoustics , pp. 177-180
- Smaragdis, P.¹ Brown J., .C.²

23
- 84890499149
- A probabilistic latent variable model for acoustic modeling
- Smaragdis, P.; Raj, B.; Shashanka, M.; 2006. A probabilistic latent variable model for acoustic modeling. In: Proc. Advances in Models for Acoustic Processing Workshop.
- (2006) Proc. Advances in Models for Acoustic Processing Workshop
- Smaragdis, P.¹ Raj, B.² Shashanka, M.³

24
- 85114788610
- A new frequency shift function for reducing inter-speaker variance
- Tuerk, C.; Robinson, T.; 1993. A new frequency shift function for reducing inter-speaker variance. In: Proc. Eurospeech, pp. 351-354.
- (1993) Proc. Eurospeech , pp. 351-354
- Tuerk, C.¹ Robinson, T.²

25
- 67651034997
- Integration of asynchronous knowledge sources in a novel speech recognition framework
- Van hamme, H.; 2008. Integration of asynchronous knowledge sources in a novel speech recognition framework. In: Proc. ISCA Tutorial and Research Workshop (ITRW) on Speech Analysis and Processing for Knowledge Discovery.
- (2008) Proc. ISCA Tutorial and Research Workshop (ITRW) on Speech Analysis and Processing for Knowledge Discovery
- Van Hamme, H.¹

26
- 50249152311
- Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
- T. Virtanen Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria IEEE Transactions on Audio, Speech and Language Processing 15 2007 291 298
- (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 291-298
- Virtanen, T.¹

27
- 0346528936
- Speaker adaptation for continuous density HMMs: A review
- Woodland, P.C.; 2001. Speaker adaptation for continuous density HMMs: a review. In: ISCA Tutorial and Research Workshop (ITRW) on Adaptation Methods for Speech Recognition, pp. 11-19.
- (2001) ISCA Tutorial and Research Workshop (ITRW) on Adaptation Methods for Speech Recognition , pp. 11-19
- Woodland, P.C.¹

28
- 1542347778
- Document-clustering based on non-negative matrix factorization
- Xu, W.; Liu, X.; Gong, Y.; 2003. Document-clustering based on non-negative matrix factorization. In: Proc. ACM SIGIR Special Interest Group on, Information Retrieval, pp. 267-273.
- (2003) Proc. ACM SIGIR Special Interest Group On, Information Retrieval , pp. 267-273
- Xu, W.¹ Liu, X.² Gong, Y.³

29
- 0029745232
- Maximum a posteriori adaptation for large scale HMM recognizers
- Zavaliagkos, G.; Schwartz, R.; 1996. Maximum a posteriori adaptation for large scale HMM recognizers. In: Proc. International Conference on Acoustics, Speech and Signal Processing, pp. 725-728.
- (1996) Proc. International Conference on Acoustics, Speech and Signal Processing , pp. 725-728
- Zavaliagkos, G.¹ Schwartz, R.²

30
- 80051603826
- Rapid speaker adaptation with speaker adaptive training and non-negative matrix factorization
- Zhang, X.; Demuynck, K.; Van hamme, H.; 2011. Rapid speaker adaptation with speaker adaptive training and non-negative matrix factorization. In: Proc. International Conference on Acoustics, Speech and Signal Processing, pp. 4456-4459.
- (2011) Proc. International Conference on Acoustics, Speech and Signal Processing , pp. 4456-4459
- Zhang, X.¹ Demuynck, K.² Van Hamme, H.³

31
- 84867600898
- Latent variable speaker adaptation of gaussian mixture weights and means
- Zhang, X.; Demuynck, K.; Van hamme, H.; 2012. Latent variable speaker adaptation of gaussian mixture weights and means. In: Proc. International Conference on Acoustics, Speech and Signal Processing, pp. 4349-4352.
- (2012) Proc. International Conference on Acoustics, Speech and Signal Processing , pp. 4349-4352
- Zhang, X.¹ Demuynck, K.² Van Hamme, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.