메뉴 건너뛰기




Volumn 55, Issue 9, 2013, Pages 893-908

Rapid speaker adaptation in latent speaker space with non-negative matrix factorization

Author keywords

Eigenvoice; fMLLR; NMF; SAT; Speaker adaptation

Indexed keywords

EIGENVOICES; FMLLR; NMF; SAT; SPEAKER ADAPTATION;

EID: 84879322629     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2013.05.001     Document Type: Article
Times cited : (5)

References (31)
  • 3
    • 0001862769 scopus 로고
    • An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process
    • L. Baum An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process Inequalities 3 1972 1 8
    • (1972) Inequalities , vol.3 , pp. 1-8
    • Baum, L.1
  • 5
    • 84879315200 scopus 로고    scopus 로고
    • Orthogonal nonnegative matrix tri-factorizations for clustering
    • (Harvard University). Technical, Report, TR-10-98
    • Chen, S.; Goodman, J.; 1998. Orthogonal nonnegative matrix tri-factorizations for clustering. Technical Report. Center for Research in Computing Technology (Harvard University). Technical, Report, TR-10-98.
    • (1998) Technical Report. Center for Research in Computing Technology
    • Chen, S.1    Goodman, J.2
  • 10
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales Maximum likelihood linear transformations for HMM-based speech recognition Computer Speech and Language 12 1998 75 98
    • (1998) Computer Speech and Language , vol.12 , pp. 75-98
    • Gales, M.J.F.1
  • 12
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.L. Gauvain, and C.H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing 2 1994 291 298
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.H.2
  • 17
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by non-negative matrix factorization
    • D.D. Lee, and H.S. Seung Learning the parts of objects by non-negative matrix factorization Nature 401 1999 788 791
    • (1999) Nature , vol.401 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 20
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Computer Speech and Language 9 1995 171 185
    • (1995) Computer Speech and Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 21
    • 0141517881 scopus 로고    scopus 로고
    • Fast speaker adaptation
    • Institut Eurécom
    • Nguyen, P.; 1998. Fast speaker adaptation. Industrial Thesis Report. Institut Eurécom.
    • (1998) Industrial Thesis Report
    • Nguyen, P.1
  • 24
    • 85114788610 scopus 로고
    • A new frequency shift function for reducing inter-speaker variance
    • Tuerk, C.; Robinson, T.; 1993. A new frequency shift function for reducing inter-speaker variance. In: Proc. Eurospeech, pp. 351-354.
    • (1993) Proc. Eurospeech , pp. 351-354
    • Tuerk, C.1    Robinson, T.2
  • 26
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
    • T. Virtanen Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria IEEE Transactions on Audio, Speech and Language Processing 15 2007 291 298
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , pp. 291-298
    • Virtanen, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.