메뉴 건너뛰기




Volumn E90-D, Issue 2, 2007, Pages 554-561

Reducing computation time of the rapid unsupervised speaker adaptation based on HMM-sufficient statistics

Author keywords

HMM sufficient statistics; Rapid adaptation; Speech recognition; Unsupervised

Indexed keywords

ALGORITHMS; MARKOV PROCESSES; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; REGRESSION ANALYSIS; STATISTICS;

EID: 33847209621     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1093/ietisy/e90-d.2.554     Document Type: Article
Times cited : (1)

References (18)
  • 1
    • 85009067411 scopus 로고    scopus 로고
    • Elderly acoustic model for large vocabulary continuous speech recognition
    • A. Baba, S. Yoshizawa, M. Yamada, A. Lee, and K. Shikano, "Elderly acoustic model for large vocabulary continuous speech recognition," Proc. EUROSPEECH, pp.1657-1660, 2001.
    • (2001) Proc. EUROSPEECH , pp. 1657-1660
    • Baba, A.1    Yoshizawa, S.2    Yamada, M.3    Lee, A.4    Shikano, K.5
  • 2
    • 85009113198 scopus 로고    scopus 로고
    • Analysis of speaker variability
    • Sept
    • C. Huang, T. Chen, S. Li, and J.L. Zhou, "Analysis of speaker variability," Proc. Eurospeech, vol.2, pp.1377-1380, Sept. 2001.
    • (2001) Proc. Eurospeech , vol.2 , pp. 1377-1380
    • Huang, C.1    Chen, T.2    Li, S.3    Zhou, J.L.4
  • 4
    • 0030672082 scopus 로고    scopus 로고
    • Experiments in speaker normalisation and adaptation for large vocabulary adaptation
    • April
    • D. Pye and P.C. Woodland, "Experiments in speaker normalisation and adaptation for large vocabulary adaptation," Proc. ICASSP, vol.2, no.1, pp.1047-1051, April 1997.
    • (1997) Proc. ICASSP , vol.2 , Issue.1 , pp. 1047-1051
    • Pye, D.1    Woodland, P.C.2
  • 5
    • 78449240909 scopus 로고    scopus 로고
    • Speaker normalization and speaker adaptation - A combination for conversational speech recognition
    • Sept
    • P. Zhan, M. Westphal, M. Finke, and A. Waibel, "Speaker normalization and speaker adaptation - A combination for conversational speech recognition," Proc. Eurospeech, vol.10, pp.2087-2090, Sept. 1997.
    • (1997) Proc. Eurospeech , vol.10 , pp. 2087-2090
    • Zhan, P.1    Westphal, M.2    Finke, M.3    Waibel, A.4
  • 6
    • 0141702066 scopus 로고    scopus 로고
    • Investigating recognition of children's speech
    • April
    • D. Giuliani and M. Gerosa, "Investigating recognition of children's speech," Proc. ICASSP, vol.2, pp.137-140, April 2003.
    • (2003) Proc. ICASSP , vol.2 , pp. 137-140
    • Giuliani, D.1    Gerosa, M.2
  • 7
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C.J. Leggeter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Proc. Computer Speech and Language, vol.9, pp.171-185, 1995.
    • (1995) Proc. Computer Speech and Language , vol.9 , pp. 171-185
    • Leggeter, C.J.1    Woodland, P.C.2
  • 8
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observation of Markov chains
    • J. Gauvain and C.H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observation of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp.291-298, 1994.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.1    Lee, C.H.2
  • 9
    • 33645759266 scopus 로고    scopus 로고
    • Transformation and combination of hidden Markov models for speaker selection training
    • C. Huang, T. Chen, and E. Chan, "Transformation and combination of hidden Markov models for speaker selection training," Proc. ICSLP, pp.1377-1380, 2004.
    • (2004) Proc. ICSLP , pp. 1377-1380
    • Huang, C.1    Chen, T.2    Chan, E.3
  • 10
    • 0030682285 scopus 로고    scopus 로고
    • Smoothed N-best based speaker adaptation for speech recognition
    • T. Matsui, T. Matsuoka, and S. Furui, "Smoothed N-best based speaker adaptation for speech recognition," Proc. ICASSP, pp.1015-1018, 1997.
    • (1997) Proc. ICASSP , pp. 1015-1018
    • Matsui, T.1    Matsuoka, T.2    Furui, S.3
  • 11
    • 17344384524 scopus 로고    scopus 로고
    • Rapid adaptation with linear combinations of rank-one matrices
    • G. Vaibhava, V. Karthik, and G. Ramesh, "Rapid adaptation with linear combinations of rank-one matrices," Proc. ICASSP, vol.1, pp.581-584, 2002.
    • (2002) Proc. ICASSP , vol.1 , pp. 581-584
    • Vaibhava, G.1    Karthik, V.2    Ramesh, G.3
  • 12
    • 0034857758 scopus 로고    scopus 로고
    • Very fast adaptation with a compact context-dependent eigenvoice model
    • R. Kuhn, F. Perronnin, P. Nguyen, J. Junqua, and L. Rigazio, "Very fast adaptation with a compact context-dependent eigenvoice model," Proc. ICASSP, vol.1, pp.373-376, 2001.
    • (2001) Proc. ICASSP , vol.1 , pp. 373-376
    • Kuhn, R.1    Perronnin, F.2    Nguyen, P.3    Junqua, J.4    Rigazio, L.5
  • 13
    • 0034848875 scopus 로고    scopus 로고
    • Unsupervised speaker adaptation based on sufficent HMM statistics of selected speakers
    • S. Yoshizawa, A. Baba, K. Matsunami, Y. Mera, M. Yamada, and K. Shikano, "Unsupervised speaker adaptation based on sufficent HMM statistics of selected speakers," Proc. ICASSP, pp.341-344, 2001.
    • (2001) Proc. ICASSP , pp. 341-344
    • Yoshizawa, S.1    Baba, A.2    Matsunami, K.3    Mera, Y.4    Yamada, M.5    Shikano, K.6
  • 14
    • 33847213379 scopus 로고    scopus 로고
    • Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments
    • R. Gomez, A. Lee, H. Saruwatari, and K. Shikano, "Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments," Proc. EUROSPEECH, pp.296-301, 2005.
    • (2005) Proc. EUROSPEECH , pp. 296-301
    • Gomez, R.1    Lee, A.2    Saruwatari, H.3    Shikano, K.4
  • 15
    • 33645787305 scopus 로고    scopus 로고
    • Improving rapid unsupervised speaker adaptation based on HMM-sufficient statistics in noisy environments using multi-template models
    • March
    • R. Gomez, A. Lee, T. Toda, H. Saruwatari, and K. Shikano, "Improving rapid unsupervised speaker adaptation based on HMM-sufficient statistics in noisy environments using multi-template models," IEICE Trans. Inf. & Syst., vol.E89-D, no.3, pp.998-1005, March 2006.
    • (2006) IEICE Trans. Inf. & Syst , vol.E89-D , Issue.3 , pp. 998-1005
    • Gomez, R.1    Lee, A.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 16
    • 0033721605 scopus 로고    scopus 로고
    • A new phonetic tied-mixture model for efficient decoding
    • A. Lee, T. Kawahara, K. Takeda, and K. Shikano, "A new phonetic tied-mixture model for efficient decoding," Proc. ICASSP, pp.1269-1272, 2000.
    • (2000) Proc. ICASSP , pp. 1269-1272
    • Lee, A.1    Kawahara, T.2    Takeda, K.3    Shikano, K.4
  • 17
    • 85009257834 scopus 로고    scopus 로고
    • Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics
    • S. Yamade, K. Matsunami, A. Baba, A. Lee, H. Saruwatari, and K. Shikano, "Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics," Proc. ICSLP, pp.I-1045-1048, 2000.
    • (2000) Proc. ICSLP
    • Yamade, S.1    Matsunami, K.2    Baba, A.3    Lee, A.4    Saruwatari, H.5    Shikano, K.6
  • 18
    • 33847177826 scopus 로고    scopus 로고
    • Speaker-class reduction for HMM-sufficient statistics adaptation using multiple acoustic models
    • March
    • R. Gomez, A. Lee, H. Saruwatari, and K. Shikano, "Speaker-class reduction for HMM-sufficient statistics adaptation using multiple acoustic models," Proc. Acoustical Society of Japan, pp.133-134, March 2005.
    • (2005) Proc. Acoustical Society of Japan , pp. 133-134
    • Gomez, R.1    Lee, A.2    Saruwatari, H.3    Shikano, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.