메뉴 건너뛰기




Volumn 53, Issue 6, 2011, Pages 914-923

Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis

Author keywords

Context adaptive training; Factorized decision tree; HMM based speech synthesis; State clustering

Indexed keywords

ADAPTIVE TRAINING; COMBINATORIAL EXPLOSION; CONTEXT DEPENDENT; DATA COVERAGE; DATA SPARSITY PROBLEMS; FACTORIZED DECISION TREE; HIGH QUALITY; HMM-BASED SPEECH SYNTHESIS; NATURAL SPEECH; PARAMETER CLUSTERING; STATE CLUSTERING; SYNTHESIZED SPEECH; USE CONTEXT;

EID: 79955538498     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2011.03.003     Document Type: Article
Times cited : (27)

References (28)
  • 2
    • 0032658258 scopus 로고    scopus 로고
    • Decision tree state tying based on penalized Bayesian information criterion
    • Chou, W., Reichl, W., 1999. Decision tree state tying based on penalized Bayesian information criterion. In: Proc. ICASSP, pp. 345-348.
    • (1999) Proc. ICASSP , pp. 345-348
    • Chou, W.1    Reichl, W.2
  • 3
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • Fukada, T., Tokuda, K., Kobayashi, T., Imai, S., 1992. An adaptive algorithm for mel-cepstral analysis of speech. In: Proc. ICASSP, pp. 137-140.
    • (1992) Proc. ICASSP , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 4
    • 0003940203 scopus 로고    scopus 로고
    • The generation and use of regression class trees for MLLR adaptation
    • Cambridge University Engineering Department
    • Gales, M., 1996. The generation and use of regression class trees for MLLR adaptation. Tech. Rep. CUED/F-INFENG/TR263, Cambridge University Engineering Department.
    • (1996) Tech. Rep. CUED/F-INFENG/TR263
    • Gales, M.1
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. Gales Maximum likelihood linear transformations for HMM-based speech recognition Comput. Speech Lang. 12 2 1998 75 98 (Pubitemid 128383747)
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 6
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • M. Gales Cluster adaptive training of hidden Markov models IEEE Trans. Speech Audio Process. 8 4 2000 417 428
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.1
  • 7
    • 79959841827 scopus 로고    scopus 로고
    • Canonical state models for automatic speech recognition
    • Gales, M., Yu, K., 2010. Canonical state models for automatic speech recognition. In: Proc. Interspeech, pp. 58-61.
    • (2010) Proc. Interspeech , pp. 58-61
    • Gales, M.1    Yu, K.2
  • 9
    • 33746384049 scopus 로고    scopus 로고
    • Statistical modelling of speech segment duration by constrained tree regression
    • N. Iwahashi, and Y. Sagisaka Statistical modelling of speech segment duration by constrained tree regression Trans. IEICE E83-D 2000 1550 1559
    • (2000) Trans. IEICE , vol.E83-D , pp. 1550-1559
    • Iwahashi, N.1    Sagisaka, Y.2
  • 11
    • 33646773080 scopus 로고    scopus 로고
    • CMU ARCTIC databases for speech synthesis
    • Carnegie Mellon University
    • Kominek, J., Black, A., 2003. CMU ARCTIC databases for speech synthesis. Tech. Rep. CMU-LTI-03-177, Carnegie Mellon University.
    • (2003) Tech. Rep. CMU-LTI-03-177
    • Kominek, J.1    Black, A.2
  • 12
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter, and P. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models Comput. Speech Lang. 9 1995 171 185
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
    • Leggetter, C.1    Woodland, P.2
  • 13
    • 51449118125 scopus 로고    scopus 로고
    • Acoustic modeling with contextual additive structure for HMM-based speech recognition
    • Nankaku, Y., Nakamura, K., Zen, H., Tokuda, K., 2008. Acoustic modeling with contextual additive structure for HMM-based speech recognition. In: Proc. ICASSP, pp. 4469-4472.
    • (2008) Proc. ICASSP , pp. 4469-4472
    • Nankaku, Y.1    Nakamura, K.2    Zen, H.3    Tokuda, K.4
  • 16
    • 85135145174 scopus 로고    scopus 로고
    • Acoustic modeling based on the MDL principle for speech recognition
    • Shinoda, K., Watanabe, T., 1997. Acoustic modeling based on the MDL principle for speech recognition. In: Proc. EUROSPEECH, pp. 99-102.
    • (1997) Proc. EUROSPEECH , pp. 99-102
    • Shinoda, K.1    Watanabe, T.2
  • 17
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inform. Systems E90-D 5 2007 816 824
    • (2007) IEICE Trans. Inform. Systems , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 18
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • Tokuda, K., Yoshimura, T., Masuko, T., Kobayashi, T., Kitamura, T., 2000. Speech parameter generation algorithms for HMM-based speech synthesis. In: Proc. ICASSP, pp. 1315-1318.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 20
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Yoshimura, T., Tokuda, K., Masuko, T., Kobayashi, T., Kitamura, T., 1999. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. In: Proc. Eurospeech, pp. 2347-2350.
    • (1999) Proc. Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 24
    • 78049376926 scopus 로고    scopus 로고
    • Word-level emphasis modelling in HMM-based speech synthesis
    • Yu, K., Mairesse, F., Young, S., 2010. Word-level emphasis modelling in HMM-based speech synthesis. In: Proc. ICASSP, pp. 4238-4241.
    • (2010) Proc. ICASSP , pp. 4238-4241
    • Yu, K.1    Mairesse, F.2    Young, S.3
  • 25
    • 79959813917 scopus 로고    scopus 로고
    • Speaker and language adaptive training for HMM-based polyglot speech synthesis
    • Zen, H., 2010. Speaker and language adaptive training for HMM-based polyglot speech synthesis. In: Proc. Interspeech, pp. 410-413.
    • (2010) Proc. Interspeech , pp. 410-413
    • Zen, H.1
  • 27
    • 33846405723 scopus 로고    scopus 로고
    • Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
    • DOI 10.1093/ietisy/e90-1.1.325
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005 IEICE Trans. Inform. Systems E-90D 1 2007 325 333 (Pubitemid 46145336)
    • (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 28
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. Black Statistical parametric speech synthesis Speech Comm. 51 11 2009 1039 1064
    • (2009) Speech Comm. , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.