메뉴 건너뛰기




Volumn , Issue , 2009, Pages 4025-4028

Trajectory training considering global variance for HMM-based speech synthesis

Author keywords

Global varian; Hidden Markov models; Speech synthesis; Training criterion; Trajectory likelihood

Indexed keywords

CLOSED FORM SOLUTIONS; GLOBAL VARIAN; HMM-BASED SPEECH SYNTHESIS; NATURAL SPEECH; NOVEL METHODS; OPTIMIZATION CRITERIA; PARAMETER OPTIMIZATION; STATISTICAL MODELING; SYNTHESIS OPTIMIZATION; SYNTHETIC SPEECH; TRAINING CRITERION; TRAINING METHODS; UNIFIED FRAMEWORK;

EID: 67650826181     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2009.4960511     Document Type: Conference Paper
Times cited : (18)

References (12)
  • 1
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Budapest, Hungary, Sep
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. Proc. EUROSPEECH, pp. 2347-2350, Budapest, Hungary, Sep. 1999.
    • (1999) Proc. EUROSPEECH , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • Istanbul, Turkey, June
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura. Speech parameter generation algorithms for HMM-based speech synthesis. Proc. ICASSP, pp. 1315-1318, Istanbul, Turkey, June 2000.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 3
    • 33749573927 scopus 로고    scopus 로고
    • Reformulating the HMM as a trajetory model by imposing explicit relationships between static and dynamic feature vector sequences
    • H. Zen, K. Tokuda, and T. Kitamura. Reformulating the HMM as a trajetory model by imposing explicit relationships between static and dynamic feature vector sequences. Computer Speech and Language, Vol. 21, pp. 153-173, 2007.
    • (2007) Computer Speech and Language , vol.21 , pp. 153-173
    • Zen, H.1    Tokuda, K.2    Kitamura, T.3
  • 4
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Toulouse, France, May
    • Y.-J. Wu and R.H. Wang. Minimum generation error training for HMM-based speech synthesis. Proc. ICASSP, pp. 89-92, Toulouse, France, May 2006.
    • (2006) Proc. ICASSP , pp. 89-92
    • Wu, Y.-J.1    Wang, R.H.2
  • 5
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • T. Toda and K. Tokuda. A speech parameter generation algorithm considering global variance for HMM-based speech synthesis. IEICE Transactions, Vol. E90-D, No. 5, pp. 816-824, May 2007.
    • (2007) IEICE Transactions , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 6
    • 51449106803 scopus 로고    scopus 로고
    • Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
    • Las Vegas, USA, Mar
    • Y.-J.Wu, H. Zen, Y. Nankaku and K. Tokuda. Minimum generation error criterion considering global/local variance for HMM-based speech synthesis. Proc. ICASSP, pp. 4621-4624, Las Vegas, USA, Mar. 2008.
    • (2008) Proc. ICASSP , pp. 4621-4624
    • Wu, Y.J.1    Zen, H.2    Nankaku, Y.3    Tokuda, K.4
  • 9
    • 85131821539 scopus 로고
    • Mel-generalized cepstral analysis - a unified approach to speech spectral estimation
    • Yokohama, Japan, Sep
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai. Mel-generalized cepstral analysis - a unified approach to speech spectral estimation. Proc. ICSLP, pp. 1043-1045, Yokohama, Japan, Sep. 1994.
    • (1994) Proc. ICSLP , pp. 1043-1045
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4
  • 10
    • 33646773080 scopus 로고    scopus 로고
    • CMU ARCTIC databases for speech synthesis
    • Technical Report, CMU-LTI-03-177, Language Technologies Institute, Carnegie Mellon University
    • J. Kominek and A. W. Black. CMU ARCTIC databases for speech synthesis. Technical Report, CMU-LTI-03-177, Language Technologies Institute, Carnegie Mellon University, 2003.
    • (2003)
    • Kominek, J.1    Black, A.W.2
  • 11
    • 70349214044 scopus 로고    scopus 로고
    • http://www.speech.cs.cmu.edu/flite/index.html


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.