메뉴 건너뛰기




Volumn E96-D, Issue 11, 2013, Pages 2417-2426

Improving naturalness of hmm-based TTS trained with limited data by temporal decomposition

Author keywords

HMM based TTS; Hybrid TTS; Limited data; Temporal decomposition; Text to speech

Indexed keywords

DECOMPOSITION; LINGUISTICS; MOTIVATION; TRAJECTORIES;

EID: 84888617103     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1587/transinf.E96.D.2417     Document Type: Article
Times cited : (2)

References (15)
  • 1
    • 85027201733 scopus 로고    scopus 로고
    • An HMM-based speech synthesis system applied to English
    • K. Tokuda, H. Zen, and A.W. Black, "An HMM-based speech synthesis system applied to English," Proc. SSW, 2002.
    • (2002) Proc. SSW
    • Tokuda, K.1    Zen, H.2    Black, A.W.3
  • 2
    • 0142007308 scopus 로고    scopus 로고
    • A training method of average voice model for HMM-based speech synthesis IEICE Trans
    • Aug
    • J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "A training method of average voice model for HMM-based speech synthesis," IEICE Trans. Fundamentals, vol.E86-A, no.8, pp.1956-1963, Aug. 2003.
    • (2003) Fundamentals , vol.86 , Issue.8 , pp. 1956-1963
    • Yamagishi, J.1    Tamura, M.2    Masuko, T.3    Tokuda, K.4    Kobayashi, T.5
  • 3
  • 4
    • 84883110854 scopus 로고    scopus 로고
    • Improving HMM Based speech synthesis by reducing over-smoothing problems
    • M. Zhang, J. Tao, H. Jia, and X. Wang, "Improving HMM Based speech synthesis by reducing over-smoothing problems," Proc. ISCSLP, pp.1-4, 2008.
    • (2008) Proc. ISCSLP , pp. 1-4
    • Zhang, M.1    Tao, J.2    Jia, H.3    Wang, X.4
  • 5
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering Global Variance for HMM-Based speech synthesis
    • May
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering Global Variance for HMM-Based speech synthesis," IEICE Trans. Inf. &Syst., vol.E90-D, no.5, pp.816-824, May 2007.
    • (2007) IEICE Trans. Inf. &Syst. , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 6
    • 85027205844 scopus 로고    scopus 로고
    • Overview of nit hmm-based speech synthesis system for blizzard challenge 2011
    • K. Hashimoto, S. Takaki, K. Oura, and K. Tokuda, "Overview of NIT HMM-based speech synthesis system for Blizzard Challenge 2011," Blizzard Challenge, 2011.
    • (2011) Blizzard Challenge
    • Hashimoto, K.1    Takaki, S.2    Oura, K.3    Tokuda, K.4
  • 7
    • 84871382567 scopus 로고    scopus 로고
    • A unified trajectory tiling approach to high quality speech rendering
    • Y. Qian, F.K. Soong, and Z. Yan, "A unified trajectory tiling approach to high quality speech rendering," IEEE Trans. Audio Speech Language Process., vol.21, no.2, pp.280-290, 2013.
    • (2013) IEEE Trans. Audio Speech Language Process. , vol.21 , Issue.2 , pp. 280-290
    • Qian, Y.1    Soong, F.K.2    Yan, Z.3
  • 8
    • 0020602364 scopus 로고
    • Efficient coding of LPC parameters by temporal decomposition
    • B.S. Atal, "Efficient coding of LPC parameters by temporal decomposition," Proc. ICASSP, pp.81-84, 1983.
    • (1983) Proc. ICASSP , pp. 81-84
    • Atal, B.S.1
  • 9
    • 0038719980 scopus 로고    scopus 로고
    • Modified restricted temporal decomposition and its application to low rate speech coding
    • March
    • PC. Nguyen, T. Ochi, and M. Akagi, "Modified restricted temporal decomposition and its application to low rate speech coding," IEICE Trans. Inf. &Syst., vol.E86-D, no.3, pp.397-405, March 2003.
    • (2003) IEICE Trans. Inf. &Syst. , vol.86 , Issue.3 , pp. 397-405
    • Nguyen, P.C.1    Ochi, T.2    Akagi, M.3
  • 10
    • 70450207337 scopus 로고    scopus 로고
    • Efficient modeling of temporal structure of speech for applications in voice transformation
    • P.N. Binh and M. Akagi, "Efficient modeling of temporal structure of speech for applications in voice transformation," Proc. Interspeech, pp.1631-1634, 2009.
    • (2009) Proc. Interspeech , pp. 1631-1634
    • Binh, P.N.1    Akagi, M.2
  • 11
    • 77949913458 scopus 로고    scopus 로고
    • Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech
    • R.B. Chicote, J. Yamagishi, S. King, J.M. Montero, and J.M. Guarasa, "Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech," Speech Commun., vol.52, pp.394-404, 2010.
    • (2010) Speech Commun. , vol.52 , pp. 394-404
    • Chicote, R.B.1    Yamagishi, J.2    King, S.3    Montero, J.M.4    Guarasa, J.M.5
  • 12
    • 33750915991 scopus 로고    scopus 로고
    • STRAIGHT, exploration of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds
    • H. Kawahara, "STRAIGHT, exploration of the other aspect of VOCODER: Perceptually isomorphic decomposition of speech sounds," Acoust. Sci &Tech., vol.27, no.6, pp.349-353, 2006.
    • (2006) Acoust. Sci &Tech. , vol.27 , Issue.6 , pp. 349-353
    • Kawahara, H.1
  • 14
    • 84870253990 scopus 로고    scopus 로고
    • Design of Vietnamese speech corpus and current status
    • L.C. Mai and D.N. Duc, "Design of Vietnamese speech corpus and current status," Proc. ISCSLP, pp.748-758, 2006.
    • (2006) Proc. ISCSLP , pp. 748-758
    • Mai, L.C.1    Duc, D.N.2
  • 15
    • 71249108361 scopus 로고    scopus 로고
    • An HMM-based Vietnamese speech synthesis system
    • TT. Vu, MC. Luong and S. Nakamura, "An HMM-based Vietnamese speech synthesis system," Proc. O-COCOSDA, pp.116-121, 2009.
    • (2009) Proc. O-COCOSDA , pp. 116-121
    • Vu, T.T.1    Luong, M.C.2    Nakamura, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.