메뉴 건너뛰기




Volumn , Issue , 2010, Pages 825-828

Global variance modeling on the log power spectrum of LSPs for HMM-based speech synthesis

Author keywords

Global variance; Hidden Markov model; Power spectrum; Speech synthesis

Indexed keywords

HIDDEN MARKOV MODELS; MAXIMUM LIKELIHOOD; POWER SPECTRUM; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

EID: 79959847301     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (13)
  • 1
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Eurospeech, 1999, pp. 2347-2350.
    • (1999) Eurospeech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in ICASSP, vol. 3, 2000, pp. 1315-1318.
    • (2000) ICASSP , vol.3 , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 4
    • 33846405723 scopus 로고    scopus 로고
    • Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
    • DOI 10.1093/ietisy/e90-1.1.325
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, 2007. (Pubitemid 46145336)
    • (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 5
    • 34547496747 scopus 로고    scopus 로고
    • USTC system for blizzard challenge 2006: An improved HMM-based speech synthesis method
    • Z. Ling, Y. Wu, Y. Wang, L. Qin, and R. Wang, "USTC system for Blizzard Challenge 2006: an improved HMM-based speech synthesis method," in Blizzard Challenge Workshop, 2006.
    • (2006) Blizzard Challenge Workshop
    • Ling, Z.1    Wu, Y.2    Wang, Y.3    Qin, L.4    Wang, R.5
  • 7
    • 70450161678 scopus 로고    scopus 로고
    • Rich context modeling for high quality HMM-based TTS
    • Z.-J. Yan, Y. Qian, and F. K. Soong, "Rich context modeling for high quality HMM-based TTS," in Interspeech, 2009, pp. 1755-1758.
    • (2009) Interspeech , pp. 1755-1758
    • Yan, Z.-J.1    Qian, Y.2    Soong, F.K.3
  • 8
    • 34547503417 scopus 로고    scopus 로고
    • HMM-based unit selection using frame sized speech segments
    • Z.-H. Ling and R.-H. Wang, "HMM-based unit selection using frame sized speech segments," in Interspeech, 2006, pp. 2034-2037.
    • (2006) Interspeech , pp. 2034-2037
    • Ling, Z.-H.1    Wang, R.-H.2
  • 9
    • 33745200051 scopus 로고    scopus 로고
    • Speech paramter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "Speech paramter generation algorithm considering global variance for HMM-based speech synthesis," in Interspeech, 2005, pp. 2801-2804.
    • (2005) Interspeech , pp. 2801-2804
    • Toda, T.1    Tokuda, K.2
  • 10
    • 51449106803 scopus 로고    scopus 로고
    • Minimum generation error criterion considering global/local variance for HMM-based speech synthesis
    • Y.-J. Wu, H. Zen, Y. Nankaku, and K. Tokuda, "Minimum generation error criterion considering global/local variance for HMM-based speech synthesis," in ICASSP, 2008, pp. 4621-4624.
    • (2008) ICASSP , pp. 4621-4624
    • Wu, Y.-J.1    Zen, H.2    Nankaku, Y.3    Tokuda, K.4
  • 11
    • 0001810975 scopus 로고
    • Line spectrum representation of linear predictive coefficients of speech signals
    • F. Itakura, "Line spectrum representation of linear predictive coefficients of speech signals," J. Acoust. Soc. Am., vol. 57, p. S35, 1975.
    • (1975) J. Acoust. Soc. Am. , vol.57
    • Itakura, F.1
  • 12
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using pitch-adaptive time-frequency smoothing and an instanta-neous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.