메뉴 건너뛰기




Volumn , Issue , 2008, Pages 3957-3960

Performance evaluation of the speaker-independent HMM-based speech synthesis system "HTS-2007" for the Blizzard Challenge 2007

Author keywords

Blizzard Challenge; HMM; HTS; Speaker adaptation; Speech synthesis

Indexed keywords

BLIZZARD CHALLENGE; HMM; HTS; SPEAKER ADAPTATION;

EID: 51449103919     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2008.4518520     Document Type: Conference Paper
Times cited : (12)

References (17)
  • 1
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Sept
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH-99, Sept. 1999, pp. 2374-2350.
    • (1999) Proc. EUROSPEECH-99 , pp. 2374-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 2
    • 51449121285 scopus 로고    scopus 로고
    • K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A.B. Black, and T. Nose, The HMM-based speech synthesis system (HTS) Version 2.0.1
    • K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A.B. Black, and T. Nose, The HMM-based speech synthesis system (HTS) Version 2.0.1, http://hts.sp.nitech.ac.jp/.
  • 3
    • 33846405723 scopus 로고    scopus 로고
    • Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
    • Jan
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 4
    • 51449114385 scopus 로고    scopus 로고
    • The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006
    • Sept
    • H. Zen, T. Toda, and K. Tokuda, "The Nitech-NAIST HMM-based speech synthesis system for the Blizzard Challenge 2006," in Proc. Blizzard Challenge 2006, Sept. 2006.
    • (2006) Proc. Blizzard Challenge 2006
    • Zen, H.1    Toda, T.2    Tokuda, K.3
  • 5
    • 77953693469 scopus 로고    scopus 로고
    • Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
    • Aug
    • J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
    • (2007) Proc. BLZ3-2007 (in Proc. SSW6)
    • Yamagishi, J.1    Zen, H.2    Toda, T.3    Tokuda, K.4
  • 6
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 7
    • 44449177634 scopus 로고    scopus 로고
    • A hidden semi-Markov model-based speech synthesis system
    • May
    • H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis system," IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 825-834, May 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.5 , pp. 825-834
    • Zen, H.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 8
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 9
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • Mar
    • M.J.F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, pp. 272-281, Mar. 1999.
    • (1999) IEEE Trans. Speech Audio Process , vol.7 , pp. 272-281
    • Gales, M.J.F.1
  • 10
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Feb
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
    • (2007) IEICE Trans. Inf. & Syst , vol.E90-D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 11
    • 34547529978 scopus 로고    scopus 로고
    • Model adaptation approach to speech synthesis with diverse voices and styles
    • Apr
    • J. Yamagishi, T. Kobayashi, M. Tachibana, K. Ogata, and Y. Nakano, "Model adaptation approach to speech synthesis with diverse voices and styles," in Proc. ICASSP 2007, Apr. 2007, pp. 1233-1236.
    • (2007) Proc. ICASSP 2007 , pp. 1233-1236
    • Yamagishi, J.1    Kobayashi, T.2    Tachibana, M.3    Ogata, K.4    Nakano, Y.5
  • 12
    • 34547525896 scopus 로고    scopus 로고
    • Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis
    • Sept
    • K. Ogata, M. Tachibana, J. Yamagishi, and T. Kobayashi, "Acoustic model training based on linear transformation and MAP modification for HSMM-based speech synthesis," in Proc. ICSLP 2006, Sept. 2006, pp. 1328-1331.
    • (2006) Proc. ICSLP 2006 , pp. 1328-1331
    • Ogata, K.1    Tachibana, M.2    Yamagishi, J.3    Kobayashi, T.4
  • 14
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • Mar
    • K. Shinoda and C.H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, pp. 276-287, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , pp. 276-287
    • Shinoda, K.1    Lee, C.H.2
  • 15
    • 51449120657 scopus 로고    scopus 로고
    • J. Ni, T. Hirai, H. Kawai, T. Toda, K. Tokuda, M. Tsuzaki, R.M∼ aia S.S∼akai, and S. Nakamura, Atrecss - atr english speech corpus for speech synthesis, in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
    • J. Ni, T. Hirai, H. Kawai, T. Toda, K. Tokuda, M. Tsuzaki, R.M∼ aia S.S∼akai, and S. Nakamura, "Atrecss - atr english speech corpus for speech synthesis," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.