메뉴 건너뛰기




Volumn , Issue , 2008, Pages 581-584

Robustness of HMM-based speech synthesis

Author keywords

HMM; HTS; Speech synthesis; Unit selection

Indexed keywords

HIGH QUALITY; HMM; HMM-BASED SPEECH SYNTHESIS; HTS; RESEARCH TOPICS; SPEECH DATA; SYNTHESIS METHOD; SYNTHESIS TECHNIQUES; TEXT TO SPEECH SYNTHESIS; TRAINING METHODS; UNIT SELECTION;

EID: 84867223798     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (38)

References (23)
  • 2
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Feb.
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 3
    • 51449114529 scopus 로고    scopus 로고
    • A style control technique for HMM-based expressive speech synthesis
    • Sep.
    • T. Nose, J. Yamagishi, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis,," IEICE Trans. Inf. & Syst., vol. E90-D, no. 9, pp. 1406-1413, Sep. 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.9 , pp. 1406-1413
    • Nose, T.1    Yamagishi, J.2    Kobayashi, T.3
  • 4
    • 78649279703 scopus 로고    scopus 로고
    • Combining statistical parameteric speech synthesis and unit-selection for automatic voice cloning
    • M. Aylett and J. Yamagishi, "Combining statistical parameteric speech synthesis and unit-selection for automatic voice cloning," in Proc. LangTech 2008, Feb. 2008.
    • Proc. LangTech 2008, Feb. 2008
    • Aylett, M.1    Yamagishi, J.2
  • 5
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database, in
    • A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP-96, May 1996, pp. 373-376.
    • Proc. ICASSP-96, May 1996 , pp. 373-376
    • Hunt, A.1    Black, A.2
  • 6
    • 34547503417 scopus 로고    scopus 로고
    • HMM-based unit selection using frame sized speech segments
    • Z.-H. Ling and R.-H. Wang, "HMM-based unit selection using frame sized speech segments," in Proc. Interspeech 2006, Sep. 2006, pp. 2034-2037.
    • Proc. Interspeech 2006, Sep. 2006 , pp. 2034-2037
    • Ling, Z.-H.1    Wang, R.-H.2
  • 7
    • 34547612590 scopus 로고    scopus 로고
    • HMM-based hierarchical unit selection combining Kullback-Leibler divergence with likelihood criterion
    • -, "HMM-based hierarchical unit selection combining Kullback-Leibler divergence with likelihood criterion," in Proc. ICASSP 2007, Apr. 2007, pp. 1245-1248.
    • Proc. ICASSP 2007, Apr. 2007 , pp. 1245-1248
    • Ling, Z.-H.1    Wang, R.-H.2
  • 10
    • 34047123652 scopus 로고    scopus 로고
    • Multisyn: Opendomain unit selection for the Festival speech synthesis system
    • R. A. J. Clark, K. Richmond, and S. King, "Multisyn: Opendomain unit selection for the Festival speech synthesis system," Speech Communication, vol. 49, no. 4, pp. 317-330, 2007.
    • (2007) Speech Communication , vol.49 , Issue.4 , pp. 317-330
    • Clark, R.A.J.1    Richmond, K.2    King, S.3
  • 11
  • 12
    • 51449103919 scopus 로고    scopus 로고
    • Performance evaluation of the speaker-independent HMM-based speech synthesis system HTS-2007 for the Blizzard Challenge 2007
    • J. Yamagishi, T. Nose, H. Zen, T. Toda, and K. Tokuda, "Performance evaluation of the speaker-independent HMM-based speech synthesis system HTS-2007 for the Blizzard Challenge 2007," in Proc. ICASSP 2008, Apr. 2008.
    • Proc. ICASSP 2008, Apr. 2008
    • Yamagishi, J.1    Nose, T.2    Zen, H.3    Toda, T.4    Tokuda, K.5
  • 13
  • 14
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 15
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • May
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 17
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error training for HMM-based speech synthesis
    • Y. Wu and R.-H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. ICASSP 2006, May 2006, pp. 89-92.
    • Proc. ICASSP 2006, May 2006 , pp. 89-92
    • Wu, Y.1    Wang, R.-H.2
  • 18
    • 11144317887 scopus 로고    scopus 로고
    • Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency
    • Dec.
    • D. Arifianto, T. Tanaka, T. Masuko, and T. Kobayashi, "Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency," IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp. 2812-2820, Dec. 2004.
    • (2004) IEICE Trans. Inf. & Syst. , vol.E87-D , Issue.12 , pp. 2812-2820
    • Arifianto, D.1    Tanaka, T.2    Masuko, T.3    Kobayashi, T.4
  • 19
    • 84928118106 scopus 로고    scopus 로고
    • Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity
    • H. Kawahara, H. Katayose, A. Cheveigné, and R. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity," in Proc. EUROSPEECH 1999, Sep. 1999, pp. 2781-2784.
    • Proc. EUROSPEECH 1999, Sep. 1999 , pp. 2781-2784
    • Kawahara, H.1    Katayose, H.2    Cheveigné, A.3    Patterson, R.4
  • 20
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking (RAPT)
    • W. Kleijn and K. Paliwal, Eds. Elsevier
    • D. Talkin, "A robust algorithm for pitch tracking (RAPT)," in Speech Coding and Synthesis, W. Kleijn and K. Paliwal, Eds. Elsevier, 1995, pp. 495-518.
    • (1995) Speech Coding and Synthesis , pp. 495-518
    • Talkin, D.1
  • 21
    • 85030493378 scopus 로고    scopus 로고
    • Synthesis of regional English using a keyword lexicon
    • Sep.
    • S. Fitt and S. Isard, "Synthesis of regional English using a keyword lexicon," in Proc. Eurospeech 1999, vol. 2, Sep. 1999, pp. 823-826.
    • (1999) Proc. Eurospeech 1999 , vol.2 , pp. 823-826
    • Fitt, S.1    Isard, S.2
  • 22
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.1
  • 23
    • 33846405723 scopus 로고    scopus 로고
    • Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
    • Jan.
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.