SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2008, Pages 581-584

Robustness of HMM-based speech synthesis

(3) Yamagishi, Junichi a Ling, Zhenhua a,b King, Simon a

a UNIVERSITY OF EDINBURGH (United Kingdom)

b UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA (China)

Author keywords

HMM; HTS; Speech synthesis; Unit selection

Indexed keywords

HIGH QUALITY; HMM; HMM-BASED SPEECH SYNTHESIS; HTS; RESEARCH TOPICS; SPEECH DATA; SYNTHESIS METHOD; SYNTHESIS TECHNIQUES; TEXT TO SPEECH SYNTHESIS; TRAINING METHODS; UNIT SELECTION;

SPEECH SYNTHESIS; STORMS;

SPEECH COMMUNICATION;

EID: 84867223798 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (38)

References (23)

1
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH-99, Sep. 1999, pp. 2374-2350.
- Proc. EUROSPEECH-99, Sep. 1999 , pp. 2374-12350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 33847129573
- Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
- Feb.
- J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training," IEICE Trans. Inf. & Syst., vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.2 , pp. 533-543
- Yamagishi, J.¹ Kobayashi, T.²

3
- 51449114529
- A style control technique for HMM-based expressive speech synthesis
- Sep.
- T. Nose, J. Yamagishi, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis,," IEICE Trans. Inf. & Syst., vol. E90-D, no. 9, pp. 1406-1413, Sep. 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.9 , pp. 1406-1413
- Nose, T.¹ Yamagishi, J.² Kobayashi, T.³

4
- 78649279703
- Combining statistical parameteric speech synthesis and unit-selection for automatic voice cloning
- M. Aylett and J. Yamagishi, "Combining statistical parameteric speech synthesis and unit-selection for automatic voice cloning," in Proc. LangTech 2008, Feb. 2008.
- Proc. LangTech 2008, Feb. 2008
- Aylett, M.¹ Yamagishi, J.²

5
- 0029765811
- Unit selection in a concatenative speech synthesis system using a large speech database, in
- A. Hunt and A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," in Proc. ICASSP-96, May 1996, pp. 373-376.
- Proc. ICASSP-96, May 1996 , pp. 373-376
- Hunt, A.¹ Black, A.²

6
- 34547503417
- HMM-based unit selection using frame sized speech segments
- Z.-H. Ling and R.-H. Wang, "HMM-based unit selection using frame sized speech segments," in Proc. Interspeech 2006, Sep. 2006, pp. 2034-2037.
- Proc. Interspeech 2006, Sep. 2006 , pp. 2034-2037
- Ling, Z.-H.¹ Wang, R.-H.²

7
- 34547612590
- HMM-based hierarchical unit selection combining Kullback-Leibler divergence with likelihood criterion
- -, "HMM-based hierarchical unit selection combining Kullback-Leibler divergence with likelihood criterion," in Proc. ICASSP 2007, Apr. 2007, pp. 1245-1248.
- Proc. ICASSP 2007, Apr. 2007 , pp. 1245-1248
- Ling, Z.-H.¹ Wang, R.-H.²

8
- 78649302225
- The USTC and iFlytek speech synthesis systems for Blizzard Challenge 2007
- Z.-H. Ling, L. Qin, H. Lu, Y. Gao, L.-R. Dai, R.-H. Wang, Y. Jiang, Z.-W. Zhao, J.-H. Y. J. Chen, and G.-P. Hu, "The USTC and iFlytek speech synthesis systems for Blizzard Challenge 2007," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
- Proc. BLZ3-2007 (In Proc. SSW6), Aug. 2007
- Ling, Z.-H.¹ Qin, L.² Lu, H.³ Gao, Y.⁴ Dai, L.-R.⁵ Wang, R.-H.⁶ Jiang, Y.⁷ Zhao, Z.-W.⁸ Chen, J.-H.Y.J.⁹ Hu, G.-P.¹⁰

9
- 0003571407
- University of Edinburgh
- A. Black, P. Taylor, and R. Caley, The Festival Speech Synthesis System, University of Edinburgh, 1999.
- (1999) The Festival Speech Synthesis System
- Black, A.¹ Taylor, P.² Caley, R.³

10
- 34047123652
- Multisyn: Opendomain unit selection for the Festival speech synthesis system
- R. A. J. Clark, K. Richmond, and S. King, "Multisyn: Opendomain unit selection for the Festival speech synthesis system," Speech Communication, vol. 49, no. 4, pp. 317-330, 2007.
- (2007) Speech Communication , vol.49 , Issue.4 , pp. 317-330
- Clark, R.A.J.¹ Richmond, K.² King, S.³

11
- 78049372160
- Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007
- J. Yamagishi, H. Zen, T. Toda, and K. Tokuda, "Speaker-independent HMM-based speech synthesis system - HTS-2007 system for the Blizzard Challenge 2007," in Proc. BLZ3-2007 (in Proc. SSW6), Aug. 2007.
- Proc. BLZ3-2007 (In Proc. SSW6), Aug. 2007
- Yamagishi, J.¹ Zen, H.² Toda, T.³ Tokuda, K.⁴

12
- 51449103919
- Performance evaluation of the speaker-independent HMM-based speech synthesis system HTS-2007 for the Blizzard Challenge 2007
- J. Yamagishi, T. Nose, H. Zen, T. Toda, and K. Tokuda, "Performance evaluation of the speaker-independent HMM-based speech synthesis system HTS-2007 for the Blizzard Challenge 2007," in Proc. ICASSP 2008, Apr. 2008.
- Proc. ICASSP 2008, Apr. 2008
- Yamagishi, J.¹ Nose, T.² Zen, H.³ Toda, T.⁴ Tokuda, K.⁵

13
- 78649235947
- Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
- accept
- J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Speech, Audio & Language Process., 2007, (accept).
- (2007) IEEE Trans. Speech, Audio & Language Process.
- Yamagishi, J.¹ Kobayashi, T.² Nakano, Y.³ Ogata, K.⁴ Isogai, J.⁵

14
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigné, A.³

15
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- May
- T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis," IEICE Trans. Inf. & Syst., vol. E90-D, no. 5, pp. 816-824, May 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

16
- 79952258981
- K. Tokuda, H. Zen, J. Yamagishi, T. Masuko, S. Sako, A. Black, and T. Nose, The HMM-based speech synthesis system (HTS) Version 2.0.1, http://hts.sp.nitech.ac.jp/.
- The HMM-based Speech Synthesis System (HTS) Version 2.0.1
- Tokuda, K.¹ Zen, H.² Yamagishi, J.³ Masuko, T.⁴ Sako, S.⁵ Black, A.⁶ Nose, T.⁷

17
- 33846429403
- Minimum generation error training for HMM-based speech synthesis
- Y. Wu and R.-H. Wang, "Minimum generation error training for HMM-based speech synthesis," in Proc. ICASSP 2006, May 2006, pp. 89-92.
- Proc. ICASSP 2006, May 2006 , pp. 89-92
- Wu, Y.¹ Wang, R.-H.²

18
- 11144317887
- Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency
- Dec.
- D. Arifianto, T. Tanaka, T. Masuko, and T. Kobayashi, "Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency," IEICE Trans. Inf. & Syst., vol. E87-D, no. 12, pp. 2812-2820, Dec. 2004.
- (2004) IEICE Trans. Inf. & Syst. , vol.E87-D , Issue.12 , pp. 2812-2820
- Arifianto, D.¹ Tanaka, T.² Masuko, T.³ Kobayashi, T.⁴

19
- 84928118106
- Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity
- H. Kawahara, H. Katayose, A. Cheveigné, and R. Patterson, "Fixed point analysis of frequency to instantaneous frequency mapping for accurate estimation of F0 and periodicity," in Proc. EUROSPEECH 1999, Sep. 1999, pp. 2781-2784.
- Proc. EUROSPEECH 1999, Sep. 1999 , pp. 2781-2784
- Kawahara, H.¹ Katayose, H.² Cheveigné, A.³ Patterson, R.⁴

20
- 0001455934
- A robust algorithm for pitch tracking (RAPT)
- W. Kleijn and K. Paliwal, Eds. Elsevier
- D. Talkin, "A robust algorithm for pitch tracking (RAPT)," in Speech Coding and Synthesis, W. Kleijn and K. Paliwal, Eds. Elsevier, 1995, pp. 495-518.
- (1995) Speech Coding and Synthesis , pp. 495-518
- Talkin, D.¹

21
- 85030493378
- Synthesis of regional English using a keyword lexicon
- Sep.
- S. Fitt and S. Isard, "Synthesis of regional English using a keyword lexicon," in Proc. Eurospeech 1999, vol. 2, Sep. 1999, pp. 823-826.
- (1999) Proc. Eurospeech 1999 , vol.2 , pp. 823-826
- Fitt, S.¹ Isard, S.²

22
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, 1998.
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.¹

23
- 33846405723
- Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005
- Jan.
- H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of Nitech HMM-based speech synthesis system for the Blizzard Challenge 2005," IEICE Trans. Inf. & Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.1 , pp. 325-333
- Zen, H.¹ Toda, T.² Nakamura, M.³ Tokuda, K.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.