SCOPUS 정보 검색 플랫폼

Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010

Volumn , Issue , 2010, Pages 853-856

Conversational spontaneous speech synthesis using average voice model

(3) Koriyama, Tomoki a Nose, Takashi a Kobayashi, Takao a

Author keywords

Average voice model; Conversational speech; HMM based speech synthesis; Speaker adaptation; Spontaneous speech; Style adaptation

Indexed keywords

HIDDEN MARKOV MODELS; SPEECH SYNTHESIS; SPEECH COMMUNICATION;

AVERAGE VOICE MODELS; CONVERSATIONAL SPEECH; HMM-BASED SPEECH SYNTHESIS; SPEAKER ADAPTATION; SPONTANEOUS SPEECH; STYLE ADAPTATION; AVERAGE-VOICE; HIDDEN MARKOV MODEL-BASED SPEECH SYNTHESIS; HIDDEN-MARKOV MODELS; MODEL TRAINING; TRAINING DATA;

SPEECH COMMUNICATION; SPEECH SYNTHESIS;

EID: 79959835828 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (15)

1
- 3042741062
- Toward spontaneous speech synthesis-utilizing language model information in TTS
- S. Werner, M. Eichner, M. Wolff, and R. Hoffmann, "Toward spontaneous speech synthesis-utilizing language model information in TTS," IEEE Trans. Speech Audio Processing, vol. 12, no. 4, pp. 436-445, 2004.
- (2004) IEEE Trans. Speech Audio Processing , vol.12 , Issue.4 , pp. 436-445
- Werner, S.¹ Eichner, M.² Wolff, M.³ Hoffmann, R.⁴

2
- 79959842873
- Toward hidden Markov model-based spontaneous speech synthesis
- T. Akagawa, K. Iwano, and S. Furui, "Toward hidden Markov model-based spontaneous speech synthesis," J. Acoust. Soc. America, vol. 120, pp. 3037-3038, 2006.
- (2006) J. Acoust. Soc. America , vol.120 , pp. 3037-3038
- Akagawa, T.¹ Iwano, K.² Furui, S.³

3
- 70349241164
- Prosody control for HMM-based Japanese TTS
- K. Iwano, M. Yamada, T. Togawa, and S. Furui, "Prosody control for HMM-based Japanese TTS," Text to speech synthesis: new paradigms and advances, p. 155, 2005.
- (2005) Text to Speech Synthesis: New Paradigms and Advances , pp. 155
- Iwano, K.¹ Yamada, M.² Togawa, T.³ Furui, S.⁴

4
- 79959855113
- A study on the statistical models for HMM-based spontaneous speech synthesis
- T. Akagawa, K. Iwano, and S. Furui, "A study on the Statistical models for HMM-based spontaneous speech synthesis," IEICE technical report (in Japanese), vol. 107, no. 77, pp. 13-18, 2007.
- (2007) IEICE Technical Report (in Japanese) , vol.107 , Issue.77 , pp. 13-18
- Akagawa, T.¹ Iwano, K.² Furui, S.³

5
- 79959817255
- Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation
- C. Lee, C. Wu, and J. Guo, "Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation," INTERSPEECH, 2010.
- (2010) INTERSPEECH
- Lee, C.¹ Wu, C.² Guo, J.³

6
- 24144437793
- Developments in corpus-based speech synthesis: Approaching natural conversational speech
- DOI 10.1093/ietisy/e88-d.3.376
- N. Campbell, "Developments in corpus-based speech synthesis: approaching natural conversational speech," IEICE Trans. Inf. & Syst., vol. 88, no. 3, pp. 376-383, 2005. (Pubitemid 41228045)
- (2005) IEICE Transactions on Information and Systems , vol.E88-D , Issue.3 , pp. 376-383
- Campbell, N.¹

7
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- Sept.
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH, Sept. 1999, pp. 2347-2350.
- (1999) Proc. EUROSPEECH , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

8
- 67650854725
- Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
- Jan.
- J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Audio, Speech, and Language Process., vol. 17, no. 1, pp. 66-83, Jan. 2009.
- (2009) IEEE Trans. Audio, Speech, and Language Process. , vol.17 , Issue.1 , pp. 66-83
- Yamagishi, J.¹ Kobayashi, T.² Nakano, Y.³ Ogata, K.⁴ Isogai, J.⁵

9
- 51449098017
- Speaker and style adaptation using average voice model for style control in hmm-based speech synthesis
- M. Tachibana, S. Izawa, T. Nose, and T. Kobayashi, "Speaker and style adaptation using average voice model for style control in hmm-based speech synthesis," in ICASSP, 2008.
- (2008) ICASSP
- Tachibana, M.¹ Izawa, S.² Nose, T.³ Kobayashi, T.⁴

10
- 84865755727
- Coupus of Spontaneous Japanese http://www.kokken.go.jp/katsudo/corpus.
- Coupus of Spontaneous Japanese

11
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- Aug.
- A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR japanese speech database as a tool of speech recognition and synthesis," Speech Communication, vol. 9, no. 4, pp. 357-363, Aug. 1990.
- (1990) Speech Communication , vol.9 , Issue.4 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kuwabara, H.⁵ Shikano, K.⁶

12
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- Apr.
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

13
- 44449177634
- A hidden semi-Markov model-based speech synthesis
- H. Zen, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "A hidden semi-Markov model-based speech synthesis," IEICE Trans. Inf. & Syst., vol. 90, no. 5, pp. 825-834, 2007.
- (2007) IEICE Trans. Inf. & Syst. , vol.90 , Issue.5 , pp. 825-834
- Zen, H.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

14
- 0038042801
- A context clustering technique for average voice models
- J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "A context clustering technique for average voice models," IEICE Trans. Inf. & Syst., vol. 86, no. 3, pp. 534-542, 2003.
- (2003) IEICE Trans. Inf. & Syst. , vol.86 , Issue.3 , pp. 534-542
- Yamagishi, J.¹ Tamura, M.² Masuko, T.³ Tokuda, K.⁴ Kobayashi, T.⁵

15
- 0030362995
- A compact model for speaker-adaptive training
- T. Aanastasakos, "A compact model for speaker-adaptive training," ICSLP, vol. 2, 1996.
- (1996) ICSLP , vol.2
- Aanastasakos, T.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.