SCOPUS 정보 검색 플랫폼

Volumn 55, Issue 2, 2013, Pages 347-357

An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model

(2) Nose, Takashi a Kobayashi, Takao a

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

Author keywords

HMM based expressive speech synthesis; Multiple regression global variance model; Multiple regression HSMM; Style control; Style intensity

Indexed keywords

COMPENSATION METHOD; CONTROL TECHNIQUES; EMOTIONAL EXPRESSIONS; EXPRESSIVE SPEECH SYNTHESIS; HIDDEN SEMI-MARKOV MODELS; MODEL PARAMETERS; MODEL TRAINING; MULTIPLE-REGRESSION HSMM; NATURAL SPEECH; SPEAKING STYLES; SPEECH SYNTHESIS SYSTEM; STYLE INTENSITY; SYNTHETIC SPEECH; TRAINING DATA; VARIANCE MODELS;

HIDDEN MARKOV MODELS; REGRESSION ANALYSIS;

SPEECH SYNTHESIS;

EID: 84870246600 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/j.specom.2012.09.003 Document Type: Article

Times cited : (21)

References (29)

1
- 0030362995
- A compact model for speaker adaptive training
- Anastasakos, T.; McDonough, J.; Schwartz, R.; Makhoul, J.; 1996. A compact model for speaker adaptive training. In: Proc. ICSLP-96, pp. 1137-1140.
- (1996) Proc. ICSLP-96 , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

2
- 0037382510
- Describing the emotional states that are expressed in speech
- R. Cowie, and R.R. Cornelius Describing the emotional states that are expressed in speech Speech Comm. 40 1-2 2003 5 32
- (2003) Speech Comm. , vol.40 , Issue.12 , pp. 5-32
- Cowie, R.¹ Cornelius, R.R.²

3
- 23144458652
- Expressive speech: Production, perception and application to speech synthesis
- D. Erickson Expressive speech: production, perception and application to speech synthesis Acoust. Sci. Tech. 26 4 2005 317 325
- (2005) Acoust. Sci. Tech. , vol.26 , Issue.4 , pp. 317-325
- Erickson, D.¹

4
- 0034855363
- Multiple-regression hidden Markov model
- Fujinaga, K.; Nakai, M.; Shimodaira, H.; Sagayama, S.; 2001. Multiple-regression hidden Markov model. In: Proc. ICASSP 2001, pp. 513-516.
- (2001) Proc. ICASSP 2001 , pp. 513-516
- Fujinaga, K.¹ Nakai, M.² Shimodaira, H.³ Sagayama, S.⁴

5
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. Gales Maximum likelihood linear transformations for HMM-based speech recognition Comput. Speech Language 12 1998 75 98
- (1998) Comput. Speech Language , vol.12 , pp. 75-98
- Gales, M.¹

6
- 0034227757
- Cluster adaptive training of hidden Markov models
- M. Gales Cluster adaptive training of hidden Markov models IEEE Trans. Speech Audio Process 8 4 2000 417 428
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 417-428
- Gales, M.¹

7
- 0037380318
- A corpus-based speech synthesis system with emotion
- A. Iida, N. Campbell, F. Higuchi, and M. Yasumura A corpus-based speech synthesis system with emotion Speech Comm. 40 1-2 2003 161 187
- (2003) Speech Comm. , vol.40 , Issue.12 , pp. 161-187
- Iida, A.¹ Campbell, N.² Higuchi, F.³ Yasumura, M.⁴

8
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds Speech Comm. 27 3-4 1999 187 207
- (1999) Speech Comm. , vol.27 , Issue.34 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

9
- 84865794815
- On the use of extended context for HMM-based spontaneous conversational speech synthesis
- Koriyama, T.; Nose, T.; Kobayashi, T.; 2011. On the use of extended context for HMM-based spontaneous conversational speech synthesis. In: Proc. INTERSPEECH 2011, pp. 2657-2660.
- (2011) Proc. INTERSPEECH 2011 , pp. 2657-2660
- Koriyama, T.¹ Nose, T.² Kobayashi, T.³

10
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano ATR Japanese speech database as a tool of speech recognition and synthesis Speech Comm. 9 4 1990 357 363
- (1990) Speech Comm. , vol.9 , Issue.4 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kuwabara, H.⁵ Shikano, K.⁶

11
- 85009069226
- A style control technique for HMM-based speech synthesis
- Miyanaga, K.; Masuko, T.; Kobayashi, T.; 2004. A style control technique for HMM-based speech synthesis. In: Proc. INTERSPEECH 2004-ICSLP, pp. 1437-1440.
- (2004) Proc. INTERSPEECH 2004-ICSLP , pp. 1437-1440
- Miyanaga, K.¹ Masuko, T.² Kobayashi, T.³

12
- 29144493408
- Human walking motion synthesis with desired pace and stride length based on HSMM
- N. Niwase, J. Yamagishi, and T. Kobayashi Human walking motion synthesis with desired pace and stride length based on HSMM IEICE Trans. Inf. Syst. E88-D 11 2005 2492 2499
- (2005) IEICE Trans. Inf. Syst. , vol.88 , Issue.11 , pp. 2492-2499
- Niwase, N.¹ Yamagishi, J.² Kobayashi, T.³

13
- 67650793657
- HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation
- T. Nose, M. Tachibana, and T. Kobayashi HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation IEICE Trans. Inf. Syst. E92-D 3 2009 489 497
- (2009) IEICE Trans. Inf. Syst. , vol.92 , Issue.3 , pp. 489-497
- Nose, T.¹ Tachibana, M.² Kobayashi, T.³

14
- 51449114529
- A style control technique for HMM-based expressive speech synthesis
- T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi A style control technique for HMM-based expressive speech synthesis IEICE Trans. Inf. Syst. E90-D 9 2007 1406 1413
- (2007) IEICE Trans. Inf. Syst. , vol.90 , Issue.9 , pp. 1406-1413
- Nose, T.¹ Yamagishi, J.² Masuko, T.³ Kobayashi, T.⁴

15
- 34047275265
- The IBM expressive text-to-speech synthesis system for American English
- J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, and M.A. Picheny The IBM expressive text-to-speech synthesis system for American English IEEE Trans. Audio Speech Language Process. 14 4 2006 1099 1108
- (2006) IEEE Trans. Audio Speech Language Process. , vol.14 , Issue.4 , pp. 1099-1108
- Pitrelli, J.F.¹ Bakis, R.² Eide, E.M.³ Fernandez, R.⁴ Hamza, W.⁵ Picheny, M.A.⁶

16
- 0001309343
- Cue utilization in emotion attribution from auditory stimuli
- K. Scherer, and J. Oshinsky Cue utilization in emotion attribution from auditory stimuli Motivat. Emot. 1 4 1977 331 346
- (1977) Motivat. Emot. , vol.1 , Issue.4 , pp. 331-346
- Scherer, K.¹ Oshinsky, J.²

17
- 84971539709
- Emotional speech synthesis: A review
- Schröder, M.; 2001. Emotional speech synthesis: a review. In: Proc. EUROSPEECH 2001, pp. 561-564.
- (2001) Proc. EUROSPEECH 2001 , pp. 561-564
- Schröder, M.¹

18
- 84908477401
- Hidden Markov model-based speech emotion recognition
- Schuller, B.; Rigoll, G.; Lang, M.; 2003. Hidden Markov model-based speech emotion recognition. In: Proc. ICASSP 2003, vol. 1, pp. 401-404.
- (2003) Proc. ICASSP 2003 , vol.1 , pp. 401-404
- Schuller, B.¹ Rigoll, G.² Lang, M.³

19
- 0033906251
- MDL-based context-dependent subword modeling for speech recognition
- K. Shinoda, and T. Watanabe MDL-based context-dependent subword modeling for speech recognition J. Acoust. Soc. Jpn. (E) 21 2 2000 79 86
- (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.2 , pp. 79-86
- Shinoda, K.¹ Watanabe, T.²

20
- 29144475179
- Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing
- M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing IEICE Trans. Inf. Syst. E88-D 11 2005 2484 2491
- (2005) IEICE Trans. Inf. Syst. , vol.88 , Issue.11 , pp. 2484-2491
- Tachibana, M.¹ Yamagishi, J.² Masuko, T.³ Kobayashi, T.⁴

21
- 38549096029
- A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
- T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inf. Syst. E90-D 5 2007 816 824
- (2007) IEICE Trans. Inf. Syst. , vol.90 , Issue.5 , pp. 816-824
- Toda, T.¹ Tokuda, K.²

22
- 84982961818
- Constructing emotional speech synthesizers with limited speech database
- Tsuzuki, R.; Zen, H.; Tokuda, K.; Kitamura, T.; Bulut, M.; Narayanan, S.; 2004. Constructing emotional speech synthesizers with limited speech database. In: Proc. INTERSPEECH 2004-ICSLP, pp. 1185-1188.
- (2004) Proc. INTERSPEECH 2004-ICSLP , pp. 1185-1188
- Tsuzuki, R.¹ Zen, H.² Tokuda, K.³ Kitamura, T.⁴ Bulut, M.⁵ Narayanan, S.⁶

23
- 85009177437
- Modeling of various speaking styles and emotions for HMM-based speech synthesis
- Yamagishi, J.; Onishi, K.; Masuko, T.; Kobayashi, T.; 2003a. Modeling of various speaking styles and emotions for HMM-based speech synthesis. In: Proc. INTERSPEECH 2003-EUROSPEECH, pp. 2461-2464.
- (2003) Proc. INTERSPEECH 2003-EUROSPEECH , pp. 2461-2464
- Yamagishi, J.¹ Onishi, K.² Masuko, T.³ Kobayashi, T.⁴

24
- 0038042801
- A context clustering technique for average voice models
- J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi A context clustering technique for average voice models IEICE Trans. Inf. Syst. E86-D 3 2003 534 542
- (2003) IEICE Trans. Inf. Syst. , vol.86 , Issue.3 , pp. 534-542
- Yamagishi, J.¹ Tamura, M.² Masuko, T.³ Tokuda, K.⁴ Kobayashi, T.⁵

25
- 0142007308
- A training method of average voice model for HMM-based speech synthesis
- J. Yamagishi, M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi A training method of average voice model for HMM-based speech synthesis IEICE Trans. Fundamentals E86-A 8 2003 1956 1963
- (2003) IEICE Trans. Fundamentals , vol.86 , Issue.8 , pp. 1956-1963
- Yamagishi, J.¹ Tamura, M.² Masuko, T.³ Tokuda, K.⁴ Kobayashi, T.⁵

26
- 67650819492
- Yamagishi, J.; Zen, H.; Wu, Y.; Toda, T.; Tokuda, K.; 2008. The HTS-2008 system: yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 blizzard challenge.
- (2008) The HTS-2008 System: Yet Another Evaluation of the Speaker-adaptive HMM-based Speech Synthesis System in the 2008 Blizzard Challenge.
- Yamagishi, J.¹ Zen, H.² Wu, Y.³ Toda, T.⁴ Tokuda, K.⁵

27
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
- Yoshimura, T.; Tokuda, K.; Masuko, T.; Kobayashi, T.; Kitamura, T.; 1999. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. In: Proc. EUROSPEECH, pp. 2347-2350.
- (1999) Proc. EUROSPEECH , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

28
- 79955538498
- Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
- K. Yu, H. Zen, F. Mairesse, and S. Young Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis Speech Comm. 53 6 2011 914 923
- (2011) Speech Comm. , vol.53 , Issue.6 , pp. 914-923
- Yu, K.¹ Zen, H.² Mairesse, F.³ Young, S.⁴

29
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. Black Statistical parametric speech synthesis Speech Comm. 51 11 2009 1039 1064
- (2009) Speech Comm. , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.