메뉴 건너뛰기




Volumn 55, Issue 2, 2013, Pages 347-357

An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model

Author keywords

HMM based expressive speech synthesis; Multiple regression global variance model; Multiple regression HSMM; Style control; Style intensity

Indexed keywords

COMPENSATION METHOD; CONTROL TECHNIQUES; EMOTIONAL EXPRESSIONS; EXPRESSIVE SPEECH SYNTHESIS; HIDDEN SEMI-MARKOV MODELS; MODEL PARAMETERS; MODEL TRAINING; MULTIPLE-REGRESSION HSMM; NATURAL SPEECH; SPEAKING STYLES; SPEECH SYNTHESIS SYSTEM; STYLE INTENSITY; SYNTHETIC SPEECH; TRAINING DATA; VARIANCE MODELS;

EID: 84870246600     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2012.09.003     Document Type: Article
Times cited : (21)

References (29)
  • 2
    • 0037382510 scopus 로고    scopus 로고
    • Describing the emotional states that are expressed in speech
    • R. Cowie, and R.R. Cornelius Describing the emotional states that are expressed in speech Speech Comm. 40 1-2 2003 5 32
    • (2003) Speech Comm. , vol.40 , Issue.12 , pp. 5-32
    • Cowie, R.1    Cornelius, R.R.2
  • 3
    • 23144458652 scopus 로고    scopus 로고
    • Expressive speech: Production, perception and application to speech synthesis
    • D. Erickson Expressive speech: production, perception and application to speech synthesis Acoust. Sci. Tech. 26 4 2005 317 325
    • (2005) Acoust. Sci. Tech. , vol.26 , Issue.4 , pp. 317-325
    • Erickson, D.1
  • 5
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. Gales Maximum likelihood linear transformations for HMM-based speech recognition Comput. Speech Language 12 1998 75 98
    • (1998) Comput. Speech Language , vol.12 , pp. 75-98
    • Gales, M.1
  • 6
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • M. Gales Cluster adaptive training of hidden Markov models IEEE Trans. Speech Audio Process 8 4 2000 417 428
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.1
  • 7
    • 0037380318 scopus 로고    scopus 로고
    • A corpus-based speech synthesis system with emotion
    • A. Iida, N. Campbell, F. Higuchi, and M. Yasumura A corpus-based speech synthesis system with emotion Speech Comm. 40 1-2 2003 161 187
    • (2003) Speech Comm. , vol.40 , Issue.12 , pp. 161-187
    • Iida, A.1    Campbell, N.2    Higuchi, F.3    Yasumura, M.4
  • 8
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds Speech Comm. 27 3-4 1999 187 207
    • (1999) Speech Comm. , vol.27 , Issue.34 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 9
    • 84865794815 scopus 로고    scopus 로고
    • On the use of extended context for HMM-based spontaneous conversational speech synthesis
    • Koriyama, T.; Nose, T.; Kobayashi, T.; 2011. On the use of extended context for HMM-based spontaneous conversational speech synthesis. In: Proc. INTERSPEECH 2011, pp. 2657-2660.
    • (2011) Proc. INTERSPEECH 2011 , pp. 2657-2660
    • Koriyama, T.1    Nose, T.2    Kobayashi, T.3
  • 12
    • 29144493408 scopus 로고    scopus 로고
    • Human walking motion synthesis with desired pace and stride length based on HSMM
    • N. Niwase, J. Yamagishi, and T. Kobayashi Human walking motion synthesis with desired pace and stride length based on HSMM IEICE Trans. Inf. Syst. E88-D 11 2005 2492 2499
    • (2005) IEICE Trans. Inf. Syst. , vol.88 , Issue.11 , pp. 2492-2499
    • Niwase, N.1    Yamagishi, J.2    Kobayashi, T.3
  • 13
    • 67650793657 scopus 로고    scopus 로고
    • HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation
    • T. Nose, M. Tachibana, and T. Kobayashi HMM-based style control for expressive speech synthesis with arbitrary speaker's voice using model adaptation IEICE Trans. Inf. Syst. E92-D 3 2009 489 497
    • (2009) IEICE Trans. Inf. Syst. , vol.92 , Issue.3 , pp. 489-497
    • Nose, T.1    Tachibana, M.2    Kobayashi, T.3
  • 14
    • 51449114529 scopus 로고    scopus 로고
    • A style control technique for HMM-based expressive speech synthesis
    • T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi A style control technique for HMM-based expressive speech synthesis IEICE Trans. Inf. Syst. E90-D 9 2007 1406 1413
    • (2007) IEICE Trans. Inf. Syst. , vol.90 , Issue.9 , pp. 1406-1413
    • Nose, T.1    Yamagishi, J.2    Masuko, T.3    Kobayashi, T.4
  • 16
    • 0001309343 scopus 로고
    • Cue utilization in emotion attribution from auditory stimuli
    • K. Scherer, and J. Oshinsky Cue utilization in emotion attribution from auditory stimuli Motivat. Emot. 1 4 1977 331 346
    • (1977) Motivat. Emot. , vol.1 , Issue.4 , pp. 331-346
    • Scherer, K.1    Oshinsky, J.2
  • 17
    • 84971539709 scopus 로고    scopus 로고
    • Emotional speech synthesis: A review
    • Schröder, M.; 2001. Emotional speech synthesis: a review. In: Proc. EUROSPEECH 2001, pp. 561-564.
    • (2001) Proc. EUROSPEECH 2001 , pp. 561-564
    • Schröder, M.1
  • 18
    • 84908477401 scopus 로고    scopus 로고
    • Hidden Markov model-based speech emotion recognition
    • Schuller, B.; Rigoll, G.; Lang, M.; 2003. Hidden Markov model-based speech emotion recognition. In: Proc. ICASSP 2003, vol. 1, pp. 401-404.
    • (2003) Proc. ICASSP 2003 , vol.1 , pp. 401-404
    • Schuller, B.1    Rigoll, G.2    Lang, M.3
  • 19
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • K. Shinoda, and T. Watanabe MDL-based context-dependent subword modeling for speech recognition J. Acoust. Soc. Jpn. (E) 21 2 2000 79 86
    • (2000) J. Acoust. Soc. Jpn. (E) , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanabe, T.2
  • 20
    • 29144475179 scopus 로고    scopus 로고
    • Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing
    • M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi Speech synthesis with various emotional expressions and speaking styles by style interpolation and morphing IEICE Trans. Inf. Syst. E88-D 11 2005 2484 2491
    • (2005) IEICE Trans. Inf. Syst. , vol.88 , Issue.11 , pp. 2484-2491
    • Tachibana, M.1    Yamagishi, J.2    Masuko, T.3    Kobayashi, T.4
  • 21
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda, and K. Tokuda A speech parameter generation algorithm considering global variance for HMM-based speech synthesis IEICE Trans. Inf. Syst. E90-D 5 2007 816 824
    • (2007) IEICE Trans. Inf. Syst. , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2
  • 27
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Yoshimura, T.; Tokuda, K.; Masuko, T.; Kobayashi, T.; Kitamura, T.; 1999. Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. In: Proc. EUROSPEECH, pp. 2347-2350.
    • (1999) Proc. EUROSPEECH , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 28
    • 79955538498 scopus 로고    scopus 로고
    • Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis
    • K. Yu, H. Zen, F. Mairesse, and S. Young Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis Speech Comm. 53 6 2011 914 923
    • (2011) Speech Comm. , vol.53 , Issue.6 , pp. 914-923
    • Yu, K.1    Zen, H.2    Mairesse, F.3    Young, S.4
  • 29
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. Black Statistical parametric speech synthesis Speech Comm. 51 11 2009 1039 1064
    • (2009) Speech Comm. , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.