메뉴 건너뛰기




Volumn 14, Issue 4, 2006, Pages 1145-1153

Prosody conversion from neutral speech to emotional speech

Author keywords

Emotional speech; Prosody analysis; Speech synthesis

Indexed keywords

EMOTIONAL SPEECH; GAUSSIAN MIXTURE MODELS (GMM); LINEAR MODIFICATION MODELS (LMM); PROSODY ANALYSIS;

EID: 34047263010     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.876113     Document Type: Article
Times cited : (221)

References (37)
  • 1
    • 84983154011 scopus 로고    scopus 로고
    • Perception of affect in speech - Toward an automatic processing of paralinguistic information in spoken conversation
    • Jeju, Korea, Oct
    • N. Campbell, "Perception of affect in speech - Toward an automatic processing of paralinguistic information in spoken conversation," in Proc. ICSLP, Jeju, Korea, Oct. 2004, pp. 881-884.
    • (2004) Proc. ICSLP , pp. 881-884
    • Campbell, N.1
  • 3
    • 0003762887 scopus 로고    scopus 로고
    • J. P. H. van Santen, R. W. Sproat, J. P. Olive, and J. Hirschberg, Eds, New York: Springer
    • J. P. H. van Santen, R. W. Sproat, J. P. Olive, and J. Hirschberg, Eds., Progress in Speech Synthesis. New York: Springer, 1997.
    • (1997) Progress in Speech Synthesis
  • 4
    • 34047247988 scopus 로고    scopus 로고
    • Emotion control of Chinese speech synthesis in natural environment
    • J. Tao, "Emotion control of Chinese speech synthesis in natural environment," in Proc. Eurospeech, 2003, pp. 2349-2352.
    • (2003) Proc. Eurospeech , pp. 2349-2352
    • Tao, J.1
  • 5
    • 0342561578 scopus 로고    scopus 로고
    • Pitch targets and their realization: Evidence from mandarin Chinese
    • Y. Xu and Q. E. Wang, "Pitch targets and their realization: Evidence from mandarin Chinese," Speech Commun., vol. 33, pp. 319-337, 2001.
    • (2001) Speech Commun , vol.33 , pp. 319-337
    • Xu, Y.1    Wang, Q.E.2
  • 6
    • 85011187169 scopus 로고
    • Analysis of voice fundamental frequency contours for declarative sentence of Japanese
    • H. Fujisaki and K. Hirose, "Analysis of voice fundamental frequency contours for declarative sentence of Japanese," J. Acoust. Soc. Jpn. (E), vol. 5, no. 4, pp. 233-242, 1984.
    • (1984) J. Acoust. Soc. Jpn. (E) , vol.5 , Issue.4 , pp. 233-242
    • Fujisaki, H.1    Hirose, K.2
  • 7
    • 85009061625 scopus 로고    scopus 로고
    • Expression of emotion and attitude through temporal speech variations
    • Beijing, China
    • S. J. L. Mozziconacci and D. J. Hermes, "Expression of emotion and attitude through temporal speech variations," in Proc. ICSLP, Beijing, China, 2000, pp. 373-378.
    • (2000) Proc. ICSLP , pp. 373-378
    • Mozziconacci, S.J.L.1    Hermes, D.J.2
  • 8
    • 0002515370 scopus 로고
    • The generation of affect in synthesized speech
    • Jul
    • J. E. Cahn, "The generation of affect in synthesized speech," J. Amer. Voice I/O Soc., vol. 8, pp. 1-19, Jul. 1990.
    • (1990) J. Amer. Voice I/O Soc , vol.8 , pp. 1-19
    • Cahn, J.E.1
  • 10
    • 85009097029 scopus 로고    scopus 로고
    • XML representation languages as a way of interconnecting TTS modules
    • Jeju, Korea
    • M. Schröder and S. Breuer, "XML representation languages as a way of interconnecting TTS modules," in Proc. ICSLP, Jeju, Korea, 2004, pp. 1889-1892.
    • (2004) Proc. ICSLP , pp. 1889-1892
    • Schröder, M.1    Breuer, S.2
  • 12
    • 85009259780 scopus 로고    scopus 로고
    • Emotion recognition from textual input using an emotional semantic network
    • Denver, CO
    • Z.-J. Chuang and C.-H. Wu, "Emotion recognition from textual input using an emotional semantic network," in Proc. Int. Conf. Spoken Language Processing, ICSLP 2002, Denver, CO, 2002, pp. 2033-2036.
    • (2002) Proc. Int. Conf. Spoken Language Processing, ICSLP 2002 , pp. 2033-2036
    • Chuang, Z.-J.1    Wu, C.-H.2
  • 13
    • 33646817084 scopus 로고    scopus 로고
    • Generating emotional speech with a concatenative synthesizer
    • E. Rank and H. Pirker, "Generating emotional speech with a concatenative synthesizer," in Proc. ICSLP, 1998, pp. 671-674.
    • (1998) Proc. ICSLP , pp. 671-674
    • Rank, E.1    Pirker, H.2
  • 14
    • 34047250574 scopus 로고    scopus 로고
    • Chinese prosody and prosodic labeling of spontaneous speech
    • A. Li, "Chinese prosody and prosodic labeling of spontaneous speech," in Proc. Speech Prosody, 2002, pp. 39-46.
    • (2002) Proc. Speech Prosody , pp. 39-46
    • Li, A.1
  • 15
    • 34047254997 scopus 로고    scopus 로고
    • H. Kawahra and R. Akahane-Yamada, Perceptual effects of spectral envelope and F0 manipulations using STRAIGHT method, J. Acoust. Soc. Amer., pt. 2, 103, no. 5, p. 2776, 1998. 1aSC27.
    • H. Kawahra and R. Akahane-Yamada, "Perceptual effects of spectral envelope and F0 manipulations using STRAIGHT method," J. Acoust. Soc. Amer., pt. 2, vol. 103, no. 5, p. 2776, 1998. 1aSC27.
  • 16
    • 34047264748 scopus 로고    scopus 로고
    • A. B. Kain, High-resolution voice transformation, Ph.D. dissertation, Oregon Health and Sci. Univ., Portland, Oct. 2001.
    • A. B. Kain, "High-resolution voice transformation," Ph.D. dissertation, Oregon Health and Sci. Univ., Portland, Oct. 2001.
  • 17
    • 84984832252 scopus 로고    scopus 로고
    • STEM-ML: Language independent prosody description
    • Beijing, China
    • G. P. Kochanski and C. Shih, "STEM-ML: Language independent prosody description," in Proc. ICSLP, Beijing, China, 2000, pp. 239-242.
    • (2000) Proc. ICSLP , pp. 239-242
    • Kochanski, G.P.1    Shih, C.2
  • 18
    • 0027447292 scopus 로고
    • Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
    • I. Murray and J. L. Arnott, "Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion," J. Acoust. Soc. Amer., pp. 1097-1108, 1993.
    • (1993) J. Acoust. Soc. Amer , pp. 1097-1108
    • Murray, I.1    Arnott, J.L.2
  • 19
    • 34047255680 scopus 로고    scopus 로고
    • R. M. Stibbard, Vocal expression of emotions in non-laboratory speech: An investigation of the reading/leeds emotion in speech project annotation data, Ph.D. dissertation, Univ. Reading, Reading, U.K., 2001.
    • R. M. Stibbard, "Vocal expression of emotions in non-laboratory speech: An investigation of the reading/leeds emotion in speech project annotation data," Ph.D. dissertation, Univ. Reading, Reading, U.K., 2001.
  • 20
    • 21844454654 scopus 로고    scopus 로고
    • The determination, analysis, and synthesis of fundamental frequency,
    • Ph.D. dissertation. Northwestern Univ, Evanston, IL
    • X. Sun, "The determination, analysis, and synthesis of fundamental frequency," Ph.D. dissertation. Northwestern Univ., Evanston, IL, 2002.
    • (2002)
    • Sun, X.1
  • 21
    • 58149209073 scopus 로고
    • Voice conversion: State of the art and perspectives
    • Feb
    • E. Moulines and Y. Sagisaka, "Voice conversion: State of the art and perspectives," Speech Commun., vol. 16, no. 2, pp. 125-126, Feb. 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 125-126
    • Moulines, E.1    Sagisaka, Y.2
  • 23
    • 84982678054 scopus 로고    scopus 로고
    • Classifying emotions in speech: A comparison of methods
    • Holon, Isreal
    • N. Amir, "Classifying emotions in speech: A comparison of methods," in Proc. Eurospeech. Holon, Isreal, 2001, pp. 127-130.
    • (2001) Proc. Eurospeech , pp. 127-130
    • Amir, N.1
  • 25
    • 0037380186 scopus 로고    scopus 로고
    • C. Gobl and A. N'1Chasaide, The role of voice quality in communicating emotion, mood and attitude, Speech Commun., 40, pp. 189-212, 2003.
    • C. Gobl and A. N'1Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Commun., vol. 40, pp. 189-212, 2003.
  • 26
    • 85009159448 scopus 로고    scopus 로고
    • Emotional space improves emotion recognition
    • Denver, CO, Sep
    • R. Tato, R. Santos, R. Kompe, and J. M. Pardo, "Emotional space improves emotion recognition," in Proc. ICSLP, Denver, CO, Sep. 2002, pp. 2029-2032.
    • (2002) Proc. ICSLP , pp. 2029-2032
    • Tato, R.1    Santos, R.2    Kompe, R.3    Pardo, J.M.4
  • 27
    • 85009080929 scopus 로고    scopus 로고
    • Emotion recognition in speech signal: Experimental study, development and application
    • Beijing, China
    • V. A. Petrushin, "Emotion recognition in speech signal: Experimental study, development and application," in Proc. ICSLP, Beijing, China, 2000, pp. 222-225.
    • (2000) Proc. ICSLP , pp. 222-225
    • Petrushin, V.A.1
  • 29
    • 85009076640 scopus 로고    scopus 로고
    • A novel voice conversion system based on codebook mapping with phoneme-lied weighting
    • Jeju, Korea, Oct
    • Z.-W. Shuang, Z.-X. Wang, Z.-H. Ling, and R.-H. Wang, "A novel voice conversion system based on codebook mapping with phoneme-lied weighting," in Proc. ICSLP, Jeju, Korea, Oct. 2004, pp. 1197-1200.
    • (2004) Proc. ICSLP , pp. 1197-1200
    • Shuang, Z.-W.1    Wang, Z.-X.2    Ling, Z.-H.3    Wang, R.-H.4
  • 30
    • 84863268465 scopus 로고    scopus 로고
    • Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum
    • Rhodes, Greece
    • L. M. Arslan and D. Talkin, "Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 1347-1350.
    • (1997) Proc. Eurospeech , pp. 1347-1350
    • Arslan, L.M.1    Talkin, D.2
  • 31
    • 85009266993 scopus 로고    scopus 로고
    • Transformation of spectral envelope for voice conversion based on radial basis function networks
    • Denver, CO
    • T. Watanabe et al., "Transformation of spectral envelope for voice conversion based on radial basis function networks," in Proc. ICSLP, Denver, CO, 2002, pp. 285-288.
    • (2002) Proc. ICSLP , pp. 285-288
    • Watanabe, T.1
  • 32
    • 85009080468 scopus 로고    scopus 로고
    • Friendly speech analysis and perception in standard Chinese
    • Jeju, Korea
    • A. Li and H. Wang, "Friendly speech analysis and perception in standard Chinese," in Proc. ICSLP, Jeju, Korea, 2004, pp. 897-900.
    • (2004) Proc. ICSLP , pp. 897-900
    • Li, A.1    Wang, H.2
  • 33
    • 33646779506 scopus 로고    scopus 로고
    • Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
    • T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. ICASSP, 2005, pp. 9-12.
    • (2005) Proc. ICASSP , pp. 9-12
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 34
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed GMM and map adaptation
    • Geneva, Switzerland
    • Y. Chen et al., "Voice conversion with smoothed GMM and map adaptation," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 2413-2416.
    • (2003) Proc. Eurospeech , pp. 2413-2416
    • Chen, Y.1
  • 35
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Y. Stylianou et al., "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 131-142, 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1
  • 36
    • 0029256372 scopus 로고    scopus 로고
    • Voice conversion based on piecewise linear conversions rules of formant frequency and spectrum tilt
    • H Mizuno, H Mizuno, and M Abe, "Voice conversion based on piecewise linear conversions rules of formant frequency and spectrum tilt," Speech Commun. 16, pp. 153-164.
    • Speech Commun , vol.16 , pp. 153-164
    • Mizuno, H.1    Mizuno, H.2    Abe, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.