SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 4, 2006, Pages 1145-1153

Prosody conversion from neutral speech to emotional speech

(3) Tao, Jianhua a,b Kang, Yongguo b Li, Aijun c

a IEEE (China)

b INSTITUTE OF AUTOMATION (China)

c CHINESE ACADEMY OF SOCIAL SCIENCES (China)

Author keywords

Emotional speech; Prosody analysis; Speech synthesis

Indexed keywords

EMOTIONAL SPEECH; GAUSSIAN MIXTURE MODELS (GMM); LINEAR MODIFICATION MODELS (LMM); PROSODY ANALYSIS;

CLASSIFICATION (OF INFORMATION); COMPUTER SIMULATION; INFORMATION ANALYSIS; MATHEMATICAL MODELS; OPTIMIZATION; REGRESSION ANALYSIS; SEMANTICS;

SPEECH SYNTHESIS;

EID: 34047263010 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.876113 Document Type: Article

Times cited : (221)

References (37)

1
- 84983154011
- Perception of affect in speech - Toward an automatic processing of paralinguistic information in spoken conversation
- Jeju, Korea, Oct
- N. Campbell, "Perception of affect in speech - Toward an automatic processing of paralinguistic information in spoken conversation," in Proc. ICSLP, Jeju, Korea, Oct. 2004, pp. 881-884.
- (2004) Proc. ICSLP , pp. 881-884
- Campbell, N.¹

2
- 0004160296
- Cambridge, U.K, Cambridge Univ. Press
- A. Ortony, G. L. Clore, and A. Collins, The Cognitive Structure of Emotions. Cambridge, U.K.: Cambridge Univ. Press, 1988.
- (1988) The Cognitive Structure of Emotions
- Ortony, A.¹ Clore, G.L.² Collins, A.³

3
- 0003762887
- J. P. H. van Santen, R. W. Sproat, J. P. Olive, and J. Hirschberg, Eds, New York: Springer
- J. P. H. van Santen, R. W. Sproat, J. P. Olive, and J. Hirschberg, Eds., Progress in Speech Synthesis. New York: Springer, 1997.
- (1997) Progress in Speech Synthesis

4
- 34047247988
- Emotion control of Chinese speech synthesis in natural environment
- J. Tao, "Emotion control of Chinese speech synthesis in natural environment," in Proc. Eurospeech, 2003, pp. 2349-2352.
- (2003) Proc. Eurospeech , pp. 2349-2352
- Tao, J.¹

5
- 0342561578
- Pitch targets and their realization: Evidence from mandarin Chinese
- Y. Xu and Q. E. Wang, "Pitch targets and their realization: Evidence from mandarin Chinese," Speech Commun., vol. 33, pp. 319-337, 2001.
- (2001) Speech Commun , vol.33 , pp. 319-337
- Xu, Y.¹ Wang, Q.E.²

6
- 85011187169
- Analysis of voice fundamental frequency contours for declarative sentence of Japanese
- H. Fujisaki and K. Hirose, "Analysis of voice fundamental frequency contours for declarative sentence of Japanese," J. Acoust. Soc. Jpn. (E), vol. 5, no. 4, pp. 233-242, 1984.
- (1984) J. Acoust. Soc. Jpn. (E) , vol.5 , Issue.4 , pp. 233-242
- Fujisaki, H.¹ Hirose, K.²

7
- 85009061625
- Expression of emotion and attitude through temporal speech variations
- Beijing, China
- S. J. L. Mozziconacci and D. J. Hermes, "Expression of emotion and attitude through temporal speech variations," in Proc. ICSLP, Beijing, China, 2000, pp. 373-378.
- (2000) Proc. ICSLP , pp. 373-378
- Mozziconacci, S.J.L.¹ Hermes, D.J.²

8
- 0002515370
- The generation of affect in synthesized speech
- Jul
- J. E. Cahn, "The generation of affect in synthesized speech," J. Amer. Voice I/O Soc., vol. 8, pp. 1-19, Jul. 1990.
- (1990) J. Amer. Voice I/O Soc , vol.8 , pp. 1-19
- Cahn, J.E.¹

9
- 34047251408
- Campbell, Online, Available
- Synthesis units for conversational speech - Using phrasal segments, N. Campbell. [Online]. Available: http://feast.atr.jp/nick/rcfs.html
- Synthesis units for conversational speech - Using phrasal segments, N

10
- 85009097029
- XML representation languages as a way of interconnecting TTS modules
- Jeju, Korea
- M. Schröder and S. Breuer, "XML representation languages as a way of interconnecting TTS modules," in Proc. ICSLP, Jeju, Korea, 2004, pp. 1889-1892.
- (2004) Proc. ICSLP , pp. 1889-1892
- Schröder, M.¹ Breuer, S.²

11
- 33947635494
- A corpus-based approach to < ahem expressive speech synthesis
- Santa Monica, CA
- E. Eide, A. Aaron, R. Bakis, W. Hamza, M. Picheny, and J. Pitrelli, "A corpus-based approach to < ahem expressive speech synthesis," in Proc. IEEE Speech Synthesis Workshop, Santa Monica, CA, 2002, pp. 79-84.
- (2002) Proc. IEEE Speech Synthesis Workshop , pp. 79-84
- Eide, E.¹ Aaron, A.² Bakis, R.³ Hamza, W.⁴ Picheny, M.⁵ Pitrelli, J.⁶

12
- 85009259780
- Emotion recognition from textual input using an emotional semantic network
- Denver, CO
- Z.-J. Chuang and C.-H. Wu, "Emotion recognition from textual input using an emotional semantic network," in Proc. Int. Conf. Spoken Language Processing, ICSLP 2002, Denver, CO, 2002, pp. 2033-2036.
- (2002) Proc. Int. Conf. Spoken Language Processing, ICSLP 2002 , pp. 2033-2036
- Chuang, Z.-J.¹ Wu, C.-H.²

13
- 33646817084
- Generating emotional speech with a concatenative synthesizer
- E. Rank and H. Pirker, "Generating emotional speech with a concatenative synthesizer," in Proc. ICSLP, 1998, pp. 671-674.
- (1998) Proc. ICSLP , pp. 671-674
- Rank, E.¹ Pirker, H.²

14
- 34047250574
- Chinese prosody and prosodic labeling of spontaneous speech
- A. Li, "Chinese prosody and prosodic labeling of spontaneous speech," in Proc. Speech Prosody, 2002, pp. 39-46.
- (2002) Proc. Speech Prosody , pp. 39-46
- Li, A.¹

15
- 34047254997
- H. Kawahra and R. Akahane-Yamada, Perceptual effects of spectral envelope and F0 manipulations using STRAIGHT method, J. Acoust. Soc. Amer., pt. 2, 103, no. 5, p. 2776, 1998. 1aSC27.
- H. Kawahra and R. Akahane-Yamada, "Perceptual effects of spectral envelope and F0 manipulations using STRAIGHT method," J. Acoust. Soc. Amer., pt. 2, vol. 103, no. 5, p. 2776, 1998. 1aSC27.

16
- 34047264748
- A. B. Kain, High-resolution voice transformation, Ph.D. dissertation, Oregon Health and Sci. Univ., Portland, Oct. 2001.
- A. B. Kain, "High-resolution voice transformation," Ph.D. dissertation, Oregon Health and Sci. Univ., Portland, Oct. 2001.

17
- 84984832252
- STEM-ML: Language independent prosody description
- Beijing, China
- G. P. Kochanski and C. Shih, "STEM-ML: Language independent prosody description," in Proc. ICSLP, Beijing, China, 2000, pp. 239-242.
- (2000) Proc. ICSLP , pp. 239-242
- Kochanski, G.P.¹ Shih, C.²

18
- 0027447292
- Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
- I. Murray and J. L. Arnott, "Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion," J. Acoust. Soc. Amer., pp. 1097-1108, 1993.
- (1993) J. Acoust. Soc. Amer , pp. 1097-1108
- Murray, I.¹ Arnott, J.L.²

19
- 34047255680
- R. M. Stibbard, Vocal expression of emotions in non-laboratory speech: An investigation of the reading/leeds emotion in speech project annotation data, Ph.D. dissertation, Univ. Reading, Reading, U.K., 2001.
- R. M. Stibbard, "Vocal expression of emotions in non-laboratory speech: An investigation of the reading/leeds emotion in speech project annotation data," Ph.D. dissertation, Univ. Reading, Reading, U.K., 2001.

20
- 21844454654
- The determination, analysis, and synthesis of fundamental frequency,
- Ph.D. dissertation. Northwestern Univ, Evanston, IL
- X. Sun, "The determination, analysis, and synthesis of fundamental frequency," Ph.D. dissertation. Northwestern Univ., Evanston, IL, 2002.
- (2002)
- Sun, X.¹

21
- 58149209073
- Voice conversion: State of the art and perspectives
- Feb
- E. Moulines and Y. Sagisaka, "Voice conversion: State of the art and perspectives," Speech Commun., vol. 16, no. 2, pp. 125-126, Feb. 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 125-126
- Moulines, E.¹ Sagisaka, Y.²

22
- 0003135459
- Approaching automatic recognition of emotion from voice: A rough benchmark
- S. McGilloway, R. Cowie, E. Doulas-Cowie, S. Gielen, M. Westerdijk, and S. Stroeve, "Approaching automatic recognition of emotion from voice: A rough benchmark," in Proc. ISCA workshop Speech Emotion, 2000, pp. 207-212.
- (2000) Proc. ISCA workshop Speech Emotion , pp. 207-212
- McGilloway, S.¹ Cowie, R.² Doulas-Cowie, E.³ Gielen, S.⁴ Westerdijk, M.⁵ Stroeve, S.⁶

23
- 84982678054
- Classifying emotions in speech: A comparison of methods
- Holon, Isreal
- N. Amir, "Classifying emotions in speech: A comparison of methods," in Proc. Eurospeech. Holon, Isreal, 2001, pp. 127-130.
- (2001) Proc. Eurospeech , pp. 127-130
- Amir, N.¹

24
- 0004203240
- The EM algorithm and extensions
- New York: Wiley
- G. McLachlan and T. Krishnan, "The EM algorithm and extensions," in Wiley Series in Probability and Statistics, New York: Wiley, 1997.
- (1997) Wiley Series in Probability and Statistics
- McLachlan, G.¹ Krishnan, T.²

25
- 0037380186
- C. Gobl and A. N'1Chasaide, The role of voice quality in communicating emotion, mood and attitude, Speech Commun., 40, pp. 189-212, 2003.
- C. Gobl and A. N'1Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Commun., vol. 40, pp. 189-212, 2003.

26
- 85009159448
- Emotional space improves emotion recognition
- Denver, CO, Sep
- R. Tato, R. Santos, R. Kompe, and J. M. Pardo, "Emotional space improves emotion recognition," in Proc. ICSLP, Denver, CO, Sep. 2002, pp. 2029-2032.
- (2002) Proc. ICSLP , pp. 2029-2032
- Tato, R.¹ Santos, R.² Kompe, R.³ Pardo, J.M.⁴

27
- 85009080929
- Emotion recognition in speech signal: Experimental study, development and application
- Beijing, China
- V. A. Petrushin, "Emotion recognition in speech signal: Experimental study, development and application," in Proc. ICSLP, Beijing, China, 2000, pp. 222-225.
- (2000) Proc. ICSLP , pp. 222-225
- Petrushin, V.A.¹

28
- 0003518592
- Chicago, IL: Univ. Chicago Press
- B. Hayes, Metrical Stress Theory: Principles and Case Studies. Chicago, IL: Univ. Chicago Press, 1995.
- (1995) Metrical Stress Theory: Principles and Case Studies
- Hayes, B.¹

29
- 85009076640
- A novel voice conversion system based on codebook mapping with phoneme-lied weighting
- Jeju, Korea, Oct
- Z.-W. Shuang, Z.-X. Wang, Z.-H. Ling, and R.-H. Wang, "A novel voice conversion system based on codebook mapping with phoneme-lied weighting," in Proc. ICSLP, Jeju, Korea, Oct. 2004, pp. 1197-1200.
- (2004) Proc. ICSLP , pp. 1197-1200
- Shuang, Z.-W.¹ Wang, Z.-X.² Ling, Z.-H.³ Wang, R.-H.⁴

30
- 84863268465
- Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum
- Rhodes, Greece
- L. M. Arslan and D. Talkin, "Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 1347-1350.
- (1997) Proc. Eurospeech , pp. 1347-1350
- Arslan, L.M.¹ Talkin, D.²

31
- 85009266993
- Transformation of spectral envelope for voice conversion based on radial basis function networks
- Denver, CO
- T. Watanabe et al., "Transformation of spectral envelope for voice conversion based on radial basis function networks," in Proc. ICSLP, Denver, CO, 2002, pp. 285-288.
- (2002) Proc. ICSLP , pp. 285-288
- Watanabe, T.¹

32
- 85009080468
- Friendly speech analysis and perception in standard Chinese
- Jeju, Korea
- A. Li and H. Wang, "Friendly speech analysis and perception in standard Chinese," in Proc. ICSLP, Jeju, Korea, 2004, pp. 897-900.
- (2004) Proc. ICSLP , pp. 897-900
- Li, A.¹ Wang, H.²

33
- 33646779506
- Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
- T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. ICASSP, 2005, pp. 9-12.
- (2005) Proc. ICASSP , pp. 9-12
- Toda, T.¹ Black, A.W.² Tokuda, K.³

34
- 84905560807
- Voice conversion with smoothed GMM and map adaptation
- Geneva, Switzerland
- Y. Chen et al., "Voice conversion with smoothed GMM and map adaptation," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 2413-2416.
- (2003) Proc. Eurospeech , pp. 2413-2416
- Chen, Y.¹

35
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou et al., "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹

36
- 0029256372
- Voice conversion based on piecewise linear conversions rules of formant frequency and spectrum tilt
- H Mizuno, H Mizuno, and M Abe, "Voice conversion based on piecewise linear conversions rules of formant frequency and spectrum tilt," Speech Commun. 16, pp. 153-164.
- Speech Commun , vol.16 , pp. 153-164
- Mizuno, H.¹ Mizuno, H.² Abe, M.³

37
- 33646785078
- A hybrid gmm and codebook mapping method for spectral conversion
- Y. Kang, Z. Shuang, J. Tao, W. Zhang, and B. Xu, "A hybrid gmm and codebook mapping method for spectral conversion," in Proc. 1st Int. Conf. Affective Comput. Intell. Interaction, 2005, pp. 303-310.
- (2005) Proc. 1st Int. Conf. Affective Comput. Intell. Interaction , pp. 303-310
- Kang, Y.¹ Shuang, Z.² Tao, J.³ Zhang, W.⁴ Xu, B.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.