SCOPUS 정보 검색 플랫폼

Proceedings of the 6th International Conference on Speech Prosody, SP 2012

Volumn 1, Issue , 2012, Pages 163-166

Emotional voice conversion for Mandarin using tone nucleus model - Small corpus and high efficiency

(4) Wang, Miaomiao a Wen, Miaomiao b Hirose, Keikichi c Minematsu, Nobuaki c

a Toshiba China R and D Center (China)

b Language Technologies Institute Carnegie Mellon University ^* (United States)

c UNIVERSITY OF TOKYO (Japan)

Author keywords

Emotional voice conversion; Mandarin; Tone nucleus

Indexed keywords

DYNAMIC PROGRAMMING; METADATA;

CLASSIFICATION AND REGRESSION TREE; DATA SPARSENESS PROBLEM; EMOTIONAL INFORMATION; EMOTIONAL VOICES; MANDARIN; SPECTRAL CONVERSION; SPECTRAL TRANSFORMATIONS; TONE NUCLEUS;

SPEECH PROCESSING;

EID: 84902959938 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (15)

1
- 84971539709
- Emotional Speech Synthesis: A Review
- Schröder, M., "Emotional Speech Synthesis: A Review", In Proc. Eurospeech, pp. 561-564, 2001
- (2001) In Proc. Eurospeech , pp. 561-564
- Schröder, M.¹

2
- 85009177437
- Modeling of various speaking styles and emotions for HMMbased speech synthesis
- J. Yamagishi, K. Onishi, T. Masuko and T. Kobayashi. 2003. Modeling of various speaking styles and emotions for HMMbased speech synthesis, Proc. Eurospeech, pp.2461-2464.
- (2003) Proc. Eurospeech , pp. 2461-2464
- Yamagishi, J.¹ Onishi, K.² Masuko, T.³ Kobayashi, T.⁴

3
- 34047263010
- Prosody conversion from neutral speech to emotional speech
- J. Tao, Y. Kang, and A. Li. 2006. Prosody conversion from neutral speech to emotional speech, IEEE Trans. Audio, Speech and Language Processing, vol.14: 1145-1153.
- (2006) IEEE Trans. Audio, Speech and Language Processing , vol.14 , pp. 1145-1153
- Tao, J.¹ Kang, Y.² Li, A.³

4
- 84867192290
- Two-Stage prosody prediction for emotional textto-speech synthesis
- H. Tang, X. Zhou, M. Odisio, M. Hasegawa-Johnson and T. Huang. 2008. Two-Stage prosody prediction for emotional textto-speech synthesis, Proc. Interspeech 2008, pp.2138-2141.
- (2008) Proc. Interspeech 2008 , pp. 2138-2141
- Tang, H.¹ Zhou, X.² Odisio, M.³ Hasegawa-Johnson, M.⁴ Huang, T.⁵

5
- 0029267839
- Tone recognition of continuous Mandarin speech based on Neural Networks
- S.-H. Chen and Y.-R. Wang, Tone recognition of continuous Mandarin speech based on Neural Networks, IEEE Trans. On SAP, Vol. 3, No. 2, 1995, pp.146-150.
- (1995) IEEE Trans. On SAP , vol.3 , Issue.2 , pp. 146-150
- Chen, S.-H.¹ Wang, Y.-R.²

6
- 0030706880
- Contextual tonal variations in Mandarin
- Xu, Y., Contextual tonal variations in Mandarin. J. Phonetics 25, 61-83, 1997.
- (1997) J. Phonetics , vol.25 , pp. 61-83
- Xu, Y.¹

7
- 0000665734
- Explaining phonetic variation: A sketch of the H&H theory
- W.Hardcastle and A. Marchal (ed.), Kluwer Academic Publishers
- B. Lindblom, Explaining phonetic variation: a sketch of the H&H theory, W.Hardcastle and A. Marchal (ed.), Speech Production and Speech Modelling. Kluwer Academic Publishers, 1990, pp.403-439.
- (1990) Speech Production and Speech Modelling , pp. 403-439
- Lindblom, B.¹

8
- 1842475630
- Tone nucleus modeling for Chinese lexical tone recognition
- Zhang, J. and Hirose, K., Tone nucleus modeling for Chinese lexical tone recognition, Speech Communication, Vol. 42, Nos. 3-4, pp. 447-466, 2004
- (2004) Speech Communication , vol.42 , Issue.3-4 , pp. 447-466
- Zhang, J.¹ Hirose, K.²

9
- 0003775162
- University of California Press, Berkeley
- Chao, Y.-R., 1968. A Grammar of Spoken Chinese. University of California Press, Berkeley.
- (1968) A Grammar of Spoken Chinese
- Chao, Y.-R.¹

10
- 84865787512
- Prosody conversion for emotional Mandarin speech synthesis using the tone nucleus model
- M. Wen, M. Wang, K. Hirose, N. Minematsu, Prosody conversion for emotional Mandarin speech synthesis using the tone nucleus model, Proc. INTERSPEECH, pp.2797-2800, 2011
- (2011) Proc. INTERSPEECH , pp. 2797-2800
- Wen, M.¹ Wang, M.² Hirose, K.³ Minematsu, N.⁴

11
- 58149203393
- Data-driven emotion conversion in spoken English
- Z. Inanoglu and S. Young. 2009. Data-driven emotion conversion in spoken English, Speech Communication, 51, pp.268-283.
- (2009) Speech Communication , vol.51 , pp. 268-283
- Inanoglu, Z.¹ Young, S.²

12
- 56149126461
- GMM-based Voice Conversion Applied to Emotional Speech Synthesis
- H. Kawanami, Y. Iwami, T. Toda, H. Saruwatari, and K. Shikamo. 1999. GMM-based Voice Conversion Applied to Emotional Speech Synthesis, IEEE Trans. Speech and Audio Proc., 7(6):697-708.
- (1999) IEEE Trans. Speech and Audio Proc , vol.7 , Issue.6 , pp. 697-708
- Kawanami, H.¹ Iwami, Y.² Toda, T.³ Saruwatari, H.⁴ Shikamo, K.⁵

13
- 0034842552
- Voice Conversion Algorithm based on Gaussian Mixture Model with Dynamic Frequency Warping of STRAIGHT spectrum
- Salt Lake City, USA
- T. Toda, H. Saruwatari, and K. Shikano. 2001. Voice Conversion Algorithm based on Gaussian Mixture Model with Dynamic Frequency Warping of STRAIGHT spectrum. In Proc. ICASSP, pp. 841-844, Salt Lake City, USA.
- (2001) In Proc. ICASSP , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

14
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou. 1998. Continuous probabilistic transform for voice conversion, IEEE TSAP, no. 6, pp. 131-142.
- (1998) IEEE TSAP , Issue.6 , pp. 131-142
- Stylianou, Y.¹

15
- 0032673049
- Restructuring speech representations using a pitch adaptive time frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. M. Katsuse, and A. D. Cheveigne, "Restructuring speech representations using a pitch adaptive time frequency smoothing and an instantaneous frequency-based F0 extraction: possible role of a repetitive structure in sounds", Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Katsuse, I.M.² Cheveigne, A.D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.