메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 163-166

Emotional voice conversion for Mandarin using tone nucleus model - Small corpus and high efficiency

Author keywords

Emotional voice conversion; Mandarin; Tone nucleus

Indexed keywords

DYNAMIC PROGRAMMING; METADATA;

EID: 84902959938     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (15)
  • 1
    • 84971539709 scopus 로고    scopus 로고
    • Emotional Speech Synthesis: A Review
    • Schröder, M., "Emotional Speech Synthesis: A Review", In Proc. Eurospeech, pp. 561-564, 2001
    • (2001) In Proc. Eurospeech , pp. 561-564
    • Schröder, M.1
  • 2
    • 85009177437 scopus 로고    scopus 로고
    • Modeling of various speaking styles and emotions for HMMbased speech synthesis
    • J. Yamagishi, K. Onishi, T. Masuko and T. Kobayashi. 2003. Modeling of various speaking styles and emotions for HMMbased speech synthesis, Proc. Eurospeech, pp.2461-2464.
    • (2003) Proc. Eurospeech , pp. 2461-2464
    • Yamagishi, J.1    Onishi, K.2    Masuko, T.3    Kobayashi, T.4
  • 5
    • 0029267839 scopus 로고
    • Tone recognition of continuous Mandarin speech based on Neural Networks
    • S.-H. Chen and Y.-R. Wang, Tone recognition of continuous Mandarin speech based on Neural Networks, IEEE Trans. On SAP, Vol. 3, No. 2, 1995, pp.146-150.
    • (1995) IEEE Trans. On SAP , vol.3 , Issue.2 , pp. 146-150
    • Chen, S.-H.1    Wang, Y.-R.2
  • 6
    • 0030706880 scopus 로고    scopus 로고
    • Contextual tonal variations in Mandarin
    • Xu, Y., Contextual tonal variations in Mandarin. J. Phonetics 25, 61-83, 1997.
    • (1997) J. Phonetics , vol.25 , pp. 61-83
    • Xu, Y.1
  • 7
    • 0000665734 scopus 로고
    • Explaining phonetic variation: A sketch of the H&H theory
    • W.Hardcastle and A. Marchal (ed.), Kluwer Academic Publishers
    • B. Lindblom, Explaining phonetic variation: a sketch of the H&H theory, W.Hardcastle and A. Marchal (ed.), Speech Production and Speech Modelling. Kluwer Academic Publishers, 1990, pp.403-439.
    • (1990) Speech Production and Speech Modelling , pp. 403-439
    • Lindblom, B.1
  • 8
    • 1842475630 scopus 로고    scopus 로고
    • Tone nucleus modeling for Chinese lexical tone recognition
    • Zhang, J. and Hirose, K., Tone nucleus modeling for Chinese lexical tone recognition, Speech Communication, Vol. 42, Nos. 3-4, pp. 447-466, 2004
    • (2004) Speech Communication , vol.42 , Issue.3-4 , pp. 447-466
    • Zhang, J.1    Hirose, K.2
  • 10
    • 84865787512 scopus 로고    scopus 로고
    • Prosody conversion for emotional Mandarin speech synthesis using the tone nucleus model
    • M. Wen, M. Wang, K. Hirose, N. Minematsu, Prosody conversion for emotional Mandarin speech synthesis using the tone nucleus model, Proc. INTERSPEECH, pp.2797-2800, 2011
    • (2011) Proc. INTERSPEECH , pp. 2797-2800
    • Wen, M.1    Wang, M.2    Hirose, K.3    Minematsu, N.4
  • 11
    • 58149203393 scopus 로고    scopus 로고
    • Data-driven emotion conversion in spoken English
    • Z. Inanoglu and S. Young. 2009. Data-driven emotion conversion in spoken English, Speech Communication, 51, pp.268-283.
    • (2009) Speech Communication , vol.51 , pp. 268-283
    • Inanoglu, Z.1    Young, S.2
  • 13
    • 0034842552 scopus 로고    scopus 로고
    • Voice Conversion Algorithm based on Gaussian Mixture Model with Dynamic Frequency Warping of STRAIGHT spectrum
    • Salt Lake City, USA
    • T. Toda, H. Saruwatari, and K. Shikano. 2001. Voice Conversion Algorithm based on Gaussian Mixture Model with Dynamic Frequency Warping of STRAIGHT spectrum. In Proc. ICASSP, pp. 841-844, Salt Lake City, USA.
    • (2001) In Proc. ICASSP , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 14
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • Y. Stylianou. 1998. Continuous probabilistic transform for voice conversion, IEEE TSAP, no. 6, pp. 131-142.
    • (1998) IEEE TSAP , Issue.6 , pp. 131-142
    • Stylianou, Y.1
  • 15
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch adaptive time frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. M. Katsuse, and A. D. Cheveigne, "Restructuring speech representations using a pitch adaptive time frequency smoothing and an instantaneous frequency-based F0 extraction: possible role of a repetitive structure in sounds", Speech Communication, vol. 27, no. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Katsuse, I.M.2    Cheveigne, A.D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.