메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 531-535

Emotional speech conversion based on spectrum-prosody dual transformation

Author keywords

Emotional speech conversion; GMM; Prosody rules

Indexed keywords

DUAL TRANSFORMATION; EMOTIONAL SPEECH; GAUSSIAN MIXTURE MODEL; GMM; PROSODIC FEATURES; PROSODY RULES; SPEECH DATABASE; SPEECH EMOTIONS;

EID: 84876489382     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICoSP.2012.6491543     Document Type: Conference Paper
Times cited : (6)

References (15)
  • 3
    • 34547519038 scopus 로고    scopus 로고
    • A statistical approach for modeling prosody features using pos tags for emotional speech synthesis
    • Bulut, M., S. Lee and S. Narayanan, A Statistical Approach For Modeling Prosody Features Using Pos Tags For Emotional Speech Synthesis, in ICASSP 2007. 2007.
    • (2007) ICASSP 2007
    • Bulut, M.1    Lee, S.2    Narayanan, S.3
  • 4
    • 84867219635 scopus 로고    scopus 로고
    • A comparison of voice conversion methods for transforming voice quality in emotional speech synthesis
    • Turk, O. and M. Schroder, A Comparison of Voice Conversion Methods for Transforming Voice Quality in Emotional Speech Synthesis, in INTERSPEECH 2008. 2008.
    • (2008) Interspeech 2008
    • Turk, O.1    Schroder, M.2
  • 5
    • 0002515370 scopus 로고
    • The generation of affectin synthesized speech
    • July
    • J.E. Cahn, The Generation of Affectin Synthesized Speech, Journal of the American Voice I/O Society, vol. 8, pp. 1-19, July 1990.
    • (1990) Journal of the American Voice I/O Society , vol.8 , pp. 1-19
    • Cahn, J.E.1
  • 6
    • 0029325035 scopus 로고
    • Implementation and testing of a system for producing emotion-by-rule in synthetic speech
    • June
    • I. R. Murray, and J. L. Arnott, Implementation and testing of a system for producing emotion-by-rule in synthetic speech, Speech Communication, vol. 16, no.4, pp. 369-390, June 1995.
    • (1995) Speech Communication , vol.16 , Issue.4 , pp. 369-390
    • Murray, I.R.1    Arnott, J.L.2
  • 8
    • 28444474431 scopus 로고    scopus 로고
    • Documentation of the danish emotional speech database des
    • Aalborg, Den-mark
    • I. S. Engberg and A. V. Hansen, "Documentation of the danish emotional speech database DES," Tech. Rep, Aalborg, Den-mark, 1996.
    • (1996) Tech. Rep
    • Engberg, I.S.1    Hansen, A.V.2
  • 10
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch adaptive time frequency smoothing and an instantaneous frequency based F0 extraction
    • H. Kuwabara, Restructuring speech representations using a pitch adaptive time frequency smoothing and an instantaneous frequency based F0 extraction, Speech Communication, 1999
    • (1999) Speech Communication
    • Kuwabara, H.1
  • 12
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition:resources, features, and methods
    • Sep
    • D. Ververidis, and C. Kotropoulos, Emotional speech recognition: resources, features, and methods, Speech Communication, vol. 48, no.9, pp. 1163-1181, Sep. 2006.
    • (2006) Speech Communication , vol.48 , Issue.9 , pp. 1163-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 13
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones [J]
    • E. Moulines and F. Charpentier. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones [J]. Speech Communication, 1990, (9):453-467.
    • (1990) Speech Communication , Issue.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 14
    • 0026830163 scopus 로고
    • Shape invariant time-scale and pitch modification of speech [J]
    • T.F. Quatieri, R.J. McAulay, Shape Invariant Time-Scale and Pitch Modification of Speech [J]. IEEE Trans. Signal Processing, 1992, 40(3):497-510.
    • (1992) IEEE Trans. Signal Processing , vol.40 , Issue.3 , pp. 497-510
    • Quatieri, T.F.1    McAulay, R.J.2
  • 15
    • 0029254163 scopus 로고
    • Non-parametric techniques for pitch-scale and time-scale modification of speech [J]
    • E. Moulines and J. Laroche. Non-parametric techniques for pitch-scale and time-scale modification of speech [J]. Speech Communication, 1995, 16(2):195-205.
    • (1995) Speech Communication , vol.16 , Issue.2 , pp. 195-205
    • Moulines, E.1    Laroche, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.