메뉴 건너뛰기




Volumn 5398 LNAI, Issue , 2009, Pages 232-241

Spectrum modification for emotional speech synthesis

Author keywords

Emotional speech; Emotional voice conversion; Spectral envelope; Speech synthesis

Indexed keywords

EMOTIONAL SPEECH; EMOTIONAL SPEECH SYNTHESIS; EMOTIONAL STATE; EMOTIONAL VOICE CONVERSION; HIGH FREQUENCY; LISTENING TESTS; NON-LINEAR; SPECTRAL ENVELOPE; SPECTRAL ENVELOPES; SPECTRAL FLATNESS; SPECTRAL MODIFICATIONS; SPECTRAL NOISE; SPECTRUM MODIFICATION; SPEECH SPECTRA;

EID: 67650486451     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-00525-1_23     Document Type: Conference Paper
Times cited : (11)

References (19)
  • 1
    • 0037384712 scopus 로고    scopus 로고
    • Vocal Communication of Emotion: A Review of Research Paradigms
    • Scherer, K.R.: Vocal Communication of Emotion: A Review of Research Paradigms. Speech Communication 40, 227-256 (2003)
    • (2003) Speech Communication , vol.40 , pp. 227-256
    • Scherer, K.R.1
  • 2
    • 0242721417 scopus 로고    scopus 로고
    • Speech Emotion Recognition Using Hidden Markov Models
    • Nwe, T.L., Foo, S.W., De Silva, L.C.: Speech Emotion Recognition Using Hidden Markov Models. Speech Communication 41, 603-623 (2003)
    • (2003) Speech Communication , vol.41 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 3
    • 33746410556 scopus 로고    scopus 로고
    • Emotional Speech Recognition: Resources, Features, and Methods
    • Ververidis, D., Kotropoulos, C.: Emotional Speech Recognition: Resources, Features, and Methods. Speech Communication 48, 1162-1181 (2006)
    • (2006) Speech Communication , vol.48 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 4
    • 33947164164 scopus 로고    scopus 로고
    • An Evaluation of the Robustness of Existing Supervised Machine Learning Approaches to the Classification of Emotions in Speech
    • Shami, M., Verhelst, W.: An Evaluation of the Robustness of Existing Supervised Machine Learning Approaches to the Classification of Emotions in Speech. Speech Communication 49, 201-212 (2007)
    • (2007) Speech Communication , vol.49 , pp. 201-212
    • Shami, M.1    Verhelst, W.2
  • 5
    • 58349108289 scopus 로고    scopus 로고
    • Speech Emotion Perception by Human and Machine
    • Esposito, A, Bourbakis, N.G, Avouris, N, Hatzilygeroudis, I, eds, Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, Springer, Heidelberg
    • Tóth, S.L., Sztahó, D., Vicsi, K.: Speech Emotion Perception by Human and Machine. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds.) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction. LNCS (LNAI), vol. 5042, pp. 213-224. Springer, Heidelberg (2008)
    • (2008) LNCS (LNAI , vol.5042 , pp. 213-224
    • Tóth, S.L.1    Sztahó, D.2    Vicsi, K.3
  • 6
    • 35548963442 scopus 로고    scopus 로고
    • Applying an Analysis of Acted Vocal Emotions to Improve the Simulation of Synthetic Speech
    • Murray, I.R., Arnott, J.L.: Applying an Analysis of Acted Vocal Emotions to Improve the Simulation of Synthetic Speech. Computer Speech and Language 22, 107-129 (2008)
    • (2008) Computer Speech and Language , vol.22 , pp. 107-129
    • Murray, I.R.1    Arnott, J.L.2
  • 7
    • 21844456055 scopus 로고    scopus 로고
    • The Role of Intonation in Emotional Expressions
    • Bänziger, T., Scherer, K.R.: The Role of Intonation in Emotional Expressions. Speech Communication 46, 252-267 (2005)
    • (2005) Speech Communication , vol.46 , pp. 252-267
    • Bänziger, T.1    Scherer, K.R.2
  • 8
    • 33745433543 scopus 로고    scopus 로고
    • Cepstral Speech Model, Padé Approximation, Excitation, and Gain Matching in Cepstral Speech Synthesis
    • Brno, pp
    • Vích, R.: Cepstral Speech Model, Padé Approximation, Excitation, and Gain Matching in Cepstral Speech Synthesis. In: Proceedings of Biosignal, Brno, pp. 77-82 (2000)
    • (2000) Proceedings of Biosignal , pp. 77-82
    • Vích, R.1
  • 9
    • 33646614226 scopus 로고    scopus 로고
    • Acoustical Analysis of Speech
    • Crocker, M.J, ed, John Wiley & Sons, Chichester
    • Fant, G.: Acoustical Analysis of Speech. In: Crocker, M.J. (ed.) Encyclopedia of Acoustics, pp. 1589-1598. John Wiley & Sons, Chichester (1997)
    • (1997) Encyclopedia of Acoustics , pp. 1589-1598
    • Fant, G.1
  • 11
    • 0003772719 scopus 로고    scopus 로고
    • Time and Pitch Scale Modification of Audio Signals
    • Kahrs, M, Brandenburg, K, eds, Kluwer Academic Publishers, Dordrecht
    • Laroche, J.: Time and Pitch Scale Modification of Audio Signals. In: Kahrs, M., Brandenburg, K. (eds.) Applications of Digital Signal Processing to Audio and Acoustics, pp. 279- 309. Kluwer Academic Publishers, Dordrecht (2001)
    • (2001) Applications of Digital Signal Processing to Audio and Acoustics , pp. 279-309
    • Laroche, J.1
  • 12
    • 33846952503 scopus 로고    scopus 로고
    • Ensemble Methods for Spoken Emotion Recognition in Call-Centres
    • Morrison, D., Wang, R., De Silva, L.C.: Ensemble Methods for Spoken Emotion Recognition in Call-Centres. Speech Communication 49, 98-112 (2007)
    • (2007) Speech Communication , vol.49 , pp. 98-112
    • Morrison, D.1    Wang, R.2    De Silva, L.C.3
  • 13
    • 33749071464 scopus 로고    scopus 로고
    • Filters
    • Zölzer, U, ed, John Wiley & Sons, Chichester
    • Dutilleux, P., Zölzer, U.: Filters. In: Zölzer, U. (ed.) DAFX - Digital Audio Effects, pp. 31-62. John Wiley & Sons, Chichester (2002)
    • (2002) DAFX - Digital Audio Effects , pp. 31-62
    • Dutilleux, P.1    Zölzer, U.2
  • 14
    • 9444237982 scopus 로고    scopus 로고
    • Emotions and Voice Quality: Experiments with Sinusoidal Modeling
    • Geneva, pp
    • Drioli, C., Tisato, G., Cosi, P., Tesser, F.: Emotions and Voice Quality: Experiments with Sinusoidal Modeling. In: Proceedings of Voice Quality, Geneva, pp. 127-132 (2003)
    • (2003) Proceedings of Voice Quality , pp. 127-132
    • Drioli, C.1    Tisato, G.2    Cosi, P.3    Tesser, F.4
  • 15
    • 21844479845 scopus 로고    scopus 로고
    • Synthesis of F0 Contours Using Generation Process Model Parameters Predicted form Unlabeled Corpora: Application to Emotional Speech Synthesis
    • Hirose, K., Sato, K., Asano, Y., Minematsu, N.: Synthesis of F0 Contours Using Generation Process Model Parameters Predicted form Unlabeled Corpora: Application to Emotional Speech Synthesis. Speech Communication 46, 385-404 (2005)
    • (2005) Speech Communication , vol.46 , pp. 385-404
    • Hirose, K.1    Sato, K.2    Asano, Y.3    Minematsu, N.4
  • 16
    • 34047248387 scopus 로고    scopus 로고
    • An Objective and Subjective Study of the Role of Semantics and Prosodic Features in Building Corpora for Emotional TTS
    • Navas, E., Hernáez, I., Luengo, I.: An Objective and Subjective Study of the Role of Semantics and Prosodic Features in Building Corpora for Emotional TTS. IEEE Transactions on Audio, Speech, and Language Processing 14, 1117-1127 (2006)
    • (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 1117-1127
    • Navas, E.1    Hernáez, I.2    Luengo, I.3
  • 17
    • 41049115081 scopus 로고    scopus 로고
    • Cabral, J.P., Oliveira, L.C.: EmoVoice: A System to Generate Emotions in Speech. In: Proceedings of Interspeech - ICSLP. pp. 1798-1801. Pittsburgh (2006)
    • Cabral, J.P., Oliveira, L.C.: EmoVoice: A System to Generate Emotions in Speech. In: Proceedings of Interspeech - ICSLP. pp. 1798-1801. Pittsburgh (2006)
  • 18
    • 38149119121 scopus 로고    scopus 로고
    • Emotional Style Conversion in the TTS System with Cepstral Description
    • Esposito, A, Faundez-Zanuy, M, Keller, E, Marinaro, M, eds, COST Action 2102, Springer, Heidelberg
    • Přibil, J., Přibilová, A.: Emotional Style Conversion in the TTS System with Cepstral Description. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 65-73. Springer, Heidelberg (2007)
    • (2007) LNCS (LNAI , vol.4775 , pp. 65-73
    • Přibil, J.1    Přibilová, A.2
  • 19
    • 33751438738 scopus 로고    scopus 로고
    • Non-Linear Frequency Scale Mapping for Voice Conversion in Text-to-Speech System with Cepstral Description
    • Přibilová, A., Přibil, J.: Non-Linear Frequency Scale Mapping for Voice Conversion in Text-to-Speech System with Cepstral Description. Speech Communication 48, 1691-1703 (2006)
    • (2006) Speech Communication , vol.48 , pp. 1691-1703
    • Přibilová, A.1    Přibil, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.