SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5398 LNAI, Issue , 2009, Pages 232-241

Spectrum modification for emotional speech synthesis

(2) Přibilová, Anna a Přibil, Jiří b

a SLOVAK UNIVERSITY OF TECHNOLOGY (Slovakia)

b INSTITUTE OF PHOTONICS AND ELECTRONICS (Czech Republic)

Author keywords

Emotional speech; Emotional voice conversion; Spectral envelope; Speech synthesis

Indexed keywords

EMOTIONAL SPEECH; EMOTIONAL SPEECH SYNTHESIS; EMOTIONAL STATE; EMOTIONAL VOICE CONVERSION; HIGH FREQUENCY; LISTENING TESTS; NON-LINEAR; SPECTRAL ENVELOPE; SPECTRAL ENVELOPES; SPECTRAL FLATNESS; SPECTRAL MODIFICATIONS; SPECTRAL NOISE; SPECTRUM MODIFICATION; SPEECH SPECTRA;

SPEECH SYNTHESIS;

SPEECH ANALYSIS;

EID: 67650486451 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-00525-1_23 Document Type: Conference Paper

Times cited : (11)

References (19)

1
- 0037384712
- Vocal Communication of Emotion: A Review of Research Paradigms
- Scherer, K.R.: Vocal Communication of Emotion: A Review of Research Paradigms. Speech Communication 40, 227-256 (2003)
- (2003) Speech Communication , vol.40 , pp. 227-256
- Scherer, K.R.¹

2
- 0242721417
- Speech Emotion Recognition Using Hidden Markov Models
- Nwe, T.L., Foo, S.W., De Silva, L.C.: Speech Emotion Recognition Using Hidden Markov Models. Speech Communication 41, 603-623 (2003)
- (2003) Speech Communication , vol.41 , pp. 603-623
- Nwe, T.L.¹ Foo, S.W.² De Silva, L.C.³

3
- 33746410556
- Emotional Speech Recognition: Resources, Features, and Methods
- Ververidis, D., Kotropoulos, C.: Emotional Speech Recognition: Resources, Features, and Methods. Speech Communication 48, 1162-1181 (2006)
- (2006) Speech Communication , vol.48 , pp. 1162-1181
- Ververidis, D.¹ Kotropoulos, C.²

4
- 33947164164
- An Evaluation of the Robustness of Existing Supervised Machine Learning Approaches to the Classification of Emotions in Speech
- Shami, M., Verhelst, W.: An Evaluation of the Robustness of Existing Supervised Machine Learning Approaches to the Classification of Emotions in Speech. Speech Communication 49, 201-212 (2007)
- (2007) Speech Communication , vol.49 , pp. 201-212
- Shami, M.¹ Verhelst, W.²

5
- 58349108289
- Speech Emotion Perception by Human and Machine
- Esposito, A, Bourbakis, N.G, Avouris, N, Hatzilygeroudis, I, eds, Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, Springer, Heidelberg
- Tóth, S.L., Sztahó, D., Vicsi, K.: Speech Emotion Perception by Human and Machine. In: Esposito, A., Bourbakis, N.G., Avouris, N., Hatzilygeroudis, I. (eds.) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction. LNCS (LNAI), vol. 5042, pp. 213-224. Springer, Heidelberg (2008)
- (2008) LNCS (LNAI , vol.5042 , pp. 213-224
- Tóth, S.L.¹ Sztahó, D.² Vicsi, K.³

6
- 35548963442
- Applying an Analysis of Acted Vocal Emotions to Improve the Simulation of Synthetic Speech
- Murray, I.R., Arnott, J.L.: Applying an Analysis of Acted Vocal Emotions to Improve the Simulation of Synthetic Speech. Computer Speech and Language 22, 107-129 (2008)
- (2008) Computer Speech and Language , vol.22 , pp. 107-129
- Murray, I.R.¹ Arnott, J.L.²

7
- 21844456055
- The Role of Intonation in Emotional Expressions
- Bänziger, T., Scherer, K.R.: The Role of Intonation in Emotional Expressions. Speech Communication 46, 252-267 (2005)
- (2005) Speech Communication , vol.46 , pp. 252-267
- Bänziger, T.¹ Scherer, K.R.²

8
- 33745433543
- Cepstral Speech Model, Padé Approximation, Excitation, and Gain Matching in Cepstral Speech Synthesis
- Brno, pp
- Vích, R.: Cepstral Speech Model, Padé Approximation, Excitation, and Gain Matching in Cepstral Speech Synthesis. In: Proceedings of Biosignal, Brno, pp. 77-82 (2000)
- (2000) Proceedings of Biosignal , pp. 77-82
- Vích, R.¹

9
- 33646614226
- Acoustical Analysis of Speech
- Crocker, M.J, ed, John Wiley & Sons, Chichester
- Fant, G.: Acoustical Analysis of Speech. In: Crocker, M.J. (ed.) Encyclopedia of Acoustics, pp. 1589-1598. John Wiley & Sons, Chichester (1997)
- (1997) Encyclopedia of Acoustics , pp. 1589-1598
- Fant, G.¹

10
- 33645743720
- Kluwer Academic Publishers, Dordrecht
- Fant, G.: Speech Acoustics and Phonetics. Kluwer Academic Publishers, Dordrecht (2004)
- (2004) Speech Acoustics and Phonetics
- Fant, G.¹

11
- 0003772719
- Time and Pitch Scale Modification of Audio Signals
- Kahrs, M, Brandenburg, K, eds, Kluwer Academic Publishers, Dordrecht
- Laroche, J.: Time and Pitch Scale Modification of Audio Signals. In: Kahrs, M., Brandenburg, K. (eds.) Applications of Digital Signal Processing to Audio and Acoustics, pp. 279- 309. Kluwer Academic Publishers, Dordrecht (2001)
- (2001) Applications of Digital Signal Processing to Audio and Acoustics , pp. 279-309
- Laroche, J.¹

12
- 33846952503
- Ensemble Methods for Spoken Emotion Recognition in Call-Centres
- Morrison, D., Wang, R., De Silva, L.C.: Ensemble Methods for Spoken Emotion Recognition in Call-Centres. Speech Communication 49, 98-112 (2007)
- (2007) Speech Communication , vol.49 , pp. 98-112
- Morrison, D.¹ Wang, R.² De Silva, L.C.³

13
- 33749071464
- Filters
- Zölzer, U, ed, John Wiley & Sons, Chichester
- Dutilleux, P., Zölzer, U.: Filters. In: Zölzer, U. (ed.) DAFX - Digital Audio Effects, pp. 31-62. John Wiley & Sons, Chichester (2002)
- (2002) DAFX - Digital Audio Effects , pp. 31-62
- Dutilleux, P.¹ Zölzer, U.²

14
- 9444237982
- Emotions and Voice Quality: Experiments with Sinusoidal Modeling
- Geneva, pp
- Drioli, C., Tisato, G., Cosi, P., Tesser, F.: Emotions and Voice Quality: Experiments with Sinusoidal Modeling. In: Proceedings of Voice Quality, Geneva, pp. 127-132 (2003)
- (2003) Proceedings of Voice Quality , pp. 127-132
- Drioli, C.¹ Tisato, G.² Cosi, P.³ Tesser, F.⁴

15
- 21844479845
- Synthesis of F0 Contours Using Generation Process Model Parameters Predicted form Unlabeled Corpora: Application to Emotional Speech Synthesis
- Hirose, K., Sato, K., Asano, Y., Minematsu, N.: Synthesis of F0 Contours Using Generation Process Model Parameters Predicted form Unlabeled Corpora: Application to Emotional Speech Synthesis. Speech Communication 46, 385-404 (2005)
- (2005) Speech Communication , vol.46 , pp. 385-404
- Hirose, K.¹ Sato, K.² Asano, Y.³ Minematsu, N.⁴

16
- 34047248387
- An Objective and Subjective Study of the Role of Semantics and Prosodic Features in Building Corpora for Emotional TTS
- Navas, E., Hernáez, I., Luengo, I.: An Objective and Subjective Study of the Role of Semantics and Prosodic Features in Building Corpora for Emotional TTS. IEEE Transactions on Audio, Speech, and Language Processing 14, 1117-1127 (2006)
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , pp. 1117-1127
- Navas, E.¹ Hernáez, I.² Luengo, I.³

17
- 41049115081
- Cabral, J.P., Oliveira, L.C.: EmoVoice: A System to Generate Emotions in Speech. In: Proceedings of Interspeech - ICSLP. pp. 1798-1801. Pittsburgh (2006)
- Cabral, J.P., Oliveira, L.C.: EmoVoice: A System to Generate Emotions in Speech. In: Proceedings of Interspeech - ICSLP. pp. 1798-1801. Pittsburgh (2006)

18
- 38149119121
- Emotional Style Conversion in the TTS System with Cepstral Description
- Esposito, A, Faundez-Zanuy, M, Keller, E, Marinaro, M, eds, COST Action 2102, Springer, Heidelberg
- Přibil, J., Přibilová, A.: Emotional Style Conversion in the TTS System with Cepstral Description. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 65-73. Springer, Heidelberg (2007)
- (2007) LNCS (LNAI , vol.4775 , pp. 65-73
- Přibil, J.¹ Přibilová, A.²

19
- 33751438738
- Non-Linear Frequency Scale Mapping for Voice Conversion in Text-to-Speech System with Cepstral Description
- Přibilová, A., Přibil, J.: Non-Linear Frequency Scale Mapping for Voice Conversion in Text-to-Speech System with Cepstral Description. Speech Communication 48, 1691-1703 (2006)
- (2006) Speech Communication , vol.48 , pp. 1691-1703
- Přibilová, A.¹ Přibil, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.