SCOPUS 정보 검색 플랫폼

IEEE Transactions on Multimedia

Volumn 12, Issue 6, 2010, Pages 490-501

Feature analysis and evaluation for automatic emotion identification in speech

(3) Luengo, Iker a Navas, Eva a Hernaez, Inmaculada a

a UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

Author keywords

Emotion identification; information fusion; parametrization

Indexed keywords

EARLY FUSION; EMOTION IDENTIFICATION; EMOTIONAL INFORMATION; FEATURE ANALYSIS; INFORMATION SOURCES; LATE FUSION; PARAMETRIZATIONS; PROSODIC FEATURES; SPECTRAL ENVELOPE PARAMETERS; SPECTRAL ENVELOPES; SPECTRAL STATISTICS; SYSTEMATIC STUDY; VOICE QUALITY;

FACE RECOGNITION; INFORMATION FUSION;

IDENTIFICATION (CONTROL SYSTEMS);

EID: 77956733663 PISSN: 15209210 EISSN: None Source Type: Journal
DOI: 10.1109/TMM.2010.2051872 Document Type: Article

Times cited : (131)

References (47)

1
- 0037384712
- Vocal communication of emotion: A review of research paradigms
- Apr.
- K. R. Scherer, "Vocal communication of emotion: A review of research paradigms," Speech Commun., vol.40, pp. 227-256, Apr. 2003.
- (2003) Speech Commun. , vol.40 , pp. 227-256
- Scherer, K.R.¹

2
- 0002171967
- Psychological models of emotion
- J. Borod, Ed. Oxford, U.K.: Oxford Univ. Press, ch. 6
- K. R. Scherer, "Psychological models of emotion," in The Neuropsychology of Emotion, J. Borod, Ed. Oxford, U.K.: Oxford Univ. Press, 2000, ch. 6, pp. 137-166.
- (2000) The Neuropsychology of Emotion , pp. 137-166
- Scherer, K.R.¹

3
- 84889960454
- An argument for basic emotions
- P. Ekman, "An argument for basic emotions," Cognit. Emotion, vol.6, pp. 169-200, 1992.
- (1992) Cognit. Emotion , vol.6 , pp. 169-200
- Ekman, P.¹

4
- 0003774595
- 3rd ed. Oxford, U.K.: Oxford Univ. Press
- C. Darwin, The Expression of the Emotions in Man and Animals, 3rd ed. Oxford, U.K.: Oxford Univ. Press, 1998.
- (1998) The Expression of the Emotions in Man and Animals
- Darwin, C.¹

5
- 0002689942
- Verification of acoustical correlates of emotional speech using formant-synthesis
- Belfast, Ireland, Sep.
- F. Burkhardt and W. F. Sendlmeier, "Verification of acoustical correlates of emotional speech using formant-synthesis," in Proc. ISCA Tutorial and Research Workshop Speech and Emotion, Belfast, Ireland, Sep. 2000, pp. 151-156.
- (2000) Proc. ISCA Tutorial and Research Workshop Speech and Emotion , pp. 151-156
- Burkhardt, F.¹ Sendlmeier, W.F.²

6
- 52949128737
- A three-layered model for expressive speech perception
- Oct.
- C. F. Huang and M. Akagi, "A three-layered model for expressive speech perception," Speech Commun., vol.50, pp. 810-828, Oct. 2008.
- (2008) Speech Commun. , vol.50 , pp. 810-828
- Huang, C.F.¹ Akagi, M.²

7
- 0000134170
- Vocal cues in emotion encoding and decoding
- K. R. Scherer, R. Banse, H. G.Wallbott, and T. Goldbeck, "Vocal cues in emotion encoding and decoding," Motiv. Emotion, vol.15, no.2, pp. 123-148, 1991.
- (1991) Motiv. Emotion , vol.15 , Issue.2 , pp. 123-148
- Scherer, K.R.¹ Banse, R.² Wallbott, H.G.³ Goldbeck, T.⁴

8
- 9444257562
- Ph.D. disservation, Universität des Saarlandes, Saarbrücken, Germany
- M. Schröder, "Speech and emotion research," Ph.D. disservation, Universität des Saarlandes, Saarbrücken, Germany, 2003.
- (2003) Speech and Emotion Research
- Schröder, M.¹

9
- 35048897466
- Acoustic analysis of emotional speech in standard Basque for emotion recognition
- ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct.
- E. Navas, I. Hernáez, A. Castelruiz, J. Sánchez, and I. Luengo, "Acoustic analysis of emotional speech in standard Basque for emotion recognition," in Progress in Pattern Recognition, Image Analysis and Applications, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct. 2004, vol.3287, pp. 386-393.
- (2004) Progress in Pattern Recognition, Image Analysis and Applications , vol.3287 , pp. 386-393
- Navas, E.¹ Hernáez, I.² Castelruiz, A.³ Sánchez, J.⁴ Luengo, I.⁵

10
- 23144458652
- Expressive speech: Production, perception and application to speech synthesis
- D. Erickson, "Expressive speech: Production, perception and application to speech synthesis," Acoust. Sci. Tech., vol.26, pp. 317-325, 2005.
- (2005) Acoust. Sci. Tech. , vol.26 , pp. 317-325
- Erickson, D.¹

11
- 0242721417
- Speech emotion recognition using hidden Markov models
- Jun.
- T. L. Nwe, S. W. Foo, and L. C. de Silva, "Speech emotion recognition using hidden Markov models," Speech Commun., vol.41, pp. 603-623, Jun. 2003.
- (2003) Speech Commun. , vol.41 , pp. 603-623
- Nwe, T.L.¹ Foo, S.W.² De Silva, L.C.³

12
- 48149087416
- Real-time emotion detection system using speech: Multi-modal fusion of different timescale features
- Crete, Greece, Oct.
- S. Kim, P. G. Georgiou, S. Lee, and S. Narayanan, "Real-time emotion detection system using speech: Multi-modal fusion of different timescale features," in Proc. Int. Workshop Multimedia Signal Processing, Crete, Greece, Oct. 2007, pp. 48-51.
- (2007) Proc. Int. Workshop Multimedia Signal Processing , pp. 48-51
- Kim, S.¹ Georgiou, P.G.² Lee, S.³ Narayanan, S.⁴

13
- 38049048651
- Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing
- ser. Lecture Notes in Computer Science. Berlin, Germany: Springer
- B. Vlasenko, B. Schuller, A. Wendemuth, and G. Rigoll, "Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing," in Affective Computing and Intelligent Interaction, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, 2007, vol.4738, pp. 139-147.
- (2007) Affective Computing and Intelligent Interaction , vol.4738 , pp. 139-147
- Vlasenko, B.¹ Schuller, B.² Wendemuth, A.³ Rigoll, G.⁴

14
- 0034346176
- Emotion recognition in speech using neural networks
- Dec.
- J. Nicholson, K. Takahashi, and R. Nakatsu, "Emotion recognition in speech using neural networks," Neural Comput. Appl., vol.9, pp. 290-296, Dec. 2000.
- (2000) Neural Comput. Appl. , vol.9 , pp. 290-296
- Nicholson, J.¹ Takahashi, K.² Nakatsu, R.³

15
- 85009223246
- Emotion recognition by speech signals
- Geneva, Switzerland
- O. W. Kwon, K. Chan, J. Hao, and T. W. Lee, "Emotion recognition by speech signals," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 125-128.
- (2003) Proc. Eurospeech , pp. 125-128
- Kwon, O.W.¹ Chan, K.² Hao, J.³ Lee, T.W.⁴

16
- 85034230784
- Improving automatic emotion recognition from speech via gender differentiation
- Genoa, Italy, May
- T. Vogt and E. André, "Improving automatic emotion recognition from speech via gender differentiation," in Proc. LREC, Genoa, Italy, May 2006.
- (2006) Proc. LREC
- Vogt, T.¹ André, E.²

17
- 33745198227
- Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
- Lisbon, Portugal, Sep.
- B. Schuller, R. Müller, M. Lang, and G. Rigoll, "Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 805-808.
- (2005) Proc. Interspeech , pp. 805-808
- Schuller, B.¹ Müller, R.² Lang, M.³ Rigoll, G.⁴

18
- 53049109134
- Two-level fusion to improve emotion classification in spoken dialogue systems
- ser. Lecture Notes in Computer Science. Berlin, Germany: Springer
- R. López-Cozar, Z. Callejas, M. Kroul, J. Nouza, and J. Silovský, "Two-level fusion to improve emotion classification in spoken dialogue systems," in Graphics Recognition. Recent Advances and New Opportunities, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, 2008, vol.5246, pp. 617-624.
- (2008) Graphics Recognition. Recent Advances and New Opportunities , vol.5246 , pp. 617-624
- López-Cozar, R.¹ Callejas, Z.² Kroul, M.³ Nouza, J.⁴ Silovský, J.⁵

19
- 34547496515
- The relevance of voice quality features in speaker independent emotion recognition
- Honolulu, HI, Apr.
- M. Lugger and B. Yang, "The relevance of voice quality features in speaker independent emotion recognition," in Proc. ICCASP, Honolulu, HI, Apr. 2007, vol.4, pp. 17-20.
- (2007) Proc. ICCASP , vol.4 , pp. 17-20
- Lugger, M.¹ Yang, B.²

20
- 0037380186
- The role of voice quality in communicating emotion, mood and attitude
- Apr.
- C. Gobl and A. N. Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Commun., vol.40, pp. 189-212, Apr. 2003.
- (2003) Speech Commun. , vol.40 , pp. 189-212
- Gobl, C.¹ Chasaide, A.N.²

21
- 85009159448
- Emotional space improves emotion recognition
- Sep.
- R. Tato, R. Santos, R. Kompe, and J. Pardo, "Emotional space improves emotion recognition," in Proc. ICSLP, Sep. 2002, pp. 2029-2032.
- (2002) Proc. ICSLP , pp. 2029-2032
- Tato, R.¹ Santos, R.² Kompe, R.³ Pardo, J.⁴

22
- 77956758858
- Enhanced robustness in speech emotion recognition combining acoustic and semantic analyses
- Santorino, Greece, Sep.
- R. Müller, B. Schuller, and G. Rigoll, "Enhanced robustness in speech emotion recognition combining acoustic and semantic analyses," in Proc. From Signals to Signs of Emotion and Vice Versa, Santorino, Greece, Sep. 2004.
- (2004) Proc. from Signals to Signs of Emotion and Vice Versa
- Müller, R.¹ Schuller, B.² Rigoll, G.³

23
- 85008007181
- Speech emotion recognition using hidden Markov models
- Aalborg, Denmark, Sep.
- A. Nogueiras, A. Moreno, A. Bonafonte, and J. B. Mariño, "Speech emotion recognition using hidden Markov models," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001, pp. 2679-2682.
- (2001) Proc. Eurospeech , pp. 2679-2682
- Nogueiras, A.¹ Moreno, A.² Bonafonte, A.³ Mariño, J.B.⁴

24
- 0032021555
- On combining classifiers
- Mar.
- J. Kittler, M. Hatef, R. P. W. Duin, and J. Matas, "On combining classifiers," IEEE Trans. Pattern Anal. Mach. Intell., vol.20, no.3, pp. 226-239, Mar. 1998.
- (1998) IEEE Trans. Pattern Anal. Mach. Intell. , vol.20 , Issue.3 , pp. 226-239
- Kittler, J.¹ Hatef, M.² Duin, R.P.W.³ Matas, J.⁴

25
- 0005946314
- An overview of classifier fusion methods
- Feb.
- D. Ruta and B. Gabrys, "An overview of classifier fusion methods," Comput. Inf. Syst., vol.7, no.1, pp. 1-10, Feb. 2000.
- (2000) Comput. Inf. Syst. , vol.7 , Issue.1 , pp. 1-10
- Ruta, D.¹ Gabrys, B.²

26
- 33745202280
- A database of German emotional speech
- Lisbon, Portugal, Sep.
- F. Burkhardt, A. Paeschke, M. Rolfes, W. F. Sendlmeier, and B. Weiss, "A database of German emotional speech," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 1517-1520.
- (2005) Proc. Interspeech , pp. 1517-1520
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.F.⁴ Weiss, B.⁵

27
- 1842476689
- Efficient voice activity detection algorithms using long-term speech information
- Apr.
- J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "Efficient voice activity detection algorithms using long-term speech information," Speech Commun., vol.42, pp. 271-287, Apr. 2004.
- (2004) Speech Commun. , vol.42 , pp. 271-287
- Ramirez, J.¹ Segura, J.C.² Benitez, C.³ De La Torre, A.⁴ Rubio, A.⁵

28
- 0032875050
- A method for generating natural-sounding speech stimuli for cognitive brain research
- Aug.
- P. Alku, H. Tiitinen, and R. Näätänen, "A method for generating natural-sounding speech stimuli for cognitive brain research," Clin. Neurophysiol., vol.110, pp. 1329-1333, Aug. 1999.
- (1999) Clin. Neurophysiol. , vol.110 , pp. 1329-1333
- Alku, P.¹ Tiitinen, H.² Näätänen, R.³

29
- 34547503468
- Evaluation of pitch detection algorithms under real conditions
- Honolulu, HI, Apr.
- I. Luengo, I. Saratxaga, E. Navas, I. Hernáez, J. Sánchez, and I. n. Sainz, "Evaluation of pitch detection algorithms under real conditions," in Proc. ICASSP, Honolulu, HI, Apr. 2007, pp. 1057-1060.
- (2007) Proc. ICASSP , pp. 1057-1060
- Luengo, I.¹ Saratxaga, I.² Navas, E.³ Hernáez, I.⁴ Sánchez, J.⁵ Sainz, I.N.⁶

30
- 77956760083
- Detección de vocales mediante modelado de clusters de fonemas
- Sep.
- I. Luengo, E. Navas, J. Sánchez, and I. Hernáez, "Detección de vocales mediante modelado de clusters de fonemas," Procesado Del Lenguaje Natural, vol.43, pp. 121-128, Sep. 2009.
- (2009) Procesado Del Lenguaje Natural , vol.43 , pp. 121-128
- Luengo, I.¹ Navas, E.² Sánchez, J.³ Hernáez, I.⁴

31
- 58349084477
- Exploiting a vowel based approach for acted emotion recognition
- ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct.
- F. Ringeval and M. Chetouani, "Exploiting a vowel based approach for acted emotion recognition," in Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct. 2008, vol.5042, pp. 243-254.
- (2008) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction , vol.5042 , pp. 243-254
- Ringeval, F.¹ Chetouani, M.²

32
- 0036508041
- Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range
- Mar.
- T. Bäckström, P. Alku, and E. Vilkman, "Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range," IEEE Trans. Speech Audio Process., vol.10, no.3, pp. 186-192, Mar. 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 186-192
- Bäckström, T.¹ Alku, P.² Vilkman, E.³

33
- 0032623865
- An acoustic description of consonant reduction
- Jun.
- R. van Son and L. Pols, "An acoustic description of consonant reduction," Speech Commun., vol.28, pp. 125-140, Jun. 1999.
- (1999) Speech Commun. , vol.28 , pp. 125-140
- Van Son, R.¹ Pols, L.²

34
- 0032097263
- New York: Academic
- K. Fukunaga, Introduction to Statistical Pattern Recognition. New York: Academic, 1990.
- (1990) Introduction to Statistical Pattern Recognition
- Fukunaga, K.¹

35
- 0003922190
- New York: Wiley
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. New York: Wiley, 2001.
- (2001) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

36
- 33745561205
- An introduction to variable and feature selection
- Mar.
- I. Guyon and A. Elisseeff, "An introduction to variable and feature selection," J. Mach. Learn. Res., vol.3, pp. 1157-1182, Mar. 2003.
- (2003) J. Mach. Learn. Res. , vol.3 , pp. 1157-1182
- Guyon, I.¹ Elisseeff, A.²

37
- 24344458137
- Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
- Aug.
- H. Peng, F. Long, and C. Ding, "Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy, " IEEE Trans. Pattern Anal. Mach. Intell., vol.27, no.8, pp. 1226-1238, Aug. 2005.
- (2005) IEEE Trans. Pattern Anal. Mach. Intell. , vol.27 , Issue.8 , pp. 1226-1238
- Peng, H.¹ Long, F.² Ding, C.³

38
- 27144489164
- A tutorial on support vector machines for pattern recognition
- C. J. Burges, "A tutorial on support vector machines for pattern recognition," Data Min. Knowl. Discov., vol.2, pp. 121-167, 1998.
- (1998) Data Min. Knowl. Discov. , vol.2 , pp. 121-167
- Burges, C.J.¹

39
- 0036505670
- A comparison of methods for multi-class support vector machines
- Mar.
- C. W. Hsu and C. J. Lin, "A comparison of methods for multi-class support vector machines," IEEE Trans. Neural Netw., vol.13, no.2, pp. 415-425, Mar. 2002.
- (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.2 , pp. 415-425
- Hsu, C.W.¹ Lin, C.J.²

40
- 27644562797
- Adapted user-dependent multimodal biometric authentication exploiting general information
- Dec.
- J. Fierrez-Aguilar, D. Garcia-Romero, J. Ortega-Garcia, and J. Gonzalez-Rodriguez, "Adapted user-dependent multimodal biometric authentication exploiting general information," Pattern Recognit. Lett., vol.26, no.16, pp. 2628-2639, Dec. 2005.
- (2005) Pattern Recognit. Lett. , vol.26 , Issue.16 , pp. 2628-2639
- Fierrez-Aguilar, J.¹ Garcia-Romero, D.² Ortega-Garcia, J.³ Gonzalez-Rodriguez, J.⁴

41
- 84901473777
- Multi-modal identity verification using support vector machines (SVM)
- Paris, France, Jul.
- B. Gutschoven and P. Verlinde, "Multi-modal identity verification using support vector machines (SVM)," in Proc. Int. Conf. Information Fusion, Paris, France, Jul. 2000, vol.2, pp. 3-8.
- (2000) Proc. Int. Conf. Information Fusion , vol.2 , pp. 3-8
- Gutschoven, B.¹ Verlinde, P.²

42
- 0030093965
- Acoustic profiles in vocal emotion expression
- R. Banse and K. R. Scherer, "Acoustic profiles in vocal emotion expression," J. Personal. Social Pathol., vol.70, no.3, pp. 614-636, 1996.
- (1996) J. Personal. Social Pathol. , vol.70 , Issue.3 , pp. 614-636
- Banse, R.¹ Scherer, K.R.²

43
- 21544475426
- o and pause features analysis for anger and fear detection in real-life spoken dialogs
- Nara, Japan, Mar.
- o and pause features analysis for anger and fear detection in real-life spoken dialogs," in Proc. Speech Prosody, Nara, Japan, Mar. 2004, pp. 205-208.
- (2004) Proc. Speech Prosody , pp. 205-208
- Devillers, L.¹ Vasilescu, I.² Vidrascu, L.³

44
- 34547505647
- Combining efforts for improving automatic classification of emotional user states
- Ljubljana, Slovenia, Oct.
- A. Batliner, S. Steidl, B. Schuller, D. Seppi, K. Laskowski, T. Vogt, L. Devillers, L. Vidrascu, N. Amir, L. Kessous, and V. Aharonson, "Combining efforts for improving automatic classification of emotional user states," in Proc. Information Society-Language Technologies Conf. (IS-LTC), Ljubljana, Slovenia, Oct. 2006, pp. 240-245.
- (2006) Proc. Information Society-Language Technologies Conf. (IS-LTC) , pp. 240-245
- Batliner, A.¹ Steidl, S.² Schuller, B.³ Seppi, D.⁴ Laskowski, K.⁵ Vogt, T.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Amir, N.⁹ Kessous, L.¹⁰ Aharonson, V.¹¹

45
- 70450161311
- Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge
- Brighton, U.K., Sep.
- I. Luengo, E. Navas, and I. Hernáez, "Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge," in Proc. Interspeech, Brighton, U.K., Sep. 2009, pp. 332-335.
- (2009) Proc. Interspeech , pp. 332-335
- Luengo, I.¹ Navas, E.² Hernáez, I.³

46
- 33744926919
- Automatic emotion recognition using prosodic parameters
- Lisbon, Portugal, Sep.
- I. Luengo, E. Navas, I. Hernáez, and J. Sanchez, "Automatic emotion recognition using prosodic parameters," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 493-496.
- (2005) Proc. Interspeech , pp. 493-496
- Luengo, I.¹ Navas, E.² Hernáez, I.³ Sanchez, J.⁴

47
- 34047248387
- An objective and subjective study of the role of semantics in building corpora for TTS
- Jul.
- E. Navas, I. Hernáez, and I. Luengo, "An objective and subjective study of the role of semantics in building corpora for TTS," IEEE Trans. Speech Audio Process., vol.14, no.4, pp. 1117-1127, Jul. 2006.
- (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.4 , pp. 1117-1127
- Navas, E.¹ Hernáez, I.² Luengo, I.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.