메뉴 건너뛰기




Volumn 12, Issue 6, 2010, Pages 490-501

Feature analysis and evaluation for automatic emotion identification in speech

Author keywords

Emotion identification; information fusion; parametrization

Indexed keywords

EARLY FUSION; EMOTION IDENTIFICATION; EMOTIONAL INFORMATION; FEATURE ANALYSIS; INFORMATION SOURCES; LATE FUSION; PARAMETRIZATIONS; PROSODIC FEATURES; SPECTRAL ENVELOPE PARAMETERS; SPECTRAL ENVELOPES; SPECTRAL STATISTICS; SYSTEMATIC STUDY; VOICE QUALITY;

EID: 77956733663     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2010.2051872     Document Type: Article
Times cited : (131)

References (47)
  • 1
    • 0037384712 scopus 로고    scopus 로고
    • Vocal communication of emotion: A review of research paradigms
    • Apr.
    • K. R. Scherer, "Vocal communication of emotion: A review of research paradigms," Speech Commun., vol.40, pp. 227-256, Apr. 2003.
    • (2003) Speech Commun. , vol.40 , pp. 227-256
    • Scherer, K.R.1
  • 2
    • 0002171967 scopus 로고    scopus 로고
    • Psychological models of emotion
    • J. Borod, Ed. Oxford, U.K.: Oxford Univ. Press, ch. 6
    • K. R. Scherer, "Psychological models of emotion," in The Neuropsychology of Emotion, J. Borod, Ed. Oxford, U.K.: Oxford Univ. Press, 2000, ch. 6, pp. 137-166.
    • (2000) The Neuropsychology of Emotion , pp. 137-166
    • Scherer, K.R.1
  • 3
    • 84889960454 scopus 로고
    • An argument for basic emotions
    • P. Ekman, "An argument for basic emotions," Cognit. Emotion, vol.6, pp. 169-200, 1992.
    • (1992) Cognit. Emotion , vol.6 , pp. 169-200
    • Ekman, P.1
  • 5
    • 0002689942 scopus 로고    scopus 로고
    • Verification of acoustical correlates of emotional speech using formant-synthesis
    • Belfast, Ireland, Sep.
    • F. Burkhardt and W. F. Sendlmeier, "Verification of acoustical correlates of emotional speech using formant-synthesis," in Proc. ISCA Tutorial and Research Workshop Speech and Emotion, Belfast, Ireland, Sep. 2000, pp. 151-156.
    • (2000) Proc. ISCA Tutorial and Research Workshop Speech and Emotion , pp. 151-156
    • Burkhardt, F.1    Sendlmeier, W.F.2
  • 6
    • 52949128737 scopus 로고    scopus 로고
    • A three-layered model for expressive speech perception
    • Oct.
    • C. F. Huang and M. Akagi, "A three-layered model for expressive speech perception," Speech Commun., vol.50, pp. 810-828, Oct. 2008.
    • (2008) Speech Commun. , vol.50 , pp. 810-828
    • Huang, C.F.1    Akagi, M.2
  • 7
    • 0000134170 scopus 로고
    • Vocal cues in emotion encoding and decoding
    • K. R. Scherer, R. Banse, H. G.Wallbott, and T. Goldbeck, "Vocal cues in emotion encoding and decoding," Motiv. Emotion, vol.15, no.2, pp. 123-148, 1991.
    • (1991) Motiv. Emotion , vol.15 , Issue.2 , pp. 123-148
    • Scherer, K.R.1    Banse, R.2    Wallbott, H.G.3    Goldbeck, T.4
  • 8
    • 9444257562 scopus 로고    scopus 로고
    • Ph.D. disservation, Universität des Saarlandes, Saarbrücken, Germany
    • M. Schröder, "Speech and emotion research," Ph.D. disservation, Universität des Saarlandes, Saarbrücken, Germany, 2003.
    • (2003) Speech and Emotion Research
    • Schröder, M.1
  • 9
    • 35048897466 scopus 로고    scopus 로고
    • Acoustic analysis of emotional speech in standard Basque for emotion recognition
    • ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct.
    • E. Navas, I. Hernáez, A. Castelruiz, J. Sánchez, and I. Luengo, "Acoustic analysis of emotional speech in standard Basque for emotion recognition," in Progress in Pattern Recognition, Image Analysis and Applications, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct. 2004, vol.3287, pp. 386-393.
    • (2004) Progress in Pattern Recognition, Image Analysis and Applications , vol.3287 , pp. 386-393
    • Navas, E.1    Hernáez, I.2    Castelruiz, A.3    Sánchez, J.4    Luengo, I.5
  • 10
    • 23144458652 scopus 로고    scopus 로고
    • Expressive speech: Production, perception and application to speech synthesis
    • D. Erickson, "Expressive speech: Production, perception and application to speech synthesis," Acoust. Sci. Tech., vol.26, pp. 317-325, 2005.
    • (2005) Acoust. Sci. Tech. , vol.26 , pp. 317-325
    • Erickson, D.1
  • 11
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Jun.
    • T. L. Nwe, S. W. Foo, and L. C. de Silva, "Speech emotion recognition using hidden Markov models," Speech Commun., vol.41, pp. 603-623, Jun. 2003.
    • (2003) Speech Commun. , vol.41 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 12
    • 48149087416 scopus 로고    scopus 로고
    • Real-time emotion detection system using speech: Multi-modal fusion of different timescale features
    • Crete, Greece, Oct.
    • S. Kim, P. G. Georgiou, S. Lee, and S. Narayanan, "Real-time emotion detection system using speech: Multi-modal fusion of different timescale features," in Proc. Int. Workshop Multimedia Signal Processing, Crete, Greece, Oct. 2007, pp. 48-51.
    • (2007) Proc. Int. Workshop Multimedia Signal Processing , pp. 48-51
    • Kim, S.1    Georgiou, P.G.2    Lee, S.3    Narayanan, S.4
  • 13
    • 38049048651 scopus 로고    scopus 로고
    • Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing
    • ser. Lecture Notes in Computer Science. Berlin, Germany: Springer
    • B. Vlasenko, B. Schuller, A. Wendemuth, and G. Rigoll, "Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing," in Affective Computing and Intelligent Interaction, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, 2007, vol.4738, pp. 139-147.
    • (2007) Affective Computing and Intelligent Interaction , vol.4738 , pp. 139-147
    • Vlasenko, B.1    Schuller, B.2    Wendemuth, A.3    Rigoll, G.4
  • 14
    • 0034346176 scopus 로고    scopus 로고
    • Emotion recognition in speech using neural networks
    • Dec.
    • J. Nicholson, K. Takahashi, and R. Nakatsu, "Emotion recognition in speech using neural networks," Neural Comput. Appl., vol.9, pp. 290-296, Dec. 2000.
    • (2000) Neural Comput. Appl. , vol.9 , pp. 290-296
    • Nicholson, J.1    Takahashi, K.2    Nakatsu, R.3
  • 15
    • 85009223246 scopus 로고    scopus 로고
    • Emotion recognition by speech signals
    • Geneva, Switzerland
    • O. W. Kwon, K. Chan, J. Hao, and T. W. Lee, "Emotion recognition by speech signals," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 125-128.
    • (2003) Proc. Eurospeech , pp. 125-128
    • Kwon, O.W.1    Chan, K.2    Hao, J.3    Lee, T.W.4
  • 16
    • 85034230784 scopus 로고    scopus 로고
    • Improving automatic emotion recognition from speech via gender differentiation
    • Genoa, Italy, May
    • T. Vogt and E. André, "Improving automatic emotion recognition from speech via gender differentiation," in Proc. LREC, Genoa, Italy, May 2006.
    • (2006) Proc. LREC
    • Vogt, T.1    André, E.2
  • 17
    • 33745198227 scopus 로고    scopus 로고
    • Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
    • Lisbon, Portugal, Sep.
    • B. Schuller, R. Müller, M. Lang, and G. Rigoll, "Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 805-808.
    • (2005) Proc. Interspeech , pp. 805-808
    • Schuller, B.1    Müller, R.2    Lang, M.3    Rigoll, G.4
  • 18
    • 53049109134 scopus 로고    scopus 로고
    • Two-level fusion to improve emotion classification in spoken dialogue systems
    • ser. Lecture Notes in Computer Science. Berlin, Germany: Springer
    • R. López-Cozar, Z. Callejas, M. Kroul, J. Nouza, and J. Silovský, "Two-level fusion to improve emotion classification in spoken dialogue systems," in Graphics Recognition. Recent Advances and New Opportunities, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, 2008, vol.5246, pp. 617-624.
    • (2008) Graphics Recognition. Recent Advances and New Opportunities , vol.5246 , pp. 617-624
    • López-Cozar, R.1    Callejas, Z.2    Kroul, M.3    Nouza, J.4    Silovský, J.5
  • 19
    • 34547496515 scopus 로고    scopus 로고
    • The relevance of voice quality features in speaker independent emotion recognition
    • Honolulu, HI, Apr.
    • M. Lugger and B. Yang, "The relevance of voice quality features in speaker independent emotion recognition," in Proc. ICCASP, Honolulu, HI, Apr. 2007, vol.4, pp. 17-20.
    • (2007) Proc. ICCASP , vol.4 , pp. 17-20
    • Lugger, M.1    Yang, B.2
  • 20
    • 0037380186 scopus 로고    scopus 로고
    • The role of voice quality in communicating emotion, mood and attitude
    • Apr.
    • C. Gobl and A. N. Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Commun., vol.40, pp. 189-212, Apr. 2003.
    • (2003) Speech Commun. , vol.40 , pp. 189-212
    • Gobl, C.1    Chasaide, A.N.2
  • 21
    • 85009159448 scopus 로고    scopus 로고
    • Emotional space improves emotion recognition
    • Sep.
    • R. Tato, R. Santos, R. Kompe, and J. Pardo, "Emotional space improves emotion recognition," in Proc. ICSLP, Sep. 2002, pp. 2029-2032.
    • (2002) Proc. ICSLP , pp. 2029-2032
    • Tato, R.1    Santos, R.2    Kompe, R.3    Pardo, J.4
  • 22
    • 77956758858 scopus 로고    scopus 로고
    • Enhanced robustness in speech emotion recognition combining acoustic and semantic analyses
    • Santorino, Greece, Sep.
    • R. Müller, B. Schuller, and G. Rigoll, "Enhanced robustness in speech emotion recognition combining acoustic and semantic analyses," in Proc. From Signals to Signs of Emotion and Vice Versa, Santorino, Greece, Sep. 2004.
    • (2004) Proc. from Signals to Signs of Emotion and Vice Versa
    • Müller, R.1    Schuller, B.2    Rigoll, G.3
  • 23
    • 85008007181 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Aalborg, Denmark, Sep.
    • A. Nogueiras, A. Moreno, A. Bonafonte, and J. B. Mariño, "Speech emotion recognition using hidden Markov models," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001, pp. 2679-2682.
    • (2001) Proc. Eurospeech , pp. 2679-2682
    • Nogueiras, A.1    Moreno, A.2    Bonafonte, A.3    Mariño, J.B.4
  • 25
    • 0005946314 scopus 로고    scopus 로고
    • An overview of classifier fusion methods
    • Feb.
    • D. Ruta and B. Gabrys, "An overview of classifier fusion methods," Comput. Inf. Syst., vol.7, no.1, pp. 1-10, Feb. 2000.
    • (2000) Comput. Inf. Syst. , vol.7 , Issue.1 , pp. 1-10
    • Ruta, D.1    Gabrys, B.2
  • 27
    • 1842476689 scopus 로고    scopus 로고
    • Efficient voice activity detection algorithms using long-term speech information
    • Apr.
    • J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, and A. Rubio, "Efficient voice activity detection algorithms using long-term speech information," Speech Commun., vol.42, pp. 271-287, Apr. 2004.
    • (2004) Speech Commun. , vol.42 , pp. 271-287
    • Ramirez, J.1    Segura, J.C.2    Benitez, C.3    De La Torre, A.4    Rubio, A.5
  • 28
    • 0032875050 scopus 로고    scopus 로고
    • A method for generating natural-sounding speech stimuli for cognitive brain research
    • Aug.
    • P. Alku, H. Tiitinen, and R. Näätänen, "A method for generating natural-sounding speech stimuli for cognitive brain research," Clin. Neurophysiol., vol.110, pp. 1329-1333, Aug. 1999.
    • (1999) Clin. Neurophysiol. , vol.110 , pp. 1329-1333
    • Alku, P.1    Tiitinen, H.2    Näätänen, R.3
  • 29
    • 34547503468 scopus 로고    scopus 로고
    • Evaluation of pitch detection algorithms under real conditions
    • Honolulu, HI, Apr.
    • I. Luengo, I. Saratxaga, E. Navas, I. Hernáez, J. Sánchez, and I. n. Sainz, "Evaluation of pitch detection algorithms under real conditions," in Proc. ICASSP, Honolulu, HI, Apr. 2007, pp. 1057-1060.
    • (2007) Proc. ICASSP , pp. 1057-1060
    • Luengo, I.1    Saratxaga, I.2    Navas, E.3    Hernáez, I.4    Sánchez, J.5    Sainz, I.N.6
  • 30
    • 77956760083 scopus 로고    scopus 로고
    • Detección de vocales mediante modelado de clusters de fonemas
    • Sep.
    • I. Luengo, E. Navas, J. Sánchez, and I. Hernáez, "Detección de vocales mediante modelado de clusters de fonemas," Procesado Del Lenguaje Natural, vol.43, pp. 121-128, Sep. 2009.
    • (2009) Procesado Del Lenguaje Natural , vol.43 , pp. 121-128
    • Luengo, I.1    Navas, E.2    Sánchez, J.3    Hernáez, I.4
  • 31
    • 58349084477 scopus 로고    scopus 로고
    • Exploiting a vowel based approach for acted emotion recognition
    • ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct.
    • F. Ringeval and M. Chetouani, "Exploiting a vowel based approach for acted emotion recognition," in Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, ser. Lecture Notes in Computer Science. Berlin, Germany: Springer, Oct. 2008, vol.5042, pp. 243-254.
    • (2008) Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction , vol.5042 , pp. 243-254
    • Ringeval, F.1    Chetouani, M.2
  • 32
    • 0036508041 scopus 로고    scopus 로고
    • Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range
    • Mar.
    • T. Bäckström, P. Alku, and E. Vilkman, "Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range," IEEE Trans. Speech Audio Process., vol.10, no.3, pp. 186-192, Mar. 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.10 , Issue.3 , pp. 186-192
    • Bäckström, T.1    Alku, P.2    Vilkman, E.3
  • 33
    • 0032623865 scopus 로고    scopus 로고
    • An acoustic description of consonant reduction
    • Jun.
    • R. van Son and L. Pols, "An acoustic description of consonant reduction," Speech Commun., vol.28, pp. 125-140, Jun. 1999.
    • (1999) Speech Commun. , vol.28 , pp. 125-140
    • Van Son, R.1    Pols, L.2
  • 36
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Mar.
    • I. Guyon and A. Elisseeff, "An introduction to variable and feature selection," J. Mach. Learn. Res., vol.3, pp. 1157-1182, Mar. 2003.
    • (2003) J. Mach. Learn. Res. , vol.3 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 37
    • 24344458137 scopus 로고    scopus 로고
    • Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy
    • Aug.
    • H. Peng, F. Long, and C. Ding, "Feature selection based on mutual information: Criteria of max-dependency, max-relevance, and min-redundancy, " IEEE Trans. Pattern Anal. Mach. Intell., vol.27, no.8, pp. 1226-1238, Aug. 2005.
    • (2005) IEEE Trans. Pattern Anal. Mach. Intell. , vol.27 , Issue.8 , pp. 1226-1238
    • Peng, H.1    Long, F.2    Ding, C.3
  • 38
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • C. J. Burges, "A tutorial on support vector machines for pattern recognition," Data Min. Knowl. Discov., vol.2, pp. 121-167, 1998.
    • (1998) Data Min. Knowl. Discov. , vol.2 , pp. 121-167
    • Burges, C.J.1
  • 39
    • 0036505670 scopus 로고    scopus 로고
    • A comparison of methods for multi-class support vector machines
    • Mar.
    • C. W. Hsu and C. J. Lin, "A comparison of methods for multi-class support vector machines," IEEE Trans. Neural Netw., vol.13, no.2, pp. 415-425, Mar. 2002.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.2 , pp. 415-425
    • Hsu, C.W.1    Lin, C.J.2
  • 40
    • 27644562797 scopus 로고    scopus 로고
    • Adapted user-dependent multimodal biometric authentication exploiting general information
    • Dec.
    • J. Fierrez-Aguilar, D. Garcia-Romero, J. Ortega-Garcia, and J. Gonzalez-Rodriguez, "Adapted user-dependent multimodal biometric authentication exploiting general information," Pattern Recognit. Lett., vol.26, no.16, pp. 2628-2639, Dec. 2005.
    • (2005) Pattern Recognit. Lett. , vol.26 , Issue.16 , pp. 2628-2639
    • Fierrez-Aguilar, J.1    Garcia-Romero, D.2    Ortega-Garcia, J.3    Gonzalez-Rodriguez, J.4
  • 41
    • 84901473777 scopus 로고    scopus 로고
    • Multi-modal identity verification using support vector machines (SVM)
    • Paris, France, Jul.
    • B. Gutschoven and P. Verlinde, "Multi-modal identity verification using support vector machines (SVM)," in Proc. Int. Conf. Information Fusion, Paris, France, Jul. 2000, vol.2, pp. 3-8.
    • (2000) Proc. Int. Conf. Information Fusion , vol.2 , pp. 3-8
    • Gutschoven, B.1    Verlinde, P.2
  • 42
    • 0030093965 scopus 로고    scopus 로고
    • Acoustic profiles in vocal emotion expression
    • R. Banse and K. R. Scherer, "Acoustic profiles in vocal emotion expression," J. Personal. Social Pathol., vol.70, no.3, pp. 614-636, 1996.
    • (1996) J. Personal. Social Pathol. , vol.70 , Issue.3 , pp. 614-636
    • Banse, R.1    Scherer, K.R.2
  • 43
    • 21544475426 scopus 로고    scopus 로고
    • o and pause features analysis for anger and fear detection in real-life spoken dialogs
    • Nara, Japan, Mar.
    • o and pause features analysis for anger and fear detection in real-life spoken dialogs," in Proc. Speech Prosody, Nara, Japan, Mar. 2004, pp. 205-208.
    • (2004) Proc. Speech Prosody , pp. 205-208
    • Devillers, L.1    Vasilescu, I.2    Vidrascu, L.3
  • 45
    • 70450161311 scopus 로고    scopus 로고
    • Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge
    • Brighton, U.K., Sep.
    • I. Luengo, E. Navas, and I. Hernáez, "Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge," in Proc. Interspeech, Brighton, U.K., Sep. 2009, pp. 332-335.
    • (2009) Proc. Interspeech , pp. 332-335
    • Luengo, I.1    Navas, E.2    Hernáez, I.3
  • 46
    • 33744926919 scopus 로고    scopus 로고
    • Automatic emotion recognition using prosodic parameters
    • Lisbon, Portugal, Sep.
    • I. Luengo, E. Navas, I. Hernáez, and J. Sanchez, "Automatic emotion recognition using prosodic parameters," in Proc. Interspeech, Lisbon, Portugal, Sep. 2005, pp. 493-496.
    • (2005) Proc. Interspeech , pp. 493-496
    • Luengo, I.1    Navas, E.2    Hernáez, I.3    Sanchez, J.4
  • 47
    • 34047248387 scopus 로고    scopus 로고
    • An objective and subjective study of the role of semantics in building corpora for TTS
    • Jul.
    • E. Navas, I. Hernáez, and I. Luengo, "An objective and subjective study of the role of semantics in building corpora for TTS," IEEE Trans. Speech Audio Process., vol.14, no.4, pp. 1117-1127, Jul. 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.4 , pp. 1117-1127
    • Navas, E.1    Hernáez, I.2    Luengo, I.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.