메뉴 건너뛰기




Volumn 15, Issue 2, 2012, Pages 131-150

Speaker-independent emotion recognition exploiting a psychologically- inspired binary cascade classification schema

Author keywords

Binary classification schema; Classifier comparison; Emotion recognition; Large scale feature extraction; Speaker independent protocol

Indexed keywords

CLASSIFICATION (OF INFORMATION); NEAREST NEIGHBOR SEARCH; PSYCHOLOGY COMPUTING; RADIAL BASIS FUNCTION NETWORKS; SUPPORT VECTOR MACHINES;

EID: 84864723353     PISSN: 13812416     EISSN: 15728110     Source Type: Journal    
DOI: 10.1007/s10772-012-9127-7     Document Type: Article
Times cited : (69)

References (67)
  • 1
    • 60249092335 scopus 로고    scopus 로고
    • Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection
    • Altun, H., & Polat, G. (2009). Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection. Expert Systems With Applications, 36(4), 8197-8203.
    • (2009) Expert Systems With Applications , vol.36 , Issue.4 , pp. 8197-8203
    • Altun, H.1    Polat, G.2
  • 5
    • 77956401353 scopus 로고    scopus 로고
    • Class-level spectral features for emotion recognition
    • Bitouk, D., Verma, R., & Nenkova, A. (2010). Class-level spectral features for emotion recognition. Speech Communication, 52(7-8), 613-625.
    • (2010) Speech Communication , vol.52 , Issue.7-8 , pp. 613-625
    • Bitouk, D.1    Verma, R.2    Nenkova, A.3
  • 6
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In Proc. institute of phonetic sciences (Vol. 17, pp. 97-110).
    • (1993) Proc. institute of phonetic sciences , vol.17 , pp. 97-110
    • Boersma, P.1
  • 10
    • 65249116503 scopus 로고    scopus 로고
    • Analysis of emotionally salient aspects of fundamental frequency for emotion detection
    • Busso, C., Lee, S., & Narayanan, S. (2009). Analysis of emotionally salient aspects of fundamental frequency for emotion detection. IEEE Transactions on Speech and Audio Processing, 17(4), 582-596.
    • (2009) IEEE Transactions on Speech and Audio Processing , vol.17 , Issue.4 , pp. 582-596
    • Busso, C.1    Lee, S.2    Narayanan, S.3
  • 11
    • 79953822842 scopus 로고    scopus 로고
    • Affect detection: An interdisciplinary review of models, methods, and their applications
    • Calvo, R. A., & D'Mello, S. (2011). Affect detection: An interdisciplinary review of models, methods, and their applications. IEEE Transactions on Affective Computing, 1(1), 18-37.
    • (2011) IEEE Transactions on Affective Computing , vol.1 , Issue.1 , pp. 18-37
    • Calvo, R.A.1    D'Mello, S.2
  • 12
    • 61549105958 scopus 로고    scopus 로고
    • Support vector machines employing cross-correlation for emotional speech recognition
    • Chandaka, S., Chatterjee, A., & Munshi, S. (2009). Support vector machines employing cross-correlation for emotional speech recognition. Measurement, 42(4), 611-618.
    • (2009) Measurement , vol.42 , Issue.4 , pp. 611-618
    • Chandaka, S.1    Chatterjee, A.2    Munshi, S.3
  • 16
    • 0002255015 scopus 로고    scopus 로고
    • Facial expression in affective disorders
    • What the face reveals, London: Oxford Press. Chap. 15
    • Ekman, P., Matsumoto, D., & Friesen, W. (2005). Facial expression in affective disorders. In Series in affective science. What the face reveals (pp. 331-342). London: Oxford Press. Chap. 15.
    • (2005) Series in affective science , pp. 331-342
    • Ekman, P.1    Matsumoto, D.2    Friesen, W.3
  • 17
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: Features, classification schemes, and databases
    • El Ayadi, M., Kamel, M. S., & Karray, F. (2011). Survey on speech emotion recognition: Features, classification schemes, and databases. Pattern Recognition, 44(3), 572-587.
    • (2011) Pattern Recognition , vol.44 , Issue.3 , pp. 572-587
    • El Ayadi, M.1    Kamel, M.S.2    Karray, F.3
  • 22
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3(7-8), 1157-1182.
    • (2003) Journal of Machine Learning Research , vol.3 , Issue.7-8 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 25
    • 70449585153 scopus 로고    scopus 로고
    • Statistical evaluation of speech features for emotion recognition
    • Colmar, France, July 2009
    • Iliou, T., & Anagnostopoulos, C. (2009). Statistical evaluation of speech features for emotion recognition. In Proc. 4th int. conf. digital telecommunications, Colmar, France, July 2009 (pp. 121-126).
    • (2009) Proc. 4th int. conf. digital telecommunications , pp. 121-126
    • Iliou, T.1    Anagnostopoulos, C.2
  • 26
    • 33644609617 scopus 로고    scopus 로고
    • Emotive alert: HMM-based emotion detection in voicemail messages
    • San Diego, USA, January 2005
    • Inanoglu, Z., & Caneel, R. (2005). Emotive alert: HMM-based emotion detection in voicemail messages. In Proc. 10th int. conf. intelligent user interfaces, San Diego, USA, January 2005 (pp. 251-253).
    • (2005) Proc. 10th int. conf. intelligent user interfaces , pp. 251-253
    • Inanoglu, Z.1    Caneel, R.2
  • 28
    • 0141764789 scopus 로고    scopus 로고
    • Communication of Emotions in Vocal Expression and Music Performance: Different Channels, Same Code?
    • DOI 10.1037/0033-2909.129.5.770
    • Juslin, P. N., & Laukka, P. (2003). Communication of emotions in vocal expression and music performance: Different channels, same code? Psychological Bulletin, 129(5), 770-814. (Pubitemid 37394950)
    • (2003) Psychological Bulletin , vol.129 , Issue.5 , pp. 770-814
    • Juslin, P.N.1    Laukka, P.2
  • 31
    • 77957969670 scopus 로고    scopus 로고
    • Gender classification in two emotional speech databases
    • Tampa, USA, December 2008
    • Kotti, M., & Kotropoulos, C. (2008). Gender classification in two emotional speech databases. In Proc. 19th int. conf. pattern recognition, Tampa, USA, December 2008 (pp. 1-4).
    • (2008) Proc. 19th int. conf. pattern recognition , pp. 1-4
    • Kotti, M.1    Kotropoulos, C.2
  • 33
    • 14644439843 scopus 로고    scopus 로고
    • Toward detecting emotions in spoken dialogs
    • DOI 10.1109/TSA.2004.838534
    • Lee, C. M., & Narayanan, S. (2005). Towards detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13(12), 293-303. (Pubitemid 40320247)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.2 , pp. 293-303
    • Lee, C.M.1    Narayanan, S.S.2
  • 37
    • 63649163674 scopus 로고    scopus 로고
    • Variational Gaussian mixture models for speech emotion recognition
    • Kolkata, India, February 2009
    • Mishra, H. K., & Sekhar, C. C. (2009). Variational Gaussian mixture models for speech emotion recognition. In Proc. 7th int. conf. advances in pattern recognition, Kolkata, India, February 2009 (pp. 183-186).
    • (2009) Proc. 7th int. conf. advances in pattern recognition , pp. 183-186
    • Mishra, H.K.1    Sekhar, C.C.2
  • 38
    • 0033692964 scopus 로고    scopus 로고
    • A novel approach to the fully automatic extraction of Fujisaki model parameters
    • June 2000
    • Mixdorff, H. (2000). A novel approach to the fully automatic extraction of Fujisaki model parameters. In Proc. IEEE int. conf. acoustics, speech, and signal processing, June 2000 (pp. 1281-1284).
    • (2000) Proc. IEEE int. conf. acoustics, speech, and signal processing , pp. 1281-1284
    • Mixdorff, H.1
  • 39
    • 0027447292 scopus 로고
    • Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
    • DOI 10.1121/1.405558
    • Murray, I. R., & Arnott, J. L. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. The Journal of the Acoustical Society of America, 93(2), 1097-1108. (Pubitemid 23059837)
    • (1993) Journal of the Acoustical Society of America , vol.93 , Issue.2 , pp. 1097-1108
    • Murray, I.R.1    Arnott, J.L.2
  • 42
    • 2942590310 scopus 로고    scopus 로고
    • Toward an affect-sensitive multimodal human-computer interaction
    • DOI 10.1109/JPROC.2003.817122, Human-Computer Multimodal Interface
    • Pantic, M., & Rothkrantz, L. J. M. (2003). Toward an affect-sensitive multimodal human-computer interaction. Proceedings of the IEEE, 91(9), 1370-1390. (Pubitemid 40890819)
    • (2003) Proceedings of the IEEE , vol.91 , Issue.9 , pp. 1370-1390
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 44
    • 34047207805 scopus 로고    scopus 로고
    • Mandarin emotional speech recognition based on SVM and NN
    • Hong Kong, Hong Kong, August 2006
    • Pao, T. L., Chen, Y. T., Yeh, J. H., & Li, P. J. (2006). Mandarin emotional speech recognition based on SVM and NN. In Proc. 18th int. conf. pattern recognition, Hong Kong, Hong Kong, August 2006 (pp. 1096-1100).
    • (2006) Proc. 18th int. conf. pattern recognition , pp. 1096-1100
    • Pao, T.L.1    Chen, Y.T.2    Yeh, J.H.3    Li, P.J.4
  • 47
    • 84879797312 scopus 로고    scopus 로고
    • Speech emotion recognition approaches in human computer interaction
    • doi:10.1007/s11235-011-9624-z
    • Ramakrishnan, S., & El Emary, I. (2011). Speech emotion recognition approaches in human computer interaction. Telecommunication Systems, 1-12. doi:10.1007/s11235-011-9624-z.
    • (2011) Telecommunication Systems , pp. 1-12
    • Ramakrishnan, S.1    El Emary, I.2
  • 48
    • 77955560086 scopus 로고    scopus 로고
    • A learning approach to hierarchical feature selection and aggregation for audio classification
    • Ruvolo, P., Fasel, I., & Movellan, J. R. (2010). A learning approach to hierarchical feature selection and aggregation for audio classification. Pattern Recognition Letters, 31(12), 1535-1542.
    • (2010) Pattern Recognition Letters , vol.31 , Issue.12 , pp. 1535-1542
    • Ruvolo, P.1    Fasel, I.2    Movellan, J.R.3
  • 49
    • 63649147868 scopus 로고    scopus 로고
    • Emotion recognition using mel-frequency cepstral coefficients
    • Sato, N., & Obuchi, Y. (2007). Emotion recognition using mel-frequency cepstral coefficients. Journal ofNatural Language Processing, 14(4), 83-96.
    • (2007) Journal ofNatural Language Processing , vol.14 , Issue.4 , pp. 83-96
    • Sato, N.1    Obuchi, Y.2
  • 50
    • 0037384712 scopus 로고    scopus 로고
    • Vocal communication of emotion: A review of research paradigms
    • Scherer, K. R. (2003). Vocal communication of emotion: A review of research paradigms. Speech Communication, 40(1-2), 227-256.
    • (2003) Speech Communication , vol.40 , Issue.1-2 , pp. 227-256
    • Scherer, K.R.1
  • 51
    • 33750541433 scopus 로고    scopus 로고
    • Speaker independent speech emotion recognition by ensemble classification
    • DOI 10.1109/ICME.2005.1521560, 1521560, IEEE International Conference on Multimedia and Expo, ICME 2005
    • Schuller, B., Reiter, S., Muller, R., Al-Hames, M., Lang, M., & Rigoll, G. (2005a). Speaker independent speech emotion recognition by ensemble classification. In Proc. IEEE int. conf. multimedia and expo, Amsterdam, The Netherlands, July 2005 (pp. 864-867). (Pubitemid 44669004)
    • (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 864-867
    • Schuller, B.1    Reiter, S.2    Muller, R.3    Al-Hames, M.4    Lang, M.5    Rigoll, G.6
  • 60
    • 33750552511 scopus 로고    scopus 로고
    • Emotional speech classification using Gaussian mixture models and the sequential floating forward selection algorithm
    • DOI 10.1109/ICME.2005.1521717, 1521717, IEEE International Conference on Multimedia and Expo, ICME 2005
    • Ververidis, D., & Kotropoulos, C. (2005). Emotional speech classification using Gaussian mixture models and the sequential floating forward selection algorithm. In Proceedings of IEEE int. conf. multimedia and expo, Los Alamitos, USA, July 2005 (pp. 1500-1503). (Pubitemid 44669161)
    • (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 1500-1503
    • Ververidis, D.1    Kotropoulos, C.2
  • 61
    • 70350619300 scopus 로고    scopus 로고
    • Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collections
    • Florence, Italy, September 2006
    • Ververidis, D., & Kotropoulos, C. (2006). Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collections. In Proc. 14th European signal processing conference, Florence, Italy, September 2006.
    • (2006) Proc. 14th European signal processing conference
    • Ververidis, D.1    Kotropoulos, C.2
  • 63
    • 84864665428 scopus 로고    scopus 로고
    • Tech. Rep., Cambridge University, Cavendish Lab
    • Wallach, H. (2006). Evaluation metrics for hard classifiers (Tech. Rep.). Cambridge University, Cavendish Lab. URL www. inference.phy.cam.ac.uk/hmw26/ papers/evaluation.ps
    • (2006) Evaluation metrics for hard classifiers
    • Wallach, H.1
  • 66
    • 57149131874 scopus 로고    scopus 로고
    • A survey of affect recognition methods: Audio, visual and spontaneous expressions
    • Nagoya, Japan, November 2007
    • Zeng, Z., Pantic, M., Roisman, G. I., & Huang, T. S. (2007). A survey of affect recognition methods: Audio, visual and spontaneous expressions. In Proc. 9th int. conf. multimodal interfaces, Nagoya, Japan, November 2007 (pp. 126-133).
    • (2007) Proc. 9th int. conf. multimodal interfaces , pp. 126-133
    • Zeng, Z.1    Pantic, M.2    Roisman, G.I.3    Huang, T.S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.