SCOPUS 정보 검색 플랫폼

International Journal of Speech Technology

Volumn 15, Issue 2, 2012, Pages 131-150

Speaker-independent emotion recognition exploiting a psychologically- inspired binary cascade classification schema

Author keywords

Binary classification schema; Classifier comparison; Emotion recognition; Large scale feature extraction; Speaker independent protocol

Indexed keywords

CLASSIFICATION (OF INFORMATION); NEAREST NEIGHBOR SEARCH; PSYCHOLOGY COMPUTING; RADIAL BASIS FUNCTION NETWORKS; SUPPORT VECTOR MACHINES;

BINARY CLASSIFICATION; EMOTION RECOGNITION; EMOTIONAL SPEECH DATABASE; GAUSSIAN RADIAL BASIS FUNCTIONS; K-NEAREST NEIGHBORHOODS; SPEAKER INDEPENDENTS; SPEECH EMOTION RECOGNITION; STATE-OF-THE-ART APPROACH;

SPEECH RECOGNITION;

EID: 84864723353 PISSN: 13812416 EISSN: 15728110 Source Type: Journal
DOI: 10.1007/s10772-012-9127-7 Document Type: Article

Times cited : (69)

References (67)

1
- 60249092335
- Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection
- Altun, H., & Polat, G. (2009). Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection. Expert Systems With Applications, 36(4), 8197-8203.
- (2009) Expert Systems With Applications , vol.36 , Issue.4 , pp. 8197-8203
- Altun, H.¹ Polat, G.²

2
- 50249181525
- Prosody based emotion recognition for MEXI
- Canada, August
- Austermann, A., Esau, N., Kleinjohann, L., & Kleinjohann, B. (2005). Prosody based emotion recognition for MEXI. In Proc. IEEE/RSJ int. conf. intelligent robots and systems, Edmonton, Canada, August 2005 (pp. 201-208).
- (2005) Proc. IEEE/RSJ int. conf. intelligent robots and systems, Edmonton , pp. 201-208
- Austermann, A.¹ Esau, N.² Kleinjohann, L.³ Kleinjohann, B.⁴

3
- 77956280275
- Non-negative tensor factorization applied to music genre classification
- Benetos, E., & Kotropoulos, C. (2010). Non-negative tensor factorization applied to music genre classification. IEEE Transactions on Audio, Speech, and Language Processing, 18(8), 1955-1967.
- (2010) IEEE Transactions on Audio, Speech, and Language Processing , vol.18 , Issue.8 , pp. 1955-1967
- Benetos, E.¹ Kotropoulos, C.²

4
- 84905169221
- Large scale musical instrument identification
- Greece, July 2007
- Benetos, E., Kotti, M., & Kotropoulos, C. (2007). Large scale musical instrument identification. In Proc. 4th sound and music computing conference, Lefkada, Greece, July 2007 (pp. 283-286).
- (2007) Proc. 4th sound and music computing conference, Lefkada , pp. 283-286
- Benetos, E.¹ Kotti, M.² Kotropoulos, C.³

5
- 77956401353
- Class-level spectral features for emotion recognition
- Bitouk, D., Verma, R., & Nenkova, A. (2010). Class-level spectral features for emotion recognition. Speech Communication, 52(7-8), 613-625.
- (2010) Speech Communication , vol.52 , Issue.7-8 , pp. 613-625
- Bitouk, D.¹ Verma, R.² Nenkova, A.³

6
- 0001835850
- Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
- Boersma, P. (1993). Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In Proc. institute of phonetic sciences (Vol. 17, pp. 97-110).
- (1993) Proc. institute of phonetic sciences , vol.17 , pp. 97-110
- Boersma, P.¹

7
- 9444223925
- Exploiting emotions to disambiguate dialogue acts
- IUI 04: 2004 International Conference on Intelligent User Interfaces
- Bosma, W., & André, E. (2004). Exploiting emotions to disambiguate dialogue acts. In Proc. 9th int. conf. intelligent user interfaces, Funchal, Portugal, January 2004 (pp. 85-92). (Pubitemid 40673706)
- (2004) International Conference on Intelligent User Interfaces, Proceedings IUI , pp. 85-92
- Bosma, W.¹ Andre, E.²

8
- 33745202280
- A database of German emotional speech
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W., & Weiss, B. (2005).A database of German emotional speech. In Proc. 9th European conf. speech communication and technology, Lisbon, Portugal, September 2005 (pp. 1517-1520). (Pubitemid 43908362)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 1517-1520
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

9
- 70349217076
- Detecting anger in automated voice portal dialogs
- Pittsburgh, USA, September 2006
- Burkhardt, F., Ajmera, J., Englert, R., Stegmann, J., & Burleson, W. (2006). Detecting anger in automated voice portal dialogs. In Proc. 9th int. conf. spoken language processing, Pittsburgh, USA, September 2006 (pp. 1-4).
- (2006) Proc. 9th int. conf. spoken language processing , pp. 1-4
- Burkhardt, F.¹ Ajmera, J.² Englert, R.³ Stegmann, J.⁴ Burleson, W.⁵

10
- 65249116503
- Analysis of emotionally salient aspects of fundamental frequency for emotion detection
- Busso, C., Lee, S., & Narayanan, S. (2009). Analysis of emotionally salient aspects of fundamental frequency for emotion detection. IEEE Transactions on Speech and Audio Processing, 17(4), 582-596.
- (2009) IEEE Transactions on Speech and Audio Processing , vol.17 , Issue.4 , pp. 582-596
- Busso, C.¹ Lee, S.² Narayanan, S.³

11
- 79953822842
- Affect detection: An interdisciplinary review of models, methods, and their applications
- Calvo, R. A., & D'Mello, S. (2011). Affect detection: An interdisciplinary review of models, methods, and their applications. IEEE Transactions on Affective Computing, 1(1), 18-37.
- (2011) IEEE Transactions on Affective Computing , vol.1 , Issue.1 , pp. 18-37
- Calvo, R.A.¹ D'Mello, S.²

12
- 61549105958
- Support vector machines employing cross-correlation for emotional speech recognition
- Chandaka, S., Chatterjee, A., & Munshi, S. (2009). Support vector machines employing cross-correlation for emotional speech recognition. Measurement, 42(4), 611-618.
- (2009) Measurement , vol.42 , Issue.4 , pp. 611-618
- Chandaka, S.¹ Chatterjee, A.² Munshi, S.³

13
- 85032751766
- Emotion recognition in human-computer interaction
- DOI 10.1109/79.911197
- Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., & Taylor, J. G. (2001). Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine, 18(1), 32-80. (Pubitemid 32287669)
- (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 32-80
- Cowie, R.¹ Douglas-Cowie, E.² Tsapatsoulis, N.³ Votsis, G.⁴ Kollias, S.⁵ Fellenz, W.⁶ Taylor, J.G.⁷

14
- 70349182284
- Comparing emotions using acoustics and human perceptual dimensions
- Boston, USA, April 2009
- Dai, K., Fell, H., & MacAuslan, J. (2009). Comparing emotions using acoustics and human perceptual dimensions. In Proc. 27th int. conf. extended abstracts on human factors in computing systems, Boston, USA, April 2009 (pp. 3341-3346).
- (2009) Proc. 27th int. conf. extended abstracts on human factors in computing systems , pp. 3341-3346
- Dai, K.¹ Fell, H.² MacAuslan, J.³

15
- 0003895612
- New York: Oxford University Press
- Ekman, P., & Davidson, R. (1994). The nature of emotion: Fundamental questions. New York: Oxford University Press.
- (1994) The nature of emotion: Fundamental questions
- Ekman, P.¹ Davidson, R.²

16
- 0002255015
- Facial expression in affective disorders
- What the face reveals, London: Oxford Press. Chap. 15
- Ekman, P., Matsumoto, D., & Friesen, W. (2005). Facial expression in affective disorders. In Series in affective science. What the face reveals (pp. 331-342). London: Oxford Press. Chap. 15.
- (2005) Series in affective science , pp. 331-342
- Ekman, P.¹ Matsumoto, D.² Friesen, W.³

17
- 78649328053
- Survey on speech emotion recognition: Features, classification schemes, and databases
- El Ayadi, M., Kamel, M. S., & Karray, F. (2011). Survey on speech emotion recognition: Features, classification schemes, and databases. Pattern Recognition, 44(3), 572-587.
- (2011) Pattern Recognition , vol.44 , Issue.3 , pp. 572-587
- El Ayadi, M.¹ Kamel, M.S.² Karray, F.³

18
- 36049044257
- Online web resource
- Ellis, D. P. W. (2005). PLP and RASTA (and MFCC, and inversion) in Matlab. URL http://www.ee.columbia.edu/~dpwe/resources/ matlab/rastamat/. Online web resource.
- (2005) PLP and RASTA (and MFCC, and inversion) in Matlab
- Ellis, D.P.W.¹

19
- 84908320056
- Detection of negative emotional state in speech with anfis and genetic algorithms
- Florence, Italy, December 2009
- Espinosa, H. P., & Reyes-García, C. (2009). Detection of negative emotional state in speech with anfis and genetic algorithms. In Proc. 6th int. workshop models and analysis of vocal emissions for biomedical applications, Florence, Italy, December 2009 (pp. 24-28).
- (2009) Proc. 6th int. workshop models and analysis of vocal emissions for biomedical applications , pp. 24-28
- Espinosa, H.P.¹ Reyes-García, C.²

20
- 70350235187
- Audio-based emotion recognition in judicial domain: A multilayer support vector machines approach
- Leipzig, Germany, July 2009
- Fersini, E., Messina, E., Arosio, G., & Archetti, F. (2009). Audio-based emotion recognition in judicial domain: A multilayer support vector machines approach. In Proc. 6th int. conf. machine learning and data mining in pattern recognition, Leipzig, Germany, July 2009 (pp. 594-602).
- (2009) Proc. 6th int. conf. machine learning and data mining in pattern recognition , pp. 594-602
- Fersini, E.¹ Messina, E.² Arosio, G.³ Archetti, F.⁴

21
- 79958702587
- Emotion representation, analysis and synthesis in continuous space: A survey
- Santa Barbara, USA, March 2011
- Gunes, H., Schuller, B., Pantic, M., & Cowie, R. (2011). Emotion representation, analysis and synthesis in continuous space: A survey. In Proc. of IEEE int. conf. automatic face and gesture recognition, Santa Barbara, USA, March 2011 (pp. 827-834).
- (2011) Proc. of IEEE int. conf. automatic face and gesture recognition , pp. 827-834
- Gunes, H.¹ Schuller, B.² Pantic, M.³ Cowie, R.⁴

22
- 33745561205
- An introduction to variable and feature selection
- Guyon, I., & Elisseeff, A. (2003). An introduction to variable and feature selection. Journal of Machine Learning Research, 3(7-8), 1157-1182.
- (2003) Journal of Machine Learning Research , vol.3 , Issue.7-8 , pp. 1157-1182
- Guyon, I.¹ Elisseeff, A.²

23
- 0031676721
- What size test set gives good error rate estimates?
- Guyon, I., Makhoul, J., Schwartz, R., & Vapnik, V. (1998). What size test set gives good error rate estimates? IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(1), 52-64. (Pubitemid 128741307)
- (1998) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.20 , Issue.1 , pp. 52-64
- Guyon, I.¹ Makhoul, J.² Schwartz, R.³ Vapnik, V.⁴

24
- 33745205327
- Distinguishing deceptive from non-deceptive speech
- 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
- Hirschberg, J., Benus, S., Brenier, J. M., Enos, F., & Friedman, S. (2005). Distinguishing deceptive from non-deceptive speech. In Proc. 9th European conf. speech communication and technology, Lisbon, Portugal, September 2005 (pp. 1833-1836). (Pubitemid 43908441)
- (2005) 9th European Conference on Speech Communication and Technology , pp. 1833-1836
- Hirschberg, J.¹ Benus, S.² Brenier, J.M.³ Enos, F.⁴ Friedman, S.⁵ Gilman, S.⁶ Girand, C.⁷ Graciarena, M.⁸ Kathol, A.⁹ Michaelis, L.¹⁰ Pellom, B.¹¹ Shriberg, E.¹² Stolcke, A.¹³

25
- 70449585153
- Statistical evaluation of speech features for emotion recognition
- Colmar, France, July 2009
- Iliou, T., & Anagnostopoulos, C. (2009). Statistical evaluation of speech features for emotion recognition. In Proc. 4th int. conf. digital telecommunications, Colmar, France, July 2009 (pp. 121-126).
- (2009) Proc. 4th int. conf. digital telecommunications , pp. 121-126
- Iliou, T.¹ Anagnostopoulos, C.²

26
- 33644609617
- Emotive alert: HMM-based emotion detection in voicemail messages
- San Diego, USA, January 2005
- Inanoglu, Z., & Caneel, R. (2005). Emotive alert: HMM-based emotion detection in voicemail messages. In Proc. 10th int. conf. intelligent user interfaces, San Diego, USA, January 2005 (pp. 251-253).
- (2005) Proc. 10th int. conf. intelligent user interfaces , pp. 251-253
- Inanoglu, Z.¹ Caneel, R.²

27
- 0003597650
- 2nd ed., New York: Kluwer Academic
- Jackson, L. B. (1989). Digital filters and signal processing (2nd ed.). New York: Kluwer Academic.
- (1989) Digital filters and signal processing
- Jackson, L.B.¹

28
- 0141764789
- Communication of Emotions in Vocal Expression and Music Performance: Different Channels, Same Code?
- DOI 10.1037/0033-2909.129.5.770
- Juslin, P. N., & Laukka, P. (2003). Communication of emotions in vocal expression and music performance: Different channels, same code? Psychological Bulletin, 129(5), 770-814. (Pubitemid 37394950)
- (2003) Psychological Bulletin , vol.129 , Issue.5 , pp. 770-814
- Juslin, P.N.¹ Laukka, P.²

29
- 70450225593
- Using affective avatars and rich multimedia content for education of children with autism
- Corfu, Greece, June 2009
- Konstantinidis, E. I., Hitoglou-Antoniadou, M., Luneski, A., Bamidis, P. D., & Nikolaidou, M. M. (2009). Using affective avatars and rich multimedia content for education of children with autism. In Proc. 2nd int. conf. pervsive technologies related to assistive environments, Corfu, Greece, June 2009 (pp. 1-6).
- (2009) Proc. 2nd int. conf. pervsive technologies related to assistive environments , pp. 1-6
- Konstantinidis, E.I.¹ Hitoglou-Antoniadou, M.² Luneski, A.³ Bamidis, P.D.⁴ Nikolaidou, M.M.⁵

30
- 60349112485
- A speaker dependent emotion recognition framework
- Patras, Greece, July 2006
- Kostoulas, T. P., & Fakotakis, N. (2006). A speaker dependent emotion recognition framework. In Proc. 5th int. symposium communication systems, neworks and digital signal processing, Patras, Greece, July 2006 (pp. 305-309).
- (2006) Proc. 5th int. symposium communication systems, neworks and digital signal processing , pp. 305-309
- Kostoulas, T.P.¹ Fakotakis, N.²

31
- 77957969670
- Gender classification in two emotional speech databases
- Tampa, USA, December 2008
- Kotti, M., & Kotropoulos, C. (2008). Gender classification in two emotional speech databases. In Proc. 19th int. conf. pattern recognition, Tampa, USA, December 2008 (pp. 1-4).
- (2008) Proc. 19th int. conf. pattern recognition , pp. 1-4
- Kotti, M.¹ Kotropoulos, C.²

32
- 78349272093
- Speaker-independent negative emotion recognition
- Elba Island, Italy, June 2010
- Kotti, M., Paterno, F., & Kotropoulos, C. (2010). Speaker-independent negative emotion recognition. In Proc. 2nd int. workshop cognitive information processing, Elba Island, Italy, June 2010.
- (2010) Proc. 2nd int. workshop cognitive information processing
- Kotti, M.¹ Paterno, F.² Kotropoulos, C.³

33
- 14644439843
- Toward detecting emotions in spoken dialogs
- DOI 10.1109/TSA.2004.838534
- Lee, C. M., & Narayanan, S. (2005). Towards detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13(12), 293-303. (Pubitemid 40320247)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.2 , pp. 293-303
- Lee, C.M.¹ Narayanan, S.S.²

34
- 0004272772
- Cambridge: Cambridge University Press
- MacKay, D. J. C. (2003). Information theory, inference and learning algorithms. Cambridge: Cambridge University Press.
- (2003) Information theory, inference and learning algorithms
- MacKay, D.J.C.¹

35
- 0003874959
- New York: Springer
- Markel, J. D., & Gray, A. H. (1976). Linear prediction of speech. New York: Springer.
- (1976) Linear prediction of speech
- Markel, J.D.¹ Gray, A.H.²

36
- 67349221483
- Challenges in speech-based human-computer interfaces
- Minker, W., Pittermann, J., Pittermann, A., Straus, P. M., & Bühler, D.(2007). Challenges in speech-based human-computer interfaces. International Journal of Speech Technology, 10(2-3), 109-119.
- (2007) International Journal of Speech Technology , vol.10 , Issue.2-3 , pp. 109-119
- Minker, W.¹ Pittermann, J.² Pittermann, A.³ Straus, P.M.⁴ Bühler, D.⁵

37
- 63649163674
- Variational Gaussian mixture models for speech emotion recognition
- Kolkata, India, February 2009
- Mishra, H. K., & Sekhar, C. C. (2009). Variational Gaussian mixture models for speech emotion recognition. In Proc. 7th int. conf. advances in pattern recognition, Kolkata, India, February 2009 (pp. 183-186).
- (2009) Proc. 7th int. conf. advances in pattern recognition , pp. 183-186
- Mishra, H.K.¹ Sekhar, C.C.²

38
- 0033692964
- A novel approach to the fully automatic extraction of Fujisaki model parameters
- June 2000
- Mixdorff, H. (2000). A novel approach to the fully automatic extraction of Fujisaki model parameters. In Proc. IEEE int. conf. acoustics, speech, and signal processing, June 2000 (pp. 1281-1284).
- (2000) Proc. IEEE int. conf. acoustics, speech, and signal processing , pp. 1281-1284
- Mixdorff, H.¹

39
- 0027447292
- Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
- DOI 10.1121/1.405558
- Murray, I. R., & Arnott, J. L. (1993). Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. The Journal of the Acoustical Society of America, 93(2), 1097-1108. (Pubitemid 23059837)
- (1993) Journal of the Acoustical Society of America , vol.93 , Issue.2 , pp. 1097-1108
- Murray, I.R.¹ Arnott, J.L.²

40
- 84869128296
- Improving automotive safety by pairing driver emotion and car voice emotion
- Portland, OR, USA, April 2005
- Nass, C., Jonsson, I. M., Harris, H., Reaves, B., Endo, J., Brave, S., & Takayama, L. (2005). Improving automotive safety by pairing driver emotion and car voice emotion. In Proc. int. conf. human-computer interaction, extended abstracts on human factors in computing systems, Portland, OR, USA, April 2005 (pp. 1973-1976).
- (2005) Proc. int. conf. human-computer interaction, extended abstracts on human factors in computing systems , pp. 1973-1976
- Nass, C.¹ Jonsson, I.M.² Harris, H.³ Reaves, B.⁴ Endo, J.⁵ Brave, S.⁶ Takayama, L.⁷

41
- 70449366099
- An adaptive framework for acoustic monitoring of potential hazards
- doi: 10.1155/2009/594103
- Ntalampiras, S., Potamitis, I., & Fakotakis, N. (2009). An adaptive framework for acoustic monitoring of potential hazards. EURASIP Journal on Audio, Speech, and Music Processing. doi: 10.1155/2009/594103.
- (2009) EURASIP Journal on Audio, Speech, and Music Processing
- Ntalampiras, S.¹ Potamitis, I.² Fakotakis, N.³

42
- 2942590310
- Toward an affect-sensitive multimodal human-computer interaction
- DOI 10.1109/JPROC.2003.817122, Human-Computer Multimodal Interface
- Pantic, M., & Rothkrantz, L. J. M. (2003). Toward an affect-sensitive multimodal human-computer interaction. Proceedings of the IEEE, 91(9), 1370-1390. (Pubitemid 40890819)
- (2003) Proceedings of the IEEE , vol.91 , Issue.9 , pp. 1370-1390
- Pantic, M.¹ Rothkrantz, L.J.M.²

43
- 34547284980
- Human computing and machine understanding of human behavior: A survey
- DOI 10.1145/1180995.1181044, ICMI'06: 8th International Conference on Multimodal Interfaces, Conference Proceedings
- Pantic, M., Pentland, A., Nijholt, A., & Huang, T. (2006). Human computing and machine understanding of human behavior: A survey. In Proc. 8th int. conf. multimodal interfaces, Banff, Canada, November 2006 (pp. 239-248). (Pubitemid 47129475)
- (2006) ICMI'06: 8th International Conference on Multimodal Interfaces, Conference Proceeding , pp. 239-248
- Pantic, M.¹ Pentland, A.² Nijholt, A.³ Huang, T.⁴

44
- 34047207805
- Mandarin emotional speech recognition based on SVM and NN
- Hong Kong, Hong Kong, August 2006
- Pao, T. L., Chen, Y. T., Yeh, J. H., & Li, P. J. (2006). Mandarin emotional speech recognition based on SVM and NN. In Proc. 18th int. conf. pattern recognition, Hong Kong, Hong Kong, August 2006 (pp. 1096-1100).
- (2006) Proc. 18th int. conf. pattern recognition , pp. 1096-1100
- Pao, T.L.¹ Chen, Y.T.² Yeh, J.H.³ Li, P.J.⁴

45
- 0003959340
- Cambridge: MIT Press
- Picard, R. W. (1997). Affective computing. Cambridge: MIT Press.
- (1997) Affective computing
- Picard, R.W.¹

46
- 77950021520
- Emotion recognition and adaptation in spoken dialogue systems
- Pittermann, J., Pittermann, A., & Minker, W. (2010). Emotion recognition and adaptation in spoken dialogue systems. International Journal of Speech Technology, 13(1), 49-60.
- (2010) International Journal of Speech Technology , vol.13 , Issue.1 , pp. 49-60
- Pittermann, J.¹ Pittermann, A.² Minker, W.³

47
- 84879797312
- Speech emotion recognition approaches in human computer interaction
- doi:10.1007/s11235-011-9624-z
- Ramakrishnan, S., & El Emary, I. (2011). Speech emotion recognition approaches in human computer interaction. Telecommunication Systems, 1-12. doi:10.1007/s11235-011-9624-z.
- (2011) Telecommunication Systems , pp. 1-12
- Ramakrishnan, S.¹ El Emary, I.²

48
- 77955560086
- A learning approach to hierarchical feature selection and aggregation for audio classification
- Ruvolo, P., Fasel, I., & Movellan, J. R. (2010). A learning approach to hierarchical feature selection and aggregation for audio classification. Pattern Recognition Letters, 31(12), 1535-1542.
- (2010) Pattern Recognition Letters , vol.31 , Issue.12 , pp. 1535-1542
- Ruvolo, P.¹ Fasel, I.² Movellan, J.R.³

49
- 63649147868
- Emotion recognition using mel-frequency cepstral coefficients
- Sato, N., & Obuchi, Y. (2007). Emotion recognition using mel-frequency cepstral coefficients. Journal ofNatural Language Processing, 14(4), 83-96.
- (2007) Journal ofNatural Language Processing , vol.14 , Issue.4 , pp. 83-96
- Sato, N.¹ Obuchi, Y.²

50
- 0037384712
- Vocal communication of emotion: A review of research paradigms
- Scherer, K. R. (2003). Vocal communication of emotion: A review of research paradigms. Speech Communication, 40(1-2), 227-256.
- (2003) Speech Communication , vol.40 , Issue.1-2 , pp. 227-256
- Scherer, K.R.¹

51
- 33750541433
- Speaker independent speech emotion recognition by ensemble classification
- DOI 10.1109/ICME.2005.1521560, 1521560, IEEE International Conference on Multimedia and Expo, ICME 2005
- Schuller, B., Reiter, S., Muller, R., Al-Hames, M., Lang, M., & Rigoll, G. (2005a). Speaker independent speech emotion recognition by ensemble classification. In Proc. IEEE int. conf. multimedia and expo, Amsterdam, The Netherlands, July 2005 (pp. 864-867). (Pubitemid 44669004)
- (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 864-867
- Schuller, B.¹ Reiter, S.² Muller, R.³ Al-Hames, M.⁴ Lang, M.⁵ Rigoll, G.⁶

52
- 33646758175
- Metaclassifiers in acoustic and linguistic feature fusion-based affect recognition
- Philadelphia, USA, March 2005
- Schuller, B., Villar, R. J., Rigoll, G., & Lang, M. (2005b). Metaclassifiers in acoustic and linguistic feature fusion-based affect recognition. In Proc. IEEE int. conf. acoustics, speech, and signal processing, Philadelphia, USA, March 2005 (pp. 325-328)
- (2005) Proc. IEEE int. conf. acoustics, speech, and signal processing , pp. 325-328
- Schuller, B.¹ Villar, R.J.² Rigoll, G.³ Lang, M.⁴

53
- 48249092791
- Audiovisual recognition of spontaneous interest within conversations
- Nagoya, Japan, November 2007
- Schuller, B., Müeller, R., Höernler, B., Höethker, A., Konosu, H., & Rigoll, G. (2007). Audiovisual recognition of spontaneous interest within conversations. In Proceedings of 9th int. conf. multimodal interfaces, Nagoya, Japan, November 2007 (pp. 30-37).
- (2007) Proceedings of 9th int. conf. multimodal interfaces , pp. 30-37
- Schuller, B.¹ Müeller, R.² Höernler, B.³ Höethker, A.⁴ Konosu, H.⁵ Rigoll, G.⁶

54
- 52949090823
- Emotion sensitive speech control for human-robot interaction in minimal invasive surgery
- Munich, Germany, August 2008
- Schuller, B., Rigoll, G., Can, S., & Feussner, H. (2008). Emotion sensitive speech control for human-robot interaction in minimal invasive surgery. In Proc. 17th IEEE int. symposium robot and human interactive communication, Munich, Germany, August 2008 (pp. 453-458).
- (2008) Proc. 17th IEEE int. symposium robot and human interactive communication , pp. 453-458
- Schuller, B.¹ Rigoll, G.² Can, S.³ Feussner, H.⁴

55
- 70349292240
- Being bored? Recognising natural interest by extensive audiovisual integration for real-life application
- Schuller, B., Müller, R., Eyben, F., Gast, J., Hörnler, B., Wöllmer, M., Rigoll, G., Höthker, A., & Konosu, H. (2009a). Being bored? Recognising natural interest by extensive audiovisual integration for real-life application. Image and Vision Computing, 27(12), 1760-1774.
- (2009) Image and Vision Computing , vol.27 , Issue.12 , pp. 1760-1774
- Schuller, B.¹ Müller, R.² Eyben, F.³ Gast, J.⁴ Hörnler, B.⁵ Wöllmer, M.⁶ Rigoll, G.⁷ Höthker, A.⁸ Konosu, H.⁹

56
- 70450206416
- The INTERSPEECH 2009 emotion challenge
- Brighton, UK, September 2009
- Schuller, B., Steidl, S., & Batliner, A. (2009b). The INTERSPEECH 2009 emotion challenge. In Proc. 10th annual int. conf. speech communication association, Brighton, UK, September 2009 (pp. 312-315).
- (2009) Proc. 10th annual int. conf. speech communication association , pp. 312-315
- Schuller, B.¹ Steidl, S.² Batliner, A.³

57
- 0000618817
- New methods of pitch extraction
- Sondhi, M. M. (1968). New methods of pitch extraction. IEEE Transactions on Audio and Electroacoustics, 16(2), 262-266.
- (1968) IEEE Transactions on Audio and Electroacoustics , vol.16 , Issue.2 , pp. 262-266
- Sondhi, M.M.¹

58
- 85009159448
- Emotional space improves emotion recognition
- September 2002
- Tato, R., Santos, R., Kompe, R., & Pardo, J. M. (2002). Emotional space improves emotion recognition. In Proc. 7th int. conf. spoken language processing, September 2002 (pp. 2029-2032).
- (2002) Proc. 7th int. conf. spoken language processing , pp. 2029-2032
- Tato, R.¹ Santos, R.² Kompe, R.³ Pardo, J.M.⁴

59
- 84864675690
- Evaluation of a pitch estimation algorithm for speech emotion recognition
- Florence, Italy, December 2009
- Vanello, N., Martini, N., Milanesi, M., Keiser, H., Calisti, M., Bocchi, L., Manfredi, C., & Landini, L. (2009). Evaluation of a pitch estimation algorithm for speech emotion recognition. In Proc. 6th int. workshop models and analysis of vocal emissions for biomedical applications, Florence, Italy, December 2009 (pp. 29-32).
- (2009) Proc. 6th int. workshop models and analysis of vocal emissions for biomedical applications , pp. 29-32
- Vanello, N.¹ Martini, N.² Milanesi, M.³ Keiser, H.⁴ Calisti, M.⁵ Bocchi, L.⁶ Manfredi, C.⁷ Landini, L.⁸

60
- 33750552511
- Emotional speech classification using Gaussian mixture models and the sequential floating forward selection algorithm
- DOI 10.1109/ICME.2005.1521717, 1521717, IEEE International Conference on Multimedia and Expo, ICME 2005
- Ververidis, D., & Kotropoulos, C. (2005). Emotional speech classification using Gaussian mixture models and the sequential floating forward selection algorithm. In Proceedings of IEEE int. conf. multimedia and expo, Los Alamitos, USA, July 2005 (pp. 1500-1503). (Pubitemid 44669161)
- (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 1500-1503
- Ververidis, D.¹ Kotropoulos, C.²

61
- 70350619300
- Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collections
- Florence, Italy, September 2006
- Ververidis, D., & Kotropoulos, C. (2006). Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collections. In Proc. 14th European signal processing conference, Florence, Italy, September 2006.
- (2006) Proc. 14th European signal processing conference
- Ververidis, D.¹ Kotropoulos, C.²

62
- 48249117060
- EmoVoice'A framework for online recognition of emotions from voice
- Irsee, Germany, June 200
- Vogt, T., André, E., & Bee, N. (2008). EmoVoice'A framework for online recognition of emotions from voice. In Proc. 4th IEEE tutorial and research workshop on perception and interactive technologies for speech-based systems, Irsee, Germany, June 2008 (pp. 188-199).
- (2008) Proc. 4th IEEE tutorial and research workshop on perception and interactive technologies for speech-based systems , pp. 188-199
- Vogt, T.¹ André, E.² Bee, N.³

63
- 84864665428
- Tech. Rep., Cambridge University, Cavendish Lab
- Wallach, H. (2006). Evaluation metrics for hard classifiers (Tech. Rep.). Cambridge University, Cavendish Lab. URL www. inference.phy.cam.ac.uk/hmw26/ papers/evaluation.ps
- (2006) Evaluation metrics for hard classifiers
- Wallach, H.¹

64
- 0003410739
- New York: Guilford Press
- Watson, D. (2000). Mood and temperament. New York: Guilford Press.
- (2000) Mood and temperament
- Watson, D.¹

65
- 75249100219
- Emotion recognition from speech signals using new harmony features
- Yang, B., & Lugger, M. (2010). Emotion recognition from speech signals using new harmony features. Signal Processing, Special Section on Statistical Signal & Array Processing, 90(5), 1415-1423.
- (2010) Signal Processing, Special Section on Statistical Signal & Array Processing , vol.90 , Issue.5 , pp. 1415-1423
- Yang, B.¹ Lugger, M.²

66
- 57149131874
- A survey of affect recognition methods: Audio, visual and spontaneous expressions
- Nagoya, Japan, November 2007
- Zeng, Z., Pantic, M., Roisman, G. I., & Huang, T. S. (2007). A survey of affect recognition methods: Audio, visual and spontaneous expressions. In Proc. 9th int. conf. multimodal interfaces, Nagoya, Japan, November 2007 (pp. 126-133).
- (2007) Proc. 9th int. conf. multimodal interfaces , pp. 126-133
- Zeng, Z.¹ Pantic, M.² Roisman, G.I.³ Huang, T.S.⁴

67
- 33745806206
- Employing Fujisaki's intonation model parameters for emotion recognition
- DOI 10.1007/11752912-44, Advances in Artificial Intelligence - 4th Helenic Conference on AI, SETN 2006, Proceedings
- Zervas, P., Mporas, I., Fakotakis, N., & Kokkinakis, G. K. (2006). Employing Fujisaki's intonation model parameters for emotion recognition. In Proc. 4th hellenic conf. artificial intelligence, Her-aclion, Greece, May 2006 (pp. 443-453). (Pubitemid 44030004)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.3955 LNAI , pp. 443-453
- Zervas, P.¹ Mporas, I.² Fakotakis, N.³ Kokkinakis, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.