SCOPUS 정보 검색 플랫폼

ACM Transactions on Interactive Intelligent Systems

Volumn 2, Issue 1, 2012, Pages 1-29

A multitask approach to continuous five-dimensional affect sensing in natural speech

(3) Eyben, Florian a Wöllmer, Martin a Schuller, Björn a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

Audio features; Dimensional affect; Emotion recognition; Long short term memory; Neural networks; SEMAINE

Indexed keywords

BRAIN; CHEMICAL ACTIVATION; INTERACTIVE COMPUTER SYSTEMS; LEARNING SYSTEMS; LONG SHORT-TERM MEMORY; NEURAL NETWORKS; REAL TIME SYSTEMS; SUPPORT VECTOR REGRESSION;

AFFECT RECOGNITION; AFFECTIVE STATE; AUDIO FEATURES; CORRELATION COEFFICIENT; DIMENSIONAL AFFECT; EMOTION RECOGNITION; SEMAINE; TECHNICAL SYSTEMS;

FEEDFORWARD NEURAL NETWORKS;

EID: 84983561287 PISSN: 21606455 EISSN: 21606463 Source Type: Journal
DOI: 10.1145/2133366.2133372 Document Type: Article

Times cited : (50)

References (73)

1
- 78349274056
- Segmenting into adequate units for automatic recognition of emotion-relatedepisodes: A speech-based approach
- article ID 782802
- BATLINER, A., SEPPI, D., STEIDL, S., AND SCHULLER, B. 2010. Segmenting into adequate units for automatic recognition of emotion-relatedepisodes: A speech-based approach.Advan. Hum. Comput. Interact. article ID 782802..
- (2010) Advan. Hum. Comput. Interact
- Batliner, A.¹ Seppi, D.² Steidl, S.³ Schuller, B.⁴

2
- 77955412961
- Whodunnit\-Searching for the most important feature types signalling emotion-related user states in speech
- BATLINER, A., STEIDL, S., SCHULLER, B., SEPPI, D., VOGT, T., WAGNER, J., DEVILLERS, L., VIDRASCU, L., KESSOUS, V. A. L, AND AMIR, N. 2011. Whodunnit\-Searching for the most important feature types signalling emotion-related user states in speech. Comput. Speech Lang. 25, 1..
- (2011) Comput. Speech Lang. , vol.25 , pp. 1
- Batliner, A.¹ Steidl, S.² Schuller, B.³ Seppi, D.⁴ Vogt, T.⁵ Wagner, J.⁶ Devillers, L.⁷ Vidrascu, L.⁸ Kessous, V.A.L.⁹ Amir, N.¹⁰

3
- 33745202280
- A database of german emotional speech
- BURKHARDT, F., PAESCHKE, A., ROLFES, M., SENDLMEIER, W., AND WEISS, B. 2005.A database of german emotional speech. In Proceedings of the Interspeech Conference. 1517-1520..
- (2005) Proceedings of the Interspeech Conference , pp. 1517-1520
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

4
- 65249111241
- Using neutral speech models for emotional speech analysis
- BUSSO, C., LEE, S., AND NARAYANAN, S. S. 2007. Using neutral speech models for emotional speech analysis. In Proceedings of the Interspeech Conference. 2225-2228..
- (2007) Proceedings of the Interspeech Conference , pp. 2225-2228
- Busso, C.¹ Lee, S.² Narayanan, S.S.³

5
- 34547153664
- Modelingnaturalistic affective states via facial and vocal expressions recognition
- CARIDAKIS, G., MALATESTA, L., KESSOUS, L., AMIR, N., RAOUZAIOU, A., AND KARPOUZIS, K. 2006. Modelingnaturalistic affective states via facial and vocal expressions recognition. In Proceedings of the ACM International Conference on Multimodal Interfaces. 146-154..
- (2006) Proceedings of the ACM International Conference on Multimodal Interfaces , pp. 146-154
- Caridakis, G.¹ Malatesta, L.² Kessous, L.³ Amir, N.⁴ Raouzaiou, A.⁵ Karpouzis, K.⁶

6
- 0003710380
- CHANG, C.-C. AND LIN, C.-J. 2001.LibSVM:A Library for Support Vector Machines.http://www.csie.ntu.edu.tw/cjlin/libsvm..
- (2001) LibSVM:A Library for Support Vector Machines
- Chang, C.-C.¹ Lin, C.-J.²

7
- 0003519438
- Lawrence Erlbaum Associates, Hillsdale, NJ
- COHEN, J., COHEN, P., WEST, S. G., AND AIKEN, L. S. 2003.Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences, 2nd ed. Lawrence Erlbaum Associates, Hillsdale, NJ..
- (2003) Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences, 2nd Ed.
- Cohen, J.¹ Cohen, P.² West, S.G.³ Aiken, L.S.⁴

8
- 0002896902
- Feeltrace: An instrument for recording perceived emotion in real time
- COWIE, R., DOUGLAS-COWIE, E., SAVVIDOU, S., MCMAHON, E., SAWEY, M., AND SCHRÖDER, M. 2000. Feeltrace: an instrument for recording perceived emotion in real time. In Proceedings of the ISCA Workshop on Speech and Emotion. 19-24..
- (2000) Proceedings of the ISCA Workshop on Speech and Emotion , pp. 19-24
- Cowie, R.¹ Douglas-Cowie, E.² Savvidou, S.³ McMahon, E.⁴ Sawey, M.⁵ Schröder, M.⁶

9
- 21544459345
- Challenges in real-life emotion annotation and machine learning based detection
- DEVILLERS, L., VIDRASCU, L., AND LAMEL, L. 2005. Challenges in real-life emotion annotation and machine learning based detection. Neural Netw. 18, 4, 407-422..
- (2005) Neural Netw. , vol.18 , Issue.4 , pp. 407-422
- Devillers, L.¹ Vidrascu, L.² Lamel, L.³

10
- 38049052968
- The HUMAINE database: Addressing the collection and annotation of naturalistic and induced emotional data
- Springer
- DOUGLAS-COWIE, E., COWIE, R., SNEDDON, I., COX, C., LOWRY, O., MCRORIE, M., MARTIN, J. C., DEVILLERS, L., ABRILIAN, S., BATLINER, A., AMIR, N., AND KARPOUZIS, K. 2007a. The HUMAINE database: Addressing the collection and annotation of naturalistic and induced emotional data. In Affective Computing and IntelligentInteraction. Springer, 488-500..
- (2007) Affective Computing and IntelligentInteraction , pp. 488-500
- Douglas-Cowie, E.¹ Cowie, R.² Sneddon, I.³ Cox, C.⁴ Lowry, O.⁵ McRorie, M.⁶ Martin, J.C.⁷ Devillers, L.⁸ Abrilian, S.⁹ Batliner, A.¹⁰ Amir, N.¹¹ Karpouzis, K.¹²

11
- 38049052968
- The humaine database: Addressing the collection and annotation of naturalistic and induced emotional data
- Springer
- DOUGLAS-COWIE, E., COWIE, R., SNEDDON, I., COX, C., O., L., MCRORIE, M., MARTIN, J., DEVILLERS, L., ABRILIAN, S., BATLINER, A., AMIR, N., AND KARPOUZIS, K. 2007b. The humaine database: Addressing the collection and annotation of naturalistic and induced emotional data. In Lecture Notes in Computer Science,Springer, vol. 4738, 488-501..
- (2007) Lecture Notes in Computer Science , vol.4738 , pp. 488-501
- Douglas-Cowie, E.¹ Cowie, R.² Sneddon, I.³ Cox, C.O.L.⁴ McRorie, M.⁵ Martin, J.⁶ Devillers, L.⁷ Abrilian, S.⁸ Batliner, A.⁹ Amir, N.¹⁰ Karpouzis, K.¹¹

12
- 0004079405
- Prentice Hall, Englewood Cliffs, NJ
- EKMAN, P. AND FRIESEN, W. V. 1975. Unmasking the Face: A Guide to Recognizing Emotions from Facial Expressions. Prentice Hall, Englewood Cliffs, NJ..
- (1975) Unmasking the Face: A Guide to Recognizing Emotions from Facial Expressions
- Ekman, P.¹ Friesen, W.V.²

13
- 77949304464
- On-line emotion recognition ina3-d activation-valence-time continuum using acoustic and linguistic cues
- EYBEN, F., WÖLLMER, M., GRAVES, A., SCHULLER, B., DOUGLAS-COWIE, E., AND COWIE, R. 2010a. On-line emotion recognition ina3-d activation-valence-time continuum using acoustic and linguistic cues. J. Multimodal User Interfaces 3, 1-2, 7-19..
- (2010) J. Multimodal User Interfaces , vol.3 , Issue.1-2 , pp. 7-19
- Eyben, F.¹ Wöllmer, M.² Graves, A.³ Schuller, B.⁴ Douglas-Cowie, E.⁵ Cowie, R.⁶

14
- 78650977476
- Open SMILE - The munich versatile and fast open-source audio feature extractor
- EYBEN, F., WÖLLMER, M., AND SCHULLER, B. 2010b. openSMILE\-The Munich versatile and fast open-source audio feature extractor In Proceedings of ACM Multimedia Conference. 1459-1462..
- (2010) Proceedings of ACM Multimedia Conference , pp. 1459-1462
- Eyben, F.¹ Wöllmer, M.² Schuller, B.³

15
- 79958694881
- String-based audiovisual fusion of behavioural events for the assessment of dimensional affect
- EYBEN, F., WÖLLMER, M., VALSTER, M., GUNES, H., SCHULLER, B., AND PANTIC, M. 2011. String-based audiovisual fusion of behavioural events for the assessment of dimensional affect. In Proceedings FG Conference (to appear)..
- (2011) Proceedings FG Conference (To Appear)
- Eyben, F.¹ Wöllmer, M.² Valster, M.³ Gunes, H.⁴ Schuller, B.⁵ Pantic, M.⁶

16
- 77949295239
- Tech. rep., IDSIA
- FERNANDEZ, S., GRAVES, A., AND SCHMIDHUBER, J. 2008. Phoneme recognition in timit with blstm-ctc. Tech. rep., IDSIA..
- (2008) Phoneme Recognition in Timit with Blstm-ctc
- Fernandez, S.¹ Graves, A.² Schmidhuber, J.³

17
- 36348934700
- The world of emotions is not two-dimensional
- FONTAINE, J. R. J., SCHERER, K. R., ROESCH, E. B., AND ELLSWORTH, P. C. 2007. The world of emotions is not two-dimensional. Psychol. Sci. 18, 2, 1050-1057..
- (2007) Psychol. Sci. , vol.18 , Issue.2 , pp. 1050-1057
- Fontaine, J.R.J.¹ Scherer, K.R.² Roesch, E.B.³ Ellsworth, P.C.⁴

18
- 21544458365
- Emotion recognition in human-computer interaction
- FRAGOPANAGOS, N. AND TAYLOR, J. G. 2005. Emotion recognition in human-computer interaction. Neural Netw. 18, 4, 389-405..
- (2005) Neural Netw. , vol.18 , Issue.4 , pp. 389-405
- Fragopanagos, N.¹ Taylor, J.G.²

19
- 51849085268
- Technique for automatic emotion recog-nition by body gesture analysis
- GLOWINSKI, D., CAMURRI, A., VOLPE, G., DAEL, N., AND SCHERER, K. 2008. Technique for automatic emotion recog-nition by body gesture analysis. In Proceedings ofComputer Vision and Pattern Recognition Workshops. 1-6..
- (2008) Proceedings OfComputer Vision and Pattern Recognition Workshops , pp. 1-6
- Glowinski, D.¹ Camurri, A.² Volpe, G.³ Dael, N.⁴ Scherer, K.⁵

20
- 27744588611
- Bidirectional lstm networks for improved phoneme classification and recognition
- GRAVES, A., FERNANDEZ, S., AND SCHMIDHUBER, J. 2005. Bidirectional lstm networks for improved phoneme classification and recognition. In Proceedings ofICANN. Vol. 18. 602-610..
- (2005) Proceedings OfICANN , vol.18 , pp. 602-610
- Graves, A.¹ Fernandez, S.² Schmidhuber, J.³

21
- 27744588611
- Framewise phoneme classification with bidirectional lstm and other neural network architectures
- GRAVES, A. AND SCHMIDHUBER, J. 2005. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural Netw. 18, 5-6, 602-610..
- (2005) Neural Netw. , vol.18 , Issue.5-6 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

22
- 34547518166
- Support vector regression for automatic recognition of spontaneous emotions in speech
- IEEE
- GRIMM, M., KROSCHEL, K., AND NARAYANAN, S. 2007a. Support vector regression for automatic recognition of spontaneous emotions in speech. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP). Vol. 4. IEEE..
- (2007) Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP) , vol.4
- Grimm, M.¹ Kroschel, K.² Narayanan, S.³

23
- 34547940048
- Primitives based estimation and evaluation of emotions in speech
- GRIMM, M., MOWER, E., KROSCHEL, K., AND NARAYANAN, S. 2007b. Primitives based estimation and evaluation of emotions in speech. Speech Comm. 49, 787-800..
- (2007) Speech Comm. , vol.49 , pp. 787-800
- Grimm, M.¹ Mower, E.² Kroschel, K.³ Narayanan, S.⁴

24
- 78049394179
- Automatic, dimensional and continuous emotion recognition
- GUNES, H. AND PANTIC, M. 2010a. Automatic, dimensional and continuous emotion recognition. Int. J. Synth. Emot. 1, 1, 68-99..
- (2010) Int. J. Synth. Emot. , vol.1 , Issue.1 , pp. 68-99
- Gunes, H.¹ Pantic, M.²

25
- 79958730446
- Automatic measurement of affect in dimensional and continuous spaces: Why, what, andhow?
- GUNES, H. AND PANTIC, M. 2010b. Automatic measurement of affect in dimensional and continuous spaces: Why, what, andhow? In Proceedings of the Conference on MeasuringBehavior. 122-126..
- (2010) Proceedings of the Conference on MeasuringBehavior , pp. 122-126
- Gunes, H.¹ Pantic, M.²

26
- 78049368043
- Dimensional emotion prediction from spontaneous head gestures for inter-actionwith sensitive artificial listeners
- GUNES, H. AND PANTIC, M. 2010c. Dimensional emotion prediction from spontaneous head gestures for inter-actionwith sensitive artificial listeners. In Proceedings of International Conference on Intelligent Virtual Agents. 371-377..
- (2010) Proceedings of International Conference on Intelligent Virtual Agents , pp. 371-377
- Gunes, H.¹ Pantic, M.²

27
- 0004060921
- Ph.D. thesis, University of Waikato, Hamilton, New Zealand
- HALL, M. A. 1998. Correlation-based feature subset selection for machine learning. Ph.D. thesis, University of Waikato, Hamilton, New Zealand..
- (1998) Correlation-based Feature Subset Selection for Machine Learning.
- Hall, M.A.¹

28
- 0041914606
- Gradient flow in recurrent nets: The difficulty oflearning long-term dependencies
- S. C. Kremer and J. F Kolen, Eds., IEEE Press
- HOCHREITER, S., BENGIO, Y., FRASCONI, P., AND SCHMIDHUBER, J. 2001. Gradient flow in recurrent nets: the difficulty oflearning long-term dependencies. In A Field Guide to Dynamical Recurrent Neural Networks, S. C. Kremer and J. F Kolen, Eds., IEEE Press..
- (2001) A Field Guide to Dynamical Recurrent Neural Networks
- Hochreiter, S.¹ Bengio, Y.² Frasconi, P.³ Schmidhuber, J.⁴

29
- 0031573117
- Long short-term memory
- HOCHREITER, S. AND SCHMIDHUBER, J. 1997. Long short-term memory. Neural Comput. 9, 8, 1735-1780..
- (1997) Neural Comput. , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

30
- 21544480590
- Emotion recognition through facial expression analysis based on a neurofuzzy method
- IOANNOU, S., RAOUZAIOU, A., TZOUVARAS, V., MAILIS, T., KARPOUZIS, K., AND KOLLIAS, S. 2005. Emotion recognition through facial expression analysis based on a neurofuzzy method. J. Neural Netw. 18, 423-435..
- (2005) J. Neural Netw. , vol.18 , pp. 423-435
- Ioannou, S.¹ Raouzaiou, A.² Tzouvaras, V.³ Mailis, T.⁴ Karpouzis, K.⁵ Kollias, S.⁶

31
- 70450185596
- Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions
- LEE, C., BUSSO, C., LEE, S., AND NARAYANAN, S. 2009. Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions. In Proceedings of the Interspeech Conference. 1983-1986..
- (2009) Proceedings of the Interspeech Conference , pp. 1983-1986
- Lee, C.¹ Busso, C.² Lee, S.³ Narayanan, S.⁴

32
- 84983586176
- Cost-Effective solution to synchronised audio-visual data capture using multiple sensors
- LICHTENAUER, J., SHEN, J., VALSTAR, M., AND PANTIC, M. 2010. Cost-Effective solution to synchronised audio-visual data capture using multiple sensors. J. Vis. Comm. Image Represent., 1-39..
- (2010) J. Vis. Comm. Image Represent. , pp. 1-39
- Lichtenauer, J.¹ Shen, J.² Valstar, M.³ Pantic, M.⁴

33
- 78349272018
- The semaine corpus of emotionally coloured character interactions
- IEEE
- MCKEOWN, G., VALSTAR, M. F., PANTIC, M., AND COWIE, R. 2010. The semaine corpus of emotionally coloured character interactions. In Proceedings of the ICME Conference. IEEE, 1-6..
- (2010) Proceedings of the ICME Conference , pp. 1-6
- McKeown, G.¹ Valstar, M.F.² Pantic, M.³ Cowie, R.⁴

34
- 80051654269
- Tracking changes in continuous emotion states using body language and prosodic cues
- IEEE
- METALLINOU, A., KATSAMANIS, A., WANG, Y., AND NARAYANAN, S. S. 2011. Tracking changes in continuous emotion states using body language and prosodic cues. In Proceedings of the International Conference onAcoustics, Speech and Signal Processing. IEEE, 2288-2291..
- (2011) Proceedings of the International Conference OnAcoustics, Speech and Signal Processing , pp. 2288-2291
- Metallinou, A.¹ Katsamanis, A.² Wang, Y.³ Narayanan, S.S.⁴

35
- 85008006613
- Aframe work for automatic human emotion classification using emotional profiles
- MOWER, E., MATARIC, M. J., ANDNARAYANAN, S. S. 2011. Aframe work for automatic human emotion classification using emotional profiles. IEEE Trans. Audio, Speech Lang. Process. 19, 5, 1057-1070..
- (2011) IEEE Trans. Audio, Speech Lang. Process. , vol.19 , Issue.5 , pp. 1057-1070
- Mower, E.¹ Mataric, M.J.² Andnarayanan, S.S.³

36
- 80051607532
- A hierarchical static-dynamic framework for emotion classification
- IEEE
- MOWER, E. AND NARAYANAN, S. S. 2011. A hierarchical static-dynamic framework for emotion classification. In Proceedings of International Conference onAcoustics, Speech and Signal Processing. IEEE, 2372-2375..
- (2011) Proceedings of International Conference OnAcoustics, Speech and Signal Processing , pp. 2372-2375
- Mower, E.¹ Narayanan, S.S.²

37
- 78149479604
- Audio-visual classification and fusion of spontaneous affective data in likelihood space
- NICOLAOU, M., GUNES, H., AND PANTIC, M. 2010. Audio-visual classification and fusion of spontaneous affective data in likelihood space. In Proceedings ofIEEE International Conference on Pattern Recognition. 3695-3699..
- (2010) Proceedings OfIEEE International Conference on Pattern Recognition , pp. 3695-3699
- Nicolaou, M.¹ Gunes, H.² Pantic, M.³

38
- 0038548330
- The production and recognition of emotions in speech: Features and algorithms
- OUDEYER, P. Y. 2003. The production and recognition of emotions in speech: Features and algorithms. Int. J. Hum.-Comput. Studies 59, 157-183..
- (2003) Int. J. Hum.-Comput. Studies , vol.59 , pp. 157-183
- Oudeyer, P.Y.¹

39
- 0038764011
- Kalman filters improve lstm network performance in problems unsolvable by traditional recurrent nets
- PÉREZ-ORTIZ, J. A., GERS, F A., ECK, D., AND SCHMIDHUBER, J. 2003. Kalman filters improve lstm network performance in problems unsolvable by traditional recurrent nets. Neural Netw. 16, 2, 241-250..
- (2003) Neural Netw. , vol.16 , Issue.2 , pp. 241-250
- Pérez-Ortiz, J.A.¹ Gers, F.A.² Eck, D.³ Schmidhuber, J.⁴

40
- 0036919726
- Synthetic vision and memory for autonomous virtual humans
- PETERS, C. AND O'SULLIVAN, C. 2002. Synthetic vision and memory for autonomous virtual humans. Comput. Graph. Forum 21, 4, 743-753..
- (2002) Comput. Graph. Forum , vol.21 , Issue.4 , pp. 743-753
- Peters, C.¹ O'Sullivan, C.²

41
- 84943274699
- A direct adaptive method for faster backpropagation learning: The rprop algorithm
- RIEDMILLER, M. AND BRAUN, H. 1993.A direct adaptive method for faster backpropagation learning: The rprop algorithm. In Proceedings of the IEEE International Conference on Neural Networks. 586-591..
- (1993) Proceedings of the IEEE International Conference on Neural Networks , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

42
- 84555174290
- Prediction oftime-varying musical mood distributions from audio
- SCHMIDT, E. M. AND KIM, Y. E. 2010. Prediction oftime-varying musical mood distributions from audio. In Proceedings of the International Society for Music Information Retrieval Conference..
- (2010) Proceedings of the International Society for Music Information Retrieval Conference
- Schmidt, E.M.¹ Kim, Y.E.²

43
- 80052606383
- Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
- to appear
- SCHULLER, B., BATLINER, A., STEIDL, S., AND SEPPI, D. 2010a. Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge. Speech Comm. (Special Issue on Sensing Emotion and AffectFacing Realism in Speech Processing) (to appear)..
- (2010) Speech Comm. (Special Issue on Sensing Emotion and AffectFacing Realism in Speech Processing)
- Schuller, B.¹ Batliner, A.² Steidl, S.³ Seppi, D.⁴

44
- 70349292240
- Being bored? Recognising natural interest by extensive audiovisual integration for real-life application
- SCHULLER, B., MÜLLER, R., EYBEN, F., GAST, J., HÖRNLER, B., WÖLLMER, M., RIGOLL, G., HÖTHKER, A., AND KONOSU, H. 2009a. Being bored? Recognising natural interest by extensive audiovisual integration for real-life application. Image Vis. Comput. J. 27, 12, 1760-1774..
- (2009) Image Vis. Comput. J. , vol.27 , Issue.12 , pp. 1760-1774
- Schuller, B.¹ Müller, R.² Eyben, F.³ Gast, J.⁴ Hörnler, B.⁵ Wöllmer, M.⁶ Rigoll, G.⁷ Höthker, A.⁸ Konosu, H.⁹

45
- 34247624725
- Evolutionary feature generation in speech emotion recognition
- SCHULLER, B., REITER, S., AND RIGOLL, G. 2006. Evolutionary feature generation in speech emotion recognition. In Proceedings of the International Conference on Multimedia and Expo (ICME). 5-8..
- (2006) Proceedings of the International Conference on Multimedia and Expo (ICME) , pp. 5-8
- Schuller, B.¹ Reiter, S.² Rigoll, G.³

46
- 38049067290
- Timing levels in segment-based speech emotion recognition
- SCHULLER, B. AND RIGOLL, G.2006. Timing levels in segment-based speech emotion recognition. In Proceedings of the Inter speech Conference. 1818-1821..
- (2006) Proceedings of the Inter Speech Conference , pp. 1818-1821
- Schuller, B.¹ Rigoll, G.²

47
- 0141478857
- Hidden markov model-based speech emotion recognition
- IEEE
- SCHULLER, B., RIGOLL, G., AND LANG, M. 2003. Hidden markov model-based speech emotion recognition. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing. Vol. II. IEEE, 1-4..
- (2003) Proceedings of the International Conference on Acoustics, Speech and Signal Processing , vol.2 , pp. 1-4
- Schuller, B.¹ Rigoll, G.² Lang, M.³

48
- 34547549142
- Towards more reality in the recognition of emotional speech
- IEEE
- SCHULLER, B., SEPPI, D., BATLINER, A., MAIER, A., AND STEIDL, S. 2007a. Towards more reality in the recognition of emotional speech. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing. Vol. IV. IEEE, 941-944..
- (2007) Proceedings of the International Conference on Acoustics, Speech and Signal Processing , vol.4 , pp. 941-944
- Schuller, B.¹ Seppi, D.² Batliner, A.³ Maier, A.⁴ Steidl, S.⁵

49
- 70450206416
- The INTERSPEECH 2009 emotion challenge
- SCHULLER, B., STEIDL, S., AND BATLINER, A. 2009b. The INTERSPEECH 2009 emotion challenge. In Proceedings of theInterspeech Conference. 312-315..
- (2009) Proceedings of TheInterspeech Conference , pp. 312-315
- Schuller, B.¹ Steidl, S.² Batliner, A.³

50
- 79954999224
- The interspeech 2010 paralinguistic challenge
- SCHULLER, B., STEIDL, S., BATLINER, A., BURKHARDT, F., DEVILLERS, L., MÜLLER, C., AND NARAYANAN, S. 2010b. The interspeech 2010 paralinguistic challenge. In Proceedings of the Interspeech Conference. 2794-2797..
- (2010) Proceedings of the Interspeech Conference , pp. 2794-2797
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Burkhardt, F.⁴ Devillers, L.⁵ Müller, C.⁶ Narayanan, S.⁷

51
- 84865716918
- The interspeech 2011 speaker state challenge
- SCHULLER, B., STEIDL, S., BATLINER, A., SCHIEL, F., AND KRAJEWSKI, J. 2011. The interspeech 2011 speaker state challenge. In Proceedings of the Interspeech Conference..
- (2011) Proceedings of the Interspeech Conference
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Schiel, F.⁴ Krajewski, J.⁵

52
- 77949395673
- Acoustic emotion recognition: A benchmark comparison ofperformances
- IEEE
- SCHULLER, B., VLASENKO, B., EYBEN, F., RIGOLL, G., AND WENDEMUTH, A. 2009c. Acoustic emotion recognition: A benchmark comparison ofperformances. In Proceedings of theASRUConference. IEEE..
- (2009) Proceedings of TheASRUConference
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Rigoll, G.⁴ Wendemuth, A.⁵

53
- 80053925819
- Cross-corpus acoustic emotion recognition: Variances and strategies
- SCHULLER, B., VLASENKO, B., EYBEN, F., WÖLLMER, M., STUHLSATZ, A., WENDEMUTH, A., AND RIGOLL, G. 2010c. Cross-corpus acoustic emotion recognition: Variances and strategies. IEEE Trans. Affective Comput. 1, 2..
- (2010) IEEE Trans. Affective Comput. , vol.1 , pp. 2
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Wöllmer, M.⁴ Stuhlsatz, A.⁵ Wendemuth, A.⁶ Rigoll, G.⁷

54
- 44849100275
- Comparing one and two-stage acoustic modeling in the recognition of emotion in speech
- SCHULLER, B., VLASENKO, B., MINGUEZ, R., RIGOLL, G., AND WENDEMUTH, A. 2007b. Comparing one and two-stage acoustic modeling in the recognition of emotion in speech. In Proceedings of the Automatic Speech Recognition and Understanding Workshop. 596-600..
- (2007) Proceedings of the Automatic Speech Recognition and Understanding Workshop , pp. 596-600
- Schuller, B.¹ Vlasenko, B.² Minguez, R.³ Rigoll, G.⁴ Wendemuth, A.⁵

55
- 51449104640
- Brute-Forcing hier-archical functionals for paralinguistics: A waste of feature space?
- SCHULLER, B., WIMMER, M., MÖSENLECHNER, L., KERN, C., ARSIC, D., AND RIGOLL, G. 2008. Brute-Forcing hier-archical functionals for paralinguistics: A waste of feature space? In Proceedings of the International Conference onAcoustics, Speech and Signal Processing. 4501-4504..
- (2008) Proceedings of the International Conference OnAcoustics, Speech and Signal Processing , pp. 4501-4504
- Schuller, B.¹ Wimmer, M.² Mösenlechner, L.³ Kern, C.⁴ Arsic, D.⁵ Rigoll, G.⁶

56
- 84983583785
- Cinemo a french spoken language resource for complex emotions: Facts and baselines
- SCHULLER, B., ZACCARELLI, R., ROLLET, N., AND DEVILLERS, L. 2010d. Cinemo a french spoken language resource for complex emotions: Facts and baselines. In Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC)..
- (2010) Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC)
- Schuller, B.¹ Zaccarelli, R.² Rollet, N.³ Devillers, L.⁴

57
- 0031268931
- Bidirectional recurrent neural networks
- SCHUSTER, M. AND PALIWAL, K. K. 1997. Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45, 2673-2681..
- (1997) IEEE Trans. Signal Process. , vol.45 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

58
- 70449388050
- Logos, Berlin
- STEIDL, S. 2009.Automatic Classification of Emotion-Related User States in Spontaneous Children's Speech. Logos, Berlin..
- (2009) Automatic Classification of Emotion-Related User States in Spontaneous Children's Speech
- Steidl, S.¹

59
- 77949400109
- The hinterland of emotions: Facing the open-microphone challenge
- IEEE
- STEIDL, S., SCHULLER, B., BATLINER, A., AND SEPPI, D. 2009. The hinterland of emotions: Facing the open-microphone challenge. In Proceedings of the 4th International HUMAINE Association Conference on Affective Computing and Intelligent Interaction (ACII). Vol. I. IEEE, 690-697..
- (2009) Proceedings of the 4th International HUMAINE Association Conference on Affective Computing and Intelligent Interaction (ACII) , vol.1 , pp. 690-697
- Steidl, S.¹ Schuller, B.² Batliner, A.³ Seppi, D.⁴

60
- 84880315493
- Emotions analysis and emotion-handling subdia-logues
- W. Wahlster, Ed., Springer
- STREIT, M., BATLINER, A., AND PORTELE, T. 2006. Emotions analysis and emotion-handling subdia-logues. In SmartKom: Foundations of Multimodal Dialogue Systems, W. Wahlster, Ed., Springer, 317-332..
- (2006) SmartKom: Foundations of Multimodal Dialogue Systems , pp. 317-332
- Streit, M.¹ Batliner, A.² Portele, T.³

61
- 38049048651
- Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing
- Springer
- VLASENKO, B., SCHULLER, B., WENDEMUTH, A., AND RIGOLL, G. 2007. Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing. In Proceedings of the 2nd International Conference on Affective Computing and Intelligent Interaction (ACII), Lecture Notes in Computer Science, vol. 4738. Springer, 139-147..
- (2007) Proceedings of the 2nd International Conference on Affective Computing and Intelligent Interaction (ACII), Lecture Notes in Computer Science , vol.4738 , pp. 139-147
- Vlasenko, B.¹ Schuller, B.² Wendemuth, A.³ Rigoll, G.⁴

62
- 84976221270
- Tuning hidden markov model for speech emotion recognition
- VLASENKO, B. AND WENDEMUTH, A. 2007. Tuning hidden markov model for speech emotion recognition. In Proceedings of DAGA 33rd German Annual Conference on Acoustics..
- (2007) Proceedings of DAGA 33rd German Annual Conference on Acoustics
- Vlasenko, B.¹ Wendemuth, A.²

63
- 33750564952
- Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
- VOGT, T. AND ANDRE, E. 2005. Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition. In Proceedings of the ICME Conference. 474-477..
- (2005) Proceedings of the ICME Conference , pp. 474-477
- Vogt, T.¹ Andre, E.²

64
- 0025503558
- Backpropagation through time: What it does and how to do it
- WERBOS, P. 1990. Backpropagation through time: What it does and how to do it. Proc. IEEE 78, 1550-1560..
- (1990) Proc. IEEE , vol.78 , pp. 1550-1560
- Werbos, P.¹

65
- 80855135228
- Morgan Kaufmann, San Francisco
- WITTEN, I. H. AND FRANK, E. 2005. Data Mining: Practical Machine Learning Tools and Techniques, 2nd ed. Morgan Kaufmann, San Francisco..
- (2005) Data Mining: Practical Machine Learning Tools and Techniques, 2nd Ed
- Witten, I.H.¹ Frank, E.²

66
- 84862156369
- Abandoning emotion classes - Towards continuous emotion recognition with modelling of long-range dependencies
- WÖLLMER, M., EYBEN, F., REITER, S., SCHULLER, B., COX, C., DOUGLAS-COWIE, E., AND COWIE, R. 2008. Abandoning emotion classes - Towards continuous emotion recognition with modelling of long-range dependencies. In Proceedings of the Interspeech Conference. 597-600..
- (2008) Proceedings of the Interspeech Conference , pp. 597-600
- Wöllmer, M.¹ Eyben, F.² Reiter, S.³ Schuller, B.⁴ Cox, C.⁵ Douglas-Cowie, E.⁶ Cowie, R.⁷

67
- 70450186589
- Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks
- WÖLLMER, M., EYBEN, F., SCHULLER, B., DOUGLAS-COWIE, E., AND COWIE, R. 2009. Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks. In Proceedings of Interspeech Conference. 1595-1598..
- (2009) Proceedings of Interspeech Conference , pp. 1595-1598
- Wöllmer, M.¹ Eyben, F.² Schuller, B.³ Douglas-Cowie, E.⁴ Cowie, R.⁵

68
- 79958734716
- Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional lstm modeling
- WÖLLMER, M., METALLINOU, A., EYBEN, F., SCHULLER, B., AND NARAYANAN, S. 2010a. Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional lstm modeling. In Proceedings of Inter speech Conference. 2362-2365..
- (2010) Proceedings of Inter Speech Conference , pp. 2362-2365
- Wöllmer, M.¹ Metallinou, A.² Eyben, F.³ Schuller, B.⁴ Narayanan, S.⁵

69
- 77956721304
- Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening
- WÖLLMER, M., SCHULLER, B., EYBEN, F., AND RIGOLL, G. 2010b. Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening. IEEE J. Select. Topics Signal Process. 4, 5, 867-881..
- (2010) IEEE J. Select. Topics Signal Process. , vol.4 , Issue.5 , pp. 867-881
- Wöllmer, M.¹ Schuller, B.² Eyben, F.³ Rigoll, G.⁴

70
- 78349237283
- Speech emotion estimation in 3d space
- WU, D., PARSONS, T., MOWER, E., AND NARAYANAN, S. S. 2010a. Speech emotion estimation in 3d space. In Proceedings of the ICME Conference. 737-742..
- (2010) Proceedings of the ICME Conference , pp. 737-742
- Wu, D.¹ Parsons, T.² Mower, E.³ Narayanan, S.S.⁴

71
- 79959848810
- Acoustic feature analysis in speech emotion primitives estimation
- WU, D., PARSONS, T., AND NARAYANAN, S. S. 2010b. Acoustic feature analysis in speech emotion primitives estimation. In Proceedings of the Interspeech Conference. 785-788..
- (2010) Proceedings of the Interspeech Conference , pp. 785-788
- Wu, D.¹ Parsons, T.² Narayanan, S.S.³

72
- 0003786273
- John Wiley
- YEE, P. V. AND HAYKIN, S. 2001. Regularized Radial Basis Function Networks: Theory andApplications. John Wiley..
- (2001) Regularized Radial Basis Function Networks: Theory AndApplications
- Yee, P.V.¹ Haykin, S.²

73
- 57149144228
- A survey of affect recognition methods: Audio, visual, and spontaneous expressions
- ZENG, Z., PANTIC, M., ROISMAN, G. I., AND HUANG, T. 2009. A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31, 1, 39-58..
- (2009) IEEE Trans. Pattern Anal. Mach. Intell. , vol.31 , Issue.1 , pp. 39-58
- Zeng, Z.¹ Pantic, M.² Roisman, G.I.³ Huang, T.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.