SCOPUS 정보 검색 플랫폼

Image and Vision Computing

Volumn 31, Issue 2, 2013, Pages 153-163

LSTM-modeling of continuous emotions in an audiovisual affect recognition framework

(5) Wöllmer, Martin a Kaiser, Moritz a Eyben, Florian a Schuller, Björn a Rigoll, Gerhard a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

Context modeling; Emotion recognition; Facial movement features; Long short term memory

Indexed keywords

BRAIN;

AFFECTIVE COMPUTING; AUDIO-VISUAL AFFECT RECOGNITION; CONTEXT MODELING; DIMENSIONAL REPRESENTATION; EMOTION RECOGNITION; FACIAL MOVEMENTS; LONG SHORT-TERM MEMORY; RECOGNIZING HUMAN EMOTION;

HUMAN COMPUTER INTERACTION;

EID: 84886418479 PISSN: 02628856 EISSN: None Source Type: Journal
DOI: 10.1016/j.imavis.2012.03.001 Document Type: Article

Times cited : (274)

References (69)

1
- 85032751766
- Emotion recognition in human-computer interaction
- DOI 10.1109/79.911197
- R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, J.G. Taylor, Emotion recognition in human-computer interaction, IEEE Signal Process. Mag. 18 (1) (2001) 32-80. (Pubitemid 32287669)
- (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 32-80
- Cowie, R.¹ Douglas-Cowie, E.² Tsapatsoulis, N.³ Votsis, G.⁴ Kollias, S.⁵ Fellenz, W.⁶ Taylor, J.G.⁷

2
- 70349337757
- On the role of emotion in embodied cognitive architectures: From organisms to robots
- T. Ziemke, R. Lowe, On the role of emotion in embodied cognitive architectures: from organisms to robots, Cogn. Comput. 1 (1) (2009) 104-117.
- (2009) Cogn. Comput. , vol.1 , Issue.1 , pp. 104-117
- Ziemke, T.¹ Lowe, R.²

3
- 79959846823
- Real-life emotion-related states detection in call centers: A cross-corpora study
- Makuhari, Japan
- L. Devillers, C. Vaudable, C. Chastagnol, Real-life emotion-related states detection in call centers: A cross-corpora study, Proc. of Interspeech, Makuhari, Japan, 2010, pp. 2350-2353.
- (2010) Proc. of Interspeech , pp. 2350-2353
- Devillers, L.¹ Vaudable, C.² Chastagnol, C.³

4
- 84964036733
- Building autonomous sensitive artificial listeners
- doi:10.1109/T-AFFC.2011.34
- M. Schröder, E. Bevacqua, R. Cowie, F. Eyben, H. Gunes, D. Heylen, M. Maat, G. McKeown, S. Pammi, M. Pantic, C. Pelachaud, B. Schuller, E. de Sevin, M. Valstar, M. Wöllmer, Building autonomous sensitive artificial listeners, IEEE Trans. Affective Comput. doi:10.1109/T-AFFC.2011.34.
- IEEE Trans. Affective Comput.
- Schröder, M.¹ Bevacqua, E.² Cowie, R.³ Eyben, F.⁴ Gunes, H.⁵ Heylen, D.⁶ Maat, M.⁷ McKeown, G.⁸ Pammi, S.⁹ Pantic, M.¹⁰ Pelachaud, C.¹¹ Schuller, B.¹² De Sevin, E.¹³ Valstar, M.¹⁴ Wöllmer, M.¹⁵

5
- 70349292240
- Being bored? Recognising natural interest by extensive audiovisual integration for real-life application
- B. Schuller, R. Müller, F. Eyben, J. Gast, B. Hörnler, M. Wöllmer, G. Rigoll, A. Höthker, H. Konosu, Being bored? Recognising natural interest by extensive audiovisual integration for real-life application, Image and Vision Computing Journal, Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior, 27 (12), 2009, pp. 1760-1774.
- (2009) Image and Vision Computing Journal, Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior , vol.27 , Issue.12 , pp. 1760-1774
- Schuller, B.¹ Müller, R.² Eyben, F.³ Gast, J.⁴ Hörnler, B.⁵ Wöllmer, M.⁶ Rigoll, G.⁷ Höthker, A.⁸ Konosu, H.⁹

6
- 80054831955
- AVEC the first international audio/visual emotion challenge
- Memphis, Tennessee, USA
- B. Schuller, M. Valstar, F. Eyben, G. McKeown, R. Cowie, M. Pantic, AVEC - the First International Audio/Visual Emotion Challenge, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 415-424.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 415-424
- Schuller, B.¹ Valstar, M.² Eyben, F.³ McKeown, G.⁴ Cowie, R.⁵ Pantic, M.⁶

7
- 51449104640
- Brute-forcinghierarchical functionals for paralinguistics: A waste of feature space?
- Las Vegas, NV
- B. Schuller, M.Wimmer, L.Mösenlechner, D. Arsic, G. Rigoll, Brute-forcingHierarchical Functionals for Paralinguistics: A Waste of Feature Space? Proc. of ICASSP, Las Vegas, NV, 2008, pp. 4501-4504.
- (2008) Proc. of ICASSP , pp. 4501-4504
- Schuller, B.¹ Wimmer, M.² Mösenlechner, L.³ Arsic, D.⁴ Rigoll, G.⁵

8
- 77949400109
- The hinterland of emotions: Facing the open-microphone challenge
- Amsterdam, The Netherlands
- S. Steidl, B. Schuller, A. Batliner, D. Seppi, The Hinterland of Emotions: Facing the Open-microphone Challenge, Proc. of ACII, Amsterdam, The Netherlands, 2009, pp. 690-697.
- (2009) Proc. of ACII , pp. 690-697
- Steidl, S.¹ Schuller, B.² Batliner, A.³ Seppi, D.⁴

9
- 79958702587
- Emotion representation, analysis and synthesis in continuous space: A survey
- Santa Barbara, CA, USA
- H. Gunes, B. Schuller, M. Pantic, R. Cowie, Emotion Representation, Analysis and Synthesis in Continuous Space: A Survey, Proc. of IEEE Conference on Face and Gesture Recognition, Santa Barbara, CA, USA, 2011, pp. 827-834.
- (2011) Proc. of IEEE Conference on Face and Gesture Recognition , pp. 827-834
- Gunes, H.¹ Schuller, B.² Pantic, M.³ Cowie, R.⁴

10
- 34547518166
- Support vector regression for automatic recognition of spontaneous emotions in speech
- Honolulu, Hawaii
- M. Grimm, K. Kroschel, S. Narayanan, Support Vector Regression for Automatic Recognition of Spontaneous Emotions in Speech, Proc. of ICASSP, Honolulu, Hawaii, 2007, pp. 1085-1088.
- (2007) Proc. of ICASSP , pp. 1085-1088
- Grimm, M.¹ Kroschel, K.² Narayanan, S.³

11
- 77949304464
- On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues
- F. Eyben, M. Wöllmer, A. Graves, B. Schuller, E. Douglas-Cowie, R. Cowie, On-line Emotion Recognition in a 3-D Activation-Valence-Time Continuum Using Acoustic and Linguistic Cues, Journal on Multimodal User Interfaces (JMUI), Special Issue on Real-time Affect Analysis and Interpretation: Closing the Loop in Virtual Agents, 3, 2009, pp. 7-19.
- (2009) Journal on Multimodal User Interfaces (JMUI), Special Issue on Real-time Affect Analysis and Interpretation: Closing the Loop in Virtual Agents , vol.3 , pp. 7-19
- Eyben, F.¹ Wöllmer, M.² Graves, A.³ Schuller, B.⁴ Douglas-Cowie, E.⁵ Cowie, R.⁶

12
- 77949395673
- Acoustic emotion recognition: A benchmark comparison of performances
- Merano, Italy
- B. Schuller, B. Vlasenko, F. Eyben, G. Rigoll, A.Wendemuth, Acoustic Emotion Recognition: A Benchmark Comparison of Performances, Proc. of ASRU, Merano, Italy, 2009, pp. 552-557.
- (2009) Proc. of ASRU , pp. 552-557
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Rigoll, G.⁴ Wendemuth, A.⁵

13
- 79958734716
- Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling
- Makuhari, Japan
- M. Wöllmer, A. Metallinou, F. Eyben, B. Schuller, S. Narayanan, Context-sensitive Multimodal Emotion Recognition from Speech and Facial Expression Using Bidirectional LSTM Modeling, Proc. of Interspeech, Makuhari, Japan, 2010, pp. 2362-2365.
- (2010) Proc. of Interspeech , pp. 2362-2365
- Wöllmer, M.¹ Metallinou, A.² Eyben, F.³ Schuller, B.⁴ Narayanan, S.⁵

14
- 79958719285
- The SEMAINE corpus of emotionally coloured character interactions
- G. McKeown, M.F. Valstar, M. Pantic, R. Cowie, The SEMAINE Corpus of Emotionally Coloured Character Interactions, Proc. of ICME, 2010, pp. 1-6.
- (2010) Proc. of ICME , pp. 1-6
- McKeown, G.¹ Valstar, M.F.² Pantic, M.³ Cowie, R.⁴

15
- 0141478857
- Hidden markov model-based speech emotion recognition
- Hong Kong, China
- B. Schuller, G. Rigoll, M. Lang, Hidden Markov Model-based Speech Emotion Recognition, Proc. of ICASSP, Hong Kong, China, 2003, pp. 1-4.
- (2003) Proc. of ICASSP , pp. 1-4
- Schuller, B.¹ Rigoll, G.² Lang, M.³

16
- 77951250940
- Context is routinely encoded during emotion perception
- L.F. Barrett, E.A. Kensinger, Context is routinely encoded during emotion perception, Psychol. Sci. 21 (2010) 595-599.
- (2010) Psychol. Sci. , vol.21 , pp. 595-599
- Barrett, L.F.¹ Kensinger, E.A.²

17
- 77956721304
- Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening
- M. Wöllmer, B. Schuller, F. Eyben, G. Rigoll, Combining Long Short-Term Memory and Dynamic Bayesian Networks for incremental emotion-sensitive artificial listening, IEEE J. Sel. Top. Sign. Proces. 4 (5) (2010) 867-881.
- (2010) IEEE J. Sel. Top. Sign. Proces. , vol.4 , Issue.5 , pp. 867-881
- Wöllmer, M.¹ Schuller, B.² Eyben, F.³ Rigoll, G.⁴

18
- 0031573117
- Long Short-Term Memory
- S. Hochreiter, J. Schmidhuber, Long Short-Term Memory, Neural Comput. 9 (8) (1997) 1735-1780. (Pubitemid 127462305)
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

19
- 0041914606
- Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
- C. Kremer, J.F. Kolen (Eds.), IEEE Press
- S. Hochreiter, Y. Bengio, P. Frasconi, J. Schmidhuber, Gradient flow in recurrent nets: The difficulty of learning long-term dependencies, in: C. Kremer, J.F. Kolen (Eds.), A Field Guide to Dynamical Recurrent Neural Networks, IEEE Press, 2001, pp. 1-15.
- (2001) A Field Guide to Dynamical Recurrent Neural Networks , pp. 1-15
- Hochreiter, S.¹ Bengio, Y.² Frasconi, P.³ Schmidhuber, J.⁴

20
- 27744588611
- Framewise phoneme classification with bidirectional LSTM and other neural network architectures
- DOI 10.1016/j.neunet.2005.06.042, PII S0893608005001206
- A. Graves, J. Schmidhuber, Framewise phoneme classification with bidirectional LSTM and other neural network architectures, Neural Networks 18 (5-6) (2005) 602-610. (Pubitemid 43186580)
- (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
- Graves, A.¹ Schmidhuber, J.²

21
- 38149014113
- An application of recurrent neural networks to discriminative keyword spotting
- Porto, Portugal
- S. Fernandez, A. Graves, J. Schmidhuber, An Application of Recurrent Neural Networks to Discriminative Keyword Spotting, Proc. of ICANN, Porto, Portugal, 2007, pp. 220-229.
- (2007) Proc. of ICANN , pp. 220-229
- Fernandez, S.¹ Graves, A.² Schmidhuber, J.³

22
- 70349203870
- Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
- Taipei, Taiwan
- M. Wöllmer, F. Eyben, J. Keshet, A. Graves, B. Schuller, G. Rigoll, Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech Using Bidirectional LSTM Networks, Proc. of ICASSP, Taipei, Taiwan, 2009, pp. 3949-3952.
- (2009) Proc. of ICASSP , pp. 3949-3952
- Wöllmer, M.¹ Eyben, F.² Keshet, J.³ Graves, A.⁴ Schuller, B.⁵ Rigoll, G.⁶

23
- 78651563436
- Bidirectional LSTM networks for context-sensitive keyword detection in a cognitive virtual agent framework
- M. Wöllmer, F. Eyben, A. Graves, B. Schuller, G. Rigoll, Bidirectional LSTM networks for context-sensitive keyword detection in a cognitive virtual agent framework, Cogn. Comput. 2 (3) (2010) 180-190.
- (2010) Cogn. Comput. , vol.2 , Issue.3 , pp. 180-190
- Wöllmer, M.¹ Eyben, F.² Graves, A.³ Schuller, B.⁴ Rigoll, G.⁵

24
- 80051637579
- A multi-stream ASR framework for BLSTM modeling of conversational speech
- Prague, Czech Republic
- M. Wöllmer, F. Eyben, B. Schuller, G. Rigoll, A Multi-stream ASR Framework for BLSTM Modeling of Conversational Speech, Proc. of ICASSP, Prague, Czech Republic, 2011, pp. 4860-4863.
- (2011) Proc. of ICASSP , pp. 4860-4863
- Wöllmer, M.¹ Eyben, F.² Schuller, B.³ Rigoll, G.⁴

25
- 84858961864
- A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition
- Waikoloa, Big Island, Hawaii
- M. Wöllmer, B. Schuller, G. Rigoll, A Novel Bottleneck-BLSTM Front-end for Feature-level Context Modeling in Conversational Speech Recognition, Proc. of ASRU, Waikoloa, Big Island, Hawaii, 2011, pp. 36-41.
- (2011) Proc. of ASRU , pp. 36-41
- Wöllmer, M.¹ Schuller, B.² Rigoll, G.³

26
- 84862156369
- Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies
- Brisbane, Australia
- M. Wöllmer, F. Eyben, S. Reiter, B. Schuller, C. Cox, E. Douglas-Cowie, R. Cowie, Abandoning Emotion Classes - Towards Continuous Emotion Recognition with Modelling of Long-range Dependencies, Proc. of Interspeech, Brisbane, Australia, 2008, pp. 597-600.
- (2008) Proc. of Interspeech , pp. 597-600
- Wöllmer, M.¹ Eyben, F.² Reiter, S.³ Schuller, B.⁴ Cox, C.⁵ Douglas-Cowie, E.⁶ Cowie, R.⁷

27
- 80054842318
- Continuous prediction of spontaneous affect from multiple cues and modalities in valence-arousal space
- M.A. Nicolaou, H. Gunes, M. Pantic, Continuous prediction of spontaneous affect from multiple cues and modalities in valence-arousal space, IEEE Trans. Affect. Comput. 2 (2011) 92-105.
- (2011) IEEE Trans. Affect. Comput. , vol.2 , pp. 92-105
- Nicolaou, M.A.¹ Gunes, H.² Pantic, M.³

28
- 78650977476
- Open SMILE the munich versatile and fast open-source audio feature extractor
- Firenze, Italy
- F. Eyben, M. Wöllmer, B. Schuller, open SMILE - The Munich Versatile and Fast Open-source Audio Feature Extractor, Proc. of ACM Multimedia, Firenze, Italy, 2010, pp. 1459-1462.
- (2010) Proc. of ACM Multimedia , pp. 1459-1462
- Eyben, F.¹ Wöllmer, M.² Schuller, B.³

29
- 70449580639
- A hierarchical approach for visual suspicious behavior detection in aircrafts
- D. Arsic, B. Hörnler, B. Schuller, G. Rigoll, A Hierarchical Approach for Visual Suspicious Behavior Detection in Aircrafts, Proceedings of the 16th international conference on Digital Signal Processing, 2009, pp. 639-645.
- (2009) Proceedings of the 16th International Conference on Digital Signal Processing , pp. 639-645
- Arsic, D.¹ Hörnler, B.² Schuller, B.³ Rigoll, G.⁴

30
- 80054844663
- Audio-based emotion recognition from natural conversations based on co-occurrence matrix and frequency domain energy distribution features
- Memphis, Tennessee, USA
- A. Sayedelahl, P. Fewzee, M. Kamel, F. Karray, Audio-based Emotion Recognition from Natural Conversations Based on Co-Occurrence Matrix and Frequency Domain Energy Distribution Features, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 407-414.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 407-414
- Sayedelahl, A.¹ Fewzee, P.² Kamel, M.³ Karray, F.⁴

31
- 80054842000
- Speech emotion recognition system based on L1 regularized linear regression and decision fusion
- Memphis, Tennessee, USA
- L. Cen, Z.L. Yu, M.H. Dong, Speech Emotion Recognition System based on L1 Regularized Linear Regression and Decision Fusion, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 332-340.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 332-340
- Cen, L.¹ Yu, Z.L.² Dong, M.H.³

32
- 80054832593
- The CASIA audio emotion recognition method for audio/visual emotion challenge 2011
- Memphis, Tennessee, USA
- S. Pan, J. Tao, Y. Li, The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 388-395.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 388-395
- Pan, S.¹ Tao, J.² Li, Y.³

33
- 80054842861
- Modeling latent discriminative dynamic of multi-dimensional affective signals
- Memphis, Tennessee, USA
- G. Ramirez, T. Baltrusaitis, L.P. Morency, Modeling Latent Discriminative Dynamic of Multi-dimensional Affective Signals, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 396-406.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 396-406
- Ramirez, G.¹ Baltrusaitis, T.² Morency, L.P.³

34
- 80054842597
- Investigating the use of formant based features for detection of affective dimensions in speech
- Memphis, Tennessee, USA
- J.C. Kim, H. Rao, M.A. Clements, Investigating the Use of Formant Based Features for Detection of Affective Dimensions in Speech, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 369-377.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 369-377
- Kim, J.C.¹ Rao, H.² Clements, M.A.³

35
- 80054836331
- Multiple classifier systems for the classification of audio-visual emotional states
- Memphis, Tennessee, USA
- M. Glodek, S. Tschechne, G. Layher, M. Schels, T. Brosch, S. Scherer, M. Kächele, M. Schmidt, H. Neumann, G. Palm, F. Schwenker, Multiple Classifier Systems for the Classification Of Audio-Visual Emotional States, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 359-368.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 359-368
- Glodek, M.¹ Tschechne, S.² Layher, G.³ Schels, M.⁴ Brosch, T.⁵ Scherer, S.⁶ Kächele, M.⁷ Schmidt, M.⁸ Neumann, H.⁹ Palm, G.¹⁰ Schwenker, F.¹¹

36
- 36348934700
- The world of emotions is not two-dimensional
- J.R.J. Fontaine, K.R. Scherer, E.B. Roesch, P. Ellsworth, The world of emotions is not two-dimensional, Psychol. Sci. 18 (2) (2007) 1050-1057.
- (2007) Psychol. Sci. , vol.18 , Issue.2 , pp. 1050-1057
- Fontaine, J.R.J.¹ Scherer, K.R.² Roesch, E.B.³ Ellsworth, P.⁴

37
- 84865716918
- The interspeech 2011 speaker state challenge
- Florence, Italy
- B. Schuller, S. Steidl, A. Batliner, F. Schiel, J. Krajewski, The Interspeech 2011 Speaker State Challenge, Proc. of Interspeech 2011, Florence, Italy, 2011, pp. 3201-3204.
- (2011) Proc. of Interspeech 2011 , pp. 3201-3204
- Schuller, B.¹ Steidl, S.² Batliner, A.³ Schiel, F.⁴ Krajewski, J.⁵

38
- 77950555854
- Recent development of open-source speech recognition engine julius
- Sapporo, Japan
- A. Lee, T. Kawahara, Recent Development of Open-source Speech Recognition Engine Julius, Proc. of APSIPA ASC, Sapporo, Japan, 2009, pp. 131-137.
- (2009) Proc. of APSIPA ASC , pp. 131-137
- Lee, A.¹ Kawahara, T.²

39
- 79959404069
- The design and collection of COSINE, A multi-microphone in situ speech corpus recorded in noisy environments
- A. Stupakov, E. Hanusa, D. Vijaywargi, D. Fox, J. Bilmes, The design and collection of COSINE, a multi-microphone in situ speech corpus recorded in noisy environments, Comput. Speech Lang. 26 (1) (2011) 52-66.
- (2011) Comput. Speech Lang. , vol.26 , Issue.1 , pp. 52-66
- Stupakov, A.¹ Hanusa, E.² Vijaywargi, D.³ Fox, D.⁴ Bilmes, J.⁵

40
- 38049052968
- The HUMAINE database: Addressing the collection and annotation of naturalistic and induced emotional data
- 2007, Springer
- E. Douglas-Cowie, R. Cowie, I. Sneddon, C. Cox, O. Lowry, M. McRorie, J.C. Martin, L. Devillers, S. Abrilian, A. Batliner, N. Amir, K. Karpouzis, The HUMAINE Database: Addressing the Collection and Annotation of Naturalistic and Induced Emotional Data, Affective Computing and Intelligent Interaction, vol. 4738/2007, Springer, 2007, pp. 488-500.
- (2007) Affective Computing and Intelligent Interaction , vol.4738 , pp. 488-500
- Douglas-Cowie, E.¹ Cowie, R.² Sneddon, I.³ Cox, C.⁴ Lowry, O.⁵ McRorie, M.⁶ Martin, J.C.⁷ Devillers, L.⁸ Abrilian, S.⁹ Batliner, A.¹⁰ Amir, N.¹¹ Karpouzis, K.¹²

41
- 63449136395
- Facial expression recognition based on local binary patterns: A comprehensive study
- C. Shan, S. Gong, P.W. McOwan, Facial expression recognition based on local binary patterns: A comprehensive study, Image Vision Comput. 27 (6) (2009) 803-816.
- (2009) Image Vision Comput , vol.27 , Issue.6 , pp. 803-816
- Shan, C.¹ Gong, S.² McOwan, P.W.³

42
- 33847747430
- Facial expression recognition in image sequences using geometric deformation features and support vector machines
- DOI 10.1109/TIP.2006.884954
- I. Kotsia, I. Pitas, Facial expression recognition in image sequences using geometric deformation features and support vector machines, IEEE Trans. Image Process. 16 (1) (2007) 172-187. (Pubitemid 46437480)
- (2007) IEEE Transactions on Image Processing , vol.16 , Issue.1 , pp. 172-187
- Kotsia, I.¹ Pitas, I.²

43
- 56049094605
- Boosting encoded dynamic features for facial expression recognition
- P. Yang, Q. Liu, D.N. Metaxas, Boosting encoded dynamic features for facial expression recognition, Pattern Recognit. Lett. 30 (2) (2009) 132-139.
- (2009) Pattern Recognit. Lett. , vol.30 , Issue.2 , pp. 132-139
- Yang, P.¹ Liu, Q.² Metaxas, D.N.³

44
- 42249104358
- An analysis of facial expression recognition under partial facial image occlusion
- I. Kotsia, I. Buciu, I. Pitas, An analysis of facial expression recognition under partial facial image occlusion, Image Vision Comput 26 (7) (2008) 1052-1067.
- (2008) Image Vision Comput , vol.26 , Issue.7 , pp. 1052-1067
- Kotsia, I.¹ Buciu, I.² Pitas, I.³

45
- 63049094206
- Pose-invariant facial expression recognition using variable-intensity templates
- S. Kumano, K. Otsuka, J. Yamato, E. Maeda, Y. Sato, Pose-invariant facial expression recognition using variable-intensity templates, Int. J. Comput. Vis. 83 (2) (2009) 178-194.
- (2009) Int. J. Comput. Vis. , vol.83 , Issue.2 , pp. 178-194
- Kumano, S.¹ Otsuka, K.² Yamato, J.³ Maeda, E.⁴ Sato, Y.⁵

46
- 4344611819
- Handbook of face recognition
- Springer, London
- Y. Tian, T. Kanade, J.F. Cohn, Handbook of Face Recognition, Facial Expression Analysis, Springer, London, 2011, pp. 487-519.
- (2011) Facial Expression Analysis , pp. 487-519
- Tian, Y.¹ Kanade, T.² Cohn, J.F.³

47
- 79958694881
- String-based audiovisual fusion of behavioural events for the assessment of dimensional affect
- F. Eyben, M. Wöllmer, M. Valstar, H. Gunes, B. Schuller, M. Pantic, String-based audiovisual fusion of behavioural events for the assessment of dimensional affect, Proc. of FG, 2011, pp. 322-329.
- (2011) Proc. of FG , pp. 322-329
- Eyben, F.¹ Wöllmer, M.² Valstar, M.³ Gunes, H.⁴ Schuller, B.⁵ Pantic, M.⁶

48
- 70449526103
- A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams
- M. Wöllmer, M. Al-Hames, F. Eyben, B. Schuller, G. Rigoll, A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams, Neurocomputing 73 (1-3) (2009) 366-380.
- (2009) Neurocomputing , vol.73 , Issue.1-3 , pp. 366-380
- Wöllmer, M.¹ Al-Hames, M.² Eyben, F.³ Schuller, B.⁴ Rigoll, G.⁵

49
- 62949227381
- Audio-visual emotion recognition using gaussian mixture models for face and voice
- Los Alamitos, CA, USA
- A. Metallinou, S. Lee, S. Narayanan, Audio-Visual Emotion Recognition Using Gaussian Mixture Models for Face and Voice, International Symposium on Multimedia, Los Alamitos, CA, USA, 2008, pp. 250-257.
- (2008) International Symposium on Multimedia , pp. 250-257
- Metallinou, A.¹ Lee, S.² Narayanan, S.³

50
- 44049099067
- Audio-visual affective expression recognition through multistream fused HMM
- DOI 10.1109/TMM.2008.921737, 4523967
- Z. Zeng, J. Tu, B. Pianfetti, T.S. Huang, Audio-visual affective expression recognition through multistream fused HMM, IEEE Trans. Multimedia 10 (4) (2008) 570-577. (Pubitemid 351711233)
- (2008) IEEE Transactions on Multimedia , vol.10 , Issue.4 , pp. 570-577
- Zeng, Z.¹ Tu, J.² Pianfetti Jr., B.M.³ Huang, T.S.⁴

51
- 55549106433
- Audio-visual integration of emotion expression
- O. Collignon, S. Girard, F. Gosselin, S. Roy, D. Saint-Amour, M. Lassonde, F. Lepore, Audio-visual integration of emotion expression, Brain Res. 1242 (2008) 126-135.
- (2008) Brain Res , vol.1242 , pp. 126-135
- Collignon, O.¹ Girard, S.² Gosselin, F.³ Roy, S.⁴ Saint-Amour, D.⁵ Lassonde, M.⁶ Lepore, F.⁷

52
- 80054841755
- A psychologically-inspired match-score fusion model for video-based facial expression recognition
- Memphis, Tennessee, USA
- A. Cruz, B. Bhanu, S. Yang, A psychologically-inspired match-score fusion model for video-based facial expression recognition, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 341-350.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 341-350
- Cruz, A.¹ Bhanu, B.² Yang, S.³

53
- 80054837844
- Continuous emotion recognition using gabor energy filters
- Memphis, Tennessee, USA
- M. Dahmane, J. Meunier, Continuous emotion recognition using Gabor energy filters, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 351-358.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 351-358
- Dahmane, M.¹ Meunier, J.²

54
- 2142812371
- Robust real-time face detection
- P.A. Viola, M.J. Jones, Robust real-time face detection, Int. J. Comput. Vis. 57 (2) (2004) 137-154.
- (2004) Int. J. Comput. Vis. , vol.57 , Issue.2 , pp. 137-154
- Viola, P.A.¹ Jones, M.J.²

55
- 0036647193
- Multiresolution gray-scale and rotation invariant texture classification with local binary patterns
- DOI 10.1109/TPAMI.2002.1017623
- T. Ojala, M. Pietikäinen, T. Mäenpää, Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Trans. Pattern Anal. Mach. Intell. 24 (7) (2002) 971-987. (Pubitemid 34835471)
- (2002) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.24 , Issue.7 , pp. 971-987
- Ojala, T.¹ Pietikainen, M.² Maenpaa, T.³

56
- 77951183242
- Video object detection speedup using staggered sampling
- D. Greig, Video object detection speedup using staggered sampling, IEEE Workshop on Applications of Computer Vision (WACV), 2009, pp. 23-29.
- (2009) IEEE Workshop on Applications of Computer Vision (WACV) , pp. 23-29
- Greig, D.¹

57
- 0004342235
- Computer vision face tracking for use in a perceptual user interface
- G.R. Bradski, Computer vision face tracking for use in a perceptual user interface, Tech. Rep. Q2, Intel Technol. J. (1998) 1-15.
- (1998) Tech. Rep. Q2, Intel Technol. J. , pp. 1-15
- Bradski, G.R.¹

58
- 76749092270
- The WEKA data mining software: An update
- M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I.H. Witten, The WEKA data mining software: An update, ACM SIGKDD Explor. Newslett. 11 (2009) 10-18.
- (2009) ACM SIGKDD Explor. Newslett. , vol.11 , pp. 10-18
- Hall, M.¹ Frank, E.² Holmes, G.³ Pfahringer, B.⁴ Reutemann, P.⁵ Witten, I.H.⁶

59
- 80054840264
- Naturalistic affective expression classification by a multi-stage approach based on hidden markov models
- Memphis, Tennessee, USA
- H. Meng, N. Bianchi-Berthouze, Naturalistic affective expression classification by a multi-stage approach based on Hidden Markov Models, Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) held in conjunction with ACII, Memphis, Tennessee, USA, 2011, pp. 378-387.
- (2011) Proc. of First International Audio/Visual Emotion Challenge and Workshop (AVEC 2011) Held in Conjunction with ACII , pp. 378-387
- Meng, H.¹ Bianchi-Berthouze, N.²

60
- 0031268931
- Bidirectional recurrent neural networks
- PII S1053587X97080550
- M. Schuster, K.K. Paliwal, Bidirectional recurrent neural networks, IEEE Trans. Signal Process. 45 (1997) 2673-2681. (Pubitemid 127766336)
- (1997) IEEE Transactions on Signal Processing , vol.45 , Issue.11 , pp. 2673-2681
- Schuster, M.¹ Paliwal, K.K.²

61
- 0034293152
- Learning to forget: Continual prediction with LSTM
- F. Gers, J. Schmidhuber, F. Cummins, Learning to forget: continual prediction with LSTM, Neural Comput. 12 (10) (2000) 2451-2471.
- (2000) Neural Comput , vol.12 , Issue.10 , pp. 2451-2471
- Gers, F.¹ Schmidhuber, J.² Cummins, F.³

62
- 70349284484
- Ph.D. thesis, Technische Universität München
- A. Graves, Supervised sequence labelling with recurrent neural networks, Ph.D. thesis, Technische Universität München (2008).
- (2008) Supervised Sequence Labelling with Recurrent Neural Networks
- Graves, A.¹

63
- 80051977348
- Tandemdecoding of children's speech for keyword detection in a child-robot interaction scenario
- M.Wöllmer, B. Schuller, A. Batliner, S. Steidl, D. Seppi, Tandemdecoding of children's speech for keyword detection in a child-robot interaction scenario, IEEE Trans. Audio Speech Lang. Process. 7 (4) (2011) 1-26.
- (2011) IEEE Trans. Audio Speech Lang. Process. , vol.7 , Issue.4 , pp. 1-26
- Wöllmer, M.¹ Schuller, B.² Batliner, A.³ Steidl, S.⁴ Seppi, D.⁵

64
- 79958176949
- On-line driver distraction detection using long short-term memory
- M. Wöllmer, C. Blaschke, T. Schindl, B. Schuller, B. Färber, S. Mayer, B. Trefflich, On-line driver distraction detection using Long Short-Term Memory, IEEE Trans. Intell. Transp. Syst. 12 (2) (2011) 574-582.
- (2011) IEEE Trans. Intell. Transp. Syst. , vol.12 , Issue.2 , pp. 574-582
- Wöllmer, M.¹ Blaschke, C.² Schindl, T.³ Schuller, B.⁴ Färber, B.⁵ Mayer, S.⁶ Trefflich, B.⁷

65
- 84867614588
- Analyzing the memory of BLSTM neural networks for enhanced emotion classification in dyadic spoken interactions
- Kyoto, Japan
- M. Wöllmer, A. Metallinou, N. Katsamanis, B. Schuller, S. Narayanan, Analyzing the Memory of BLSTM Neural Networks for Enhanced Emotion Classification in Dyadic Spoken Interactions, Proc. of ICASSP, Kyoto, Japan, 2012.
- (2012) Proc. of ICASSP
- Wöllmer, M.¹ Metallinou, A.² Katsamanis, N.³ Schuller, B.⁴ Narayanan, S.⁵

66
- 0003957032
- 2nd Edition Morgan Kaufmann, San Francisco
- I.H. Witten, E. Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2nd Edition Morgan Kaufmann, San Francisco, 2005.
- (2005) Data Mining: Practical Machine Learning Tools and Techniques
- Witten, I.H.¹ Frank, E.²

67
- 0003632935
- 8th ed., Iowa State University Press
- G.W. Snedecor, W.G. Cochran, Statistical methods, 8th ed. Iowa State University Press, 1989.
- (1989) Statistical Methods
- Snedecor, G.W.¹ Cochran, W.G.²

68
- 34547548235
- Probabilistic and bottle-neck features for LVCSR of meetings
- Honolulu, Hawaii
- F. Grezl, M. Karafiat, K. Stanislav, J. Cernocky, Probabilistic and Bottle-neck Features for LVCSR of Meetings, Proc. of ICASSP, Honolulu, Hawaii, 2007, pp. 757-760.
- (2007) Proc. of ICASSP , pp. 757-760
- Grezl, F.¹ Karafiat, M.² Stanislav, K.³ Cernocky, J.⁴

69
- 84898971246
- An asynchronous hidden markov model for audio-visual speech recognition
- S. Bengio, An asynchronous Hidden Markov Model for audio-visual speech recognition, Adv. NIPS 15 (2003) 1-8.
- (2003) Adv. NIPS , vol.15 , pp. 1-8
- Bengio, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.