SCOPUS 정보 검색 플랫폼

Volumn 45, Issue 9, 2015, Pages 1927-1941

Temporal Bayesian Fusion for Affect Sensing: Combining Video, Audio, and Lexical Modalities

(4) Savran, Arman a Cao, Houwei a Nenkova, Ani b Verma, Ragini a

a UNIVERSITY OF PENNSYLVANIA (United States)

b UNIVERSITY OF PENNSYLVANIA (United States)

Author keywords

Acoustic; affective computing; arousal; Bayesian fusion; emotion recognition; facial expressions; lexical; multimodal; particle filter; power; speech; temporal fusion; turn based; valence

Indexed keywords

ACOUSTICS; FACE RECOGNITION; SPEECH;

AFFECTIVE COMPUTING; AROUSAL; BAYESIAN FUSION; EMOTION RECOGNITION; FACIAL EXPRESSIONS; LEXICAL; MULTI-MODAL; PARTICLE FILTER; POWER; VALENCE;

SPEECH RECOGNITION;

AUTOMATED PATTERN RECOGNITION; BAYES THEOREM; EMOTION; FACIAL EXPRESSION; HUMAN; PHYSIOLOGY; PROCEDURES; VIDEORECORDING;

BAYES THEOREM; EMOTIONS; FACIAL EXPRESSION; HUMANS; PATTERN RECOGNITION, AUTOMATED; VIDEO RECORDING;

EID: 84939825651 PISSN: 21682267 EISSN: None Source Type: Journal
DOI: 10.1109/TCYB.2014.2362101 Document Type: Article

Times cited : (45)

References (41)

1
- 84870186039
- Consistent but modest: A meta-analysis on unimodal and multimodal affect detection accuracies from 30 studies
- S. D'Mello and J. Kory, "Consistent but modest: A meta-analysis on unimodal and multimodal affect detection accuracies from 30 studies, " in Proc. ACM Int. Conf. Multimodal Interact., New York, NY, USA, 2012, pp. 31-38.
- (2012) Proc. ACM Int. Conf. Multimodal Interact., New York, NY, USA , pp. 31-38
- D'Mello, S.¹ Kory, J.²

2
- 84867336190
- Multisensor data fusion: A review of the state-of-The-art
- Jan.
- B. Khaleghi, A. Khamis, F. O. Karray, and S. N. Razavi, "Multisensor data fusion: A review of the state-of-the-art, " Inf. Fusion, vol. 14, no. 1, pp. 28-44, Jan. 2013.
- (2013) Inf. Fusion , vol.14 , Issue.1 , pp. 28-44
- Khaleghi, B.¹ Khamis, A.² Karray, F.O.³ Razavi, S.N.⁴

3
- 78049394179
- Automatic, dimensional and continuous emotion recognition
- Jan.
- H. Gunes and M. Pantic, "Automatic, dimensional and continuous emotion recognition, " Int. J. Syn. Emot., vol. 1, no. 1, pp. 68-99, Jan. 2010.
- (2010) Int. J. Syn. Emot. , vol.1 , Issue.1 , pp. 68-99
- Gunes, H.¹ Pantic, M.²

4
- 36348934700
- The world of emotion is not two-dimensional
- Dec.
- J. R. Fontaine, K. R. Scherer, E. B. Roesch, and P. Ellsworth, "The world of emotion is not two-dimensional, " Psychol. Sci., vol. 18, pp. 1050-1057, Dec. 2007.
- (2007) Psychol. Sci. , vol.18 , pp. 1050-1057
- Fontaine, J.R.¹ Scherer, K.R.² Roesch, E.B.³ Ellsworth, P.⁴

5
- 79953822842
- Affect detection: An interdisciplinary review of models, methods, and their applications
- Jan.
- R. A. Calvo and S. D'Mello, "Affect detection: An interdisciplinary review of models, methods, and their applications, " IEEE Trans. Affect. Comput., vol. 1, no. 1, pp. 18-37, Jan. 2010.
- (2010) IEEE Trans. Affect. Comput. , vol.1 , Issue.1 , pp. 18-37
- Calvo, R.A.¹ D'Mello, S.²

6
- 57149144228
- A survey of affect recognition methods: Audio, visual, and spontaneous expressions
- Jan.
- Z. Zeng, M. Pantic, G. I. Roisman, and T. S. Huang, "A survey of affect recognition methods: Audio, visual, and spontaneous expressions, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, no. 1, pp. 39-58, Jan. 2009.
- (2009) IEEE Trans. Pattern Anal. Mach. Intell. , vol.31 , Issue.1 , pp. 39-58
- Zeng, Z.¹ Pantic, M.² Roisman, G.I.³ Huang, T.S.⁴

7
- 0037209464
- Automatic facial expression analysis: A survey
- B. Fasel and J. Luettin, "Automatic facial expression analysis: A survey, " Pattern Recognit., vol. 36, no. 1, pp. 259-275, 2003.
- (2003) Pattern Recognit. , vol.36 , Issue.1 , pp. 259-275
- Fasel, B.¹ Luettin, J.²

8
- 1342316500
- Salt Lake City, UT, USA: A Human Face
- P. Ekman, W. V. Friesen, and J. C. Hager, Facial Action Coding System. Salt Lake City, UT, USA: A Human Face, 2002.
- (2002) Facial Action Coding System
- Ekman, P.¹ Friesen, W.V.² Hager, J.C.³

9
- 84862961455
- Error weighted semi-coupled hidden Markov model for audio-visual emotion recognition
- Feb.
- J.-C. Lin, C.-H. Wu, and W.-L. Wei, "Error weighted semi-coupled hidden Markov model for audio-visual emotion recognition, " IEEE Trans. Multimedia, vol. 14, no. 1, pp. 142-156, Feb. 2012.
- (2012) IEEE Trans. Multimedia , vol.14 , Issue.1 , pp. 142-156
- Lin, J.-C.¹ Wu, C.-H.² Wei, W.-L.³

10
- 84870210348
- Robust continuous prediction of human emotions using multiscale dynamic cues
- New York, NY, USA
- J. Nicolle, V. Rapp, K. Bailly, L. Prevost, and M. Chetouani, "Robust continuous prediction of human emotions using multiscale dynamic cues, " in Proc. ACM Int. Conf. Multimodal Interact. (ICMI), New York, NY, USA, 2012, pp. 501-508.
- (2012) Proc. ACM Int. Conf. Multimodal Interact. (ICMI) , pp. 501-508
- Nicolle, J.¹ Rapp, V.² Bailly, K.³ Prevost, L.⁴ Chetouani, M.⁵

11
- 84866713347
- Regression-based intensity estimation of facial action units
- Oct.
- A. Savran, B. Sankur, and M. T. Bilge, "Regression-based intensity estimation of facial action units, " Image Vis. Comput., vol. 30, no. 10, pp. 774-784, Oct. 2012.
- (2012) Image Vis. Comput. , vol.30 , Issue.10 , pp. 774-784
- Savran, A.¹ Sankur, B.² Bilge, M.T.³

12
- 84870213533
- AVEC 2012: The continuous audio/visual emotion challenge
- New York, NY, USA
- B. Schuller, M. Valstar, F. Eyben, R. Cowie, and M. Pantic, "AVEC 2012: The continuous audio/visual emotion challenge, " in Proc. ACM Int. Conf. Multimodal Interact., New York, NY, USA, 2012, pp. 361-362.
- (2012) Proc. ACM Int. Conf. Multimodal Interact , pp. 361-362
- Schuller, B.¹ Valstar, M.² Eyben, F.³ Cowie, R.⁴ Pantic, M.⁵

13
- 33746410556
- Emotional speech recognition: Resources, features, and methods
- Sep.
- D. Ververidis and C. Kotropoulos, "Emotional speech recognition: Resources, features, and methods, " Speech Commun., vol. 48, pp. 1162-1181, Sep. 2006.
- (2006) Speech Commun. , vol.48 , pp. 1162-1181
- Ververidis, D.¹ Kotropoulos, C.²

14
- 77949395673
- Acoustic emotion recognition: A benchmark comparison of performances
- B. Schuller, B. Vlasenko, F. Eyben, G. Rigoll, and A. Wendemuth, "Acoustic emotion recognition: A benchmark comparison of performances, " in Proc. Autom. Speech Recognit. Understanding (ASRU), Merano, Italy, 2009, pp. 552-557.
- (2009) Proc. Autom. Speech Recognit. Understanding (ASRU), Merano, Italy , pp. 552-557
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Rigoll, G.⁴ Wendemuth, A.⁵

15
- 33745198227
- Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
- B. Schuller, R. Müller, M. K. Lang, and G. Rigoll, "Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles, " in Proc. Interspeech, Lisbon, Portugal, 2005, pp. 805-808.
- (2005) Proc. Interspeech, Lisbon, Portugal , pp. 805-808
- Schuller, B.¹ Müller, R.² Lang, M.K.³ Rigoll, G.⁴

16
- 77949298130
- Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis
- L. Kessous, G. Castellano, and G. Caridakis, "Multimodal emotion recognition in speech-based interaction using facial expression, body gesture and acoustic analysis, " J. Multimodal User Interf., vol. 3, nos. 1-2, pp. 33-48, 2010.
- (2010) J. Multimodal User Interf. , vol.3 , Issue.1-2 , pp. 33-48
- Kessous, L.¹ Castellano, G.² Caridakis, G.³

17
- 33646766470
- Fusing face and body display for bi-modal emotion recognition: Single frame analysis and multi-frame post integration
- H. Gunes and M. Piccardi, "Fusing face and body display for bi-modal emotion recognition: Single frame analysis and multi-frame post integration, " in Proc. Affect. Comput. Intell. Interact. (ACII), Beijing, China, 2005, pp. 102-111.
- (2005) Proc. Affect. Comput. Intell. Interact. (ACII), Beijing, China , pp. 102-111
- Gunes, H.¹ Piccardi, M.²

18
- 61549119152
- Automatic temporal segment detection and affect recognition from face and body display
- Feb.
- H. Gunes and M. Piccardi, "Automatic temporal segment detection and affect recognition from face and body display, " IEEE Trans. Cybern., vol. 39, no. 1, pp. 64-84, Feb. 2009.
- (2009) IEEE Trans. Cybern. , vol.39 , Issue.1 , pp. 64-84
- Gunes, H.¹ Piccardi, M.²

19
- 84863938861
- Multimodal emotion recognition in response to videos
- Apr./Jun.
- M. Soleymani, M. Pantic, and T. Pun, "Multimodal emotion recognition in response to videos, " IEEE Trans. Affect. Comput., vol. 3, no. 2, pp. 211-223, Apr./Jun. 2012.
- (2012) IEEE Trans. Affect. Comput. , vol.3 , Issue.2 , pp. 211-223
- Soleymani, M.¹ Pantic, M.² Pun, T.³

20
- 84864127007
- Facial action recognition combining heterogeneous features via multikernel learning
- Aug.
- T. Senechal et al., "Facial action recognition combining heterogeneous features via multikernel learning, " IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 4, pp. 993-1005, Aug. 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.4 , pp. 993-1005
- Senechal, T.¹

21
- 77956338010
- Robust speech recognition and understanding
- M. Grimm and K. Kroschel, Eds. Vienna, Austria: I-Tech
- J. Kim, "Robust speech recognition and understanding, " in Bimodal Emotion Recognition Using Speech and Physiological Changes, M. Grimm and K. Kroschel, Eds. Vienna, Austria: I-Tech, 2007, pp. 268-280.
- (2007) Bimodal Emotion Recognition Using Speech and Physiological Changes , pp. 268-280
- Kim, J.¹

22
- 84897564299
- Automatic detection of emotion valence on faces using consumer depth cameras
- A. Savran, R. Gur, and R. Verma, "Automatic detection of emotion valence on faces using consumer depth cameras, " in Proc. IEEE ICCV Workshop Consum. Depth Cameras Comput. Vis., Sydney, NSW, Australia, 2013, pp. 75-82.
- (2013) Proc. IEEE ICCV Workshop Consum. Depth Cameras Comput. Vis., Sydney, NSW, Australia , pp. 75-82
- Savran, A.¹ Gur, R.² Verma, R.³

23
- 80052968042
- Comparative evaluation of 3D vs. 2D modality for automatic detection of facial action units
- A. Savran, B. Sankur, and M. T. Bilge, "Comparative evaluation of 3D vs. 2D modality for automatic detection of facial action units, " Pattern Recognit., vol. 45, no. 2, pp. 767-782, 2012.
- (2012) Pattern Recognit. , vol.45 , Issue.2 , pp. 767-782
- Savran, A.¹ Sankur, B.² Bilge, M.T.³

24
- 84857911091
- Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels
- Jan./Jun.
- C.-H. Wu and W.-B. Liang, "Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels, " IEEE Trans. Affect. Comput., vol. 2, no. 1, pp. 10-21, Jan./Jun. 2011.
- (2011) IEEE Trans. Affect. Comput. , vol.2 , Issue.1 , pp. 10-21
- Wu, C.-H.¹ Liang, W.-B.²

25
- 84883100945
- Multimodal affect recognition in learning environments
- A. Kapoor and R. W. Picard, "Multimodal affect recognition in learning environments, " in Proc. ACM Multimedia, Singapore, 2005, pp. 677-682.
- (2005) Proc. ACM Multimedia, Singapore , pp. 677-682
- Kapoor, A.¹ Picard, R.W.²

26
- 79958694881
- String-based audiovisual fusion of behavioral events for the assessment of dimensional affect
- Santa Barbara, CA, USA
- F. Eyben et al., "String-based audiovisual fusion of behavioral events for the assessment of dimensional affect, " in Proc. IEEE Autom. Face Gesture Recognit. Workshop., Santa Barbara, CA, USA, 2011, pp. 322-329.
- (2011) Proc. IEEE Autom. Face Gesture Recognit. Workshop , pp. 322-329
- Eyben, F.¹

27
- 84870212389
- Step-wise emotion recognition using concatenated-HMM
- Santa Monica, CA, USA
- D. Ozkan, S. Scherer, and L.-P. Morency, "Step-wise emotion recognition using concatenated-HMM, " in Proc. ACM Int. Conf. Multimodal Interact. (ICMI), Santa Monica, CA, USA, 2012, pp. 477-484.
- (2012) Proc. ACM Int. Conf. Multimodal Interact. (ICMI) , pp. 477-484
- Ozkan, D.¹ Scherer, S.² Morency, L.-P.³

28
- 84886418479
- LSTM-modeling of continuous emotions in an audiovisual affect recognition framework
- M. Wollmer, M. Kaiser, F. Eyben, B. Schuller, and G. Rigoll, "LSTM-modeling of continuous emotions in an audiovisual affect recognition framework, " Image Vis. Comput., vol. 31, no. 2, pp. 153-163, 2013.
- (2013) Image Vis. Comput. , vol.31 , Issue.2 , pp. 153-163
- Wollmer, M.¹ Kaiser, M.² Eyben, F.³ Schuller, B.⁴ Rigoll, G.⁵

29
- 84939806160
- AVEC 2011, the audio/visual emotion challenge
- B. Schuller et al., "AVEC 2011, the audio/visual emotion challenge, " in Proc. Int. Conf. Affect. Comput. Intell. Interact., Berlin, Germany, 2011, pp. 415-424.
- (2011) Proc. Int. Conf. Affect. Comput. Intell. Interact., Berlin, Germany , pp. 415-424
- Schuller, B.¹

30
- 84870213555
- A multimodal fuzzy inference system using a continuous facial expression representation for emotion detection
- Santa Monica, CA, USA
- C. Soladié, H. Salam, C. Pelachaud, N. Stoiber, and R. Séguier, "A multimodal fuzzy inference system using a continuous facial expression representation for emotion detection, " in Proc. ACM Int. Conf. Multimodal Interact. (ICMI), Santa Monica, CA, USA, 2012, pp. 493-500.
- (2012) Proc. ACM Int. Conf. Multimodal Interact. (ICMI) , pp. 493-500
- Soladié, C.¹ Salam, H.² Pelachaud, C.³ Stoiber, N.⁴ Séguier, R.⁵

31
- 84870177354
- Combining video, audio and lexical indicators of affect in spontaneous conversation via particle filtering
- New York, NY, USA
- A. Savran, H. Cao, M. Shah, A. Nenkova, and R. Verma, "Combining video, audio and lexical indicators of affect in spontaneous conversation via particle filtering, " in Proc. ACM Int. Conf. Multimodal Interact. (ICMI), New York, NY, USA, 2012, pp. 485-492.
- (2012) Proc. ACM Int. Conf. Multimodal Interact. (ICMI) , pp. 485-492
- Savran, A.¹ Cao, H.² Shah, M.³ Nenkova, A.⁴ Verma, R.⁵

32
- 84939859550
- [Online]
- (2012). AVEC 2012, 2nd International Audio/Visual Emotion Challenge and Workshop [Online]. Available: http://sspnet.eu/avec2012/
- (2012) AVEC 2012, 2nd International Audio/Visual Emotion Challenge and Workshop

33
- 84859899698
- The SEMAINE database: Annotated multimodal records of emotionally colored conversations between a person and a limited agent
- Jan./Mar.
- G. McKeown, M. Valstar, R. Cowie, M. Pantic, and M. Schroder, "The SEMAINE database: Annotated multimodal records of emotionally colored conversations between a person and a limited agent, " IEEE Trans. Affect. Comput., vol. 3, no. 1, pp. 5-17, Jan./Mar. 2012.
- (2012) IEEE Trans. Affect. Comput. , vol.3 , Issue.1 , pp. 5-17
- McKeown, G.¹ Valstar, M.² Cowie, R.³ Pantic, M.⁴ Schroder, M.⁵

34
- 78650977476
- OpenSMILE: The munich versatile and fast open-source audio feature extractor
- F. Eyben, M. Wöllmer, and B. Schuller, "openSMILE: The munich versatile and fast open-source audio feature extractor, " in Proc. Int. Conf. Multimedia, New York, NY, USA, 2010, pp. 1459-1462.
- (2010) Proc. Int. Conf. Multimedia, New York, NY, USA , pp. 1459-1462
- Eyben, F.¹ Wöllmer, M.² Schuller, B.³

35
- 14644439843
- Toward detecting emotions in spoken dialogs
- Mar.
- C. M. Lee and S. S. Narayanan, "Toward detecting emotions in spoken dialogs, " IEEE Trans. Speech Audio Process., vol. 13, no. 2, pp. 293-303, Mar. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 293-303
- Lee, C.M.¹ Narayanan, S.S.²

36
- 4544316885
- Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
- B. Schuller, G. Rigoll, and M. Lang, "Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture, " in Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2004, pp. 577-580.
- (2004) Proc. IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP) , pp. 577-580
- Schuller, B.¹ Rigoll, G.² Lang, M.³

37
- 0012745713
- Desperately seeking emotions or: Actors, wizards, and human beings
- A. Batliner, K. Fischer, R. Huber, J. Spilker, and E. Nöth, "Desperately seeking emotions or: Actors, wizards, and human beings, " in Proc. ISCA Workshop Speech Emot., New Castle, U.K., 2000, pp. 195-200.
- (2000) Proc. ISCA Workshop Speech Emot., New Castle, U.K. , pp. 195-200
- Batliner, A.¹ Fischer, K.² Huber, R.³ Spilker, J.⁴ Nöth, E.⁵

38
- 84939859551
- Acoustic and lexical representations for affect prediction in spontaneous conversations
- to be published
- H. Cao, A. Savran, R. Verma, and A. Nenkova, "Acoustic and lexical representations for affect prediction in spontaneous conversations, " Comput. Speech Lang., to be published.
- Comput. Speech Lang
- Cao, H.¹ Savran, A.² Verma, R.³ Nenkova, A.⁴

39
- 84867201207
- Balancing spoken content adaptation and unit length in the recognition of emotion and interest
- B. Vlasenko, B. Schuller, K. T. Mengistu, G. Rigoll, and A. Wendemuth, "Balancing spoken content adaptation and unit length in the recognition of emotion and interest, " in Proc. Interspeech, 2008, pp. 805-808.
- (2008) Proc. Interspeech , pp. 805-808
- Vlasenko, B.¹ Schuller, B.² Mengistu, K.T.³ Rigoll, G.⁴ Wendemuth, A.⁵

40
- 85036258669
- Distribution of the estimators for autore-gressive time series with a unit root
- D. A. Dickey and W. A. Fuller, "Distribution of the estimators for autore-gressive time series with a unit root, " J. Amer. Stat. Assoc., vol. 74, no. 1, pp. 427-431, 1979.
- (1979) J. Amer. Stat. Assoc. , vol.74 , Issue.1 , pp. 427-431
- Dickey, D.A.¹ Fuller, W.A.²

41
- 0003665481
- N. de Freitas, A. Doucet, and N. J. Gordon, Eds. New York, NY, USA: Springer
- M. K. Pitt and N. Shephard, Sequential Monte Carlo Methods in Practice, N. de Freitas, A. Doucet, and N. J. Gordon, Eds. New York, NY, USA: Springer, 2001.
- (2001) Sequential Monte Carlo Methods in Practice
- Pitt, M.K.¹ Shephard, N.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.