메뉴 건너뛰기




Volumn 3, Issue 1, 2010, Pages 49-66

Multimodal user's affective state analysis in naturalistic interaction

Author keywords

Affective computing; Emotion dynamics; Emotion recognition; Multimodal analysis; Recurrent neural network

Indexed keywords

AFFECTIVE COMPUTING; AFFECTIVE STATE; APPROXIMATION CAPABILITIES; AUDIO-VISUAL DATABASE; AUDIO-VISUAL MATERIAL; DIMENSIONAL REPRESENTATION; DYNAMIC EVENTS; EMOTION RECOGNITION; EMOTIONAL STATE; FACIAL EXPRESSIONS; HUMAN MACHINE INTERACTION; HUMAN-CENTERED COMPUTING; MULTI-MODAL; MULTIMODAL ANALYSIS; PROSODY INFORMATION; REAL WORLD SITUATIONS; RECOGNITION RATES; SHORT TERM MEMORY; VIDEO SEQUENCES;

EID: 77949305931     PISSN: 17837677     EISSN: 17838738     Source Type: Journal    
DOI: 10.1007/s12193-009-0030-8     Document Type: Article
Times cited : (35)

References (104)
  • 2
    • 11944264057 scopus 로고
    • Thin slices of expressive B predictors of interpersonal consequences: A meta-analysis
    • Ambady A, Rosenthal R (1992) Thin slices of expressive B predictors of interpersonal consequences: a meta-analysis. Psychol Bull 111(2): 256-274.
    • (1992) Psychol Bull , vol.111 , Issue.2 , pp. 256-274
    • Ambady, A.1    Rosenthal, R.2
  • 3
    • 85009145332 scopus 로고    scopus 로고
    • Prosody based automatic detection of annoyance and frustration in human computer dialog
    • Ang J, Dhilon R, Krupski A, Shriberg E, Stolcke A (2002) Prosody based automatic detection of annoyance and frustration in human computer dialog. In: Proc of ICSLP, pp 2037-2040.
    • (2002) Proc of ICSLP , pp. 2037-2040
    • Ang, J.1    Dhilon, R.2    Krupski, A.3    Shriberg, E.4    Stolcke, A.5
  • 8
    • 34047218363 scopus 로고    scopus 로고
    • Early feature stream integration versus decision level combination in a multiple classifier system for text line recognition
    • Bertolami R, Bunke H Early feature stream integration versus decision level combination in a multiple classifier system for text line recognition. In: 18th international conference on pattern recognition (ICPR'06).
    • 18th International Conference on Pattern Recognition (ICPR'06)
    • Bertolami, R.1    Bunke, H.2
  • 9
    • 14944351245 scopus 로고    scopus 로고
    • Analysis of emotion recognition using facial expressions, speech and multimodal information
    • Busso C et al. (2004) Analysis of emotion recognition using facial expressions, speech and multimodal information. In: Proc sixth ACM int'l conf multimodal interfaces (ICMI'04), pp 205-211.
    • (2004) Proc Sixth ACM Int'l Conf Multimodal Interfaces (ICMI'04) , pp. 205-211
    • Busso, C.1
  • 11
    • 0141860866 scopus 로고    scopus 로고
    • Facial expression recognition from video sequences: Temporal and static modeling
    • Cohen I, Sebe N, Garg A, Chen LS, Huang TS (2003) Facial expression recognition from video sequences: temporal and static modeling. Comput Vis Image Underst 91: 160-187.
    • (2003) Comput Vis Image Underst , vol.91 , pp. 160-187
    • Cohen, I.1    Sebe, N.2    Garg, A.3    Chen, L.S.4    Huang, T.S.5
  • 12
    • 77949309520 scopus 로고    scopus 로고
    • Multimodal interaction: A new focal area for AI
    • Cohen PR (2001) Multimodal interaction: a new focal area for AI. In: IJCAI, pp 1467-1473.
    • (2001) IJCAI , pp. 1467-1473
    • Cohen, P.R.1
  • 16
    • 0035363218 scopus 로고    scopus 로고
    • Active appearance models
    • Cootes T, Edwards G, Taylor C (2001) Active appearance models. IEEE PAMI 23(6): 681-685.
    • (2001) IEEE PAMI , vol.23 , Issue.6 , pp. 681-685
    • Cootes, T.1    Edwards, G.2    Taylor, C.3
  • 17
    • 0037382510 scopus 로고    scopus 로고
    • Describing the emotional states that are expressed in speech
    • Cowie R, Cornelius R (2003) Describing the emotional states that are expressed in speech. Speech Commun 40: 5-32.
    • (2003) Speech Commun , vol.40 , pp. 5-32
    • Cowie, R.1    Cornelius, R.2
  • 22
    • 77949307895 scopus 로고    scopus 로고
    • Real-life emotion recognition human-human call center data with acoustic and lexical cues
    • In: Müller C, Schötz S (eds), Springer, Berlin (to appear)
    • Devillers L, Vidrascu L (2007) Real-life emotion recognition human-human call center data with acoustic and lexical cues. In: Müller C, Schötz S (eds) Speaker characterization. Springer, Berlin (to appear).
    • (2007) Speaker Characterization
    • Devillers, L.1    Vidrascu, L.2
  • 23
    • 0043262133 scopus 로고    scopus 로고
    • Integrating perceptual and cognitive modeling for adaptive and intelligent human-computer interaction
    • Duric Z, Gray WD, Heishman R, Li F, Rosenfeld A, Schoelles MJ, Schunn C, Wechsler H (2002) Integrating perceptual and cognitive modeling for adaptive and intelligent human-computer interaction. In: Proc IEEE, vol 90(7), pp 1272-1289.
    • (2002) Proc IEEE , vol.90 , Issue.7 , pp. 1272-1289
    • Duric, Z.1    Gray, W.D.2    Heishman, R.3    Li, F.4    Rosenfeld, A.5    Schoelles, M.J.6    Schunn, C.7    Wechsler, H.8
  • 25
    • 33845516816 scopus 로고
    • Felt, false, and miserable smiles
    • Ekman P, Friesen WV (1982) Felt, false, and miserable smiles. J Nonverbal Behav, 6: 238-252.
    • (1982) J Nonverbal Behav , vol.6 , pp. 238-252
    • Ekman, P.1    Friesen, W.V.2
  • 27
    • 0027588084 scopus 로고
    • Facial expression and emotion
    • Ekman P (1993) Facial expression and emotion. Am Psychol 48: 384-392.
    • (1993) Am Psychol , vol.48 , pp. 384-392
    • Ekman, P.1
  • 28
    • 26444565569 scopus 로고
    • Finding structure in time
    • Elman JL (1990) Finding structure in time. Cogn Sci 14: 179-211.
    • (1990) Cogn Sci , vol.14 , pp. 179-211
    • Elman, J.L.1
  • 29
    • 0001419757 scopus 로고
    • Distributed representations, simple recurrent networks, and grammatical structure
    • Elman JL (1991) Distributed representations, simple recurrent networks, and grammatical structure. Mach Learn 7: 195-224.
    • (1991) Mach Learn , vol.7 , pp. 195-224
    • Elman, J.L.1
  • 30
    • 0031187271 scopus 로고    scopus 로고
    • Coding, analysis, interpretation, and recognition of facial expressions
    • Essa IA, Pentland AP (1997) Coding, analysis, interpretation, and recognition of facial expressions. IEEE Trans Pattern Anal Mach Intell 19(7): 757-763.
    • (1997) IEEE Trans Pattern Anal Mach Intell , vol.19 , Issue.7 , pp. 757-763
    • Essa, I.A.1    Pentland, A.P.2
  • 31
    • 0037209464 scopus 로고    scopus 로고
    • Automatic facial expression analysis: Survey
    • Fasel B, Luttin J (2003) Automatic facial expression analysis: survey. Pattern Recogn 36(1): 259-275.
    • (2003) Pattern Recogn , vol.36 , Issue.1 , pp. 259-275
    • Fasel, B.1    Luttin, J.2
  • 32
    • 21544458365 scopus 로고    scopus 로고
    • Emotion recognition in human computer interaction
    • Fragopanagos N, Taylor JG (2005) Emotion recognition in human computer interaction. Neural Netw 18: 389-405.
    • (2005) Neural Netw , vol.18 , pp. 389-405
    • Fragopanagos, N.1    Taylor, J.G.2
  • 33
    • 0003025321 scopus 로고
    • Not all smiles are created equal: Differences between enjoyment and other smiles
    • Frank MG, Ekman P (1993) Not all smiles are created equal: differences between enjoyment and other smiles. Humor: Int J Res Humor 6: 9-26.
    • (1993) Humor: Int J Res Humor , vol.6 , pp. 9-26
    • Frank, M.G.1    Ekman, P.2
  • 36
    • 0042326343 scopus 로고    scopus 로고
    • Recurrent neural networks with small weights implement definite memory machines
    • Hammer A, Tino P (2003) Recurrent neural networks with small weights implement definite memory machines. Neural Comput 15(8): 1897-1929.
    • (2003) Neural Comput , vol.15 , Issue.8 , pp. 1897-1929
    • Hammer, A.1    Tino, P.2
  • 38
    • 33646820042 scopus 로고    scopus 로고
    • Hoch S, Althoff F, McGlaun G, Rigoll G (2005) Bimodal fusion of emotional data in an automotive environment. In: Proc 30th int'l conf acoustics, speech, and signal processing (ICASSP '05), vol II, pp 1085-1088.
  • 39
    • 77949287892 scopus 로고    scopus 로고
    • http://emotion-research. net/toolbox/toolboxdatabase Humaine.
  • 40
    • 21544480590 scopus 로고    scopus 로고
    • Emotion recognition through facial expression analysis based on a neurofuzzy network
    • Special issue on emotion: understanding & recognition
    • Ioannou S, Raouzaiou A, Tzouvaras V, Mailis T, Karpouzis K, Kollias S (2005) Emotion recognition through facial expression analysis based on a neurofuzzy network. Neural Netw 18(4): 423-435. Special issue on emotion: understanding & recognition.
    • (2005) Neural Netw , vol.18 , Issue.4 , pp. 423-435
    • Ioannou, S.1    Raouzaiou, A.2    Tzouvaras, V.3    Mailis, T.4    Karpouzis, K.5    Kollias, S.6
  • 43
    • 31744446221 scopus 로고    scopus 로고
    • Human-centered multimedia: Culture, deployment, and access
    • Jaimes A (2006) Human-centered multimedia: culture, deployment, and access. IEEE Multimedia Mag 13(1).
    • (2006) IEEE Multimedia Mag , vol.13 , Issue.1
    • Jaimes, A.1
  • 44
    • 10044235774 scopus 로고    scopus 로고
    • Probabilistic combination of multiple modalities to detect interest
    • Kapoor A, Picard RW, Ivanov Y (2004) Probabilistic combination of multiple modalities to detect interest. In: Proc of IEEE ICPR.
    • (2004) Proc of IEEE ICPR
    • Kapoor, A.1    Picard, R.W.2    Ivanov, Y.3
  • 46
    • 49949087271 scopus 로고    scopus 로고
    • Modeling naturalistic affective states via facial, vocal, and bodily expressions recognition
    • Special Volume on AI for Human Computing, T. Huang, A. Nijholt, M. Pantic, and A. Pentland (Eds.), Berlin: Springer
    • Karpouzis K, Caridakis G, Kessous L, Amir N, Raouzaiou A, Malatesta L, Kollias S (2007) Modeling naturalistic affective states via facial, vocal, and bodily expressions recognition. In: Huang T, Nijholt A, Pantic M, Pentland A (eds) Lecture notes in artificial intelligence, vol 4451. Springer, Berlin. pp 91-112. Special Volume on AI for Human Computing.
    • (2007) Lecture Notes in Artificial Intelligence , vol.4451 , pp. 91-112
    • Karpouzis, K.1    Caridakis, G.2    Kessous, L.3    Amir, N.4    Raouzaiou, A.5    Malatesta, L.6    Kollias, S.7
  • 48
    • 1242263389 scopus 로고    scopus 로고
    • Lip image segmentation using fuzzy clustering incorporating an elliptic shape function
    • Leung SH, Wang SL, Lau WH (2004) Lip image segmentation using fuzzy clustering incorporating an elliptic shape function. IEEE Trans Image Process 13(1).
    • (2004) IEEE Trans Image Process , vol.13 , Issue.1
    • Leung, S.H.1    Wang, S.L.2    Lau, W.H.3
  • 53
    • 0001185920 scopus 로고
    • Communication without words
    • Mehrabian A (1968) Communication without words. Psychol. Today 2(4): 53-56.
    • (1968) Psychol. Today , vol.2 , Issue.4 , pp. 53-56
    • Mehrabian, A.1
  • 54
    • 34547224809 scopus 로고    scopus 로고
    • The prosogram: Semi-automatic transcription of prosody based on a tonal perception model
    • In: Bel B, Marlien I (eds), Japan
    • Mertens P (2004) The prosogram: semi-automatic transcription of prosody based on a tonal perception model. In: Bel B, Marlien I (eds) Proc of speech Prosody, Japan.
    • (2004) Proc of speech Prosody
    • Mertens, P.1
  • 56
    • 0038548330 scopus 로고    scopus 로고
    • The production and recognition of emotions in speech: Features and algorithms
    • Oudeyer PY (2003) The production and recognition of emotions in speech: features and algorithms. Int J Human-Comput Interact 59(1-2): 157-183.
    • (2003) Int J Human-Comput Interact , vol.59 , Issue.1-2 , pp. 157-183
    • Oudeyer, P.Y.1
  • 57
    • 0002126112 scopus 로고    scopus 로고
    • Ten myths of multimodal interaction
    • Oviatt S (1999) Ten myths of multimodal interaction. Commun ACM 42(11): 74-81.
    • (1999) Commun ACM , vol.42 , Issue.11 , pp. 74-81
    • Oviatt, S.1
  • 60
    • 33645009609 scopus 로고    scopus 로고
    • Dynamics of facial expression: Recognition of facial actions and their temporal segments from face profile image sequences
    • Pantic M, Patras I (2006) Dynamics of facial expression: recognition of facial actions and their temporal segments from face profile image sequences. IEEE Trans Syst Man Cybern, Part B 36(2): 433-449.
    • (2006) IEEE Trans Syst Man Cybern, Part B , vol.36 , Issue.2 , pp. 433-449
    • Pantic, M.1    Patras, I.2
  • 61
    • 2942590310 scopus 로고    scopus 로고
    • Towards an affect-sensitive multimodal human-computer interaction
    • Pantic M, Rothkrantz LJM (2003) Towards an affect-sensitive multimodal human-computer interaction. Proc IEEE 91(9): 1370-1390.
    • (2003) Proc IEEE , vol.91 , Issue.9 , pp. 1370-1390
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 63
    • 0034498178 scopus 로고    scopus 로고
    • Automatic analysis of facial expressions: The state of the art
    • Pantic M, Rothkrantz LJM (2000) Automatic analysis of facial expressions: the state of the art. IEEE Trans Pattern Anal Mach Intell 22(12): 1424-1445.
    • (2000) IEEE Trans Pattern Anal Mach Intell , vol.22 , Issue.12 , pp. 1424-1445
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 64
    • 2942590310 scopus 로고    scopus 로고
    • Toward an affect-sensitive multimodal human-computer interaction
    • Pantic M, Rothkrantz LJM (2003) Toward an affect-sensitive multimodal human-computer interaction. Proc IEEE 91(9): 1370-1390.
    • (2003) Proc IEEE , vol.91 , Issue.9 , pp. 1370-1390
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 65
    • 0034247225 scopus 로고    scopus 로고
    • Expert system for automatic analysis of facial expressions
    • Pantic M, Rothkrantz LJM (2000) Expert system for automatic analysis of facial expressions. Image Vis Comput 18: 881-905.
    • (2000) Image Vis Comput , vol.18 , pp. 881-905
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 66
    • 27944439029 scopus 로고    scopus 로고
    • Face for interface
    • M. Pagani (Ed.), Hershey: Idea Group Reference
    • Pantic M (2005) Face for interface. In: Pagani M (ed) The encyclopedia of multimedia technology and networking. Idea Group Reference, Hershey, vol 1, pp 308-314.
    • (2005) The Encyclopedia of Multimedia Technology and Networking , vol.1 , pp. 308-314
    • Pantic, M.1
  • 68
    • 17644419274 scopus 로고    scopus 로고
    • Socially aware computation and communication
    • Pentland A (2005) Socially aware computation and communication. Computer 38(3): 33-40.
    • (2005) Computer , vol.38 , Issue.3 , pp. 33-40
    • Pentland, A.1
  • 71
    • 0034513122 scopus 로고    scopus 로고
    • Towards computers that recognize and respond to user emotion
    • Picard RW (2000) Towards computers that recognize and respond to user emotion. IBM Syst J 39(3-4): 705-719.
    • (2000) IBM Syst J , vol.39 , Issue.3-4 , pp. 705-719
    • Picard, R.W.1
  • 72
    • 0002358797 scopus 로고    scopus 로고
    • Discriminative learning of visual data for audiovisual speech recognition
    • Rogozan A (1999) Discriminative learning of visual data for audiovisual speech recognition. Int J Artif Intell Tools 8: 43-52.
    • (1999) Int J Artif Intell Tools , vol.8 , pp. 43-52
    • Rogozan, A.1
  • 73
    • 0017712350 scopus 로고
    • Evidence for a three-factor theory of emotions
    • Russell JA, Mehrabian A (1977) Evidence for a three-factor theory of emotions. J Res Pers 11: 273-294.
    • (1977) J Res Pers , vol.11 , pp. 273-294
    • Russell, J.A.1    Mehrabian, A.2
  • 75
    • 0026755044 scopus 로고
    • Automatic recognition and analysis of human faces and facial expressions: A survey
    • Samal A, Iyengar PA (1992) Automatic recognition and analysis of human faces and facial expressions: a survey. Pattern Recogn 25(1): 65-77.
    • (1992) Pattern Recogn , vol.25 , Issue.1 , pp. 65-77
    • Samal, A.1    Iyengar, P.A.2
  • 76
    • 21544442336 scopus 로고    scopus 로고
    • A system approach to appraisal mechanisms in emotion
    • Sander D, Grandjean D, Scherer KR (2005) A system approach to appraisal mechanisms in emotion. Neural Netw 18: 317-352.
    • (2005) Neural Netw , vol.18 , pp. 317-352
    • Sander, D.1    Grandjean, D.2    Scherer, K.R.3
  • 77
    • 33749820013 scopus 로고    scopus 로고
    • Recurrent neural networks are universal approximators
    • Schaefer M, Zimmermann HG (2006) Recurrent neural networks are universal approximators, ICANN 2006, pp 632-640.
    • (2006) ICANN 2006 , pp. 632-640
    • Schaefer, M.1    Zimmermann, H.G.2
  • 78
    • 0002224746 scopus 로고    scopus 로고
    • Appraisal theory
    • T. Dalgleish and M. J. Power (Eds.), New York: Wiley
    • Scherer KR (1999) Appraisal theory. In: Dalgleish T, Power MJ (eds) Handbook of cognition and emotion, pp 637-663. Wiley, New York.
    • (1999) Handbook of Cognition and Emotion , pp. 637-663
    • Scherer, K.R.1
  • 79
    • 58149431345 scopus 로고
    • A scale for judgment of facial expressions
    • Schlosberg H (1954) A scale for judgment of facial expressions. J Exp Psychol 29: 497-510.
    • (1954) J Exp Psychol , vol.29 , pp. 497-510
    • Schlosberg, H.1
  • 81
    • 84968724649 scopus 로고    scopus 로고
    • Handbook of pattern recognition and computer vision, Singapore: World Scientific
    • Sebe N, Cohen I, Huang TS (2005) Multimodal emotion recognition. Handbook of pattern recognition and computer vision. World Scientific, Singapore.
    • (2005) Multimodal Emotion Recognition
    • Sebe, N.1    Cohen, I.2    Huang, T.S.3
  • 86
    • 0035250305 scopus 로고    scopus 로고
    • Recognizing action units for facial expression analysis
    • Tian YL, Kanade T, Cohn JF (2001) Recognizing action units for facial expression analysis. IEEE Trans PAMI 23(2).
    • (2001) IEEE Trans PAMI , vol.23 , Issue.2
    • Tian, Y.L.1    Kanade, T.2    Cohn, J.F.3
  • 87
    • 33646749463 scopus 로고    scopus 로고
    • Facial expression analysis
    • S. Z. Li and A. K. Jain (Eds.), Berlin: Springer
    • Tian YL, Kanade T, Cohn JF (2005) Facial expression analysis. In: Li SZ, Jain AK (eds) Handbook of face recognition, pp 247-276. Springer, Berlin.
    • (2005) Handbook of Face Recognition , pp. 247-276
    • Tian, Y.L.1    Kanade, T.2    Cohn, J.F.3
  • 92
    • 84911514327 scopus 로고
    • ELIZA-a computer program for the study of natural language communication between man and machine
    • Weizenbaum J (1966) ELIZA-a computer program for the study of natural language communication between man and machine. Commun ACM 9(1): 36-35.
    • (1966) Commun ACM , vol.9 , Issue.1 , pp. 36-35
    • Weizenbaum, J.1
  • 93
    • 0003367662 scopus 로고
    • The dictionary of affect in language
    • R. Plutchnik and H. Kellerman (Eds.), New York: Academic Press
    • Whissel CM (1989) The dictionary of affect in language. In: Plutchnik R, Kellerman H (eds) Emotion: theory, research and experience: the measurement of emotions. Academic Press, New York, vol 4, pp 113-131.
    • (1989) Emotion: Theory, Research and Experience: The Measurement of Emotions , vol.4 , pp. 113-131
    • Whissel, C.M.1
  • 94
    • 0037674537 scopus 로고    scopus 로고
    • Facial expression of pain: An evolutionary account
    • Williams A (2002) Facial expression of pain: an evolutionary account. Behav Brain Sci 25(4): 439-488.
    • (2002) Behav Brain Sci , vol.25 , Issue.4 , pp. 439-488
    • Williams, A.1
  • 99
    • 34547223416 scopus 로고    scopus 로고
    • Training combination strategy of multi-stream fused hidden Markov model for audio-visual affect recognition
    • Zeng Z, Hu Y, Liu M, Fu Y, Huang TS (2006) Training combination strategy of multi-stream fused hidden Markov model for audio-visual affect recognition. In: Proc 14th ACM int'l conf multimedia (Multimedia'06), pp 65-68.
    • (2006) Proc 14th ACM Int'l Conf Multimedia (Multimedia'06) , pp. 65-68
    • Zeng, Z.1    Hu, Y.2    Liu, M.3    Fu, Y.4    Huang, T.S.5
  • 100
    • 49949108559 scopus 로고    scopus 로고
    • Audio-visual spontaneous emotion recognition
    • T. S. Huang, A. Nijholt, M. Pantic, and A. Pentland (Eds.), Berlin: Springer
    • Zeng Z, Hu Y, Roisman GI, Wen Z, Fu Y, Huang TS (2007) Audio-visual spontaneous emotion recognition. In: Huang TS, Nijholt A, Pantic M, Pentland A (eds) Artificial intelligence for human computing, pp 72-90. Springer, Berlin.
    • (2007) Artificial Intelligence for Human Computing , pp. 72-90
    • Zeng, Z.1    Hu, Y.2    Roisman, G.I.3    Wen, Z.4    Fu, Y.5    Huang, T.S.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.