메뉴 건너뛰기




Volumn 2, Issue 1, 2012, Pages 1-29

A multitask approach to continuous five-dimensional affect sensing in natural speech

Author keywords

Audio features; Dimensional affect; Emotion recognition; Long short term memory; Neural networks; SEMAINE

Indexed keywords

BRAIN; CHEMICAL ACTIVATION; INTERACTIVE COMPUTER SYSTEMS; LEARNING SYSTEMS; LONG SHORT-TERM MEMORY; NEURAL NETWORKS; REAL TIME SYSTEMS; SUPPORT VECTOR REGRESSION;

EID: 84983561287     PISSN: 21606455     EISSN: 21606463     Source Type: Journal    
DOI: 10.1145/2133366.2133372     Document Type: Article
Times cited : (50)

References (73)
  • 1
    • 78349274056 scopus 로고    scopus 로고
    • Segmenting into adequate units for automatic recognition of emotion-relatedepisodes: A speech-based approach
    • article ID 782802
    • BATLINER, A., SEPPI, D., STEIDL, S., AND SCHULLER, B. 2010. Segmenting into adequate units for automatic recognition of emotion-relatedepisodes: A speech-based approach.Advan. Hum. Comput. Interact. article ID 782802..
    • (2010) Advan. Hum. Comput. Interact
    • Batliner, A.1    Seppi, D.2    Steidl, S.3    Schuller, B.4
  • 9
    • 21544459345 scopus 로고    scopus 로고
    • Challenges in real-life emotion annotation and machine learning based detection
    • DEVILLERS, L., VIDRASCU, L., AND LAMEL, L. 2005. Challenges in real-life emotion annotation and machine learning based detection. Neural Netw. 18, 4, 407-422..
    • (2005) Neural Netw. , vol.18 , Issue.4 , pp. 407-422
    • Devillers, L.1    Vidrascu, L.2    Lamel, L.3
  • 13
    • 77949304464 scopus 로고    scopus 로고
    • On-line emotion recognition ina3-d activation-valence-time continuum using acoustic and linguistic cues
    • EYBEN, F., WÖLLMER, M., GRAVES, A., SCHULLER, B., DOUGLAS-COWIE, E., AND COWIE, R. 2010a. On-line emotion recognition ina3-d activation-valence-time continuum using acoustic and linguistic cues. J. Multimodal User Interfaces 3, 1-2, 7-19..
    • (2010) J. Multimodal User Interfaces , vol.3 , Issue.1-2 , pp. 7-19
    • Eyben, F.1    Wöllmer, M.2    Graves, A.3    Schuller, B.4    Douglas-Cowie, E.5    Cowie, R.6
  • 14
    • 78650977476 scopus 로고    scopus 로고
    • Open SMILE - The munich versatile and fast open-source audio feature extractor
    • EYBEN, F., WÖLLMER, M., AND SCHULLER, B. 2010b. openSMILE\-The Munich versatile and fast open-source audio feature extractor In Proceedings of ACM Multimedia Conference. 1459-1462..
    • (2010) Proceedings of ACM Multimedia Conference , pp. 1459-1462
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3
  • 18
    • 21544458365 scopus 로고    scopus 로고
    • Emotion recognition in human-computer interaction
    • FRAGOPANAGOS, N. AND TAYLOR, J. G. 2005. Emotion recognition in human-computer interaction. Neural Netw. 18, 4, 389-405..
    • (2005) Neural Netw. , vol.18 , Issue.4 , pp. 389-405
    • Fragopanagos, N.1    Taylor, J.G.2
  • 20
    • 27744588611 scopus 로고    scopus 로고
    • Bidirectional lstm networks for improved phoneme classification and recognition
    • GRAVES, A., FERNANDEZ, S., AND SCHMIDHUBER, J. 2005. Bidirectional lstm networks for improved phoneme classification and recognition. In Proceedings ofICANN. Vol. 18. 602-610..
    • (2005) Proceedings OfICANN , vol.18 , pp. 602-610
    • Graves, A.1    Fernandez, S.2    Schmidhuber, J.3
  • 21
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional lstm and other neural network architectures
    • GRAVES, A. AND SCHMIDHUBER, J. 2005. Framewise phoneme classification with bidirectional lstm and other neural network architectures. Neural Netw. 18, 5-6, 602-610..
    • (2005) Neural Netw. , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 23
    • 34547940048 scopus 로고    scopus 로고
    • Primitives based estimation and evaluation of emotions in speech
    • GRIMM, M., MOWER, E., KROSCHEL, K., AND NARAYANAN, S. 2007b. Primitives based estimation and evaluation of emotions in speech. Speech Comm. 49, 787-800..
    • (2007) Speech Comm. , vol.49 , pp. 787-800
    • Grimm, M.1    Mower, E.2    Kroschel, K.3    Narayanan, S.4
  • 24
    • 78049394179 scopus 로고    scopus 로고
    • Automatic, dimensional and continuous emotion recognition
    • GUNES, H. AND PANTIC, M. 2010a. Automatic, dimensional and continuous emotion recognition. Int. J. Synth. Emot. 1, 1, 68-99..
    • (2010) Int. J. Synth. Emot. , vol.1 , Issue.1 , pp. 68-99
    • Gunes, H.1    Pantic, M.2
  • 25
    • 79958730446 scopus 로고    scopus 로고
    • Automatic measurement of affect in dimensional and continuous spaces: Why, what, andhow?
    • GUNES, H. AND PANTIC, M. 2010b. Automatic measurement of affect in dimensional and continuous spaces: Why, what, andhow? In Proceedings of the Conference on MeasuringBehavior. 122-126..
    • (2010) Proceedings of the Conference on MeasuringBehavior , pp. 122-126
    • Gunes, H.1    Pantic, M.2
  • 26
    • 78049368043 scopus 로고    scopus 로고
    • Dimensional emotion prediction from spontaneous head gestures for inter-actionwith sensitive artificial listeners
    • GUNES, H. AND PANTIC, M. 2010c. Dimensional emotion prediction from spontaneous head gestures for inter-actionwith sensitive artificial listeners. In Proceedings of International Conference on Intelligent Virtual Agents. 371-377..
    • (2010) Proceedings of International Conference on Intelligent Virtual Agents , pp. 371-377
    • Gunes, H.1    Pantic, M.2
  • 29
  • 31
    • 70450185596 scopus 로고    scopus 로고
    • Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions
    • LEE, C., BUSSO, C., LEE, S., AND NARAYANAN, S. 2009. Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions. In Proceedings of the Interspeech Conference. 1983-1986..
    • (2009) Proceedings of the Interspeech Conference , pp. 1983-1986
    • Lee, C.1    Busso, C.2    Lee, S.3    Narayanan, S.4
  • 32
    • 84983586176 scopus 로고    scopus 로고
    • Cost-Effective solution to synchronised audio-visual data capture using multiple sensors
    • LICHTENAUER, J., SHEN, J., VALSTAR, M., AND PANTIC, M. 2010. Cost-Effective solution to synchronised audio-visual data capture using multiple sensors. J. Vis. Comm. Image Represent., 1-39..
    • (2010) J. Vis. Comm. Image Represent. , pp. 1-39
    • Lichtenauer, J.1    Shen, J.2    Valstar, M.3    Pantic, M.4
  • 35
    • 85008006613 scopus 로고    scopus 로고
    • Aframe work for automatic human emotion classification using emotional profiles
    • MOWER, E., MATARIC, M. J., ANDNARAYANAN, S. S. 2011. Aframe work for automatic human emotion classification using emotional profiles. IEEE Trans. Audio, Speech Lang. Process. 19, 5, 1057-1070..
    • (2011) IEEE Trans. Audio, Speech Lang. Process. , vol.19 , Issue.5 , pp. 1057-1070
    • Mower, E.1    Mataric, M.J.2    Andnarayanan, S.S.3
  • 38
    • 0038548330 scopus 로고    scopus 로고
    • The production and recognition of emotions in speech: Features and algorithms
    • OUDEYER, P. Y. 2003. The production and recognition of emotions in speech: Features and algorithms. Int. J. Hum.-Comput. Studies 59, 157-183..
    • (2003) Int. J. Hum.-Comput. Studies , vol.59 , pp. 157-183
    • Oudeyer, P.Y.1
  • 39
    • 0038764011 scopus 로고    scopus 로고
    • Kalman filters improve lstm network performance in problems unsolvable by traditional recurrent nets
    • PÉREZ-ORTIZ, J. A., GERS, F A., ECK, D., AND SCHMIDHUBER, J. 2003. Kalman filters improve lstm network performance in problems unsolvable by traditional recurrent nets. Neural Netw. 16, 2, 241-250..
    • (2003) Neural Netw. , vol.16 , Issue.2 , pp. 241-250
    • Pérez-Ortiz, J.A.1    Gers, F.A.2    Eck, D.3    Schmidhuber, J.4
  • 40
    • 0036919726 scopus 로고    scopus 로고
    • Synthetic vision and memory for autonomous virtual humans
    • PETERS, C. AND O'SULLIVAN, C. 2002. Synthetic vision and memory for autonomous virtual humans. Comput. Graph. Forum 21, 4, 743-753..
    • (2002) Comput. Graph. Forum , vol.21 , Issue.4 , pp. 743-753
    • Peters, C.1    O'Sullivan, C.2
  • 63
    • 33750564952 scopus 로고    scopus 로고
    • Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
    • VOGT, T. AND ANDRE, E. 2005. Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition. In Proceedings of the ICME Conference. 474-477..
    • (2005) Proceedings of the ICME Conference , pp. 474-477
    • Vogt, T.1    Andre, E.2
  • 64
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • WERBOS, P. 1990. Backpropagation through time: What it does and how to do it. Proc. IEEE 78, 1550-1560..
    • (1990) Proc. IEEE , vol.78 , pp. 1550-1560
    • Werbos, P.1
  • 67
    • 70450186589 scopus 로고    scopus 로고
    • Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks
    • WÖLLMER, M., EYBEN, F., SCHULLER, B., DOUGLAS-COWIE, E., AND COWIE, R. 2009. Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks. In Proceedings of Interspeech Conference. 1595-1598..
    • (2009) Proceedings of Interspeech Conference , pp. 1595-1598
    • Wöllmer, M.1    Eyben, F.2    Schuller, B.3    Douglas-Cowie, E.4    Cowie, R.5
  • 68
    • 79958734716 scopus 로고    scopus 로고
    • Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional lstm modeling
    • WÖLLMER, M., METALLINOU, A., EYBEN, F., SCHULLER, B., AND NARAYANAN, S. 2010a. Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional lstm modeling. In Proceedings of Inter speech Conference. 2362-2365..
    • (2010) Proceedings of Inter Speech Conference , pp. 2362-2365
    • Wöllmer, M.1    Metallinou, A.2    Eyben, F.3    Schuller, B.4    Narayanan, S.5
  • 69
    • 77956721304 scopus 로고    scopus 로고
    • Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening
    • WÖLLMER, M., SCHULLER, B., EYBEN, F., AND RIGOLL, G. 2010b. Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening. IEEE J. Select. Topics Signal Process. 4, 5, 867-881..
    • (2010) IEEE J. Select. Topics Signal Process. , vol.4 , Issue.5 , pp. 867-881
    • Wöllmer, M.1    Schuller, B.2    Eyben, F.3    Rigoll, G.4
  • 73
    • 57149144228 scopus 로고    scopus 로고
    • A survey of affect recognition methods: Audio, visual, and spontaneous expressions
    • ZENG, Z., PANTIC, M., ROISMAN, G. I., AND HUANG, T. 2009. A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31, 1, 39-58..
    • (2009) IEEE Trans. Pattern Anal. Mach. Intell. , vol.31 , Issue.1 , pp. 39-58
    • Zeng, Z.1    Pantic, M.2    Roisman, G.I.3    Huang, T.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.