메뉴 건너뛰기




Volumn 53, Issue 9-10, 2011, Pages 1062-1087

Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge

Author keywords

Adaptation; Affect; Automatic classification; Emotion; Evaluation; Feature selection; Feature types; Noise robustness; Standardisation; Usability

Indexed keywords

ADAPTATION; AFFECT; AUTOMATIC CLASSIFICATION; EMOTION; EVALUATION; FEATURE TYPES; NOISE ROBUSTNESS; USABILITY;

EID: 79960846940     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2011.01.011     Document Type: Article
Times cited : (600)

References (252)
  • 1
    • 44949128361 scopus 로고    scopus 로고
    • Using system and user performance features to improve emotion detection in spoken tutoring dialogs
    • Pittsburgh, PA, USA
    • Ai, H.; Litman, D.; Forbes-Riley, K.; Rotaru, M.; Tetreault, J.; Purandare, A.; 2006. Using system and user performance features to improve emotion detection in spoken tutoring dialogs. In: Proc. Interspeech, Pittsburgh, PA, USA, pp. 797-800.
    • (2006) Proc. Interspeech , pp. 797-800
    • Ai, H.1    Litman, D.2    Forbes-Riley, K.3    Rotaru, M.4    Tetreault, J.5    Purandare, A.6
  • 2
    • 33947625143 scopus 로고    scopus 로고
    • Reduced complexity and scaling for asynchronous HMMs in a bimodal input fusion application
    • Toulouse, France
    • Al-Hames, M.; Rigoll, G.; 2006. Reduced complexity and scaling for asynchronous HMMs in a bimodal input fusion application. In: Proc. ICASSP, Toulouse, France, pp. 757-760.
    • (2006) Proc. ICASSP , pp. 757-760
    • Al-Hames, M.1    Rigoll, G.2
  • 3
    • 60249092335 scopus 로고    scopus 로고
    • Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection
    • H. Altun, and G. Polata Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection Expert Systems Appl. 36 4 2009 8197 8203
    • (2009) Expert Systems Appl. , vol.36 , Issue.4 , pp. 8197-8203
    • Altun, H.1    Polata, G.2
  • 4
    • 85009145332 scopus 로고    scopus 로고
    • Prosody-based automatic detection of annoyance and frustration in human-computer dialog
    • Denver, CO, USA
    • Ang, J.; Dhillon, R.; Shriberg, E.; Stolcke, A.; 2002. Prosody-based automatic detection of annoyance and frustration in human-computer dialog. In: Proc. Interspeech, Denver, CO, USA, pp. 2037-2040.
    • (2002) Proc. Interspeech , pp. 2037-2040
    • Ang, J.1    Dhillon, R.2    Shriberg, E.3    Stolcke, A.4
  • 5
    • 34249316820 scopus 로고    scopus 로고
    • Significance tests harm progress in forecasting
    • J. Armstrong Significance tests harm progress in forecasting Internat. J. Forecast. 23 2007 321 327
    • (2007) Internat. J. Forecast. , vol.23 , pp. 321-327
    • Armstrong, J.1
  • 6
    • 85009069271 scopus 로고    scopus 로고
    • Politeness and frustration language in child-machine interactions
    • Aalborg, Denmark
    • Arunachalam, S.; Gould, D.; Anderson, E.; Byrd, D.; Narayanan, S.; 2001. Politeness and frustration language in child-machine interactions. In: Proc. Eurospeech, Aalborg, Denmark, pp. 2675-2678.
    • (2001) Proc. Eurospeech , pp. 2675-2678
    • Arunachalam, S.1    Gould, D.2    Anderson, E.3    Byrd, D.4    Narayanan, S.5
  • 7
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of the speech wave
    • B. Atal, and S.L. Hanauer Speech analysis and synthesis by linear prediction of the speech wave J. Acoust. Soc. Amer. 50 1971 637 655
    • (1971) J. Acoust. Soc. Amer. , vol.50 , pp. 637-655
    • Atal, B.1    Hanauer, S.L.2
  • 8
    • 21544466181 scopus 로고    scopus 로고
    • ASR for emotional speech: Clarifying the issues and enhancing performance
    • DOI 10.1016/j.neunet.2005.03.008, PII S0893608005000419, Emotion and Brain
    • T. Athanaselis, S. Bakamidis, I. Dologlu, R. Cowie, E. Douglas-Cowie, and C. Cox ASR for emotional speech: clarifying the issues and enhancing performance Neural Networks 18 2005 437 444 (Pubitemid 40922650)
    • (2005) Neural Networks , vol.18 , Issue.4 , pp. 437-444
    • Athanaselis, T.1    Bakamidis, S.2    Dologlou, I.3    Cowie, R.4    Douglas-Cowie, E.5    Cox, C.6
  • 9
    • 34547535917 scopus 로고    scopus 로고
    • Speech emotion recognition using gaussian mixture vector autoregressive models
    • Honolulu, HY
    • Ayadi, M.M.H.E.; Kamel, M.S.; Karray, F.; 2007. Speech emotion recognition using gaussian mixture vector autoregressive models. In: Proc. ICASSP, Honolulu, HY, pp. 957-960.
    • (2007) Proc. ICASSP , pp. 957-960
    • Ayadi, M.M.H.E.1    Kamel, M.S.2    Karray, F.3
  • 14
    • 84938335075 scopus 로고    scopus 로고
    • Boiling down prosody for the classification of boundaries and accents in German and English
    • Aalborg, Denmark
    • Batliner, A.; Buckow, J.; Huber, R.; Warnke, V.; Nöth, E.; Niemann, H.; 2001. Boiling down prosody for the classification of boundaries and accents in German and English. In: Proc. Eurospeech, Aalborg, Denmark, pp. 2781-2784.
    • (2001) Proc. Eurospeech , pp. 2781-2784
    • Batliner, A.1    Buckow, J.2    Huber, R.3    Warnke, V.4    Nöth, E.5    Niemann, H.6
  • 16
    • 78349297410 scopus 로고    scopus 로고
    • We are not amused - But how do you know? User states in a multi-modal dialogue system
    • Geneva, Switzerland
    • Batliner, A.; Zeissler, V.; Frank, C.; Adelhardt, J.; Shi, R.P.; Nöth, E.; 2003b. We are not amused - but how do you know? User states in a multi-modal dialogue system. In: Proc. Interspeech, Geneva, Switzerland, pp. 733-736.
    • (2003) Proc. Interspeech , pp. 733-736
    • Batliner, A.1    Zeissler, V.2    Frank, C.3    Adelhardt, J.4    Shi . R, P.5    Nöth, E.6
  • 19
    • 36248967391 scopus 로고    scopus 로고
    • A taxonomy of applications that utilize emotional awareness
    • Ljubliana, Slovenia
    • Batliner, A.; Burkhardt, F.; van Ballegooy, M.; Nöth, E.; 2006a. A taxonomy of applications that utilize emotional awareness. In: Proc. IS-LTC 2006, Ljubliana, Slovenia, pp. 246-250.
    • (2006) Proc. IS-LTC 2006 , pp. 246-250
    • Batliner, A.1    Burkhardt, F.2    Van Ballegooy, M.3    Nöth, E.4
  • 22
    • 51449094634 scopus 로고    scopus 로고
    • Mothers, adults, children, pets - Towards the acoustics of intimacy
    • Las Vegas, NV
    • Batliner, A.; Schuller, B.; Schaeffler, S.; Steidl, S.; 2008a. Mothers, adults, children, pets - towards the acoustics of intimacy. In: Proc. ICASSP 2008, Las Vegas, NV, pp. 4497-4500.
    • (2008) Proc. ICASSP 2008 , pp. 4497-4500
    • Batliner, A.1    Schuller, B.2    Schaeffler, S.3    Steidl, S.4
  • 23
    • 38749108114 scopus 로고    scopus 로고
    • Private emotions versus social interaction: A data-driven approach towards analysing emotion in speech
    • DOI 10.1007/s11257-007-9039-4, Special Issue on Affective Modeling and Adaptation
    • A. Batliner, S. Steidl, C. Hacker, and E. Nöth Private emotions vs. social interaction - a data-driven approach towards analysing emotions in speech User Model. User-Adapted Interact. 18 2008 175 206 (Pubitemid 351185636)
    • (2008) User Modelling and User-Adapted Interaction , vol.18 , Issue.1-2 , pp. 175-206
    • Batliner, A.1    Steidl, S.2    Hacker, C.3    Noth, E.4
  • 24
    • 78349274056 scopus 로고    scopus 로고
    • Segmenting into adequate units for automatic recognition of emotion-related episodes: A speech-based approach
    • Article ID 782802
    • Batliner, A.; Seppi, D.; Steidl, S.; Schuller, B.; 2010. Segmenting into adequate units for automatic recognition of emotion-related episodes: a speech-based approach. Advances in Human-Computer Interaction, Vol. 2010. Article ID 782802, 15 pages.
    • (2010) Advances in Human-Computer Interaction , pp. 15
    • Batliner, A.1    Seppi, D.2    Steidl, S.3    Schuller, B.4
  • 27
    • 84898971246 scopus 로고    scopus 로고
    • An asynchronous hidden markov model for audio-visual speech recognition
    • Bengio, S.; 2003. An asynchronous hidden markov model for audio-visual speech recognition. Advances in NIPS 15.
    • (2003) Advances in NIPS , vol.15
    • Bengio, S.1
  • 30
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • P. Boersma Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound Proc. Inst. Phonetic Sci. (Univ. Amsterdam) 17 1993 97 110
    • (1993) Proc. Inst. Phonetic Sci. (Univ. Amsterdam) , vol.17 , pp. 97-110
    • Boersma, P.1
  • 32
    • 0002161311 scopus 로고
    • The quefrency analysis of time series for echoes: Cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking
    • Rosenblatt, M. (Ed.) John Wiley & Sons, New York
    • Bogert, B.; Healy, M.; Tukey, J.; 1963. The quefrency analysis of time series for echoes: cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking. In: Rosenblatt, M. (Ed.), Symposium on Time Series Analysis. John Wiley & Sons, New York, pp. 209-243.
    • (1963) Symposium on Time Series Analysis , pp. 209-243
    • Bogert, B.1    Healy, M.2    Tukey, J.3
  • 33
    • 70450177656 scopus 로고    scopus 로고
    • Improving automatic emotion recognition from speech signals
    • Brighton
    • Bozkurt, E.; Erzin, E.; Erdem,.E.; Erdem, A.T.; 2009. Improving automatic emotion recognition from speech signals. In: Proc. Interspeech, Brighton, pp. 324-327.
    • (2009) Proc. Interspeech , pp. 324-327
    • Bozkurt, E.1    Erzin, E.2    Erdem, E.3    Erdem, A.T.4
  • 34
    • 17144380230 scopus 로고    scopus 로고
    • Modeling emotional state and personality for conversational agents
    • Microsoft
    • Breese, J.; Ball, G.; 1998. Modeling emotional state and personality for conversational agents. Technical Report MS-TR-98-41, Microsoft.
    • (1998) Technical Report MS-TR-98-41
    • Breese, J.1    Ball, G.2
  • 35
    • 79959824353 scopus 로고    scopus 로고
    • Towards measuring similarity between emotional corpora
    • Workshop on EMOTION (satellite of LREC): Corpora for Research on Emotion and Affect, Valetta
    • Brendel, M.; Zaccarelli, R.; Schuller, B.; Devillers, L.; 2010. Towards measuring similarity between emotional corpora. In: Proc. 3rd ELR0A Internat. Workshop on EMOTION (satellite of LREC): Corpora for Research on Emotion and Affect, Valetta, pp. 58-64.
    • (2010) Proc. 3rd ELR0A Internat , pp. 58-64
    • Brendel, M.1    Zaccarelli, R.2    Schuller, B.3    Devillers, L.4
  • 37
    • 77949401701 scopus 로고    scopus 로고
    • Emotion detection in dialog systems: Applications, strategies and challenges
    • Amsterdam, Netherlands
    • Burkhardt, F.; van Ballegooy, M.; Engelbrecht, K.-P.; Polzehl, T.; Stegmann, J.; 2009. Emotion detection in dialog systems: applications, strategies and challenges. In: Proc. ACII, Amsterdam, Netherlands, pp. 1-6.
    • (2009) Proc. ACII , pp. 1-6
    • Burkhardt, F.1    Van Ballegooy, M.2    Engelbrecht, K.-P.3    Polzehl, T.4    Stegmann, J.5
  • 42
    • 0036214787 scopus 로고    scopus 로고
    • Yin: A fundamental frequency estimator for speech and music
    • A.D. Cheveigne, and H. Kawahara Yin: a fundamental frequency estimator for speech and music J. Acoust. Soc. Amer. 111 4 2002 1917 1930
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1917-1930
    • Cheveigne, A.D.1    Kawahara, H.2
  • 47
    • 21544457223 scopus 로고    scopus 로고
    • Beyond emotion archetypes: Databases for emotion modelling using neural networks
    • R. Cowie, E. Douglas-Cowie, and C. Cox Beyond emotion archetypes: databases for emotion modelling using neural networks Neural Networks 18 2005 3388
    • (2005) Neural Networks , vol.18 , pp. 3388
    • Cowie, R.1    Douglas-Cowie, E.2    Cox, C.3
  • 49
    • 0025482241 scopus 로고
    • Wavelet transform, time-frequency localization and signal analysis
    • DOI 10.1109/18.57199
    • I. Daubechies The wavelet transform, time-frequency localization and signal analysis TransIT 36 5 1990 961 1005 (Pubitemid 20738359)
    • (1990) IEEE Transactions on Information Theory , vol.36 , Issue.5 , pp. 961-1005
    • Daubechies Ingrid1
  • 50
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • S. Davis, and P. Mermelstein Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences IEEE Trans. Acoust. Speech Signal Process. 29 1980 917 919
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.29 , pp. 917-919
    • Davis, S.1    Mermelstein, P.2
  • 51
    • 0034040864 scopus 로고    scopus 로고
    • The perception of emotions by ear and eye
    • B. de Gelder, and J. Vroomen The perception of emotions by ear and eye Cognition Emotion 14 3 2000 289 311
    • (2000) Cognition Emotion , vol.14 , Issue.3 , pp. 289-311
    • De Gelder, B.1    Vroomen, J.2
  • 52
    • 0344177597 scopus 로고    scopus 로고
    • The combined perception of emotion from voice and face: Early interaction revealed by human electric brain responses
    • DOI 10.1016/S0304-3940(98)00963-X, PII S030439409800963X
    • B. de Gelder, K.B.E. Böcker, J. Tuomainen, M. Hensen, and J. Vroomen The combined perception of emotion from voice and face: early interaction revealed by human electric brain responses Neurosci. Lett. 260 2 1999 133 136 (Pubitemid 29066883)
    • (1999) Neuroscience Letters , vol.260 , Issue.2 , pp. 133-136
    • De Gelder, B.1    Bocker, K.B.E.2    Tuomainen, J.3    Hensen, M.4    Vroomen, J.5
  • 53
    • 0030353343 scopus 로고    scopus 로고
    • Recognizing emotion in speech
    • Philadelphia, PA, USA
    • Dellaert, F.; Polzin, T.; Waibel, A.; 1996. Recognizing emotion in speech. In: Proc. ICSLP, Philadelphia, PA, USA, pp. 1970-1973.
    • (1996) Proc. ICSLP , pp. 1970-1973
    • Dellaert, F.1    Polzin, T.2    Waibel, A.3
  • 55
    • 36249013993 scopus 로고    scopus 로고
    • Real-life emotion recognition in speech
    • C. Müller, Lecture Notes in Computer Science Springer Berlin/Heidelberg
    • L. Devillers, and L. Vidrascu Real-life emotion recognition in speech C. Müller, Speaker Classification II Lecture Notes in Computer Science Vol. 4441/2007 2007 Springer Berlin/Heidelberg 34 42
    • (2007) Speaker Classification II , pp. 34-42
    • Devillers, L.1    Vidrascu, L.2
  • 57
    • 21544459345 scopus 로고    scopus 로고
    • Challenges in real-life emotion annotation and machine learning based detection
    • DOI 10.1016/j.neunet.2005.03.007, PII S0893608005000407, Emotion and Brain
    • L. Devillers, L. Vidrascu, and L. Lamel Challenges in real-life emotion annotation and machine learning based detection Neural Networks 18 2005 407 422 (Pubitemid 40922648)
    • (2005) Neural Networks , vol.18 , Issue.4 , pp. 407-422
    • Devillers, L.1    Vidrascu, L.2    Lamel, L.3
  • 61
    • 85131413217 scopus 로고    scopus 로고
    • Design, recording and verification of a Danish emotional speech database
    • Rhodes, Greece
    • Engberg, I.S.; Hansen, A.V.; Andersen, O.; Dalsgaard, P.; 1997. Design, recording and verification of a Danish emotional speech database. In: Proc. Eurospeech, Rhodes, Greece, pp. 1695-1698.
    • (1997) Proc. Eurospeech , pp. 1695-1698
    • Engberg . I, S.1    Hansen . A, V.2    Andersen, O.3    Dalsgaard, P.4
  • 62
    • 33644698701 scopus 로고    scopus 로고
    • Exploratory study of some acoustic and articulatory characteristics of sad speech
    • DOI 10.1159/000091404
    • D. Erickson, K. Yoshida, C. Menezes, A. Fujino, T. Mochida, and Y. Shibuya Exploratory study of some acoustic and articulatory characteristics of sad speech Phonetica 63 2004 1 25 (Pubitemid 43333525)
    • (2006) Phonetica , vol.63 , Issue.1 , pp. 1-25
    • Erickson, D.1    Yoshida, K.2    Menezes, C.3    Fujino, A.4    Mochida, T.5    Shibuya, Y.6
  • 63
    • 77949415384 scopus 로고    scopus 로고
    • OpenEAR - Introducing the Munich Open-Source Emotion and Affect Recognition Toolkit
    • Amsterdam, Netherlands
    • Eyben, F.; Wöllmer, M.; Schuller, B.; 2009. openEAR - Introducing the Munich Open-Source Emotion and Affect Recognition Toolkit. In: Proc. ACII, Amsterdam, Netherlands, pp. 576-581.
    • (2009) Proc. ACII , pp. 576-581
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3
  • 65
    • 77949304464 scopus 로고    scopus 로고
    • On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues
    • Special Issue on "Real-Time Affect Analysis and Interpretation: Closing the Affective Loop in Virtual Agents and Robots"
    • F. Eyben, M. Wöllmer, A. Graves, B. Schuller, E. Douglas-Cowie, and R. Cowie On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues J. Multimodal User Interfaces 3 1-2 2010 7 12 Special Issue on "Real-Time Affect Analysis and Interpretation: Closing the Affective Loop in Virtual Agents and Robots"
    • (2010) J. Multimodal User Interfaces , vol.3 , Issue.12 , pp. 7-12
    • Eyben, F.1    Wöllmer, M.2    Graves, A.3    Schuller, B.4    Douglas-Cowie, E.5    Cowie, R.6
  • 66
    • 78650977476 scopus 로고    scopus 로고
    • OpenSMILE - The munich versatile and fast open-source audio feature Extractor
    • Florence, Italy
    • Eyben, F.; Wöllmer, M.; Schuller, B.; 2010c. openSMILE - the munich versatile and fast open-source audio feature Extractor. In: Proc. ACM Multimedia, Florence, Italy, pp. 1459-1462.
    • (2010) Proc. ACM Multimedia , pp. 1459-1462
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3
  • 67
    • 0009890860 scopus 로고
    • The concept of statistical significance and the controversy about one-tailed tests
    • H. Eysenck The concept of statistical significance and the controversy about one-tailed tests Psychol. Rev. 67 1960 269 271
    • (1960) Psychol. Rev. , vol.67 , pp. 269-271
    • Eysenck, H.1
  • 69
    • 0001600718 scopus 로고
    • Concept of emotion viewed from a prototype perspective
    • B. Fehr, and J.A. Russel Concept of emotion viewed from a prototype perspective J. Exp. Psychol.: Gen. 113 1984 464 486
    • (1984) J. Exp. Psychol.: Gen. , vol.113 , pp. 464-486
    • Fehr, B.1    Russel, J.A.2
  • 71
    • 70449109467 scopus 로고    scopus 로고
    • An effect size primer: A guide for clinicians and researchers
    • C.J. Ferguson An effect size primer: a guide for clinicians and researchers Prof. Psychol.: Res. Practice 40 2009 532 538
    • (2009) Prof. Psychol.: Res. Practice , vol.40 , pp. 532-538
    • Ferguson, C.J.1
  • 72
    • 0037382608 scopus 로고    scopus 로고
    • Modeling drivers' speech under stress
    • R. Fernandez, and R.W. Picard Modeling drivers' speech under stress Speech Comm. 40 2003 145 159
    • (2003) Speech Comm. , vol.40 , pp. 145-159
    • Fernandez, R.1    Picard, R.W.2
  • 73
    • 0013944260 scopus 로고
    • Memory for gist: Some relevant variables
    • S. Fillenbaum Memory for gist: some relevant variables Lang. Speech 9 1966 217 227
    • (1966) Lang. Speech , vol.9 , pp. 217-227
    • Fillenbaum, S.1
  • 74
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    • Santa Barbara, CA, USA
    • Fiscus, J.; 1997. A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER). In: Proc. ASRU, Santa Barbara, CA, USA, pp. 347-352.
    • (1997) Proc. ASRU , pp. 347-352
    • Fiscus, J.1
  • 75
    • 33645066726 scopus 로고
    • Large sample standard errors of kappa and weighted kappa
    • J. Fleiss, J. Cohen, and B. Everitt Large sample standard errors of kappa and weighted kappa Psychol. Bull. 72 5 1969 323 327
    • (1969) Psychol. Bull. , vol.72 , Issue.5 , pp. 323-327
    • Fleiss, J.1    Cohen, J.2    Everitt, B.3
  • 77
    • 0000989095 scopus 로고
    • Communicating emotion: The role of prosodic features
    • R. Frick Communicating emotion: the role of prosodic features Psychol. Bull. 97 1985 412 429
    • (1985) Psychol. Bull. , vol.97 , pp. 412-429
    • Frick, R.1
  • 80
    • 10144262549 scopus 로고    scopus 로고
    • Mindless statistics
    • DOI 10.1016/j.socec.2004.09.033, PII S1053535704000927, Statistical Significance
    • G. Gigerenzer Mindless statistics J. Socio-Econ. 33 2004 587 606 (Pubitemid 39615265)
    • (2004) Journal of Socio-Economics , vol.33 , Issue.5 , pp. 587-606
    • Gigerenzer, G.1
  • 83
  • 85
  • 87
    • 85089273681 scopus 로고    scopus 로고
    • Getting started with susas: A speech under simulated and actual stress database
    • Rhodes, Greece
    • Hansen, J.; Bou-Ghazale, S.; 1997. Getting started with susas: a speech under simulated and actual stress database. In: Proc. EUROSPEECH-97, Vol. 4, Rhodes, Greece, pp. 1743-1746.
    • (1997) Proc. EUROSPEECH-97 , vol.4 , pp. 1743-1746
    • Hansen, J.1    Bou-Ghazale, S.2
  • 88
    • 0003938958 scopus 로고
    • The Groundwork of Cognition University Press Cambridge
    • S. Harnad Categorical Perception The Groundwork of Cognition 1987 University Press Cambridge
    • (1987) Categorical Perception
    • Harnad, S.1
  • 89
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis of speech
    • DOI 10.1121/1.399423
    • H. Hermansky Perceptual linear predictive (plp) analysis for speech J. Acoust. Soc. Amer. (JASA) 87 1990 1738 1752 (Pubitemid 20256470)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.4 , pp. 1738-1752
    • Hermansky, H.1
  • 92
    • 33947620674 scopus 로고    scopus 로고
    • A pitch detection algorithm based on amdf and acf
    • Toulouse, France
    • Hui, L.; Dai, B.-Q.; Wei, L.; 2006. A pitch detection algorithm based on amdf and acf. In: Proc. ICASSP, Toulouse, France, p. I.
    • (2006) Proc. ICASSP , pp. 1
    • Hui, L.1    Dai, B.-Q.2    Wei, L.3
  • 95
    • 84957069814 scopus 로고    scopus 로고
    • Text Categorization with Support Vector Machines: Learning with many Relevant Features
    • Machine Learning: ECML-98
    • Joachims, T.; 1998. Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C.; Rouveirol, C. (Eds.), Proc. ECML-98, 10th European Conf. on Machine Learning. Springer, Heidelberg, Chemnitz, Germany, pp. 137-142. (Pubitemid 128067178)
    • (1998) Proc. ECML-98, 10th European Conf. on Machine Learning , Issue.1398 , pp. 137-142
    • Joachims, T.1
  • 96
    • 0000191372 scopus 로고    scopus 로고
    • Vocal communication of emotion
    • M. Lewis, J.M. Haviland-Jones, Guilford Press New York, London second ed. (Chapter 14)
    • T. Johnstone, and K.R. Scherer Vocal communication of emotion M. Lewis, J.M. Haviland-Jones, Handbook of Emotions 2000 Guilford Press New York, London 220 235 second ed. (Chapter 14)
    • (2000) Handbook of Emotions , pp. 220-235
    • Johnstone, T.1    Scherer, K.R.2
  • 98
    • 48749113415 scopus 로고    scopus 로고
    • Human emotion recognition system using optimally designed SVM with different facial feature extraction techniques
    • G.U. Kharat, and S.V. Dudul Human emotion recognition system using optimally designed SVM with different facial feature extraction techniques WSEAS Trans. Comput. 7 6 2008
    • (2008) WSEAS Trans. Comput. , vol.7 , Issue.6
    • Kharat, G.U.1    Dudul, S.V.2
  • 99
    • 4243783982 scopus 로고    scopus 로고
    • Extraktion und Klassifikation prosodischer Merkmale in der automatischen Sprachverarbeitung
    • Shaker, Aachen, Germany
    • Kießling, A.; 1997. Extraktion und Klassifikation prosodischer Merkmale in der automatischen Sprachverarbeitung. Berichte aus der Informatik. Shaker, Aachen, Germany.
    • (1997) Berichte Aus der Informatik
    • Kießling, A.1
  • 101
    • 2942731250 scopus 로고    scopus 로고
    • Emotion recognition system using short-term monitoring of physiological signals
    • DOI 10.1007/BF02344719
    • K.H. Kim, S.W. Bang, and S.R. Kim Emotion recognition system using short-term monitoring of physiological signals Medical Biological Eng. Comput. 42 3 2004 419 427 (Pubitemid 38786706)
    • (2004) Medical and Biological Engineering and Computing , vol.42 , Issue.3 , pp. 419-427
    • Kim, K.H.1    Bang, S.W.2    Kim, S.R.3
  • 102
    • 33745184901 scopus 로고    scopus 로고
    • Integrating information from speech and physiological signals to achieve emotional sensitivity
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • Kim, J.; André, E.; Rehm, M.; Vogt, T.; Wagner, J.; 2005. Integrating information from speech and physiological signals to achieve emotional sensitivity. In: Proc. Interspeech, Lisbon, Portugal, pp. 809-812. (Pubitemid 43908185)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 809-812
    • Kim, J.1    Andre, E.2    Rehm, M.3    Vogt, T.4    Wagner, J.5
  • 104
    • 70450177653 scopus 로고    scopus 로고
    • Brno University of Technology System for Interspeech 2009 Emotion Challenge
    • Brighton
    • Kockmann, M.; Burget, L.; Černocký, J.; 2009. Brno University of Technology System for Interspeech 2009 Emotion Challenge. In: Proc. Interspeech, Brighton, pp. 348-351.
    • (2009) Proc. Interspeech , pp. 348-351
    • Kockmann, M.1    Burget, L.2    Černocký, J.3
  • 106
    • 70349223033 scopus 로고    scopus 로고
    • Contrasting emotion-bearing laughter types in multiparticipant vocal activity detection for meetings
    • IEEE, Taipei, Taiwan
    • Laskowski, K.; 2009. Contrasting emotion-bearing laughter types in multiparticipant vocal activity detection for meetings. In: Proc. ICASSP, IEEE, Taipei, Taiwan, pp. 4765-4768.
    • (2009) Proc. ICASSP , pp. 4765-4768
    • Laskowski, K.1
  • 108
    • 0033592606 scopus 로고    scopus 로고
    • Learning the parts of objects by non-negative matrix factorization
    • D.D. Lee, and H.S. Seung Learning the parts of objects by non-negative matrix factorization Nature 401 6755 1999 788 791
    • (1999) Nature , vol.401 , Issue.6755 , pp. 788-791
    • Lee, D.D.1    Seung, H.S.2
  • 109
    • 85009233228 scopus 로고    scopus 로고
    • Combining acoustic and language information for emotion recognition
    • Denver, CO, USA
    • Lee, C.M.; Narayanan, S.S.; Pieraccini, R.; 2002. Combining acoustic and language information for emotion recognition. In: Proc. Interspeech, Denver, CO, USA, pp. 873-376.
    • (2002) Proc. Interspeech , pp. 873-376
    • Lee . C, M.1    Narayanan . S, S.2    Pieraccini, R.3
  • 110
    • 70450179812 scopus 로고    scopus 로고
    • Emotion recognition using a hierarchical binary decision tree approach
    • Brighton
    • Lee, C.; Mower, E.; Busso, C.; Lee, S.; Narayanan, S.; 2009. Emotion recognition using a hierarchical binary decision tree approach. In: Proc. Interspeech, Brighton, pp. 320-323.
    • (2009) Proc. Interspeech , pp. 320-323
    • Lee, C.1    Mower, E.2    Busso, C.3    Lee, S.4    Narayanan, S.5
  • 114
    • 84976216481 scopus 로고    scopus 로고
    • Classifying subject ratings of emotional speech using acoustic features
    • Geneva, Switzerland
    • Liscombe, J.; Venditti, J.; Hirschberg, J.; 2003. Classifying subject ratings of emotional speech using acoustic features. In: Proc. Eurospeech, Geneva, Switzerland, pp. 725-728.
    • (2003) Proc. Eurospeech , pp. 725-728
    • Liscombe, J.1    Venditti, J.2    Hirschberg, J.3
  • 115
    • 84946012706 scopus 로고    scopus 로고
    • Recognizing emotions from student speech in tutoring dialogues
    • Virgin Island, USA
    • Litman, D.; Forbes, K.; 2003. Recognizing emotions from student speech in tutoring dialogues. In: Proc. ASRU, Virgin Island, USA, pp. 25-30.
    • (2003) Proc. ASRU , pp. 25-30
    • Litman, D.1    Forbes, K.2
  • 119
    • 70450161311 scopus 로고    scopus 로고
    • Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge
    • Brighton
    • Luengo, I.; Navas, E.; Hernáez, I.; 2009. Combining spectral and prosodic information for emotion recognition in the interspeech 2009 emotion challenge. In: Proc. Interspeech, Brighton, pp. 332-335.
    • (2009) Proc. Interspeech , pp. 332-335
    • Luengo, I.1    Navas, E.2    Hernáez, I.3
  • 120
    • 75249105202 scopus 로고    scopus 로고
    • Psychological motivated multi-stage emotion classification exploiting voice quality features
    • Mihelic, F.; Zibert, J. (Eds.) IN-TECH
    • Lugger, M.; Yang, B.; 2008. Psychological motivated multi-stage emotion classification exploiting voice quality features. In: Mihelic, F.; Zibert, J. (Eds.), Speech Recognition, IN-TECH, p. 1.
    • (2008) Speech Recognition , pp. 1
    • Lugger, M.1    Yang, B.2
  • 121
    • 33947615772 scopus 로고    scopus 로고
    • Robust estimation of voice quality parameters under real world disturbances
    • Toulouse, France
    • Lugger, M.; Yang, B.; Wokurek, W.; 2006. Robust estimation of voice quality parameters under real world disturbances. In: Proc. ICASSP, Toulouse, France, pp. 1097-1100.
    • (2006) Proc. ICASSP , pp. 1097-1100
    • Lugger, M.1    Yang, B.2    Wokurek, W.3
  • 122
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • J. Makhoul Linear prediction: a tutorial review Proc. IEEE 63 1975 561 580
    • (1975) Proc. IEEE , vol.63 , pp. 561-580
    • Makhoul, J.1
  • 124
    • 33746622941 scopus 로고    scopus 로고
    • Emotion recognition in non-structured utterances for human-robot interaction
    • DOI 10.1109/ROMAN.2005.1513750, 1513750, 14th IEEE Workshop on Robot and Human Interactive Communication, RO-MAN 2005
    • Martinez, C.A.; Cruz, A.; 2005. Emotion recognition in non-structured utterances for human-robot interaction. In: IEEE Internat. Workshop on Robot and Human Interactive Communication, Nashville, TN, USA, pp. 19-23. (Pubitemid 44144353)
    • (2005) Proceedings - IEEE International Workshop on Robot and Human Interactive Communication , vol.2005 , pp. 19-23
    • Martinez, C.A.1    Cruz, A.B.2
  • 125
    • 33744988731 scopus 로고    scopus 로고
    • Detection of cough signals in continuous audio recordings using hidden markov models
    • S. Matos, S. Birring, I. Pavord, and D. Evans Detection of cough signals in continuous audio recordings using hidden markov models IEEE Trans. Biomed. Eng. 2006 1078 1108
    • (2006) IEEE Trans. Biomed. Eng. , pp. 1078-1108
    • Matos, S.1    Birring, S.2    Pavord, I.3    Evans, D.4
  • 128
    • 67650705553 scopus 로고    scopus 로고
    • Using WordNet's semantic relations for opinion detection in blogs
    • Lecture Notes in Computer Science Springer
    • Missen, M.; Boughanem, M.; 2009. Using WordNet's semantic relations for opinion detection in blogs. In: Advances in Information Retrieval, Lecture Notes in Computer Science, Vol. 5478/2009. Springer, pp. 729-733.
    • (2009) Advances in Information Retrieval , pp. 729-733
    • Missen, M.1    Boughanem, M.2
  • 129
    • 34547954381 scopus 로고    scopus 로고
    • Voting ensembles for spoken affect classification
    • DOI 10.1016/j.jnca.2006.09.005, PII S1084804506000737
    • D. Morrison, and L.C.D. Silva Voting ensembles for spoken affect classification J. Network Comput. Appl. 30 2007 1356 1365 (Pubitemid 47263519)
    • (2007) Journal of Network and Computer Applications , vol.30 , Issue.4 , pp. 1356-1365
    • Morrison, D.1    De Silva, L.C.2
  • 130
    • 33846952503 scopus 로고    scopus 로고
    • Ensemble methods for spoken emotion recognition in call-centres
    • D. Morrison, R. Wang, and L.C.D. Silva Ensemble methods for spoken emotion recognition in call-centres Speech Comm. 49 2 2007 98 112
    • (2007) Speech Comm. , vol.49 , Issue.2 , pp. 98-112
    • Morrison, D.1    Wang, R.2    Silva, L.C.D.3
  • 131
    • 77955422208 scopus 로고    scopus 로고
    • Incremental learning for spoken affect classification and its application in call-centres
    • D. Morrison, R. Wang, W. Xu, and L.C.D. Silva Incremental learning for spoken affect classification and its application in call-centres Int. J. Intell. Systems Technol. Appl. 2 2007 242 254
    • (2007) Int. J. Intell. Systems Technol. Appl. , vol.2 , pp. 242-254
    • Morrison, D.1    Wang, R.2    Xu, W.3    Silva, L.C.D.4
  • 133
    • 27144489770 scopus 로고    scopus 로고
    • Emotion recognition from physiological signals using wireless sensors for presence technologies
    • DOI 10.1007/s10111-003-0143-x
    • F. Nasoz, K. Alvarez, C.L. Lisetti, and N. Finkelstein Emotion recognition from physiological signals using wireless sensors for presence technologies Cognition Technol. Work 6 1 2004 4 14 (Pubitemid 38305312)
    • (2004) Cognition Technol. Work , vol.6 , Issue.1 , pp. 4-14
    • Nasoz, F.1    Alvarez, K.2    Lisetti, C.L.3    Finkelstein, N.4
  • 134
    • 0036297183 scopus 로고    scopus 로고
    • A coupled HMM for audio-visual speech recognition
    • Orlando, FL, USA
    • Nefian, A.V.; Luhong, L.; Xiaobo, P.; Liu, X.; Mao, C.; Murphy, K.; 2002. A coupled HMM for audio-visual speech recognition. In: Proc. ICASSP, Orlando, FL, USA, pp. 2013-2016.
    • (2002) Proc. ICASSP , pp. 2013-2016
    • Nefian . A, V.1    Luhong, L.2    Xiaobo, P.3    Liu, X.4    Mao, C.5    Murphy, K.6
  • 135
    • 38749103707 scopus 로고    scopus 로고
    • Emotion recognition in spontaneous speech using GMMs
    • Pittsburgh, PA, USA
    • Neiberg, D.; Elenius, K.; Laskowski, K.; 2006. Emotion recognition in spontaneous speech using GMMs. In: Proc. Interspeech, Pittsburgh, PA, USA, pp. 809-812.
    • (2006) Proc. Interspeech , pp. 809-812
    • Neiberg, D.1    Elenius, K.2    Laskowski, K.3
  • 136
    • 0034203345 scopus 로고    scopus 로고
    • Null hypothesis significance testing: A review of an old and continuing controversy
    • R.S. Nickerson Null hypothesis significance testing: a review of an old and continuing controversy Psychol. Methods 5 2000 241 301
    • (2000) Psychol. Methods , vol.5 , pp. 241-301
    • Nickerson, R.S.1
  • 137
    • 85008007181 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden markov models
    • Aalborg, Denmark
    • Nogueiras, A.; Moreno, A.; Bonafonte, A.; Mariño, J.B.; 2001. Speech emotion recognition using hidden markov models. In: Proc. Eurospeech, Aalborg, Denmark, pp. 2267-2270.
    • (2001) Proc. Eurospeech , pp. 2267-2270
    • Nogueiras, A.1    Moreno, A.2    Bonafonte, A.3    Mariño . J, B.4
  • 138
  • 139
    • 70349219560 scopus 로고    scopus 로고
    • Style estimation of speech based on multiple regression hidden semi-markov model
    • Antwerp, Belgium
    • Nose, T.; Kato, Y.; Kobayashi, T.; 2007. Style estimation of speech based on multiple regression hidden semi-markov model. In: Proc. Interspeech, Antwerp, Belgium, pp. 2285-2288.
    • (2007) Proc. Interspeech , pp. 2285-2288
    • Nose, T.1    Kato, Y.2    Kobayashi, T.3
  • 141
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden markov models
    • T. Nwe, S. Foo, and L.D. Silva Speech emotion recognition using hidden markov models Speech Comm. 41 2003 603 623
    • (2003) Speech Comm. , vol.41 , pp. 603-623
    • Nwe, T.1    Foo, S.2    Silva, L.D.3
  • 142
    • 65849105927 scopus 로고    scopus 로고
    • Analytical features: A knowledge-based approach to audio feature generation
    • 23 pages
    • F. Pachet, and P. Roy Analytical features: a knowledge-based approach to audio feature generation EURASIP J. Audio Speech Music Process. 2009 23 pages
    • (2009) EURASIP J. Audio Speech Music Process.
    • Pachet, F.1    Roy, P.2
  • 143
    • 33947616723 scopus 로고    scopus 로고
    • Emotion detection from infant facial expressions and cries
    • Toulouse, France
    • Pal, P.; Iyer, A.; Yantorno, R.; 2006. Emotion detection from infant facial expressions and cries. In: Proc. ICASSP, Toulouse, France, pp. 809-812.
    • (2006) Proc. ICASSP , pp. 809-812
    • Pal, P.1    Iyer, A.2    Yantorno, R.3
  • 145
    • 2942590310 scopus 로고    scopus 로고
    • Toward an affect-sensitive multimodal human-computer interaction
    • DOI 10.1109/JPROC.2003.817122, Human-Computer Multimodal Interface
    • M. Pantic, and L. Rothkrantz Toward an affect-sensitive multimodal human-computer interaction Proc. IEEE 91 9 2003 1370 1390 (Pubitemid 40890819)
    • (2003) Proceedings of the IEEE , vol.91 , Issue.9 , pp. 1370-1390
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 146
    • 0032542897 scopus 로고    scopus 로고
    • What's wrong with Bonferroni adjustment
    • T.V. Pernegger What's wrong with Bonferroni adjustment Brit. Med. J. 316 1998 1236 1238
    • (1998) Brit. Med. J. , vol.316 , pp. 1236-1238
    • Pernegger, T.V.1
  • 147
  • 149
    • 70450186224 scopus 로고    scopus 로고
    • GTM-URL contribution to the INTERSPEECH 2009 Emotion Challenge
    • Brighton
    • Planet, S.; Iriondo, I.; Socoró, J.-C.; Monzo, C.; Adell, J.; 2009. GTM-URL contribution to the INTERSPEECH 2009 Emotion Challenge. In: Proc. Interspeech, Brighton, pp. 316-319.
    • (2009) Proc. Interspeech , pp. 316-319
    • Planet, S.1    Iriondo, I.2    Socoró, J.-C.3    Monzo, C.4    Adell, J.5
  • 150
    • 70450191308 scopus 로고    scopus 로고
    • Emotion classification in children's speech using fusion of acoustic and linguistic features
    • Brighton
    • Polzehl, T.; Sundaram, S.; Ketabdar, H.; Wagner, M.; Metze, F.; 2009. Emotion classification in children's speech using fusion of acoustic and linguistic features. In: Proc. Interspeech, Brighton, pp. 340-343.
    • (2009) Proc. Interspeech , pp. 340-343
    • Polzehl, T.1    Sundaram, S.2    Ketabdar, H.3    Wagner, M.4    Metze, F.5
  • 151
    • 0012645358 scopus 로고    scopus 로고
    • Emotion-sensitive human-computer interfaces
    • Newcastle, Northern Ireland
    • Polzin, T.S.; Waibel, A.; 2000. Emotion-sensitive human-computer interfaces. In: Proc. ISCA Workshop on Speech and Emotion, Newcastle, Northern Ireland, pp. 201-206.
    • (2000) Proc. ISCA Workshop on Speech and Emotion , pp. 201-206
    • Polzin . T, S.1    Waibel, A.2
  • 153
    • 84948481845 scopus 로고
    • An algorithm for suffix stripping
    • M. Porter An algorithm for suffix stripping Program 14 3 1980 130 137
    • (1980) Program , vol.14 , Issue.3 , pp. 130-137
    • Porter, M.1
  • 158
    • 34248961406 scopus 로고
    • Cognitive representations of semantic categories
    • E. Rosch Cognitive representations of semantic categories J. Exp. Psychol.: Gen. 104 3 1975 192 233
    • (1975) J. Exp. Psychol.: Gen. , vol.104 , Issue.3 , pp. 192-233
    • Rosch, E.1
  • 159
    • 85101155057 scopus 로고    scopus 로고
    • Augmenting the kappa statistic to determine interannotator reliability for multiply labeled data points
    • Dumais, D.M.; Roukos, S. (Eds.) Boston, MA, USA
    • Rosenberg, A.; Binkowski, E.; 2004. Augmenting the kappa statistic to determine interannotator reliability for multiply labeled data points. In: Dumais, D.M.; Roukos, S. (Eds.), HLT-NAACL 2004: Short Papers. Association for Computational Linguistics, Boston, MA, USA, pp. 77-80.
    • (2004) HLT-NAACL 2004: Short Papers. Association for Computational Linguistics , pp. 77-80
    • Rosenberg, A.1    Binkowski, E.2
  • 160
    • 0000130677 scopus 로고
    • The fallacy of the null-hypothesis significance test
    • W. Rozeboom The fallacy of the null-hypothesis significance test Psychol. Bull. 57 1960 416 428
    • (1960) Psychol. Bull. , vol.57 , pp. 416-428
    • Rozeboom, W.1
  • 162
    • 84864888751 scopus 로고
    • Recognition memory for syntactic and semantic aspects od connected discourse
    • J. Sachs Recognition memory for syntactic and semantic aspects od connected discourse Percept. Psychophys. 2 1967 437 442
    • (1967) Percept. Psychophys. , vol.2 , pp. 437-442
    • Sachs, J.1
  • 163
    • 79953023044 scopus 로고    scopus 로고
    • Graded representations of emotional expressions in the left superior temporal sulcus
    • doi:10.3389/fnsys.2010.00006
    • C. Said, C. Moore, K. Norman, J. Haxby, and A. Todorov Graded representations of emotional expressions in the left superior temporal sulcus Front. Systems Neurosci. 4 2010 6 doi:10.3389/fnsys.2010.00006
    • (2010) Front. Systems Neurosci. , vol.4 , pp. 6
    • Said, C.1    Moore, C.2    Norman, K.3    Haxby, J.4    Todorov, A.5
  • 164
    • 27144463192 scopus 로고    scopus 로고
    • On comparing classifiers: Pitfalls to avoid and a recommended approach
    • S. Salzberg On comparing classifiers: pitfalls to avoid and a recommended approach Data Mining Knowl. Discov. 1 3 1997 317 328
    • (1997) Data Mining Knowl. Discov. , vol.1 , Issue.3 , pp. 317-328
    • Salzberg, S.1
  • 165
    • 70349193509 scopus 로고    scopus 로고
    • Emotion recognition using mel-frequency cepstral coefficients
    • N. Sato, and Y. Obuchi Emotion recognition using mel-frequency cepstral coefficients Inform. Media Technol. 2 3 2007 835 848
    • (2007) Inform. Media Technol. , vol.2 , Issue.3 , pp. 835-848
    • Sato, N.1    Obuchi, Y.2
  • 166
    • 0347613216 scopus 로고    scopus 로고
    • Vocal expression of emotion
    • R.J. Davidson, K.R. Scherer, H.H. Goldsmith, Oxford University Press Oxford, New York (Chapter 23)
    • K.R. Scherer, T. Johnstone, and G. Klasmeyer Vocal expression of emotion R.J. Davidson, K.R. Scherer, H.H. Goldsmith, Handbook of Affective Sciences 2003 Oxford University Press Oxford, New York 433 456 (Chapter 23)
    • (2003) Handbook of Affective Sciences , pp. 433-456
    • Scherer, K.R.1    Johnstone, T.2    Klasmeyer, G.3
  • 167
    • 23944443102 scopus 로고    scopus 로고
    • Automatic phonetic transcription of non-prompted speech
    • San Francisco, CA, USA
    • Schiel, F.; 1999. Automatic phonetic transcription of non-prompted speech. In: Proc. ICPhS, San Francisco, CA, USA, pp. 607-610.
    • (1999) Proc. ICPhS , pp. 607-610
    • Schiel, F.1
  • 168
    • 38049013081 scopus 로고    scopus 로고
    • First suggestions for an emotion annotation and representation language
    • Devillers, L.; Martin, J.-C.; Cowie, R.; Douglas-Cowie, E.; Batliner, A. (Eds.) Genoa, Italy
    • Schröder, M.; Pirker, H.; Lamolle, M.; 2006. First suggestions for an emotion annotation and representation language. In: Devillers, L.; Martin, J.-C.; Cowie, R.; Douglas-Cowie, E.; Batliner, A. (Eds.), Proc. Satellite Workshop of LREC 2006 on Corpora for Research on Emotion and Affect, Genoa, Italy, pp. 88-92.
    • (2006) Proc. Satellite Workshop of LREC 2006 on Corpora for Research on Emotion and Affect , pp. 88-92
    • Schröder, M.1    Pirker, H.2    Lamolle, M.3
  • 171
    • 0141478857 scopus 로고    scopus 로고
    • Hidden markov model-based speech emotion recognition
    • Hong Kong
    • Schuller, B.; Rigoll, G.; Lang, M.; 2003. Hidden markov model-based speech emotion recognition. In: Proc. ICASSP, Hong Kong, pp. 1-4.
    • (2003) Proc. ICASSP , pp. 1-4
    • Schuller, B.1    Rigoll, G.2    Lang, M.3
  • 172
    • 4544316885 scopus 로고    scopus 로고
    • Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
    • Montreal, Canada
    • Schuller, B.; Rigoll, G.; Lang, M.; 2004. Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture. In: Proc. ICASSP, Montreal, Canada, pp. 577-580.
    • (2004) Proc. ICASSP , pp. 577-580
    • Schuller, B.1    Rigoll, G.2    Lang, M.3
  • 173
    • 33646758175 scopus 로고    scopus 로고
    • Meta-classifiers in acoustic and linguistic feature fusion-based affect recognition
    • Philadelphia, PA, USA
    • Schuller, B.; Jiménez Villar, R.; Rigoll, G.; Lang, M.; 2005a. Meta-classifiers in acoustic and linguistic feature fusion-based affect recognition. In: Proc. ICASSP, Philadelphia, PA, USA, pp. I:325-328.
    • (2005) Proc. ICASSP
    • Schuller, B.1    Jiménez Villar, R.2    Rigoll, G.3    Lang, M.4
  • 174
    • 33745198227 scopus 로고    scopus 로고
    • Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles
    • 9th European Conference on Speech Communication and Technology, Eurospeech Interspeech
    • Schuller, B.; Müller, R.; Lang, M.; Rigoll, G.; 2005b. Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensemble. In: Proc. Interspeech, Lisbon, Portugal, pp. 805-808. (Pubitemid 43908184)
    • (2005) 9th European Conference on Speech Communication and Technology , pp. 805-808
    • Schuller, B.1    Muller, R.2    Lang, M.3    Rigoll, G.4
  • 176
    • 78149472083 scopus 로고    scopus 로고
    • Emotion recognition in the noise applying large acoustic feature sets
    • Dresden, Germany, no pagination
    • Schuller, B.; Arsić, D.; Wallhoff, F.; Rigoll, G.; 2006. Emotion recognition in the noise applying large acoustic feature sets. In: Proc. Speech Prosody 2006, Dresden, Germany, no pagination.
    • (2006) Proc. Speech Prosody 2006
    • Schuller, B.1    Arsić, D.2    Wallhoff, F.3    Rigoll, G.4
  • 177
    • 38049067290 scopus 로고    scopus 로고
    • Timing levels in segment-based speech emotion recognition
    • Pittsburgh, PA, USA
    • Schuller, B.; Rigoll, G.; 2006. Timing levels in segment-based speech emotion recognition. In: Proc. Interspeech, Pittsburgh, PA, USA, pp. 1818-1821.
    • (2006) Proc. Interspeech , pp. 1818-1821
    • Schuller, B.1    Rigoll, G.2
  • 178
    • 44949160056 scopus 로고    scopus 로고
    • Recognition of interest in human conversational speech
    • Pittsburgh, PA, USA
    • Schuller, B.; Köhler, N.; Müller, R.; Rigoll, G.; 2006b. Recognition of interest in human conversational speech. In: Proc. Interspeech, Pittsburgh, PA, USA, pp. 793-796.
    • (2006) Proc. Interspeech , pp. 793-796
    • Schuller, B.1    Köhler, N.2    Müller, R.3    Rigoll, G.4
  • 180
    • 79953811661 scopus 로고    scopus 로고
    • Affect-robust speech recognition by dynamic emotional adaptation
    • Dresden, Germany, no pagination
    • Schuller, B.; Stadermann, J.; Rigoll, G.; 2006d. Affect-robust speech recognition by dynamic emotional adaptation. In: Proc. Speech Prosody 2006, Dresden, Germany, no pagination.
    • (2006) Proc. Speech Prosody 2006
    • Schuller, B.1    Stadermann, J.2    Rigoll, G.3
  • 182
    • 48249092791 scopus 로고    scopus 로고
    • Audiovisual recognition of spontaneous interest within conversations
    • Special Session on Multimodal Analysis of Human Spontaneous Behaviour. ACM SIGCHI, Nagoya, Japan
    • Schuller, B.; Müller, R.; Hörnler, B.; Höthker, A.; Konosu, H.; Rigoll, G.; 2007b. Audiovisual recognition of spontaneous interest within conversations. In: Proc. 9th Internat. Conf. on Multimodal Interfaces (ICMI), Special Session on Multimodal Analysis of Human Spontaneous Behaviour. ACM SIGCHI, Nagoya, Japan, pp. 30-37.
    • (2007) Proc. 9th Internat. Conf. on Multimodal Interfaces (ICMI) , pp. 30-37
    • Schuller, B.1    Müller, R.2    Hörnler, B.3    Höthker, A.4    Konosu, H.5    Rigoll, G.6
  • 183
    • 34547549142 scopus 로고    scopus 로고
    • Towards more reality in the recognition of emotional speech
    • Honolulu, HY, USA
    • Schuller, B.; Seppi, D.; Batliner, A.; Meier, A.; Steidl, S.; 2007c. Towards more reality in the recognition of emotional speech. In: Proc. ICASSP, Honolulu, HY, USA, pp. 941-944.
    • (2007) Proc. ICASSP , pp. 941-944
    • Schuller, B.1    Seppi, D.2    Batliner, A.3    Meier, A.4    Steidl, S.5
  • 186
    • 54049132987 scopus 로고    scopus 로고
    • Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition
    • Hannover, Germany
    • Schuller, B.; Vlasenko, B.; Arsic, D.; Rigoll, G.; Wendemuth, A.; 2008c. Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition. In: Proc. ICME, Hannover, Germany, pp. 1333-1336.
    • (2008) Proc. ICME , pp. 1333-1336
    • Schuller, B.1    Vlasenko, B.2    Arsic, D.3    Rigoll, G.4    Wendemuth, A.5
  • 187
    • 84867198846 scopus 로고    scopus 로고
    • Detection of security related affect and behaviour in passenger transport
    • Brisbane, Australia
    • Schuller, B.; Wimmer, M.; Arsic, D.; Moosmayr, T.; Rigoll, G.; 2008d. Detection of security related affect and behaviour in passenger transport. In: Proc. Interspeech, Brisbane, Australia, pp. 265-268.
    • (2008) Proc. Interspeech , pp. 265-268
    • Schuller, B.1    Wimmer, M.2    Arsic, D.3    Moosmayr, T.4    Rigoll, G.5
  • 188
    • 51449104640 scopus 로고    scopus 로고
    • Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space?
    • Las Vegas, NV
    • Schuller, B.; Wimmer, M.; Mösenlechner, L.; Kern, C.; Arsic, D.; Rigoll, G.; 2008e. Brute-forcing hierarchical functionals for paralinguistics: a waste of feature space? In: Proc. ICASSP, Las Vegas, NV, pp. 4501-4504.
    • (2008) Proc. ICASSP , pp. 4501-4504
    • Schuller, B.1    Wimmer, M.2    Mösenlechner, L.3    Kern, C.4    Arsic, D.5    Rigoll, G.6
  • 189
    • 70450185591 scopus 로고    scopus 로고
    • Recognising interest in conversational speech - Comparing bag of frames and supra-segmental features
    • Brighton, UK
    • Schuller, B.; Rigoll, G.; 2009. Recognising interest in conversational speech - comparing bag of frames and supra-segmental features. In: Proc. Interspeech, Brighton, UK, pp. 1999-2002.
    • (2009) Proc. Interspeech , pp. 1999-2002
    • Schuller, B.1    Rigoll, G.2
  • 190
    • 70349193703 scopus 로고    scopus 로고
    • Emotion recognition from speech: Putting ASR in the loop
    • IEEE, Taipei, Taiwan
    • Schuller, B.; Batliner, A.; Steidl, S.; Seppi, D.; 2009a. Emotion recognition from speech: putting ASR in the loop. In: Proc. ICASSP, IEEE, Taipei, Taiwan, pp. 4585-4588.
    • (2009) Proc. ICASSP , pp. 4585-4588
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 191
    • 70349292240 scopus 로고    scopus 로고
    • Being bored? Recognising natural interest by extensive audiovisual integration for real-life Application
    • (Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior)
    • Schuller, B.; Müller, R.; Eyben, F.; Gast, J.; Hörnler, B.; Wöllmer, M.; Rigoll, G.; Höthker, A.; Konosu, H.; 2009b. Being bored? Recognising natural interest by extensive audiovisual integration for real-life Application. Image Vision Comput. J. (IMAVIS) 27, 1760-1774 (Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior).
    • (2009) Image Vision Comput. J. (IMAVIS) , vol.27 , pp. 1760-1774
    • Schuller, B.1    Müller, R.2    Eyben, F.3    Gast, J.4    Hörnler, B.5    Wöllmer, M.6    Rigoll, G.7    Höthker, A.8    Konosu, H.9
  • 192
    • 71249091768 scopus 로고    scopus 로고
    • The "godfather" vs. "chaos": Comparing linguistic analysis based on online knowledge sources and Bags-of-N-grams for movie review valence estimation
    • Barcelona, Spain
    • Schuller, B.; Schenk, J.; Rigoll, G.; Knaup, T.; 2009c. The "Godfather" vs. "Chaos": comparing linguistic analysis based on online knowledge sources and Bags-of-N-grams for movie review valence estimation. In: Proc. Internat. Conf. on Document Analysis and Recognition, Barcelona, Spain, pp. 858-862.
    • (2009) Proc. Internat. Conf. on Document Analysis and Recognition , pp. 858-862
    • Schuller, B.1    Schenk, J.2    Rigoll, G.3    Knaup, T.4
  • 193
    • 70450206416 scopus 로고    scopus 로고
    • The INTERSPEECH 2009 Emotion Challenge
    • Brighton, UK
    • Schuller, B.; Steidl, S.; Batliner, A.; 2009d. The INTERSPEECH 2009 Emotion Challenge. In: Proc. Interspeech, Brighton, UK, pp. 312-315.
    • (2009) Proc. Interspeech , pp. 312-315
    • Schuller, B.1    Steidl, S.2    Batliner, A.3
  • 196
    • 67650135931 scopus 로고    scopus 로고
    • Recognition of noisy speech: A comparative survey of robust model architectures and feature enhancement
    • (JASMP) Article ID 942617
    • Schuller, B.; Wöllmer, M.; Moosmayr, T.; Rigoll, G.; 2009g. Recognition of noisy speech: a comparative survey of robust model architectures and feature enhancement. EURASIP J. Audio Speech Music Process. (JASMP), 17 pages, Article ID 942617.
    • (2009) EURASIP J. Audio Speech Music Process , pp. 17
    • Schuller, B.1    Wöllmer, M.2    Moosmayr, T.3    Rigoll, G.4
  • 198
    • 78049361438 scopus 로고    scopus 로고
    • Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization
    • Dallas
    • Schuller, B.; Weninger, F.; 2010. Discrimination of speech and non-linguistic vocalizations by non-negative matrix factorization. In: Proc. ICASSP, Dallas, pp. 5054-5057.
    • (2010) Proc. ICASSP , pp. 5054-5057
    • Schuller, B.1    Weninger, F.2
  • 206
    • 38149027682 scopus 로고    scopus 로고
    • Automatic classification of expressiveness in speech: A multi-corpus study
    • Müller, C. (Ed.) Springer, Heidelberg-Berlin-New York
    • Shami, M.; Verhelst, W.; 2007. Automatic classification of expressiveness in speech: a multi-corpus study. In: Müller, C. (Ed.), Speaker Classification II, Lecture Notes in Computer Science/Artificial Intelligence, Vol. 4441. Springer, Heidelberg-Berlin-New York, pp. 43-56.
    • (2007) Speaker Classification II, Lecture Notes in Computer Science/Artificial Intelligence , vol.4441 , pp. 43-56
    • Shami, M.1    Verhelst, W.2
  • 207
    • 0001403157 scopus 로고
    • Cross-cultural similarities and differences in emotion and its representation: A prototype approach
    • P.R. Shaver, S. Wu, and J.C. Schwartz Cross-cultural similarities and differences in emotion and its representation: a prototype approach Emotion 1992 175 212
    • (1992) Emotion , pp. 175-212
    • Shaver, P.R.1    Wu, S.2    Schwartz, J.C.3
  • 209
    • 22944452795 scopus 로고    scopus 로고
    • Looking at the last two turns, I'd say this dialogue is doomed - Measuring dialogue success
    • Sojka, P.; Kopeček, I.; Pala, K. (Eds.) Berlin, Heidelberg
    • Steidl, S.; Ruff, C.; Batliner, A.; Nöth, E.; Haas, J.; 2004. Looking at the last two turns, I'd say this dialogue is doomed - measuring dialogue success. In: Sojka, P.; Kopeček, I.; Pala, K. (Eds.), 7th Internat. Conf. on Text, Speech and Dialogue, TSD 2004, Berlin, Heidelberg, pp. 629-636.
    • (2004) 7th Internat. Conf. on Text, Speech and Dialogue, TSD 2004 , pp. 629-636
    • Steidl, S.1    Ruff, C.2    Batliner, A.3    Nöth, E.4    Haas, J.5
  • 212
    • 77949400109 scopus 로고    scopus 로고
    • The hinterland of emotions: Facing the open-microphone challenge
    • Amsterdam, Netherlands
    • Steidl, S.; Schuller, B.; Batliner, A.; Seppi, D.; 2009. The hinterland of emotions: facing the open-microphone challenge. In: Proc. ACII, Amsterdam, Netherlands, pp. 690-697.
    • (2009) Proc. ACII , pp. 690-697
    • Steidl, S.1    Schuller, B.2    Batliner, A.3    Seppi, D.4
  • 213
    • 77951457701 scopus 로고    scopus 로고
    • On the impact of children's emotional speech on acoustic and language models
    • doi:10.1155/2010/783954
    • Steidl, S.; Batliner, A.; Seppi, D.; Schuller, B.; 2010. On the impact of children's emotional speech on acoustic and language models. EURASIP J. Audio Speech Music Process.; 14. doi:10.1155/2010/783954.
    • (2010) EURASIP J. Audio Speech Music Process , pp. 14
    • Steidl, S.1    Batliner, A.2    Seppi, D.3    Schuller, B.4
  • 215
    • 0037382560 scopus 로고    scopus 로고
    • Emotions, speech and the ASR framework
    • L. tenBosch Emotions, speech and the ASR framework Speech Comm. 40 1-2 2003 213 225
    • (2003) Speech Comm. , vol.40 , Issue.12 , pp. 213-225
    • Tenbosch, L.1
  • 216
    • 0029747053 scopus 로고    scopus 로고
    • Integrating audio and visual information to provide highly robust speech recognition
    • Atlanta, GA, USA
    • Tomlinson, M.J.; Russell, M.J.; Brooke, N.M.; 1996. Integrating audio and visual information to provide highly robust speech recognition. In: Proc. ICASSP, Atlanta, GA, USA, pp. 812-824.
    • (1996) Proc. ICASSP , pp. 812-824
    • Tomlinson . M, J.1    Russell . M, J.2    Brooke . N, M.3
  • 218
  • 219
    • 84862624179 scopus 로고    scopus 로고
    • Fast sequential floating forward selection applied to emotional speech features estimated on des and SUSAS data collection
    • Florence, Italy, no pagination
    • Ververidis, D.; Kotropoulos, C.; 2006. Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collection. In: Proc. European Signal Processing Conf. (EUSIPCO 2006), Florence, Italy, no pagination.
    • (2006) Proc. European Signal Processing Conf. (EUSIPCO 2006)
    • Ververidis, D.1    Kotropoulos, C.2
  • 220
    • 77955418889 scopus 로고    scopus 로고
    • Five emotion classes in real-world call center data: The use of various types of paralinguistic features
    • Vidrascu, L.; Devillers, L.; 2007. Five emotion classes in real-world call center data: the use of various types of paralinguistic features. In: Proc. PARALING07, pp. 11-16.
    • (2007) Proc. PARALING07 , pp. 11-16
    • Vidrascu, L.1    Devillers, L.2
  • 222
    • 70450176954 scopus 로고    scopus 로고
    • Processing affected speech within human machine interaction
    • Brighton
    • Vlasenko, B.; 2009. Processing affected speech within human machine interaction. In: Proc. Interspeech, Brighton, pp. 2039-2042.
    • (2009) Proc. Interspeech , pp. 2039-2042
    • Vlasenko, B.1
  • 223
    • 56149115138 scopus 로고    scopus 로고
    • Combining frame and turn-level information for robust recognition of emotions within speech
    • Antwerp, Belgium
    • Vlasenko, B.; Schuller, B.; Wendemuth, A.; Rigoll, G.; 2007a. Combining frame and turn-level information for robust recognition of emotions within speech. In: Proc. Interspeech, Antwerp, Belgium, pp. 2249-2252.
    • (2007) Proc. Interspeech , pp. 2249-2252
    • Vlasenko, B.1    Schuller, B.2    Wendemuth, A.3    Rigoll, G.4
  • 224
    • 38049048651 scopus 로고    scopus 로고
    • Frame vs. turn-level: Emotion recognition from speech considering static and dynamic processing
    • A. Paiva, R. Prada, R.W. Picard, Springer Berlin-Heidelberg
    • B. Vlasenko, B. Schuller, A. Wendemuth, and G. Rigoll Frame vs. turn-level: emotion recognition from speech considering static and dynamic processing A. Paiva, R. Prada, R.W. Picard, Affective Computing and Intelligent Interaction 2007 Springer Berlin-Heidelberg 139 147
    • (2007) Affective Computing and Intelligent Interaction , pp. 139-147
    • Vlasenko, B.1    Schuller, B.2    Wendemuth, A.3    Rigoll, G.4
  • 225
    • 84867201207 scopus 로고    scopus 로고
    • Balancing spoken content adaptation and unit length in the recognition of emotion and interest
    • Brisbane, Australia
    • Vlasenko, B.; Schuller, B.; Mengistu, T.K.; Rigoll, G.A.W.; 2008. Balancing spoken content adaptation and unit length in the recognition of emotion and interest. In: Proc. Interspeech, Brisbane, Australia, pp. 805-808.
    • (2008) Proc. Interspeech , pp. 805-808
    • Vlasenko, B.1    Schuller, B.2    Mengistu, T.K.3    Rigoll, A.G.W.4
  • 226
    • 33750564952 scopus 로고    scopus 로고
    • Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
    • DOI 10.1109/ICME.2005.1521463, 1521463, IEEE International Conference on Multimedia and Expo, ICME 2005
    • Vogt, T.; André, E.; 2005. Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition. In: Proc. Multimedia and Expo (ICME05), Amsterdam, Netherlands, pp. 474-477. (Pubitemid 44668907)
    • (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 474-477
    • Vogt, T.1    Andre, E.2
  • 227
    • 70450177657 scopus 로고    scopus 로고
    • Exploring the benefits of discretization of acoustic features for speech emotion recognition
    • Brighton
    • Vogt, T.; André, E.; 2009. Exploring the benefits of discretization of acoustic features for speech emotion recognition. In: Proc. Interspeech, Brighton, pp. 328-331.
    • (2009) Proc. Interspeech , pp. 328-331
    • Vogt, T.1    André, E.2
  • 229
    • 77949371114 scopus 로고    scopus 로고
    • Real-time vocal emotion recognition in artistic installations and interactive storytelling: Experiences and lessons learnt from CALLAS and IRIS
    • Amsterdam, Netherlands
    • Vogt, T.; André, E.; Wagner, J.; Gilroy, S.; Charles, F.; Cavazza, M.; 2009. Real-time vocal emotion recognition in artistic installations and interactive storytelling: Experiences and lessons learnt from CALLAS and IRIS. In: Proc. ACII, Amsterdam, Netherlands, pp. 670-677.
    • (2009) Proc. ACII , pp. 670-677
    • Vogt, T.1    André, E.2    Wagner, J.3    Gilroy, S.4    Charles, F.5    Cavazza, M.6
  • 230
    • 33749572603 scopus 로고    scopus 로고
    • From physiological signals to emotions: Implementing and comparing selected methods for feature extraction and classification
    • DOI 10.1109/ICME.2005.1521579, 1521579, IEEE International Conference on Multimedia and Expo, ICME 2005
    • Wagner, J.; Kim, J.; André, E.; 2005. From physiological signals to emotions: implementing and comparing selected methods for feature extraction and classification. In: Proc. ICME, Amsterdam, Netherlands, pp. 940-943. (Pubitemid 44669023)
    • (2005) IEEE International Conference on Multimedia and Expo, ICME 2005 , vol.2005 , pp. 940-943
    • Wagner, J.1    Kim, J.2    Andre, E.3
  • 231
    • 38049079660 scopus 로고    scopus 로고
    • A systematic comparison of different HMM designs for emotion recognition from acted and spontaneous speech
    • Paiva, A.; Prada, R.; Picard, R.W. (Eds.) Springer, Berlin-Heidelberg
    • Wagner, J.; Vogt, T.; André, 2007. A systematic comparison of different HMM designs for emotion recognition from acted and spontaneous speech. In: Paiva, A.; Prada, R.; Picard, R.W. (Eds.), Affective Computing and Intelligent Interaction. Springer, Berlin-Heidelberg, pp. 114-125.
    • (2007) Affective Computing and Intelligent Interaction , pp. 114-125
    • Wagner, J.1    Vogt, T.2    André3
  • 232
    • 33646784723 scopus 로고    scopus 로고
    • Recognizing human emotion from audiovisual information
    • Philadelphia, PA, USA
    • Wang, Y.; Guan, L.; 2005. Recognizing human emotion from audiovisual information. In: Proc. ICASSP, Vol. 2. Philadelphia, PA, USA, pp. 1125-1128.
    • (2005) Proc. ICASSP , vol.2 , pp. 1125-1128
    • Wang, Y.1    Guan, L.2
  • 236
    • 84862156369 scopus 로고    scopus 로고
    • Abandoning emotion classes - Towards continuous emotion recognition with modelling of long-range dependencies
    • Brisbane, Australia
    • Wöllmer, M.; Eyben, F.; Reiter, S.; Schuller, B.; Cox, C.; Douglas-Cowie, E.; Cowie, R.; 2008. Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies. In: Proc. Interspeech, Brisbane, Australia, pp. 597-600.
    • (2008) Proc. Interspeech , pp. 597-600
    • Wöllmer, M.1    Eyben, F.2    Reiter, S.3    Schuller, B.4    Cox, C.5    Douglas-Cowie, E.6    Cowie, R.7
  • 237
    • 70449526103 scopus 로고    scopus 로고
    • A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams
    • M. Wöllmer, M. Al-Hames, F. Eyben, B. Schuller, and G. Rigoll A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams Neurocomputing 73 2009 366 380
    • (2009) Neurocomputing , vol.73 , pp. 366-380
    • Wöllmer, M.1    Al-Hames, M.2    Eyben, F.3    Schuller, B.4    Rigoll, G.5
  • 238
    • 70349203870 scopus 로고    scopus 로고
    • Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
    • Taipei, Taiwan
    • Wöllmer, M.; Eyben, F.; Keshet, J.; Graves, A.; Schuller, B.; Rigoll, G.; 2009. Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks. In: Proc. ICASSP, Taipei, Taiwan, pp. 3949-3952.
    • (2009) Proc. ICASSP , pp. 3949-3952
    • Wöllmer, M.1    Eyben, F.2    Keshet, J.3    Graves, A.4    Schuller, B.5    Rigoll, G.6
  • 239
    • 77956721304 scopus 로고    scopus 로고
    • Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening
    • (Special Issue on "Speech Processing for Natural Interaction with Intelligent Environments")
    • Wöllmer, M.; Schuller, B.; Eyben, F.; Rigoll, G.; 2010. Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening. IEEE J. Select. Topics Signal Process. 4, 867-881 (Special Issue on "Speech Processing for Natural Interaction with Intelligent Environments").
    • (2010) IEEE J. Select. Topics Signal Process , vol.4 , pp. 867-881
    • Wöllmer, M.1    Schuller, B.2    Eyben, F.3    Rigoll, G.4
  • 240
    • 0026692226 scopus 로고
    • Stacked generalization
    • D. Wolpert Stacked generalization Neural Networks 5 1992 241 259
    • (1992) Neural Networks , vol.5 , pp. 241-259
    • Wolpert, D.1
  • 241
    • 77952107788 scopus 로고    scopus 로고
    • Posting act tagging using transformation-based learning
    • T.Y. Lin, S. Ohsuga, C.-J. Liau, X. Hu, S. Tsumoto, Springer Berlin-Heidelberg
    • T. Wu, F. Khan, T. Fisher, L. Shuler, and W. Pottenger Posting act tagging using transformation-based learning T.Y. Lin, S. Ohsuga, C.-J. Liau, X. Hu, S. Tsumoto, Foundations of Data Mining and Knowledge Discovery 2005 Springer Berlin-Heidelberg 319 331
    • (2005) Foundations of Data Mining and Knowledge Discovery , pp. 319-331
    • Wu, T.1    Khan, F.2    Fisher, T.3    Shuler, L.4    Pottenger, W.5
  • 243
    • 84867228433 scopus 로고    scopus 로고
    • Long-term spectro-temporal information for improved automatic speech emotion classification
    • Brisbane, Australia
    • Wu, S.; Falk, T.; Chan, W.-Y.; 2008b. Long-term spectro-temporal information for improved automatic speech emotion classification. In: Proc. INTERSPEECH, Brisbane, Australia, pp. 638-641.
    • (2008) Proc. INTERSPEECH , pp. 638-641
    • Wu, S.1    Falk, T.2    Chan, W.-Y.3
  • 244
    • 29344473560 scopus 로고    scopus 로고
    • Sentiment analyzer: Extracting sentiments about a given topic using natural language processing techniques
    • Melbourne, FL
    • Yi, J.; Nasukawa, T.; Bunescu, R.; Niblack, W.; 2003. Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques. In: Proc. IEEE Internat. Conf. on Data Mining (ICDM), Melbourne, FL, pp. 427-434.
    • (2003) Proc. IEEE Internat. Conf. on Data Mining (ICDM) , pp. 427-434
    • Yi, J.1    Nasukawa, T.2    Bunescu, R.3    Niblack, W.4
  • 247
    • 84961322808 scopus 로고    scopus 로고
    • Detecting user engagement in everyday conversations
    • Yu, C.; Aoki, P.; Woodruf, A.; 2004. Detecting user engagement in everyday conversations. In: Proc. ICSLP, pp. 1329-1332.
    • (2004) Proc. ICSLP , pp. 1329-1332
    • Yu, C.1    Aoki, P.2    Woodruf, A.3
  • 251
    • 32844468513 scopus 로고    scopus 로고
    • Text-to-emotion engine for real time internet communication
    • Networks, and DSPs, Staffordshire University
    • Zhe, X.; Boucouvalas, A.; 2002. Text-to-emotion engine for real time internet communication. In: Proc. Internat. Symp. on Communication Systems, Networks, and DSPs, Staffordshire University, pp. 164-168.
    • (2002) Proc. Internat. Symp. on Communication Systems , pp. 164-168
    • Zhe, X.1    Boucouvalas, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.