메뉴 건너뛰기




Volumn 4, Issue 5, 2010, Pages 867-881

Combining long short-term memory and dynamic bayesian networks for incremental emotion-sensitive artificial listening

Author keywords

Dynamic Bayesian networks (DBNs); emotion recognition; intelligent environments; long short term memory (LSTM); recurrent neural nets; virtual agents

Indexed keywords

DYNAMIC BAYESIAN NETWORKS; EMOTION RECOGNITION; INTELLIGENT ENVIRONMENT; NEURAL NET; SHORT TERM MEMORY; VIRTUAL AGENT;

EID: 77956721304     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2057200     Document Type: Article
Times cited : (155)

References (90)
  • 1
    • 6644225702 scopus 로고
    • Multimodal human-computer interaction
    • Waseda, Japan
    • M. T. Vo and A. Waibel, "Multimodal human-computer interaction," in Proc. ISSD'93, Waseda, Japan, 1993.
    • (1993) Proc. ISSD'93
    • Vo, M.T.1    Waibel, A.2
  • 2
    • 0028862886 scopus 로고
    • Toward the ultimate synthesis/recognition
    • S. Furui, "Toward the ultimate synthesis/recognition," Proc. Nat. Acad. Sci. USA, vol.92, no.22, pp. 10 040-10 045, 1995.
    • (1995) Proc. Nat. Acad. Sci. USA , vol.92 , Issue.22 , pp. 10040-10045
    • Furui, S.1
  • 4
    • 0005540533 scopus 로고    scopus 로고
    • Toward agents that recognize emotion
    • R. W. Picard, "Toward agents that recognize emotion," in Actes Proc. IMAGINA, 1998, pp. 153-165.
    • (1998) Actes Proc. IMAGINA , pp. 153-165
    • Picard, R.W.1
  • 5
    • 33745828208 scopus 로고    scopus 로고
    • How peoply really talk and why engineers should care
    • Lisbon, Portugal
    • E. Shriberg, "How peoply really talk and why engineers should care," in Proc. Interspeech, Lisbon, Portugal, 2005, pp. 1781-1784.
    • (2005) Proc. Interspeech , pp. 1781-1784
    • Shriberg, E.1
  • 6
    • 57149144228 scopus 로고    scopus 로고
    • A survey of affect recognition methods: Audio, visual, and spontaneous expressions
    • Jan
    • Z. Zeng, M. Pantic, G. I. Rosiman, and T. S. Huang, "A survey of affect recognition methods: Audio, visual, and spontaneous expressions," Trans. Pattern Anal. Mach. Intell., vol.31, no.1, pp. 39-58, Jan. 2009.
    • (2009) Trans. Pattern Anal. Mach. Intell. , vol.31 , Issue.1 , pp. 39-58
    • Zeng, Z.1    Pantic, M.2    Rosiman, G.I.3    Huang, T.S.4
  • 9
    • 51449104640 scopus 로고    scopus 로고
    • Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space?
    • Las Vegas, NV
    • B. Schuller, M. Wimmer, L. Mösenlechner, C. Kern, D. Arsic, and G. Rigoll, "Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space?," in Proc. ICASSP'08, Las Vegas, NV, 2008, pp. 4501-4504.
    • (2008) Proc. ICASSP'08 , pp. 4501-4504
    • Schuller, B.1    Wimmer, M.2    Mösenlechner, L.3    Kern, C.4    Arsic, D.5    Rigoll, G.6
  • 10
    • 14644439843 scopus 로고    scopus 로고
    • Toward detecting emotions in spoken dialogs
    • Mar
    • M. Lee and S. S. Narayanan, "Toward detecting emotions in spoken dialogs," IEEE Trans. Speech Audio Process., vol.13, no.2, pp. 293-303, Mar. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 293-303
    • Lee, M.1    Narayanan, S.S.2
  • 11
    • 21544459345 scopus 로고    scopus 로고
    • Challenges in real-life emotion annotation and machine learning based detection
    • L. Devillers, L. Vidrascu, and L. Lamel, "Challenges in real-life emotion annotation and machine learning based detection," Neural Netw., vol.18, no.4, pp. 407-422, 2005.
    • (2005) Neural Netw , vol.18 , Issue.4 , pp. 407-422
    • Devillers, L.1    Vidrascu, L.2    Lamel, L.3
  • 12
    • 21544482187 scopus 로고    scopus 로고
    • Emotion in speech: Recognition and application to call centers
    • V. Petrushin, "Emotion in speech: Recognition and application to call centers," Artif. Neural Netw Eng. (ANNIE), pp. 7-12, 1999.
    • (1999) Artif. Neural Netw Eng. (ANNIE) , pp. 7-12
    • Petrushin, V.1
  • 15
    • 68949168239 scopus 로고    scopus 로고
    • Releasing a thoroughly annotated and processed spontaneous emotional database: The FAU Aibo Emotion Corpus
    • L. Devillers, J. C. Martin, R. Cowie, E. Douglas-Cowie, and A. Batliner, Eds
    • A. Batliner, S. Steidl, and E. Nöth, "Releasing a thoroughly annotated and processed spontaneous emotional database: The FAU Aibo Emotion Corpus," in Proc. Satellite Workshop of LREC 2008 Corpora Res. Emotion and Affect, L. Devillers, J. C. Martin, R. Cowie, E. Douglas-Cowie, and A. Batliner, Eds., 2008, pp. 28-31.
    • (2008) Proc. Satellite Workshop of LREC 2008 Corpora Res. Emotion and Affect , pp. 28-31
    • Batliner, A.1    Steidl, S.2    Nöth, E.3
  • 18
    • 85089273681 scopus 로고    scopus 로고
    • Getting started with SUSAS: A speech under simulated and actual stress database
    • Rhodes, Greece
    • J. Hansen and S. Bou-Ghazale, "Getting started with SUSAS: A speech under simulated and actual stress database," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 1743-1746.
    • (1997) Proc. Eurospeech , pp. 1743-1746
    • Hansen, J.1    Bou-Ghazale, S.2
  • 19
    • 33750564952 scopus 로고    scopus 로고
    • Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition
    • Amsterdam, The Netherlands
    • T. Vogt and E. André, "Comparing feature sets for acted and spontaneous speech in view of automatic emotion recognition," in Proc. ICME, Amsterdam, The Netherlands, 2005.
    • (2005) Proc. ICME
    • Vogt, T.1    André, E.2
  • 21
    • 85131413217 scopus 로고    scopus 로고
    • Design, recording and verification of a Danish emotional speech database
    • Rhodes, Greece
    • I. S. Engberg, A. V. Hansen, O. Andersen, and P. Dalsgaard, "Design, recording and verification of a Danish emotional speech database," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 1695-1698.
    • (1997) Proc. Eurospeech , pp. 1695-1698
    • Engberg, I.S.1    Hansen, A.V.2    Andersen, O.3    Dalsgaard, P.4
  • 23
    • 70450206416 scopus 로고    scopus 로고
    • The Interspeech 2009 emotion challenge
    • Brighton, U.K
    • B. Schuller, S. Steidl, and A. Batliner, "The Interspeech 2009 emotion challenge," in Proc. Interspeech, Brighton, U.K., 2009, pp. 312-315.
    • (2009) Proc. Interspeech , pp. 312-315
    • Schuller, B.1    Steidl, S.2    Batliner, A.3
  • 24
    • 77949400109 scopus 로고    scopus 로고
    • The hinterland of emotions: Facing the open-microphone challenge
    • Amsterdam, The Netherlands
    • S. Steidl, B. Schuller, A. Batliner, and D. Seppi, "The hinterland of emotions: Facing the open-microphone challenge," in Proc. ACII, Amsterdam, The Netherlands, 2009, pp. 690-697.
    • (2009) Proc. ACII , pp. 690-697
    • Steidl, S.1    Schuller, B.2    Batliner, A.3    Seppi, D.4
  • 26
    • 84862156369 scopus 로고    scopus 로고
    • Abandoning emotion classes-Towards continuous emotion recognition with modelling of long-range dependencies
    • Brisbane, Australia
    • M. Wöllmer, F. Eyben, S. Reiter, B. Schuller, C. Cox, E. Douglas-Cowie, and R. Cowie, "Abandoning emotion classes-Towards continuous emotion recognition with modelling of long-range dependencies," in Proc. Interspeech, Brisbane, Australia, 2008, pp. 597-600.
    • (2008) Proc. Interspeech , pp. 597-600
    • Wöllmer, M.1    Eyben, F.2    Reiter, S.3    Schuller, B.4    Cox, C.5    Douglas-Cowie, E.6    Cowie, R.7
  • 27
    • 0031573117 scopus 로고    scopus 로고
    • Long short-term memory
    • S. Hochreiter and J. Schmidhuber, "Long short-term memory," Neural Comput., vol.9, no.8, pp. 1735-1780, 1997.
    • (1997) Neural Comput , vol.9 , Issue.8 , pp. 1735-1780
    • Hochreiter, S.1    Schmidhuber, J.2
  • 28
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • Jun
    • A. Graves and J. Schmidhuber, "Framewise phoneme classification with bidirectional LSTM and other neural network architectures," Neural Netw., vol.18, no.5-6, pp. 602-610, Jun. 2005.
    • (2005) Neural Netw , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 29
    • 70349203870 scopus 로고    scopus 로고
    • Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks
    • Taipei, Taiwan
    • M. Wöllmer, F. Eyben, J. Keshet, A. Graves, B. Schuller, and G. Rigoll, "Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks," in Proc. ICASSP, Taipei, Taiwan, 2009, pp. 3949-3952.
    • (2009) Proc. ICASSP , pp. 3949-3952
    • Wöllmer, M.1    Eyben, F.2    Keshet, J.3    Graves, A.4    Schuller, B.5    Rigoll, G.6
  • 31
  • 32
    • 57149144228 scopus 로고    scopus 로고
    • A survey of affect recognition methods: Audio, visual, and spontaneous expressions
    • Jan
    • Z. Zeng, M. Pantic, G. I. Roisman, and T. S. Huang, "A survey of affect recognition methods: Audio, visual, and spontaneous expressions," IEEE Trans. Pattern Anal. Mach. Intell., vol.31, no.1, pp. 39-58, Jan. 2009.
    • (2009) IEEE Trans. Pattern Anal. Mach. Intell. , vol.31 , Issue.1 , pp. 39-58
    • Zeng, Z.1    Pantic, M.2    Roisman, G.I.3    Huang, T.S.4
  • 33
    • 34247624725 scopus 로고    scopus 로고
    • Evolutionary feature generation in speech emotion recognition
    • Toronto, ON, Canada
    • B. Schuller, S. Reiter, and G. Rigoll, "Evolutionary feature generation in speech emotion recognition," in Proc. ICME'06, Toronto, ON, Canada, 2006, pp. 5-8.
    • (2006) Proc. ICME'06 , pp. 5-8
    • Schuller, B.1    Reiter, S.2    Rigoll, G.3
  • 34
    • 84880315493 scopus 로고    scopus 로고
    • Emotions analysis and emotionhandling subdialogues
    • W. Wahlster, Ed. Berlin, Germany: Springer
    • M. Streit, A. Batliner, and T. Portele, "Emotions analysis and emotionhandling subdialogues," in SmartKom: Foundations of Multimodal Dialogue Systems, W. Wahlster, Ed. Berlin, Germany: Springer, 2006, pp. 317-332.
    • (2006) SmartKom: Foundations of Multimodal Dialogue Systems , pp. 317-332
    • Streit, M.1    Batliner, A.2    Portele, T.3
  • 36
    • 38049067290 scopus 로고    scopus 로고
    • Timing levels in segment-based speech emotion recognition
    • Pittsburgh, PA, ISCA
    • B. Schuller and G. Rigoll, "Timing levels in segment-based speech emotion recognition," in Proc. Interspeech'06, Pittsburgh, PA, 2006, pp. 1818-1821, ISCA.
    • (2006) Proc. Interspeech'06 , pp. 1818-1821
    • Schuller, B.1    Rigoll, G.2
  • 37
    • 44849100275 scopus 로고    scopus 로고
    • Comparing one and two-stage acoustic modeling in the recognition of emotion in speech
    • Kyoto, Japan
    • B. Schuller, B. Vlasenko, R. Minguez, G. Rigoll, and A. Wendemuth, "Comparing one and two-stage acoustic modeling in the recognition of emotion in speech," in Proc. ASRU'07, Kyoto, Japan, 2007, pp. 596-600.
    • (2007) Proc. ASRU'07 , pp. 596-600
    • Schuller, B.1    Vlasenko, B.2    Minguez, R.3    Rigoll, G.4    Wendemuth, A.5
  • 38
    • 0141478857 scopus 로고    scopus 로고
    • Hidden Markov model-based speech emotion recognition
    • Hong Kong, China
    • B. Schuller, G. Rigoll, and M. Lang, "Hidden Markov model-based speech emotion recognition," in Proc. ICASSP'03, Hong Kong, China, 2003, pp. 1-4.
    • (2003) Proc. ICASSP'03 , pp. 1-4
    • Schuller, B.1    Rigoll, G.2    Lang, M.3
  • 39
    • 38049048651 scopus 로고    scopus 로고
    • Frame vs. turnlevel: Emotion recognition from speech considering static and dynamic processing
    • Lisbon, Portugal, A. Paiva, Ed., Heidelberg, Germany, Springer Berlin
    • B. Vlasenko, B. Schuller, A. Wendemuth, and G. Rigoll, "Frame vs. turnlevel: Emotion recognition from speech considering static and dynamic processing," in Proc. ACII'07, Lisbon, Portugal, A. Paiva, Ed., Heidelberg, Germany, 2007, vol.LNCS 4738, pp. 139-147, Springer Berlin.
    • (2007) Proc. ACII'07 , vol.LNCS 4738 , pp. 139-147
    • Vlasenko, B.1    Schuller, B.2    Wendemuth, A.3    Rigoll, G.4
  • 40
    • 0031268931 scopus 로고    scopus 로고
    • Bidirectional recurrent neural networks
    • Nov
    • M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural networks," IEEE Trans. Signal Process., vol.45, no.11, pp. 2673-2681, Nov. 1997.
    • (1997) IEEE Trans. Signal Process. , vol.45 , Issue.11 , pp. 2673-2681
    • Schuster, M.1    Paliwal, K.K.2
  • 41
    • 77949372271 scopus 로고    scopus 로고
    • A tandem BLSTM-DBN architecture for keyword spotting with enhanced context modeling
    • Vic, Spain
    • M. Wöllmer, F. Eyben, A. Graves, B. Schuller, and G. Rigoll, "A tandem BLSTM-DBN architecture for keyword spotting with enhanced context modeling," in Proc. NOLISP, Vic, Spain, 2009.
    • (2009) Proc. NOLISP
    • Wöllmer, M.1    Eyben, F.2    Graves, A.3    Schuller, B.4    Rigoll, G.5
  • 43
    • 70450185596 scopus 로고    scopus 로고
    • Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions
    • Brighton, U.K
    • C. Lee, C. Busso, S. Lee, and S. Narayanan, "Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions," in Proc. Interspeech, Brighton, U.K., 2009, pp. 1983-1986.
    • (2009) Proc. Interspeech , pp. 1983-1986
    • Lee, C.1    Busso, C.2    Lee, S.3    Narayanan, S.4
  • 47
    • 77949415384 scopus 로고    scopus 로고
    • OpenEAR-Introducing the Munich open-source emotion and affect recognition toolkit
    • Amsterdam, The Netherlands
    • F. Eyben, M. Wöllmer, and B. Schuller, "openEAR-Introducing the Munich open-source emotion and affect recognition toolkit," in Proc. ACII, Amsterdam, The Netherlands, 2009, pp. 576-581.
    • (2009) Proc. ACII , pp. 576-581
    • Eyben, F.1    Wöllmer, M.2    Schuller, B.3
  • 50
    • 70450185591 scopus 로고    scopus 로고
    • Recognising interest in conversational speech-comparing bag of frames and supra-segmental features
    • Brighton, U.K
    • B. Schuller and G. Rigoll, "Recognising interest in conversational speech-comparing bag of frames and supra-segmental features," in Proc. Interspeech, Brighton, U.K., 2009, pp. 1999-2002.
    • (2009) Proc. Interspeech , pp. 1999-2002
    • Schuller, B.1    Rigoll, G.2
  • 52
    • 0041914606 scopus 로고    scopus 로고
    • Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
    • S. C. Kremer and J. F. Kolen, Eds. Piscataway, NJ: IEEE Press
    • S. Hochreiter, Y. Bengio, P. Frasconi, and J. Schmidhuber, "Gradient flow in recurrent nets: The difficulty of learning long-term dependencies," in A Field Guide to Dynamical Recurrent Neural Networks, S. C. Kremer and J. F. Kolen, Eds. Piscataway, NJ: IEEE Press, 2001.
    • (2001) A Field Guide to Dynamical Recurrent Neural Networks
    • Hochreiter, S.1    Bengio, Y.2    Frasconi, P.3    Schmidhuber, J.4
  • 53
    • 27744588611 scopus 로고    scopus 로고
    • Framewise phoneme classification with bidirectional LSTM and other neural network architectures
    • DOI 10.1016/j.neunet.2005.06.042, PII S0893608005001206
    • A. Graves, S. Fernandez, and J. Schmidhuber, "Bidirectional LSTM networks for improved phoneme classification and recognition," in Proc. ICANN, Warsaw, Poland, 2005, vol.18, pp. 602-610. (Pubitemid 43186580)
    • (2005) Neural Networks , vol.18 , Issue.5-6 , pp. 602-610
    • Graves, A.1    Schmidhuber, J.2
  • 57
    • 85009069271 scopus 로고    scopus 로고
    • Politeness and frustration language in child-machine interactions
    • Aalborg, Denmark
    • S. Arunachalam, D. Gould, E. Anderson, D. Byrd, and S. Narayanan, "Politeness and frustration language in child-machine interactions," in Proc. Eurospeech, Aalborg, Denmark, 2001, pp. 2675-2678.
    • (2001) Proc. Eurospeech , pp. 2675-2678
    • Arunachalam, S.1    Gould, D.2    Anderson, E.3    Byrd, D.4    Narayanan, S.5
  • 58
    • 11244258301 scopus 로고    scopus 로고
    • Emotion recognition using acoustic features and textual content
    • Z. J. Chuang and C. H. Wu, "Emotion recognition using acoustic features and textual content," in Proc. ICME, 2004, pp. 53-56.
    • (2004) Proc. ICME , pp. 53-56
    • Chuang, Z.J.1    Wu, C.H.2
  • 59
    • 77952101226 scopus 로고    scopus 로고
    • Use of lexical and affective prosodic cues to emotion by younger and older adults
    • Antwerp, Belgium
    • K. Dupuis and K. Pichora-Fuller, "Use of lexical and affective prosodic cues to emotion by younger and older adults," in Proc. Interspeech, Antwerp, Belgium, 2007, pp. 2237-2240.
    • (2007) Proc. Interspeech , pp. 2237-2240
    • Dupuis, K.1    Pichora-Fuller, K.2
  • 62
    • 84946012706 scopus 로고    scopus 로고
    • Recognizing emotions from student speech in tutoring dialogues
    • Virgin Islands
    • D. Litman and K. Forbes, "Recognizing emotions from student speech in tutoring dialogues," in Proc. ASRU, Virgin Islands, 2003, pp. 25-30.
    • (2003) Proc. ASRU , pp. 25-30
    • Litman, D.1    Forbes, K.2
  • 63
    • 32844468513 scopus 로고    scopus 로고
    • Text-to-emotion engine for real time internet communication
    • Staffordshire Univ., Stoke-on-Trent, U.K
    • X. Zhe and A. Boucouvalas, "Text-to-emotion engine for real time internet communication," in Proc. Int. Symp. Commun. Syst., Netw., DSPs, 2002, pp. 164-168, Staffordshire Univ., Stoke-on-Trent, U.K.
    • (2002) Proc. Int. Symp. Commun. Syst., Netw., DSPs , pp. 164-168
    • Zhe, X.1    Boucouvalas, A.2
  • 65
    • 77952107788 scopus 로고    scopus 로고
    • Posting act tagging using transformation-based learning
    • T. Y. Lin, S. Ohsuga, C. J. Liau, X. Hu, and S. Tsumoto, Eds
    • T. Wu, F. Khan, T. Fisher, L. Shuler, and W. Pottenger, "Posting act tagging using transformation-based learning," in Foundations of Data Mining and Knowledge Discovery, T. Y. Lin, S. Ohsuga, C. J. Liau, X. Hu, and S. Tsumoto, Eds., 2005, pp. 319-331.
    • (2005) Foundations of Data Mining and Knowledge Discovery , pp. 319-331
    • Wu, T.1    Khan, F.2    Fisher, T.3    Shuler, L.4    Pottenger, W.5
  • 66
    • 0038382154 scopus 로고    scopus 로고
    • A model of textual affect sensing using real-world knowledge
    • H. Liu, H. Liebermann, and T. Selker, "A model of textual affect sensing using real-world knowledge," in Proc. IUI, 2003, pp. 125-132.
    • (2003) Proc. IUI , pp. 125-132
    • Liu, H.1    Liebermann, H.2    Selker, T.3
  • 67
    • 4544316885 scopus 로고    scopus 로고
    • Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
    • B. Schuller,G. Rigoll, and M. Lang, "Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture," in Proc. ICASSP, 2004, pp. 577-580.
    • (2004) Proc. ICASSP , pp. 577-580
    • Schullerg. Rigoll, B.1    Lang, M.2
  • 68
    • 17144380230 scopus 로고    scopus 로고
    • Modeling emotional state and personality for conversational agents
    • J. Breese and G. Ball, "Modeling emotional state and personality for conversational agents," Microsoft, Tech. Rep., 1998.
    • (1998) Microsoft, Tech. Rep
    • Breese, J.1    Ball, G.2
  • 69
    • 77950309848 scopus 로고    scopus 로고
    • Speech emotion recognition exploiting acoustic and linguistic information sources
    • Patras, Greece
    • G. Rigoll, R. Müller, and B. Schuller, "Speech emotion recognition exploiting acoustic and linguistic information sources," in Proc. SPECOM, Patras, Greece, 2005, pp. 61-67.
    • (2005) Proc. SPECOM , pp. 61-67
    • Rigoll, G.1    Müller, R.2    Schuller, B.3
  • 71
    • 85009145332 scopus 로고    scopus 로고
    • Prosody-based automatic detection of annoyance and frustration in human-computer dialog
    • Denver, CO
    • J. Ang, R. Dhillon, E. Shriberg, and A. Stolcke, "Prosody-based automatic detection of annoyance and frustration in human-computer dialog," in Proc. Interspeech, Denver, CO, 2002, pp. 2037-2040.
    • (2002) Proc. Interspeech , pp. 2037-2040
    • Ang, J.1    Dhillon, R.2    Shriberg, E.3    Stolcke, A.4
  • 72
    • 85009233228 scopus 로고    scopus 로고
    • Combining acoustic and language information for emotion recognition
    • C. M. Lee, S. Narayanan, and R. Pieraccini, "Combining acoustic and language information for emotion recognition," in Proc. ICSLP, 2002, pp. 873-1376
    • (2002) Proc. ICSLP , pp. 873-1376
    • Lee, C.M.1    Narayanan, S.2    Pieraccini, R.3
  • 73
    • 4544240852 scopus 로고    scopus 로고
    • Emotion detection in taskoriented spoken dialogs
    • Baltimore, MD
    • L. Devillers, L. Lamel, and I. Vasilescu, "Emotion detection in taskoriented spoken dialogs," in Proc. ICME, Baltimore, MD, 2003.
    • (2003) Proc. ICME
    • Devillers, L.1    Lamel, L.2    Vasilescu, I.3
  • 74
    • 33745198227 scopus 로고    scopus 로고
    • Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensemble
    • Lisbon, Portugal
    • B. Schuller, R. Müller, M. Lang, and G. Rigoll, "Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensemble," in Proc. Interspeech, Lisbon, Portugal, 2005, pp. 805-808.
    • (2005) Proc. Interspeech , pp. 805-808
    • Schuller, B.1    Müller, R.2    Lang, M.3    Rigoll, G.4
  • 76
    • 84957069814 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Chemniz, Germany
    • T. Joachims, "Text categorization with support vector machines: Learning with many relevant features," in Proc. ECML, Chemniz, Germany, 1998, pp. 137-142.
    • (1998) Proc. ECML , pp. 137-142
    • Joachims, T.1
  • 77
    • 70349193703 scopus 로고    scopus 로고
    • Emotion recognition from speech: Putting asr in the loop
    • Taipei, Taiwan
    • B. Schuller, A. Batliner, S. Steidl, and D. Seppi, "Emotion recognition from speech: Putting asr in the loop," in Proc. ICASSP, Taipei, Taiwan, 2009, pp. 4585-4588.
    • (2009) Proc. ICASSP , pp. 4585-4588
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 78
    • 71249091768 scopus 로고    scopus 로고
    • "The Godfather" vs. "chaos": Comparing linguistic analysis based on online knowledge sources and bags-of-n-grams for movie review valence estimation
    • Barcelona, Spain
    • B. Schuller, J. Schenk, and G. Rigoll, ""The Godfather" vs. "chaos": Comparing linguistic analysis based on online knowledge sources and bags-of-n-grams for movie review valence estimation," in Proc. ICDAR, Barcelona, Spain, 2009, pp. 858-862.
    • (2009) Proc. ICDAR , pp. 858-862
    • Schuller, B.1    Schenk, J.2    Rigoll, G.3
  • 79
    • 62949122697 scopus 로고    scopus 로고
    • Modeling for optimal probability prediction
    • Sydney, Australia
    • Y. Wang and I. H. Witten, "Modeling for optimal probability prediction," in Proc. 19th Int. Conf. Mach. Learn., Sydney, Australia, 2002, pp. 650-657.
    • (2002) Proc. 19th Int. Conf. Mach. Learn. , pp. 650-657
    • Wang, Y.1    Witten, I.H.2
  • 80
    • 77949350062 scopus 로고    scopus 로고
    • Robust vocabulary independent keyword spotting with graphical models
    • Merano, Italy
    • M. Wöllmer, F. Eyben, B. Schuller, and G. Rigoll, "Robust vocabulary independent keyword spotting with graphical models," in Proc. ASRU, Merano, Italy, 2009, pp. 349-353.
    • (2009) Proc. ASRU , pp. 349-353
    • Wöllmer, M.1    Eyben, F.2    Schuller, B.3    Rigoll, G.4
  • 81
    • 0037841402 scopus 로고    scopus 로고
    • Graphical models and automatic speech recognition
    • R. Rosenfeld, M. Ostendorf, S. Khudanpur, and M. Johnson, Eds. New York: Springer Verlag
    • J. A. Bilmes, "Graphical models and automatic speech recognition," in Mathematical Foundations of Speech and Language Processing, R. Rosenfeld, M. Ostendorf, S. Khudanpur, and M. Johnson, Eds. New York: Springer Verlag, 2003, pp. 191-246.
    • (2003) Mathematical Foundations of Speech and Language Processing , pp. 191-246
    • Bilmes, J.A.1
  • 83
    • 0036293559 scopus 로고    scopus 로고
    • The graphical models toolkit: An open source software system for speech and time-series processing
    • J. Bilmes and G. Zweig, "The graphical models toolkit: An open source software system for speech and time-series processing," in Proc. ICASSP, 2002, pp. 3916-3919.
    • (2002) Proc. ICASSP , pp. 3916-3919
    • Bilmes, J.1    Zweig, G.2
  • 85
    • 70450186589 scopus 로고    scopus 로고
    • Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks
    • Brighton, U.K
    • M. Wöllmer, F. Eyben, B. Schuller, E. Douglas-Cowie, and R. Cowie, "Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks," in Proc. Interspeech, Brighton, U.K., 2009, pp. 1595-1598.
    • (2009) Proc. Interspeech , pp. 1595-1598
    • Wöllmer, M.1    Eyben, F.2    Schuller, B.3    Douglas-Cowie, E.4    Cowie, R.5
  • 86
    • 84943274699 scopus 로고
    • A direct adaptive method for faster backpropagation learning: The RPROP algorithm
    • M. Riedmiller and H. Braun, "A direct adaptive method for faster backpropagation learning: The RPROP algorithm," in Proc. IEEE Int. Conf. Neural Netw., 1993, pp. 586-591.
    • (1993) Proc. IEEE Int. Conf. Neural Netw , pp. 586-591
    • Riedmiller, M.1    Braun, H.2
  • 87
    • 34547518166 scopus 로고    scopus 로고
    • Support vector regression for automatic recognition of spontaneous emotions in speech
    • Honolulu, HI
    • M. Grimm, K. Kroschel, and S. Narayanan, "Support vector regression for automatic recognition of spontaneous emotions in speech," in Proc. ICASSP, Honolulu, HI, 2007, pp. 1085-1088.
    • (2007) Proc. ICASSP , pp. 1085-1088
    • Grimm, M.1    Kroschel, K.2    Narayanan, S.3
  • 88
    • 52049124063 scopus 로고    scopus 로고
    • Emotion recognition through multiple modalities: Face, body gesture, speech
    • C. Peter and R. Beale, Eds. New York: Springer, lNCS
    • G. Castellano, L. Kessous, and G. Caridakis, "Emotion recognition through multiple modalities: Face, body gesture, speech," in Affect and Emotion in Human-Computer Interaction, C. Peter and R. Beale, Eds. New York: Springer, 2008, vol.4868, lNCS.
    • (2008) Affect and Emotion in Human-Computer Interaction , vol.4868
    • Castellano, G.1    Kessous, L.2    Caridakis, G.3
  • 89
    • 84898971246 scopus 로고    scopus 로고
    • An asynchronous hidden Markov model for audio-visual speech recognition
    • S. Bengio, "An asynchronous hidden Markov model for audio-visual speech recognition," Advances in NIPS 15 2003.
    • (2003) Advances in NIPS , vol.15
    • Bengio, S.1
  • 90
    • 70449526103 scopus 로고    scopus 로고
    • A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams
    • M. Wöllmer, M. Al-Hames, F. Eyben, B. Schuller, and G. Rigoll, "A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams," Neurocomputing, vol.73, pp. 366-380, 2009.
    • (2009) Neurocomputing , vol.73 , pp. 366-380
    • Wöllmer, M.1    Al-Hames, M.2    Eyben, F.3    Schuller, B.4    Rigoll, G.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.