메뉴 건너뛰기




Volumn , Issue , 2014, Pages 3724-3728

Emotion detection in speech using deep networks

Author keywords

CRBMs; CRF; Deep Networks; Emotion Recognition; Hybrid Models

Indexed keywords

IMAGE RETRIEVAL; SIGNAL PROCESSING;

EID: 84905252886     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854297     Document Type: Conference Paper
Times cited : (33)

References (32)
  • 1
    • 84946012706 scopus 로고    scopus 로고
    • Recognizing emotions from student speech in tutoring dialogues
    • D. Litman and K. Forbes, "Recognizing emotions from student speech in tutoring dialogues," in ASRU, 2003.
    • (2003) ASRU
    • Litman, D.1    Forbes, K.2
  • 2
    • 48149092146 scopus 로고    scopus 로고
    • The montreal affective voices: A validated set of nonverbal affect bursts for research on auditory affective processing
    • P. Belin, S. Fillion-Bilodeau, and F. Gosselin, "The montreal affective voices: A validated set of nonverbal affect bursts for research on auditory affective processing," in Behavior Research Methods, 2008.
    • (2008) Behavior Research Methods
    • Belin, P.1    Fillion-Bilodeau, S.2    Gosselin, F.3
  • 3
    • 80051631315 scopus 로고    scopus 로고
    • Deep neural networks for acoustic emotion recognition: Raising the benchmarks
    • A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: Raising the benchmarks," in ICASSP, 2011.
    • (2011) ICASSP
    • Stuhlsatz, A.1    Meyer, C.2    Eyben, F.3    Zieike, T.4    Meier, G.5    Schuller, B.6
  • 4
    • 80054836058 scopus 로고    scopus 로고
    • Avec 2011-the first international audio visual emotion challenge
    • B. Schuller and et al., "Avec 2011-the first international audio visual emotion challenge," in ACII, 2011.
    • (2011) ACII
    • Schuller, B.1
  • 5
    • 84881518935 scopus 로고    scopus 로고
    • Modeling latent discriminative dynamic of multi-dimensional affective signals
    • G. Ramirez, T. Baltrusaitis, and L. P. Morency, "Modeling latent discriminative dynamic of multi-dimensional affective signals," in ACII, 2011.
    • (2011) ACII
    • Ramirez, G.1    Baltrusaitis, T.2    Morency, L.P.3
  • 6
    • 84885679134 scopus 로고    scopus 로고
    • Affect analysis in natural human interactions using joint hidden conditional random fields
    • B. Siddiquie, S. Khan, A. Divakaran, and H. Sawhney, "Affect analysis in natural human interactions using joint hidden conditional random fields," in ICME, 2013.
    • (2013) ICME
    • Siddiquie, B.1    Khan, S.2    Divakaran, A.3    Sawhney, H.4
  • 7
    • 77949395673 scopus 로고    scopus 로고
    • Acoustic emotion recognition: A benchmark comparison of performances
    • Bjorn Schuller, Bogdan Vlasenko, Florian Eyben, Gerhard Rigoll, and Andreas Wendemuth, "Acoustic emotion recognition: A benchmark comparison of performances," in ASRU, 2009.
    • (2009) ASRU
    • Schuller, B.1    Vlasenko, B.2    Eyben, F.3    Rigoll, G.4    Wendemuth, A.5
  • 8
    • 84885629060 scopus 로고    scopus 로고
    • Multiple classifier systems for the classification of audio-visual emotional states
    • M. Glodek and et al., "Multiple classifier systems for the classification of audio-visual emotional states," in ACII, 2011.
    • (2011) ACII
    • Glodek, M.1
  • 9
    • 69349090197 scopus 로고    scopus 로고
    • Learning deep architectures for ai
    • Y. Bengio, "Learning deep architectures for ai," in FTML, 2009.
    • (2009) FTML
    • Bengio, Y.1
  • 10
    • 84890526837 scopus 로고    scopus 로고
    • New types of deep neural network leaning for speech recognition and related applications: An overview
    • L. Deng, G. Hinton, and B. Kingsbury, "New types of deep neural network leaning for speech recognition and related applications: An overview," in ICASSP, 2013.
    • (2013) ICASSP
    • Deng, L.1    Hinton, G.2    Kingsbury, B.3
  • 11
    • 84905225084 scopus 로고    scopus 로고
    • Deep learning for signal and information processing
    • L. Deng and D. Yu, "Deep learning for signal and information processing," in FTML, 2013.
    • (2013) FTML
    • Deng, L.1    Yu, D.2
  • 13
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets," in NC, 2006.
    • (2006) NC
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.W.3
  • 15
    • 84890526379 scopus 로고    scopus 로고
    • Deep learning for robust feature generation in audiovisual emotion recognition
    • Yelin Kim, Honglak Lee, and Emily Mower Provost, "Deep learning for robust feature generation in audiovisual emotion recognition," in ICASSP, 2013.
    • (2013) ICASSP
    • Kim, Y.1    Lee, H.2    Mower Provost, E.3
  • 16
    • 84864026688 scopus 로고    scopus 로고
    • Modeling human motion using binary latent variables
    • G.W. Taylor and et. al., "Modeling human motion using binary latent variables," in NIPS, 2007.
    • (2007) NIPS
    • Taylor, G.W.1
  • 17
    • 34547997421 scopus 로고    scopus 로고
    • Learning multilevel distributed representations for high-dimensional sequences
    • I. Sutskever and G. E. Hinton, "Learning multilevel distributed representations for high-dimensional sequences," in AISTATS, 2007.
    • (2007) AISTATS
    • Sutskever, I.1    Hinton, G.E.2
  • 18
    • 84904696764 scopus 로고    scopus 로고
    • Phone recognition using restricted boltzmann machines
    • A. R. Mohamed and G. E. Hinton, "Phone recognition using restricted boltzmann machines," in ICASSP, 2009.
    • (2009) ICASSP
    • Mohamed, A.R.1    Hinton, G.E.2
  • 19
    • 84867129058 scopus 로고    scopus 로고
    • Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription
    • N. B. Lewandowski, Y. Bengio, and P. Vincent, "Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription," in ICML, 2012.
    • (2012) ICML
    • Lewandowski, N.B.1    Bengio, Y.2    Vincent, P.3
  • 20
    • 84867614591 scopus 로고    scopus 로고
    • Scalable stacking and learning for building deep architectures
    • L. Deng, D. Yu, and J. Platt, "Scalable stacking and learning for building deep architectures," in Interspeech, 2012.
    • (2012) Interspeech
    • Deng, L.1    Yu, D.2    Platt, J.3
  • 21
    • 84879301618 scopus 로고    scopus 로고
    • Tensor deep stacking networks
    • B. Hutchinson, L. Deng, and D. Yu, "Tensor deep stacking networks," in TPAMI, 2013.
    • (2013) TPAMI
    • Hutchinson, B.1    Deng, L.2    Yu, D.3
  • 22
    • 84055222005 scopus 로고    scopus 로고
    • Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
    • G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," in ICASSP, 2012.
    • (2012) ICASSP
    • Dahl, G.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 23
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech, 2011.
    • (2011) Interspeech
    • Seide, F.1    Li, G.2    Yu, D.3
  • 24
    • 84898996216 scopus 로고    scopus 로고
    • Speech recognition using svms
    • N. Smith and M. Gales, "Speech recognition using svms," in NIPS, 2002.
    • (2002) NIPS
    • Smith, N.1    Gales, M.2
  • 25
    • 39549089484 scopus 로고    scopus 로고
    • Semi-supervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle
    • A. Fujino, N. Ueda, and K. Saito, "Semi-supervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle," in TPAMI, 2008.
    • (2008) TPAMI
    • Fujino, A.1    Ueda, N.2    Saito, K.3
  • 26
    • 56449110012 scopus 로고    scopus 로고
    • Classification using discriminative restricted boltzmann machines
    • H. Larochelle and Y. Bengio, "Classification using discriminative restricted boltzmann machines," in ICML, 2008.
    • (2008) ICML
    • Larochelle, H.1    Bengio, Y.2
  • 28
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in ICML, 2001.
    • (2001) ICML
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 29
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • G. E. Hinton, "Training products of experts by minimizing contrastive divergence," in NC, 2002.
    • (2002) NC
    • Hinton, G.E.1
  • 31
    • 54049132925 scopus 로고    scopus 로고
    • The vera am mittag german audio-visual emotional speech database
    • M. Grimm, K. Kroschel, and S. Narayanan, "The vera am mittag german audio-visual emotional speech database," in ICME, 2008.
    • (2008) ICME
    • Grimm, M.1    Kroschel, K.2    Narayanan, S.3
  • 32
    • 84893945649 scopus 로고    scopus 로고
    • Opensmile: The munich versatile and fast open-source audio feature extractor
    • Florian Eyben, Martin Wollmer, and Bjorn Schuller, "opensmile: The munich versatile and fast open-source audio feature extractor," in ACM MM, 2010.
    • (2010) ACM MM
    • Eyben, F.1    Wollmer, M.2    Schuller, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.