SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2014, Pages 3724-3728

Emotion detection in speech using deep networks

(4) Amer, Mohamed R a Siddiquie, Behjat a Richey, Colleen a Divakaran, Ajay a

a SRI INTERNATIONAL (United States)

Author keywords

CRBMs; CRF; Deep Networks; Emotion Recognition; Hybrid Models

Indexed keywords

IMAGE RETRIEVAL; SIGNAL PROCESSING;

CRBMS; CRF; DISCRIMINATIVE CLASSIFIERS; DISCRIMINATIVE MODELS; EMOTION DETECTION; EMOTION RECOGNITION; HYBRID MODEL; TEMPORAL DYNAMICS;

SPEECH RECOGNITION;

EID: 84905252886 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2014.6854297 Document Type: Conference Paper

Times cited : (33)

References (32)

1
- 84946012706
- Recognizing emotions from student speech in tutoring dialogues
- D. Litman and K. Forbes, "Recognizing emotions from student speech in tutoring dialogues," in ASRU, 2003.
- (2003) ASRU
- Litman, D.¹ Forbes, K.²

2
- 48149092146
- The montreal affective voices: A validated set of nonverbal affect bursts for research on auditory affective processing
- P. Belin, S. Fillion-Bilodeau, and F. Gosselin, "The montreal affective voices: A validated set of nonverbal affect bursts for research on auditory affective processing," in Behavior Research Methods, 2008.
- (2008) Behavior Research Methods
- Belin, P.¹ Fillion-Bilodeau, S.² Gosselin, F.³

3
- 80051631315
- Deep neural networks for acoustic emotion recognition: Raising the benchmarks
- A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: Raising the benchmarks," in ICASSP, 2011.
- (2011) ICASSP
- Stuhlsatz, A.¹ Meyer, C.² Eyben, F.³ Zieike, T.⁴ Meier, G.⁵ Schuller, B.⁶

4
- 80054836058
- Avec 2011-the first international audio visual emotion challenge
- B. Schuller and et al., "Avec 2011-the first international audio visual emotion challenge," in ACII, 2011.
- (2011) ACII
- Schuller, B.¹

5
- 84881518935
- Modeling latent discriminative dynamic of multi-dimensional affective signals
- G. Ramirez, T. Baltrusaitis, and L. P. Morency, "Modeling latent discriminative dynamic of multi-dimensional affective signals," in ACII, 2011.
- (2011) ACII
- Ramirez, G.¹ Baltrusaitis, T.² Morency, L.P.³

6
- 84885679134
- Affect analysis in natural human interactions using joint hidden conditional random fields
- B. Siddiquie, S. Khan, A. Divakaran, and H. Sawhney, "Affect analysis in natural human interactions using joint hidden conditional random fields," in ICME, 2013.
- (2013) ICME
- Siddiquie, B.¹ Khan, S.² Divakaran, A.³ Sawhney, H.⁴

7
- 77949395673
- Acoustic emotion recognition: A benchmark comparison of performances
- Bjorn Schuller, Bogdan Vlasenko, Florian Eyben, Gerhard Rigoll, and Andreas Wendemuth, "Acoustic emotion recognition: A benchmark comparison of performances," in ASRU, 2009.
- (2009) ASRU
- Schuller, B.¹ Vlasenko, B.² Eyben, F.³ Rigoll, G.⁴ Wendemuth, A.⁵

8
- 84885629060
- Multiple classifier systems for the classification of audio-visual emotional states
- M. Glodek and et al., "Multiple classifier systems for the classification of audio-visual emotional states," in ACII, 2011.
- (2011) ACII
- Glodek, M.¹

9
- 69349090197
- Learning deep architectures for ai
- Y. Bengio, "Learning deep architectures for ai," in FTML, 2009.
- (2009) FTML
- Bengio, Y.¹

10
- 84890526837
- New types of deep neural network leaning for speech recognition and related applications: An overview
- L. Deng, G. Hinton, and B. Kingsbury, "New types of deep neural network leaning for speech recognition and related applications: An overview," in ICASSP, 2013.
- (2013) ICASSP
- Deng, L.¹ Hinton, G.² Kingsbury, B.³

11
- 84905225084
- Deep learning for signal and information processing
- L. Deng and D. Yu, "Deep learning for signal and information processing," in FTML, 2013.
- (2013) FTML
- Deng, L.¹ Yu, D.²

12
- 84905256039
- Recursive compositional models for computer vision
- L. Zhu, Y. H. Chen, and A. Yuille, "Recursive compositional models for computer vision," Journal of Mathematical Imaging and Vision, 2011.
- (2011) Journal of Mathematical Imaging and Vision
- Zhu, L.¹ Chen, Y.H.² Yuille, A.³

13
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets," in NC, 2006.
- (2006) NC
- Hinton, G.E.¹ Osindero, S.² Teh, Y.W.³

14
- 80053437179
- Multimodal deep learning
- J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A.Y. Ng, "Multimodal deep learning," in ICML, 2011.
- (2011) ICML
- Ngiam, J.¹ Khosla, A.² Kim, M.³ Nam, J.⁴ Lee, H.⁵ Ng, A.Y.⁶

15
- 84890526379
- Deep learning for robust feature generation in audiovisual emotion recognition
- Yelin Kim, Honglak Lee, and Emily Mower Provost, "Deep learning for robust feature generation in audiovisual emotion recognition," in ICASSP, 2013.
- (2013) ICASSP
- Kim, Y.¹ Lee, H.² Mower Provost, E.³

16
- 84864026688
- Modeling human motion using binary latent variables
- G.W. Taylor and et. al., "Modeling human motion using binary latent variables," in NIPS, 2007.
- (2007) NIPS
- Taylor, G.W.¹

17
- 34547997421
- Learning multilevel distributed representations for high-dimensional sequences
- I. Sutskever and G. E. Hinton, "Learning multilevel distributed representations for high-dimensional sequences," in AISTATS, 2007.
- (2007) AISTATS
- Sutskever, I.¹ Hinton, G.E.²

18
- 84904696764
- Phone recognition using restricted boltzmann machines
- A. R. Mohamed and G. E. Hinton, "Phone recognition using restricted boltzmann machines," in ICASSP, 2009.
- (2009) ICASSP
- Mohamed, A.R.¹ Hinton, G.E.²

19
- 84867129058
- Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription
- N. B. Lewandowski, Y. Bengio, and P. Vincent, "Modeling temporal dependencies in high-dimensional sequences: Application to polyphonic music generation and transcription," in ICML, 2012.
- (2012) ICML
- Lewandowski, N.B.¹ Bengio, Y.² Vincent, P.³

20
- 84867614591
- Scalable stacking and learning for building deep architectures
- L. Deng, D. Yu, and J. Platt, "Scalable stacking and learning for building deep architectures," in Interspeech, 2012.
- (2012) Interspeech
- Deng, L.¹ Yu, D.² Platt, J.³

21
- 84879301618
- Tensor deep stacking networks
- B. Hutchinson, L. Deng, and D. Yu, "Tensor deep stacking networks," in TPAMI, 2013.
- (2013) TPAMI
- Hutchinson, B.¹ Deng, L.² Yu, D.³

22
- 84055222005
- Contextdependent pre-trained deep neural networks for large vocabulary speech recognition
- G. Dahl, D. Yu, L. Deng, and A. Acero, "Contextdependent pre-trained deep neural networks for large vocabulary speech recognition," in ICASSP, 2012.
- (2012) ICASSP
- Dahl, G.¹ Yu, D.² Deng, L.³ Acero, A.⁴

23
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- F. Seide, G. Li, and D. Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech, 2011.
- (2011) Interspeech
- Seide, F.¹ Li, G.² Yu, D.³

24
- 84898996216
- Speech recognition using svms
- N. Smith and M. Gales, "Speech recognition using svms," in NIPS, 2002.
- (2002) NIPS
- Smith, N.¹ Gales, M.²

25
- 39549089484
- Semi-supervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle
- A. Fujino, N. Ueda, and K. Saito, "Semi-supervised learning for a hybrid generative/discriminative classifier based on the maximum entropy principle," in TPAMI, 2008.
- (2008) TPAMI
- Fujino, A.¹ Ueda, N.² Saito, K.³

26
- 56449110012
- Classification using discriminative restricted boltzmann machines
- H. Larochelle and Y. Bengio, "Classification using discriminative restricted boltzmann machines," in ICML, 2008.
- (2008) ICML
- Larochelle, H.¹ Bengio, Y.²

27
- 84904680320
- Multimodal fusion using dynamic hybrid models
- M. R. Amer, B. Siddiquie, S. Khan, A. Divakaran, and H. Sawhney, "Multimodal fusion using dynamic hybrid models," in WACV, 2014.
- (2014) WACV
- Amer, M.R.¹ Siddiquie, B.² Khan, S.³ Divakaran, A.⁴ Sawhney, H.⁵

28
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in ICML, 2001.
- (2001) ICML
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

29
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G. E. Hinton, "Training products of experts by minimizing contrastive divergence," in NC, 2002.
- (2002) NC
- Hinton, G.E.¹

30
- 84872942522
- Mark Schmidt, "Ugm: Matlab code for undirected graphical models," 2012.
- (2012) Ugm: Matlab Code for Undirected Graphical Models
- Schmidt, M.¹

31
- 54049132925
- The vera am mittag german audio-visual emotional speech database
- M. Grimm, K. Kroschel, and S. Narayanan, "The vera am mittag german audio-visual emotional speech database," in ICME, 2008.
- (2008) ICME
- Grimm, M.¹ Kroschel, K.² Narayanan, S.³

32
- 84893945649
- Opensmile: The munich versatile and fast open-source audio feature extractor
- Florian Eyben, Martin Wollmer, and Bjorn Schuller, "opensmile: The munich versatile and fast open-source audio feature extractor," in ACM MM, 2010.
- (2010) ACM MM
- Eyben, F.¹ Wollmer, M.² Schuller, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.