SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn 2016-January, Issue , 2016, Pages 2196-2202

Discriminatively trained recurrent neural networks for continuous dimensional emotion recognition from audio

(4) Weninger, Felix a Ringeval, Fabien b Marchi, Erik b Schuller, Bjorn b,c

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

b UNIVERSITY OF PASSAU (Germany)

c IMPERIAL COLLEGE LONDON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COST FUNCTIONS; ERRORS; MEAN SQUARE ERROR; NEURAL NETWORKS; SPEECH RECOGNITION;

CORRELATION COEFFICIENT; DEEP NEURAL NETWORKS; DIFFERENTIABLE FUNCTIONS; DISCRIMINATIVE TRAINING; EMOTION RECOGNITION; EVALUATION CRITERIA; NEURAL NETWORK TRAINING; OBJECTIVE FUNCTIONS;

RECURRENT NEURAL NETWORKS;

EID: 85006110321 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (60)

References (27)

1
- 85014296970
- A neural network model for the prediction of musical emotions
- S. Nefti-Meziani and J. G. Grey, editors, IET Publisher, London, UK
- Eduardo Coutinho and Angelo Cangelosi. A neural network model for the prediction of musical emotions. In S. Nefti-Meziani and J. G. Grey, editors, Advances in Cognitive Systems, pages 331-368. IET Publisher, London, UK, 2010.
- (2010) Advances in Cognitive Systems , pp. 331-368
- Coutinho, E.¹ Cangelosi, A.²

2
- 84983561287
- A multi-task approach to continuous five-dimensional affect sensing in natural speech
- March
- Florian Eyben, Martin Wöllmer, and Björn Schuller. A Multi-Task Approach to Continuous Five-Dimensional Affect Sensing in Natural Speech. ACM Transactions on Interactive Intelligent Systems, Special Issue on Affective Interaction in Natural Environments, 2 (1), March 2012. 29 pages.
- (2012) ACM Transactions on Interactive Intelligent Systems, Special Issue on Affective Interaction in Natural Environments , vol.2 , Issue.1
- Eyben, F.¹ Wöllmer, M.² Schuller, B.³

3
- 84947915210
- The geneva minimalistic acoustic parameter set (gemaps) for voice research and affective computing
- in press
- Florian Eyben, Klaus Scherer, Björn Schuller, Johan Sundberg, Elisabeth André, Carlos Busso, Laurence Devillers, Julien Epps, Petri Laukka, Shrikanth Narayanan, and Khiet Truong. The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing. IEEE Transactions on Affective Computing, 2015. in press.
- (2015) IEEE Transactions on Affective Computing
- Eyben, F.¹ Scherer, K.² Schuller, B.³ Sundberg, J.⁴ André, E.⁵ Busso, C.⁶ Devillers, L.⁷ Epps, J.⁸ Laukka, P.⁹ Narayanan, S.¹⁰ Truong, K.¹¹

4
- 0034293152
- Learning to forget: Continual prediction with LSTM
- Felix A. Gers, Jürgen Schmidhuber, and Fred Cummins. Learning to forget: Continual prediction with LSTM. Neural Computation, 12 (10): 2451-2471, 2000.
- (2000) Neural Computation , vol.12 , Issue.10 , pp. 2451-2471
- Gers, F.A.¹ Schmidhuber, J.² Cummins, F.³

5
- 84910060363
- Speech emotion recognition using deep neural network and extreme learning machine
- Singapore, September. ISCA
- Kun Han, Dong Yu, and Ivan Tashev. Speech Emotion Recognition Using Deep Neural Network and Extreme Learning Machine. In Proc. of INTERSPEECH, pages 223-227, Singapore, September 2014. ISCA.
- (2014) Proc. of INTERSPEECH , pp. 223-227
- Han, K.¹ Yu, D.² Tashev, I.³

6
- 84960882502
- Multimodal affective dimension prediction using deep bidirectional long short-term memory recurrent neural networks
- Brisbane, Australia, October. ACM
- Lang He, Dongmei Jiang, Le Yang, Ercheng Pei, Peng Wu, and Hichem Sahli. Multimodal Affective Dimension Prediction Using Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks. In Proc. of AVEC, pages 73-80, Brisbane, Australia, October 2015. ACM.
- (2015) Proc. of AVEC , pp. 73-80
- He, L.¹ Jiang, D.² Yang, L.³ Pei, E.⁴ Wu, P.⁵ Sahli, H.⁶

7
- 84941334839
- Joint opti-mization of masks and deep recurrent neural networks for monaural source separation
- December
- Po-Sen Huang, Minje Kim, Mark Hasegawa-Johnson, and Paris Smaragdis. Joint Opti-mization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation. IEEE Transactions on Audio, Speech, and Language Processing, 23 (12): 2136-2147, December 2015.
- (2015) IEEE Transactions on Audio, Speech, and Language Processing , vol.23 , Issue.12 , pp. 2136-2147
- Huang, P.-S.¹ Kim, M.² Mark, H.-J.³ Smaragdis, P.⁴

8
- 84893298548
- A comparison of evaluation measures for emotion recognition in dimensional space
- Geneva, Switzerland, September. IEEE
- Robert Jenke, Angelika Peer, and Martin Buss. A Comparison of Evaluation Measures for Emotion Recognition in Dimensional Space. In Proc. of ACII, pages 822-826, Geneva, Switzerland, September 2013. IEEE.
- (2013) Proc. of ACII , pp. 822-826
- Jenke, R.¹ Peer, A.² Buss, M.³

9
- 84960847562
- Ensemble methods for continuous affect recognition: Multimodality, temporality, and challenges
- Brisbane, Australia, October. ACM
- Markus Kächele, Patrick Thiam, Günther Palm, Friedhelm Schwenker, and Martin Schels. Ensemble methods for continuous affect recognition: Multimodality, temporality, and challenges. In Proc. of AVEC (held in conjunction with ACM MM), pages 9-16, Brisbane, Australia, October 2015. ACM.
- (2015) Proc. of AVEC (Held in Conjunction with ACM MM) , pp. 9-16
- Kächele, M.¹ Thiam, P.² Palm, G.³ Schwenker, F.⁴ Schels, M.⁵

10
- 85006140664
- Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization
- Kyoto, Japan
- Brian Kingsbury, Tara N. Sainath, and Hagen Soltau. Scalable minimum Bayes risk training of deep neural network acoustic models using distributed Hessian-free optimization. In Proc. of ICASSP, Kyoto, Japan, 2012.
- (2012) Proc. of ICASSP
- Kingsbury, B.¹ Sainath, T.N.² Soltau, H.³

11
- 84893307972
- Hybrid deep neural network-hidden markov model (dnn-hmm) based speech emotion recognition
- Geneva, Switzerland, September. IEEE
- Longfei Li, Yong Zhao, Dongmei Jiang, Yanning Zhang, Fengna Wang, I. Gonzalez, E. Valentin, and H. Sahli. Hybrid Deep Neural Network-Hidden Markov Model (DNN-HMM) Based Speech Emotion Recognition. In Proc. of ACII, pages 312-317, Geneva, Switzerland, September 2013. IEEE.
- (2013) Proc. of ACII , pp. 312-317
- Li, L.¹ Zhao, Y.² Jiang, D.³ Zhang, Y.⁴ Wang, F.⁵ Gonzalez, I.⁶ Valentin, E.⁷ Sahli, H.⁸

12
- 0024521543
- A concordance correlation coefficient to evaluate reproducibility
- March
- Lawrence I. Lin. A concordance correlation coefficient to evaluate reproducibility. Biometrics, 45 (1): 255-268, March 1989.
- (1989) Biometrics , vol.45 , Issue.1 , pp. 255-268
- Lin, L.I.¹

13
- 84930944930
- Correcting time-continuous emotional labels by modeling the reaction lag of evaluators
- April-June
- Soroosh Mariooryad and Carlos Busso. Correcting time-continuous emotional labels by modeling the reaction lag of evaluators. IEEE Transactions on Affective Computing, 6 (2): 97-108, April-June 2015.
- (2015) IEEE Transactions on Affective Computing , vol.6 , Issue.2 , pp. 97-108
- Mariooryad, S.¹ Busso, C.²

14
- 84886390329
- Tracking continuous emotional trends of participants during affective dyadic interactions using body language and speech information
- February
- Angeliki Metallinou, Athanasios Katsamanis, and Shrikanth Narayanan. Tracking continuous emotional trends of participants during affective dyadic interactions using body language and speech information. Image and Vision Computing, 31 (2): 137-152, February 2013.
- (2013) Image and Vision Computing , vol.31 , Issue.2 , pp. 137-152
- Metallinou, A.¹ Katsamanis, A.² Narayanan, S.³

15
- 84881512394
- Introducing the RECOLA Multimodal Corpus of Remote Collaborative and Affective Interactions
- Shanghai, China, April
- Fabien Ringeval, Andreas Sonderegger, Jürgen Sauer, and Denis Lalanne. Introducing the RECOLA Multimodal Corpus of Remote Collaborative and Affective Interactions. In Proc. of EmoSPACE (held in conjunction with ACM FG), Shanghai, China, April 2013. 8 pages.
- (2013) Proc. of EmoSPACE (Held in Conjunction with ACM FG)
- Ringeval, F.¹ Sonderegger, A.² Sauer, J.³ Lalanne, D.⁴

16
- 84943197961
- Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data
- November
- Fabien Ringeval, Florian Eyben, Eleni Kroupi, Anil Yuce, Jean-Philippe Thiran, Touradj Ebrahimi, Denis Lalanne, and Björn Schuller. Prediction of Asynchronous Dimensional Emotion Ratings from Audiovisual and Physiological Data. Pattern Recognition Letters, 66: 22-30, November 2015.
- (2015) Pattern Recognition Letters , vol.66 , pp. 22-30
- Ringeval, F.¹ Eyben, F.² Kroupi, E.³ Yuce, A.⁴ Thiran, J.⁵ Ebrahimi, T.⁶ Lalanne, D.⁷ Schuller, B.⁸

17
- 84960854232
- AV+EC 2015-the first affect recognition challenge bridging across audio, video, and physiological data
- Brisbane, Australia, October
- Fabien Ringeval, Björn Schuller, Michel Valstar, Shashank Jaiswal, Erik Marchi, Denis Lalanne, Roddy Cowie, and Maja Pantic. AV+EC 2015-The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data. In Proc. of AVEC (held in conjunction with ACM MM), pages 3-8, Brisbane, Australia, October 2015.
- (2015) Proc. of AVEC (Held in Conjunction with ACM MM) , pp. 3-8
- Ringeval, F.¹ Schuller, B.² Valstar, M.³ Jaiswal, S.⁴ Marchi, E.⁵ Lalanne, D.⁶ Cowie, R.⁷ Pantic, M.⁸

18
- 4644280844
- A circumplex model of affect
- J. A. Russell. A circumplex model of affect. Journal of Personality and Social Psychology, 39 (6): 1161-1178, 1980.
- (1980) Journal of Personality and Social Psychology , vol.39 , Issue.6 , pp. 1161-1178
- Russell, J.A.¹

19
- 84872700353
- Modeling musical emotion dynamics with conditional random fields
- Miami, FL, USA
- Erik M. Schmidt and Youngmoo E. Kim. Modeling musical emotion dynamics with conditional random fields. In Proc. of ISMIR, pages 777-782, Miami, FL, USA, 2011.
- (2011) Proc. of ISMIR , pp. 777-782
- Schmidt, E.M.¹ Kim, Y.E.²

20
- 84873420121
- Feature learning in dynamic environments: Modeling the acoustic structure of musical emotion
- Miami, FL, USA
- Erik M. Schmidt, Jeffrey Scott, and Youngmoo E. Kim. Feature learning in dynamic environments: Modeling the acoustic structure of musical emotion. In Proc. of ISMIR, pages 325-330, Miami, FL, USA, 2012.
- (2012) Proc. of ISMIR , pp. 325-330
- Schmidt, E.M.¹ Scott, J.² Kim, Y.E.³

21
- 84870223036
- AVEC 2012-the continuous audio/visual emotion challenge
- Louis-Philippe Morency, Dan Bohus, Hamid K. Aghajan, Justine Cassell, Anton Nijholt, and Julien Epps, editors, Santa Monica, CA, October. ACM
- Björn Schuller, Michel Valstar, Florian Eyben, Roddy Cowie, and Maja Pantic. AVEC 2012-The Continuous Audio/Visual Emotion Challenge. In Louis-Philippe Morency, Dan Bohus, Hamid K. Aghajan, Justine Cassell, Anton Nijholt, and Julien Epps, editors, Proc. of ICMI, pages 449-456, Santa Monica, CA, October 2012. ACM.
- (2012) Proc. of ICMI , pp. 449-456
- Schuller, B.¹ Valstar, M.² Eyben, F.³ Cowie, R.⁴ Pantic, M.⁵

22
- 84887500129
- 1000 songs for emotional analysis of music
- Barcelona, Spain, ACM
- Mohammad Soleymani, Michael N. Caro, Erik M. Schmidt, Chen-Ya Sha, and Yi-Hsuan Yang. 1000 songs for emotional analysis of music. In Proc. of CrowdMM (held in conjunction with ACM MM), Barcelona, Spain, 2013. ACM.
- (2013) Proc. of CrowdMM (Held in Conjunction with ACM MM)
- Soleymani, M.¹ Caro, M.N.² Schmidt, E.M.³ Sha, C.⁴ Yang, Y.⁵

23
- 84906274730
- Sequence-discriminative training of deep neural networks
- Lyon, France. ISCA
- Karek Veselý, Arnab Ghoshal, Lukás Burget, and Daniel Povey. Sequence-discriminative training of deep neural networks. In Proc. of INTERSPEECH, pages 2345-2349, Lyon, France, 2013. ISCA.
- (2013) Proc. of INTERSPEECH , pp. 2345-2349
- Veselý, K.¹ Ghoshal, A.² Burget, L.³ Povey, D.⁴

24
- 84945462625
- Relevance units machine based dimensional and continuous speech emotion prediction
- November
- Fengna Wang, Hichem Sahli, Junbin Gao, Dongmei Jiang, and Werner Verhelst. Relevance units machine based dimensional and continuous speech emotion prediction. Multimedia Tools and Applications, 74 (22): 9983-10000, November 2015.
- (2015) Multimedia Tools and Applications , vol.74 , Issue.22 , pp. 9983-10000
- Wang, F.¹ Sahli, H.² Gao, J.³ Jiang, D.⁴ Verhelst, W.⁵

25
- 85006140676
- The TUM approach to the MediaEval music emotion task using generic affective audio features
- Martha Larson, Xavier Anguera, Timo Reuter, Gareth J. F. Jones, Bogdan Ionescu, Markus Schedl, Tomas Piatrik, Claudia Hauff, and Mohammad Soleymani, editors, Barcelona, Spain, October. CEUR
- Felix Weninger, Florian Eyben, and Björn Schuller. The TUM approach to the MediaEval music emotion task using generic affective audio features. In Martha Larson, Xavier Anguera, Timo Reuter, Gareth J. F. Jones, Bogdan Ionescu, Markus Schedl, Tomas Piatrik, Claudia Hauff, and Mohammad Soleymani, editors, Proc. of MediaEval, Barcelona, Spain, October 2013. CEUR.
- (2013) Proc. of MediaEval
- Weninger, F.¹ Eyben, F.² Schuller, B.³

26
- 84941334311
- Discriminatively trained recurrent neural networks for single-channel speech separation
- Atlanta, GA, USA. IEEE
- Felix Weninger, John R. Hershey, Jonathan Le Roux, and Björn Schuller. Discriminatively trained recurrent neural networks for single-channel speech separation. In Proc. of GlobalSIP, pages 740-744, Atlanta, GA, USA, 2014. IEEE.
- (2014) Proc. of GlobalSIP , pp. 740-744
- Weninger, F.¹ Hershey, J.R.² Le Roux, J.³ Schuller, B.⁴

27
- 84930639546
- Introducing CURRENNT-the Munich open-source CUDA RecurREnt neural network toolkit
- Felix Weninger, Johannes Bergmann, and Björn Schuller. Introducing CURRENNT-the Munich open-source CUDA RecurREnt Neural Network Toolkit. Journal of Machine Learning Research, 16: 547-551, 2015.
- (2015) Journal of Machine Learning Research , vol.16 , pp. 547-551
- Weninger, F.¹ Bergmann, J.² Schuller, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.