SCOPUS 정보 검색 플랫폼

Proceedings - International Conference on Pattern Recognition

Volumn , Issue , 2014, Pages 803-808

Prosodic, Spectral and Voice Quality Feature Selection Using a Long-Term Stopping Criterion for Audio-Based Emotion Recognition

(4) Kachele, Markus a Zharkov, Dimitrij a Meudt, Sascha a Schwenker, Friedhelm a

a UNIVERSITY OF ULM (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

FEATURE EXTRACTION;

EMOTION RECOGNITION; EMOTION RECOGNITION FROM SPEECH; HUMAN MACHINE INTERFACE; STATE-OF-THE-ART PERFORMANCE; STOPPING CRITERIA; SUPRASEGMENTAL FEATURES; TERMINATION CRITERIA; VOICE QUALITY FEATURES;

SPEECH RECOGNITION;

EID: 84918807880 PISSN: 10514651 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICPR.2014.148 Document Type: Conference Paper

Times cited : (22)

References (43)

1
- 84919932657
- A long-term stopping criterion for the forward-backward feature selection algorithm for the classification of emotions in speech
- D. Zharkov, M. Kä chele, S. Meudt, and F. Schwenker, "A long-term stopping criterion for the forward-backward feature selection algorithm for the classification of emotions in speech, " in Joint Proceedings of the T2CT and CCGL Workshops. Otto von Guericke University Magdeburg, 2013, pp. 27-35.
- (2013) Joint Proceedings of the T2CT and CCGL Workshops , pp. 27-35
- Otto von Guericke University Magdeburg¹ Zharkov, D.² Kächele, M.³ Meudt, S.⁴ Schwenker, F.⁵

2
- 85032751766
- Emotion recognition in human-computer interaction
- Jan
- R. Cowie, E. Douglas-Cowie, N. Tsapatsoulis, G. Votsis, S. Kollias, W. Fellenz, and J. Taylor, "Emotion recognition in human-computer interaction, " IEEE Signal Processing Magazine, vol. 18, no. 1, pp. 32-80, Jan 2001.
- (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 32-80
- Cowie, R.¹ Douglas-Cowie, E.² Tsapatsoulis, N.³ Votsis, G.⁴ Kollias, S.⁵ Fellenz, W.⁶ Taylor, J.⁷

3
- 21544459345
- Challenges in real-life emotion annotation and machine learning based detection
- L. Devillers, L. Vidrascu, and L. Lamel, "Challenges in real-life emotion annotation and machine learning based detection, " Neural Networks, vol. 18, pp. 407-422, 2005.
- (2005) Neural Networks , vol.18 , pp. 407-422
- Devillers, L.¹ Vidrascu, L.² Lamel, L.³

4
- 0038548330
- The production and recognition of emotions in speech: Features and algorithms
- P.-Y. Oudeyer, "The production and recognition of emotions in speech: features and algorithms, " International Journal of Human Computer Interaction, vol. 59(1-2), pp. 157-183, 2003.
- (2003) International Journal of Human Computer Interaction , vol.59 , Issue.1-2 , pp. 157-183
- Oudeyer, P.-Y.¹

5
- 80054838542
- Classifier fusion for emotion recognition from speech
- W. Minker, M. Weber, H. Hagras, V. Callagan, and A. D. Kameas, Eds. Springer
- S. Scherer, F. Schwenker, and G. Palm, "Classifier fusion for emotion recognition from speech, " in Advanced Intelligent Environments, W. Minker, M. Weber, H. Hagras, V. Callagan, and A. D. Kameas, Eds. Springer, 2009, pp. 95-117.
- (2009) Advanced Intelligent Environments , pp. 95-117
- Scherer, S.¹ Schwenker, F.² Palm, G.³

6
- 84902358176
- Fusion of audio-visual features using hierarchical classifier systems for the recognition of affective states and the state of depression
- M. De Marsico, A. Tabbone, and A. Fred, Eds. SciTePress
- M. Kä chele, M. Glodek, D. Zharkov, S. Meudt, and F. Schwenker, "Fusion of audio-visual features using hierarchical classifier systems for the recognition of affective states and the state of depression, " in Proceedings of the International Conference on Pattern Recognition Applications and Methods (ICPRAM), M. De Marsico, A. Tabbone, and A. Fred, Eds. SciTePress, 2014, pp. 671-678.
- (2014) Proceedings of the International Conference on Pattern Recognition Applications and Methods (ICPRAM) , pp. 671-678
- Kächele, M.¹ Glodek, M.² Zharkov, D.³ Meudt, S.⁴ Schwenker, F.⁵

7
- 21544458365
- Emotion recognition in humancomputer interaction
- N. Fragopanagos and J. Taylor, "Emotion recognition in humancomputer interaction, " Neural Networks, vol. 18, pp. 389-405, 2005.
- (2005) Neural Networks , vol.18 , pp. 389-405
- Fragopanagos, N.¹ Taylor, J.²

8
- 0345525831
- ser. Affective Science. Oxford University Press, ch. 23
- K. R. Scherer, T. Johnstone, and G. Klasmeyer, Handbook of Affective Sciences-Vocal expression of emotion, ser. Affective Science. Oxford University Press, 2003, ch. 23, pp. 433-456.
- (2003) Handbook of Affective Sciences-Vocal Expression of Emotion , pp. 433-456
- Scherer, K.R.¹ Johnstone, T.² Klasmeyer, G.³

9
- 54249126029
- Real-time emotion recognition from speech using echo state networks
- Springer Berlin Heidelberg
- S. Scherer, M. Oubbati, F. Schwenker, and G. Palm, "Real-time emotion recognition from speech using echo state networks, " in Artificial Neural Networks in Pattern Recognition. Springer Berlin Heidelberg, 2008, pp. 205-216.
- (2008) Artificial Neural Networks in Pattern Recognition , pp. 205-216
- Scherer, S.¹ Oubbati, M.² Schwenker, F.³ Palm, G.⁴

10
- 37549042396
- Emotion recognition from speech using multi-classifier systems and rbf-ensembles
- Springer Berlin Heidelberg
- S. Scherer, F. Schwenker, and G. Palm, "Emotion recognition from speech using multi-classifier systems and rbf-ensembles, " in Speech, Audio, Image and Biomedical Signal Processing using Neural Networks. Springer Berlin Heidelberg, 2008, pp. 49-70.
- (2008) Speech, Audio, Image and Biomedical Signal Processing Using Neural Networks , pp. 49-70
- Scherer, S.¹ Schwenker, F.² Palm, G.³

11
- 0034346176
- Emotion recognition in speech using neural networks
- J. Nicholson, K. Takahashi, and R. Nakatsu, "Emotion recognition in speech using neural networks, " Neural Computing and Applications, vol. 9, pp. 290-296, 2000.
- (2000) Neural Computing and Applications , vol.9 , pp. 290-296
- Nicholson, J.¹ Takahashi, K.² Nakatsu, R.³

12
- 34247610490
- A database of german emotional speech
- F. Burkhardt, A. Paeschke, M. Rolfes, W. Sendlmeier, and B. Weiss, "A database of german emotional speech, " in Proceedings of Interspeech 2005, 2005.
- (2005) Proceedings of Interspeech 2005
- Burkhardt, F.¹ Paeschke, A.² Rolfes, M.³ Sendlmeier, W.⁴ Weiss, B.⁵

13
- 84892621508
- Sensor-fusion in neural networks
- E. Shahbazian, G. Rogova, and M. J. DeWeert, Eds. Springer
- G. Palm and F. Schwenker, "Sensor-fusion in neural networks, " in Harbour Protection Through Data Fusion Technologies, E. Shahbazian, G. Rogova, and M. J. DeWeert, Eds. Springer, 2009, pp. 299-306.
- (2009) Harbour Protection Through Data Fusion Technologies , pp. 299-306
- Palm, G.¹ Schwenker, F.²

14
- 77952032920
- Multiple classifier systems for the recognition of human emotions
- ser. LNCS 5997, N. E. Gayar, J. Kittler, and F. Roli, Eds. Springer
- F. Schwenker, S. Scherer, M. Schmidt, M. Schels, and M. Glodek, "Multiple classifier systems for the recognition of human emotions, " in Proceedings of the 9th International Workshop on Multiple Classifier Systems (MCS'10), ser. LNCS 5997, N. E. Gayar, J. Kittler, and F. Roli, Eds. Springer, 2010, pp. 315-324.
- (2010) Proceedings of the 9th International Workshop on Multiple Classifier Systems (MCS'10) , pp. 315-324
- Schwenker, F.¹ Scherer, S.² Schmidt, M.³ Schels, M.⁴ Glodek, M.⁵

15
- 85009063582
- Emotion recognition based on phoneme classes
- C. M. Lee, S. Yildirim, M. Bulut, A. Kazemzadeh, C. Busso, Z. Deng, S. Lee, and S. S. Narayanan, "Emotion recognition based on phoneme classes, " in Proceedings of ICSLP 04, 2004.
- (2004) Proceedings of ICSLP 04
- Lee, C.M.¹ Yildirim, S.² Bulut, M.³ Kazemzadeh, A.⁴ Busso, C.⁵ Deng, Z.⁶ Lee, S.⁷ Narayanan, S.S.⁸

16
- 84867629933
- On instance selection in audio based emotion recognition
- Springer
- S. Meudt and F. Schwenker, "On instance selection in audio based emotion recognition, " in Proceedings of the 5th IAPR TC3 Workshop on Artificial Neural Networks for Pattern Recognition (ANNPR'12). Springer, 2012, pp. 186-192.
- (2012) Proceedings of the 5th IAPR TC3 Workshop on Artificial Neural Networks for Pattern Recognition (ANNPR'12) , pp. 186-192
- Meudt, S.¹ Schwenker, F.²

17
- 0242721417
- Speech emotion recognition using hidden markov models
- T. L. Nwe, S. W. Foo, and L. C. De Silva, "Speech emotion recognition using hidden markov models, " Speech communication, vol. 41, no. 4, pp. 603-623, 2003.
- (2003) Speech Communication , vol.41 , Issue.4 , pp. 603-623
- Nwe, T.L.¹ Foo, S.W.² De Silva, L.C.³

18
- 33745805403
- A fast learning algorithm for deep belief nets
- Jul
- G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets, " Neural Comput., vol. 18, no. 7, pp. 1527-1554, Jul. 2006.
- (2006) Neural Comput , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.-W.³

19
- 84863380535
- Unsupervised feature learning for audio classification using convolutional deep belief networks
- H. Lee, Y. Largman, P. Pham, and A. Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks, " in Advances in Neural Information Processing Systems 22, 2009, pp. 1096-1104.
- (2009) Advances in Neural Information Processing Systems 22 , pp. 1096-1104
- Lee, H.¹ Largman, Y.² Pham, P.³ Ng, A.Y.⁴

20
- 80051631315
- Deep neural networks for acoustic emotion recognition: Raising the benchmarks
- A. Stuhlsatz, C. Meyer, F. Eyben, T. ZieIke, G. Meier, and B. Schuller, "Deep neural networks for acoustic emotion recognition: raising the benchmarks, " in Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. IEEE, 2011, pp. 5688-5691.
- (2011) Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference On. IEEE , pp. 5688-5691
- Stuhlsatz, A.¹ Meyer, C.² Eyben, F.³ Zieike, T.⁴ Meier, G.⁵ Schuller, B.⁶

21
- 84883693964
- Classification of different speaking groups by means of voice quality parameters
- M. Lugger and B. Yang, "Classification of different speaking groups by means of voice quality parameters, " ITG-Fachbericht-Sprachkommunikation 2006, 2006.
- (2006) ITG-Fachbericht-Sprachkommunikation 2006
- Lugger, M.¹ Yang, B.²

22
- 77956733663
- Feature analysis and evaluation for automatic emotion identification in speech
- I. Luengo, E. Navas, and I. Herná ez, "Feature analysis and evaluation for automatic emotion identification in speech, " Multimedia, IEEE Transactions on, vol. 12, no. 6, pp. 490-501, 2010.
- (2010) Multimedia, IEEE Transactions on , vol.12 , Issue.6 , pp. 490-501
- Luengo, I.¹ Navas, E.² Hernáez, I.³

23
- 84867329306
- Investigating fuzzyinput fuzzy-output support vector machines for robust voice quality classification
- Jan
- S. Scherer, J. Kane, C. Gobl, and F. Schwenker, "Investigating fuzzyinput fuzzy-output support vector machines for robust voice quality classification, " Computer Speech and Language, vol. 27, no. 1, pp. 263-287, Jan. 2012.
- (2012) Computer Speech and Language , vol.27 , Issue.1 , pp. 263-287
- Scherer, S.¹ Kane, J.² Gobl, C.³ Schwenker, F.⁴

24
- 79960846940
- Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
- sensing Emotion and Affect-Facing Realism in Speech Processing
- B. Schuller, A. Batliner, S. Steidl, and D. Seppi, "Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge, " Speech Communication, vol. 53, no. 9-10, pp. 1062-1087, 2011, sensing Emotion and Affect-Facing Realism in Speech Processing.
- (2011) Speech Communication , vol.53 , Issue.9-10 , pp. 1062-1087
- Schuller, B.¹ Batliner, A.² Steidl, S.³ Seppi, D.⁴

25
- 77949415384
- Openear-introducing the munich open-source emotion and affect recognition toolkit
- F. Eyben, M. Wollmer, and B. Schuller, "Openear-introducing the munich open-source emotion and affect recognition toolkit, " in Affective Computing and Intelligent Interaction and Workshops, 2009. ACII 2009. 3rd International Conference on, 2009, pp. 1-6.
- (2009) Affective Computing and Intelligent Interaction and Workshops, 2009. ACII 2009. 3rd International Conference on , pp. 1-6
- Eyben, F.¹ Wollmer, M.² Schuller, B.³

26
- 33745561205
- An introduction to variable and feature selection
- Mar
- I. Guyon and A. Elisseeff, "An introduction to variable and feature selection, " J. Mach. Learn. Res., vol. 3, pp. 1157-1182, Mar. 2003.
- (2003) J. Mach. Learn. Res , vol.3 , pp. 1157-1182
- Guyon, I.¹ Elisseeff, A.²

27
- 0031381525
- Wrappers for feature subset selection
- Dec
- R. Kohavi and G. H. John, "Wrappers for feature subset selection, " Artif. Intell., vol. 97, no. 1-2, pp. 273-324, Dec. 1997.
- (1997) Artif. Intell , vol.97 , Issue.1-2 , pp. 273-324
- Kohavi, R.¹ John, G.H.²

28
- 34547496515
- The relevance of voice quality features in speaker independent emotion recognition
- IEEE
- M. Lugger and B. Yang, "The relevance of voice quality features in speaker independent emotion recognition, " in Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, vol. 4. IEEE, 2007, pp. IV-17.
- (2007) Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on , vol.4 , pp. IV-17
- Lugger, M.¹ Yang, B.²

29
- 84865726860
- Identifying regions of non-modal phonation using features of the wavelet transform
- J. Kane and C. Gobl, "Identifying regions of non-modal phonation using features of the wavelet transform." in INTERSPEECH, 2011, pp. 177-180.
- (2011) INTERSPEECH , pp. 177-180
- Kane, J.¹ Gobl, C.²

30
- 79955528226
- Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation
- T. Drugman, B. Bozkurt, and T. Dutoit, "Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation, " Speech Communication, vol. 53, no. 6, pp. 855-866, 2011.
- (2011) Speech Communication , vol.53 , Issue.6 , pp. 855-866
- Drugman, T.¹ Bozkurt, B.² Dutoit, T.³

31
- 33947684811
- A four-parameter model of glottal flow
- G. Fant, J. Liljencrants, and Q.-g. Lin, "A four-parameter model of glottal flow, " STL-QPSR, vol. 4, no. 1985, pp. 1-13, 1985.
- (1985) STL-QPSR , vol.4 , Issue.1985 , pp. 1-13
- Fant, G.¹ Liljencrants, J.² Lin, Q.³

32
- 0036339929
- Normalized amplitude quotient for parametrization of the glottal flow
- P. Alku, T. Bä ckströ m, and E. Vilkman, "Normalized amplitude quotient for parametrization of the glottal flow, " the Journal of the Acoustical Society of America, vol. 112, p. 701, 2002.
- (2002) The Journal of the Acoustical Society of America , vol.112 , pp. 701
- Alku, P.¹ Bäckström, T.² Vilkman, E.³

33
- 70450163450
- Comparison of multiple voice source parameters in different phonation types
- M. Airas and P. Alku, "Comparison of multiple voice source parameters in different phonation types." in INTERSPEECH, 2007, pp. 1410-1413.
- (2007) INTERSPEECH , pp. 1410-1413
- Airas, M.¹ Alku, P.²

34
- 84919928885
- J. Kane and C. Gobl, "Wavelet maxima dispersion for breathy to tense voice discrimination, " 2013.
- (2013) Wavelet Maxima Dispersion for Breathy to Tense Voice Discrimination
- Kane, J.¹ Gobl, C.²

35
- 0032762471
- A statistical model-based voice activity detection
- J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection, " Signal Processing Letters, IEEE, vol. 6, no. 1, pp. 1-3, 1999.
- (1999) Signal Processing Letters, IEEE , vol.6 , Issue.1 , pp. 1-3
- Sohn, J.¹ Kim, N.S.² Sung, W.³

36
- 84868554211
- Automatic emotion classification vs human perception: Comparing machine performance to the human benchmark
- J. Esparza, S. Scherer, A. Brechmann, and F. Schwenker, "Automatic emotion classification vs. human perception: Comparing machine performance to the human benchmark, " in Proceedings of the 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA'12). IEEE, 2012, pp. 1286-1291.
- (2012) Proceedings of the 11th International Conference on Information Science, Signal Processing and Their Applications (ISSPA'12). IEEE , pp. 1286-1291
- Esparza, J.¹ Scherer, S.² Brechmann, A.³ Schwenker, F.⁴

37
- 27244456854
- Comparison of multiclass svm decomposition schemes for visual object recognition
- Springer Berlin Heidelberg
- L. Kahsay, F. Schwenker, and G. Palm, "Comparison of multiclass svm decomposition schemes for visual object recognition, " in Pattern Recognition. Springer Berlin Heidelberg, 2005, pp. 334-341.
- (2005) Pattern Recognition , pp. 334-341
- Kahsay, L.¹ Schwenker, F.² Palm, G.³

38
- 0036505670
- A comparison of methods for multiclass support vector machines
- C.-W. Hsu and C.-J. Lin, "A comparison of methods for multiclass support vector machines, " Neural Networks, IEEE Transactions on, vol. 13, no. 2, pp. 415-425, 2002.
- (2002) Neural Networks, IEEE Transactions on , vol.13 , Issue.2 , pp. 415-425
- Hsu, C.-W.¹ Lin, C.-J.²

39
- 38149007434
- Logos-Verlag
- A. Paeschke, Prosodische Analyse emotionaler Sprechweise. Logos-Verlag, 2003.
- (2003) Prosodische Analyse Emotionaler Sprechweise
- Paeschke, A.¹

40
- 0017712350
- Evidence for a three-factor theory of emotions
- J. A. Russell and A. Mehrabian, "Evidence for a three-factor theory of emotions, " Journal of Research in Personality, vol. 11, no. 3, pp. 273-294, 1977.
- (1977) Journal of Research in Personality , vol.11 , Issue.3 , pp. 273-294
- Russell, J.A.¹ Mehrabian, A.²

41
- 84919928884
- Using unlabeled data to improve classification of emotional states in human computer interaction
- M. Schels, M. Kä chele, M. Glodek, D. Hrabal, S. Walter, and F. Schwenker, "Using unlabeled data to improve classification of emotional states in human computer interaction, " Journal on Multimodal User Interfaces (JMUI), 2013.
- (2013) Journal on Multimodal User Interfaces (JMUI)
- Schels, M.¹ Kächele, M.² Glodek, M.³ Hrabal, D.⁴ Walter, S.⁵ Schwenker, F.⁶

42
- 84890861989
- Semi-supervised dictionary learning of sparse representations for emotion recognition
- ser. Lecture Notes in Computer Science, Z.-H. Zhou and F. Schwenker, Eds. Springer Berlin Heidelberg
- M. Kä chele and F. Schwenker, "Semi-supervised dictionary learning of sparse representations for emotion recognition, " in Partially Supervised Learning, ser. Lecture Notes in Computer Science, Z.-H. Zhou and F. Schwenker, Eds. Springer Berlin Heidelberg, 2013, pp. 21-35.
- (2013) Partially Supervised Learning , pp. 21-35
- Kächele, M.¹ Schwenker, F.²

43
- 84908097374
- Combination of sequential class distributions from multiple channels using markov fusion networks
- M. Glodek, M. Schels, F. Schwenker, and G. Palm, "Combination of sequential class distributions from multiple channels using markov fusion networks, " Journal on Multimodal User Interfaces, pp. 1-16, 2014
- (2014) Journal on Multimodal User Interfaces , pp. 1-16
- Glodek, M.¹ Schels, M.² Schwenker, F.³ Palm, G.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.