메뉴 건너뛰기




Volumn , Issue , 2014, Pages 803-808

Prosodic, Spectral and Voice Quality Feature Selection Using a Long-Term Stopping Criterion for Audio-Based Emotion Recognition

Author keywords

[No Author keywords available]

Indexed keywords

FEATURE EXTRACTION;

EID: 84918807880     PISSN: 10514651     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICPR.2014.148     Document Type: Conference Paper
Times cited : (22)

References (43)
  • 3
    • 21544459345 scopus 로고    scopus 로고
    • Challenges in real-life emotion annotation and machine learning based detection
    • L. Devillers, L. Vidrascu, and L. Lamel, "Challenges in real-life emotion annotation and machine learning based detection, " Neural Networks, vol. 18, pp. 407-422, 2005.
    • (2005) Neural Networks , vol.18 , pp. 407-422
    • Devillers, L.1    Vidrascu, L.2    Lamel, L.3
  • 4
    • 0038548330 scopus 로고    scopus 로고
    • The production and recognition of emotions in speech: Features and algorithms
    • P.-Y. Oudeyer, "The production and recognition of emotions in speech: features and algorithms, " International Journal of Human Computer Interaction, vol. 59(1-2), pp. 157-183, 2003.
    • (2003) International Journal of Human Computer Interaction , vol.59 , Issue.1-2 , pp. 157-183
    • Oudeyer, P.-Y.1
  • 5
    • 80054838542 scopus 로고    scopus 로고
    • Classifier fusion for emotion recognition from speech
    • W. Minker, M. Weber, H. Hagras, V. Callagan, and A. D. Kameas, Eds. Springer
    • S. Scherer, F. Schwenker, and G. Palm, "Classifier fusion for emotion recognition from speech, " in Advanced Intelligent Environments, W. Minker, M. Weber, H. Hagras, V. Callagan, and A. D. Kameas, Eds. Springer, 2009, pp. 95-117.
    • (2009) Advanced Intelligent Environments , pp. 95-117
    • Scherer, S.1    Schwenker, F.2    Palm, G.3
  • 6
  • 7
    • 21544458365 scopus 로고    scopus 로고
    • Emotion recognition in humancomputer interaction
    • N. Fragopanagos and J. Taylor, "Emotion recognition in humancomputer interaction, " Neural Networks, vol. 18, pp. 389-405, 2005.
    • (2005) Neural Networks , vol.18 , pp. 389-405
    • Fragopanagos, N.1    Taylor, J.2
  • 13
    • 84892621508 scopus 로고    scopus 로고
    • Sensor-fusion in neural networks
    • E. Shahbazian, G. Rogova, and M. J. DeWeert, Eds. Springer
    • G. Palm and F. Schwenker, "Sensor-fusion in neural networks, " in Harbour Protection Through Data Fusion Technologies, E. Shahbazian, G. Rogova, and M. J. DeWeert, Eds. Springer, 2009, pp. 299-306.
    • (2009) Harbour Protection Through Data Fusion Technologies , pp. 299-306
    • Palm, G.1    Schwenker, F.2
  • 17
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden markov models
    • T. L. Nwe, S. W. Foo, and L. C. De Silva, "Speech emotion recognition using hidden markov models, " Speech communication, vol. 41, no. 4, pp. 603-623, 2003.
    • (2003) Speech Communication , vol.41 , Issue.4 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 18
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • Jul
    • G. E. Hinton, S. Osindero, and Y.-W. Teh, "A fast learning algorithm for deep belief nets, " Neural Comput., vol. 18, no. 7, pp. 1527-1554, Jul. 2006.
    • (2006) Neural Comput , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.-W.3
  • 19
    • 84863380535 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • H. Lee, Y. Largman, P. Pham, and A. Y. Ng, "Unsupervised feature learning for audio classification using convolutional deep belief networks, " in Advances in Neural Information Processing Systems 22, 2009, pp. 1096-1104.
    • (2009) Advances in Neural Information Processing Systems 22 , pp. 1096-1104
    • Lee, H.1    Largman, Y.2    Pham, P.3    Ng, A.Y.4
  • 21
    • 84883693964 scopus 로고    scopus 로고
    • Classification of different speaking groups by means of voice quality parameters
    • M. Lugger and B. Yang, "Classification of different speaking groups by means of voice quality parameters, " ITG-Fachbericht-Sprachkommunikation 2006, 2006.
    • (2006) ITG-Fachbericht-Sprachkommunikation 2006
    • Lugger, M.1    Yang, B.2
  • 22
    • 77956733663 scopus 로고    scopus 로고
    • Feature analysis and evaluation for automatic emotion identification in speech
    • I. Luengo, E. Navas, and I. Herná ez, "Feature analysis and evaluation for automatic emotion identification in speech, " Multimedia, IEEE Transactions on, vol. 12, no. 6, pp. 490-501, 2010.
    • (2010) Multimedia, IEEE Transactions on , vol.12 , Issue.6 , pp. 490-501
    • Luengo, I.1    Navas, E.2    Hernáez, I.3
  • 23
    • 84867329306 scopus 로고    scopus 로고
    • Investigating fuzzyinput fuzzy-output support vector machines for robust voice quality classification
    • Jan
    • S. Scherer, J. Kane, C. Gobl, and F. Schwenker, "Investigating fuzzyinput fuzzy-output support vector machines for robust voice quality classification, " Computer Speech and Language, vol. 27, no. 1, pp. 263-287, Jan. 2012.
    • (2012) Computer Speech and Language , vol.27 , Issue.1 , pp. 263-287
    • Scherer, S.1    Kane, J.2    Gobl, C.3    Schwenker, F.4
  • 24
    • 79960846940 scopus 로고    scopus 로고
    • Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
    • sensing Emotion and Affect-Facing Realism in Speech Processing
    • B. Schuller, A. Batliner, S. Steidl, and D. Seppi, "Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge, " Speech Communication, vol. 53, no. 9-10, pp. 1062-1087, 2011, sensing Emotion and Affect-Facing Realism in Speech Processing.
    • (2011) Speech Communication , vol.53 , Issue.9-10 , pp. 1062-1087
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 26
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Mar
    • I. Guyon and A. Elisseeff, "An introduction to variable and feature selection, " J. Mach. Learn. Res., vol. 3, pp. 1157-1182, Mar. 2003.
    • (2003) J. Mach. Learn. Res , vol.3 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 27
    • 0031381525 scopus 로고    scopus 로고
    • Wrappers for feature subset selection
    • Dec
    • R. Kohavi and G. H. John, "Wrappers for feature subset selection, " Artif. Intell., vol. 97, no. 1-2, pp. 273-324, Dec. 1997.
    • (1997) Artif. Intell , vol.97 , Issue.1-2 , pp. 273-324
    • Kohavi, R.1    John, G.H.2
  • 29
    • 84865726860 scopus 로고    scopus 로고
    • Identifying regions of non-modal phonation using features of the wavelet transform
    • J. Kane and C. Gobl, "Identifying regions of non-modal phonation using features of the wavelet transform." in INTERSPEECH, 2011, pp. 177-180.
    • (2011) INTERSPEECH , pp. 177-180
    • Kane, J.1    Gobl, C.2
  • 30
    • 79955528226 scopus 로고    scopus 로고
    • Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation
    • T. Drugman, B. Bozkurt, and T. Dutoit, "Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation, " Speech Communication, vol. 53, no. 6, pp. 855-866, 2011.
    • (2011) Speech Communication , vol.53 , Issue.6 , pp. 855-866
    • Drugman, T.1    Bozkurt, B.2    Dutoit, T.3
  • 31
    • 33947684811 scopus 로고
    • A four-parameter model of glottal flow
    • G. Fant, J. Liljencrants, and Q.-g. Lin, "A four-parameter model of glottal flow, " STL-QPSR, vol. 4, no. 1985, pp. 1-13, 1985.
    • (1985) STL-QPSR , vol.4 , Issue.1985 , pp. 1-13
    • Fant, G.1    Liljencrants, J.2    Lin, Q.3
  • 33
    • 70450163450 scopus 로고    scopus 로고
    • Comparison of multiple voice source parameters in different phonation types
    • M. Airas and P. Alku, "Comparison of multiple voice source parameters in different phonation types." in INTERSPEECH, 2007, pp. 1410-1413.
    • (2007) INTERSPEECH , pp. 1410-1413
    • Airas, M.1    Alku, P.2
  • 35
    • 0032762471 scopus 로고    scopus 로고
    • A statistical model-based voice activity detection
    • J. Sohn, N. S. Kim, and W. Sung, "A statistical model-based voice activity detection, " Signal Processing Letters, IEEE, vol. 6, no. 1, pp. 1-3, 1999.
    • (1999) Signal Processing Letters, IEEE , vol.6 , Issue.1 , pp. 1-3
    • Sohn, J.1    Kim, N.S.2    Sung, W.3
  • 37
    • 27244456854 scopus 로고    scopus 로고
    • Comparison of multiclass svm decomposition schemes for visual object recognition
    • Springer Berlin Heidelberg
    • L. Kahsay, F. Schwenker, and G. Palm, "Comparison of multiclass svm decomposition schemes for visual object recognition, " in Pattern Recognition. Springer Berlin Heidelberg, 2005, pp. 334-341.
    • (2005) Pattern Recognition , pp. 334-341
    • Kahsay, L.1    Schwenker, F.2    Palm, G.3
  • 38
    • 0036505670 scopus 로고    scopus 로고
    • A comparison of methods for multiclass support vector machines
    • C.-W. Hsu and C.-J. Lin, "A comparison of methods for multiclass support vector machines, " Neural Networks, IEEE Transactions on, vol. 13, no. 2, pp. 415-425, 2002.
    • (2002) Neural Networks, IEEE Transactions on , vol.13 , Issue.2 , pp. 415-425
    • Hsu, C.-W.1    Lin, C.-J.2
  • 40
    • 0017712350 scopus 로고
    • Evidence for a three-factor theory of emotions
    • J. A. Russell and A. Mehrabian, "Evidence for a three-factor theory of emotions, " Journal of Research in Personality, vol. 11, no. 3, pp. 273-294, 1977.
    • (1977) Journal of Research in Personality , vol.11 , Issue.3 , pp. 273-294
    • Russell, J.A.1    Mehrabian, A.2
  • 42
    • 84890861989 scopus 로고    scopus 로고
    • Semi-supervised dictionary learning of sparse representations for emotion recognition
    • ser. Lecture Notes in Computer Science, Z.-H. Zhou and F. Schwenker, Eds. Springer Berlin Heidelberg
    • M. Kä chele and F. Schwenker, "Semi-supervised dictionary learning of sparse representations for emotion recognition, " in Partially Supervised Learning, ser. Lecture Notes in Computer Science, Z.-H. Zhou and F. Schwenker, Eds. Springer Berlin Heidelberg, 2013, pp. 21-35.
    • (2013) Partially Supervised Learning , pp. 21-35
    • Kächele, M.1    Schwenker, F.2
  • 43
    • 84908097374 scopus 로고    scopus 로고
    • Combination of sequential class distributions from multiple channels using markov fusion networks
    • M. Glodek, M. Schels, F. Schwenker, and G. Palm, "Combination of sequential class distributions from multiple channels using markov fusion networks, " Journal on Multimodal User Interfaces, pp. 1-16, 2014
    • (2014) Journal on Multimodal User Interfaces , pp. 1-16
    • Glodek, M.1    Schels, M.2    Schwenker, F.3    Palm, G.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.