메뉴 건너뛰기




Volumn 23, Issue 1, 2013, Pages 215-227

Modular neural-SVM scheme for speech emotion recognition using ANOVA feature selection method

Author keywords

ANOVA; Modular; Neural network; Speech emotion recognition; SVM

Indexed keywords

AUTOMATIC SPEECH RECOGNITION SYSTEM; EMOTIONAL SPEECH RECOGNITION; FEATURE SELECTION METHODS; MODULAR; MULTI-LAYER PERCEPTRON NEURAL NETWORKS; RECOGNITION PERFORMANCE; SPEECH EMOTION RECOGNITION; SVM;

EID: 84879838571     PISSN: 09410643     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00521-012-0814-8     Document Type: Article
Times cited : (82)

References (95)
  • 1
    • 0037382560 scopus 로고    scopus 로고
    • Emotions, speech and the ASR framework
    • Bosch L (2003) Emotions, speech and the ASR framework. Speech Commun 40: 213-225.
    • (2003) Speech Commun , vol.40 , pp. 213-225
    • Bosch, L.1
  • 2
    • 79952619486 scopus 로고    scopus 로고
    • Spoken emotion recognition using hierarchical classifiers
    • Albornoz EM, Milone DH, Rufiner HL (2011) Spoken emotion recognition using hierarchical classifiers. Comput Speech Lang 25: 556-570.
    • (2011) Comput Speech Lang , vol.25 , pp. 556-570
    • Albornoz, E.M.1    Milone, D.H.2    Rufiner, H.L.3
  • 4
    • 38749092393 scopus 로고    scopus 로고
    • Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs
    • Devillers L, Vidrascu L (2006) Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs. In: The proceedings of Interspeech, pp 801-804.
    • (2006) The proceedings of Interspeech , pp. 801-804
    • Devillers, L.1    Vidrascu, L.2
  • 7
    • 0036473090 scopus 로고    scopus 로고
    • This computer responds to user frustration: theory, design and results
    • Klein J, Moon Y, Picard RW (2002) This computer responds to user frustration: theory, design and results. Interact Comput 14: 119-140.
    • (2002) Interact Comput , vol.14 , pp. 119-140
    • Klein, J.1    Moon, Y.2    Picard, R.W.3
  • 8
    • 79960839416 scopus 로고    scopus 로고
    • Enhancement of emotion detection in spoken dialogue systems by combining several information sources
    • López-Cózar R, Silovsky J, Kroul M (2011) Enhancement of emotion detection in spoken dialogue systems by combining several information sources. Speech Commun 53: 1210-1228.
    • (2011) Speech Commun , vol.53 , pp. 1210-1228
    • López-Cózar, R.1    Silovsky, J.2    Kroul, M.3
  • 9
    • 79960846934 scopus 로고    scopus 로고
    • Recognizing affect from speech prosody using hierarchical graphical models
    • Fernandez R, Picard R (2011) Recognizing affect from speech prosody using hierarchical graphical models. Speech Commun 53: 1088-1103.
    • (2011) Speech Commun , vol.53 , pp. 1088-1103
    • Fernandez, R.1    Picard, R.2
  • 10
    • 0038548330 scopus 로고    scopus 로고
    • The production and recognition of emotions in speech: features and algorithms
    • Oudeyer PY (2003) The production and recognition of emotions in speech: features and algorithms. Int J Hum Comput Interact Stud 59: 157-183.
    • (2003) Int J Hum Comput Interact Stud , vol.59 , pp. 157-183
    • Oudeyer, P.Y.1
  • 13
    • 79960836821 scopus 로고    scopus 로고
    • Anger recognition in speech using acoustic and linguistic cues
    • Polzehl T, Schmitt A, Metze F, Wagner M (2011) Anger recognition in speech using acoustic and linguistic cues. Speech Commun 53: 1198-1209.
    • (2011) Speech Commun , vol.53 , pp. 1198-1209
    • Polzehl, T.1    Schmitt, A.2    Metze, F.3    Wagner, M.4
  • 14
    • 85009160710 scopus 로고    scopus 로고
    • Emotion recognition using a data-driven fuzzy inference system
    • Lee CM, Narayanan S (2003) Emotion recognition using a data-driven fuzzy inference system. In: The proceedings of Eurospeech, pp 157-160.
    • (2003) The proceedings of Eurospeech , pp. 157-160
    • Lee, C.M.1    Narayanan, S.2
  • 15
    • 33645971628 scopus 로고    scopus 로고
    • Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors
    • Litman DJ, Forbes-Riley K (2006) Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. Speech Commun 48: 559-590.
    • (2006) Speech Commun , vol.48 , pp. 559-590
    • Litman, D.J.1    Forbes-Riley, K.2
  • 19
    • 0030283946 scopus 로고    scopus 로고
    • Classification of speech under stress using target driven features
    • Womack BD, Hansen JHL (1996) Classification of speech under stress using target driven features. Speech Commun 20: 131-150.
    • (1996) Speech Commun , vol.20 , pp. 131-150
    • Womack, B.D.1    Hansen, J.H.L.2
  • 20
    • 42449116646 scopus 로고    scopus 로고
    • Stressed speech recognition using a warped frequency scale
    • Gharavian D, Ahadi SM (2008) Stressed speech recognition using a warped frequency scale. IEICE Electron Express 5: 187-191.
    • (2008) IEICE Electron Express , vol.5 , pp. 187-191
    • Gharavian, D.1    Ahadi, S.M.2
  • 21
    • 77955421904 scopus 로고    scopus 로고
    • Expression of affect in spontaneous speech: acoustic correlates and automatic detection of irritation and resignation
    • Laukka P, Neiberg D, Forsell M, Karlsson I, Elenius K (2011) Expression of affect in spontaneous speech: acoustic correlates and automatic detection of irritation and resignation. Comput Speech Lang 25: 84-104.
    • (2011) Comput Speech Lang , vol.25 , pp. 84-104
    • Laukka, P.1    Neiberg, D.2    Forsell, M.3    Karlsson, I.4    Elenius, K.5
  • 23
    • 0028630509 scopus 로고
    • Nonlinear analysis and detection of speech under stressed conditions
    • Cairns D, Hansen JHL (1994) Nonlinear analysis and detection of speech under stressed conditions. J Acoust Soc Am 96: 3392-3400.
    • (1994) J Acoust Soc Am , vol.96 , pp. 3392-3400
    • Cairns, D.1    Hansen, J.H.L.2
  • 27
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: resources, features, and methods
    • Ververidis D, Kotropoulos C (2006) Emotional speech recognition: resources, features, and methods. Speech Commun 48: 1162-1181.
    • (2006) Speech Commun , vol.48 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 28
    • 33947164164 scopus 로고    scopus 로고
    • An evaluation of the robustness of existing supervised machine learning approaches to the classifications of emotions in speech
    • Shami M, Verhelst W (2007) An evaluation of the robustness of existing supervised machine learning approaches to the classifications of emotions in speech. Speech Commun 49: 201-212.
    • (2007) Speech Commun , vol.49 , pp. 201-212
    • Shami, M.1    Verhelst, W.2
  • 29
    • 60249092335 scopus 로고    scopus 로고
    • Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection
    • Altun H, Polat G (2009) Boosting selection of speech related features to improve performance of multi-class SVMs in emotion detection. Expert Syst Appl 36: 8197-8203.
    • (2009) Expert Syst Appl , vol.36 , pp. 8197-8203
    • Altun, H.1    Polat, G.2
  • 30
    • 84867721963 scopus 로고    scopus 로고
    • Speech emotion recognition using FCBF feature selection method and GA-optimized fuzzy ARTMAP neural network
    • (published online 27 May 2011). doi: 10. 1007/s00521-011-0643-1
    • Gharavian D, Sheikhan M, Nazerieh AR, Garoucy S (2011) Speech emotion recognition using FCBF feature selection method and GA-optimized fuzzy ARTMAP neural network. Neural Comput Appl (published online 27 May 2011). doi: 10. 1007/s00521-011-0643-1.
    • (2011) Neural Comput Appl
    • Gharavian, D.1    Sheikhan, M.2    Nazerieh, A.R.3    Garoucy, S.4
  • 31
    • 84864944381 scopus 로고    scopus 로고
    • Emotion recognition of speech using small-size selected feature set and ANN-based classifiers: a comparative study
    • Sheikhan M, Safdarkhani MK, Gharavian D (2011) Emotion recognition of speech using small-size selected feature set and ANN-based classifiers: a comparative study. World Appl Sci J 14: 616-625.
    • (2011) World Appl Sci J , vol.14 , pp. 616-625
    • Sheikhan, M.1    Safdarkhani, M.K.2    Gharavian, D.3
  • 32
    • 84864960135 scopus 로고    scopus 로고
    • GMM-based emotion recognition in Farsi language using feature selection algorithms
    • Gharavian D, Sheikhan M, Pezhmanpour M (2011) GMM-based emotion recognition in Farsi language using feature selection algorithms. World Appl Sci J 14: 626-638.
    • (2011) World Appl Sci J , vol.14 , pp. 626-638
    • Gharavian, D.1    Sheikhan, M.2    Pezhmanpour, M.3
  • 33
    • 80052725463 scopus 로고    scopus 로고
    • Emotional states in judicial courtrooms: an experimental investigation
    • Fersini E, Messina E, Archetti F (2012) Emotional states in judicial courtrooms: an experimental investigation. Speech Commun 54: 11-22.
    • (2012) Speech Commun , vol.54 , pp. 11-22
    • Fersini, E.1    Messina, E.2    Archetti, F.3
  • 35
    • 77951137078 scopus 로고    scopus 로고
    • SPSS Inc, Integral Solutions Limited, Chicago
    • SPSS Inc. (2007) Clementine® 12. 0 algorithms guide. Integral Solutions Limited, Chicago.
    • (2007) Clementine® 12.0 algorithms guide
  • 37
    • 64549147125 scopus 로고    scopus 로고
    • Acoustic feature selection for automatic emotion recognition from speech
    • Rong J, Li G, Chen YP (2009) Acoustic feature selection for automatic emotion recognition from speech. Info Process Manage 45: 315-328.
    • (2009) Info Process Manage , vol.45 , pp. 315-328
    • Rong, J.1    Li, G.2    Chen, Y.P.3
  • 38
    • 44949264114 scopus 로고    scopus 로고
    • Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language
    • Kao Y, Lee L (2006) Feature analysis for emotion recognition from Mandarin speech considering the special characteristics of Chinese language. In: The proceedings of international conference on spoken language processing, pp 1814-1817.
    • (2006) The proceedings of international conference on spoken language processing , pp. 1814-1817
    • Kao, Y.1    Lee, L.2
  • 40
    • 63649117187 scopus 로고    scopus 로고
    • Emotion recognition and evaluation of Mandarin speech using weighted D-KNN classification
    • Pao T, Chen Y, Yeh J, Chang Y (2008) Emotion recognition and evaluation of Mandarin speech using weighted D-KNN classification. Int J Innov Comput Info Control 4: 1695-1709.
    • (2008) Int J Innov Comput Info Control , vol.4 , pp. 1695-1709
    • Pao, T.1    Chen, Y.2    Yeh, J.3    Chang, Y.4
  • 43
    • 75249100219 scopus 로고    scopus 로고
    • Emotion recognition from speech signals using new harmony features
    • Yang B, Lugger M (2010) Emotion recognition from speech signals using new harmony features. Signal Process 90: 1415-1423.
    • (2010) Signal Process , vol.90 , pp. 1415-1423
    • Yang, B.1    Lugger, M.2
  • 44
    • 77956401353 scopus 로고    scopus 로고
    • Class-level spectral features for emotion recognition
    • Bitouk D, Verma R, Nenkova A (2010) Class-level spectral features for emotion recognition. Speech Commun 52: 613-625.
    • (2010) Speech Commun , vol.52 , pp. 613-625
    • Bitouk, D.1    Verma, R.2    Nenkova, A.3
  • 45
    • 79960286585 scopus 로고    scopus 로고
    • Segment-based emotion recognition from continuous Mandarin Chinese speech
    • Yeh J, Pao T, Lin C, Tsai Y, Chen Y (2010) Segment-based emotion recognition from continuous Mandarin Chinese speech. Comput Hum Behav 27: 1545-1552.
    • (2010) Comput Hum Behav , vol.27 , pp. 1545-1552
    • Yeh, J.1    Pao, T.2    Lin, C.3    Tsai, Y.4    Chen, Y.5
  • 46
    • 79953659944 scopus 로고    scopus 로고
    • Automatic speech emotion recognition using modulation spectral features
    • Wu S, Falk TH, Chan WY (2011) Automatic speech emotion recognition using modulation spectral features. Speech Commun 53: 768-785.
    • (2011) Speech Commun , vol.53 , pp. 768-785
    • Wu, S.1    Falk, T.H.2    Chan, W.Y.3
  • 47
    • 79952707334 scopus 로고    scopus 로고
    • Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech
    • He L, Lech M, Maddage NC, Allen NB (2011) Study of empirical mode decomposition and spectral analysis for stress and emotion classification in natural speech. Biomed Signal Process Control 6: 139-146.
    • (2011) Biomed Signal Process Control , vol.6 , pp. 139-146
    • He, L.1    Lech, M.2    Maddage, N.C.3    Allen, N.B.4
  • 48
    • 0031381525 scopus 로고    scopus 로고
    • Wrappers for feature subset selection
    • Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97: 273-324.
    • (1997) Artif Intell , vol.97 , pp. 273-324
    • Kohavi, R.1    John, G.H.2
  • 49
    • 0000466122 scopus 로고    scopus 로고
    • Survey of independent component analysis
    • Hyvarinen A (1999) Survey of independent component analysis. Neural Comput Surv 2: 94-128.
    • (1999) Neural Comput Surv , vol.2 , pp. 94-128
    • Hyvarinen, A.1
  • 52
    • 70350619300 scopus 로고    scopus 로고
    • Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collections
    • Ververidis D, Kotropoulos C (2006) Fast sequential floating forward selection applied to emotional speech features estimated on DES and SUSAS data collections. In: The proceedings of European signal processing conference, pp 1-5.
    • (2006) The proceedings of European signal processing conference , pp. 1-5
    • Ververidis, D.1    Kotropoulos, C.2
  • 55
    • 84855883762 scopus 로고    scopus 로고
    • Acoustic feature selection and classification of emotions in speech using a 3D continuous emotion model
    • (published online 3 April 2011). doi: 10. 1016/j. bspc. 2011. 02. 008
    • Pérez-Espinosa H, Reyes-García CA, Villaseñor-Pineda L (2011) Acoustic feature selection and classification of emotions in speech using a 3D continuous emotion model. Biomed Signal Process Control (published online 3 April 2011). doi: 10. 1016/j. bspc. 2011. 02. 008.
    • (2011) Biomed Signal Process Control
    • Pérez-Espinosa, H.1    Reyes-García, C.A.2    Villaseñor-Pineda, L.3
  • 57
    • 79551484916 scopus 로고    scopus 로고
    • Classification of emotion in spoken Finnish using vowel-length segments: increasing reliability with a fusion technique
    • Väyrynen E, Toivanen J, Seppänen T (2011) Classification of emotion in spoken Finnish using vowel-length segments: increasing reliability with a fusion technique. Speech Commun 53: 269-282.
    • (2011) Speech Commun , vol.53 , pp. 269-282
    • Väyrynen, E.1    Toivanen, J.2    Seppänen, T.3
  • 58
    • 77950073346 scopus 로고    scopus 로고
    • Spoken emotion recognition through optimum-path forest classification using glottal features
    • Iliev AI, Scordilis MS, Papa JP, Falcão AX (2010) Spoken emotion recognition through optimum-path forest classification using glottal features. Comput Speech Lang 24: 445-460.
    • (2010) Comput Speech Lang , vol.24 , pp. 445-460
    • Iliev, A.I.1    Scordilis, M.S.2    Papa, J.P.3    Falcão, A.X.4
  • 59
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: features, classification schemes, and databases
    • El Ayadi M, Kamel MS, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recognit 44: 572-587.
    • (2011) Pattern Recognit , vol.44 , pp. 572-587
    • El Ayadi, M.1    Kamel, M.S.2    Karray, F.3
  • 60
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Nwe TL, Foo SV, De Silva LC (2003) Speech emotion recognition using hidden Markov models. Speech Commun 41: 603-623.
    • (2003) Speech Commun , vol.41 , pp. 603-623
    • Nwe, T.L.1    Foo, S.V.2    De Silva, L.C.3
  • 63
    • 79960848738 scopus 로고    scopus 로고
    • Application of speaker- and language identification state-of-the-art techniques for emotion recognition
    • (article in press). doi: 10. 1016/j. specom. 2011. 01. 007
    • Kockmann M, Burget L, Černocky JH (2011) Application of speaker- and language identification state-of-the-art techniques for emotion recognition. Speech Commun (article in press). doi: 10. 1016/j. specom. 2011. 01. 007.
    • (2011) Speech Commun
    • Kockmann, M.1    Burget, L.2    Černocky, J.H.3
  • 64
  • 67
    • 33846952503 scopus 로고    scopus 로고
    • Ensemble methods for spoken emotion recognition in call-centers
    • Morrison D, Wang R, de Silva LC (2007) Ensemble methods for spoken emotion recognition in call-centers. Speech Commun 49: 98-112.
    • (2007) Speech Commun , vol.49 , pp. 98-112
    • Morrison, D.1    Wang, R.2    de Silva, L.C.3
  • 68
    • 61549105958 scopus 로고    scopus 로고
    • Support vector machines employing cross-correlation for emotional speech recognition
    • Chandaka S, Chatterjee A, Munshi S (2009) Support vector machines employing cross-correlation for emotional speech recognition. Measurement 42: 611-618.
    • (2009) Measurement , vol.42 , pp. 611-618
    • Chandaka, S.1    Chatterjee, A.2    Munshi, S.3
  • 69
    • 80054835044 scopus 로고    scopus 로고
    • Relevance vector machine based speech emotion recognition. Lecture Notes in Computer Science
    • Wang F, Verhelst W, Sahli H (2011) Relevance vector machine based speech emotion recognition. Lecture Notes in Computer Science. Affect Comput Intell Interact 6975: 111-120.
    • (2011) Affect Comput Intell Interact , vol.6975 , pp. 111-120
    • Wang, F.1    Verhelst, W.2    Sahli, H.3
  • 73
    • 56449106584 scopus 로고    scopus 로고
    • User and context adaptive neural networks for emotion recognition
    • Caridakis G, Karpouzis K, Kollias S (2008) User and context adaptive neural networks for emotion recognition. Neurocomputing 71: 2553-2562.
    • (2008) Neurocomputing , vol.71 , pp. 2553-2562
    • Caridakis, G.1    Karpouzis, K.2    Kollias, S.3
  • 75
    • 79960847182 scopus 로고    scopus 로고
    • Emotion recognition using a hierarchical binary decision tree approach
    • Lee CC, Mower E, Busso C, Lee S, Narayanan S (2011) Emotion recognition using a hierarchical binary decision tree approach. Speech Commun 53: 1162-1171.
    • (2011) Speech Commun , vol.53 , pp. 1162-1171
    • Lee, C.C.1    Mower, E.2    Busso, C.3    Lee, S.4    Narayanan, S.5
  • 76
    • 77952032920 scopus 로고    scopus 로고
    • Multiple classifier systems for the recognition of human emotions. Lecture Notes in Computer Science
    • Schwenker F, Scherer S, Schmidt M, Schels M, Glodek M (2010) Multiple classifier systems for the recognition of human emotions. Lecture Notes in Computer Science. Multiple Classif Syst 5997: 315-324.
    • (2010) Multiple Classif Syst , vol.5997 , pp. 315-324
    • Schwenker, F.1    Scherer, S.2    Schmidt, M.3    Schels, M.4    Glodek, M.5
  • 77
    • 70350580828 scopus 로고    scopus 로고
    • The GMM-SVM supervector approach for the recognition of the emotional status from speech. Lecture Notes in Computer Science
    • Schwenker F, Scherer S, Magdi YM, Palm G (2009) The GMM-SVM supervector approach for the recognition of the emotional status from speech. Lecture Notes in Computer Science. Artif Neural Netw 5768: 894-903.
    • (2009) Artif Neural Netw , vol.5768 , pp. 894-903
    • Schwenker, F.1    Scherer, S.2    Magdi, Y.M.3    Palm, G.4
  • 79
    • 84857911091 scopus 로고    scopus 로고
    • Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels
    • Wu CH, Liang WB (2011) Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels. IEEE Trans Affect Comput 2: 10-21.
    • (2011) IEEE Trans Affect Comput , vol.2 , pp. 10-21
    • Wu, C.H.1    Liang, W.B.2
  • 80
    • 78049286797 scopus 로고    scopus 로고
    • Emotion recognition from speech by combining databases and fusion of classifiers. Lecture Notes in Computer Science
    • Lefter I, Rothkrantz LJM, Wiggers P, van Leeuwen DA (2010) Emotion recognition from speech by combining databases and fusion of classifiers. Lecture Notes in Computer Science. Text Speech Dialogue 6231: 353-360.
    • (2010) Text Speech Dialogue , vol.6231 , pp. 353-360
    • Lefter, I.1    Rothkrantz, L.J.M.2    Wiggers, P.3    van Leeuwen, D.A.4
  • 83
    • 77955423547 scopus 로고    scopus 로고
    • Fiction support for realistic portrayals of fear-type emotional manifestations
    • Clavel C, Vasilescu I, Devillers L (2011) Fiction support for realistic portrayals of fear-type emotional manifestations. Comput Speech Lang 25: 63-83.
    • (2011) Comput Speech Lang , vol.25 , pp. 63-83
    • Clavel, C.1    Vasilescu, I.2    Devillers, L.3
  • 85
    • 79960846940 scopus 로고    scopus 로고
    • Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge
    • (article in press). doi: 10. 1016/j. specom. 2011. 01. 011
    • Schuller B, Batliner A, Steidl S, Seppi D (2011) Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge. Speech Commun (article in press). doi: 10. 1016/j. specom. 2011. 01. 011.
    • (2011) Speech Commun
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 86
    • 33745561205 scopus 로고    scopus 로고
    • An introduction to variable and feature selection
    • Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3: 1157-1182.
    • (2003) J Mach Learn Res , vol.3 , pp. 1157-1182
    • Guyon, I.1    Elisseeff, A.2
  • 88
    • 0013379958 scopus 로고    scopus 로고
    • NIST/SEMATECH
    • NIST/SEMATECH (2011) e-Handbook of statistical methods. (http://www. itl. nist. gov/div898/handbook/).
    • (2011) e-Handbook of statistical methods
  • 89
    • 0032355984 scopus 로고    scopus 로고
    • Classification by pairwise coupling
    • Hastie T, Tibshirani R (1998) Classification by pairwise coupling. Ann Stat 26: 451-471.
    • (1998) Ann Stat , vol.26 , pp. 451-471
    • Hastie, T.1    Tibshirani, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.