메뉴 건너뛰기




Volumn 6, Issue 1, 2015, Pages 69-75

Speech emotion recognition using Fourier parameters

Author keywords

affective computing; Fourier parameter model; speaker independent; speech emotion recognition

Indexed keywords

DATABASE SYSTEMS; FOURIER TRANSFORMS; SPEECH;

EID: 84924081025     PISSN: 19493045     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAFFC.2015.2392101     Document Type: Article
Times cited : (374)

References (58)
  • 1
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: Features, classification schemes, and databases
    • M. El Ayadi, M. S. Kamel, and F. Karray, "Survey on speech emotion recognition: Features, classification schemes, and databases," Pattern Recog., vol. 44, no. 3, pp. 572-587, 2011.
    • (2011) Pattern Recog. , vol.44 , Issue.3 , pp. 572-587
    • El Ayadi, M.1    Kamel, M.S.2    Karray, F.3
  • 4
    • 84864723353 scopus 로고    scopus 로고
    • Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema
    • M. Kotti and F. Paternò, "Speaker-independent emotion recognition exploiting a psychologically-inspired binary cascade classification schema," Int. J. Speech Technol., vol. 15, pp. 131-150, 2012.
    • (2012) Int. J. Speech Technol. , vol.15 , pp. 131-150
    • Kotti, M.1    Paternò, F.2
  • 5
    • 84868138650 scopus 로고    scopus 로고
    • Opinions and attitudes toward humanoid robots in the middle east
    • N. Mavridis, M. S. Katsaiti, S. Naef, A. Falasi, A. Nuaimi, H. Araifi, and A. Kitbi, "Opinions and attitudes toward humanoid robots in the middle east," AI Soc., vol. 27, no. 4, pp. 517-534, 2012.
    • (2012) AI Soc. , vol.27 , Issue.4 , pp. 517-534
    • Mavridis, N.1    Katsaiti, M.S.2    Naef, S.3    Falasi, A.4    Nuaimi, A.5    Araifi, H.6    Kitbi, A.7
  • 6
    • 79953822842 scopus 로고    scopus 로고
    • Affect detection: An interdisciplinary review of models, methods, and their applications
    • Jan.-Jun.
    • R. A. Calvo and S. D'Mello, "Affect detection: An interdisciplinary review of models, methods, and their applications," IEEE Trans. Affective Comput., vol. 1, no. 1, pp. 18-37, Jan.-Jun. 2010.
    • (2010) IEEE Trans. Affective Comput. , vol.1 , Issue.1 , pp. 18-37
    • Calvo, R.A.1    D'Mello, S.2
  • 7
    • 65249116503 scopus 로고    scopus 로고
    • Analysis of emotionally salient aspects of fundamental frequency for emotion detection
    • May
    • C. Busso, S. Lee, and S. Narayanan, "Analysis of emotionally salient aspects of fundamental frequency for emotion detection," IEEE Trans. Audio, Speech, Language Process., vol. 17, no. 4, pp. 582-596, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Language Process. , vol.17 , Issue.4 , pp. 582-596
    • Busso, C.1    Lee, S.2    Narayanan, S.3
  • 9
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text independent speaker recognition: From features to supervectors
    • T. Kinnunen and H. Z. Li, "An overview of text independent speaker recognition: From features to supervectors," Speech Commun., vol. 52, pp. 12-40, 2010.
    • (2010) Speech Commun. , vol.52 , pp. 12-40
    • Kinnunen, T.1    Li, H.Z.2
  • 10
    • 84866447267 scopus 로고    scopus 로고
    • Using DTW neural-based MFCC warping to improve emotional speech recognition
    • M. Sheikhan, D. Gharavian, and F. Ashoftedl, "Using DTW neural-based MFCC warping to improve emotional speech recognition," Neural Comput. Appl., vol. 21, pp.1765-1773, 2011.
    • (2011) Neural Comput. Appl. , vol.21 , pp. 1765-1773
    • Sheikhan, M.1    Gharavian, D.2    Ashoftedl, F.3
  • 12
    • 14644439843 scopus 로고    scopus 로고
    • Toward detecting emotions in spoken dialogs
    • Mar.
    • C. Lee and S. Narayanan, "Toward detecting emotions in spoken dialogs,"IEEE Trans. Speech Audio Process., vol. 13, no. 2, pp. 293-303, Mar. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.2 , pp. 293-303
    • Lee, C.1    Narayanan, S.2
  • 13
    • 70350235187 scopus 로고    scopus 로고
    • Audio-based emotion recognition in judicial domain: A multilayer support vector machines approach
    • E. Messina, G. Arosio, and F. Archetti, "Audio-based emotion recognition in judicial domain: A multilayer support vector machines approach,"Mach. Learn. Data Mining Pattern Recog., vol. 5632, pp. 594-602, 2009.
    • (2009) Mach. Learn. Data Mining Pattern Recog. , vol.5632 , pp. 594-602
    • Messina, E.1    Arosio, G.2    Archetti, F.3
  • 14
    • 0037380186 scopus 로고    scopus 로고
    • The role of voice quality in communicating emotion, mood and attitude
    • C. Gobl and A. Ni Chasaide, "The role of voice quality in communicating emotion, mood and attitude," Speech Commun., vol. 40, pp.189-212, 2003.
    • (2003) Speech Commun. , vol.40 , pp. 189-212
    • Gobl, C.1    Ni Chasaide, A.2
  • 15
    • 75249100219 scopus 로고    scopus 로고
    • Emotion recognition from speech signals using new harmony features
    • B. Yang and M. Lugger, "Emotion recognition from speech signals using new harmony features," Signal Process., vol. 90, pp. 1415-1423, 2010.
    • (2010) Signal Process. , vol.90 , pp. 1415-1423
    • Yang, B.1    Lugger, M.2
  • 16
    • 77955560086 scopus 로고    scopus 로고
    • A learning approach to hierarchical feature selection and aggregation for audio classification
    • P. Ruvolo, I. Fasel, and J. R. Movellan, "A learning approach to hierarchical feature selection and aggregation for audio classification," Pattern Recog. Lett., vol. 31, pp.1535-1542, 2010.
    • (2010) Pattern Recog. Lett. , vol.31 , pp. 1535-1542
    • Ruvolo, P.1    Fasel, I.2    Movellan, J.R.3
  • 17
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug.
    • R. J. McAulay and T. F. Quatieri, "Speech analysis/synthesis based on a sinusoidal representation," IEEE Trans. Acoust., Speech Signal Process.,vol. 34, no. 4, pp. 744-754, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech Signal Process. , vol.34 , Issue.4 , pp. 744-754
    • McAulay, R.J.1    Quatieri, T.F.2
  • 19
    • 34047256081 scopus 로고    scopus 로고
    • Sinusoidal model-based analysis and classification of stressed speech
    • May
    • S. Ramamohan and S. Dandapat, "Sinusoidal model-based analysis and classification of stressed speech," IEEE Trans. Audio, Speech, Language Process., vol. 14, no. 3, pp.737-746, May 2006.
    • (2006) IEEE Trans. Audio, Speech, Language Process. , vol.14 , Issue.3 , pp. 737-746
    • Ramamohan, S.1    Dandapat, S.2
  • 20
    • 0025681006 scopus 로고    scopus 로고
    • Robust speaker-independent word recognition using static, dynamic and acceleration features: Experiments with Lombard and noisy speech
    • B. A. Hanson and T. H. Applebaum, "Robust speaker-independent word recognition using static, dynamic and acceleration features: Experiments with Lombard and noisy speech," in Proc. Int. Conf. Acoust., Speech, Signal Process., 1990, pp. 857-860.
    • Proc. Int. Conf. Acoust., Speech, Signal Process., 1990 , pp. 857-860
    • Hanson, B.A.1    Applebaum, T.H.2
  • 25
    • 84855898663 scopus 로고    scopus 로고
    • Cultural dependency analysis for understanding speech emotion
    • N. Kamaruddina, A. Wahabb, and C. Quek, "Cultural dependency analysis for understanding speech emotion," Expert Syst. Appl., vol. 39, pp. 5115-5133, 2012.
    • (2012) Expert Syst. Appl. , vol.39 , pp. 5115-5133
    • Kamaruddina, N.1    Wahabb, A.2    Quek, C.3
  • 26
    • 27144489164 scopus 로고    scopus 로고
    • A tutorial on support vector machines for pattern recognition
    • C. J. C. Burges, "A tutorial on support vector machines for pattern recognition,"Knowl. Discovery Data Mining, vol. 2, pp. 121-167, 1998.
    • (1998) Knowl. Discovery Data Mining , vol.2 , pp. 121-167
    • Burges, C.J.C.1
  • 27
    • 84908401085 scopus 로고    scopus 로고
    • An automatic framework for textured 3D video-based facial expression recognition
    • Jul.-Sep.
    • M. Hayat and M. Bennamoun, "An automatic framework for textured 3D video-based facial expression recognition," IEEE Trans. Affective Comput., vol. 5, no. 3, pp. 301-313, Jul.-Sep. 2014.
    • (2014) IEEE Trans. Affective Comput. , vol.5 , Issue.3 , pp. 301-313
    • Hayat, M.1    Bennamoun, M.2
  • 28
    • 85059768339 scopus 로고    scopus 로고
    • Combination of generative models and SVM based classifier for speech emotion recognition
    • S. Chandrakala and C. C. Sekhar, "Combination of generative models and SVM based classifier for speech emotion recognition," in Proc. IEEE Int. Joint Conf. Neural Netw., 2009, pp. 1374-1379.
    • Proc. IEEE Int. Joint Conf. Neural Netw., 2009 , pp. 1374-1379
    • Chandrakala, S.1    Sekhar, C.C.2
  • 29
    • 4544316885 scopus 로고    scopus 로고
    • Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture
    • B. Schuller, G. Rigoll, and M. Lang, "Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, vol. 1, pp. 577-580.
    • Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004 , vol.1 , pp. 577-580
    • Schuller, B.1    Rigoll, G.2    Lang, M.3
  • 31
    • 84876239811 scopus 로고    scopus 로고
    • Classifier-based learning of nonlinear feature manifold for visualization of emotional speech prosody
    • Jan.-Mar.
    • E. Vayrynen, J. Kortelainen, and T. Seppanen, "Classifier-based learning of nonlinear feature manifold for visualization of emotional speech prosody,"IEEE Trans. Affective Comput., vol. 4, no. 1, pp. 47-56, Jan.-Mar. 2013
    • (2013) IEEE Trans. Affective Comput. , vol.4 , Issue.1 , pp. 47-56
    • Vayrynen, E.1    Kortelainen, J.2    Seppanen, T.3
  • 32
    • 0003425258 scopus 로고
    • Prentice Hall, Upper Saddle River, New Jersey 07458, USA
    • st ed. Prentice Hall, Upper Saddle River, New Jersey 07458, USA, 1978.
    • (1978) st Ed.
    • Rabiner, L.1    Schafer, R.2
  • 33
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • B. S. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. Am., vol. 55, no. 6, pp.1304-1312, 1974.
    • (1974) J. Acoust. Soc. Am. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.S.1
  • 34
    • 0034229795 scopus 로고    scopus 로고
    • A comparative study of traditional and newly proposed features for recognition of speech under stress
    • Jul.
    • S. E. Bou-Ghazale and J. Hansen, "A comparative study of traditional and newly proposed features for recognition of speech under stress," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 429-442, Jul. 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 429-442
    • Bou-Ghazale, S.E.1    Hansen, J.2
  • 35
    • 0019075685 scopus 로고
    • Some observations on oral air flow during phonation
    • Oct.
    • H. Teager, "Some observations on oral air flow during phonation," IEEE Trans. Acoust. Speech Signal Process., vol. 28, no. 5, pp. 599-601, Oct. 1990.
    • (1990) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , Issue.5 , pp. 599-601
    • Teager, H.1
  • 36
    • 0000606530 scopus 로고
    • Communication of affects by single vowels
    • L. Kaiser, "Communication of affects by single vowels," Synthese, vol. 14, no. 4, pp. 300-319, 1962.
    • (1962) Synthese , vol.14 , Issue.4 , pp. 300-319
    • Kaiser, L.1
  • 37
    • 0028630509 scopus 로고
    • Nonlinear analysis and detection of speech under stressed conditions
    • D. Caims and J. Hansen, "Nonlinear analysis and detection of speech under stressed conditions," J. Acoust. Soc. Am., vol. 96, pp. 3392-3400, 1994.
    • (1994) J. Acoust. Soc. Am. , vol.96 , pp. 3392-3400
    • Caims, D.1    Hansen, J.2
  • 38
    • 0035278948 scopus 로고    scopus 로고
    • Nonlinear feature based classification of speech under stress
    • Mar.
    • G. Zhou, J. Hansen, and J. Kaiser, "Nonlinear feature based classification of speech under stress," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 201-216, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.3 , pp. 201-216
    • Zhou, G.1    Hansen, J.2    Kaiser, J.3
  • 42
    • 75249105202 scopus 로고    scopus 로고
    • Psychological Motivated Multi-stage Emotion Classification Exploiting Voice Quality features
    • F.Mihelic and J. Zibert Ed. In-Tech, Vienna, Austria
    • M. Lugger and B. Yang, "Psychological motivated multi-stage emotion classification exploiting voice quality features," in Speech Recognition, F. Mihelic and J. Zibert Ed. In-Tech, Vienna, Austria, 2008.
    • (2008) Speech Recognition
    • Lugger, M.1    Yang, B.2
  • 44
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Aug.
    • S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,"IEEE Trans. Acoust., Speech Signal Process., vol. 28, no. 4, pp. 357-366, Aug. 1980.
    • (1980) IEEE Trans. Acoust., Speech Signal Process. , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 46
    • 34547940048 scopus 로고    scopus 로고
    • Primitives based evaluation and estimation of emotions in speech
    • M. Grimm, K. Kroschel, E. Mower, and S. Narayanan, "Primitives based evaluation and estimation of emotions in speech," Speech Commun., vol. 49, pp. 787-800, 2007.
    • (2007) Speech Commun. , vol.49 , pp. 787-800
    • Grimm, M.1    Kroschel, K.2    Mower, E.3    Narayanan, S.4
  • 47
    • 44149109121 scopus 로고    scopus 로고
    • Feartype emotion recognition for future audio-based surveillance systems
    • C. Clavel, I. Vasilescu, L. Devillers, G. Richard, and T. Ehrette, "Feartype emotion recognition for future audio-based surveillance systems," Speech Commun., vol. 50, pp. 487-503, 2008.
    • (2008) Speech Commun. , vol.50 , pp. 487-503
    • Clavel, C.1    Vasilescu, I.2    Devillers, L.3    Richard, G.4    Ehrette, T.5
  • 48
    • 77956401353 scopus 로고    scopus 로고
    • Class-level spectral features for emotion recognition
    • D. Bitouk, R. Verma, and A. Nenkova, "Class-level spectral features for emotion recognition," Speech Commun., vol. 52, no. 7-8, pp. 613-625, 2010.
    • (2010) Speech Commun. , vol.52 , Issue.7-8 , pp. 613-625
    • Bitouk, D.1    Verma, R.2    Nenkova, A.3
  • 51
    • 84897096415 scopus 로고    scopus 로고
    • Iterative feature normalization scheme for automatic emotion detection from speech
    • Oct.-Dec.
    • C. Busso, S. Marioor-yad, S. Narayanan and A. Metallinou, "Iterative feature normalization scheme for automatic emotion detection from speech,"IEEE Trans. Affective Comput., vol. 4, no. 4, pp. 386-397, Oct.-Dec. 2013
    • (2013) IEEE Trans. Affective Comput. , vol.4 , Issue.4 , pp. 386-397
    • Busso, C.1    Marioor-yad, S.2    Narayanan, S.3    Metallinou, A.4
  • 52
    • 0036505670 scopus 로고    scopus 로고
    • A comparison of methods for multiclass support vector machines
    • Mar.
    • C. W. Hsu and C. J. Lin, "A comparison of methods for multiclass support vector machines," IEEE Trans. Neural Netw., vol. 13, no. 2, pp. 415-425, Mar. 2002.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.2 , pp. 415-425
    • Hsu, C.W.1    Lin, C.J.2
  • 56
    • 84924061481 scopus 로고    scopus 로고
    • Emotional speech recognition using a novel feature set
    • K. X. Wang, N. An, and L. Li, "Emotional speech recognition using a novel feature set," J. Comput. Inf. Syst., vol. 9, pp. 1-8, 2013.
    • (2013) J. Comput. Inf. Syst. , vol.9 , pp. 1-8
    • Wang, K.X.1    An, N.2    Li, L.3
  • 57
    • 85115260483 scopus 로고
    • Floating search method for feature selection with nonmonotonic criterion functions
    • P. Pudil, F. Ferri, J. Novovicova, and J. Kittler, "Floating search method for feature selection with nonmonotonic criterion functions," Pattern Recog., vol. 2, pp. 279-283, 1994.
    • (1994) Pattern Recog. , vol.2 , pp. 279-283
    • Pudil, P.1    Ferri, F.2    Novovicova, J.3    Kittler, J.4
  • 58


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.