메뉴 건너뛰기




Volumn 90, Issue 5, 2010, Pages 1415-1423

Emotion recognition from speech signals using new harmony features

Author keywords

Emotion recognition; Feature extraction; Harmony features; Pitch interval

Indexed keywords

EMOTION RECOGNITION; MUSIC THEORY; PITCH CONTOURS; RECOGNITION PERFORMANCE; SPEECH SIGNALS; STATE OF THE ART;

EID: 75249100219     PISSN: 01651684     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.sigpro.2009.09.009     Document Type: Article
Times cited : (141)

References (54)
  • 1
    • 85032751766 scopus 로고    scopus 로고
    • Emotion recognition in human-computer interaction
    • Cowie R., et al. Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine 18 (2001) 32-81
    • (2001) IEEE Signal Processing Magazine , vol.18 , pp. 32-81
    • Cowie, R.1
  • 2
    • 0005504614 scopus 로고
    • Speakers and hearers are people: reflections on speech deterioration as a consequence of acquired deafness
    • Spens K.E., and Plant G. (Eds), Wiley, New York
    • Cowie R., and Douglas-Cowie E. Speakers and hearers are people: reflections on speech deterioration as a consequence of acquired deafness. In: Spens K.E., and Plant G. (Eds). Profound Deafness and Speech Communication (1995), Wiley, New York
    • (1995) Profound Deafness and Speech Communication
    • Cowie, R.1    Douglas-Cowie, E.2
  • 8
    • 33846952503 scopus 로고    scopus 로고
    • Ensemble methods for spoken emotion recognition in call-centres
    • Morrison D., Wang R., et al. Ensemble methods for spoken emotion recognition in call-centres. Speech communication 49 (2007) 98-112
    • (2007) Speech communication , vol.49 , pp. 98-112
    • Morrison, D.1    Wang, R.2
  • 10
    • 34548015220 scopus 로고    scopus 로고
    • The role of prosody in disambiguating potentially ambiguous utterances in English and Italian
    • J. Hirschberg, C. Avesani, The role of prosody in disambiguating potentially ambiguous utterances in English and Italian, in: ESCA Workshop on Intonation, 1997.
    • (1997) ESCA Workshop on Intonation
    • Hirschberg, J.1    Avesani, C.2
  • 12
    • 38049133461 scopus 로고    scopus 로고
    • Emotional aspects of intrinsic speech variabilities in automatic speech recognition
    • M. Cernak, C. Wellekens, Emotional aspects of intrinsic speech variabilities in automatic speech recognition, in: International Conference on Speech and Computer, 2006, pp. 405-408.
    • (2006) International Conference on Speech and Computer , pp. 405-408
    • Cernak, M.1    Wellekens, C.2
  • 16
    • 84889960454 scopus 로고
    • An argument for basic emotions
    • Ekman P. An argument for basic emotions. Cognition and Emotion 6 (1992) 169-200
    • (1992) Cognition and Emotion , vol.6 , pp. 169-200
    • Ekman, P.1
  • 17
    • 58149453035 scopus 로고
    • Three dimensions of emotions
    • Schlosberg H. Three dimensions of emotions. Psychological Review 61 (1954) 81-88
    • (1954) Psychological Review , vol.61 , pp. 81-88
    • Schlosberg, H.1
  • 19
    • 85032752037 scopus 로고    scopus 로고
    • Extracting moods from pictures and sounds: towards truly personalized TV
    • Hanjalic A. Extracting moods from pictures and sounds: towards truly personalized TV. IEEE Signal Processing Magazine 23 (2006) 90-100
    • (2006) IEEE Signal Processing Magazine , vol.23 , pp. 90-100
    • Hanjalic, A.1
  • 21
    • 75249105202 scopus 로고    scopus 로고
    • Psychological motivated multi-stage emotion classification exploiting voice quality features
    • F. Mihelic, J. Zibert Eds, Chapter 22
    • M. Lugger, B. Yang, Psychological motivated multi-stage emotion classification exploiting voice quality features, in: F. Mihelic, J. Zibert (Eds.), Speech Recognition, In-Tech, 2008 (Chapter 22).
    • (2008) Speech Recognition, In-Tech
    • Lugger, M.1    Yang, B.2
  • 22
    • 0002686212 scopus 로고    scopus 로고
    • Dimensions of emotional meaning in speech
    • C. Pereira, Dimensions of emotional meaning in speech, in: ITRW on Speech and Emotion, 2000, pp. 25-28.
    • (2000) ITRW on Speech and Emotion , pp. 25-28
    • Pereira, C.1
  • 24
    • 0008618803 scopus 로고
    • Royal Swedish Academy of Music
    • I. Fonagy, Emotions voice and music, Royal Swedish Academy of Music, 1981, pp. 51-79.
    • (1981) Emotions voice and music , pp. 51-79
    • Fonagy, I.1
  • 25
    • 34547493864 scopus 로고    scopus 로고
    • Emotion recognition in the noise applying large acoustic feature sets
    • Dresden
    • B. Schuller, D. Arsic, et al., Emotion recognition in the noise applying large acoustic feature sets, in: Speech Prosody, Dresden, 2006.
    • (2006) Speech Prosody
    • Schuller, B.1    Arsic, D.2
  • 29
    • 84902658348 scopus 로고    scopus 로고
    • Extracting voice quality contours using discrete hidden Markov models
    • M. Lugger, B. Yang, Extracting voice quality contours using discrete hidden Markov models, in: Proceedings of the Speech Prosody, 2008.
    • (2008) Proceedings of the Speech Prosody
    • Lugger, M.1    Yang, B.2
  • 30
    • 0038103326 scopus 로고    scopus 로고
    • Fundamentals of Statistical Signal Processing
    • Prentice-Hall, Englewood Cliffs, NJ
    • Kay S.M. Fundamentals of Statistical Signal Processing. Detection Theory Vol. 2 (1998), Prentice-Hall, Englewood Cliffs, NJ
    • (1998) Detection Theory , vol.2
    • Kay, S.M.1
  • 31
    • 0016355478 scopus 로고
    • A new look at the statistical model identification
    • Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control 19 (1974) 716-723
    • (1974) IEEE Transactions on Automatic Control , vol.19 , pp. 716-723
    • Akaike, H.1
  • 34
    • 75249092878 scopus 로고    scopus 로고
    • Combining classifiers with diverse feature sets for robust speaker independent emotion recognition
    • M. Lugger, B. Yang, Combining classifiers with diverse feature sets for robust speaker independent emotion recognition, in: Proceedings of the EUSIPCO, 2009.
    • (2009) Proceedings of the EUSIPCO
    • Lugger, M.1    Yang, B.2
  • 35
    • 84890445089 scopus 로고    scopus 로고
    • Overfitting in making comparisons between variable selection methods
    • Reunanen J. Overfitting in making comparisons between variable selection methods. Journal of Machine Learning Research 3 (2003) 1371-1382
    • (2003) Journal of Machine Learning Research , vol.3 , pp. 1371-1382
    • Reunanen, J.1
  • 36
    • 85115260483 scopus 로고
    • Floating search methods for feature selection with nonmonotonic criterion
    • Pudil P., Ferri F.J., et al. Floating search methods for feature selection with nonmonotonic criterion. Pattern Recognition-Conference B: Computer Vision 2 (1994) 279-283
    • (1994) Pattern Recognition-Conference B: Computer Vision , vol.2 , pp. 279-283
    • Pudil, P.1    Ferri, F.J.2
  • 37
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Nwe T., Foo S., and Silva L.D. Speech emotion recognition using hidden Markov models. Speech communication 41 (2003) 603-623
    • (2003) Speech communication , vol.41 , pp. 603-623
    • Nwe, T.1    Foo, S.2    Silva, L.D.3
  • 38
    • 34547496515 scopus 로고    scopus 로고
    • The relevance of voice quality features in speaker independent emotion recognition
    • M. Lugger, B. Yang, The relevance of voice quality features in speaker independent emotion recognition, in: Proceedings of the IEEE ICASSP, vol. 4, 2007, pp. 17-20.
    • (2007) Proceedings of the IEEE ICASSP , vol.4 , pp. 17-20
    • Lugger, M.1    Yang, B.2
  • 39
    • 51449108623 scopus 로고    scopus 로고
    • Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters
    • M. Lugger, B. Yang, Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters, in: Proceedings of the IEEE ICASSP, 2008, pp. 4945-4948.
    • (2008) Proceedings of the IEEE ICASSP , pp. 4945-4948
    • Lugger, M.1    Yang, B.2
  • 44
    • 0000547455 scopus 로고
    • Classification of glottal vibration from acoustic measurements
    • Fujimura O., and Hirano M. (Eds), Hiltop University Press
    • Stevens K., and Hanson H. Classification of glottal vibration from acoustic measurements. In: Fujimura O., and Hirano M. (Eds). Vocal Fold Physiology (1994), Hiltop University Press 147-170
    • (1994) Vocal Fold Physiology , pp. 147-170
    • Stevens, K.1    Hanson, H.2
  • 48
    • 0042125529 scopus 로고    scopus 로고
    • The statistical structure of human speech sounds predicts musical universals
    • D. Schwartz, C. Howe, D. Purves, The statistical structure of human speech sounds predicts musical universals, The Journal of Neuroscience (2003) 7160-7168.
    • (2003) The Journal of Neuroscience , pp. 7160-7168
    • Schwartz, D.1    Howe, C.2    Purves, D.3
  • 50
    • 0033338098 scopus 로고    scopus 로고
    • G.H. Wakefield, Mathematical representation of joint time-chroma distribution, in: SPIE, 3807, 1999.
    • G.H. Wakefield, Mathematical representation of joint time-chroma distribution, in: SPIE, vol. 3807, 1999.
  • 51
    • 33847034601 scopus 로고    scopus 로고
    • The psychophysics of harmony perception: harmony is a three-tone phenomenon
    • Cook N.D., and Fujidawa T.X. The psychophysics of harmony perception: harmony is a three-tone phenomenon. Empirical Musicology Review 1 2 (2006) 106-126
    • (2006) Empirical Musicology Review , vol.1 , Issue.2 , pp. 106-126
    • Cook, N.D.1    Fujidawa, T.X.2
  • 54
    • 70450136545 scopus 로고    scopus 로고
    • An incremental analysis of different feature groups in speaker independent emotion recognition
    • M. Lugger, B. Yang, An incremental analysis of different feature groups in speaker independent emotion recognition, in: Proceedings of the International Conference on Phonetic Sciences, 2007, pp. 2149-2152.
    • (2007) Proceedings of the International Conference on Phonetic Sciences , pp. 2149-2152
    • Lugger, M.1    Yang, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.