메뉴 건너뛰기




Volumn 24, Issue 3, 2010, Pages 445-460

Spoken emotion recognition through optimum-path forest classification using glottal features

Author keywords

Emotion recognition; Glottal analysis; Optimum path forest; Speech analysis

Indexed keywords

ARTIFICIAL NEURAL NETWORK; BAYESIAN CLASSIFIER; CLASSIFICATION METHODS; EMOTION RECOGNITION; FOREST CLASSIFICATION; GAUSSIAN MIXTURE MODEL; INVERSE FILTERING; K-NEAREST NEIGHBOR RULES; MULTI LAYER PERCEPTRON; RECOGNITION RATES; SPEECH DATABASE; SPEECH FEATURES; SPEECH SIGNALS; WAVE FORMS;

EID: 77950073346     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2009.02.005     Document Type: Article
Times cited : (79)

References (58)
  • 1
    • 9444288190 scopus 로고    scopus 로고
    • Airas, M., Alku, P., 2004. Emotions in short vowel segments: effects of the glottal flow as reflected by the normalized amplitude quotient. Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science), v 3068, Affective Dialogue Systems, 2004, pp. 13-24.
    • Airas, M., Alku, P., 2004. Emotions in short vowel segments: effects of the glottal flow as reflected by the normalized amplitude quotient. Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science), v 3068, Affective Dialogue Systems, 2004, pp. 13-24.
  • 2
    • 0030123157 scopus 로고    scopus 로고
    • Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering
    • Alku P., and Vilkman E. Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering. Speech Communication 18 (1996) 131-138
    • (1996) Speech Communication , vol.18 , pp. 131-138
    • Alku, P.1    Vilkman, E.2
  • 7
    • 0037382560 scopus 로고    scopus 로고
    • Emotions, speech and the ASR framework
    • Bosch L. Emotions, speech and the ASR framework. Speech Communication 40 (2003) 213-225
    • (2003) Speech Communication , vol.40 , pp. 213-225
    • Bosch, L.1
  • 15
    • 0037382510 scopus 로고    scopus 로고
    • Describing the emotional states that are expressed in speech
    • Cowie R., and Cornelius R. Describing the emotional states that are expressed in speech. Speech Communication 40 (2003) 5-32
    • (2003) Speech Communication , vol.40 , pp. 5-32
    • Cowie, R.1    Cornelius, R.2
  • 16
    • 0029030164 scopus 로고
    • Analysis of the glottal excitation of emotionally styled and stressed speech
    • Cummings K.E., and Clements M.A. Analysis of the glottal excitation of emotionally styled and stressed speech. Journal of the Acoustical Society of America 98 1 (1995) 88-98
    • (1995) Journal of the Acoustical Society of America , vol.98 , Issue.1 , pp. 88-98
    • Cummings, K.E.1    Clements, M.A.2
  • 17
    • 26444573178 scopus 로고    scopus 로고
    • Which is the best multiclass SVM method: an empirical study
    • Duan K., and Keerthi S.S. Which is the best multiclass SVM method: an empirical study. Multiple Classifier Systems (2005) 278-285
    • (2005) Multiple Classifier Systems , pp. 278-285
    • Duan, K.1    Keerthi, S.S.2
  • 18
    • 84889960454 scopus 로고
    • An argument for basic emotions
    • Eckman P. An argument for basic emotions. Cognition and Emotion 6 3/4 (1992) 169-200
    • (1992) Cognition and Emotion , vol.6 , Issue.3-4 , pp. 169-200
    • Eckman, P.1
  • 20
    • 84928451959 scopus 로고
    • Glottal flow: models and interaction
    • Fant G. Glottal flow: models and interaction. Journal of Phonetics 14 (1986) 393-399
    • (1986) Journal of Phonetics , vol.14 , pp. 393-399
    • Fant, G.1
  • 22
    • 34147125327 scopus 로고    scopus 로고
    • Emotion recognition from the facial image and speech signal
    • Go, H., Kwak, K., Lee, D., Chun, M., 2003. Emotion recognition from the facial image and speech signal. In: SICE Annual Conference in Fukui, pp. 2890-2895.
    • (2003) SICE Annual Conference in Fukui , pp. 2890-2895
    • Go, H.1    Kwak, K.2    Lee, D.3    Chun, M.4
  • 23
    • 0037380186 scopus 로고    scopus 로고
    • The role of voice quality in communicating emotion, mood and attitude
    • Gobl C., and Chasaide A. The role of voice quality in communicating emotion, mood and attitude. Speech Communication 40 (2003) 189-212
    • (2003) Speech Communication , vol.40 , pp. 189-212
    • Gobl, C.1    Chasaide, A.2
  • 25
    • 0037380318 scopus 로고    scopus 로고
    • A corpus-based speech synthesis system with emotion
    • Iida A., Campbell N., Higuchi F., and Yasumara M. A corpus-based speech synthesis system with emotion. Speech Communication 40 1-2 (2003) 161-187
    • (2003) Speech Communication , vol.40 , Issue.1-2 , pp. 161-187
    • Iida, A.1    Campbell, N.2    Higuchi, F.3    Yasumara, M.4
  • 29
    • 0345580812 scopus 로고    scopus 로고
    • Rules for the generation of ToBI-based American English intonation
    • Jilka M., Moler G., and Dogil G. Rules for the generation of ToBI-based American English intonation. Speech Communication 28 (1999) 83-108
    • (1999) Speech Communication , vol.28 , pp. 83-108
    • Jilka, M.1    Moler, G.2    Dogil, G.3
  • 30
    • 0000139994 scopus 로고
    • Objective voice parameters to characterize the emotional content in speech
    • Klasmeyer, G., Sendlneier, W.F., 1995. Objective voice parameters to characterize the emotional content in speech. In: Proceedings of ICPhS 95, p. 1182.
    • (1995) Proceedings of ICPhS , vol.95 , pp. 1182
    • Klasmeyer, G.1    Sendlneier, W.F.2
  • 31
    • 85009223246 scopus 로고    scopus 로고
    • Kwon, O., Chan, K., Hao, J., Lee, T., 2003. Emotion Recognition by Speech Signals, Eurospeech-Geneva, pp. 125-128.
    • Kwon, O., Chan, K., Hao, J., Lee, T., 2003. Emotion Recognition by Speech Signals, Eurospeech-Geneva, pp. 125-128.
  • 32
    • 0030191296 scopus 로고    scopus 로고
    • Physical variation related to stress and emotional state: a preliminary study
    • Laukkanen A.-M., Vilkman E., Alku P., and Oksanen H. Physical variation related to stress and emotional state: a preliminary study. Journal of Phonetics 24 (1996) 313-335
    • (1996) Journal of Phonetics , vol.24 , pp. 313-335
    • Laukkanen, A.-M.1    Vilkman, E.2    Alku, P.3    Oksanen, H.4
  • 36
    • 0038381003 scopus 로고    scopus 로고
    • Automatic measurement of pressed/breathy phonation at acoustic centres of reliability in continuous speech
    • Mokhtari P., and Campbell N. Automatic measurement of pressed/breathy phonation at acoustic centres of reliability in continuous speech. IEICE Transactions on Information and Systems E86-D 3 (2003) 574-582
    • (2003) IEICE Transactions on Information and Systems , vol.E86-D , Issue.3 , pp. 574-582
    • Mokhtari, P.1    Campbell, N.2
  • 39
    • 37349079113 scopus 로고    scopus 로고
    • Moore, E., II, Clements, M.A., Peifer, J.W., Weisser, L., 2008. IEEE Transactions on Biomedical Engineering 55 (1) 96-107. Nissen S., 2003. Implementation of a Fast Artificial Neural Network Library (FANN), Department of Computer Science University of Copenhagen (DIKU), Software available at .
    • Moore, E., II, Clements, M.A., Peifer, J.W., Weisser, L., 2008. IEEE Transactions on Biomedical Engineering 55 (1) 96-107. Nissen S., 2003. Implementation of a Fast Artificial Neural Network Library (FANN), Department of Computer Science University of Copenhagen (DIKU), Software available at .
  • 40
    • 34548139674 scopus 로고
    • Adaptive emotion recognition in speech by feature selection based on KL-divergence
    • Man, and Cybernetics, October 8-11, Taipei, Taiwan. 2006
    • Noda, T., Yano, Y., Doki, S., Okuma, S., 2006. Adaptive emotion recognition in speech by feature selection based on KL-divergence. In: IEEE International Conference on Systems, Man, and Cybernetics, 1921-1926, October 8-11, Taipei, Taiwan.
    • (1921) IEEE International Conference on Systems
    • Noda, T.1    Yano, Y.2    Doki, S.3    Okuma, S.4
  • 43
    • 77950080138 scopus 로고    scopus 로고
    • Papa, J.P, Suzuki, C.T.N, Falcão A.X, 2008. LibOPF: A Library for the Design of Optimum-Path Forest Classifiers, Software version 1.0 available at
    • Papa, J.P., Suzuki, C.T.N., Falcão A.X., 2008. LibOPF: A Library for the Design of Optimum-Path Forest Classifiers, Software version 1.0 available at .
  • 46
    • 0002505832 scopus 로고    scopus 로고
    • Techniques for the phonetic description of emotional speech
    • Northern Ireland, pp
    • Roach, P., 2000. Techniques for the phonetic description of emotional speech. In: Proceedings of the ISCA Workshop on Speech and Emotion, Northern Ireland, pp. 53-59.
    • (2000) Proceedings of the ISCA Workshop on Speech and Emotion , pp. 53-59
    • Roach, P.1
  • 47
    • 0015799682 scopus 로고
    • A new inverse-filtering technique for deriving the glottal air flow waveform during voicing
    • Rothenberg M. A new inverse-filtering technique for deriving the glottal air flow waveform during voicing. Journal of the Acoustical Society of America 53 6 (1973) l632-1645
    • (1973) Journal of the Acoustical Society of America , vol.53 , Issue.6
    • Rothenberg, M.1
  • 48
    • 0037384712 scopus 로고    scopus 로고
    • Vocal communication of emotion: a review of research paradigms
    • Scherer K. Vocal communication of emotion: a review of research paradigms. Speech Communication 40 (2003) 227-256
    • (2003) Speech Communication , vol.40 , pp. 227-256
    • Scherer, K.1
  • 49
  • 51
    • 0002494136 scopus 로고    scopus 로고
    • Automated extraction of ToBI annotation data from the Reading/Leeds emotional speech corpus. ITRW on Speech and Emotion, ISCA, in Speech
    • Stibbard, R.,2000. Automated extraction of ToBI annotation data from the Reading/Leeds emotional speech corpus. ITRW on Speech and Emotion, ISCA, in Speech Emotion, pp. 60-65.
    • (2000) Emotion , pp. 60-65
    • Stibbard, R.1
  • 53
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: resources, features, and methods
    • Ververidis D., and Kotropoulos C. Emotional speech recognition: resources, features, and methods. Speech Communication 48 (2006) 1162-1181
    • (2006) Speech Communication , vol.48 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.