메뉴 건너뛰기




Volumn 57, Issue , 2014, Pages 155-169

Continuous emotion recognition with phonetic syllables

Author keywords

Affective computing; Feature extraction; Phonetic syllables; Valence Activation Dominance space

Indexed keywords

AFFECTIVE COMPUTING; AMOUNT OF INFORMATION; FEATURE EXTRACTION METHODS; PHONETIC SYLLABLES; REALTIME PROCESSING; STATE-OF-THE-ART PERFORMANCE; THREE-DIMENSIONAL MODEL; VALENCE-ACTIVATION-DOMINANCE SPACE;

EID: 84887045699     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2013.09.012     Document Type: Article
Times cited : (46)

References (74)
  • 1
    • 65149087793 scopus 로고    scopus 로고
    • Rhythm, timing and the timing of rhythm
    • A. Arvaniti Rhythm, timing and the timing of rhythm Phonetica 66 2009 46 63
    • (2009) Phonetica , vol.66 , pp. 46-63
    • Arvaniti, A.1
  • 5
    • 78349274056 scopus 로고    scopus 로고
    • Segmenting into adequate units for automatic recognition of emotion-related episodes: A speech-based approach
    • A. Batliner, D. Seppi, S. Steidl, and B. Schuller Segmenting into adequate units for automatic recognition of emotion-related episodes: a speech-based approach Advances in Human-Computer Interaction 2010 2010 1 15
    • (2010) Advances in Human-Computer Interaction , vol.2010 , pp. 1-15
    • Batliner, A.1    Seppi, D.2    Steidl, S.3    Schuller, B.4
  • 6
    • 0001835850 scopus 로고
    • Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound
    • Boersma, P.; 1993. Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound. In: Proc. of IFA, pp. 97-110.
    • (1993) Proc. of IFA , pp. 97-110
    • Boersma, P.1
  • 8
    • 0035135647 scopus 로고    scopus 로고
    • The contribution of speech rate and pitch variation to the perception of vocal emotions in a German and an American sample
    • C. Breitenstein, D. Van Lancker, and I. Daum The contribution of speech rate and pitch variation to the perception of vocal emotions in a German and an American sample Cognition and Emotion 15 2001 57 79 (Pubitemid 32095593)
    • (2001) Cognition and Emotion , vol.15 , Issue.1 , pp. 57-79
    • Breitenstein, C.1    Van Lancker, D.2    Daum, I.3
  • 13
    • 0032041832 scopus 로고    scopus 로고
    • Rhythmic constraints on stress timing in English
    • F. Cummins, and R. Port Rhythmic constraints on stress timing in English Journal of Phonetics 26 1998 145 171 (Pubitemid 128179514)
    • (1998) Journal of Phonetics , vol.26 , Issue.2 , pp. 145-171
    • Cummins, F.1    Port, R.2
  • 14
    • 0029342671 scopus 로고
    • Automatic pitch contour stylization using a model of tonal perception
    • C. D'Alessandro, and P. Mertens Automatic pitch contour stylization using a model of tonal perception Computer Speech and Language 9 1995 257 288
    • (1995) Computer Speech and Language , vol.9 , pp. 257-288
    • D'Alessandro, C.1    Mertens, P.2
  • 15
    • 33646456450 scopus 로고    scopus 로고
    • Rhythm and speech rate: A variation coefficient for Δc
    • Karnowski, P. Szigeti, I. (Eds.) Peter Lang, Frankfurt am Main
    • Dellwo, V.; 2006. Rhythm and speech rate: a variation coefficient for ΔC. In: Karnowski, P. Szigeti, I. (Eds.), Language and Language Processing, Peter Lang, Frankfurt am Main, pp. 231-241.
    • (2006) Language and Language Processing , pp. 231-241
    • Dellwo, V.1
  • 16
    • 24144491519 scopus 로고    scopus 로고
    • Relationships between speech rate and rhythm
    • Dellwo, V.; Wagner, P.; 2003. Relationships between speech rate and rhythm. In: Proc. of the ICPhS, pp. 471-474.
    • (2003) Proc. of the ICPhS , pp. 471-474
    • Dellwo, V.1    Wagner, P.2
  • 17
    • 9444237982 scopus 로고    scopus 로고
    • Emotions and voice quality: Experiments with sinusoidal modeling
    • Drioli, C.; Tisato, G.; Cosi, P.; Tesser, F.; 2003. Emotions and voice quality: experiments with sinusoidal modeling. In: Proc. of VOQUAL, pp. 127-132.
    • (2003) Proc. of VOQUAL , pp. 127-132
    • Drioli, C.1    Tisato, G.2    Cosi, P.3    Tesser, F.4
  • 18
    • 84889960454 scopus 로고
    • An argument for basic emotions
    • P. Ekman An argument for basic emotions Cognition and Emotion 6 1992 169 200
    • (1992) Cognition and Emotion , vol.6 , pp. 169-200
    • Ekman, P.1
  • 20
    • 79960846934 scopus 로고    scopus 로고
    • Recognizing affect from speech prosody using hierarchical graphical models
    • R. Fernandez, and R. Picard Recognizing affect from speech prosody using hierarchical graphical models Speech Communication 53 2011 1088 1103
    • (2011) Speech Communication , vol.53 , pp. 1088-1103
    • Fernandez, R.1    Picard, R.2
  • 21
    • 0015297936 scopus 로고
    • Combinations of amplitude and frequency differences in auditory discrimination
    • L.L. Feth Combinations of amplitude and frequency differences in auditory discrimination Acustica 26 1972 67 77
    • (1972) Acustica , vol.26 , pp. 67-77
    • Feth, L.L.1
  • 22
    • 21544458365 scopus 로고    scopus 로고
    • Emotion recognition in human-computer interaction
    • DOI 10.1016/j.neunet.2005.03.006, PII S0893608005000390, Emotion and Brain
    • N. Fragopanagos, and J.G. Taylor Special issue: emotion recognition in human-computer interaction Neural Networks 18 2005 389 405 (Pubitemid 40922647)
    • (2005) Neural Networks , vol.18 , Issue.4 , pp. 389-405
    • Fragopanagos, N.1    Taylor, J.G.2
  • 24
    • 84876449733 scopus 로고    scopus 로고
    • Emotion recognition improvement using normalized formant supplementary features by hybrid of DTW-MLP-GMM model
    • D. Gharavian, M. Sheikhan, and F. Ashoftedel Emotion recognition improvement using normalized formant supplementary features by hybrid of DTW-MLP-GMM model Neural Computing and Applications 22 2012 1 11
    • (2012) Neural Computing and Applications , vol.22 , pp. 1-11
    • Gharavian, D.1    Sheikhan, M.2    Ashoftedel, F.3
  • 27
    • 34547940048 scopus 로고    scopus 로고
    • Primitives-based evaluation and estimation of emotions in speech
    • DOI 10.1016/j.specom.2007.01.010, PII S0167639307000040
    • M. Grimm, E. Mower, K. Kroschel, and S. Narayanan Primitives-based evaluation and estimation of emotions in speech Speech Communication 49 2007 787 800 (Pubitemid 47268568)
    • (2007) Speech Communication , vol.49 , Issue.10-11 , pp. 787-800
    • Grimm, M.1    Kroschel, K.2    Mower, E.3    Narayanan, S.4
  • 28
    • 54049132925 scopus 로고    scopus 로고
    • The Vera am Mittag German audio-visual emotional speech database
    • Grimm, M.; Kroschel, K.; Narayanan, S.; 2008. The Vera am Mittag German audio-visual emotional speech database. In: Proc. of ICME, pp. 865-868.
    • (2008) Proc. of ICME , pp. 865-868
    • Grimm, M.1    Kroschel, K.2    Narayanan, S.3
  • 29
    • 79958702587 scopus 로고    scopus 로고
    • Emotion representation, analysis and synthesis in continuous space: A survey
    • Gunes, H.; Schuller, B.; Pantic, M.; Cowie, R.; 2011. Emotion representation, analysis and synthesis in continuous space: a survey. In: Proc. of FG, pp. 827-834.
    • (2011) Proc. of FG , pp. 827-834
    • Gunes, H.1    Schuller, B.2    Pantic, M.3    Cowie, R.4
  • 32
    • 85135174463 scopus 로고
    • Perception of prepausal tonal contours: Implications for automatic stylization of intonation
    • House, D.; 1995. Perception of prepausal tonal contours: implications for automatic stylization of intonation. In: Proc. of Eurospeech, pp. 949-952.
    • (1995) Proc. of Eurospeech , pp. 949-952
    • House, D.1
  • 33
    • 0030351604 scopus 로고    scopus 로고
    • Differential perception of tonal contours through the syllable
    • House, D.; 1996. Differential perception of tonal contours through the syllable. In: Proc. of ICSLP, pp. 2048-2051.
    • (1996) Proc. of ICSLP , pp. 2048-2051
    • House, D.1
  • 37
    • 0025635254 scopus 로고
    • On a simple algorithm to calculate the 'energy' of a signal
    • Kaiser, J.; 1990. On a simple algorithm to calculate the 'energy' of a signal. In: Proc. of IEEE ICASSP, pp. 381-384.
    • (1990) Proc. of IEEE ICASSP , pp. 381-384
    • Kaiser, J.1
  • 38
    • 44949264114 scopus 로고    scopus 로고
    • Feature analysis for emotion recognition from mandarin speech considering the special characteristics of Chinese language
    • Kao, Y.; Lee, L.; 2006. Feature analysis for emotion recognition from mandarin speech considering the special characteristics of Chinese language. In: Proc. of Interspeech, pp. 1814-1817.
    • (2006) Proc. of Interspeech , pp. 1814-1817
    • Kao, Y.1    Lee, L.2
  • 39
    • 0015534174 scopus 로고
    • Discrimination of fundamental frequency contours in synthetic speech: Implications for models of pitch perception
    • D.H. Klatt Discrimination of fundamental frequency contours in synthetic speech: implications for models of pitch perception Journal of the Acoustical Society of America 53 1973 8 16
    • (1973) Journal of the Acoustical Society of America , vol.53 , pp. 8-16
    • Klatt, D.H.1
  • 40
    • 84865804487 scopus 로고    scopus 로고
    • On the use of the rhythmogram for automatic syllabic prominence annotation
    • Ludusan, B.; Origlia, A.; Cutugno, C.; 2011. On the use of the rhythmogram for automatic syllabic prominence annotation. In: Proc. of Interspeech, pp. 2413-2416.
    • (2011) Proc. of Interspeech , pp. 2413-2416
    • Ludusan, B.1    Origlia, A.2    Cutugno, C.3
  • 41
    • 0004383174 scopus 로고
    • Ein Funktionsschema des Gehörs zur Beschreibung der Erkennbarkeit kleiner Frequenz-und-Amplitudenänderungen
    • D. Maiwald Ein Funktionsschema des Gehörs zur Beschreibung der Erkennbarkeit kleiner Frequenz-und-Amplitudenänderungen Acustica 18 1967 81 93
    • (1967) Acustica , vol.18 , pp. 81-93
    • Maiwald, D.1
  • 42
    • 85039167624 scopus 로고    scopus 로고
    • Prominence detection without syllabic segmentation
    • Martin, P.; 2010. Prominence detection without syllabic segmentation. In: Proc. of Speech Prosody (Online).
    • (2010) Proc. of Speech Prosody (Online)
    • Martin, P.1
  • 43
    • 52949094265 scopus 로고    scopus 로고
    • Extraction and representation of prosodic features for language and speaker recognition
    • L. Mary, and B. Yegnanarayana Extraction and representation of prosodic features for language and speaker recognition Speech Communication 50 2008 782 796
    • (2008) Speech Communication , vol.50 , pp. 782-796
    • Mary, L.1    Yegnanarayana, B.2
  • 44
    • 78349272018 scopus 로고    scopus 로고
    • The semaine corpus of emotionally coloured character interactions
    • McKeown, G.; Valstar, M.F.; Cowie, R.; Pantic, M.; 2010. The semaine corpus of emotionally coloured character interactions. In: Proc. of ICME, pp. 1079-1084.
    • (2010) Proc. of ICME , pp. 1079-1084
    • McKeown, G.1    Valstar M., .F.2    Cowie, R.3    Pantic, M.4
  • 45
    • 21344454051 scopus 로고    scopus 로고
    • Pleasure-arousal-dominance: A general framework for describing and measuring individual differences in temperament
    • A. Mehrabian Pleasure-arousal-dominance: a general framework for describing and measuring individual differences in temperament Current Psychology: Developmental, Learning, Personality, Social 14 1996 261 292
    • (1996) Current Psychology: Developmental, Learning, Personality, Social , vol.14 , pp. 261-292
    • Mehrabian, A.1
  • 47
    • 34547224809 scopus 로고    scopus 로고
    • The prosogram: Semi-automatic transcription of prosody based on a tonal perception model
    • Mertens, P.; 2004. The prosogram: semi-automatic transcription of prosody based on a tonal perception model. In: Proc. of Speech Prosody.
    • (2004) Proc. of Speech Prosody
    • Mertens, P.1
  • 48
    • 84867336294 scopus 로고
    • Dynamic properties of cochlear nucleus units in response to excitatory and inhibitory tones
    • Møller, E.; 1974. Dynamic properties of cochlear nucleus units in response to excitatory and inhibitory tones. In: Proc. of Facts and models in hearing, pp. 227-40.
    • (1974) Proc. of Facts and Models in Hearing , pp. 227-240
    • Møller, E.1
  • 49
    • 80054842318 scopus 로고    scopus 로고
    • Continuous prediction of spontaneous affect from multiple cues and modalities in valence-arousal space
    • M.A. Nicolaou, H. Gunes, and M. Pantic Continuous prediction of spontaneous affect from multiple cues and modalities in valence-arousal space IEEE Transactions on Affective Computing 2 2011 92 105
    • (2011) IEEE Transactions on Affective Computing , vol.2 , pp. 92-105
    • Nicolaou, M.A.1    Gunes, H.2    Pantic, M.3
  • 50
    • 84867332831 scopus 로고    scopus 로고
    • A dynamic tonal perception model for optimal pitch stylization
    • A. Origlia, G. Abete, and C. Cutugno A dynamic tonal perception model for optimal pitch stylization Computer Speech and Language 27 2013 190 208
    • (2013) Computer Speech and Language , vol.27 , pp. 190-208
    • Origlia, A.1    Abete, G.2    Cutugno, C.3
  • 52
    • 51849111937 scopus 로고    scopus 로고
    • A syllable segmentation algorithm for English and Italian
    • Petrillo, M.; Cutugno, F.; 2003. A syllable segmentation algorithm for English and Italian. In: Proc. of Eurospeech, pp. 2913-2916.
    • (2003) Proc. of Eurospeech , pp. 2913-2916
    • Petrillo, M.1    Cutugno, F.2
  • 54
    • 0014320701 scopus 로고
    • Detection of rate of change of auditory frequency
    • I. Pollack Detection of rate of change of auditory frequency Journal of Experimental Psychology 77 1968 535 541
    • (1968) Journal of Experimental Psychology , vol.77 , pp. 535-541
    • Pollack, I.1
  • 55
    • 0032725252 scopus 로고    scopus 로고
    • Correlates of linguistic rhythm in the speech signal
    • F. Ramus, M. Nespor, and J. Mehler Correlates of linguistic rhythm in the speech signal Cognition 73 1999 265 292
    • (1999) Cognition , vol.73 , pp. 265-292
    • Ramus, F.1    Nespor, M.2    Mehler, J.3
  • 57
    • 84939663443 scopus 로고
    • Le seuil de glissando ou seuil de perception des variations tonales pour les sons de la parole
    • M. Rossi Le seuil de glissando ou seuil de perception des variations tonales pour les sons de la parole Phonetica 23 1971 1 33
    • (1971) Phonetica , vol.23 , pp. 1-33
    • Rossi, M.1
  • 58
    • 0018030350 scopus 로고
    • Interactions of intensity glides and frequency glissandos
    • M. Rossi Interactions of intensity glides and frequency glissandos Language and Speech 21 1978 384 394
    • (1978) Language and Speech , vol.21 , pp. 384-394
    • Rossi, M.1
  • 60
    • 0347613216 scopus 로고    scopus 로고
    • Vocal expression of emotions
    • R.J. Davidson, K.R. Scherer, H.H. Goldsmith, Oxford University Press
    • K.R. Scherer, T. Johnstone, and G. Klasmeyer Vocal expression of emotions R.J. Davidson, K.R. Scherer, H.H. Goldsmith, Handbook of Affective Sciences 2003 Oxford University Press 433 456
    • (2003) Handbook of Affective Sciences , pp. 433-456
    • Scherer, K.R.1    Johnstone, T.2    Klasmeyer, G.3
  • 61
    • 0022040692 scopus 로고
    • Identification and discrimination of sweep tones
    • H.E.M. Schouten Identification and discrimination of sweep tones Perception and Psychophysics 37 1985 369 376
    • (1985) Perception and Psychophysics , vol.37 , pp. 369-376
    • Schouten, H.E.M.1
  • 62
    • 54049132987 scopus 로고    scopus 로고
    • Combining speech recognition and acoustic word emotion models for robust text independent emotion recognition
    • Schuller, B.; Vlasenko, B.; Arsic, D.; Rigoll, G.; Wendemuth, A.; 2008. Combining speech recognition and acoustic word emotion models for robust text independent emotion recognition. In: Proc. of IEEE ICME, pp. 1333-1336.
    • (2008) Proc. of IEEE ICME , pp. 1333-1336
    • Schuller, B.1    Vlasenko, B.2    Arsic, D.3    Rigoll, G.4    Wendemuth, A.5
  • 64
    • 79960846940 scopus 로고    scopus 로고
    • Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge
    • B. Schuller, A. Batliner, S. Steidl, and D. Seppi Recognising realistic emotions and affect in speech: state of the art and lessons learnt from the first challenge Speech Communication 53 2011 1062 1087
    • (2011) Speech Communication , vol.53 , pp. 1062-1087
    • Schuller, B.1    Batliner, A.2    Steidl, S.3    Seppi, D.4
  • 67
    • 33645169036 scopus 로고    scopus 로고
    • Automatic transcription of prosodic stress for spontaneous English discourse
    • Silipo, R.; Greenberg, S.; 1999. Automatic transcription of prosodic stress for spontaneous English discourse. In: Proc. of ICPhS, pp. 2351-2354.
    • (1999) Proc. of ICPhS , pp. 2351-2354
    • Silipo, R.1    Greenberg, S.2
  • 68
    • 77955669484 scopus 로고    scopus 로고
    • On automatic prominence detection for German
    • Tamburini, F.; Wagner, P.; 2007. On automatic prominence detection for German. In: Proc. of Interspeech, pp. 1809-1812.
    • (2007) Proc. of Interspeech , pp. 1809-1812
    • Tamburini, F.1    Wagner, P.2
  • 73
    • 79953659944 scopus 로고    scopus 로고
    • Automatic speech emotion recognition using modulation spectral features
    • S. Wu, T.H. Falk, and W. Chan Automatic speech emotion recognition using modulation spectral features Speech Communication 53 2011 768 785
    • (2011) Speech Communication , vol.53 , pp. 768-785
    • Wu, S.1    Falk, T.H.2    Chan, W.3
  • 74
    • 0000496392 scopus 로고
    • Direct comparisons between the sensations produced by frequency modulation and amplitude modulation
    • E. Zwicker Direct comparisons between the sensations produced by frequency modulation and amplitude modulation Journal of the Acoustical Society of America 34 1962 1425 1430
    • (1962) Journal of the Acoustical Society of America , vol.34 , pp. 1425-1430
    • Zwicker, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.