메뉴 건너뛰기




Volumn 2013, Issue 1, 2013, Pages

On the use of speech parameter contours for emotion recognition

Author keywords

Emotion recognition; Formant contours; Glottal spectrum; LDC emotional prosody speech corpus; Paralinguistic information; Pitch contours; Temporal information

Indexed keywords

EMOTION RECOGNITION; FORMANT CONTOURS; GLOTTAL SPECTRUM; PARALINGUISTIC INFORMATION; PITCH CONTOURS; SPEECH CORPORA; TEMPORAL INFORMATION;

EID: 84887104454     PISSN: 16874714     EISSN: 16874722     Source Type: Journal    
DOI: 10.1186/1687-4722-2013-19     Document Type: Article
Times cited : (10)

References (50)
  • 4
    • 2942590310 scopus 로고    scopus 로고
    • Toward an affect-sensitive multimodal human-computer interaction
    • DOI 10.1109/JPROC.2003.817122, Human-Computer Multimodal Interface
    • Pantic M, Rothkrantz LJM: Toward an affect-sensitive multimodal human-computer interaction. Proc IEEE 2003, 91:1370-1390. (Pubitemid 40890819)
    • (2003) Proceedings of the IEEE , vol.91 , Issue.9 , pp. 1370-1390
    • Pantic, M.1    Rothkrantz, L.J.M.2
  • 5
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: Resources, features, and methods
    • DOI 10.1016/j.specom.2006.04.003, PII S0167639306000422
    • Ververidis D, Kotropoulos C: Emotional speech recognition: resources, features, and methods. Speech Communication 2006, 48:1162-1181. (Pubitemid 44128615)
    • (2006) Speech Communication , vol.48 , Issue.9 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 6
    • 77955418889 scopus 로고    scopus 로고
    • Five emotion classes detection in real-world call center data: The use of various types of paralinguistic features, in Proceedings of International Workshop on Paralinguistic Speech - 2007
    • August
    • Vidrascu L, Devillers L: Five emotion classes detection in real-world call center data: the use of various types of paralinguistic features, in Proceedings of International Workshop on Paralinguistic Speech - 2007. Saarbrücken August 2007, 3:11-16.
    • (2007) Saarbrücken , vol.3 , pp. 11-16
    • Vidrascu, L.1    Devillers, L.2
  • 7
    • 84947280249 scopus 로고    scopus 로고
    • Recognition of emotions in interactive voice response systems, in Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - INTERSPEECH 2003
    • September
    • Yacoub S, Simske S, Lin X, Burns J: Recognition of emotions in interactive voice response systems, in Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - INTERSPEECH 2003. Geneva September 2003, 1-4:729-732.
    • (2003) Geneva , vol.1-4 , pp. 729-732
    • Yacoub, S.1    Simske, S.2    Lin, X.3    Burns, J.4
  • 8
    • 77956401353 scopus 로고    scopus 로고
    • Class-level spectral features for emotion recognition
    • 10.1016/j.specom.2010.02.010
    • Bitouk D, Verma R, Nenkova A: Class-level spectral features for emotion recognition. Speech Communication 2010, 52:613-625.
    • (2010) Speech Communication , vol.52 , pp. 613-625
    • Bitouk, D.1    Verma, R.2    Nenkova, A.3
  • 9
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: Features, classification schemes, and databases
    • 1207.68275 10.1016/j.patcog.2010.09.020
    • El Ayadi M, Kamel MS, Karray F: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recognition 2011, 44:572-587.
    • (2011) Pattern Recognition , vol.44 , pp. 572-587
    • El Ayadi, M.1    Kamel, M.S.2    Karray, F.3
  • 17
    • 70450186224 scopus 로고    scopus 로고
    • GTM-URL contribution to the INTERSPEECH 2009 Emotion Challenge, in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-2009)
    • September
    • Planet S, Iriondo I, Socoró J, Monzo C, Adell J: GTM-URL contribution to the INTERSPEECH 2009 Emotion Challenge, in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-2009). Brighton September 2009, 6-10:316-319.
    • (2009) Brighton , vol.6-10 , pp. 316-319
    • Planet, S.1    Iriondo, I.2    Socoró, J.3    Monzo, C.4    Adell, J.5
  • 18
    • 79953659944 scopus 로고    scopus 로고
    • Automatic speech emotion recognition using modulation spectral features
    • 10.1016/j.specom.2010.08.013
    • Wu S, Falk TH, Chan W-Y: Automatic speech emotion recognition using modulation spectral features. Speech Communication 2011, 53:768-785.
    • (2011) Speech Communication , vol.53 , pp. 768-785
    • Wu, S.1    Falk, T.H.2    Chan, W.-Y.3
  • 19
    • 70450191305 scopus 로고    scopus 로고
    • Cepstral and long-term features for emotion recognition, in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-2009)
    • September
    • Dumouchel P, Dehak N, Attabi Y, Dehak R, Boufaden N: Cepstral and long-term features for emotion recognition, in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-2009). Brighton September 2009, 6-10:344-347.
    • (2009) Brighton , vol.6-10 , pp. 344-347
    • Dumouchel, P.1    Dehak, N.2    Attabi, Y.3    Dehak, R.4    Boufaden, N.5
  • 21
    • 70450163571 scopus 로고    scopus 로고
    • Pitch contour parameterisation based on linear stylisation for emotion recognition, in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-2009)
    • September
    • Sethu V, Ambikairajah E, Epps J: Pitch contour parameterisation based on linear stylisation for emotion recognition, in Proceedings of the 10th Annual Conference of the International Speech Communication Association (INTERSPEECH-2009). Brighton September 2009, 6-10:2011-2014.
    • (2009) Brighton , vol.6-10 , pp. 2011-2014
    • Sethu, V.1    Ambikairajah, E.2    Epps, J.3
  • 22
    • 0025786649 scopus 로고
    • Vocal quality factors: Analysis, synthesis, and perception
    • 10.1121/1.402044
    • Childers DG, Lee CK: Vocal quality factors: analysis, synthesis, and perception. J. Acoust. Soc. Am. 1991, 90:2394-2410.
    • (1991) J. Acoust. Soc. Am , vol.90 , pp. 2394-2410
    • Childers, D.G.1    Lee, C.K.2
  • 25
    • 79959858434 scopus 로고    scopus 로고
    • On the importance of glottal flow spectral energy for the recognition of emotions in speech, in Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
    • September
    • He L, Lech M, Allen N: On the importance of glottal flow spectral energy for the recognition of emotions in speech, in Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010. Makuhari, Chiba, Japan September 2010, 26-30:2346-2349.
    • (2010) Makuhari, Chiba, Japan , vol.26-30 , pp. 2346-2349
    • He, L.1    Lech, M.2    Allen, N.3
  • 28
    • 33947684811 scopus 로고
    • A four-parameter model of glottal flow
    • Fant G, Liljencrants J, Lin Q: A four-parameter model of glottal flow. STL-QPSR 1985, 4:1-13.
    • (1985) STL-QPSR , vol.4 , pp. 1-13
    • Fant, G.1    Liljencrants, J.2    Lin, Q.3
  • 29
    • 0015008817 scopus 로고
    • Effect of glottal pulse shape on the quality of natural vowels
    • 10.1121/1.1912389
    • Rosenberg AE: Effect of glottal pulse shape on the quality of natural vowels. J. Acoust. Soc. Am. 1971, 49:583-590.
    • (1971) J. Acoust. Soc. Am , vol.49 , pp. 583-590
    • Rosenberg, A.E.1
  • 30
    • 0025321354 scopus 로고
    • Analysis, synthesis, and perception of voice quality variations among female and male talkers
    • Klatt DH, Klatt LC: Analysis, synthesis, and perception of voice quality variations among female and male talkers. J. Acoust. Soc. Am. 1990, 87:820-857. (Pubitemid 20129722)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.2 , pp. 820-857
    • Klatt, D.H.1    Klatt, L.C.2
  • 31
    • 0031972775 scopus 로고    scopus 로고
    • A computationally efficient alternative for the Liljencrants-Fant model and its perceptual evaluation
    • DOI 10.1121/1.421103
    • Veldhuis R: A computationally efficient alternative for the Liljencrants-Fant model and its perceptual evaluation. J. Acoust. Soc. Am. 1998, 103:566-571. (Pubitemid 28042430)
    • (1998) Journal of the Acoustical Society of America , vol.103 , Issue.1 , pp. 566-571
    • Veldhuis, R.1
  • 35
    • 0034945901 scopus 로고    scopus 로고
    • Sim - Simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals
    • DOI 10.1121/1.1379076
    • Frohlich M, Michaelis D, Strube HW: SIM-simultaneous inverse filtering and matching of a glottal flow model for acoustic speech signals. J. Acoust. Soc. Am. 2001, 110:479-488. (Pubitemid 32642587)
    • (2001) Journal of the Acoustical Society of America , vol.110 , Issue.1 , pp. 479-488
    • Frohlich, M.1    Michaelis, D.2    Strube, H.W.3
  • 38
    • 33745214458 scopus 로고    scopus 로고
    • Estimation of LF glottal source parameters based on an ARX model, in Proceedings of INTERSPEECH 2005 - EUROSPEECH, 9th European Conference on Speech Communication and Technology
    • September
    • Vincent D, Rosec O, Chonavel T: Estimation of LF glottal source parameters based on an ARX model, in Proceedings of INTERSPEECH 2005 - EUROSPEECH, 9th European Conference on Speech Communication and Technology. Lisbon September 2005, 4-8:333-336.
    • (2005) Lisbon , vol.4-8 , pp. 333-336
    • Vincent, D.1    Rosec, O.2    Chonavel, T.3
  • 40
    • 33745225154 scopus 로고    scopus 로고
    • Piecewise linear stylization of pitch via wavelet analysis, in Proceedings of INTERSPEECH 2005 - EUROSPEECH, 9th European Conference on Speech Communication and Technology
    • September
    • Wang D, Narayanan S: Piecewise linear stylization of pitch via wavelet analysis, in Proceedings of INTERSPEECH 2005 - EUROSPEECH, 9th European Conference on Speech Communication and Technology. Lisbon September 2005, 4-8:3277-3280.
    • (2005) Lisbon , vol.4-8 , pp. 3277-3280
    • Wang, D.1    Narayanan, S.2
  • 42
    • 52049109828 scopus 로고    scopus 로고
    • Linguistic Data Consortium (LDC) database LDC catalog no. LDC2002S28 ISBN 1-58563-237-6
    • Liberman M, Davis K, Grossman M, Martey N: J Bell, Emotional prosody speech and transcripts. LDC catalog no. LDC2002S28: Linguistic Data Consortium (LDC) database; 2007. http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp? catalogId=LDC2002S28 ISBN 1-58563-237-6
    • (2007) J Bell, Emotional Prosody Speech and Transcripts
    • Liberman, M.1    Davis, K.2    Grossman, M.3    Martey, N.4
  • 48
    • 0242721417 scopus 로고    scopus 로고
    • Silva, Speech emotion recognition using hidden Markov models
    • 10.1016/S0167-6393(03)00099-2
    • Nwe TL, Foo SW, De LC: Silva, Speech emotion recognition using hidden Markov models. Speech Communication 2003, 41:603-623.
    • (2003) Speech Communication , vol.41 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De, L.C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.