메뉴 건너뛰기




Volumn 7, Issue 2, 2016, Pages 190-202

The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing

(11)  Eyben, Florian a,b,c   Scherer, Klaus R c,d   Schuller, Bjorn W a,e,f   Sundberg, Johan g   Andre, Elisabeth h   Busso, Carlos i   Devillers, Laurence Y j   Epps, Julien k,l   Laukka, Petri m   Narayanan, Shrikanth S n   Truong, Khiet P o  


Author keywords

Acoustic Features; Affective Computing; Emotion Recognition; Geneva Minimalistic Parameter Set; Speech Analysis; Standard

Indexed keywords

ELASTIC WAVES; PARAMETER ESTIMATION; REGULATORY COMPLIANCE; SPEECH ANALYSIS; STANDARDS;

EID: 84973513831     PISSN: 19493045     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAFFC.2015.2457417     Document Type: Article
Times cited : (1343)

References (68)
  • 1
    • 0022688124 scopus 로고
    • Vocal affect expression: A review and a model for future research
    • K. R. Scherer, "Vocal affect expression: A review and a model for future research, " Psychol. Bull., vol. 99, pp. 143-165, 1986.
    • (1986) Psychol. Bull. , vol.99 , pp. 143-165
    • Scherer, K.R.1
  • 2
    • 0030093965 scopus 로고    scopus 로고
    • Acoustic profiles in vocal emotion expression
    • R. Banse and K. R. Scherer, "Acoustic profiles in vocal emotion expression, " J. Personality Soc. Psychol., vol. 70, no. 3, pp. 614-636, 1996.
    • (1996) J. Personality Soc. Psychol. , vol.70 , Issue.3 , pp. 614-636
    • Banse, R.1    Scherer, K.R.2
  • 3
    • 0012833861 scopus 로고    scopus 로고
    • Impact of intended emotion intensity on cue utilization and decoding accuracy in vocal expression of emotion
    • P. N. Juslin and P. Laukka, "Impact of intended emotion intensity on cue utilization and decoding accuracy in vocal expression of emotion, " Emotion, vol. 1, pp. 381-412, 2001.
    • (2001) Emotion , vol.1 , pp. 381-412
    • Juslin, P.N.1    Laukka, P.2
  • 5
    • 37349079113 scopus 로고    scopus 로고
    • Critical analysis of the impact of glottal features in the classification of clinical depression in speech
    • Jan.
    • E. Moore, M. Clements, J. Peifer, L. Weisser, "Critical analysis of the impact of glottal features in the classification of clinical depression in speech, " IEEE Trans. Biomed. Eng., vol. 55, no. 1, pp. 96-107, Jan. 2008.
    • (2008) IEEE Trans. Biomed. Eng. , vol.55 , Issue.1 , pp. 96-107
    • Moore, E.1    Clements, M.2    Peifer, J.3    Weisser, L.4
  • 6
    • 65249116503 scopus 로고    scopus 로고
    • Analysis of emotionally salient aspects of fundamental frequency for emotion detection
    • May
    • C. Busso, S. Lee, S. Narayanan, "Analysis of emotionally salient aspects of fundamental frequency for emotion detection, " IEEE Trans. Audio, Speech Language Process., vol. 17, no. 4, pp. 582-596, May 2009.
    • (2009) IEEE Trans. Audio, Speech Language Process. , vol.17 , Issue.4 , pp. 582-596
    • Busso, C.1    Lee, S.2    Narayanan, S.3
  • 7
    • 80054843364 scopus 로고    scopus 로고
    • Interdependencies among voice source parameters in emotional speech
    • Jul.-Sep.
    • J. Sundberg, S. Patel, E. Bjorkner, K. R. Scherer, "Interdependencies among voice source parameters in emotional speech, " IEEE Trans. Affective Comput., vol. 2, no. 3, pp. 162-174, Jul.-Sep. 2011.
    • (2011) IEEE Trans. Affective Comput. , vol.2 , Issue.3 , pp. 162-174
    • Sundberg, J.1    Patel, S.2    Bjorkner, E.3    Scherer, K.R.4
  • 9
    • 0141764789 scopus 로고    scopus 로고
    • Communication of emotions in vocal expression and music performance: Different channels, same code
    • Sep.
    • P. N. Juslin and P. Laukka, "Communication of emotions in vocal expression and music performance: Different channels, same code" Psychol. Bull., vol. 129, no. 5, pp. 770-814, Sep. 2003.
    • (2003) Psychol. Bull. , vol.129 , Issue.5 , pp. 770-814
    • Juslin, P.N.1    Laukka, P.2
  • 10
    • 84973512430 scopus 로고    scopus 로고
    • Berlin, Germany: Mouton-DeGruyter
    • S. Patel and K. R. Scherer, Vocal Behaviour. Berlin, Germany: Mouton-DeGruyter, 2013, pp. 167-204.
    • (2013) Vocal Behaviour , pp. 167-204
    • Patel, S.1    Scherer, K.R.2
  • 11
    • 4444257069 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer
    • P. Boersma, "Praat, a system for doing phonetics by computer, " Glot Int., vol. 5, nos. 9/10, pp. 341-345, 2001.
    • (2001) Glot Int. , vol.5 , Issue.9-10 , pp. 341-345
    • Boersma, P.1
  • 14
    • 84893282872 scopus 로고    scopus 로고
    • Synthesis of emotional speech
    • K. R. Scherer, T. Banziger, E. Roesch, Eds. Oxford, U. K. : Oxford Univ. Press
    • M. Schroder, F. Burkhardt, S. Krstulovic, "Synthesis of emotional speech, " in Blueprint for Affective Computing, K. R. Scherer, T. Banziger, E. Roesch, Eds. Oxford, U. K. : Oxford Univ. Press, 2010, pp. 222-231.
    • (2010) Blueprint for Affective Computing , pp. 222-231
    • Schroder, M.1    Burkhardt, F.2    Krstulovic, S.3
  • 15
    • 0037384712 scopus 로고    scopus 로고
    • Vocal communication of emotion: A review of research paradigms
    • K. R. Scherer, "Vocal communication of emotion: A review of research paradigms, " Speech Commun., vol. 40, pp. 227-256, 2003.
    • (2003) Speech Commun. , vol.40 , pp. 227-256
    • Scherer, K.R.1
  • 16
    • 34548124536 scopus 로고    scopus 로고
    • Acoustical correlates of affective prosody
    • K. Hammerschmidt and U. Jurgens, "Acoustical correlates of affective prosody, " J. Voice, vol. 21, pp. 531-540, 2007.
    • (2007) J. Voice , vol.21 , pp. 531-540
    • Hammerschmidt, K.1    Jurgens, U.2
  • 17
    • 77956412938 scopus 로고    scopus 로고
    • Beyond arousal: Valence and potency/control cues in the vocal expression of emotion
    • M. Goudbeek and K. R. Scherer, "Beyond arousal: Valence and potency/control cues in the vocal expression of emotion, " J. Acoust. Soc. Amer., vol. 128, pp. 1322-1336, 2010.
    • (2010) J. Acoust. Soc. Amer. , vol.128 , pp. 1322-1336
    • Goudbeek, M.1    Scherer, K.R.2
  • 19
    • 84860573746 scopus 로고    scopus 로고
    • Emotion appraisal dimensions can be inferred from vocal expressions
    • P. Laukka and H. A. Elfenbein, "Emotion appraisal dimensions can be inferred from vocal expressions, " Soc. Psychol. Personality Sci., vol. 3, pp. 529-536, 2012.
    • (2012) Soc. Psychol. Personality Sci. , vol.3 , pp. 529-536
    • Laukka, P.1    Elfenbein, H.A.2
  • 22
    • 84905716338 scopus 로고    scopus 로고
    • Robust unsupervised arousal rating: A rule-based framework with knowledge-inspired vocal features
    • Apr.-Jun.
    • D. Bone, C.-C. Lee, S. Narayanan, "Robust unsupervised arousal rating: A rule-based framework with knowledge-inspired vocal features, " IEEE Trans. Affective Comput., vol. 5, no. 2, pp. 201-213, Apr.-Jun. 2014.
    • (2014) IEEE Trans. Affective Comput. , vol.5 , Issue.2 , pp. 201-213
    • Bone, D.1    Lee, C.-C.2    Narayanan, S.3
  • 24
    • 84878925980 scopus 로고    scopus 로고
    • On the acoustics of emotion in audio: What speech, music and sound have in common
    • May
    • F. Weninger, F. Eyben, B. W. Schuller, M. Mortillaro, K. R. Scherer, "On the acoustics of emotion in audio: What speech, music and sound have in common, " Frontiers Psychol., vol. 4, no. Article ID 292, pp. 1-12, May 2013.
    • (2013) Frontiers Psychol. , vol.4 , pp. 1-12
    • Weninger, F.1    Eyben, F.2    Schuller, B.W.3    Mortillaro, M.4    Scherer, K.R.5
  • 25
    • 84903770147 scopus 로고    scopus 로고
    • Affect recognition in Real-life acoustic conditions-A new perspective on feature selection
    • Aug.
    • F. Eyben, F. Weninger, B. Schuller, "Affect recognition in Real-life acoustic conditions-A new perspective on feature selection, " in Proc. Annu. Conf. Int. Speech Commun. Assoc., Aug. 2013, pp. 2044-2048.
    • (2013) Proc. Annu. Conf. Int. Speech Commun. Assoc. , pp. 2044-2048
    • Eyben, F.1    Weninger, F.2    Schuller, B.3
  • 26
    • 80051627806 scopus 로고    scopus 로고
    • Acoustic measures characterizing anger across corpora collected in artificial or natural context
    • Chicago, IL, USA
    • M. Tahon and L. Devillers, "Acoustic measures characterizing anger across corpora collected in artificial or natural context, " in Proc. Speech Prosody, Chicago, IL, USA, 2010.
    • (2010) Proc. Speech Prosody
    • Tahon, M.1    Devillers, L.2
  • 27
  • 28
    • 84865756228 scopus 로고    scopus 로고
    • Phonologicallybased biomarkers for major depressive disorder
    • A. C. Trevino, T. F. Quatieri, N. Malyska, "Phonologicallybased biomarkers for major depressive disorder, " EURASIP J. Adv. Signal Process., vol. 2011, no. 42, pp. 1-18, 2011.
    • (2011) EURASIP J. Adv. Signal Process. , vol.2011 , Issue.42 , pp. 1-18
    • Trevino, A.C.1    Quatieri, T.F.2    Malyska, N.3
  • 30
    • 79952364323 scopus 로고    scopus 로고
    • Investigation of spectral centroid features for cognitive load classification
    • P. Le, E. Ambikairajah, J. Epps, V. Sethu, E. H. C. Choi, "Investigation of spectral centroid features for cognitive load classification, " Speech Commun., vol. 54, no. 4, pp. 540-551, 2011.
    • (2011) Speech Commun. , vol.54 , Issue.4 , pp. 540-551
    • Le, P.1    Ambikairajah, E.2    Epps, J.3    Sethu, V.4    Choi, E.H.C.5
  • 32
    • 33746410556 scopus 로고    scopus 로고
    • Emotional speech recognition: Resources, features, methods
    • Sep.
    • D. Ververidis and C. Kotropoulos, "Emotional speech recognition: Resources, features, methods, " Speech Commun., vol. 48, no. 9, pp. 1162-1181, Sep. 2006.
    • (2006) Speech Commun. , vol.48 , Issue.9 , pp. 1162-1181
    • Ververidis, D.1    Kotropoulos, C.2
  • 34
    • 84879002588 scopus 로고    scopus 로고
    • Towards a standard set of acoustic features for the processing of emotion in speech
    • Jul.
    • F. Eyben, A. Batliner, B. Schuller, "Towards a standard set of acoustic features for the processing of emotion in speech, " Proc. Meetings Acoust., vol. 9, no. 1, pp. 1-12, Jul. 2012.
    • (2012) Proc. Meetings Acoust. , vol.9 , Issue.1 , pp. 1-12
    • Eyben, F.1    Batliner, A.2    Schuller, B.3
  • 39
    • 84887104454 scopus 로고    scopus 로고
    • On the use of speech parameter contours for emotion recognition
    • V. Sethu, E. Ambikairajah, J. Epps, "On the use of speech parameter contours for emotion recognition, " EURASIP J. Audio, Speech Music Process., vol. 2013, no. 1, pp. 1-14, 2013.
    • (2013) EURASIP J. Audio, Speech Music Process. , vol.2013 , Issue.1 , pp. 1-14
    • Sethu, V.1    Ambikairajah, E.2    Epps, J.3
  • 40
    • 70450185591 scopus 로고    scopus 로고
    • Recognising Interest in Conversational Speech-Comparing bag of frames and supra-segmental features
    • Sep.
    • B. Schuller and G. Rigoll, "Recognising Interest in Conversational Speech-Comparing bag of frames and supra-segmental features, " in Proc. 10th Annu. Conf. Int. Speech Commun. Assoc., Sep. 2009, pp. 1999-2002.
    • (2009) Proc. 10th Annu. Conf. Int. Speech Commun. Assoc , pp. 1999-2002
    • Schuller, B.1    Rigoll, G.2
  • 49
    • 84874471178 scopus 로고    scopus 로고
    • Introducing the Geneva multimodal expression corpus for experimental research on emotion perception
    • T. Banziger, M. Mortillaro, K. R. Scherer, "Introducing the Geneva multimodal expression corpus for experimental research on emotion perception, " Emotion, vol. 12, no. 5, pp. 1161-1179, 2012.
    • (2012) Emotion , vol.12 , Issue.5 , pp. 1161-1179
    • Banziger, T.1    Mortillaro, M.2    Scherer, K.R.3
  • 50
    • 84908510789 scopus 로고    scopus 로고
    • Comparing the acoustic expression of emotion in the speaking and the singing voice
    • Jan.
    • K. R. Scherer, J. Sundberg, L. Tamarit, G. L. Salom~ao, "Comparing the acoustic expression of emotion in the speaking and the singing voice, " Comput. Speech Language, vol. 29, no. 1, pp. 218-235, Jan. 2015.
    • (2015) Comput. Speech Language , vol.29 , Issue.1 , pp. 218-235
    • Scherer, K.R.1    Sundberg, J.2    Tamarit, G.L.3    Salomao, L.4
  • 51
    • 54049132925 scopus 로고    scopus 로고
    • The Vera am Mittag German audio-visual emotional speech database
    • Hannover, Germany
    • M. Grimm, K. Kroschel, S. Narayanan, "The Vera am Mittag German audio-visual emotional speech database, " in Proc. IEEE Int. Conf. Multimedia Expo, Hannover, Germany, 2008, pp. 865-868.
    • (2008) Proc. IEEE Int. Conf. Multimedia Expo , pp. 865-868
    • Grimm, M.1    Kroschel, K.2    Narayanan, S.3
  • 52
    • 34547940048 scopus 로고    scopus 로고
    • Primitives based estimation and evaluation of emotions in speech
    • M. Grimm, E. Mower, K. Kroschel, S. Narayanan, "Primitives based estimation and evaluation of emotions in speech, " Speech Commun., vol. 49, pp. 787-800, 2007.
    • (2007) Speech Commun. , vol.49 , pp. 787-800
    • Grimm, M.1    Mower, E.2    Kroschel, K.3    Narayanan, S.4
  • 53
    • 58149453035 scopus 로고
    • Three dimensions of emotion
    • H. Schlosberg, "Three dimensions of emotion, " Psychol. Rev., vol. 61, pp. 81-88, 1954.
    • (1954) Psychol. Rev. , vol.61 , pp. 81-88
    • Schlosberg, H.1
  • 58
    • 79953328709 scopus 로고    scopus 로고
    • Mapping emotions into acoustic space: The role of voice production
    • S. Patel, K. R. Scherer, J. Sundberg, E. Bjorkner, "Mapping emotions into acoustic space: The role of voice production, " Biol. Psychol., vol. 87, pp. 93-98, 2011.
    • (2011) Biol. Psychol. , vol.87 , pp. 93-98
    • Patel, S.1    Scherer, K.R.2    Sundberg, J.3    Bjorkner, E.4
  • 59
    • 84887494391 scopus 로고    scopus 로고
    • Recent developments in openSMILE, the Munich open-source multimedia feature extractor
    • F. Eyben, F. Weninger, F. Gross, B. Schuller, "Recent developments in openSMILE, the Munich open-source multimedia feature extractor, " in Proc. 21st ACMInt. Conf. Multimedia, 2013, pp. 835-838.
    • (2013) Proc. 21st ACMInt. Conf. Multimedia , pp. 835-838
    • Eyben, F.1    Weninger, F.2    Gross, F.3    Schuller, B.4
  • 60
    • 0023833270 scopus 로고
    • Measurement of pitch by subharmonic summation
    • D. J. Hermes, "Measurement of pitch by subharmonic summation, " J. Acoust. Soc. Amer., vol. 83, no. 1, pp. 257-264, 1988.
    • (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.1 , pp. 257-264
    • Hermes, D.J.1
  • 63
    • 0025041264 scopus 로고
    • Perceptual linear predictive (PLP) analysis for speech
    • H. Hermansky, "Perceptual linear predictive (PLP) analysis for speech, " J. Acoust. Soc. Amer., vol. 87, pp. 1738-1752, 1990.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , pp. 1738-1752
    • Hermansky, H.1
  • 66
    • 84959118677 scopus 로고    scopus 로고
    • Acoustic markers of emotions based on voice physiology
    • Chicago, IL, USA
    • S. Patel, K. R. Scherer, J. Sundberg, E. Bjorkner, "Acoustic markers of emotions based on voice physiology, " in Proc. Speech Prosody, Chicago, IL, USA, 2010, pp. 1-4.
    • (2010) Proc. Speech Prosody , pp. 1-4
    • Patel, S.1    Scherer, K.R.2    Sundberg, J.3    Bjorkner, E.4
  • 67
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • Apr.
    • J. Makhoul, "Linear prediction: A tutorial review, " Proc. IEEE, vol. 63, no. 5, pp. 561-580, Apr. 1975.
    • (1975) Proc. IEEE , vol.63 , Issue.5 , pp. 561-580
    • Makhoul, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.