SCOPUS 정보 검색 플랫폼

Signal Processing

Volumn 90, Issue 5, 2010, Pages 1415-1423

Emotion recognition from speech signals using new harmony features

(2) Yang, B a Lugger, M a

a UNIVERSITY OF STUTTGART (Germany)

Author keywords

Emotion recognition; Feature extraction; Harmony features; Pitch interval

Indexed keywords

EMOTION RECOGNITION; MUSIC THEORY; PITCH CONTOURS; RECOGNITION PERFORMANCE; SPEECH SIGNALS; STATE OF THE ART;

SPEECH RECOGNITION;

FEATURE EXTRACTION;

EID: 75249100219 PISSN: 01651684 EISSN: None Source Type: Journal
DOI: 10.1016/j.sigpro.2009.09.009 Document Type: Article

Times cited : (141)

References (54)

1
- 85032751766
- Emotion recognition in human-computer interaction
- Cowie R., et al. Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine 18 (2001) 32-81
- (2001) IEEE Signal Processing Magazine , vol.18 , pp. 32-81
- Cowie, R.¹

2
- 0005504614
- Speakers and hearers are people: reflections on speech deterioration as a consequence of acquired deafness
- Spens K.E., and Plant G. (Eds), Wiley, New York
- Cowie R., and Douglas-Cowie E. Speakers and hearers are people: reflections on speech deterioration as a consequence of acquired deafness. In: Spens K.E., and Plant G. (Eds). Profound Deafness and Speech Communication (1995), Wiley, New York
- (1995) Profound Deafness and Speech Communication
- Cowie, R.¹ Douglas-Cowie, E.²

3
- 75249100570
- Paralinguistic phenomena
- Ammon U., Dittmar N., et al. (Eds), Walter de Gruyter, Berlin
- Traunmüller H. Paralinguistic phenomena. In: Ammon U., Dittmar N., et al. (Eds). Sociolinguistics: An International Handbook of the Science of Language and Society (2005), Walter de Gruyter, Berlin 653-665
- (2005) Sociolinguistics: An International Handbook of the Science of Language and Society , pp. 653-665
- Traunmüller, H.¹

4
- 0003557856
- Cambridge University Press, Cambridge
- Laver J. The Phonetic Description of Voice Quality (1980), Cambridge University Press, Cambridge
- (1980) The Phonetic Description of Voice Quality
- Laver, J.¹

5
- 36249021789
- An emotion-aware voice portal
- F. Burkhardt, M. van Ballegooy, et al., An emotion-aware voice portal, in: Electronic Speech Signal Processing Conference, 2005.
- (2005) Electronic Speech Signal Processing Conference
- Burkhardt, F.¹ van Ballegooy, M.²

6
- 70349199898
- Detecting real life anger
- F. Burkhardt, T. Polzehl, et al., Detecting real life anger, in: Proceedings of the IEEE ICASSP, 2009, pp. 4761-4764.
- (2009) Proceedings of the IEEE ICASSP , pp. 4761-4764
- Burkhardt, F.¹ Polzehl, T.²

7
- 38749092393
- INTERSPEECH
- L. Devillers, L. Vidrascu, Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs, in: INTERSPEECH, 2006.
- (2006) Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs
- Devillers, L.¹ Vidrascu, L.²

8
- 33846952503
- Ensemble methods for spoken emotion recognition in call-centres
- Morrison D., Wang R., et al. Ensemble methods for spoken emotion recognition in call-centres. Speech communication 49 (2007) 98-112
- (2007) Speech communication , vol.49 , pp. 98-112
- Morrison, D.¹ Wang, R.²

9
- 85028812147
- You stupid tin box-children interacting with the AIBO robot: A cross-linguistic emotional speech corpus
- A. Batliner, et al., You stupid tin box-children interacting with the AIBO robot: a cross-linguistic emotional speech corpus, in: Proceedings of the Fourth International Conference of Language Resources and Evaluation, 2004, pp. 171-174.
- (2004) Proceedings of the Fourth International Conference of Language Resources and Evaluation , pp. 171-174
- Batliner, A.¹

10
- 34548015220
- The role of prosody in disambiguating potentially ambiguous utterances in English and Italian
- J. Hirschberg, C. Avesani, The role of prosody in disambiguating potentially ambiguous utterances in English and Italian, in: ESCA Workshop on Intonation, 1997.
- (1997) ESCA Workshop on Intonation
- Hirschberg, J.¹ Avesani, C.²

11
- 33745188470
- INTERSPEECH
- R. Gretter, D. Seppi, Using prosodic information for disambiguation purposes, in: INTERSPEECH, 2005, pp. 1821-1824.
- (2005) Using prosodic information for disambiguation purposes , pp. 1821-1824
- Gretter, R.¹ Seppi, D.²

12
- 38049133461
- Emotional aspects of intrinsic speech variabilities in automatic speech recognition
- M. Cernak, C. Wellekens, Emotional aspects of intrinsic speech variabilities in automatic speech recognition, in: International Conference on Speech and Computer, 2006, pp. 405-408.
- (2006) International Conference on Speech and Computer , pp. 405-408
- Cernak, M.¹ Wellekens, C.²

13
- 84971539709
- EUROSPEECH
- M. Schröder, Emotional speech synthesis: a review, in: EUROSPEECH, 2001, pp. 561-564.
- (2001) Emotional speech synthesis: A review , pp. 561-564
- Schröder, M.¹

14
- 85009089741
- EUROSPEECH
- M. Schröder, R. Cowie, et al., Acoustic correlates of emotion dimensions in view of speech synthesis, in: EUROSPEECH, 2001, pp. 87-90.
- (2001) Acoustic correlates of emotion dimensions in view of speech synthesis , pp. 87-90
- Schröder, M.¹ Cowie, R.²

15
- 0003596508
- The Guilford Press
- Lewis M., Haviland-Jones J., and Barrett L.F. Handbook of Emotions (2008), The Guilford Press
- (2008) Handbook of Emotions
- Lewis, M.¹ Haviland-Jones, J.² Barrett, L.F.³

16
- 84889960454
- An argument for basic emotions
- Ekman P. An argument for basic emotions. Cognition and Emotion 6 (1992) 169-200
- (1992) Cognition and Emotion , vol.6 , pp. 169-200
- Ekman, P.¹

17
- 58149453035
- Three dimensions of emotions
- Schlosberg H. Three dimensions of emotions. Psychological Review 61 (1954) 81-88
- (1954) Psychological Review , vol.61 , pp. 81-88
- Schlosberg, H.¹

18
- 0003603572
- Harper & Row, New York
- Plutchik R. Emotion: A Psychoevolutionary Synthesis (1980), Harper & Row, New York
- (1980) Emotion: A Psychoevolutionary Synthesis
- Plutchik, R.¹

19
- 85032752037
- Extracting moods from pictures and sounds: towards truly personalized TV
- Hanjalic A. Extracting moods from pictures and sounds: towards truly personalized TV. IEEE Signal Processing Magazine 23 (2006) 90-100
- (2006) IEEE Signal Processing Magazine , vol.23 , pp. 90-100
- Hanjalic, A.¹

20
- 0003922190
- Wiley, New York
- Duda R.O., Hart P.E., and Stork D.G. Pattern Classification. second ed. (2001), Wiley, New York
- (2001) Pattern Classification. second ed.
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

21
- 75249105202
- Psychological motivated multi-stage emotion classification exploiting voice quality features
- F. Mihelic, J. Zibert Eds, Chapter 22
- M. Lugger, B. Yang, Psychological motivated multi-stage emotion classification exploiting voice quality features, in: F. Mihelic, J. Zibert (Eds.), Speech Recognition, In-Tech, 2008 (Chapter 22).
- (2008) Speech Recognition, In-Tech
- Lugger, M.¹ Yang, B.²

22
- 0002686212
- Dimensions of emotional meaning in speech
- C. Pereira, Dimensions of emotional meaning in speech, in: ITRW on Speech and Emotion, 2000, pp. 25-28.
- (2000) ITRW on Speech and Emotion , pp. 25-28
- Pereira, C.¹

23
- 75249087923
- Perceiving anger and joy in speech through the size code
- Y. Xu, S. Chuenwattanapranithi, Perceiving anger and joy in speech through the size code, in: Proceedings of the International Conference on Phonetic Sciences, 2007, pp. 2105-2108.
- (2007) Proceedings of the International Conference on Phonetic Sciences , pp. 2105-2108
- Xu, Y.¹ Chuenwattanapranithi, S.²

24
- 0008618803
- Royal Swedish Academy of Music
- I. Fonagy, Emotions voice and music, Royal Swedish Academy of Music, 1981, pp. 51-79.
- (1981) Emotions voice and music , pp. 51-79
- Fonagy, I.¹

25
- 34547493864
- Emotion recognition in the noise applying large acoustic feature sets
- Dresden
- B. Schuller, D. Arsic, et al., Emotion recognition in the noise applying large acoustic feature sets, in: Speech Prosody, Dresden, 2006.
- (2006) Speech Prosody
- Schuller, B.¹ Arsic, D.²

26
- 0004094721
- MIT Press, Cambridge, MA
- Schölkopf B., and Smola A. Learning with Kernels (2002), MIT Press, Cambridge, MA
- (2002) Learning with Kernels
- Schölkopf, B.¹ Smola, A.²

27
- 0003450542
- Springer, Berlin
- Vapnik V.N. The Nature of Statistical Learning Theory (2000), Springer, Berlin
- (2000) The Nature of Statistical Learning Theory
- Vapnik, V.N.¹

28
- 0141478857
- Hidden Markov model-based speech emotion recognition
- B. Schuller, G. Rigoll, M. Lang, Hidden Markov model-based speech emotion recognition, in: Proceedings of the IEEE ICASSP, 2003.
- (2003) Proceedings of the IEEE ICASSP
- Schuller, B.¹ Rigoll, G.² Lang, M.³

29
- 84902658348
- Extracting voice quality contours using discrete hidden Markov models
- M. Lugger, B. Yang, Extracting voice quality contours using discrete hidden Markov models, in: Proceedings of the Speech Prosody, 2008.
- (2008) Proceedings of the Speech Prosody
- Lugger, M.¹ Yang, B.²

30
- 0038103326
- Fundamentals of Statistical Signal Processing
- Prentice-Hall, Englewood Cliffs, NJ
- Kay S.M. Fundamentals of Statistical Signal Processing. Detection Theory Vol. 2 (1998), Prentice-Hall, Englewood Cliffs, NJ
- (1998) Detection Theory , vol.2
- Kay, S.M.¹

31
- 0016355478
- A new look at the statistical model identification
- Akaike H. A new look at the statistical model identification. IEEE Transactions on Automatic Control 19 (1974) 716-723
- (1974) IEEE Transactions on Automatic Control , vol.19 , pp. 716-723
- Akaike, H.¹

32
- 14644422971
- Wiley, New York
- Kuncheva L.I. Combining Pattern Classifiers Methods and Algorithms (2004), Wiley, New York
- (2004) Combining Pattern Classifiers Methods and Algorithms
- Kuncheva, L.I.¹

33
- 33750733121
- Emotional speech classification using Gaussian mixture models
- D. Ververidis, C. Kotropoulos, Emotional speech classification using Gaussian mixture models, in: IEEE International Symposium on Circuits and Systems, 2005, pp. 2871-2874.
- (2005) IEEE International Symposium on Circuits and Systems , pp. 2871-2874
- Ververidis, D.¹ Kotropoulos, C.²

34
- 75249092878
- Combining classifiers with diverse feature sets for robust speaker independent emotion recognition
- M. Lugger, B. Yang, Combining classifiers with diverse feature sets for robust speaker independent emotion recognition, in: Proceedings of the EUSIPCO, 2009.
- (2009) Proceedings of the EUSIPCO
- Lugger, M.¹ Yang, B.²

35
- 84890445089
- Overfitting in making comparisons between variable selection methods
- Reunanen J. Overfitting in making comparisons between variable selection methods. Journal of Machine Learning Research 3 (2003) 1371-1382
- (2003) Journal of Machine Learning Research , vol.3 , pp. 1371-1382
- Reunanen, J.¹

36
- 85115260483
- Floating search methods for feature selection with nonmonotonic criterion
- Pudil P., Ferri F.J., et al. Floating search methods for feature selection with nonmonotonic criterion. Pattern Recognition-Conference B: Computer Vision 2 (1994) 279-283
- (1994) Pattern Recognition-Conference B: Computer Vision , vol.2 , pp. 279-283
- Pudil, P.¹ Ferri, F.J.²

37
- 0242721417
- Speech emotion recognition using hidden Markov models
- Nwe T., Foo S., and Silva L.D. Speech emotion recognition using hidden Markov models. Speech communication 41 (2003) 603-623
- (2003) Speech communication , vol.41 , pp. 603-623
- Nwe, T.¹ Foo, S.² Silva, L.D.³

38
- 34547496515
- The relevance of voice quality features in speaker independent emotion recognition
- M. Lugger, B. Yang, The relevance of voice quality features in speaker independent emotion recognition, in: Proceedings of the IEEE ICASSP, vol. 4, 2007, pp. 17-20.
- (2007) Proceedings of the IEEE ICASSP , vol.4 , pp. 17-20
- Lugger, M.¹ Yang, B.²

39
- 51449108623
- Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters
- M. Lugger, B. Yang, Cascaded emotion classification via psychological emotion dimensions using a large set of voice quality parameters, in: Proceedings of the IEEE ICASSP, 2008, pp. 4945-4948.
- (2008) Proceedings of the IEEE ICASSP , pp. 4945-4948
- Lugger, M.¹ Yang, B.²

40
- 21244491419
- D. Talkin, W. Kleijn, K. Paliwal, A robust algorithm for pitch tracking (RAPT), Speech Coding and Synthesis (1995) 495-518.
- (1995) A robust algorithm for pitch tracking (RAPT), Speech Coding and Synthesis , pp. 495-518
- Talkin, D.¹ Kleijn, W.² Paliwal, K.³

41
- 14644439843
- Toward detecting emotions in spoken dialogs
- Lee C.M., and Narayanan S.S. Toward detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing 13 (2005) 293-303
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , pp. 293-303
- Lee, C.M.¹ Narayanan, S.S.²

42
- 0030093965
- Acoustic profiles in vocal emotion expression
- Banse R., and Scherer K.R. Acoustic profiles in vocal emotion expression. Journal of Personality and Social Psychology 70 (1996) 614-636
- (1996) Journal of Personality and Social Psychology , vol.70 , pp. 614-636
- Banse, R.¹ Scherer, K.R.²

43
- 0003418124
- The Hague, Mouton
- G. Fant, Acoustic Theory of Speech Production, The Hague, Mouton, 1970.
- (1970) Acoustic Theory of Speech Production
- Fant, G.¹

44
- 0000547455
- Classification of glottal vibration from acoustic measurements
- Fujimura O., and Hirano M. (Eds), Hiltop University Press
- Stevens K., and Hanson H. Classification of glottal vibration from acoustic measurements. In: Fujimura O., and Hirano M. (Eds). Vocal Fold Physiology (1994), Hiltop University Press 147-170
- (1994) Vocal Fold Physiology , pp. 147-170
- Stevens, K.¹ Hanson, H.²

45
- 0003421022
- Dover Publications, New York
- Helmholtz H.L.F. On the Sensations of Tone as a Physiological Basis for the Theory of Music. second ed. (1877), Dover Publications, New York
- (1877) On the Sensations of Tone as a Physiological Basis for the Theory of Music. second ed.
- Helmholtz, H.L.F.¹

46
- 0000082194
- Tonal consonance and critical bandwidth
- Plomp R., and Levelt J.M. Tonal consonance and critical bandwidth. Journal of the Acoustical Society of America 38 (1965) 548-560
- (1965) Journal of the Acoustical Society of America , vol.38 , pp. 548-560
- Plomp, R.¹ Levelt, J.M.²

47
- 0000241263
- Frequency ratios and the perception of done patterns
- G. Schellenberg, S. Trehub, Frequency ratios and the perception of done patterns, Psychonomic Bulletin & Review (1994) 191-201.
- (1994) Psychonomic Bulletin & Review , pp. 191-201
- Schellenberg, G.¹ Trehub, S.²

48
- 0042125529
- The statistical structure of human speech sounds predicts musical universals
- D. Schwartz, C. Howe, D. Purves, The statistical structure of human speech sounds predicts musical universals, The Journal of Neuroscience (2003) 7160-7168.
- (2003) The Journal of Neuroscience , pp. 7160-7168
- Schwartz, D.¹ Howe, C.² Purves, D.³

49
- 85099848325
- ICMC
- T. Fujishima, Real-time chord recognition of musical sound: a system using common lisp music, in: ICMC, 1999, pp. 464-467.
- (1999) Real-time chord recognition of musical sound: A system using common lisp music , pp. 464-467
- Fujishima, T.¹

50
- 0033338098
- G.H. Wakefield, Mathematical representation of joint time-chroma distribution, in: SPIE, 3807, 1999.
- G.H. Wakefield, Mathematical representation of joint time-chroma distribution, in: SPIE, vol. 3807, 1999.

51
- 33847034601
- The psychophysics of harmony perception: harmony is a three-tone phenomenon
- Cook N.D., and Fujidawa T.X. The psychophysics of harmony perception: harmony is a three-tone phenomenon. Empirical Musicology Review 1 2 (2006) 106-126
- (2006) Empirical Musicology Review , vol.1 , Issue.2 , pp. 106-126
- Cook, N.D.¹ Fujidawa, T.X.²

52
- 4544315904
- A state of the art review of emotional speech databases
- D. Ververidis, C. Kotropoulos, A state of the art review of emotional speech databases, in: Proceedings of the First Richmedia Conference, 2003.
- (2003) Proceedings of the First Richmedia Conference
- Ververidis, D.¹ Kotropoulos, C.²

53
- 33745202280
- A database of German emotional speech
- F. Burkhardt, A. Paeschke, et al., A database of German emotional speech, in: Proceedings of the Interspeech, 2005, pp. 1517-1520.
- (2005) Proceedings of the Interspeech , pp. 1517-1520
- Burkhardt, F.¹ Paeschke, A.²

54
- 70450136545
- An incremental analysis of different feature groups in speaker independent emotion recognition
- M. Lugger, B. Yang, An incremental analysis of different feature groups in speaker independent emotion recognition, in: Proceedings of the International Conference on Phonetic Sciences, 2007, pp. 2149-2152.
- (2007) Proceedings of the International Conference on Phonetic Sciences , pp. 2149-2152
- Lugger, M.¹ Yang, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.