메뉴 건너뛰기




Volumn 41, Issue 4, 2003, Pages 603-623

Speech emotion recognition using hidden Markov models

Author keywords

Emotional speech; Hidden Markov model; Human communication; Log frequency power coefficients; Recognition of emotion

Indexed keywords

ALGORITHMS; DATABASE SYSTEMS; MARKOV PROCESSES; SPEECH RECOGNITION;

EID: 0242721417     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(03)00099-2     Document Type: Article
Times cited : (803)

References (62)
  • 1
    • 0009937094 scopus 로고
    • Emotion and Personality
    • Columbia University Press, New York
    • Arnold, M.B., 1960. Emotion and Personality. Physiological Aspects, Vol. 2. Columbia University Press, New York.
    • (1960) Physiological Aspects , vol.2
    • Arnold, M.B.1
  • 2
    • 0016067897 scopus 로고
    • Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification
    • Atal B.S. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J. Acoust. Soc. Amer. 55(6):1974;1304-1312.
    • (1974) J. Acoust. Soc. Amer. , vol.55 , Issue.6 , pp. 1304-1312
    • Atal, B.S.1
  • 4
    • 0002515370 scopus 로고
    • The generation of affect in synthesized speech
    • Cahn J.E. The generation of affect in synthesized speech. J. Amer. Voice I/O Soc. 8:1990;1-19.
    • (1990) J. Amer. Voice I/O Soc. , vol.8 , pp. 1-19
    • Cahn, J.E.1
  • 5
    • 0028630509 scopus 로고
    • Nonlinear analysis and classification of speech under stressed conditions
    • Cairns D.A., Hansen J.H.L. Nonlinear analysis and classification of speech under stressed conditions. J. Acoust. Soc. Amer. 96(6):1994;3392-3400.
    • (1994) J. Acoust. Soc. Amer. , vol.96 , Issue.6 , pp. 3392-3400
    • Cairns, D.A.1    Hansen, J.H.L.2
  • 6
    • 64549113519 scopus 로고
    • Identification of emotional states using perceptual and acoustic analyses
    • Lawrence, V., Weinberg, B. (Eds.), The Voice Foundation, New York
    • Coleman, R., Williams, R., 1979. Identification of emotional states using perceptual and acoustic analyses. In: Lawrence, V., Weinberg, B. (Eds.), Care of the Professional Voice, Vol. 1. The Voice Foundation, New York.
    • (1979) Care of the Professional Voice , vol.1
    • Coleman, R.1    Williams, R.2
  • 8
    • 0005490789 scopus 로고
    • Pitch and Intensity Characteristics of Stage of Speech
    • suppl. to Dec. issue
    • Cowan, M., 1936. Pitch and Intensity Characteristics of Stage of Speech. Arch. Speech, suppl. to Dec. issue.
    • (1936) Arch. Speech
    • Cowan, M.1
  • 14
    • 0019053271 scopus 로고
    • Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
    • Davis S.B., Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Signal Process. 28:1980;357-366.
    • (1980) IEEE Trans. Acoust. Speech Signal Process. , vol.28 , pp. 357-366
    • Davis, S.B.1    Mermelstein, P.2
  • 16
    • 0031650643 scopus 로고    scopus 로고
    • Use of multimodal information in facial emotion recognition
    • De Silva L.C., Miyasato T., Nakatsu R. Use of multimodal information in facial emotion recognition. IEICE Trans. Inf. Syst. E81-D(1):1998;105-114.
    • (1998) IEICE Trans. Inf. Syst. , vol.E81-D , Issue.1 , pp. 105-114
    • De Silva, L.C.1    Miyasato, T.2    Nakatsu, R.3
  • 21
    • 0037906016 scopus 로고
    • New Statistical Methods for Assigning Device Tolerances
    • 1975, Newton, MA, USA
    • Elias, N.J., 1975. New Statistical Methods for Assigning Device Tolerances. Proc. IEEE Int. SYmp. Ccts. Sys., 1975, Newton, MA, USA, pp. 329-332.
    • (1975) Proc. IEEE Int. SYmp. Ccts. Sys. , pp. 329-332
    • Elias, N.J.1
  • 22
    • 0024752328 scopus 로고
    • A new vector quantization clustering algorithm
    • Equitz W.H. A new vector quantization clustering algorithm. IEEE Trans. Acoust. Speech Signal Process. 37(10):1989;1568-1575.
    • (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , Issue.10 , pp. 1568-1575
    • Equitz, W.H.1
  • 23
    • 0001988169 scopus 로고
    • An experimental study of the pitch characteristics of the voice during the expression of emotion
    • Fairbanks G., Pronovost W. An experimental study of the pitch characteristics of the voice during the expression of emotion. Speech Monograph. 6:1939;87-104.
    • (1939) Speech Monograph , vol.6 , pp. 87-104
    • Fairbanks, G.1    Pronovost, W.2
  • 24
    • 0017795674 scopus 로고
    • A new method of investigating the perception of prosodic features
    • Fonagy I. A new method of investigating the perception of prosodic features. Language and Speech. 21:1978;34-49.
    • (1978) Language and Speech , vol.21 , pp. 34-49
    • Fonagy, I.1
  • 27
    • 0026209914 scopus 로고
    • If it's not left it's right
    • Fox N.A. If it's not left it's right. Amer. Psychol. 46:1992;863-872.
    • (1992) Amer. Psychol. , vol.46 , pp. 863-872
    • Fox, N.A.1
  • 28
    • 0000989095 scopus 로고
    • Communicating Emotion: The Role of Prosodic Features
    • Frick R. Communicating Emotion: The Role of Prosodic Features. Psychol. Bull. 97(3):1985;412-429.
    • (1985) Psychol. Bull. , vol.97 , Issue.3 , pp. 412-429
    • Frick, R.1
  • 30
    • 0018345765 scopus 로고
    • Changes of the voice expression during suggestively influenced states of experiencing
    • Havrdova Z., Moravek M. Changes of the voice expression during suggestively influenced states of experiencing. Activitas Nervosa Superior. 21:1979;33-35.
    • (1979) Activitas Nervosa Superior , vol.21 , pp. 33-35
    • Havrdova, Z.1    Moravek, M.2
  • 31
    • 0014325315 scopus 로고
    • Relations between prosodic variables and emotions in normal american english utterances
    • Huttar G.L. Relations between prosodic variables and emotions in normal american english utterances. J. Speech Hearing Res. 11:1968;481-487.
    • (1968) J. Speech Hearing Res. , vol.11 , pp. 481-487
    • Huttar, G.L.1
  • 33
    • 0000606530 scopus 로고
    • Communication of affects by single vowels
    • Kaiser L. Communication of affects by single vowels. Synthese. 14:1962;300-319.
    • (1962) Synthese , vol.14 , pp. 300-319
    • Kaiser, L.1
  • 34
    • 0016954139 scopus 로고
    • Acoustic correlates of the emotional content of vocalized speech
    • Kotlyar G., Mozorov V. Acoustic correlates of the emotional content of vocalized speech. J. Acoust. Acad. Sci. USSR. 22:1976;208-211.
    • (1976) J. Acoust. Acad. Sci. USSR , vol.22 , pp. 208-211
    • Kotlyar, G.1    Mozorov, V.2
  • 36
    • 0024768209 scopus 로고
    • Speaker-independent phone recognition using Hidden Markov Models
    • Lee K.F., Hon H.W. Speaker-independent phone recognition using Hidden Markov Models. IEEE Trans. Acoust. Speech Signal Process. 37(11):1989;1641-1648.
    • (1989) IEEE Trans. Acoust. Speech Signal Process. , vol.37 , Issue.11 , pp. 1641-1648
    • Lee, K.F.1    Hon, H.W.2
  • 37
    • 0002792685 scopus 로고
    • A phonophotographic study of trained and untrained voices reading factual and dramatic material
    • Lynch G.E. A phonophotographic study of trained and untrained voices reading factual and dramatic material. Arch. Speech. 1:1934;9-25.
    • (1934) Arch. Speech , vol.1 , pp. 9-25
    • Lynch, G.E.1
  • 38
    • 0001677777 scopus 로고
    • Prosodic signs of emotion in speech: Preliminary results from a new technique for automatic statistical analysis
    • Stockholm, Sweden
    • McGilloway, S., Cowie, R., Douglas-Cowie, E., 1995. Prosodic signs of emotion in speech: preliminary results from a new technique for automatic statistical analysis. In: Proc. XIIIth Int. Congr. Phonetic Sciences, Vol. 1. Stockholm, Sweden, pp. 250-253.
    • (1995) Proc. XIIIth Int. Congr. Phonetic Sciences , vol.1 , pp. 250-253
    • McGilloway, S.1    Cowie, R.2    Douglas-Cowie, E.3
  • 41
    • 0027447292 scopus 로고
    • Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion
    • Murray I.R., Arnott J.L. Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. J. Acoust. Soc. Amer. 93(2):1993;1097-1108.
    • (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.2 , pp. 1097-1108
    • Murray, I.R.1    Arnott, J.L.2
  • 43
    • 0002798665 scopus 로고
    • Communicative theory of emotions: Empirical test, mental models and implications for social interaction
    • L. Martin, & A. Tessler. Hillsdale, NJ: Erlbaum
    • Oatley K., Johnson-Laird P. Communicative theory of emotions: Empirical test, mental models and implications for social interaction. Martin L., Tessler A. Goals and Affect. 1995;Erlbaum, Hillsdale, NJ.
    • (1995) Goals and Affect
    • Oatley, K.1    Johnson-Laird, P.2
  • 45
  • 50
    • 0001034398 scopus 로고
    • On the nature and function of emotion: A component process approach
    • K.R. Scherer, & P. Ekman. Hillsdale, NJ: Erlbaum
    • Scherer K.R. On the nature and function of emotion: A component process approach. Scherer K.R., Ekman P. Approaches to Emotion. 1984;Erlbaum, Hillsdale, NJ.
    • (1984) Approaches to Emotion
    • Scherer, K.R.1
  • 51
    • 0022688124 scopus 로고
    • Vocal effect expression: A review and a model for future research
    • Scherer K.R. Vocal effect expression: A review and a model for future research. Psychol. Bull. 99:1986;143-165.
    • (1986) Psychol. Bull. , vol.99 , pp. 143-165
    • Scherer, K.R.1
  • 52
    • 0022688124 scopus 로고
    • Vocal affect expression: A review and a model for future research
    • Scherer K.R. Vocal affect expression: A review and a model for future research. Psychol. Bull. 99:1986;143-165.
    • (1986) Psychol. Bull. , vol.99 , pp. 143-165
    • Scherer, K.R.1
  • 53
    • 23044523714 scopus 로고    scopus 로고
    • Emotion inferences from vocal expression correlate across languages and cultures
    • Scherer K.R., Banse R., Wallbott H.G. Emotion inferences from vocal expression correlate across languages and cultures. J. Cross-Cultural Psychol. 32(1):2001;76-92.
    • (2001) J. Cross-cultural Psychol. , vol.32 , Issue.1 , pp. 76-92
    • Scherer, K.R.1    Banse, R.2    Wallbott, H.G.3
  • 56
    • 0017528188 scopus 로고
    • Emotional changes in human voice
    • Sulc J. Emotional changes in human voice. Activitas Nervosa Superior. 19:1977;215-216.
    • (1977) Activitas Nervosa Superior , vol.19 , pp. 215-216
    • Sulc, J.1
  • 58
    • 0001576972 scopus 로고
    • Relationship between Emotional State and Fundamental Frequency of Speech
    • Japan Air Self-defense Force
    • Utsuki, N., Okamura, N., 1976. Relationship Between Emotional State and Fundamental Frequency of Speech. Rep. Aeromedical Laboratory. Japan Air Self-Defense Force 16, 179-188.
    • (1976) Rep. Aeromedical Laboratory , vol.16 , pp. 179-188
    • Utsuki, N.1    Okamura, N.2
  • 60
    • 0000859392 scopus 로고
    • On determining the emotional state of pilots during flight: An exploratory study
    • Williams C.E., Stevens K.N. On determining the emotional state of pilots during flight: An exploratory study. Aerospace Med. 40:1969;1369-1372.
    • (1969) Aerospace Med. , vol.40 , pp. 1369-1372
    • Williams, C.E.1    Stevens, K.N.2
  • 61
    • 0003938587 scopus 로고
    • Vocal correlates of emotional states
    • J.K. Darby. Grune and Stratton, Inc.
    • Williams C.E., Stevens K.N. Vocal correlates of emotional states. Darby J.K. Speech Evaluation in Psychiatry. 1981;189-220 Grune and Stratton, Inc.
    • (1981) Speech Evaluation in Psychiatry , pp. 189-220
    • Williams, C.E.1    Stevens, K.N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.