메뉴 건너뛰기




Volumn 50, Issue 10, 2008, Pages 810-828

A three-layered model for expressive speech perception

Author keywords

Acoustic analysis; Expressive speech; Fuzzy inference system; Multi layer model; Perception; Rule based

Indexed keywords

ACOUSTICS; CORRELATION METHODS; EXPERIMENTS; FOOD PROCESSING; FUZZY INFERENCE; INFORMATION THEORY; MODAL ANALYSIS; POWER SPECTRUM; REGRESSION ANALYSIS; SEMANTICS;

EID: 52949128737     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2008.05.017     Document Type: Article
Times cited : (41)

References (52)
  • 1
    • 21844456055 scopus 로고    scopus 로고
    • The role of intonation in emotional expressions
    • Banziger T., and Scherer K.R. The role of intonation in emotional expressions. Speech Commun 46 (2005) 252-267
    • (2005) Speech Commun , vol.46 , pp. 252-267
    • Banziger, T.1    Scherer, K.R.2
  • 3
    • 52949119986 scopus 로고    scopus 로고
    • Cahn, J.E., 1990. Generating expression in synthesized speech, Master's Thesis, MIT, Media Laboratory.
    • Cahn, J.E., 1990. Generating expression in synthesized speech, Master's Thesis, MIT, Media Laboratory.
  • 4
    • 84974743850 scopus 로고
    • Fuzzy model identification based on cluster estimation
    • Chiu S. Fuzzy model identification based on cluster estimation. J. Intell. Fuzzy Syst. 2 3 (1994)
    • (1994) J. Intell. Fuzzy Syst. , vol.2 , Issue.3
    • Chiu, S.1
  • 5
    • 0037382510 scopus 로고    scopus 로고
    • Describing the emotional states that are expressed in speech
    • Cowie R., and Cornelius R.R. Describing the emotional states that are expressed in speech. Speech Commun. 40 (2003) 5-32
    • (2003) Speech Commun. , vol.40 , pp. 5-32
    • Cowie, R.1    Cornelius, R.R.2
  • 6
    • 0030352957 scopus 로고    scopus 로고
    • Cowie, R., Douglas-Cowie, E., 1996. Automatic statistical analysis of the signal and prosodic signs of emotion in speech. In: Proc. ICSLP96, Philadelphia.
    • Cowie, R., Douglas-Cowie, E., 1996. Automatic statistical analysis of the signal and prosodic signs of emotion in speech. In: Proc. ICSLP96, Philadelphia.
  • 8
    • 52949111579 scopus 로고    scopus 로고
    • Darke, G., 2005. Assessment of timbre using verbal attributes. In: Proc. CIM05, Montreal.
    • Darke, G., 2005. Assessment of timbre using verbal attributes. In: Proc. CIM05, Montreal.
  • 9
    • 52949135626 scopus 로고    scopus 로고
    • Devillers, L., Lamel, L., Vasilescu, I., 2003. Emotion detection in task-oriented spoken dialogues. In: Proc. International Conference on Multimedia and Expo, 2003.
    • Devillers, L., Lamel, L., Vasilescu, I., 2003. Emotion detection in task-oriented spoken dialogues. In: Proc. International Conference on Multimedia and Expo, 2003.
  • 10
    • 21544459345 scopus 로고    scopus 로고
    • Challenges in real-life emotion annotation and machine learning based detection
    • Devillers L., Vidrascu L., and Lamel L. Challenges in real-life emotion annotation and machine learning based detection. Neural Networks 18 4 (2005) 407-422
    • (2005) Neural Networks , vol.18 , Issue.4 , pp. 407-422
    • Devillers, L.1    Vidrascu, L.2    Lamel, L.3
  • 12
    • 23144458652 scopus 로고    scopus 로고
    • Expressive speech: production, perception and application to speech synthesis
    • Erickson D. Expressive speech: production, perception and application to speech synthesis. Acoust. Sci. Technol. 26 (2005) 317-325
    • (2005) Acoust. Sci. Technol. , vol.26 , pp. 317-325
    • Erickson, D.1
  • 14
    • 52949125681 scopus 로고    scopus 로고
    • Friberg, A., 2004. A fuzzy analyzer of emotional expression in music performance and body motion. In: Proc. Music and Music Science, Stockholm, 2004.
    • Friberg, A., 2004. A fuzzy analyzer of emotional expression in music performance and body motion. In: Proc. Music and Music Science, Stockholm, 2004.
  • 15
    • 34250878407 scopus 로고    scopus 로고
    • Overview of the KTH rule system for music performance
    • Friberg A., Bresin R., and Sundberg J. Overview of the KTH rule system for music performance. Adv. Cognitive Psych. 2 2-3 (2006) 145-161
    • (2006) Adv. Cognitive Psych. , vol.2 , Issue.2-3 , pp. 145-161
    • Friberg, A.1    Bresin, R.2    Sundberg, J.3
  • 16
    • 52949127054 scopus 로고    scopus 로고
    • Fujisaki, H., Manifestation of linguistic, para-linguistic, non-linguistic information in the prosodic characteristics of speech. In: IEICE 1994-0.
    • Fujisaki, H., Manifestation of linguistic, para-linguistic, non-linguistic information in the prosodic characteristics of speech. In: IEICE 1994-0.
  • 17
    • 0002698187 scopus 로고    scopus 로고
    • Voice source variation
    • Hardcastle W.J., and Laver J. (Eds), Blackwell, Oxford
    • Gobl C., and Ni Chasaide A. Voice source variation. In: Hardcastle W.J., and Laver J. (Eds). The Handbook of Phonetic Sciences (1997), Blackwell, Oxford 427-461
    • (1997) The Handbook of Phonetic Sciences , pp. 427-461
    • Gobl, C.1    Ni Chasaide, A.2
  • 18
  • 19
    • 33646816432 scopus 로고    scopus 로고
    • Toward a Rule-Based Synthesis of Emotional Speech on Linguistic Descriptions of Perception
    • Huang C.F., and Akagi M. Toward a Rule-Based Synthesis of Emotional Speech on Linguistic Descriptions of Perception. Lect. Notes Comput. Sci. 3784/2005 (2005) 366-373
    • (2005) Lect. Notes Comput. Sci. , vol.3784-2005 , pp. 366-373
    • Huang, C.F.1    Akagi, M.2
  • 21
    • 0037734416 scopus 로고    scopus 로고
    • Communication of emotion in music performance: a review and a theoretical framework
    • Juslin P.N., and Sloboda J.A. (Eds), Oxford University Press, New York
    • Juslin P.N. Communication of emotion in music performance: a review and a theoretical framework. In: Juslin P.N., and Sloboda J.A. (Eds). Music and Emotion: Theory and Research (2001), Oxford University Press, New York 309-337
    • (2001) Music and Emotion: Theory and Research , pp. 309-337
    • Juslin, P.N.1
  • 22
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds
    • Kawahara H., Masuda-Katsuse I., and de Cheveigne A. Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds. Speech Commun. 27 (1999) 187-207
    • (1999) Speech Commun. , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigne, A.3
  • 23
    • 52949113901 scopus 로고    scopus 로고
    • Keating, P., Esposito, C., 2006. Linguistic Voice Quality. Invited Keynote Paper Presented at 11th Australasian International Conference on Speech Science and Technology in Auckland, December. 2006.
    • Keating, P., Esposito, C., 2006. Linguistic Voice Quality. Invited Keynote Paper Presented at 11th Australasian International Conference on Speech Science and Technology in Auckland, December. 2006.
  • 26
    • 52949101788 scopus 로고    scopus 로고
    • Kienast, M., Sendlmeier, W.F., 2000. Acoustical analysis of spectral and temporal changes in expressive speech. In: ISCA Workshop on Speech and Emotion, Belfast.
    • Kienast, M., Sendlmeier, W.F., 2000. Acoustical analysis of spectral and temporal changes in expressive speech. In: ISCA Workshop on Speech and Emotion, Belfast.
  • 27
    • 52949100668 scopus 로고    scopus 로고
    • Maekawa, K., 2004. Production and perception of 'paralinguistic' information. In: Proceedings of Speech Prosody, Nara, pp. 367-374.
    • Maekawa, K., 2004. Production and perception of 'paralinguistic' information. In: Proceedings of Speech Prosody, Nara, pp. 367-374.
  • 28
    • 23144441231 scopus 로고    scopus 로고
    • How does speech transmit paralinguistic information?
    • Maekawa K., and Kitagawa N. How does speech transmit paralinguistic information?. Congnitive Studies 9 (2002) 46-66
    • (2002) Congnitive Studies , vol.9 , pp. 46-66
    • Maekawa, K.1    Kitagawa, N.2
  • 31
    • 85089837502 scopus 로고    scopus 로고
    • Menezes, C., Maekawa, K., 2006. Paralinguistic effects on voice quality: a study in Japanese. In: Proc. Speech Prosody 2006, Dresden.
    • Menezes, C., Maekawa, K., 2006. Paralinguistic effects on voice quality: a study in Japanese. In: Proc. Speech Prosody 2006, Dresden.
  • 32
    • 0027447292 scopus 로고
    • Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion
    • Murray I.R., and Arnott J.L. Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. J. Acoust. Soc. Am. 93 2 (1993) 1097-1108
    • (1993) J. Acoust. Soc. Am. , vol.93 , Issue.2 , pp. 1097-1108
    • Murray, I.R.1    Arnott, J.L.2
  • 33
    • 0029325035 scopus 로고
    • Implementation and testing of a system for producing emotion-by-rule in synthetic speech
    • Murray I.R., and Arnott J.L. Implementation and testing of a system for producing emotion-by-rule in synthetic speech. Speech Commun. 16 (1995) 369-390
    • (1995) Speech Commun. , vol.16 , pp. 369-390
    • Murray, I.R.1    Arnott, J.L.2
  • 34
    • 0038719980 scopus 로고    scopus 로고
    • Modified restricted temporal decomposition and its application to low rate speech coding
    • Nguyen P.C., Ochi T., and Akagi M. Modified restricted temporal decomposition and its application to low rate speech coding. IEICE Trans. Inf. Syst. E86-D 3 (2003) 397-405
    • (2003) IEICE Trans. Inf. Syst. , vol.E86-D , Issue.3 , pp. 397-405
    • Nguyen, P.C.1    Ochi, T.2    Akagi, M.3
  • 36
    • 0037384712 scopus 로고    scopus 로고
    • Vocal communication of emotion: a review of research paradigms
    • Scherer K.R. Vocal communication of emotion: a review of research paradigms. Speech Commun. 40 (2003) 227-256
    • (2003) Speech Commun. , vol.40 , pp. 227-256
    • Scherer, K.R.1
  • 39
    • 85009089741 scopus 로고    scopus 로고
    • Schroder, M., Cowie, R., Douglas-Cowie, E., Westerdijk, M., Gielen, S., 2001. Acoustic correlates of emotion dimensions in view of speech synthesis. In: Proc. Eurospeech 2001, Denmark, pp. 87-90.
    • Schroder, M., Cowie, R., Douglas-Cowie, E., Westerdijk, M., Gielen, S., 2001. Acoustic correlates of emotion dimensions in view of speech synthesis. In: Proc. Eurospeech 2001, Denmark, pp. 87-90.
  • 40
    • 85009091534 scopus 로고    scopus 로고
    • Sobol Shikler T., Robinson, P., 2004. Visualizing dynamic features of expressions in speech. In: Proc. ICSLP2004, Korea.
    • Sobol Shikler T., Robinson, P., 2004. Visualizing dynamic features of expressions in speech. In: Proc. ICSLP2004, Korea.
  • 42
    • 0022759481 scopus 로고
    • Effect of experimentally induced stress on vocal parameters
    • Tolkmitt F.J., and Scherer K.R. Effect of experimentally induced stress on vocal parameters. J. Exp. Psychol. [Hum. Percept.] 12 3 (1986) 302-313
    • (1986) J. Exp. Psychol. [Hum. Percept.] , vol.12 , Issue.3 , pp. 302-313
    • Tolkmitt, F.J.1    Scherer, K.R.2
  • 43
    • 52949131033 scopus 로고    scopus 로고
    • Traube, C., Depalle, P., Wanderley, M., 2003. Indirect acquisition of instrumental gesture based on signal, physical and perceptual information. In: Proc. 2003 Conf. New interfaces for Musical Expression.
    • Traube, C., Depalle, P., Wanderley, M., 2003. Indirect acquisition of instrumental gesture based on signal, physical and perceptual information. In: Proc. 2003 Conf. New interfaces for Musical Expression.
  • 44
    • 52949149777 scopus 로고
    • Should we assume a hierarchical structure for adjectives describing timbre?
    • (in Japanese)
    • Ueda K. Should we assume a hierarchical structure for adjectives describing timbre?. Acoust. Sci. Technol. 44 2 (1988) 102-107 (in Japanese)
    • (1988) Acoust. Sci. Technol. , vol.44 , Issue.2 , pp. 102-107
    • Ueda, K.1
  • 45
    • 79956009451 scopus 로고    scopus 로고
    • A hierarchical structure for adjectives describing timbre
    • Ueda K. A hierarchical structure for adjectives describing timbre. J. Acoust. Soc. Am. (1996)
    • (1996) J. Acoust. Soc. Am.
    • Ueda, K.1
  • 46
    • 0025269049 scopus 로고
    • Sharpness and amplitude envelopes of broadband noise
    • Ueda K., and Akagi M. Sharpness and amplitude envelopes of broadband noise. J. Acoust. Soc. Am. 87 2 (1990) 814-819
    • (1990) J. Acoust. Soc. Am. , vol.87 , Issue.2 , pp. 814-819
    • Ueda, K.1    Akagi, M.2
  • 47
    • 52949139136 scopus 로고    scopus 로고
    • Van Bezooijen, R., 1984. The Characteristics and Recognizability of Vocal Expression of Emotions, Foris, Drodrecht, The Netherlands.
    • Van Bezooijen, R., 1984. The Characteristics and Recognizability of Vocal Expression of Emotions, Foris, Drodrecht, The Netherlands.
  • 49
    • 52949103762 scopus 로고    scopus 로고
    • Vickhoff, B., Malmgren, H., 2004. Why Does Music Move Us? Philosophical Communication, Web Series, No. 34.
    • Vickhoff, B., Malmgren, H., 2004. Why Does Music Move Us? Philosophical Communication, Web Series, No. 34.
  • 50
    • 0000859392 scopus 로고
    • On determining the emotional state of pilots during flight: An exploratory study
    • Williams C.E., and Stevens K.N. On determining the emotional state of pilots during flight: An exploratory study. Aerospace Med. 40 12 (1969) 1369-1372
    • (1969) Aerospace Med. , vol.40 , Issue.12 , pp. 1369-1372
    • Williams, C.E.1    Stevens, K.N.2
  • 51
    • 0015409613 scopus 로고
    • Emotions and speech: some acoustical correlates
    • Williams C.E., and Stevens K.N. Emotions and speech: some acoustical correlates. J. Acoust. Soc. Am. 52 (1972) 1238-1250
    • (1972) J. Acoust. Soc. Am. , vol.52 , pp. 1238-1250
    • Williams, C.E.1    Stevens, K.N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.