메뉴 건너뛰기




Volumn 50, Issue 6, 2008, Pages 531-543

Automatic extraction of paralinguistic information using prosodic features related to F0, duration and voice quality

Author keywords

Automatic detection; Emotion; Paralinguistic information; Prosody; Speech act; Voice quality

Indexed keywords

FEATURE EXTRACTION; INFORMATION ANALYSIS; LINGUISTICS; PROBLEM SOLVING; SPEECH INTELLIGIBILITY;

EID: 44149121656     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2008.03.009     Document Type: Article
Times cited : (48)

References (31)
  • 1
    • 9444260412 scopus 로고    scopus 로고
    • What do people hear? A study of the perception of non-verbal affective information in conversational speech
    • Campbell N., and Erickson D. What do people hear? A study of the perception of non-verbal affective information in conversational speech. J. Phonet. Soc. Jpn. 8 1 (2004) 9-28
    • (2004) J. Phonet. Soc. Jpn. , vol.8 , Issue.1 , pp. 9-28
    • Campbell, N.1    Erickson, D.2
  • 2
    • 44149104969 scopus 로고    scopus 로고
    • Campbell, N., Mokhtari, P., 2003. Voice quality, the 4th prosodic dimension. In: Proceedings of 15th International Congress of Phonetic Sciences (ICPhS2003), Barcelona, pp. 2417-2420.
    • Campbell, N., Mokhtari, P., 2003. Voice quality, the 4th prosodic dimension. In: Proceedings of 15th International Congress of Phonetic Sciences (ICPhS2003), Barcelona, pp. 2417-2420.
  • 3
    • 0031012371 scopus 로고
    • Acoustic characteristics of the piriform fossa in models and humans
    • Dang J., and Honda K. Acoustic characteristics of the piriform fossa in models and humans. J. Acoust. Soc. Amer. 101 1 (1966) 456-465
    • (1966) J. Acoust. Soc. Amer. , vol.101 , Issue.1 , pp. 456-465
    • Dang, J.1    Honda, K.2
  • 4
    • 23144458652 scopus 로고    scopus 로고
    • Expressive speech: Production, perception and application to speech synthesis
    • Erickson D. Expressive speech: Production, perception and application to speech synthesis. Acoust. Sci. Tech. 26 4 (2005) 317-325
    • (2005) Acoust. Sci. Tech. , vol.26 , Issue.4 , pp. 317-325
    • Erickson, D.1
  • 5
    • 33745214017 scopus 로고    scopus 로고
    • Fernandez, R., Picard, R.W., 2005. Classical and novel discriminant features for affect recognition from speech. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 473-476.
    • Fernandez, R., Picard, R.W., 2005. Classical and novel discriminant features for affect recognition from speech. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 473-476.
  • 6
    • 33745180348 scopus 로고    scopus 로고
    • Fujie, S., Ejiri, Y., Matsusaka, Y., Kikuchi, H., Kobayashi, T., 2003. Recognition of paralinguistic information and its application to spoken dialogue system. In: Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'03), St. Thomas, US, pp. 231-236.
    • Fujie, S., Ejiri, Y., Matsusaka, Y., Kikuchi, H., Kobayashi, T., 2003. Recognition of paralinguistic information and its application to spoken dialogue system. In: Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU'03), St. Thomas, US, pp. 231-236.
  • 7
    • 44149128028 scopus 로고    scopus 로고
    • Fujimoto, M., Maekawa, K. 2003. Variation of phonation types due to paralinguistic information: An analysis of high-speed video images. In: Proceedings of 15th International Congress of Phonetic Sciences (ICPhS2003), Barcelona, pp. 2401-2404.
    • Fujimoto, M., Maekawa, K. 2003. Variation of phonation types due to paralinguistic information: An analysis of high-speed video images. In: Proceedings of 15th International Congress of Phonetic Sciences (ICPhS2003), Barcelona, pp. 2401-2404.
  • 8
    • 0037380186 scopus 로고    scopus 로고
    • The role of voice quality in communicating emotion, mood and attitude
    • Gobl C., and Ní Chasaide A. The role of voice quality in communicating emotion, mood and attitude. Speech Comm. 40 (2003) 189-212
    • (2003) Speech Comm. , vol.40 , pp. 189-212
    • Gobl, C.1    Ní Chasaide, A.2
  • 9
    • 0035668083 scopus 로고    scopus 로고
    • Phonation types: A cross-linguistic overview
    • Gordon M., and Ladefoged P. Phonation types: A cross-linguistic overview. J. Phonet. 29 (2001) 383-406
    • (2001) J. Phonet. , vol.29 , pp. 383-406
    • Gordon, M.1    Ladefoged, P.2
  • 10
    • 0031023993 scopus 로고    scopus 로고
    • Glottal characteristics of female speakers: Acoustic correlates
    • Hanson H. Glottal characteristics of female speakers: Acoustic correlates. J. Acoust. Soc. Amer. 101 (1997) 466-481
    • (1997) J. Acoust. Soc. Amer. , vol.101 , pp. 466-481
    • Hanson, H.1
  • 11
    • 44149128266 scopus 로고    scopus 로고
    • Hayashi, Y., 1999. Recognition of vocal expression of emotions in Japanese: using the interjection "eh". In: Proceedings of ICPhS 99, San Francisco, USA, pp. 2355-2359.
    • Hayashi, Y., 1999. Recognition of vocal expression of emotions in Japanese: using the interjection "eh". In: Proceedings of ICPhS 99, San Francisco, USA, pp. 2355-2359.
  • 12
    • 0003391579 scopus 로고
    • Springer-Verlag, Berlin, Heidelberg, New York
    • Hess W. Pitch Determination of Speech Signals. Springer Series of Information Sciences vol. 3 (1983), Springer-Verlag, Berlin, Heidelberg, New York
    • (1983) Springer Series of Information Sciences , vol.3
    • Hess, W.1
  • 13
    • 44149125123 scopus 로고    scopus 로고
    • Imagawa, H., Sakakibara, K., Tayama, N., Niimi, S., 2003. The effect of the hypopharyngeal and supra-glottic shapes for the singing voice. In: Proceedings of the Stockholm Music Acoustics Conference (SMAC 2003), II, pp. 471-474.
    • Imagawa, H., Sakakibara, K., Tayama, N., Niimi, S., 2003. The effect of the hypopharyngeal and supra-glottic shapes for the singing voice. In: Proceedings of the Stockholm Music Acoustics Conference (SMAC 2003), II, pp. 471-474.
  • 14
    • 85009108067 scopus 로고    scopus 로고
    • Ishi, C.T., 2004. A new acoustic measure for aspiration noise detection. In: Proceedings of Interspeech 2004-ICSLP, Jeju, Korea, pp. 941-944.
    • Ishi, C.T., 2004. A new acoustic measure for aspiration noise detection. In: Proceedings of Interspeech 2004-ICSLP, Jeju, Korea, pp. 941-944.
  • 15
    • 24144436951 scopus 로고    scopus 로고
    • Perceptually-related F0 parameters for automatic classification of phrase final tones
    • Ishi C.T. Perceptually-related F0 parameters for automatic classification of phrase final tones. IEICE Trans. Inf. Syst. 88 3 (2005) 481-488
    • (2005) IEICE Trans. Inf. Syst. , vol.88 , Issue.3 , pp. 481-488
    • Ishi, C.T.1
  • 16
    • 33745208789 scopus 로고    scopus 로고
    • Ishi, C.T., Ishiguro, H., Hagita, N., 2005. Proposal of acoustic measures for automatic detection of vocal fry. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 481-484.
    • Ishi, C.T., Ishiguro, H., Hagita, N., 2005. Proposal of acoustic measures for automatic detection of vocal fry. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 481-484.
  • 17
    • 44149119143 scopus 로고    scopus 로고
    • Ito, M., 2004. Politeness and voice quality - The alternative method to measure aspiration noise. In: Proceedings of Speech Prosody 2004, Nara, Japan, pp. 213-216.
    • Ito, M., 2004. Politeness and voice quality - The alternative method to measure aspiration noise. In: Proceedings of Speech Prosody 2004, Nara, Japan, pp. 213-216.
  • 18
    • 44149118691 scopus 로고    scopus 로고
    • JST/CREST ESP Project homepage. .
    • JST/CREST ESP Project homepage. .
  • 19
    • 85009151996 scopus 로고    scopus 로고
    • Kasuya, H., Yoshizawa, M., Maekawa, K., 2000. Roles of voice source dynamics as a conveyer of paralinguistic features. In: Proceedings of International Conference on Spoken Language Processing (ICSLP2000), Beijing, pp. 345-348.
    • Kasuya, H., Yoshizawa, M., Maekawa, K., 2000. Roles of voice source dynamics as a conveyer of paralinguistic features. In: Proceedings of International Conference on Spoken Language Processing (ICSLP2000), Beijing, pp. 345-348.
  • 20
    • 12844282873 scopus 로고    scopus 로고
    • Individual variation of the hypopharyngeal cavities and its acoustic effects
    • Kitamura T., Honda K., and Takemoto H. Individual variation of the hypopharyngeal cavities and its acoustic effects. Acoust. Sci. Tech. 26 1 (2005) 16-26
    • (2005) Acoust. Sci. Tech. , vol.26 , Issue.1 , pp. 16-26
    • Kitamura, T.1    Honda, K.2    Takemoto, H.3
  • 21
    • 0012872760 scopus 로고    scopus 로고
    • Voice and Emotional States
    • Kent R.D., and Ball M.J. (Eds), Springer, Berlin
    • Klasmeyer G., and Sendlmeier W.F. Voice and Emotional States. In: Kent R.D., and Ball M.J. (Eds). Voice Quality Measurement (2000), Springer, Berlin 339-358
    • (2000) Voice Quality Measurement , pp. 339-358
    • Klasmeyer, G.1    Sendlmeier, W.F.2
  • 22
    • 0001825277 scopus 로고    scopus 로고
    • Measuring vocal quality
    • Kent R.D., and Ball M.J. (Eds), Singular Thomson Learning, San Diego
    • Kreiman J., and Gerratt B. Measuring vocal quality. In: Kent R.D., and Ball M.J. (Eds). Voice Quality Measurement (2000), Singular Thomson Learning, San Diego 73-102
    • (2000) Voice Quality Measurement , pp. 73-102
    • Kreiman, J.1    Gerratt, B.2
  • 23
    • 4344692648 scopus 로고
    • Phonatory settings
    • Cambridge University Press, Cambridge
    • Laver J. Phonatory settings. The Phonetic Description of Voice Quality (1980), Cambridge University Press, Cambridge 93-135
    • (1980) The Phonetic Description of Voice Quality , pp. 93-135
    • Laver, J.1
  • 24
    • 44149124688 scopus 로고    scopus 로고
    • Maekawa, K., 2004. Production and perception of 'Paralinguistic' information. In: Proceedings of Speech Prosody 2004, Nara, Japan, pp. 367-374.
    • Maekawa, K., 2004. Production and perception of 'Paralinguistic' information. In: Proceedings of Speech Prosody 2004, Nara, Japan, pp. 367-374.
  • 25
    • 38749103707 scopus 로고    scopus 로고
    • Neiberg, D., Elenius, K., Laskowski, K., 2006. Emotion recognition in spontaneous speech using GMMs. In: Proceedings of Interspeech 2006, Pittsburgh, USA, pp. 809-812.
    • Neiberg, D., Elenius, K., Laskowski, K., 2006. Emotion recognition in spontaneous speech using GMMs. In: Proceedings of Interspeech 2006, Pittsburgh, USA, pp. 809-812.
  • 26
    • 0242721417 scopus 로고    scopus 로고
    • Speech emotion recognition using hidden Markov models
    • Nwe T.L., Foo S.W., and De Silva L.C. Speech emotion recognition using hidden Markov models. Speech Comm. 41 (2003) 603-623
    • (2003) Speech Comm. , vol.41 , pp. 603-623
    • Nwe, T.L.1    Foo, S.W.2    De Silva, L.C.3
  • 27
    • 23144466342 scopus 로고    scopus 로고
    • A natural history of Japanese pressed voice
    • Sadanobu T. A natural history of Japanese pressed voice. J. Phonetic Soc. Jpn. 8 1 (2004) 29-44
    • (2004) J. Phonetic Soc. Jpn. , vol.8 , Issue.1 , pp. 29-44
    • Sadanobu, T.1
  • 29
    • 33745198227 scopus 로고    scopus 로고
    • Schuller, B., Muller, R., Lang, M., Rigoll, G., 2005. Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 805-808.
    • Schuller, B., Muller, R., Lang, M., Rigoll, G., 2005. Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles. In: Proceedings of Interspeech 2005, Lisbon, Portugal, pp. 805-808.
  • 30
    • 44149094853 scopus 로고    scopus 로고
    • Turbulence noise at the glottis during breathy and modal voicing
    • The MIT Press, Cambridge
    • Stevens K. Turbulence noise at the glottis during breathy and modal voicing. Acoustic Phonetics (2000), The MIT Press, Cambridge 445-450
    • (2000) Acoustic Phonetics , pp. 445-450
    • Stevens, K.1
  • 31
    • 44149106447 scopus 로고    scopus 로고
    • Voice quality sample homepage. .
    • Voice quality sample homepage. .


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.