메뉴 건너뛰기




Volumn 41, Issue 3-4, 1998, Pages 443-492

Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?

Author keywords

Automatic dialog act classification; Discourse modeling; Prosody; Recognition; Speech; Spontaneous speech; Understanding

Indexed keywords

ARTICLE; DECISION TREE; HUMAN; LINGUISTICS; PHONETICS; SOUND DETECTION; SPEECH PERCEPTION; TELEPHONE; VERBAL BEHAVIOR;

EID: 0342310704     PISSN: 00238309     EISSN: None     Source Type: Journal    
DOI: 10.1177/002383099804100410     Document Type: Article
Times cited : (192)

References (70)
  • 1
    • 0040639199 scopus 로고
    • Discourse functions of pitch range in spontaneous and read speech
    • Ohio State University
    • AYERS, G. M. (1994). Discourse functions of pitch range in spontaneous and read speech. In Working Papers in Linguistics No. 44 (pp. 1-49). Ohio State University.
    • (1994) Working Papers in Linguistics , vol.44 , pp. 1-49
    • Ayers, G.M.1
  • 2
    • 0009553224 scopus 로고
    • An oral dialog model based on speech acts categorization
    • P. Dalsgaard, E. Larsen, E. Boves, & I. Thomsen (Eds.), Vigsø, Denmark
    • BENNACEF, S. K., NÉEL, F., & MAYNARD, H. B. (1995). An oral dialog model based on speech acts categorization. In P. Dalsgaard, E. Larsen, E. Boves, & I. Thomsen (Eds.), ESCA Workshop on Spoken Dialog Systems - Theories and applications (pp. 237-240). Vigsø, Denmark.
    • (1995) ESCA Workshop on Spoken Dialog Systems - Theories and Applications , pp. 237-240
    • Bennacef, S.K.1    Néel, F.2    Maynard, H.B.3
  • 5
    • 0000583248 scopus 로고
    • Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition
    • F. Soulie & J. Herault (Eds.), Berlin: Springer
    • BRIDLE, J. (1990). Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition. In F. Soulie & J. Herault (Eds.), Neurocomputing: Algorithms, architectures and applications (pp. 227-236). Berlin: Springer.
    • (1990) Neurocomputing: Algorithms, Architectures and Applications , pp. 227-236
    • Bridle, J.1
  • 7
    • 77949291063 scopus 로고    scopus 로고
    • Assessing agreement on classification tasks: The Kappa statistic
    • CARLETTA, J. (1996). Assessing agreement on classification tasks: The Kappa statistic. Computational Linguistics, 22(2), 249-254.
    • (1996) Computational Linguistics , vol.22 , Issue.2 , pp. 249-254
    • Carletta, J.1
  • 9
    • 0039669806 scopus 로고    scopus 로고
    • A statistical model for discourse act recognition in dialog interactions
    • J. Chu-Carroll & N. Green (Eds.), Technical Report SS-98-01 Menlo Park, CA: AAAI Press
    • CHU-CARROLL, J. (1998). A statistical model for discourse act recognition in dialog interactions. In J. Chu-Carroll & N. Green (Eds.), Applying machine learning to discourse processing. Papers from the 1998 AAAI Spring Symposium. Technical Report SS-98-01 (pp. 12-17). Menlo Park, CA: AAAI Press.
    • (1998) Applying Machine Learning to Discourse Processing. Papers from the 1998 AAAI Spring Symposium , pp. 12-17
    • Chu-Carroll, J.1
  • 10
    • 1842653460 scopus 로고    scopus 로고
    • Cambridge, U.K.: Cambridge University Press
    • CLARK, H. (1996). Using language. Cambridge, U.K.: Cambridge University Press.
    • (1996) Using Language
    • Clark, H.1
  • 11
  • 13
    • 0346436960 scopus 로고
    • Statistical and linguistic analyses of F0 in read and spontaneous speech
    • J. J. Ohala, T. M. Nearey, B. L. Derwing, M. M. Hodge, & G. E. Wiebe (Eds.), Banff, Canada
    • DALY, N. A., & ZUE, V. W. (1992). Statistical and linguistic analyses of F0 in read and spontaneous speech. In J. J. Ohala, T. M. Nearey, B. L. Derwing, M. M. Hodge, & G. E. Wiebe (Eds.), Proceedings of the International Conference on Spoken Language Processing (Vol. 1, pp. 763-766). Banff, Canada.
    • (1992) Proceedings of the International Conference on Spoken Language Processing , vol.1 , pp. 763-766
    • Daly, N.A.1    Zue, V.W.2
  • 16
    • 0038556626 scopus 로고
    • Some intonational characteristics of discourse structure
    • J. J. Ohala, T. M. Nearey, B. L. Derwing, M. M. Hodge, & G. E. Wiebe (Eds.), Banff, Canada
    • GROSZ, B., & HIRSCHBERG, J. (1992). Some intonational characteristics of discourse structure. In J. J. Ohala, T. M. Nearey, B. L. Derwing, M. M. Hodge, & G. E. Wiebe (Eds.), Proceedings of the International Conference on Spoken Language Processing (Vol. 1, pp. 429-432). Banff, Canada.
    • (1992) Proceedings of the International Conference on Spoken Language Processing , vol.1 , pp. 429-432
    • Grosz, B.1    Hirschberg, J.2
  • 17
    • 0041059566 scopus 로고    scopus 로고
    • Intonational characteristics of declarativity and interrogativity in Dutch: A comparison
    • A. Botonis, G. Kouroupetroglou, & G. Carayiannis (Eds.), Athens, Greece
    • HAAN, J., HEUVEN, V. J. van, PACILLY, J. J. A., & BEZOOIJEN, R. van (1997a). Intonational characteristics of declarativity and interrogativity in Dutch: A comparison. In A. Botonis, G. Kouroupetroglou, & G. Carayiannis (Eds.), ESCA, Workshop on Intonation: Theory, Models and Applications (pp. 173-176). Athens, Greece.
    • (1997) ESCA, Workshop on Intonation: Theory, Models and Applications , pp. 173-176
    • Haan, J.1    Van Heuven, V.J.2    Pacilly, J.J.A.3    Van Bezooijen, R.4
  • 21
    • 0000262562 scopus 로고
    • Hierarchical mixtures of experts and the EM algorithm
    • JORDAN, M. I., & JACOBS, R. A. (1993). Hierarchical mixtures of experts and the EM algorithm. Neural Computation, 6(2), 181-214.
    • (1993) Neural Computation , vol.6 , Issue.2 , pp. 181-214
    • Jordan, M.I.1    Jacobs, R.A.2
  • 23
    • 77950874070 scopus 로고    scopus 로고
    • Tech. Rep. No. 97-02. University of Colorado Institute of Cognitive Science
    • JURAFSKY, D., SHRIBERG, E., & BIASCA, D. (1997b). Switchboard-DAMSL Labeling Project Coder's Manual (Tech. Rep. No. 97-02). University of Colorado Institute of Cognitive Science. (http://stripe.colorado.edu/~jurafsky/manual.augustl.html)
    • (1997) Switchboard-DAMSL Labeling Project Coder's Manual
    • Jurafsky, D.1    Shriberg, E.2    Biasca, D.3
  • 26
    • 0023312404 scopus 로고
    • Estimation of probabilities from sparse data for the language model component of a speech recognizer
    • KATZ, S. M. (1987). Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Transactions on Acoustics, Speech, and Signal Processing, 35(3), 400-401.
    • (1987) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.35 , Issue.3 , pp. 400-401
    • Katz, S.M.1
  • 27
    • 0345805844 scopus 로고
    • "Roger," "Sorry," "I'm still listening": Dialog guiding signals in informational retrieval dialogs
    • D. House & P. Touati (Eds.), Lund, Sweden
    • KIESSLING, A., KOMPE, R., NIEMANN, H., NÖTH, E., & BATLINER, A. (1993). "Roger," "Sorry," "I'm still listening": Dialog guiding signals in informational retrieval dialogs. In D. House & P. Touati (Eds.), ESCA, Workshop on Prosody (pp. 140-143). Lund, Sweden.
    • (1993) ESCA, Workshop on Prosody , pp. 140-143
    • Kiessling, A.1    Kompe, R.2    Niemann, H.3    Nöth, E.4    Batliner, A.5
  • 38
    • 0028699881 scopus 로고
    • First steps toward statistical modeling of dialog to predict the speech act type of the next utterance
    • NAGATA, M., & MORIMOTO, T. (1994). First steps toward statistical modeling of dialog to predict the speech act type of the next utterance. Speech Communication, 15, 193-203.
    • (1994) Speech Communication , vol.15 , pp. 193-203
    • Nagata, M.1    Morimoto, T.2
  • 39
    • 0000897166 scopus 로고
    • A study on prosody and discourse structure in cooperative dialogs
    • NAKAJIMA, S., & ALLEN, J. F. (1993). A study on prosody and discourse structure in cooperative dialogs. Phonetica, 50, 197-210.
    • (1993) Phonetica , vol.50 , pp. 197-210
    • Nakajima, S.1    Allen, J.F.2
  • 40
    • 0039453724 scopus 로고    scopus 로고
    • Prosodic features of utterances in task-oriented dialogs
    • Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), New York: Springer
    • NAKAJIMA, S., & TSUKADA, H. (1997). Prosodic features of utterances in task-oriented dialogs. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing prosody: Computational models for processing spontaneous speech (pp. 81-94). New York: Springer.
    • (1997) Computing Prosody: Computational Models for Processing Spontaneous Speech , pp. 81-94
    • Nakajima, S.1    Tsukada, H.2
  • 41
    • 0346436959 scopus 로고
    • Microphone-independent robust signal processing using probabilistic optimum filtering
    • Plainsboro, NJ
    • NEUMEYER, L., & WEINTRAUB, M. (1994). Microphone-independent robust signal processing using probabilistic optimum filtering. In Proceedings of the ARPA Workshop on Human Language Technology (pp. 336-341). Plainsboro, NJ.
    • (1994) Proceedings of the ARPA Workshop on Human Language Technology , pp. 336-341
    • Neumeyer, L.1    Weintraub, M.2
  • 44
    • 0002267291 scopus 로고    scopus 로고
    • Multilevel recognition of intonation labels
    • Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), New York: Springer
    • OSTENDORF, M., & ROSS, K. (1997). Multilevel recognition of intonation labels. In Y. Sagisaka, N. Campbell, & N. Higuchi (Eds.), Computing prosody: Computational models for processing spontaneous speech (pp. 291-308). New York: Springer.
    • (1997) Computing Prosody: Computational Models for Processing Spontaneous Speech , pp. 291-308
    • Ostendorf, M.1    Ross, K.2
  • 52
    • 0030374907 scopus 로고    scopus 로고
    • Automatic linguistic segmentation of conversational speech
    • H. T. Bunnell & W. Idsardi (Eds.), Philadelphia
    • STOLCKE, A., & SHRIBERG, E. (1996). Automatic linguistic segmentation of conversational speech. In H. T. Bunnell & W. Idsardi (Eds.), Proceedings of the International Conference on Spoken Language Processing (Vol. 2, pp. 1005-1008). Philadelphia.
    • (1996) Proceedings of the International Conference on Spoken Language Processing , vol.2 , pp. 1005-1008
    • Stolcke, A.1    Shriberg, E.2
  • 55
    • 0031033301 scopus 로고    scopus 로고
    • Prosodic features at discourse boundaries of different strength
    • SWERTS, M. (1997). Prosodic features at discourse boundaries of different strength. Journal of the Acoustical Society of America, 101, 514-521.
    • (1997) Journal of the Acoustical Society of America , vol.101 , pp. 514-521
    • Swerts, M.1
  • 56
    • 0031185913 scopus 로고    scopus 로고
    • Prosodic and lexical indications of discourse structure in human-machine interactions
    • SWERTS, M., & OSTENDORF, M. (1997). Prosodic and lexical indications of discourse structure in human-machine interactions. Speech Communication, 22(1), 25-41.
    • (1997) Speech Communication , vol.22 , Issue.1 , pp. 25-41
    • Swerts, M.1    Ostendorf, M.2
  • 59
    • 0347128737 scopus 로고    scopus 로고
    • Intonation and dialog context as constraints for speech recognition
    • TAYLOR, P. A., KING, S., ISARD, S. D., WRIGHT, H. (1998). Intonation and dialog context as constraints for speech recognition. Language and Speech, 41, 493-512.
    • (1998) Language and Speech , vol.41 , pp. 493-512
    • Taylor, P.A.1    King, S.2    Isard, S.D.3    Wright, H.4
  • 61
  • 62
    • 0003026505 scopus 로고
    • Language-independent prosodic features
    • A. Cutler & D. R. Ladd (Eds.), Berlin: Springer
    • VAISSIÈRE, J. (1983). Language-independent prosodic features. In A. Cutler & D. R. Ladd (Eds.), Prosody: Models and measurements (pp. 53-66). Berlin: Springer.
    • (1983) Prosody: Models and Measurements , pp. 53-66
    • Vaissière, J.1
  • 65
    • 85135190196 scopus 로고    scopus 로고
    • Integrated dialog act segmentation and classification using prosodic features and language models
    • G. Kokkinakis, N. Fakotakis, & E. Dermatas (Eds.), Rhodes, Greece
    • WARNKE, V., KOMPE, R., NIEMANN, H., & NÖTH, E. (1997). Integrated dialog act segmentation and classification using prosodic features and language models. In G. Kokkinakis, N. Fakotakis, & E. Dermatas (Eds.), Proceedings of the Fifth European Conference on Speech Communication and Technology (Vol. 1, pp. 207-210). Rhodes, Greece.
    • (1997) Proceedings of the Fifth European Conference on Speech Communication and Technology , vol.1 , pp. 207-210
    • Warnke, V.1    Kompe, R.2    Niemann, H.3    Nöth, E.4
  • 69
    • 0347067189 scopus 로고
    • Dialog interpretation model and its application to next utterance predication for spoken language processing
    • Geneva, Italy
    • YAMAOKA, T., & IIDA, H. (1991). Dialog interpretation model and its application to next utterance predication for spoken language processing. Proceedings of the Second European Conference on Speech Communication and Technology (Vol 2, pp. 849-852). Geneva, Italy.
    • (1991) Proceedings of the Second European Conference on Speech Communication and Technology , vol.2 , pp. 849-852
    • Yamaoka, T.1    Iida, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.