메뉴 건너뛰기




Volumn 17, Issue 1, 2009, Pages 2-12

Automatic detection of disfluency boundaries in spontaneous speech of children using audio-visual information

Author keywords

Disfluency detection; Feature selection; Information fusion; Spoken language processing; Spontaneous children speech

Indexed keywords

AUDIO-VISUAL INFORMATION; AUTOMATIC DETECTION; AUTOMATIC RECOGNITION; COGNITIVE STATE; COMPUTER GAME; DECISION LEVELS; DETECTION ACCURACY; DETECTION ERROR RATE; DETECTION SYSTEM; DISFLUENCIES; DISFLUENCY DETECTION; FEATURE LEVEL; FEATURE SELECTION; INFORMATION SOURCES; LANGUAGE FEATURES; MULTI-MODAL; SPOKEN LANGUAGE PROCESSING; SPONTANEOUS CHILDREN SPEECH; SPONTANEOUS SPEECH; VISUAL INFORMATION;

EID: 70350442414     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2008.2006728     Document Type: Article
Times cited : (26)

References (40)
  • 1
    • 51849142134 scopus 로고    scopus 로고
    • Evaluating the effect of predicting oral reading miscues
    • Geneva, Switzerland
    • S. Banerjee, J. E. Beck, and J. Mostow, "Evaluating the effect of predicting oral reading miscues," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 3165-3168.
    • (2003) Proc. Eurospeech , pp. 3165-3168
    • Banerjee, S.1    Beck, J.E.2    Mostow, J.3
  • 3
    • 67650622233 scopus 로고    scopus 로고
    • Automatic detection and classification of disfluent reading miscues in young childrens speech for the purpose of assessment
    • Antwerp, Belgium, Aug.
    • M. Black, J. Tepperman, S. Lee, P. Price, and S. Narayanan, "Automatic detection and classification of disfluent reading miscues in young childrens speech for the purpose of assessment," in Proc. InterSpeech ICSLP, Antwerp, Belgium, Aug. 2007, pp. 206-209.
    • (2007) Proc. InterSpeech ICSLP , pp. 206-209
    • Black, M.1    Tepperman, J.2    Lee, S.3    Price, P.4    Narayanan, S.5
  • 4
    • 0003798906 scopus 로고
    • Preliminaries to a Theory of Speech Disfluencies
    • Univ. of California, Berkley
    • E. E. Shriberg, "Preliminaries to a Theory of Speech Disfluencies," Ph.D. dissertation, Univ. of California, Berkley, 1994.
    • (1994) Ph.D. dissertation
    • Shriberg, E.E.1
  • 5
    • 0029765629 scopus 로고    scopus 로고
    • Statistical language modeling for speech disfluencies
    • Atlanta, GA
    • A. Stolcke and E. Shriberg, "Statistical language modeling for speech disfluencies," in Proc. ICASSP, Atlanta, GA, 1996, vol.1, pp. 405-408.
    • (1996) Proc. ICASSP , vol.1 , pp. 405-408
    • Stolcke, A.1    Shriberg, E.2
  • 6
    • 0010125082 scopus 로고    scopus 로고
    • A prosody-only decision-tree model for disfluency detection
    • E. Shriberg, R. Bates, and A. Stolcke, "A prosody-only decision-tree model for disfluency detection," in Proc. Eurospeech, 1997, pp. 2383-2386.
    • (1997) Proc. Eurospeech , pp. 2383-2386
    • Shriberg, E.1    Bates, R.2    Stolcke, A.3
  • 8
    • 85009223733 scopus 로고    scopus 로고
    • Automatic disfluency identification in conversational speech using multiple knowledge source
    • Geneva, Switzerland
    • Y. Liu, E. Shriberg, and A. Stolcke, "Automatic disfluency identification in conversational speech using multiple knowledge source," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 957-960.
    • (2003) Proc. Eurospeech , pp. 957-960
    • Liu, Y.1    Shriberg, E.2    Stolcke, A.3
  • 9
    • 0032969462 scopus 로고    scopus 로고
    • Acoustics of children's speech: Developmental changes of temporal and spectral parameters
    • Mar.
    • S. Lee, A. Potamianos, and S. Narayanan, "Acoustics of children's speech: Developmental changes of temporal and spectral parameters," J. Acoust. Soc. Amer., vol.105, pp. 1455-1468, Mar. 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.105 , pp. 1455-1468
    • Lee, S.1    Potamianos, A.2    Narayanan, S.3
  • 10
    • 0036475971 scopus 로고    scopus 로고
    • Creating conversational interfaces for children
    • Feb.
    • S. Narayanan and A. Potamianos, "Creating conversational interfaces for children," IEEE Trans. Speech Audio Process., vol.10, no.2, pp. 65-78, Feb. 2002.
    • (2002) IEEE Trans. Speech Audio Process , vol.10 , Issue.2 , pp. 65-78
    • Narayanan, S.1    Potamianos, A.2
  • 11
    • 0029747582 scopus 로고    scopus 로고
    • A study of speech recognition for children and elderly
    • J.Wilpon and C. Jacobsen, "A study of speech recognition for children and elderly," in Proc. ICASSP, 1996, pp. 349-352.
    • (1996) Proc. ICASSP , pp. 349-352
    • Wilpon, J.1    Jacobsen, C.2
  • 12
    • 0031644298 scopus 로고    scopus 로고
    • Improvements in children's speech recognition performance
    • S. Das, D. Nix, and M. Picheny, "Improvements in children's speech recognition performance," in Proc. ICASSP, 1998, pp. 433-436.
    • (1998) Proc. ICASSP , pp. 433-436
    • Das, S.1    Nix, D.2    Picheny, M.3
  • 13
    • 84946707630 scopus 로고    scopus 로고
    • Childrens speech recognition with application to interactive books and tutors
    • St. Thomas, Virgin Islands, Dec.
    • A. Hagen, B. Pellom, and R. Cole, "Childrens speech recognition with application to interactive books and tutors," in Proc. IEEE ASRUWorkshop, St. Thomas, Virgin Islands, Dec. 2003.
    • (2003) Proc. IEEE ASRUWorkshop
    • Hagen, A.1    Pellom, B.2    Cole, R.3
  • 14
    • 85009291880 scopus 로고    scopus 로고
    • An analysis of the causes of increased error rated in children's speech recognition
    • Denver, CO
    • Q. Li and M. J. Russell, "An analysis of the causes of increased error rated in children's speech recognition," in Proc. ICSLP, Denver, CO, 2002, pp. 2337-2340.
    • (2002) Proc. ICSLP , pp. 2337-2340
    • Li, Q.1    Russell, M.J.2
  • 15
    • 0038418668 scopus 로고    scopus 로고
    • Designing and evaluating conversational interfaces with animated characters
    • S. L. Oviatt and B. Adams, , J. Cassell, J. Sullivan, S. Prevost, and E. Churchill, Eds., Cambridge, MA: MIT Press
    • S. L. Oviatt and B. Adams, , J. Cassell, J. Sullivan, S. Prevost, and E. Churchill, Eds., "Designing and evaluating conversational interfaces with animated characters," in Embodied Conversational Agents.. Cambridge, MA: MIT Press, 2000, pp. 319-343.
    • (2000) Embodied Conversational Agents , pp. 319-343
  • 16
    • 4544316886 scopus 로고    scopus 로고
    • A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues
    • May
    • D. Wang and S. Narayanan, "A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues," in Proc. ICASSP, May 2004, vol.1, pp. 525-528.
    • (2004) Proc. ICASSP , vol.1 , pp. 525-528
    • Wang, D.1    Narayanan, S.2
  • 19
    • 42949107237 scopus 로고    scopus 로고
    • Interrelation between speech and facial gestures in emotional utterances: A single subject study
    • Nov.
    • C. Busso and S. Narayanan, "Interrelation between speech and facial gestures in emotional utterances: A single subject study," IEEE Trans. Speech, Audio, Lang. Process., vol.15, no.8, pp. 2331-2347, Nov. 2007.
    • (2007) IEEE Trans. Speech, Audio, Lang. Process , vol.15 , Issue.8 , pp. 2331-2347
    • Busso, C.1    Narayanan, S.2
  • 23
    • 70350481599 scopus 로고    scopus 로고
    • Gesture patterns during speech repairs
    • Denver, CO
    • L. Chen, M. Harper, and F. Quek, "Gesture patterns during speech repairs," in Proc. ICSLP, Denver, CO, 2002, pp. 629-632.
    • (2002) Proc. ICSLP , pp. 629-632
    • Chen, L.1    Harper, M.2    Quek, F.3
  • 24
    • 16244416858 scopus 로고    scopus 로고
    • Prosody based audiovisual coanalysis for coverbal gesture recognition
    • Apr.
    • S. Kettebekov, M. Yeasin, and R. Sharma, "Prosody based audiovisual coanalysis for coverbal gesture recognition," IEEE Trans. Multimedia, vol.7, no.2, pp. 234-242, Apr. 2005.
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.2 , pp. 234-242
    • Kettebekov, S.1    Yeasin, M.2    Sharma, R.3
  • 25
    • 14944345809 scopus 로고    scopus 로고
    • Multimodal model integration for sentence unit detection
    • State College, PA
    • L. Chen, Y. Liu, M. Harper, and E. Shriberg, "Multimodal model integration for sentence unit detection," in Proc. ICMI, State College, PA, 2004, pp. 121-128.
    • (2004) Proc. ICMI , pp. 121-128
    • Chen, L.1    Liu, Y.2    Harper, M.3    Shriberg, E.4
  • 26
    • 21244500957 scopus 로고    scopus 로고
    • Logistic model trees
    • N. Landwehr, M. Hall, and E. Frank, "Logistic model trees," Mach. Learn. J., vol.59, no.1-2, pp. 161-205, 2005.
    • (2005) Mach. Learn. J. , vol.59 , Issue.1-2 , pp. 161-205
    • Landwehr, N.1    Hall, M.2    Frank, E.3
  • 27
    • 85009243632 scopus 로고    scopus 로고
    • Cu animate tools for enabling conversations with animated characters
    • J. Ma, J.Yan, and R. Cole, "Cu animate tools for enabling conversations with animated characters," in Proc. ICSLP, 2002, vol.1, pp. 197-200.
    • (2002) Proc. ICSLP , vol.1 , pp. 197-200
    • Ma, J.1    Yan, J.2    Cole, R.3
  • 28
    • 0038120523 scopus 로고    scopus 로고
    • Jul. 20 2005 2005, retrieved from, [Online]. Available:, (version 4.3.19) [Computer Program]
    • P. Boersma and D. Weenink, "Praat: Doing Phonetics by Computer," Jul. 20, 2005 [Online]. Available: http://www.praat.org, (version 4.3.19) [computer program], 2005, retrieved from
    • Praat: Doing Phonetics by Computer
    • Boersma, P.1    Weenink, D.2
  • 29
    • 84859899617 scopus 로고    scopus 로고
    • Anvil-A generic annotation tool for multimodal dialogue
    • M. Kipp, "Anvil-A generic annotation tool for multimodal dialogue," in Proc. Eurospeech, 2001, pp. 1367-1370.
    • (2001) Proc. Eurospeech , pp. 1367-1370
    • Kipp, M.1
  • 30
    • 74049094559 scopus 로고    scopus 로고
    • Analyzing the interplay between spoken language and gestural cues in conversational child-machine interactions in pre/early literate age group
    • paper ID 047
    • S. Montanari, S. Yildirim, S. Khurana, M. Landes, L. Lawyer, E. Andersen, and S. Narayanan, "Analyzing the interplay between spoken language and gestural cues in conversational child-machine interactions in pre/early literate age group," in Proc. InStil, Jul. 2004, paper ID 047.
    • (2004) Proc. InStil, Jul.
    • Montanari, S.1    Yildirim, S.2    Khurana, S.3    Landes, M.4    Lawyer, L.5    Andersen, E.6    Narayanan, S.7
  • 31
    • 85009115741 scopus 로고    scopus 로고
    • Reference marking in childrens computer-directed speech: An integrated analysis of discourse and gesture
    • Oct.
    • S. Montanari, S. Yildirim, E. Andersen, and S. Narayanan, "Reference marking in childrens computer-directed speech: An integrated analysis of discourse and gesture," in Proc. ICSLP, Oct. 2004, pp. 1841-1844.
    • (2004) Proc. ICSLP , pp. 1841-1844
    • Montanari, S.1    Yildirim, S.2    Andersen, E.3    Narayanan, S.4
  • 32
    • 0029219786 scopus 로고
    • Predicting spoken disfluecies during human-computer interaction
    • S. Oviatt, "Predicting spoken disfluecies during human-computer interaction," Comput. Speech Lang., vol.9, pp. 19-35, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 19-35
    • Oviatt, S.1
  • 33
    • 0039100034 scopus 로고    scopus 로고
    • Disfluencies in switchboard
    • E. Shriberg, "Disfluencies in switchboard," in Proc. ICSLP, 1996, pp. 11-14.
    • (1996) Proc. ICSLP , pp. 11-14
    • Shriberg, E.1
  • 34
    • 84891308106 scopus 로고    scopus 로고
    • Srilm-An extensible language modeling toolkit
    • A. Stolcke, "Srilm-An extensible language modeling toolkit," in Proc. ICSLP, 2002, vol.2, pp. 901-904.
    • (2002) Proc. ICSLP , vol.2 , pp. 901-904
    • Stolcke, A.1
  • 37
    • 0036379257 scopus 로고    scopus 로고
    • Measuring the structure of dynamic visual signals
    • R. A. Peters, C. W. G. Clifford, and C. S. Evans, "Measuring the structure of dynamic visual signals," Animal Behaviour, vol.64, pp. 131-146, 2002.
    • (2002) Animal Behaviour , vol.64 , pp. 131-146
    • Peters, R.A.1    Clifford, C.W.G.2    Evans, C.S.3
  • 39
    • 0033713738 scopus 로고    scopus 로고
    • Combining multiple classifiers by averaging or by multiplying
    • D. Tax, M. van Breukelen, R. Duin, and J. Kittler, "Combining multiple classifiers by averaging or by multiplying," Pattern Recognition, vol.33, pp. 1475-1485, 2000.
    • (2000) Pattern Recognition , vol.33 , pp. 1475-1485
    • Tax, D.1    Van Breukelen, M.2    Duin, R.3    Kittler, J.4
  • 40
    • 0037403516 scopus 로고    scopus 로고
    • Measure of diversity in classifier esembles
    • L. Kuncheva and C. Whitaker, "Measure of diversity in classifier esembles," Mach. Learn., vol.51, pp. 181-207, 2003.
    • (2003) Mach. Learn. , vol.51 , pp. 181-207
    • Kuncheva, L.1    Whitaker, C.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.