메뉴 건너뛰기




Volumn 14, Issue 5, 2006, Pages 1574-1583

Edit disfluency detection and correction using a cleanup language model and an alignment model

Author keywords

Edit disflucncy; Language model; Potential interruption point (IP) detection; Rich transcription

Indexed keywords

EDIT DISFLUCNCY; LANGUAGE MODELS; POTENTIAL INTERRUPTION POINT (IP) DETECTION; RICH TRANSCRIPTION;

EID: 34047266604     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.878267     Document Type: Article
Times cited : (19)

References (43)
  • 3
    • 33646809491 scopus 로고    scopus 로고
    • Structural event detection for rich transcription of speech,
    • Ph.D. dissertation, Purdue Univ, West Lafayette, IN
    • Y. Liu, "Structural event detection for rich transcription of speech," Ph.D. dissertation, Purdue Univ., West Lafayette, IN, 2004.
    • (2004)
    • Liu, Y.1
  • 5
    • 33646786242 scopus 로고    scopus 로고
    • Version 6.2. Linguistic Data Consortium, Online, Available
    • S. Strassel. (2004) Simple Metadata Annotation Specification Version 6.2. Linguistic Data Consortium. [Online]. Available: http://www.ldc.upenn.edu/ Projects/MDE
    • (2004) Simple Metadata Annotation Specification
    • Strassel, S.1
  • 6
    • 18744415719 scopus 로고    scopus 로고
    • Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system
    • May
    • C.-H. Wu and G.-L. Yan, "Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 330-344, May 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 330-344
    • Wu, C.-H.1    Yan, G.-L.2
  • 7
    • 0034275920 scopus 로고    scopus 로고
    • Prosody-based automatic segmentation of speech into sentences and topics
    • E. Shriberg, A. Stolcke, D. Hakkani-Tur, and G. Tur, "Prosody-based automatic segmentation of speech into sentences and topics," Speech Commun., vol. 32, no. 1-2, pp. 127-154, 2000.
    • (2000) Speech Commun , vol.32 , Issue.1-2 , pp. 127-154
    • Shriberg, E.1    Stolcke, A.2    Hakkani-Tur, D.3    Tur, G.4
  • 8
    • 85059598545 scopus 로고
    • Integrating multiple knowledge sources for detecting and correction of repairs in human computer dialog
    • J. Bear, J. Dowding, and E. Shriberg, "Integrating multiple knowledge sources for detecting and correction of repairs in human computer dialog," in Proc. ACL, 1992, pp. 56-63.
    • (1992) Proc. ACL , pp. 56-63
    • Bear, J.1    Dowding, J.2    Shriberg, E.3
  • 10
    • 84878523744 scopus 로고    scopus 로고
    • Prosodic features of four types of disfluencies
    • G. Savova and J. Bachenko, "Prosodic features of four types of disfluencies," in Proc. DiSS, 2003, pp. 91-94.
    • (2003) Proc. DiSS , pp. 91-94
    • Savova, G.1    Bachenko, J.2
  • 12
    • 33646819463 scopus 로고    scopus 로고
    • Detecting structural metadata with decision trees and transformation-based learning
    • J. Kim, S. E. Schwarm, and M. Ostendorf, "Detecting structural metadata with decision trees and transformation-based learning," in Proc. HLT/NAACL, 2004, pp. 137-144.
    • (2004) Proc. HLT/NAACL , pp. 137-144
    • Kim, J.1    Schwarm, S.E.2    Ostendorf, M.3
  • 13
    • 24144462548 scopus 로고    scopus 로고
    • Analysis and recognition of spontaneous speech using corpus of spontaneous Japanese
    • S. Furui, M. Nakamura, T. Ichiba, and K. Iwano, "Analysis and recognition of spontaneous speech using corpus of spontaneous Japanese," Speech Commun., vol. 47, pp. 208-219, 2005.
    • (2005) Speech Commun , vol.47 , pp. 208-219
    • Furui, S.1    Nakamura, M.2    Ichiba, T.3    Iwano, K.4
  • 14
    • 85120620835 scopus 로고    scopus 로고
    • Edit detection and parsing for transcribed speech
    • E. Charniak and M. Johnson, "Edit detection and parsing for transcribed speech," in Proc. NAACL, 2001, pp. 118-126.
    • (2001) Proc. NAACL , pp. 118-126
    • Charniak, E.1    Johnson, M.2
  • 15
    • 57849131781 scopus 로고    scopus 로고
    • A tag-based noisy channel model of speech repairs
    • M. Johnson and E. Charniak, "A tag-based noisy channel model of speech repairs," in Proc. ACL, 2004, pp. 33-39.
    • (2004) Proc. ACL , pp. 33-39
    • Johnson, M.1    Charniak, E.2
  • 16
    • 33646756210 scopus 로고    scopus 로고
    • Parsing and its applications for conversational speech
    • M. Lease, E. Charniak, and M. Johnson, "Parsing and its applications for conversational speech," in Proc. ICASSP, 2005, pp. 961-964.
    • (2005) Proc. ICASSP , pp. 961-964
    • Lease, M.1    Charniak, E.2    Johnson, M.3
  • 18
    • 0040958578 scopus 로고    scopus 로고
    • Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue
    • P. Heeman and J. Allen, "Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue," Comput. Ling., vol. 25, pp. 527-571, 1999.
    • (1999) Comput. Ling , vol.25 , pp. 527-571
    • Heeman, P.1    Allen, J.2
  • 20
    • 33646762857 scopus 로고    scopus 로고
    • Automatic disfluency removal on recognized spontaneous speech-Rapid adaptation to speaker dependent dislfuencies
    • M. Honal and T. Schultz, "Automatic disfluency removal on recognized spontaneous speech-Rapid adaptation to speaker dependent dislfuencies," in Proc. ICASSP, 2005, pp. 969-972.
    • (2005) Proc. ICASSP , pp. 969-972
    • Honal, M.1    Schultz, T.2
  • 21
    • 56149102222 scopus 로고    scopus 로고
    • Corrections of disfluencies in spontaneous speech using a noisy-channel approach
    • _, "Corrections of disfluencies in spontaneous speech using a noisy-channel approach," in Proc. Eurospeech, 2003, pp. 2781-2784.
    • (2003) Proc. Eurospeech , pp. 2781-2784
    • Honal, M.1    Schultz, T.2
  • 22
    • 33646764337 scopus 로고    scopus 로고
    • A lexically-driven algorithm for disfluency detection
    • M. Snover, B. Dorr, and R. Schwartz, "A lexically-driven algorithm for disfluency detection," in Proc. HLT/NAACL, 2004, pp. 157-160.
    • (2004) Proc. HLT/NAACL , pp. 157-160
    • Snover, M.1    Dorr, B.2    Schwartz, R.3
  • 27
    • 85009291541 scopus 로고    scopus 로고
    • Maximum entropy model for punctuation annotation from speech
    • J. Huang and G. Zweig, "Maximum entropy model for punctuation annotation from speech," in Proc. ICSLP, pp. 917-920.
    • Proc. ICSLP , pp. 917-920
    • Huang, J.1    Zweig, G.2
  • 28
    • 33745191849 scopus 로고    scopus 로고
    • Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
    • Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Comparing HMM, maximum entropy, and conditional random fields for disfluency detection," in Proc. Eurospeech, 2005, pp. 3313-3316.
    • (2005) Proc. Eurospeech , pp. 3313-3316
    • Liu, Y.1    Shriberg, E.2    Stolcke, A.3    Harper, M.4
  • 31
    • 0002629270 scopus 로고
    • Maximum-likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum-likelihood from incomplete data via the EM algorithm," J. R. Statist Soc. B, pp. 1-39, 1977.
    • (1977) J. R. Statist Soc. B , pp. 1-39
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 33
    • 0029765629 scopus 로고    scopus 로고
    • Statistical language modeling for speech disfluencies
    • A. Stolcke and E. Shriberg, "Statistical language modeling for speech disfluencies," in Proc. ICASSP, vol. 1, 1996, pp. 405-408.
    • (1996) Proc. ICASSP , vol.1 , pp. 405-408
    • Stolcke, A.1    Shriberg, E.2
  • 34
    • 34047265603 scopus 로고    scopus 로고
    • S. F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Center Res. Comput. Technol., Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
    • S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Center Res. Comput. Technol., Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
  • 36
    • 85146676791 scopus 로고
    • Verb semantics and lexical selection
    • Z. Wu and M. Palmer, "Verb semantics and lexical selection," in Proc. 32nd ACL, 1994, pp. 133-138.
    • (1994) Proc. 32nd ACL , pp. 133-138
    • Wu, Z.1    Palmer, M.2
  • 40
    • 34047261551 scopus 로고    scopus 로고
    • Online, Available
    • MAT Speech Database - TCC-300 [Online]. Available: http://rocling.iis. sinica.edu.tw/ROCLING/MAT/Tcc_300brief.htm
    • MAT Speech Database - TCC-300
  • 41
    • 36749015898 scopus 로고    scopus 로고
    • Online, Available
    • Rich Transcription (RT-04F) Evaluation Plan (2004). [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/docs/rt04f-eval-plan-vl4.doc
    • (2004) Rich Transcription (RT-04F) Evaluation Plan
  • 42
    • 85123721100 scopus 로고    scopus 로고
    • Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous mandarin speech
    • C.-K. Lin, S.-C. Tseng, and L.-S. Lee, "Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous mandarin speech," in Proc. DiSS, 2005, pp. 117-121.
    • (2005) Proc. DiSS , pp. 117-121
    • Lin, C.-K.1    Tseng, S.-C.2    Lee, L.-S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.