메뉴 건너뛰기




Volumn 17, Issue 7, 2009, Pages 1263-1278

Improved features and models for detecting edit disfluencies in transcribing spontaneous mandarin speech

Author keywords

Edit disfluency; Interruption point detection; Prosody; Speech recognition; Spontaneous speech

Indexed keywords

CHINESE CHARACTERS; CONDITIONAL RANDOM FIELD; DISFLUENCIES; EDIT DISFLUENCY; IMPROVED MODELS; INTERRUPTION POINT DETECTION; MAXIMUM ENTROPY; PROBABILISTIC FRAMEWORK; PROSODIC FEATURES; PROSODIC MODELING; PROSODIC STATE; PROSODY; RECOGNITION ACCURACY; SPEECH CORPORA; SPEECH PROSODY; SPONTANEOUS SPEECH;

EID: 68549130583     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2014792     Document Type: Article
Times cited : (20)

References (41)
  • 1
    • 85083514474 scopus 로고    scopus 로고
    • Parsing conversational speech using enhanced segmentation
    • J. G. Kahn, M. Ostendorf, and C. Chelba, "Parsing conversational speech using enhanced segmentation," in Proc. HLT/NAACL, 2004, pp. 121-128.
    • (2004) Proc. HLT/NAACL , pp. 121-128
    • Kahn, J.G.1    Ostendorf, M.2    Chelba, C.3
  • 2
    • 68549138413 scopus 로고    scopus 로고
    • S. Strassel, Simple metadata annotation specification V6.2, Linguistic Data Consortium, 2004 [Online]. Available: http://www.ldc. upenn.edu/Projects/MDE/Guidelines/SimpleMDE V6.2.pdf
    • S. Strassel, "Simple metadata annotation specification V6.2," Linguistic Data Consortium, 2004 [Online]. Available: http://www.ldc. upenn.edu/Projects/MDE/Guidelines/SimpleMDE V6.2.pdf
  • 4
    • 24144462548 scopus 로고    scopus 로고
    • Analysis and recognition of spontaneous speech using corpus of spontaneous japanese
    • S. Furui, M. Nakamura, T. Ichiba, and K. Iwano, "Analysis and recognition of spontaneous speech using corpus of spontaneous japanese," Speech Commun., vol. 47, pp. 208-219, 2005.
    • (2005) Speech Commun , vol.47 , pp. 208-219
    • Furui, S.1    Nakamura, M.2    Ichiba, T.3    Iwano, K.4
  • 8
    • 34047261805 scopus 로고    scopus 로고
    • An overview of automatic speaker diarization systems
    • Sep
    • S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1557-1565
    • Tranter, S.E.1    Reynolds, D.A.2
  • 11
    • 34047266604 scopus 로고    scopus 로고
    • Edit disfluency detection and correction using a cleanup language model and an alignment model
    • Sep
    • J.-F. Yeh and C.-H. Wu, "Edit disfluency detection and correction using a cleanup language model and an alignment model," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1574-1583, Sep. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1574-1583
    • Yeh, J.-F.1    Wu, C.-H.2
  • 12
    • 0040958578 scopus 로고    scopus 로고
    • Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue
    • P. Heeman and J. Allen, "Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue," Comput. Linguist., vol. 25, pp. 527-571, 1999.
    • (1999) Comput. Linguist , vol.25 , pp. 527-571
    • Heeman, P.1    Allen, J.2
  • 13
    • 85120620835 scopus 로고    scopus 로고
    • Edit detection and parsing for transcribed speech
    • E. Charniak and M. Johnson, "Edit detection and parsing for transcribed speech," in Proc. NAACL, 2001, pp. 118-126.
    • (2001) Proc. NAACL , pp. 118-126
    • Charniak, E.1    Johnson, M.2
  • 14
    • 57849131781 scopus 로고    scopus 로고
    • a TAG-based noisy channel model of speech repairs
    • m. Johnson and e. Charniak, "a TAG-based noisy channel model of speech repairs," in Proc. ACL, 2004.
    • (2004) Proc. ACL
    • Johnson, M.1    Charniak, E.2
  • 15
    • 33646762857 scopus 로고    scopus 로고
    • Automatic disfluency removal on recognized spontaneous speech - Rapid adaptation to speaker dependent disfluencies
    • M. Honal and T. Schultz, "Automatic disfluency removal on recognized spontaneous speech - Rapid adaptation to speaker dependent disfluencies," in Proc. ICASSP, 2005, pp. 969-972.
    • (2005) Proc. ICASSP , pp. 969-972
    • Honal, M.1    Schultz, T.2
  • 16
    • 56149102222 scopus 로고    scopus 로고
    • Corrections of disfluencies in spontaneous speech using a noisy channel approach
    • M. Honal and T. Schultz, "Corrections of disfluencies in spontaneous speech using a noisy channel approach," in Proc. Eurospeech, 2003, pp. 2781-2784.
    • (2003) Proc. Eurospeech , pp. 2781-2784
    • Honal, M.1    Schultz, T.2
  • 17
    • 0028215480 scopus 로고
    • A corpus-based study of repair cues in spontaneous speech
    • C. Nakatani and J. Hirschberg, "A corpus-based study of repair cues in spontaneous speech," J. Acoust. Soc. Amer., pp. 1603-1616, 1994.
    • (1994) J. Acoust. Soc. Amer , pp. 1603-1616
    • Nakatani, C.1    Hirschberg, J.2
  • 18
    • 0000703860 scopus 로고    scopus 로고
    • Phonetic consequences of speech disfluency
    • E. Shriberg, "Phonetic consequences of speech disfluency," in Proc. Int. Conf. Phonetics Sci., 1999, pp. 619-622.
    • (1999) Proc. Int. Conf. Phonetics Sci , pp. 619-622
    • Shriberg, E.1
  • 19
    • 0030351630 scopus 로고    scopus 로고
    • Juncture cues to disfluency
    • R. Lickley, "Juncture cues to disfluency," in Proc. ICSLP, 1996, pp. 2478-2481.
    • (1996) Proc. ICSLP , pp. 2478-2481
    • Lickley, R.1
  • 20
    • 84878523744 scopus 로고    scopus 로고
    • Prosodic features of four types of disfluencies
    • G. Savova and J. Bachenko, "Prosodic features of four types of disfluencies," in Proc. DiSS, 2003, pp. 91-94.
    • (2003) Proc. DiSS , pp. 91-94
    • Savova, G.1    Bachenko, J.2
  • 21
    • 0010125082 scopus 로고    scopus 로고
    • A prosody-only decision-tree model for disfluency detection
    • E. Shriberg and A. Stolcke, "A prosody-only decision-tree model for disfluency detection," in Proc. Eurospeech, 1997, pp. 2383-2386.
    • (1997) Proc. Eurospeech , pp. 2383-2386
    • Shriberg, E.1    Stolcke, A.2
  • 22
    • 0034275920 scopus 로고    scopus 로고
    • Prosody-based automatic segmentation of speech into sentences and topics
    • E. Shriberg, A. Stolcke, D. Hakkani-Tur, and G. Tur, "Prosody-based automatic segmentation of speech into sentences and topics," Speech Commun., pp. 127-154, 2000.
    • (2000) Speech Commun , pp. 127-154
    • Shriberg, E.1    Stolcke, A.2    Hakkani-Tur, D.3    Tur, G.4
  • 23
    • 33745191849 scopus 로고    scopus 로고
    • Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
    • Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Comparing HMM, maximum entropy, and conditional random fields for disfluency detection," in Proc. Eurospeech, 2005, pp. 3313-3316.
    • (2005) Proc. Eurospeech , pp. 3313-3316
    • Liu, Y.1    Stolcke, A.2    Shriberg, E.3    Harper, M.4
  • 24
    • 33646800879 scopus 로고    scopus 로고
    • Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, Structural metadata research in the ears program, presented at the icassp, 2005, pp. 957-960, unpublished.
    • Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Structural metadata research in the ears program," presented at the icassp, 2005, pp. 957-960, unpublished.
  • 25
    • 85009223733 scopus 로고    scopus 로고
    • Automatic disfluency identification in conversational speech using multiple knowledge sources
    • Y. Liu, E. Shriberg, and A. Stolcke, "Automatic disfluency identification in conversational speech using multiple knowledge sources," in Proc. Eurospeech, 2003, pp. 957-960.
    • (2003) Proc. Eurospeech , pp. 957-960
    • Liu, Y.1    Shriberg, E.2    Stolcke, A.3
  • 26
    • 85009142186 scopus 로고    scopus 로고
    • Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection
    • Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection," in Proc. ICSLP, 2004, pp. 1525-1528.
    • (2004) Proc. ICSLP , pp. 1525-1528
    • Liu, Y.1    Shriberg, E.2    Stolcke, A.3    Harper, M.4
  • 27
    • 33646764337 scopus 로고    scopus 로고
    • A lexically-driven algorithm for disfluency detection
    • M. Snover, B. Dorr, and R. Schwartz, "A lexically-driven algorithm for disfluency detection," in Proc. HLT/NAACL, 2004, pp. 157-160.
    • (2004) Proc. HLT/NAACL , pp. 157-160
    • Snover, M.1    Dorr, B.2    Schwartz, R.3
  • 28
    • 33646819463 scopus 로고    scopus 로고
    • Detecting structural metadata with decision trees and transformation-based learning
    • J. Kim, S. Schwarm, and M. Ostendorf, "Detecting structural metadata with decision trees and transformation-based learning," in Proc. HLT/ NAACL, 2004.
    • (2004) Proc. HLT/ NAACL
    • Kim, J.1    Schwarm, S.2    Ostendorf, M.3
  • 29
    • 0002652285 scopus 로고    scopus 로고
    • A maximum entropy approach to natural language processing
    • A. L. Berger, S. A. Della Pietra, and V. J. Della Pietra, "A maximum entropy approach to natural language processing," Comput. Linguist., vol. 22, pp. 39-72, 1996.
    • (1996) Comput. Linguist , vol.22 , pp. 39-72
    • Berger, A.L.1    Della Pietra, S.A.2    Della Pietra, V.J.3
  • 30
    • 0000732463 scopus 로고
    • A limited memory algorithm for bound constrained optimization
    • R. H. Ryrd, P. Lu, and J. Nocedal, "A limited memory algorithm for bound constrained optimization," SIAM J. Sci. Statist. Comput., vol. 16, no. 5, pp. 1190-1208, 1995.
    • (1995) SIAM J. Sci. Statist. Comput , vol.16 , Issue.5 , pp. 1190-1208
    • Ryrd, R.H.1    Lu, P.2    Nocedal, J.3
  • 31
    • 0004014502 scopus 로고    scopus 로고
    • A Gaussian Prior for Smoothing Maximum Entropy Models Carnegie Mellon Univ., Pittsburgh, PA
    • Tech. Rep
    • S. Chen and R. Rosenfeld, A Gaussian Prior for Smoothing Maximum Entropy Models Carnegie Mellon Univ., Pittsburgh, PA, 1999, Tech. Rep..
    • (1999)
    • Chen, S.1    Rosenfeld, R.2
  • 33
    • 21844474040 scopus 로고    scopus 로고
    • Fluent speech prosody: Framework and modeling
    • July, Special Issue on Quantitative Prosody Modeling for Natural Speech Description and Generation
    • C.-Y. Tseng, S.-H. Pin, Y.-L. Lee, H.-M. Wang, and Y.-C. Chen, "Fluent speech prosody: Framework and modeling," Speech Commun., vol. 46, no. 3-4, pp. 284-309, July 2005, Special Issue on Quantitative Prosody Modeling for Natural Speech Description and Generation.
    • (2005) Speech Commun , vol.46 , Issue.3-4 , pp. 284-309
    • Tseng, C.-Y.1    Pin, S.-H.2    Lee, Y.-L.3    Wang, H.-M.4    Chen, Y.-C.5
  • 34
    • 11244330002 scopus 로고    scopus 로고
    • Probabilistic latent semantic analysis
    • T. Hofmann, "Probabilistic latent semantic analysis," in Uncertainty Artif. Intell., 1999, pp. 289-296.
    • (1999) Uncertainty Artif. Intell , pp. 289-296
    • Hofmann, T.1
  • 35
    • 40349098806 scopus 로고    scopus 로고
    • Learning the threshold in hierarchical agglomerative clustering
    • K. Daniels and C. Giraud-Carrier, "Learning the threshold in hierarchical agglomerative clustering," in Proc. ICMLA, 2006, pp. 270-278.
    • (2006) Proc. ICMLA , pp. 270-278
    • Daniels, K.1    Giraud-Carrier, C.2
  • 37
    • 0142192295 scopus 로고    scopus 로고
    • Conditional random fields: Probabilistic models for segmenting and labeling sequence data
    • J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. ICML, 2001, pp. 282-289.
    • (2001) Proc. ICML , pp. 282-289
    • Lafferty, J.1    McCallum, A.2    Pereira, F.3
  • 38
    • 68549091556 scopus 로고    scopus 로고
    • S.-C. Tseng, Processing spoken mandarin corpora, Traitement Automatique des Langues, 45, no. 2, pp. 89-108, Special Issue: Spoken Corpus Processing.
    • S.-C. Tseng, "Processing spoken mandarin corpora," Traitement Automatique des Langues, vol. 45, no. 2, pp. 89-108, Special Issue: Spoken Corpus Processing.
  • 39
    • 33947691278 scopus 로고    scopus 로고
    • Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
    • Y.-C. Hsieh, Y.-T. Huang, C.-C. Wang, and L.-S. Lee, "Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)," in Proc. ICASSP, 2006, pp. 961-964.
    • (2006) Proc. ICASSP , pp. 961-964
    • Hsieh, Y.-C.1    Huang, Y.-T.2    Wang, C.-C.3    Lee, L.-S.4
  • 40
    • 68549116144 scopus 로고    scopus 로고
    • Rich Transcription (RT-04F)
    • Evaluation Plan, Online, Available
    • "Rich Transcription (RT-04F)," Evaluation Plan 2004 [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/docs/rt04f-eval-plan- v14.doc
    • (2004)
  • 41
    • 0001884644 scopus 로고
    • Individual comparisons by ranking methods
    • F. Wilcoxon, "Individual comparisons by ranking methods," Biometrics, vol. 1, pp. 80-83, 1945.
    • (1945) Biometrics , vol.1 , pp. 80-83
    • Wilcoxon, F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.