SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 5, 2006, Pages 1574-1583

Edit disfluency detection and correction using a cleanup language model and an alignment model

(2) Yeh, Jui Feng b Wu, Chung Hsien a,b

a IEEE (Taiwan)

b NATIONAL CHENG KUNG UNIVERSITY (Taiwan)

Author keywords

Edit disflucncy; Language model; Potential interruption point (IP) detection; Rich transcription

Indexed keywords

EDIT DISFLUCNCY; LANGUAGE MODELS; POTENTIAL INTERRUPTION POINT (IP) DETECTION; RICH TRANSCRIPTION;

BIT ERROR RATE; MATHEMATICAL MODELS; OPTIMIZATION; SPEECH PROCESSING; WORD PROCESSING;

FORMAL LANGUAGES;

EID: 34047266604 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.878267 Document Type: Article

Times cited : (19)

References (43)

1
- 3042820894
- Automatic recognition of spontaneous speech for access to multilingual oral history archives
- Jul
- W. Byrne, D. Doermann, M. Franz, S. Gustman, J. Hajic, D. Oard, M. Picheny, J. Psutka, B. Ramabhadran, D. Soergel, T. Ward, and Z. Wei-Jin, "Automatic recognition of spontaneous speech for access to multilingual oral history archives," IEEE Trans. Speech Audio Process., vol. 12, no. 4, pp. 420-435, Jul. 2004.
- (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.4 , pp. 420-435
- Byrne, W.¹ Doermann, D.² Franz, M.³ Gustman, S.⁴ Hajic, J.⁵ Oard, D.⁶ Picheny, M.⁷ Psutka, J.⁸ Ramabhadran, B.⁹ Soergel, D.¹⁰ Ward, T.¹¹ Wei-Jin, Z.¹²

2
- 85009168601
- Measuring the readability of automatic speech-to-text transcripts
- D. Jones, F. Wolf, E. Gibson, E. Williams, E. Fedorenko, D. Reynods, and M. Zissman, "Measuring the readability of automatic speech-to-text transcripts," in Proc. Eurospeech, 2003, pp. 1585-1588.
- (2003) Proc. Eurospeech , pp. 1585-1588
- Jones, D.¹ Wolf, F.² Gibson, E.³ Williams, E.⁴ Fedorenko, E.⁵ Reynods, D.⁶ Zissman, M.⁷

3
- 33646809491
- Structural event detection for rich transcription of speech,
- Ph.D. dissertation, Purdue Univ, West Lafayette, IN
- Y. Liu, "Structural event detection for rich transcription of speech," Ph.D. dissertation, Purdue Univ., West Lafayette, IN, 2004.
- (2004)
- Liu, Y.¹

4
- 0003798906
- Ph.D, Dept. Psychol, Univ. California, Berkeley
- E. Shriberg, "Preliminaries to a theory of speech disfluencies," Ph.D., Dept. Psychol., Univ. California, Berkeley, 1994.
- (1994) Preliminaries to a theory of speech disfluencies
- Shriberg, E.¹

5
- 33646786242
- Version 6.2. Linguistic Data Consortium, Online, Available
- S. Strassel. (2004) Simple Metadata Annotation Specification Version 6.2. Linguistic Data Consortium. [Online]. Available: http://www.ldc.upenn.edu/ Projects/MDE
- (2004) Simple Metadata Annotation Specification
- Strassel, S.¹

6
- 18744415719
- Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system
- May
- C.-H. Wu and G.-L. Yan, "Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 330-344, May 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 330-344
- Wu, C.-H.¹ Yan, G.-L.²

7
- 0034275920
- Prosody-based automatic segmentation of speech into sentences and topics
- E. Shriberg, A. Stolcke, D. Hakkani-Tur, and G. Tur, "Prosody-based automatic segmentation of speech into sentences and topics," Speech Commun., vol. 32, no. 1-2, pp. 127-154, 2000.
- (2000) Speech Commun , vol.32 , Issue.1-2 , pp. 127-154
- Shriberg, E.¹ Stolcke, A.² Hakkani-Tur, D.³ Tur, G.⁴

8
- 85059598545
- Integrating multiple knowledge sources for detecting and correction of repairs in human computer dialog
- J. Bear, J. Dowding, and E. Shriberg, "Integrating multiple knowledge sources for detecting and correction of repairs in human computer dialog," in Proc. ACL, 1992, pp. 56-63.
- (1992) Proc. ACL , pp. 56-63
- Bear, J.¹ Dowding, J.² Shriberg, E.³

9
- 21844454996
- Modeling prosodic feature sequences for speaker recognition
- E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Commun., pp. 455-472, 2005.
- (2005) Speech Commun , pp. 455-472
- Shriberg, E.¹ Ferrer, L.² Kajarekar, S.³ Venkataraman, A.⁴ Stolcke, A.⁵

10
- 84878523744
- Prosodic features of four types of disfluencies
- G. Savova and J. Bachenko, "Prosodic features of four types of disfluencies," in Proc. DiSS, 2003, pp. 91-94.
- (2003) Proc. DiSS , pp. 91-94
- Savova, G.¹ Bachenko, J.²

11
- 33646798740
- The IBM2004 conversational telephony system forrichtranscription
- H. Soltau, B. Kingsbury, L. Mangu, D. Povey, G. Saon, and G. Zweig, "The IBM2004 conversational telephony system forrichtranscription, "inProc. IEEE Int. Conf. Acoustics, Speech, Signal Process.,2005, pp. 205-208.
- (2005) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process , pp. 205-208
- Soltau, H.¹ Kingsbury, B.² Mangu, L.³ Povey, D.⁴ Saon, G.⁵ Zweig, G.⁶

12
- 33646819463
- Detecting structural metadata with decision trees and transformation-based learning
- J. Kim, S. E. Schwarm, and M. Ostendorf, "Detecting structural metadata with decision trees and transformation-based learning," in Proc. HLT/NAACL, 2004, pp. 137-144.
- (2004) Proc. HLT/NAACL , pp. 137-144
- Kim, J.¹ Schwarm, S.E.² Ostendorf, M.³

13
- 24144462548
- Analysis and recognition of spontaneous speech using corpus of spontaneous Japanese
- S. Furui, M. Nakamura, T. Ichiba, and K. Iwano, "Analysis and recognition of spontaneous speech using corpus of spontaneous Japanese," Speech Commun., vol. 47, pp. 208-219, 2005.
- (2005) Speech Commun , vol.47 , pp. 208-219
- Furui, S.¹ Nakamura, M.² Ichiba, T.³ Iwano, K.⁴

14
- 85120620835
- Edit detection and parsing for transcribed speech
- E. Charniak and M. Johnson, "Edit detection and parsing for transcribed speech," in Proc. NAACL, 2001, pp. 118-126.
- (2001) Proc. NAACL , pp. 118-126
- Charniak, E.¹ Johnson, M.²

15
- 57849131781
- A tag-based noisy channel model of speech repairs
- M. Johnson and E. Charniak, "A tag-based noisy channel model of speech repairs," in Proc. ACL, 2004, pp. 33-39.
- (2004) Proc. ACL , pp. 33-39
- Johnson, M.¹ Charniak, E.²

16
- 33646756210
- Parsing and its applications for conversational speech
- M. Lease, E. Charniak, and M. Johnson, "Parsing and its applications for conversational speech," in Proc. ICASSP, 2005, pp. 961-964.
- (2005) Proc. ICASSP , pp. 961-964
- Lease, M.¹ Charniak, E.² Johnson, M.³

17
- 33646770096
- An improved model for recogizing disfluencies in conversational speech
- M. Johnson, E. Charniak, and M. Lease, "An improved model for recogizing disfluencies in conversational speech," in Proc. Rich Transcription 2004 Fall Workshop, 2004.
- (2004) Proc. Rich Transcription 2004 Fall Workshop
- Johnson, M.¹ Charniak, E.² Lease, M.³

18
- 0040958578
- Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue
- P. Heeman and J. Allen, "Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue," Comput. Ling., vol. 25, pp. 527-571, 1999.
- (1999) Comput. Ling , vol.25 , pp. 527-571
- Heeman, P.¹ Allen, J.²

19
- 0030351559
- Combining the detection and correction of speech repairs
- Oct
- P. A. Heeman, K. Loken-Kim, and J. F. Allen, "Combining the detection and correction of speech repairs," in Proc. 4th Int. Conf. Spoken Lang. Process., Oct. 1996, pp. 358-361.
- (1996) Proc. 4th Int. Conf. Spoken Lang. Process , pp. 358-361
- Heeman, P.A.¹ Loken-Kim, K.² Allen, J.F.³

20
- 33646762857
- Automatic disfluency removal on recognized spontaneous speech-Rapid adaptation to speaker dependent dislfuencies
- M. Honal and T. Schultz, "Automatic disfluency removal on recognized spontaneous speech-Rapid adaptation to speaker dependent dislfuencies," in Proc. ICASSP, 2005, pp. 969-972.
- (2005) Proc. ICASSP , pp. 969-972
- Honal, M.¹ Schultz, T.²

21
- 56149102222
- Corrections of disfluencies in spontaneous speech using a noisy-channel approach
- _, "Corrections of disfluencies in spontaneous speech using a noisy-channel approach," in Proc. Eurospeech, 2003, pp. 2781-2784.
- (2003) Proc. Eurospeech , pp. 2781-2784
- Honal, M.¹ Schultz, T.²

22
- 33646764337
- A lexically-driven algorithm for disfluency detection
- M. Snover, B. Dorr, and R. Schwartz, "A lexically-driven algorithm for disfluency detection," in Proc. HLT/NAACL, 2004, pp. 157-160.
- (2004) Proc. HLT/NAACL , pp. 157-160
- Snover, M.¹ Dorr, B.² Schwartz, R.³

23
- 27744599401
- Automatic transcription of conversational telephone speech
- Nov
- T. Hain, P. C. Woodland, G. Evermann, M. J. F. Gales, X. Liu, G. L. Moore, D. Povey, and L. Wang, "Automatic transcription of conversational telephone speech," IEEE Trans. Speech Audio Process., vol. 13, no. 6, pp. 1173-1185, Nov. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.6 , pp. 1173-1185
- Hain, T.¹ Woodland, P.C.² Evermann, G.³ Gales, M.J.F.⁴ Liu, X.⁵ Moore, G.L.⁶ Povey, D.⁷ Wang, L.⁸

24
- 4544275060
- The 2003 ISL rich transcription system for conversational telephony speech
- H. Soltau, H. Yu, F. Metze, C. Fugen, J. Qin, and S.-C. Jou, "The 2003 ISL rich transcription system for conversational telephony speech," in Proc. Acoust., Speech, Signal Process., 2004, pp. 17-21.
- (2004) Proc. Acoust., Speech, Signal Process , pp. 17-21
- Soltau, H.¹ Yu, H.² Metze, F.³ Fugen, C.⁴ Qin, J.⁵ Jou, S.-C.⁶

25
- 48749092718
- Final report on parsing and spoken structural event detection
- M. Harper, B. J. Dorr, J. Hale, B. Roark, I. Shafran, M. Lease, Y. Liu, M. Snover, L. Yung, A. Krasnyanskaya, and R. Stewart, "Final report on parsing and spoken structural event detection," in Proc. Johns Hopkins Summer Workshop, 2005, pp. 1-116.
- (2005) Proc. Johns Hopkins Summer Workshop , pp. 1-116
- Harper, M.¹ Dorr, B.J.² Hale, J.³ Roark, B.⁴ Shafran, I.⁵ Lease, M.⁶ Liu, Y.⁷ Snover, M.⁸ Yung, L.⁹ Krasnyanskaya, A.¹⁰ Stewart, R.¹¹

26
- 44849094956
- Using conditional random fields for sentence boundary detection in speech
- Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Using conditional random fields for sentence boundary detection in speech," in Proc. 43nd Annu. Meeting Assoc. Computat. Ling., 2005, pp. 451-458.
- (2005) Proc. 43nd Annu. Meeting Assoc. Computat. Ling , pp. 451-458
- Liu, Y.¹ Stolcke, A.² Shriberg, E.³ Harper, M.⁴

27
- 85009291541
- Maximum entropy model for punctuation annotation from speech
- J. Huang and G. Zweig, "Maximum entropy model for punctuation annotation from speech," in Proc. ICSLP, pp. 917-920.
- Proc. ICSLP , pp. 917-920
- Huang, J.¹ Zweig, G.²

28
- 33745191849
- Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
- Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Comparing HMM, maximum entropy, and conditional random fields for disfluency detection," in Proc. Eurospeech, 2005, pp. 3313-3316.
- (2005) Proc. Eurospeech , pp. 3313-3316
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Harper, M.⁴

29
- 33646800879
- Structural metadata research in the ears program
- presented at the paper
- Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendort, M. Tomalin, P. I. Woodland, and M. Harper, "Structural metadata research in the ears program," presented at the ICASSP, invited paper, 2005, pp. 957-960.
- (2005) ICASSP, invited , pp. 957-960
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Peskin, B.⁴ Ang, J.⁵ Hillard, D.⁶ Ostendort, M.⁷ Tomalin, M.⁸ Woodland, P.I.⁹ Harper, M.¹⁰

30
- 85128394891
- Automatic detection of sentence boundaries and disfluencies based on recognized words
- A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. Tur, and Y. Lu, "Automatic detection of sentence boundaries and disfluencies based on recognized words," in Proc. Int. Conf. Spoken Lang. Process., 1998, pp. 2247-2250.
- (1998) Proc. Int. Conf. Spoken Lang. Process , pp. 2247-2250
- Stolcke, A.¹ Shriberg, E.² Bates, R.³ Ostendorf, M.⁴ Hakkani, D.⁵ Plauche, M.⁶ Tur, G.⁷ Lu, Y.⁸

31
- 0002629270
- Maximum-likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum-likelihood from incomplete data via the EM algorithm," J. R. Statist Soc. B, pp. 1-39, 1977.
- (1977) J. R. Statist Soc. B , pp. 1-39
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

32
- 80054087185
- Log-linear models for word alignment
- Y. Liu, Q. Liu, and S. Lin, "Log-linear models for word alignment," in Proc. 43rd Annu. Meeting Assoc. Comput. Ling., 2005, pp. 459-466.
- (2005) Proc. 43rd Annu. Meeting Assoc. Comput. Ling , pp. 459-466
- Liu, Y.¹ Liu, Q.² Lin, S.³

33
- 0029765629
- Statistical language modeling for speech disfluencies
- A. Stolcke and E. Shriberg, "Statistical language modeling for speech disfluencies," in Proc. ICASSP, vol. 1, 1996, pp. 405-408.
- (1996) Proc. ICASSP , vol.1 , pp. 405-408
- Stolcke, A.¹ Shriberg, E.²

34
- 34047265603
- S. F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Center Res. Comput. Technol., Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
- S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Center Res. Comput. Technol., Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.

35
- 0033874696
- Algorithms for statistical translation of spoken language
- Jan
- H. Ney, S. Niessen, F. J. Och, H. Sawaf, C. Tilhnmm, and S. Vogel, "Algorithms for statistical translation of spoken language," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 24-36, Jan. 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.1 , pp. 24-36
- Ney, H.¹ Niessen, S.² Och, F.J.³ Sawaf, H.⁴ Tilhnmm, C.⁵ Vogel, S.⁶

36
- 85146676791
- Verb semantics and lexical selection
- Z. Wu and M. Palmer, "Verb semantics and lexical selection," in Proc. 32nd ACL, 1994, pp. 133-138.
- (1994) Proc. 32nd ACL , pp. 133-138
- Wu, Z.¹ Palmer, M.²

37
- 34047253517
- Academia Sinica, CKIP Tech. Rep.-01
- S.-C. Tseng and Y.-F. Liu, "Annotation of Mandarin Conversational Dialogue Corpus," Academia Sinica, CKIP Tech. Rep.-01, 2002.
- (2002) Annotation of Mandarin Conversational Dialogue Corpus
- Tseng, S.-C.¹ Liu, Y.-F.²

38
- 84907336951
- An efficient repair procedure for quick transcriptions
- Jeju Island, Korea, Oct
- A. Venkataraman, A. Stolcke, W. Wang, D. Vergyri, V. R. R. Gadde, and J. Zheng, "An efficient repair procedure for quick transcriptions," in Proc. Int. Conf. Spoken Lang. Process., Jeju Island, Korea, Oct. 2004, pp. 1961-1964.
- (2004) Proc. Int. Conf. Spoken Lang. Process , pp. 1961-1964
- Venkataraman, A.¹ Stolcke, A.² Wang, W.³ Vergyri, D.⁴ Gadde, V.R.R.⁵ Zheng, J.⁶

39
- 0003822743
- Cambridge, U.K: Cambridge Univ. Press
- S. J. Young, G. Evermann, T. Hain, D. Kershaw, G. L. Moore, J. J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book. Cambridge, U.K: Cambridge Univ. Press, 2003.
- (2003) The HTK Book
- Young, S.J.¹ Evermann, G.² Hain, T.³ Kershaw, D.⁴ Moore, G.L.⁵ Odell, J.J.⁶ Ollason, D.⁷ Povey, D.⁸ Valtchev, V.⁹ Woodland, P.C.¹⁰

40
- 34047261551
- Online, Available
- MAT Speech Database - TCC-300 [Online]. Available: http://rocling.iis. sinica.edu.tw/ROCLING/MAT/Tcc_300brief.htm
- MAT Speech Database - TCC-300

41
- 36749015898
- Online, Available
- Rich Transcription (RT-04F) Evaluation Plan (2004). [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/docs/rt04f-eval-plan-vl4.doc
- (2004) Rich Transcription (RT-04F) Evaluation Plan

42
- 85123721100
- Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous mandarin speech
- C.-K. Lin, S.-C. Tseng, and L.-S. Lee, "Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous mandarin speech," in Proc. DiSS, 2005, pp. 117-121.
- (2005) Proc. DiSS , pp. 117-121
- Lin, C.-K.¹ Tseng, S.-C.² Lee, L.-S.³

43
- 33646818116
- RT-S: Surface rich transcription scoring, methodology, and initial results
- M. Snover, R. Schwartz, B. Dorr, and J. Makhoul, "RT-S: Surface rich transcription scoring, methodology, and initial results," in Proc. DARPA Rich Transcription Workshop, 2004.
- (2004) Proc. DARPA Rich Transcription Workshop
- Snover, M.¹ Schwartz, R.² Dorr, B.³ Makhoul, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.