SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 17, Issue 7, 2009, Pages 1263-1278

Improved features and models for detecting edit disfluencies in transcribing spontaneous mandarin speech

(2) Lin, Che Kuang a Lee, Lin Shan a

a NATIONAL TAIWAN UNIVERSITY (Taiwan)

Author keywords

Edit disfluency; Interruption point detection; Prosody; Speech recognition; Spontaneous speech

Indexed keywords

CHINESE CHARACTERS; CONDITIONAL RANDOM FIELD; DISFLUENCIES; EDIT DISFLUENCY; IMPROVED MODELS; INTERRUPTION POINT DETECTION; MAXIMUM ENTROPY; PROBABILISTIC FRAMEWORK; PROSODIC FEATURES; PROSODIC MODELING; PROSODIC STATE; PROSODY; RECOGNITION ACCURACY; SPEECH CORPORA; SPEECH PROSODY; SPONTANEOUS SPEECH;

DECISION TREES; TRANSCRIPTION;

SPEECH RECOGNITION;

EID: 68549130583 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2014792 Document Type: Article

Times cited : (20)

References (41)

1
- 85083514474
- Parsing conversational speech using enhanced segmentation
- J. G. Kahn, M. Ostendorf, and C. Chelba, "Parsing conversational speech using enhanced segmentation," in Proc. HLT/NAACL, 2004, pp. 121-128.
- (2004) Proc. HLT/NAACL , pp. 121-128
- Kahn, J.G.¹ Ostendorf, M.² Chelba, C.³

2
- 68549138413
- S. Strassel, Simple metadata annotation specification V6.2, Linguistic Data Consortium, 2004 [Online]. Available: http://www.ldc. upenn.edu/Projects/MDE/Guidelines/SimpleMDE V6.2.pdf
- S. Strassel, "Simple metadata annotation specification V6.2," Linguistic Data Consortium, 2004 [Online]. Available: http://www.ldc. upenn.edu/Projects/MDE/Guidelines/SimpleMDE V6.2.pdf

3
- 34047253517
- Academia Sinica, CKIP Tech. Rep.-01
- S.-C. Tseng and Y.-F. Liu, "Annotation of Mandarin Conversational Dialogue Corpus," Academia Sinica, CKIP Tech. Rep.-01, 2002.
- (2002) Annotation of Mandarin Conversational Dialogue Corpus
- Tseng, S.-C.¹ Liu, Y.-F.²

4
- 24144462548
- Analysis and recognition of spontaneous speech using corpus of spontaneous japanese
- S. Furui, M. Nakamura, T. Ichiba, and K. Iwano, "Analysis and recognition of spontaneous speech using corpus of spontaneous japanese," Speech Commun., vol. 47, pp. 208-219, 2005.
- (2005) Speech Commun , vol.47 , pp. 208-219
- Furui, S.¹ Nakamura, M.² Ichiba, T.³ Iwano, K.⁴

5
- 33646798740
- The IBM 2004 conversational telephony system for rich transcription
- H. Soltau, B. Kingsbury, L. Mangu, D. Povey, G. Saon, and G. Zweig, "The IBM 2004 conversational telephony system for rich transcription," in Proc. IEEE ICASSP, 2005, pp. 205-208.
- (2005) Proc. IEEE ICASSP , pp. 205-208
- Soltau, H.¹ Kingsbury, B.² Mangu, L.³ Povey, D.⁴ Saon, G.⁵ Zweig, G.⁶

6
- 27744599401
- Automatic transcription of conversational telephone speech
- Nov
- T. Hain, P. C. Woodland, G. Evermann, M. J. F. Gales, X. Liu, G. L. Moore, D. Povey, and L. Wang, "Automatic transcription of conversational telephone speech," IEEE Trans. Speech Audio Process., vol. 13, no. 6, pp. 1173-1185, Nov. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.6 , pp. 1173-1185
- Hain, T.¹ Woodland, P.C.² Evermann, G.³ Gales, M.J.F.⁴ Liu, X.⁵ Moore, G.L.⁶ Povey, D.⁷ Wang, L.⁸

7
- 34047266609
- Multistage speaker diarization of broadcast news
- Sep
- C. Barras, X. Zhu, S. Meignier, and J.-L. Gauvain, "Multistage speaker diarization of broadcast news," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1505-1512, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1505-1512
- Barras, C.¹ Zhu, X.² Meignier, S.³ Gauvain, J.-L.⁴

8
- 34047261805
- An overview of automatic speaker diarization systems
- Sep
- S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1557-1565
- Tranter, S.E.¹ Reynolds, D.A.²

9
- 34047266607
- Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
- Sep
- Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper, "Enriching speech recognition with automatic detection of sentence boundaries and disfluencies," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1526-1540, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1526-1540
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Hillard, D.⁴ Ostendorf, M.⁵ Harper, M.⁶

10
- 34047271072
- Recognizing disfluencies in conversational speech
- Sep
- M. Lease, M. Johnson, and E. Charniak, "Recognizing disfluencies in conversational speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1566-1573, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1566-1573
- Lease, M.¹ Johnson, M.² Charniak, E.³

11
- 34047266604
- Edit disfluency detection and correction using a cleanup language model and an alignment model
- Sep
- J.-F. Yeh and C.-H. Wu, "Edit disfluency detection and correction using a cleanup language model and an alignment model," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1574-1583, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process , vol.14 , Issue.5 , pp. 1574-1583
- Yeh, J.-F.¹ Wu, C.-H.²

12
- 0040958578
- Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue
- P. Heeman and J. Allen, "Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue," Comput. Linguist., vol. 25, pp. 527-571, 1999.
- (1999) Comput. Linguist , vol.25 , pp. 527-571
- Heeman, P.¹ Allen, J.²

13
- 85120620835
- Edit detection and parsing for transcribed speech
- E. Charniak and M. Johnson, "Edit detection and parsing for transcribed speech," in Proc. NAACL, 2001, pp. 118-126.
- (2001) Proc. NAACL , pp. 118-126
- Charniak, E.¹ Johnson, M.²

14
- 57849131781
- a TAG-based noisy channel model of speech repairs
- m. Johnson and e. Charniak, "a TAG-based noisy channel model of speech repairs," in Proc. ACL, 2004.
- (2004) Proc. ACL
- Johnson, M.¹ Charniak, E.²

15
- 33646762857
- Automatic disfluency removal on recognized spontaneous speech - Rapid adaptation to speaker dependent disfluencies
- M. Honal and T. Schultz, "Automatic disfluency removal on recognized spontaneous speech - Rapid adaptation to speaker dependent disfluencies," in Proc. ICASSP, 2005, pp. 969-972.
- (2005) Proc. ICASSP , pp. 969-972
- Honal, M.¹ Schultz, T.²

16
- 56149102222
- Corrections of disfluencies in spontaneous speech using a noisy channel approach
- M. Honal and T. Schultz, "Corrections of disfluencies in spontaneous speech using a noisy channel approach," in Proc. Eurospeech, 2003, pp. 2781-2784.
- (2003) Proc. Eurospeech , pp. 2781-2784
- Honal, M.¹ Schultz, T.²

17
- 0028215480
- A corpus-based study of repair cues in spontaneous speech
- C. Nakatani and J. Hirschberg, "A corpus-based study of repair cues in spontaneous speech," J. Acoust. Soc. Amer., pp. 1603-1616, 1994.
- (1994) J. Acoust. Soc. Amer , pp. 1603-1616
- Nakatani, C.¹ Hirschberg, J.²

18
- 0000703860
- Phonetic consequences of speech disfluency
- E. Shriberg, "Phonetic consequences of speech disfluency," in Proc. Int. Conf. Phonetics Sci., 1999, pp. 619-622.
- (1999) Proc. Int. Conf. Phonetics Sci , pp. 619-622
- Shriberg, E.¹

19
- 0030351630
- Juncture cues to disfluency
- R. Lickley, "Juncture cues to disfluency," in Proc. ICSLP, 1996, pp. 2478-2481.
- (1996) Proc. ICSLP , pp. 2478-2481
- Lickley, R.¹

20
- 84878523744
- Prosodic features of four types of disfluencies
- G. Savova and J. Bachenko, "Prosodic features of four types of disfluencies," in Proc. DiSS, 2003, pp. 91-94.
- (2003) Proc. DiSS , pp. 91-94
- Savova, G.¹ Bachenko, J.²

21
- 0010125082
- A prosody-only decision-tree model for disfluency detection
- E. Shriberg and A. Stolcke, "A prosody-only decision-tree model for disfluency detection," in Proc. Eurospeech, 1997, pp. 2383-2386.
- (1997) Proc. Eurospeech , pp. 2383-2386
- Shriberg, E.¹ Stolcke, A.²

22
- 0034275920
- Prosody-based automatic segmentation of speech into sentences and topics
- E. Shriberg, A. Stolcke, D. Hakkani-Tur, and G. Tur, "Prosody-based automatic segmentation of speech into sentences and topics," Speech Commun., pp. 127-154, 2000.
- (2000) Speech Commun , pp. 127-154
- Shriberg, E.¹ Stolcke, A.² Hakkani-Tur, D.³ Tur, G.⁴

23
- 33745191849
- Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
- Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Comparing HMM, maximum entropy, and conditional random fields for disfluency detection," in Proc. Eurospeech, 2005, pp. 3313-3316.
- (2005) Proc. Eurospeech , pp. 3313-3316
- Liu, Y.¹ Stolcke, A.² Shriberg, E.³ Harper, M.⁴

24
- 33646800879
- Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, Structural metadata research in the ears program, presented at the icassp, 2005, pp. 957-960, unpublished.
- Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Structural metadata research in the ears program," presented at the icassp, 2005, pp. 957-960, unpublished.

25
- 85009223733
- Automatic disfluency identification in conversational speech using multiple knowledge sources
- Y. Liu, E. Shriberg, and A. Stolcke, "Automatic disfluency identification in conversational speech using multiple knowledge sources," in Proc. Eurospeech, 2003, pp. 957-960.
- (2003) Proc. Eurospeech , pp. 957-960
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³

26
- 85009142186
- Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection
- Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection," in Proc. ICSLP, 2004, pp. 1525-1528.
- (2004) Proc. ICSLP , pp. 1525-1528
- Liu, Y.¹ Shriberg, E.² Stolcke, A.³ Harper, M.⁴

27
- 33646764337
- A lexically-driven algorithm for disfluency detection
- M. Snover, B. Dorr, and R. Schwartz, "A lexically-driven algorithm for disfluency detection," in Proc. HLT/NAACL, 2004, pp. 157-160.
- (2004) Proc. HLT/NAACL , pp. 157-160
- Snover, M.¹ Dorr, B.² Schwartz, R.³

28
- 33646819463
- Detecting structural metadata with decision trees and transformation-based learning
- J. Kim, S. Schwarm, and M. Ostendorf, "Detecting structural metadata with decision trees and transformation-based learning," in Proc. HLT/ NAACL, 2004.
- (2004) Proc. HLT/ NAACL
- Kim, J.¹ Schwarm, S.² Ostendorf, M.³

29
- 0002652285
- A maximum entropy approach to natural language processing
- A. L. Berger, S. A. Della Pietra, and V. J. Della Pietra, "A maximum entropy approach to natural language processing," Comput. Linguist., vol. 22, pp. 39-72, 1996.
- (1996) Comput. Linguist , vol.22 , pp. 39-72
- Berger, A.L.¹ Della Pietra, S.A.² Della Pietra, V.J.³

30
- 0000732463
- A limited memory algorithm for bound constrained optimization
- R. H. Ryrd, P. Lu, and J. Nocedal, "A limited memory algorithm for bound constrained optimization," SIAM J. Sci. Statist. Comput., vol. 16, no. 5, pp. 1190-1208, 1995.
- (1995) SIAM J. Sci. Statist. Comput , vol.16 , Issue.5 , pp. 1190-1208
- Ryrd, R.H.¹ Lu, P.² Nocedal, J.³

31
- 0004014502
- A Gaussian Prior for Smoothing Maximum Entropy Models Carnegie Mellon Univ., Pittsburgh, PA
- Tech. Rep
- S. Chen and R. Rosenfeld, A Gaussian Prior for Smoothing Maximum Entropy Models Carnegie Mellon Univ., Pittsburgh, PA, 1999, Tech. Rep..
- (1999)
- Chen, S.¹ Rosenfeld, R.²

32
- 0032346848
- Bayesian CART model search
- H. Chipman, E. I. George, and R. E. McCulloch, "Bayesian CART model search," J. Amer. Statist. Assoc., vol. 93, no. 443, pp. 935-947, 1998.
- (1998) J. Amer. Statist. Assoc , vol.93 , Issue.443 , pp. 935-947
- Chipman, H.¹ George, E.I.² McCulloch, R.E.³

33
- 21844474040
- Fluent speech prosody: Framework and modeling
- July, Special Issue on Quantitative Prosody Modeling for Natural Speech Description and Generation
- C.-Y. Tseng, S.-H. Pin, Y.-L. Lee, H.-M. Wang, and Y.-C. Chen, "Fluent speech prosody: Framework and modeling," Speech Commun., vol. 46, no. 3-4, pp. 284-309, July 2005, Special Issue on Quantitative Prosody Modeling for Natural Speech Description and Generation.
- (2005) Speech Commun , vol.46 , Issue.3-4 , pp. 284-309
- Tseng, C.-Y.¹ Pin, S.-H.² Lee, Y.-L.³ Wang, H.-M.⁴ Chen, Y.-C.⁵

34
- 11244330002
- Probabilistic latent semantic analysis
- T. Hofmann, "Probabilistic latent semantic analysis," in Uncertainty Artif. Intell., 1999, pp. 289-296.
- (1999) Uncertainty Artif. Intell , pp. 289-296
- Hofmann, T.¹

35
- 40349098806
- Learning the threshold in hierarchical agglomerative clustering
- K. Daniels and C. Giraud-Carrier, "Learning the threshold in hierarchical agglomerative clustering," in Proc. ICMLA, 2006, pp. 270-278.
- (2006) Proc. ICMLA , pp. 270-278
- Daniels, K.¹ Giraud-Carrier, C.²

36
- 0003710380
- Online, Available
- C.-C. Chang and C.-J. Lin, "LIBSVM: A library for support vector machines," 2001 [Online]. Available: www.csie.ntu.edu.tw/∼cjlin/libsvm.
- (2001) LIBSVM: A library for support vector machines
- Chang, C.-C.¹ Lin, C.-J.²

37
- 0142192295
- Conditional random fields: Probabilistic models for segmenting and labeling sequence data
- J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. ICML, 2001, pp. 282-289.
- (2001) Proc. ICML , pp. 282-289
- Lafferty, J.¹ McCallum, A.² Pereira, F.³

38
- 68549091556
- S.-C. Tseng, Processing spoken mandarin corpora, Traitement Automatique des Langues, 45, no. 2, pp. 89-108, Special Issue: Spoken Corpus Processing.
- S.-C. Tseng, "Processing spoken mandarin corpora," Traitement Automatique des Langues, vol. 45, no. 2, pp. 89-108, Special Issue: Spoken Corpus Processing.

39
- 33947691278
- Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
- Y.-C. Hsieh, Y.-T. Huang, C.-C. Wang, and L.-S. Lee, "Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)," in Proc. ICASSP, 2006, pp. 961-964.
- (2006) Proc. ICASSP , pp. 961-964
- Hsieh, Y.-C.¹ Huang, Y.-T.² Wang, C.-C.³ Lee, L.-S.⁴

40
- 68549116144
- Rich Transcription (RT-04F)
- Evaluation Plan, Online, Available
- "Rich Transcription (RT-04F)," Evaluation Plan 2004 [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/docs/rt04f-eval-plan- v14.doc
- (2004)

41
- 0001884644
- Individual comparisons by ranking methods
- F. Wilcoxon, "Individual comparisons by ranking methods," Biometrics, vol. 1, pp. 80-83, 1945.
- (1945) Biometrics , vol.1 , pp. 80-83
- Wilcoxon, F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.