-
1
-
-
3042820894
-
Automatic recognition of spontaneous speech for access to multilingual oral history archives
-
Jul
-
W. Byrne, D. Doermann, M. Franz, S. Gustman, J. Hajic, D. Oard, M. Picheny, J. Psutka, B. Ramabhadran, D. Soergel, T. Ward, and Z. Wei-Jin, "Automatic recognition of spontaneous speech for access to multilingual oral history archives," IEEE Trans. Speech Audio Process., vol. 12, no. 4, pp. 420-435, Jul. 2004.
-
(2004)
IEEE Trans. Speech Audio Process
, vol.12
, Issue.4
, pp. 420-435
-
-
Byrne, W.1
Doermann, D.2
Franz, M.3
Gustman, S.4
Hajic, J.5
Oard, D.6
Picheny, M.7
Psutka, J.8
Ramabhadran, B.9
Soergel, D.10
Ward, T.11
Wei-Jin, Z.12
-
2
-
-
85009168601
-
Measuring the readability of automatic speech-to-text transcripts
-
D. Jones, F. Wolf, E. Gibson, E. Williams, E. Fedorenko, D. Reynods, and M. Zissman, "Measuring the readability of automatic speech-to-text transcripts," in Proc. Eurospeech, 2003, pp. 1585-1588.
-
(2003)
Proc. Eurospeech
, pp. 1585-1588
-
-
Jones, D.1
Wolf, F.2
Gibson, E.3
Williams, E.4
Fedorenko, E.5
Reynods, D.6
Zissman, M.7
-
3
-
-
33646809491
-
Structural event detection for rich transcription of speech,
-
Ph.D. dissertation, Purdue Univ, West Lafayette, IN
-
Y. Liu, "Structural event detection for rich transcription of speech," Ph.D. dissertation, Purdue Univ., West Lafayette, IN, 2004.
-
(2004)
-
-
Liu, Y.1
-
4
-
-
0003798906
-
-
Ph.D, Dept. Psychol, Univ. California, Berkeley
-
E. Shriberg, "Preliminaries to a theory of speech disfluencies," Ph.D., Dept. Psychol., Univ. California, Berkeley, 1994.
-
(1994)
Preliminaries to a theory of speech disfluencies
-
-
Shriberg, E.1
-
5
-
-
33646786242
-
-
Version 6.2. Linguistic Data Consortium, Online, Available
-
S. Strassel. (2004) Simple Metadata Annotation Specification Version 6.2. Linguistic Data Consortium. [Online]. Available: http://www.ldc.upenn.edu/ Projects/MDE
-
(2004)
Simple Metadata Annotation Specification
-
-
Strassel, S.1
-
6
-
-
18744415719
-
Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system
-
May
-
C.-H. Wu and G.-L. Yan, "Speech act modeling and verification of spontaneous speech with disfluency in a spoken dialogue system," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 330-344, May 2005.
-
(2005)
IEEE Trans. Speech Audio Process
, vol.13
, Issue.3
, pp. 330-344
-
-
Wu, C.-H.1
Yan, G.-L.2
-
7
-
-
0034275920
-
Prosody-based automatic segmentation of speech into sentences and topics
-
E. Shriberg, A. Stolcke, D. Hakkani-Tur, and G. Tur, "Prosody-based automatic segmentation of speech into sentences and topics," Speech Commun., vol. 32, no. 1-2, pp. 127-154, 2000.
-
(2000)
Speech Commun
, vol.32
, Issue.1-2
, pp. 127-154
-
-
Shriberg, E.1
Stolcke, A.2
Hakkani-Tur, D.3
Tur, G.4
-
8
-
-
85059598545
-
Integrating multiple knowledge sources for detecting and correction of repairs in human computer dialog
-
J. Bear, J. Dowding, and E. Shriberg, "Integrating multiple knowledge sources for detecting and correction of repairs in human computer dialog," in Proc. ACL, 1992, pp. 56-63.
-
(1992)
Proc. ACL
, pp. 56-63
-
-
Bear, J.1
Dowding, J.2
Shriberg, E.3
-
9
-
-
21844454996
-
Modeling prosodic feature sequences for speaker recognition
-
E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Commun., pp. 455-472, 2005.
-
(2005)
Speech Commun
, pp. 455-472
-
-
Shriberg, E.1
Ferrer, L.2
Kajarekar, S.3
Venkataraman, A.4
Stolcke, A.5
-
10
-
-
84878523744
-
Prosodic features of four types of disfluencies
-
G. Savova and J. Bachenko, "Prosodic features of four types of disfluencies," in Proc. DiSS, 2003, pp. 91-94.
-
(2003)
Proc. DiSS
, pp. 91-94
-
-
Savova, G.1
Bachenko, J.2
-
11
-
-
33646798740
-
The IBM2004 conversational telephony system forrichtranscription
-
H. Soltau, B. Kingsbury, L. Mangu, D. Povey, G. Saon, and G. Zweig, "The IBM2004 conversational telephony system forrichtranscription, "inProc. IEEE Int. Conf. Acoustics, Speech, Signal Process.,2005, pp. 205-208.
-
(2005)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process
, pp. 205-208
-
-
Soltau, H.1
Kingsbury, B.2
Mangu, L.3
Povey, D.4
Saon, G.5
Zweig, G.6
-
12
-
-
33646819463
-
Detecting structural metadata with decision trees and transformation-based learning
-
J. Kim, S. E. Schwarm, and M. Ostendorf, "Detecting structural metadata with decision trees and transformation-based learning," in Proc. HLT/NAACL, 2004, pp. 137-144.
-
(2004)
Proc. HLT/NAACL
, pp. 137-144
-
-
Kim, J.1
Schwarm, S.E.2
Ostendorf, M.3
-
13
-
-
24144462548
-
Analysis and recognition of spontaneous speech using corpus of spontaneous Japanese
-
S. Furui, M. Nakamura, T. Ichiba, and K. Iwano, "Analysis and recognition of spontaneous speech using corpus of spontaneous Japanese," Speech Commun., vol. 47, pp. 208-219, 2005.
-
(2005)
Speech Commun
, vol.47
, pp. 208-219
-
-
Furui, S.1
Nakamura, M.2
Ichiba, T.3
Iwano, K.4
-
14
-
-
85120620835
-
Edit detection and parsing for transcribed speech
-
E. Charniak and M. Johnson, "Edit detection and parsing for transcribed speech," in Proc. NAACL, 2001, pp. 118-126.
-
(2001)
Proc. NAACL
, pp. 118-126
-
-
Charniak, E.1
Johnson, M.2
-
15
-
-
57849131781
-
A tag-based noisy channel model of speech repairs
-
M. Johnson and E. Charniak, "A tag-based noisy channel model of speech repairs," in Proc. ACL, 2004, pp. 33-39.
-
(2004)
Proc. ACL
, pp. 33-39
-
-
Johnson, M.1
Charniak, E.2
-
16
-
-
33646756210
-
Parsing and its applications for conversational speech
-
M. Lease, E. Charniak, and M. Johnson, "Parsing and its applications for conversational speech," in Proc. ICASSP, 2005, pp. 961-964.
-
(2005)
Proc. ICASSP
, pp. 961-964
-
-
Lease, M.1
Charniak, E.2
Johnson, M.3
-
18
-
-
0040958578
-
Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue
-
P. Heeman and J. Allen, "Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue," Comput. Ling., vol. 25, pp. 527-571, 1999.
-
(1999)
Comput. Ling
, vol.25
, pp. 527-571
-
-
Heeman, P.1
Allen, J.2
-
19
-
-
0030351559
-
Combining the detection and correction of speech repairs
-
Oct
-
P. A. Heeman, K. Loken-Kim, and J. F. Allen, "Combining the detection and correction of speech repairs," in Proc. 4th Int. Conf. Spoken Lang. Process., Oct. 1996, pp. 358-361.
-
(1996)
Proc. 4th Int. Conf. Spoken Lang. Process
, pp. 358-361
-
-
Heeman, P.A.1
Loken-Kim, K.2
Allen, J.F.3
-
20
-
-
33646762857
-
Automatic disfluency removal on recognized spontaneous speech-Rapid adaptation to speaker dependent dislfuencies
-
M. Honal and T. Schultz, "Automatic disfluency removal on recognized spontaneous speech-Rapid adaptation to speaker dependent dislfuencies," in Proc. ICASSP, 2005, pp. 969-972.
-
(2005)
Proc. ICASSP
, pp. 969-972
-
-
Honal, M.1
Schultz, T.2
-
21
-
-
56149102222
-
Corrections of disfluencies in spontaneous speech using a noisy-channel approach
-
_, "Corrections of disfluencies in spontaneous speech using a noisy-channel approach," in Proc. Eurospeech, 2003, pp. 2781-2784.
-
(2003)
Proc. Eurospeech
, pp. 2781-2784
-
-
Honal, M.1
Schultz, T.2
-
22
-
-
33646764337
-
A lexically-driven algorithm for disfluency detection
-
M. Snover, B. Dorr, and R. Schwartz, "A lexically-driven algorithm for disfluency detection," in Proc. HLT/NAACL, 2004, pp. 157-160.
-
(2004)
Proc. HLT/NAACL
, pp. 157-160
-
-
Snover, M.1
Dorr, B.2
Schwartz, R.3
-
23
-
-
27744599401
-
Automatic transcription of conversational telephone speech
-
Nov
-
T. Hain, P. C. Woodland, G. Evermann, M. J. F. Gales, X. Liu, G. L. Moore, D. Povey, and L. Wang, "Automatic transcription of conversational telephone speech," IEEE Trans. Speech Audio Process., vol. 13, no. 6, pp. 1173-1185, Nov. 2005.
-
(2005)
IEEE Trans. Speech Audio Process
, vol.13
, Issue.6
, pp. 1173-1185
-
-
Hain, T.1
Woodland, P.C.2
Evermann, G.3
Gales, M.J.F.4
Liu, X.5
Moore, G.L.6
Povey, D.7
Wang, L.8
-
24
-
-
4544275060
-
The 2003 ISL rich transcription system for conversational telephony speech
-
H. Soltau, H. Yu, F. Metze, C. Fugen, J. Qin, and S.-C. Jou, "The 2003 ISL rich transcription system for conversational telephony speech," in Proc. Acoust., Speech, Signal Process., 2004, pp. 17-21.
-
(2004)
Proc. Acoust., Speech, Signal Process
, pp. 17-21
-
-
Soltau, H.1
Yu, H.2
Metze, F.3
Fugen, C.4
Qin, J.5
Jou, S.-C.6
-
25
-
-
48749092718
-
Final report on parsing and spoken structural event detection
-
M. Harper, B. J. Dorr, J. Hale, B. Roark, I. Shafran, M. Lease, Y. Liu, M. Snover, L. Yung, A. Krasnyanskaya, and R. Stewart, "Final report on parsing and spoken structural event detection," in Proc. Johns Hopkins Summer Workshop, 2005, pp. 1-116.
-
(2005)
Proc. Johns Hopkins Summer Workshop
, pp. 1-116
-
-
Harper, M.1
Dorr, B.J.2
Hale, J.3
Roark, B.4
Shafran, I.5
Lease, M.6
Liu, Y.7
Snover, M.8
Yung, L.9
Krasnyanskaya, A.10
Stewart, R.11
-
26
-
-
44849094956
-
Using conditional random fields for sentence boundary detection in speech
-
Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Using conditional random fields for sentence boundary detection in speech," in Proc. 43nd Annu. Meeting Assoc. Computat. Ling., 2005, pp. 451-458.
-
(2005)
Proc. 43nd Annu. Meeting Assoc. Computat. Ling
, pp. 451-458
-
-
Liu, Y.1
Stolcke, A.2
Shriberg, E.3
Harper, M.4
-
27
-
-
85009291541
-
Maximum entropy model for punctuation annotation from speech
-
J. Huang and G. Zweig, "Maximum entropy model for punctuation annotation from speech," in Proc. ICSLP, pp. 917-920.
-
Proc. ICSLP
, pp. 917-920
-
-
Huang, J.1
Zweig, G.2
-
28
-
-
33745191849
-
Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
-
Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Comparing HMM, maximum entropy, and conditional random fields for disfluency detection," in Proc. Eurospeech, 2005, pp. 3313-3316.
-
(2005)
Proc. Eurospeech
, pp. 3313-3316
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
Harper, M.4
-
29
-
-
33646800879
-
Structural metadata research in the ears program
-
presented at the paper
-
Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendort, M. Tomalin, P. I. Woodland, and M. Harper, "Structural metadata research in the ears program," presented at the ICASSP, invited paper, 2005, pp. 957-960.
-
(2005)
ICASSP, invited
, pp. 957-960
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
Peskin, B.4
Ang, J.5
Hillard, D.6
Ostendort, M.7
Tomalin, M.8
Woodland, P.I.9
Harper, M.10
-
30
-
-
85128394891
-
Automatic detection of sentence boundaries and disfluencies based on recognized words
-
A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. Tur, and Y. Lu, "Automatic detection of sentence boundaries and disfluencies based on recognized words," in Proc. Int. Conf. Spoken Lang. Process., 1998, pp. 2247-2250.
-
(1998)
Proc. Int. Conf. Spoken Lang. Process
, pp. 2247-2250
-
-
Stolcke, A.1
Shriberg, E.2
Bates, R.3
Ostendorf, M.4
Hakkani, D.5
Plauche, M.6
Tur, G.7
Lu, Y.8
-
31
-
-
0002629270
-
Maximum-likelihood from incomplete data via the EM algorithm
-
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum-likelihood from incomplete data via the EM algorithm," J. R. Statist Soc. B, pp. 1-39, 1977.
-
(1977)
J. R. Statist Soc. B
, pp. 1-39
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
32
-
-
80054087185
-
Log-linear models for word alignment
-
Y. Liu, Q. Liu, and S. Lin, "Log-linear models for word alignment," in Proc. 43rd Annu. Meeting Assoc. Comput. Ling., 2005, pp. 459-466.
-
(2005)
Proc. 43rd Annu. Meeting Assoc. Comput. Ling
, pp. 459-466
-
-
Liu, Y.1
Liu, Q.2
Lin, S.3
-
33
-
-
0029765629
-
Statistical language modeling for speech disfluencies
-
A. Stolcke and E. Shriberg, "Statistical language modeling for speech disfluencies," in Proc. ICASSP, vol. 1, 1996, pp. 405-408.
-
(1996)
Proc. ICASSP
, vol.1
, pp. 405-408
-
-
Stolcke, A.1
Shriberg, E.2
-
34
-
-
34047265603
-
-
S. F. Chen and J. Goodman, An empirical study of smoothing techniques for language modeling, Center Res. Comput. Technol., Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
-
S. F. Chen and J. Goodman, "An empirical study of smoothing techniques for language modeling," Center Res. Comput. Technol., Harvard Univ., Cambridge, MA, Tech. Rep. TR-10-98, 1998.
-
-
-
-
35
-
-
0033874696
-
Algorithms for statistical translation of spoken language
-
Jan
-
H. Ney, S. Niessen, F. J. Och, H. Sawaf, C. Tilhnmm, and S. Vogel, "Algorithms for statistical translation of spoken language," IEEE Trans. Speech Audio Process., vol. 8, no. 1, pp. 24-36, Jan. 2000.
-
(2000)
IEEE Trans. Speech Audio Process
, vol.8
, Issue.1
, pp. 24-36
-
-
Ney, H.1
Niessen, S.2
Och, F.J.3
Sawaf, H.4
Tilhnmm, C.5
Vogel, S.6
-
36
-
-
85146676791
-
Verb semantics and lexical selection
-
Z. Wu and M. Palmer, "Verb semantics and lexical selection," in Proc. 32nd ACL, 1994, pp. 133-138.
-
(1994)
Proc. 32nd ACL
, pp. 133-138
-
-
Wu, Z.1
Palmer, M.2
-
38
-
-
84907336951
-
An efficient repair procedure for quick transcriptions
-
Jeju Island, Korea, Oct
-
A. Venkataraman, A. Stolcke, W. Wang, D. Vergyri, V. R. R. Gadde, and J. Zheng, "An efficient repair procedure for quick transcriptions," in Proc. Int. Conf. Spoken Lang. Process., Jeju Island, Korea, Oct. 2004, pp. 1961-1964.
-
(2004)
Proc. Int. Conf. Spoken Lang. Process
, pp. 1961-1964
-
-
Venkataraman, A.1
Stolcke, A.2
Wang, W.3
Vergyri, D.4
Gadde, V.R.R.5
Zheng, J.6
-
39
-
-
0003822743
-
-
Cambridge, U.K: Cambridge Univ. Press
-
S. J. Young, G. Evermann, T. Hain, D. Kershaw, G. L. Moore, J. J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. C. Woodland, The HTK Book. Cambridge, U.K: Cambridge Univ. Press, 2003.
-
(2003)
The HTK Book
-
-
Young, S.J.1
Evermann, G.2
Hain, T.3
Kershaw, D.4
Moore, G.L.5
Odell, J.J.6
Ollason, D.7
Povey, D.8
Valtchev, V.9
Woodland, P.C.10
-
40
-
-
34047261551
-
-
Online, Available
-
MAT Speech Database - TCC-300 [Online]. Available: http://rocling.iis. sinica.edu.tw/ROCLING/MAT/Tcc_300brief.htm
-
MAT Speech Database - TCC-300
-
-
-
41
-
-
36749015898
-
-
Online, Available
-
Rich Transcription (RT-04F) Evaluation Plan (2004). [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/docs/rt04f-eval-plan-vl4.doc
-
(2004)
Rich Transcription (RT-04F) Evaluation Plan
-
-
-
42
-
-
85123721100
-
Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous mandarin speech
-
C.-K. Lin, S.-C. Tseng, and L.-S. Lee, "Important and new features with analysis for disfluency interruption point (IP) detection in spontaneous mandarin speech," in Proc. DiSS, 2005, pp. 117-121.
-
(2005)
Proc. DiSS
, pp. 117-121
-
-
Lin, C.-K.1
Tseng, S.-C.2
Lee, L.-S.3
-
43
-
-
33646818116
-
RT-S: Surface rich transcription scoring, methodology, and initial results
-
M. Snover, R. Schwartz, B. Dorr, and J. Makhoul, "RT-S: Surface rich transcription scoring, methodology, and initial results," in Proc. DARPA Rich Transcription Workshop, 2004.
-
(2004)
Proc. DARPA Rich Transcription Workshop
-
-
Snover, M.1
Schwartz, R.2
Dorr, B.3
Makhoul, J.4
|