-
1
-
-
85083514474
-
Parsing conversational speech using enhanced segmentation
-
J. G. Kahn, M. Ostendorf, and C. Chelba, "Parsing conversational speech using enhanced segmentation," in Proc. HLT/NAACL, 2004, pp. 121-128.
-
(2004)
Proc. HLT/NAACL
, pp. 121-128
-
-
Kahn, J.G.1
Ostendorf, M.2
Chelba, C.3
-
2
-
-
68549138413
-
-
S. Strassel, Simple metadata annotation specification V6.2, Linguistic Data Consortium, 2004 [Online]. Available: http://www.ldc. upenn.edu/Projects/MDE/Guidelines/SimpleMDE V6.2.pdf
-
S. Strassel, "Simple metadata annotation specification V6.2," Linguistic Data Consortium, 2004 [Online]. Available: http://www.ldc. upenn.edu/Projects/MDE/Guidelines/SimpleMDE V6.2.pdf
-
-
-
-
4
-
-
24144462548
-
Analysis and recognition of spontaneous speech using corpus of spontaneous japanese
-
S. Furui, M. Nakamura, T. Ichiba, and K. Iwano, "Analysis and recognition of spontaneous speech using corpus of spontaneous japanese," Speech Commun., vol. 47, pp. 208-219, 2005.
-
(2005)
Speech Commun
, vol.47
, pp. 208-219
-
-
Furui, S.1
Nakamura, M.2
Ichiba, T.3
Iwano, K.4
-
5
-
-
33646798740
-
The IBM 2004 conversational telephony system for rich transcription
-
H. Soltau, B. Kingsbury, L. Mangu, D. Povey, G. Saon, and G. Zweig, "The IBM 2004 conversational telephony system for rich transcription," in Proc. IEEE ICASSP, 2005, pp. 205-208.
-
(2005)
Proc. IEEE ICASSP
, pp. 205-208
-
-
Soltau, H.1
Kingsbury, B.2
Mangu, L.3
Povey, D.4
Saon, G.5
Zweig, G.6
-
6
-
-
27744599401
-
Automatic transcription of conversational telephone speech
-
Nov
-
T. Hain, P. C. Woodland, G. Evermann, M. J. F. Gales, X. Liu, G. L. Moore, D. Povey, and L. Wang, "Automatic transcription of conversational telephone speech," IEEE Trans. Speech Audio Process., vol. 13, no. 6, pp. 1173-1185, Nov. 2005.
-
(2005)
IEEE Trans. Speech Audio Process
, vol.13
, Issue.6
, pp. 1173-1185
-
-
Hain, T.1
Woodland, P.C.2
Evermann, G.3
Gales, M.J.F.4
Liu, X.5
Moore, G.L.6
Povey, D.7
Wang, L.8
-
7
-
-
34047266609
-
Multistage speaker diarization of broadcast news
-
Sep
-
C. Barras, X. Zhu, S. Meignier, and J.-L. Gauvain, "Multistage speaker diarization of broadcast news," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1505-1512, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1505-1512
-
-
Barras, C.1
Zhu, X.2
Meignier, S.3
Gauvain, J.-L.4
-
8
-
-
34047261805
-
An overview of automatic speaker diarization systems
-
Sep
-
S. E. Tranter and D. A. Reynolds, "An overview of automatic speaker diarization systems," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1557-1565, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1557-1565
-
-
Tranter, S.E.1
Reynolds, D.A.2
-
9
-
-
34047266607
-
Enriching speech recognition with automatic detection of sentence boundaries and disfluencies
-
Sep
-
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, and M. Harper, "Enriching speech recognition with automatic detection of sentence boundaries and disfluencies," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1526-1540, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1526-1540
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
Hillard, D.4
Ostendorf, M.5
Harper, M.6
-
10
-
-
34047271072
-
Recognizing disfluencies in conversational speech
-
Sep
-
M. Lease, M. Johnson, and E. Charniak, "Recognizing disfluencies in conversational speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1566-1573, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1566-1573
-
-
Lease, M.1
Johnson, M.2
Charniak, E.3
-
11
-
-
34047266604
-
Edit disfluency detection and correction using a cleanup language model and an alignment model
-
Sep
-
J.-F. Yeh and C.-H. Wu, "Edit disfluency detection and correction using a cleanup language model and an alignment model," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1574-1583, Sep. 2006.
-
(2006)
IEEE Trans. Audio, Speech, Lang. Process
, vol.14
, Issue.5
, pp. 1574-1583
-
-
Yeh, J.-F.1
Wu, C.-H.2
-
12
-
-
0040958578
-
Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue
-
P. Heeman and J. Allen, "Speech repairs, intonational phrases and discourse markers: Modeling speakers' utterances in spoken dialogue," Comput. Linguist., vol. 25, pp. 527-571, 1999.
-
(1999)
Comput. Linguist
, vol.25
, pp. 527-571
-
-
Heeman, P.1
Allen, J.2
-
13
-
-
85120620835
-
Edit detection and parsing for transcribed speech
-
E. Charniak and M. Johnson, "Edit detection and parsing for transcribed speech," in Proc. NAACL, 2001, pp. 118-126.
-
(2001)
Proc. NAACL
, pp. 118-126
-
-
Charniak, E.1
Johnson, M.2
-
14
-
-
57849131781
-
a TAG-based noisy channel model of speech repairs
-
m. Johnson and e. Charniak, "a TAG-based noisy channel model of speech repairs," in Proc. ACL, 2004.
-
(2004)
Proc. ACL
-
-
Johnson, M.1
Charniak, E.2
-
15
-
-
33646762857
-
Automatic disfluency removal on recognized spontaneous speech - Rapid adaptation to speaker dependent disfluencies
-
M. Honal and T. Schultz, "Automatic disfluency removal on recognized spontaneous speech - Rapid adaptation to speaker dependent disfluencies," in Proc. ICASSP, 2005, pp. 969-972.
-
(2005)
Proc. ICASSP
, pp. 969-972
-
-
Honal, M.1
Schultz, T.2
-
16
-
-
56149102222
-
Corrections of disfluencies in spontaneous speech using a noisy channel approach
-
M. Honal and T. Schultz, "Corrections of disfluencies in spontaneous speech using a noisy channel approach," in Proc. Eurospeech, 2003, pp. 2781-2784.
-
(2003)
Proc. Eurospeech
, pp. 2781-2784
-
-
Honal, M.1
Schultz, T.2
-
17
-
-
0028215480
-
A corpus-based study of repair cues in spontaneous speech
-
C. Nakatani and J. Hirschberg, "A corpus-based study of repair cues in spontaneous speech," J. Acoust. Soc. Amer., pp. 1603-1616, 1994.
-
(1994)
J. Acoust. Soc. Amer
, pp. 1603-1616
-
-
Nakatani, C.1
Hirschberg, J.2
-
18
-
-
0000703860
-
Phonetic consequences of speech disfluency
-
E. Shriberg, "Phonetic consequences of speech disfluency," in Proc. Int. Conf. Phonetics Sci., 1999, pp. 619-622.
-
(1999)
Proc. Int. Conf. Phonetics Sci
, pp. 619-622
-
-
Shriberg, E.1
-
19
-
-
0030351630
-
Juncture cues to disfluency
-
R. Lickley, "Juncture cues to disfluency," in Proc. ICSLP, 1996, pp. 2478-2481.
-
(1996)
Proc. ICSLP
, pp. 2478-2481
-
-
Lickley, R.1
-
20
-
-
84878523744
-
Prosodic features of four types of disfluencies
-
G. Savova and J. Bachenko, "Prosodic features of four types of disfluencies," in Proc. DiSS, 2003, pp. 91-94.
-
(2003)
Proc. DiSS
, pp. 91-94
-
-
Savova, G.1
Bachenko, J.2
-
21
-
-
0010125082
-
A prosody-only decision-tree model for disfluency detection
-
E. Shriberg and A. Stolcke, "A prosody-only decision-tree model for disfluency detection," in Proc. Eurospeech, 1997, pp. 2383-2386.
-
(1997)
Proc. Eurospeech
, pp. 2383-2386
-
-
Shriberg, E.1
Stolcke, A.2
-
22
-
-
0034275920
-
Prosody-based automatic segmentation of speech into sentences and topics
-
E. Shriberg, A. Stolcke, D. Hakkani-Tur, and G. Tur, "Prosody-based automatic segmentation of speech into sentences and topics," Speech Commun., pp. 127-154, 2000.
-
(2000)
Speech Commun
, pp. 127-154
-
-
Shriberg, E.1
Stolcke, A.2
Hakkani-Tur, D.3
Tur, G.4
-
23
-
-
33745191849
-
Comparing HMM, maximum entropy, and conditional random fields for disfluency detection
-
Y. Liu, A. Stolcke, E. Shriberg, and M. Harper, "Comparing HMM, maximum entropy, and conditional random fields for disfluency detection," in Proc. Eurospeech, 2005, pp. 3313-3316.
-
(2005)
Proc. Eurospeech
, pp. 3313-3316
-
-
Liu, Y.1
Stolcke, A.2
Shriberg, E.3
Harper, M.4
-
24
-
-
33646800879
-
-
Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, Structural metadata research in the ears program, presented at the icassp, 2005, pp. 957-960, unpublished.
-
Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Structural metadata research in the ears program," presented at the icassp, 2005, pp. 957-960, unpublished.
-
-
-
-
25
-
-
85009223733
-
Automatic disfluency identification in conversational speech using multiple knowledge sources
-
Y. Liu, E. Shriberg, and A. Stolcke, "Automatic disfluency identification in conversational speech using multiple knowledge sources," in Proc. Eurospeech, 2003, pp. 957-960.
-
(2003)
Proc. Eurospeech
, pp. 957-960
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
-
26
-
-
85009142186
-
Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection
-
Y. Liu, E. Shriberg, A. Stolcke, and M. Harper, "Using machine learning to cope with imbalanced classes in natural speech: Evidence from sentence boundary and disfluency detection," in Proc. ICSLP, 2004, pp. 1525-1528.
-
(2004)
Proc. ICSLP
, pp. 1525-1528
-
-
Liu, Y.1
Shriberg, E.2
Stolcke, A.3
Harper, M.4
-
27
-
-
33646764337
-
A lexically-driven algorithm for disfluency detection
-
M. Snover, B. Dorr, and R. Schwartz, "A lexically-driven algorithm for disfluency detection," in Proc. HLT/NAACL, 2004, pp. 157-160.
-
(2004)
Proc. HLT/NAACL
, pp. 157-160
-
-
Snover, M.1
Dorr, B.2
Schwartz, R.3
-
28
-
-
33646819463
-
Detecting structural metadata with decision trees and transformation-based learning
-
J. Kim, S. Schwarm, and M. Ostendorf, "Detecting structural metadata with decision trees and transformation-based learning," in Proc. HLT/ NAACL, 2004.
-
(2004)
Proc. HLT/ NAACL
-
-
Kim, J.1
Schwarm, S.2
Ostendorf, M.3
-
29
-
-
0002652285
-
A maximum entropy approach to natural language processing
-
A. L. Berger, S. A. Della Pietra, and V. J. Della Pietra, "A maximum entropy approach to natural language processing," Comput. Linguist., vol. 22, pp. 39-72, 1996.
-
(1996)
Comput. Linguist
, vol.22
, pp. 39-72
-
-
Berger, A.L.1
Della Pietra, S.A.2
Della Pietra, V.J.3
-
30
-
-
0000732463
-
A limited memory algorithm for bound constrained optimization
-
R. H. Ryrd, P. Lu, and J. Nocedal, "A limited memory algorithm for bound constrained optimization," SIAM J. Sci. Statist. Comput., vol. 16, no. 5, pp. 1190-1208, 1995.
-
(1995)
SIAM J. Sci. Statist. Comput
, vol.16
, Issue.5
, pp. 1190-1208
-
-
Ryrd, R.H.1
Lu, P.2
Nocedal, J.3
-
31
-
-
0004014502
-
A Gaussian Prior for Smoothing Maximum Entropy Models Carnegie Mellon Univ., Pittsburgh, PA
-
Tech. Rep
-
S. Chen and R. Rosenfeld, A Gaussian Prior for Smoothing Maximum Entropy Models Carnegie Mellon Univ., Pittsburgh, PA, 1999, Tech. Rep..
-
(1999)
-
-
Chen, S.1
Rosenfeld, R.2
-
32
-
-
0032346848
-
Bayesian CART model search
-
H. Chipman, E. I. George, and R. E. McCulloch, "Bayesian CART model search," J. Amer. Statist. Assoc., vol. 93, no. 443, pp. 935-947, 1998.
-
(1998)
J. Amer. Statist. Assoc
, vol.93
, Issue.443
, pp. 935-947
-
-
Chipman, H.1
George, E.I.2
McCulloch, R.E.3
-
33
-
-
21844474040
-
Fluent speech prosody: Framework and modeling
-
July, Special Issue on Quantitative Prosody Modeling for Natural Speech Description and Generation
-
C.-Y. Tseng, S.-H. Pin, Y.-L. Lee, H.-M. Wang, and Y.-C. Chen, "Fluent speech prosody: Framework and modeling," Speech Commun., vol. 46, no. 3-4, pp. 284-309, July 2005, Special Issue on Quantitative Prosody Modeling for Natural Speech Description and Generation.
-
(2005)
Speech Commun
, vol.46
, Issue.3-4
, pp. 284-309
-
-
Tseng, C.-Y.1
Pin, S.-H.2
Lee, Y.-L.3
Wang, H.-M.4
Chen, Y.-C.5
-
34
-
-
11244330002
-
Probabilistic latent semantic analysis
-
T. Hofmann, "Probabilistic latent semantic analysis," in Uncertainty Artif. Intell., 1999, pp. 289-296.
-
(1999)
Uncertainty Artif. Intell
, pp. 289-296
-
-
Hofmann, T.1
-
35
-
-
40349098806
-
Learning the threshold in hierarchical agglomerative clustering
-
K. Daniels and C. Giraud-Carrier, "Learning the threshold in hierarchical agglomerative clustering," in Proc. ICMLA, 2006, pp. 270-278.
-
(2006)
Proc. ICMLA
, pp. 270-278
-
-
Daniels, K.1
Giraud-Carrier, C.2
-
37
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," in Proc. ICML, 2001, pp. 282-289.
-
(2001)
Proc. ICML
, pp. 282-289
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.3
-
38
-
-
68549091556
-
-
S.-C. Tseng, Processing spoken mandarin corpora, Traitement Automatique des Langues, 45, no. 2, pp. 89-108, Special Issue: Spoken Corpus Processing.
-
S.-C. Tseng, "Processing spoken mandarin corpora," Traitement Automatique des Langues, vol. 45, no. 2, pp. 89-108, Special Issue: Spoken Corpus Processing.
-
-
-
-
39
-
-
33947691278
-
Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)
-
Y.-C. Hsieh, Y.-T. Huang, C.-C. Wang, and L.-S. Lee, "Improved spoken document retrieval with dynamic key term lexicon and probabilistic latent semantic analysis (PLSA)," in Proc. ICASSP, 2006, pp. 961-964.
-
(2006)
Proc. ICASSP
, pp. 961-964
-
-
Hsieh, Y.-C.1
Huang, Y.-T.2
Wang, C.-C.3
Lee, L.-S.4
-
40
-
-
68549116144
-
Rich Transcription (RT-04F)
-
Evaluation Plan, Online, Available
-
"Rich Transcription (RT-04F)," Evaluation Plan 2004 [Online]. Available: http://www.nist.gov/speech/tests/rt/rt2004/fall/docs/rt04f-eval-plan- v14.doc
-
(2004)
-
-
-
41
-
-
0001884644
-
Individual comparisons by ranking methods
-
F. Wilcoxon, "Individual comparisons by ranking methods," Biometrics, vol. 1, pp. 80-83, 1945.
-
(1945)
Biometrics
, vol.1
, pp. 80-83
-
-
Wilcoxon, F.1
|