-
1
-
-
84892179040
-
The BBN Byblos 1997 large vocabulary conversational speech recognition system
-
G. Zavaliagkos, J. McDonough, D. Miller, A. El-Jaroudi, J. Billa, F. Richardson, K. Ma, M. Siu, and H. Gish, "The BBN Byblos 1997 large vocabulary conversational speech recognition system," in Proc. ICASSP, 1998, pp. 905-908.
-
(1998)
Proc. ICASSP
, pp. 905-908
-
-
Zavaliagkos, G.1
McDonough, J.2
Miller, D.3
El-Jaroudi, A.4
Billa, J.5
Richardson, F.6
Ma, K.7
Siu, M.8
Gish, H.9
-
2
-
-
0034847002
-
The 1998 HTK system for transcription of conversational telephone speech
-
T. Hain, P. Woodland, T. Niesler, and E. Whittaker, "The 1998 HTK system for transcription of conversational telephone speech," in Proc. ICASSP, 1999, pp. 57-60.
-
(1999)
Proc. ICASSP
, pp. 57-60
-
-
Hain, T.1
Woodland, P.2
Niesler, T.3
Whittaker, E.4
-
4
-
-
44849090969
-
Recognition and understanding of meetings: The AMI and AMIDA projects
-
S. Renals, T. Hain, and H. Bourlard, "Recognition and understanding of meetings: The AMI and AMIDA projects," in Proc. ASRU, 2007, pp. 238-247.
-
(2007)
Proc. ASRU
, pp. 238-247
-
-
Renals, S.1
Hain, T.2
Bourlard, H.3
-
5
-
-
85009067726
-
Toward the realization of spontaneous speech recognition-Introduction of a Japanese priority program and preliminary results
-
S. Furui, K. Maekawa, and H. Isahara, "Toward the realization of spontaneous speech recognition-Introduction of a Japanese priority program and preliminary results," in Proc. ICSLP, 2000, pp. 518-521.
-
(2000)
Proc. ICSLP
, pp. 518-521
-
-
Furui, S.1
Maekawa, K.2
Isahara, H.3
-
6
-
-
0141591531
-
Language modeling and transcription of the TED corpus lectures
-
E. Leeuwis, M. Federico, and M. Cettolo, "Language modeling and transcription of the TED corpus lectures," in Proc. ICASSP, 2003, pp. 232-235.
-
(2003)
Proc. ICASSP
, pp. 232-235
-
-
Leeuwis, E.1
Federico, M.2
Cettolo, M.3
-
7
-
-
33745188014
-
Transcribing lectures and seminars
-
L. Lamel, G. Adda, E. Bilinski, and J. Gauvain, "Transcribing lectures and seminars," in Proc. Eurospeech, 2005, pp. 1657-1660.
-
(2005)
Proc. Eurospeech
, pp. 1657-1660
-
-
Lamel, L.1
Adda, G.2
Bilinski, E.3
Gauvain, J.4
-
8
-
-
43849107616
-
Recent progress in the MIT spoken lecture processing project
-
J. Glass, T. Hazen, S. Cyphers, I. Malioutov, D. Huynh, and R. Barzilay, "Recent progress in the MIT spoken lecture processing project," in Proc. Eurospeech, 2007, pp. 2553-2556.
-
(2007)
Proc. Eurospeech
, pp. 2553-2556
-
-
Glass, J.1
Hazen, T.2
Cyphers, S.3
Malioutov, I.4
Huynh, D.5
Barzilay, R.6
-
9
-
-
51449113481
-
Automatic lecture transcription by exploiting presentation slide information for language model adaptation
-
T. Kawahara, Y. Nemoto, and Y. Akita, "Automatic lecture transcription by exploiting presentation slide information for language model adaptation," in Proc. ICASSP, 2008, pp. 4929-4932.
-
(2008)
Proc. ICASSP
, pp. 4929-4932
-
-
Kawahara, T.1
Nemoto, Y.2
Akita, Y.3
-
10
-
-
85009286782
-
Automatic transcription of courtroom speech
-
R. Prasad, L. Nguyen, R. Schwartz, and J. Makhoul, "Automatic transcription of courtroom speech," in Proc. ICSLP, 2002, pp. 1745-1748.
-
(2002)
Proc. ICSLP
, pp. 1745-1748
-
-
Prasad, R.1
Nguyen, L.2
Schwartz, R.3
Makhoul, J.4
-
11
-
-
34547531456
-
The LIMSI 2006 TC-STAR EPPS transcription systems
-
L. Lamel, J.-L. Gauvain, G. Adda, C. Barras, E. Bilinski, O. Galibert, A. Pujol, H. Schwenk, and X. Zhu, "The LIMSI 2006 TC-STAR EPPS transcription systems," in Proc. ICASSP, 2007, vol.4, pp. 997-1000.
-
(2007)
Proc. ICASSP
, vol.4
, pp. 997-1000
-
-
Lamel, L.1
Gauvain, J.-L.2
Adda, G.3
Barras, C.4
Bilinski, E.5
Galibert, O.6
Pujol, A.7
Schwenk, H.8
Zhu, X.9
-
12
-
-
44949265179
-
The 2006 RWTH parliamentary speeches transcription system
-
J. Loof, M. Bisani, C. Gollan, G. Heigold, B. Hoffmeister, C. Plahl, R. Schluter, and H. Ney, "The 2006 RWTH parliamentary speeches transcription system," in Proc. ICSLP, 2006, pp. 105-108.
-
(2006)
Proc. ICSLP
, pp. 105-108
-
-
Loof, J.1
Bisani, M.2
Gollan, C.3
Heigold, G.4
Hoffmeister, B.5
Plahl, C.6
Schluter, R.7
Ney, H.8
-
13
-
-
44949221858
-
The IBM 2006 speech transcription system for European parliamentary speeches
-
B. Ramabhadran, O. Siohan, L. Mangu, G. Zweig, M. Westphal, H. Schulz, and A. Soneiro, "The IBM 2006 speech transcription system for European parliamentary speeches," in Proc. ICSLP, 2006, pp. 1225-1228.
-
(2006)
Proc. ICSLP
, pp. 1225-1228
-
-
Ramabhadran, B.1
Siohan, O.2
Mangu, L.3
Zweig, G.4
Westphal, M.5
Schulz, H.6
Soneiro, A.7
-
14
-
-
4544316882
-
Advances in the automatic transcription of lectures
-
M. Cettolo, F. Brugnara, and M. Federico, "Advances in the automatic transcription of lectures," in Proc. ICASSP, 2004, pp. 769-772.
-
(2004)
Proc. ICASSP
, pp. 769-772
-
-
Cettolo, M.1
Brugnara, F.2
Federico, M.3
-
15
-
-
85044611587
-
The mathematics of statistical machine translation: Parameter estimation
-
P. Brown, S. Pietra, V. Pietra, and R. Mercer, "The mathematics of statistical machine translation: Parameter estimation," Comput. Linguist., vol.19, no.2, pp. 263-311, 1993.
-
(1993)
Comput. Linguist.
, vol.19
, Issue.2
, pp. 263-311
-
-
Brown, P.1
Pietra, S.2
Pietra, V.3
Mercer, R.4
-
16
-
-
10444241409
-
Filled-pause modeling for medical transcriptions
-
H. Schramm, X. Aubert, C. Meyer, and J. Peters, "Filled-pause modeling for medical transcriptions," in Proc. Workshop Spontaneous Speech Process. Recognition, 2003, pp. 143-146.
-
(2003)
Proc. Workshop Spontaneous Speech Process. Recognition
, pp. 143-146
-
-
Schramm, H.1
Aubert, X.2
Meyer, C.3
Peters, J.4
-
17
-
-
34547522348
-
Reconstructing medical dictations from automatically recognized and non-literal transcripts with phonetic similarity matching
-
S. Petrik and G. Kubin, "Reconstructing medical dictations from automatically recognized and non-literal transcripts with phonetic similarity matching," in Proc. ICASSP, 2007, vol.4, pp. 1125-1128.
-
(2007)
Proc. ICASSP
, vol.4
, pp. 1125-1128
-
-
Petrik, S.1
Kubin, G.2
-
18
-
-
0141480041
-
Language model adaptation using WFST-based speaking-style translation
-
T. Hori, D.Willett, and Y. Minami, "Language model adaptation using WFST-based speaking-style translation," in Proc. ICASSP, 2003, vol.1, pp. 228-231.
-
(2003)
Proc. ICASSP
, vol.1
, pp. 228-231
-
-
Hori, T.1
Willett, D.2
Minami, Y.3
-
19
-
-
4043075534
-
Extended models and tools for high-performance part-of-speech tagger
-
M. Asahara and Y. Matsumoto, "Extended models and tools for high-performance part-of-speech tagger," in Proc. COLING, 2000, pp. 21-27.
-
(2000)
Proc. COLING
, pp. 21-27
-
-
Asahara, M.1
Matsumoto, Y.2
-
20
-
-
0002652285
-
A maximum entropy approach to natural language processing
-
A. Berger, V. Della Pietra, and S. Della Pietra, "A maximum entropy approach to natural language processing," Comput. Linguist., vol.22, no.1, pp. 39-71, 1996.
-
(1996)
Comput. Linguist.
, vol.22
, Issue.1
, pp. 39-71
-
-
Berger, A.1
Della Pietra, V.2
Della Pietra, S.3
-
21
-
-
0030351374
-
On designing pronunciation lexicons for large vocabulary, continuous speech recognition
-
L. Lamel and G. Adda, "On designing pronunciation lexicons for large vocabulary, continuous speech recognition," in Proc. ICSLP, 1996, pp. 6-9.
-
(1996)
Proc. ICSLP
, pp. 6-9
-
-
Lamel, L.1
Adda, G.2
-
22
-
-
0030363039
-
Dictionary learning for spontaneous speech recognition
-
T. Sloboda and A.Waibel, "Dictionary learning for spontaneous speech recognition," in Proc. ICSLP, 1996, pp. 2328-2331.
-
(1996)
Proc. ICSLP
, pp. 2328-2331
-
-
Sloboda, T.1
Waibel, A.2
-
23
-
-
3042704466
-
Language model and speaking rate adaptation for spontaneous presentation speech recognition
-
Jul.
-
H. Nanjo and T. Kawahara, "Language model and speaking rate adaptation for spontaneous presentation speech recognition," IEEE Trans. Speech Audio Process., vol.12, no.4, pp. 391-400, Jul. 2004.
-
(2004)
IEEE Trans. Speech Audio Process.
, vol.12
, Issue.4
, pp. 391-400
-
-
Nanjo, H.1
Kawahara, T.2
-
24
-
-
0033353288
-
Stochastic pronunciation modelling from hand-labelled phonetic corpora
-
M. Riley, W. Byrne, M. Finke, S. Khudanpur,A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters, and G. Zavaliagkos, "Stochastic pronunciation modelling from hand-labelled phonetic corpora," Speech Commun., vol.29, pp. 209-224, 1999.
-
(1999)
Speech Commun.
, vol.29
, pp. 209-224
-
-
Riley, M.1
Byrne, W.2
Finke, M.3
Khudanpur, S.4
Ljolje, A.5
McDonough, J.6
Nock, H.7
Saraclar, M.8
Wooters, C.9
Zavaliagkos, G.10
-
25
-
-
0033077780
-
Automatic generation of multiple pronunciations based on neural networks
-
T. Fukada, T. Yoshimura, and Y. Sagisaka, "Automatic generation of multiple pronunciations based on neural networks," Speech Communication, vol.27, pp. 63-73, 1999.
-
(1999)
Speech Communication
, vol.27
, pp. 63-73
-
-
Fukada, T.1
Yoshimura, T.2
Sagisaka, Y.3
-
26
-
-
0030672090
-
Automatic alternative transcription generation and vocabulary selection for flexible word recognizers
-
D. Torre, L. Villarrubia, J. Elvira, and L. Hernandez-Gomez, "Automatic alternative transcription generation and vocabulary selection for flexible word recognizers," in Proc. ICASSP, 1997, pp. 1463-1466.
-
(1997)
Proc. ICASSP
, pp. 1463-1466
-
-
Torre, D.1
Villarrubia, L.2
Elvira, J.3
Hernandez-Gomez, L.4
-
27
-
-
33646759445
-
Pronunciation variation modeling for ASR: Large improvements are possible but small ones are likely to achieve
-
Q. Yang, J.-P. Martens, P.-J. Ghesquiere, and D. Compernolle, "Pronunciation variation modeling for ASR: Large improvements are possible but small ones are likely to achieve," in Proc. ICSLP Workshop Pronunciation Modeling Lexicon Adaptation for Spoken Lang. Technol., 2002, pp. 123-128.
-
(2002)
Proc. ICSLP Workshop Pronunciation Modeling Lexicon Adaptation for Spoken Lang. Technol.
, pp. 123-128
-
-
Yang, Q.1
Martens, J.-P.2
Ghesquiere, P.-J.3
Compernolle, D.4
-
29
-
-
0029725604
-
A parametric approach to vocal tract length normalization
-
E. Eide and H. Gish, "A parametric approach to vocal tract length normalization," in Proc. ICASSP, 1996, vol.1, pp. 346-349.
-
(1996)
Proc. ICASSP
, vol.1
, pp. 346-349
-
-
Eide, E.1
Gish, H.2
-
30
-
-
0029747183
-
Speaker normalization using efficient frequency warping procedures
-
L. Lee and R. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. ICASSP, 1996, vol.1, pp. 353-356.
-
(1996)
Proc. ICASSP
, vol.1
, pp. 353-356
-
-
Lee, L.1
Rose, R.2
-
31
-
-
0036296863
-
Minimum phone error and I-smoothing for improved discriminative training
-
D. Povey and P. Woodland, "Minimum phone error and I-smoothing for improved discriminative training," in Proc. ICASSP, 2002, vol.1, pp. 105-108.
-
(2002)
Proc. ICASSP
, vol.1
, pp. 105-108
-
-
Povey, D.1
Woodland, P.2
-
32
-
-
33646809034
-
Generalized statistical modeling of pronunciation variations using variable-length phone context
-
Y. Akita and T. Kawahara, "Generalized statistical modeling of pronunciation variations using variable-length phone context," in Proc. ICASSP, 2005, vol.1, pp. 689-692.
-
(2005)
Proc. ICASSP
, vol.1
, pp. 689-692
-
-
Akita, Y.1
Kawahara, T.2
|