-
1
-
-
0030638031
-
A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
-
JG Fiscus, "A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)," in Proc. IEEE ASRU, 1997, pp. 347-352
-
(1997)
Proc. IEEE ASRU
, pp. 347-352
-
-
Fiscus, J.G.1
-
2
-
-
85061808589
-
Explicit word error minimization in n-best list rescoring
-
A Stolcke, Y Konig, and M Weintraub, "Explicit word error minimization in n-best list rescoring.," in EUROSPEECH, 1997
-
(1997)
EUROSPEECH
-
-
Stolcke, A.1
Konig, Y.2
Weintraub, M.3
-
3
-
-
0034296009
-
Finding consensus in speech recognition: Word error minimization and other appli-cations of confusion networks
-
L. Mangu, E. Brill, and A. Stolcke, "Finding consensus in speech recognition: word error minimization and other appli-cations of confusion networks," Computer Speech and Language, vol. 14, no. 4, pp. 373-400, 2000
-
(2000)
Computer Speech and Language
, vol.14
, Issue.4
, pp. 373-400
-
-
Mangu, L.1
Brill, E.2
Stolcke, A.3
-
5
-
-
44949249226
-
Generating complementary systems for speech recognition
-
C Breslin and MJF Gales, "Generating complementary systems for speech recognition.," in INTERSPEECH, 2006
-
(2006)
INTERSPEECH
-
-
Breslin, C.1
Gales, M.2
-
6
-
-
58149202339
-
Directed decision trees for generating complementary systems
-
C. Breslin and M. J. F. Gales, "Directed decision trees for generating complementary systems," Speech Communication, vol. 51, no. 3, pp. 284-295, 2009
-
(2009)
Speech Communication
, vol.51
, Issue.3
, pp. 284-295
-
-
Breslin, C.1
Gales, M.J.F.2
-
7
-
-
0009129790
-
Adaptively growing hierarchical mixtures of experts
-
J. Fritsch, M. Finke, and A. Waibel, "Adaptively growing hierarchical mixtures of experts," in Advances in Neural Information Processing Systems, 1997, pp. 459-465
-
(1997)
Advances in Neural Information Processing Systems
, pp. 459-465
-
-
Fritsch, J.1
Finke, M.2
Waibel, A.3
-
9
-
-
34547548235
-
Probabilistic and bottleneck features for LVCSR of meetings
-
F. Grezl, M Karafiat, S. Kontar, and J. Cernokcy, "Probabilistic and bottleneck features for LVCSR of meetings," in Proc. IEEE ICASSP, 2007
-
(2007)
Proc. IEEE ICASSP
-
-
Grezl, F.1
Karafiat, M.2
Kontar, S.3
Cernokcy, J.4
-
10
-
-
0033709098
-
Tandem connectionist feature extraction for conventional HMM systems
-
H Hermansky, DPW Ellis, and S Sharma, "Tandem connectionist feature extraction for conventional HMM systems," in Proc. IEEE ICASSP, 2000
-
(2000)
Proc. IEEE ICASSP
-
-
Hermansky, H.1
Ellis, D.P.W.2
Sharma, S.3
-
11
-
-
84874245054
-
Transcription of multigenre media archives using out-of-domain data
-
P. Bell, M. Gales, P. Lanchantin, X. Liu, Y. Long, S. Renals, P. Swietojanski, and P. Woodland, "Transcription of multigenre media archives using out-of-domain data," in Proc. IEEE Workshop on Spoken Language Technology, Miami, 2012
-
(2012)
Proc. IEEE Workshop on Spoken Language Technology, Miami
-
-
Bell, P.1
Gales, M.2
Lanchantin, P.3
Liu, X.4
Long, Y.5
Renals, S.6
Swietojanski, P.7
Woodland, P.8
-
13
-
-
84055222005
-
Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
-
GE Dahl, D Yu, L Deng, and A Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Transactions on Audio, Speech &Language Processing, vol. 20, no. 1, pp. 30-42, 2012
-
(2012)
IEEE Transactions on Audio, Speech &Language Processing
, vol.20
, Issue.1
, pp. 30-42
-
-
Dahl, G.E.1
Yu, D.2
Deng, L.3
Acero, A.4
-
14
-
-
0032050110
-
Maximum likelihood linear transformations for HMM-based speech recognition
-
April
-
MJF Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer Speech and Language, vol. 12, no. 2, pp. 75-98, April 1998
-
(1998)
Computer Speech and Language
, vol.12
, Issue.2
, pp. 75-98
-
-
Gales, M.1
-
15
-
-
85045373614
-
Overview of the IWSLT 2012 evaluation campaign
-
Hong Kong, HK, December
-
M Federico, M. Cettolo, L. Bentivogli, M. Paul, and S. Stuker, "Overview of the IWSLT 2012 evaluation campaign," in Proc. of the International Workshop on Spoken Language Translation, Hong Kong, HK, December 2012
-
(2012)
Proc. of the International Workshop on Spoken Language Translation
-
-
Federico, M.1
Cettolo, M.2
Bentivogli, L.3
Paul, M.4
Stuker, S.5
-
16
-
-
0028204660
-
Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition
-
jan
-
C. Dugast, L. Devillers, and X. Aubert, "Combining TDNN and HMM in a hybrid system for improved continuous-speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 1, pp. 217-223, jan 1994
-
(1994)
Speech and Audio Processing, IEEE Transactions on
, vol.2
, Issue.1
, pp. 217-223
-
-
Dugast, C.1
Devillers, L.2
Aubert, X.3
-
17
-
-
0028194709
-
Connectionist probability estimators in HMM speech recognition
-
S Renals, N Morgan, H Bourlard, M Cohen, and H Franco, "Connectionist probability estimators in HMM speech recognition," IEEE Transactions on Speech and Audio Processing, vol. 2, no. 1, pp. 161-174, 1994
-
(1994)
IEEE Transactions on Speech and Audio Processing
, vol.2
, Issue.1
, pp. 161-174
-
-
Renals, S.1
Morgan, N.2
Bourlard, H.3
Cohen, M.4
Franco, H.5
-
18
-
-
0002384092
-
Large vocabulary continuous speech recognition using a hybrid connectionist/ HMM system
-
1994
-
M. Hochberg, S. Renals, T. Robinson, and D. Kershaw, "Large vocabulary continuous speech recognition using a hybrid connectionist/ HMM system," in Proc. ICSLP, Yokohama
-
(1994)
Proc. ICSLP, Yokohama
-
-
Hochberg, M.1
Renals, S.2
Robinson, T.3
Kershaw, D.4
-
19
-
-
0028288775
-
A hybrid segmental neural net/hidden Markov model system for continuous speech recognition
-
jan
-
G. Zavaliagkos, Y. Zhao, R. Schwartz, and J. Makhoul, "A hybrid segmental neural net/hidden Markov model system for continuous speech recognition," Speech and Audio Processing, IEEE Transactions on, vol. 2, no. 1, pp. 151-160, jan 1994
-
(1994)
Speech and Audio Processing, IEEE Transactions on
, vol.2
, Issue.1
, pp. 151-160
-
-
Zavaliagkos, G.1
Zhao, Y.2
Schwartz, R.3
Makhoul, J.4
-
20
-
-
0029732695
-
Multilayer perceptrons for statedependent weightings of HMM likelihoods
-
Y. J. Chung and C. K. Un, "Multilayer perceptrons for statedependent weightings of HMM likelihoods," Speech Communication, vol. 18, no. 1, pp. 79-89, 1996
-
(1996)
Speech Communication
, vol.18
, Issue.1
, pp. 79-89
-
-
Chung, Y.J.1
Un, C.K.2
-
22
-
-
84878539964
-
Application of pretrained deep neural networks to large vocabulary speech recognition
-
N Jaitly, P Nguyen, A Senior, and V Vanhoucke, "Application of pretrained deep neural networks to large vocabulary speech recognition," in Interspeech, 2012
-
(2012)
Interspeech
-
-
Jaitly, N.1
Nguyen, P.2
Senior, A.3
Vanhoucke, V.4
-
23
-
-
79959814724
-
Scarf: A segmental conditional random field toolkit for speech recognition
-
G Zweig and P Nguyen, "Scarf: A segmental conditional random field toolkit for speech recognition," in Interspeech, 2010, pp. 2858-2861
-
(2010)
Interspeech
, pp. 2858-2861
-
-
Zweig, G.1
Nguyen, P.2
-
24
-
-
0034825241
-
Multi-stream adaptive evidence combination for noise robust ASR
-
A Morris, A Hagen, H Glotin, and H Bourlard, "Multi-stream adaptive evidence combination for noise robust ASR," Speech Communication, vol. 34, no. 1-2, pp. 25-40, 2001
-
(2001)
Speech Communication
, vol.34
, Issue.1-2
, pp. 25-40
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
25
-
-
79953250475
-
Minimum bayes risk decoding and system combination based on a recursion for edit distance
-
October
-
H Xu, D Povey, L Mangu, and J Zhu, "Minimum bayes risk decoding and system combination based on a recursion for edit distance," Computer Speech and Language, vol. 25, no. 4, pp. 802-828, October 2011
-
(2011)
Computer Speech and Language
, vol.25
, Issue.4
, pp. 802-828
-
-
Xu, H.1
Povey, D.2
Mangu, L.3
Zhu, J.4
-
26
-
-
85001124710
-
Wit3: Web inventory of transcribed and translated talks
-
Trento, Italy, May
-
M. Cettolo, C. Girardi, and M. Federico, "Wit3: Web inventory of transcribed and translated talks," in Proceedings of the 16th Conference of the European Association for Machine Translation (EAMT), Trento, Italy, May 2012, pp. 261-268
-
(2012)
Proceedings of the 16th Conference of the European Association for Machine Translation (EAMT)
, pp. 261-268
-
-
Cettolo, M.1
Girardi, C.2
Federico, M.3
-
27
-
-
84890543632
-
The UEDIN systems for the IWSLT 2012 evaluation
-
E. Hasler, P. Bell, A. Ghoshal, B. Haddow, P. Koehn, F. McInnes, S. Renals, and P. Swietojanski, "The UEDIN systems for the IWSLT 2012 evaluation," in Proc. IWSLT, 2012
-
(2012)
Proc. IWSLT
-
-
Hasler, E.1
Bell, P.2
Ghoshal, A.3
Haddow, B.4
Koehn, P.5
McInnes, F.6
Renals, S.7
Swietojanski, P.8
-
28
-
-
84874276847
-
The Kaldi speech recognition toolkit
-
December
-
D. Povey, A. Ghoshal, G. Boulianne, L. Burget, O. Glembek, N. Goel, M. Hannemann, P. Motl?cek, Y. Qian, P. Schwarz, J. Silovsky, G. Stemmer, and K. Vesely, "The Kaldi speech recognition toolkit," in Proc. IEEE ASRU, December 2011
-
(2011)
Proc. IEEE ASRU
-
-
Povey, D.1
Ghoshal, A.2
Boulianne, G.3
Burget, L.4
Glembek, O.5
Goel, N.6
Hannemann, M.7
Motlcek, P.8
Qian, Y.9
Schwarz, P.10
Silovsky, J.11
Stemmer, G.12
Vesely, K.13
-
29
-
-
84873443879
-
Theano: A CPU and GPU math expression compiler
-
J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, J Turian, D Warde-Farley, and Y Bengio, "Theano: A CPU and GPU math expression compiler," in Proc. SciPy, 2010
-
(2010)
Proc. SciPy
-
-
Bergstra, J.1
Breuleux, O.2
Bastien, F.3
Lamblin, P.4
Pascanu, R.5
Desjardins, G.6
Turian, J.7
Warde-Farley, D.8
Bengio, Y.9
-
30
-
-
51449120120
-
Boosted MMI for model and featurespace discriminative training
-
D Povey, D Kanevsky, B Kingsbury, B Ramabhadran, G Saon, and K Visweswariah, "Boosted MMI for model and featurespace discriminative training," in Proc. IEEE ICASSP, 2008, pp. 4057-4060
-
(2008)
Proc. IEEE ICASSP
, pp. 4057-4060
-
-
Povey, D.1
Kanevsky, D.2
Kingsbury, B.3
Ramabhadran, B.4
Saon, G.5
Visweswariah, K.6
-
31
-
-
85008520364
-
Transcribing meetings with the AMIDA systems
-
T. Hain, L. Burget, J. Dines, P.N. Garner, F. Grezl, A.E. Hannani, M. Huijbregts, M. Karafiat, M. Lincoln, and V. Wan, "Transcribing meetings with the AMIDA systems," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 2, pp. 486-498, 2012
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.2
, pp. 486-498
-
-
Hain, T.1
Burget, L.2
Dines, J.3
Garner, P.N.4
Grezl, F.5
Hannani, A.E.6
Huijbregts, M.7
Karafiat, M.8
Lincoln, M.9
Wan, V.10
-
34
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
GE Hinton, S Osindero, and Y Teh, "A fast learning algorithm for deep belief nets," Neural Computation, vol. 18, 2006
-
(2006)
Neural Computation
, vol.18
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
35
-
-
84858976070
-
Feature engineering in context-dependent deep neural networks for conversational speech transcription
-
F Seide, G Li, X Chen, and D Yu, "Feature engineering in context-dependent deep neural networks for conversational speech transcription," in Proc. IEEE ASRU, 2011
-
(2011)
Proc. IEEE ASRU
-
-
Seide, F.1
Li, G.2
Chen, X.3
Yu, D.4
-
36
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A Mohamed, GE Dahl, and GE Hinton, "Acoustic modeling using deep belief networks," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 1, 2012
-
(2012)
IEEE Transactions on Audio, Speech, and Language Processing
, vol.20
, Issue.1
-
-
Mohamed, A.1
Dahl, G.E.2
Hinton, G.E.3
|