-
1
-
-
84865801985
-
Conversational speech transcription using context-dependent deep neural networks
-
Frank Seide, Gang Li, and Dong Yu, "Conversational speech transcription using context-dependent deep neural networks," in Interspeech, 2011.
-
(2011)
Interspeech
-
-
Seide, F.1
Li, G.2
Yu, D.3
-
2
-
-
0000494467
-
Handwritten digit recognition with a back-propagation network
-
Y. Le Cun, B. Boser, J. Denker, D. Henderson, R. Howard, W. Hubbard, and L. Jackel, "Handwritten digit recognition with a back-propagation network," in Proc. NIPS, 1990.
-
(1990)
Proc. NIPS
-
-
Le Cun, Y.1
Boser, B.2
Denker, J.3
Henderson, D.4
Howard, R.5
Hubbard, W.6
Jackel, L.7
-
4
-
-
0023833469
-
Phoneme recognition: Neural networks vs hidden Markov models
-
A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, and K. Lang, "Phoneme recognition: Neural networks vs hidden Markov models," in Proc. ICASSP, 1988.
-
(1988)
Proc. ICASSP
-
-
Waibel, A.1
Hanazawa, T.2
Hinton, G.3
Shikano, K.4
Lang, K.5
-
5
-
-
84867605836
-
Applying convolutional neural network concepts to hybrid NNHMM model for speech recognition
-
o. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural network concepts to hybrid NNHMM model for speech recognition," in Proc. ICASSP, 2012.
-
(2012)
Proc. ICASSP
-
-
Abdel-Hamid, O.1
Mohamed, A.2
Jiang, H.3
Penn, G.4
-
6
-
-
84890525984
-
Deep convolutional neural networks for LVCSR
-
T. Sainath, A. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for LVCSR," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Sainath, T.1
Mohamed, A.2
Kingsbury, B.3
Ramabhadran, B.4
-
7
-
-
84893654379
-
Improvements to deep convolutional neural networks for LVCSR
-
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Dahl, George Saon, Hagen Soltau, Tomas Beran, Aleksandr Y. Aravkin, and Bhuvana Ramabhadran, "Improvements to deep convolutional neural networks for LVCSR," in Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Sainath, T.N.1
Kingsbury, B.2
Mohamed, A.-R.3
Dahl, G.4
Saon, G.5
Soltau, H.6
Beran, T.7
Aravkin, A.Y.8
Ramabhadran, B.9
-
8
-
-
33745528628
-
Using MLP features in SRIs conversational speech recognition system
-
Qifeng Zhu, Andreas Stolcke, Barry Y. Chen, and Nelson Morgan, "Using MLP features in SRIs conversational speech recognition system," in in Proc. Interspeech, 2005, pp. 21412144.
-
(2005)
Proc. Interspeech
, pp. 21412144
-
-
Zhu, Q.1
Stolcke, A.2
Chen, B.Y.3
Morgan, N.4
-
9
-
-
84906275972
-
Development of the RWTH transcription system for slovenian
-
Lyon, France, Aug
-
Pavel Golik, Zoltan Tueske, Ralf Schlueter, and Hermann Ney, "Development of the RWTH Transcription System for Slovenian," in Interspeech, Lyon, France, Aug. 2013, pp. 31073111.
-
(2013)
Interspeech
, pp. 31073111
-
-
Golik, P.1
Tueske, Z.2
Schlueter, R.3
Ney, H.4
-
10
-
-
84906257050
-
Neural network acoustic models for the darpa rats program
-
Hagen Soltau, Hong-Kwang Kuo, Lidia Mangu, George Saon, and Tomas Beran, "Neural Network Acoustic Models for the DARPA RATS Program," in Proc. Interspeech, 2013.
-
(2013)
Proc. Interspeech
-
-
Soltau, H.1
Kuo, H.-K.2
Mangu, L.3
Saon, G.4
Beran, T.5
-
11
-
-
84878397276
-
Pipelined back-propagation for context-dependent deep neural networks
-
Xie Chen, Adam Eversole, Gang Li, Dong Yu, and Frank Seide, "Pipelined back-propagation for context-dependent deep neural networks.," in Proc. INTERSPEECH, 2012.
-
(2012)
Proc. INTERSPEECH
-
-
Chen, X.1
Eversole, A.2
Li, G.3
Yu, D.4
Seide, F.5
-
12
-
-
0031189914
-
Multitask learning
-
Rich Caruana, "Multitask learning," Machine Learning, vol. 28, no. 1,pp.41-75, 1997.
-
(1997)
Machine Learning
, vol.28
, Issue.1
, pp. 41-75
-
-
Caruana, R.1
-
13
-
-
84890545600
-
Multi-task learning in deep neural networks for improved phoneme recognition
-
M. Seltzer and J. Droppo, "Multi-task learning in deep neural networks for improved phoneme recognition," in Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Seltzer, M.1
Droppo, J.2
-
14
-
-
84890458846
-
Multitask learning in connectionist speech recognition'
-
Y. Lu, F. Lu, S. Sehgal, S. Gupta, J. Du, C. Tham, P. Green, and V. Wan, "Multitask learning in connectionist speech recognition'" in Proc. ASSTA, 2004.
-
(2004)
Proc. ASSTA
-
-
Lu, Y.1
Lu, F.2
Sehgal, S.3
Gupta, S.4
Du, J.5
Tham, C.6
Green, P.7
Wan, V.8
-
15
-
-
79951796005
-
The IBM Attila speech recognition toolkit
-
Hagen Soltau, George Saon, and Brian Kingsbury, "The IBM Attila speech recognition toolkit," in Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE, 2010, pp. 97-102.
-
(2010)
Spoken Language Technology Workshop (SLT), 2010 IEEE. IEEE
, pp. 97-102
-
-
Soltau, H.1
Saon, G.2
Kingsbury, B.3
-
16
-
-
84878379108
-
Scalable minimum Bayes risk training of neural network acoustic models using distributed Hessian-free optimization
-
B. Kingsbury, T. N. Sainath, and H. Soltau, "Scalable minimum Bayes risk training of neural network acoustic models using distributed Hessian-free optimization," in Proc. Interspeech, 2012.
-
(2012)
Proc. Interspeech
-
-
Kingsbury, B.1
Sainath, T.N.2
Soltau, H.3
-
17
-
-
79951609039
-
Front-end factor analysis for speaker verification
-
May
-
N. Dehak, P. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 4, May 2011.
-
(2011)
IEEE Trans. Audio, Speech and Language Processing
, vol.19
, Issue.4
-
-
Dehak, N.1
Kenny, P.2
Dehak, R.3
Dumouchel, P.4
Ouellet, P.5
-
18
-
-
84893691530
-
Speaker adaptation of neural network acoustic models using I-Vectors
-
George Saon, Hagen Soltau, David Nahamoo, and Michael Picheny, "Speaker adaptation of neural network acoustic models using I-Vectors," in Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Saon, G.1
Soltau, H.2
Nahamoo, D.3
Picheny, M.4
-
19
-
-
84905231268
-
The ibm spoken term detection system for the darpa rats program
-
Lidia Mangu, Hagen Soltau, Hong-Kwang Kuo, and George Saon, "The IBM Spoken Term Detection System for the DARPA RATS Program," in Proc. ASRU, 2013.
-
(2013)
Proc. ASRU
-
-
Mangu, L.1
Soltau, H.2
Kuo, H.-K.3
Saon, G.4
-
20
-
-
84906222432
-
The IBM speech activity detection system for the DARPA RATS program
-
George Saon, Samuel Thomas, Hagen Soltau, Sriram Ganapathy, and Brian Kingsbury, "The IBM speech activity detection system for the DARPA RATS program," in Proc. Interspeech, 2013.
-
(2013)
Proc. Interspeech
-
-
Saon, G.1
Thomas, S.2
Soltau, H.3
Ganapathy, S.4
Kingsbury, B.5
|