-
1
-
-
84973324686
-
Very deep multilingual convolutional neural networks for lvcsr
-
T. Sercu, C. Puhrsch, B. Kingsbury, and Y. LeCun, "Very deep multilingual convolutional neural networks for lvcsr, " Proc. ICASSP, 2016.
-
(2016)
Proc. ICASSP
-
-
Sercu, T.1
Puhrsch, C.2
Kingsbury, B.3
LeCun, Y.4
-
2
-
-
84994298763
-
-
"Iarpa babel, " http://www.iarpa.gov/index.php/researchprograms/babel.
-
Iarpa Babel
-
-
-
4
-
-
84969584486
-
Batch normalization: Accelerating deep network training by reducing internal covariate shift
-
S. Ioffe and C. Szegedy, "Batch normalization: Accelerating deep network training by reducing internal covariate shift, " Proc. ICML, 2015.
-
(2015)
Proc ICML
-
-
Ioffe, S.1
Szegedy, C.2
-
5
-
-
84890525984
-
Deep convolutional neural networks for lvcsr
-
T. N. Sainath, A.-R. Mohamed, B. Kingsbury, and B. Ramabhadran, "Deep convolutional neural networks for lvcsr, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 8614-8618.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 8614-8618
-
-
Sainath, T.N.1
Mohamed, A.-R.2
Kingsbury, B.3
Ramabhadran, B.4
-
6
-
-
84905265980
-
Joint training of convolutional and non-convolutional neural networks
-
H. Soltau, G. Saon, and T. N. Sainath, "Joint training of convolutional and non-convolutional neural networks, " to Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Soltau, H.1
Saon, G.2
Sainath, T.N.3
-
7
-
-
84959129849
-
The IBM 2015 english conversational telephone speech recognition system
-
G. Saon, H.-K. J. Kuo, S. Rennie, and M. Picheny, "The ibm 2015 english conversational telephone speech recognition system, " Proc. Interspeech, 2015.
-
(2015)
Proc. Interspeech
-
-
Saon, G.1
Kuo, H.-K.J.2
Rennie, S.3
Picheny, M.4
-
8
-
-
84994201246
-
-
G. Saon, T. Sercu, S. Rennie, and H.-K. J. Kuo, "The ibm 2016 english conversational telephone speech recognition system, " -, 2016.
-
(2016)
The IBM 2016 English Conversational Telephone Speech Recognition System
-
-
Saon, G.1
Sercu, T.2
Rennie, S.3
Kuo, H.-K.J.4
-
9
-
-
11144321031
-
Convolutional networks for images, speech, and time series
-
Y. LeCun and Y. Bengio, "Convolutional networks for images, speech, and time series, " The handbook of brain theory and neural networks, vol. 3361, no. 10, p. 1995, 1995.
-
(1995)
The Handbook of Brain Theory and Neural Networks
, vol.3361
, Issue.10
, pp. 1995
-
-
LeCun, Y.1
Bengio, Y.2
-
10
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition, " Proceedings of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
11
-
-
84990044091
-
Torch7: A matlab-like environment for machine learning
-
R. Collobert, K. Kavukcuoglu, and C. Farabet, "Torch7: A matlab-like environment for machine learning, " in BigLearn, NIPS Workshop, no. EPFL-CONF-192376, 2011.
-
(2011)
BigLearn, NIPS Workshop, No. EPFL-CONF-192376
-
-
Collobert, R.1
Kavukcuoglu, K.2
Farabet, C.3
-
12
-
-
84897510162
-
On the importance of initialization and momentum in deep learning
-
I. Sutskever, J. Martens, G. Dahl, and G. Hinton, "On the importance of initialization and momentum in deep learning, " in Proc. ICML, 2013, pp. 1139-1147.
-
(2013)
Proc ICML
, pp. 1139-1147
-
-
Sutskever, I.1
Martens, J.2
Dahl, G.3
Hinton, G.4
-
13
-
-
84890543852
-
Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription
-
H. Su, G. Li, D. Yu, and F. Seide, "Error back propagation for sequence training of context-dependent deep networks for conversational speech transcription, " in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 6664-6668.
-
(2013)
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE
, pp. 6664-6668
-
-
Su, H.1
Li, G.2
Yu, D.3
Seide, F.4
-
14
-
-
84876231242
-
Imagenet classification with deep convolutional neural networks
-
A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks, " in Advances in neural information processing systems, 2012, pp. 1097-1105.
-
(2012)
Advances in Neural Information Processing Systems
, pp. 1097-1105
-
-
Krizhevsky, A.1
Sutskever, I.2
Hinton, G.E.3
-
15
-
-
84906347546
-
-
arXiv preprint arXiv: 1312.6229
-
P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. LeCun, "Overfeat: Integrated recognition, localization and detection using convolutional networks, " arXiv preprint arXiv:1312.6229, 2013.
-
(2013)
Overfeat: Integrated Recognition, Localization and Detection Using Convolutional Networks
-
-
Sermanet, P.1
Eigen, D.2
Zhang, X.3
Mathieu, M.4
Fergus, R.5
LeCun, Y.6
-
16
-
-
84876258641
-
Learning hierarchical features for scene labeling
-
C. Farabet, C. Couprie, L. Najman, and Y. LeCun, "Learning hierarchical features for scene labeling, " Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 35, no. 8, pp. 1915- 1929, 2013.
-
(2013)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.35
, Issue.8
, pp. 1915-1929
-
-
Farabet, C.1
Couprie, C.2
Najman, L.3
LeCun, Y.4
-
17
-
-
84867605836
-
Applying convolutional neural networks concepts to hybrid nn-hmm model for speech recognition
-
O. Abdel-Hamid, A.-R. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid nn-hmm model for speech recognition, " in Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on. IEEE, 2012, pp. 4277-4280.
-
(2012)
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference On. IEEE
, pp. 4277-4280
-
-
Abdel-Hamid, O.1
Mohamed, A.-R.2
Jiang, H.3
Penn, G.4
-
19
-
-
84978952442
-
-
arXiv preprint arXiv: 1508.06615
-
Y. Kim, Y. Jernite, D. Sontag, and A. M. Rush, "Character-aware neural language models, " arXiv preprint arXiv:1508.06615, 2015.
-
(2015)
Character-aware Neural Language Models
-
-
Kim, Y.1
Jernite, Y.2
Sontag, D.3
Rush, A.M.4
-
20
-
-
84939821074
-
-
arXiv preprint arXiv: 1502.03044
-
K. Xu, J. Ba, R. Kiros, A. Courville, R. Salakhutdinov, R. Zemel, and Y. Bengio, "Show, attend and tell: Neural image caption generation with visual attention, " arXiv preprint arXiv:1502.03044, 2015.
-
(2015)
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
-
-
Xu, K.1
Ba, J.2
Kiros, R.3
Courville, A.4
Salakhutdinov, R.5
Zemel, R.6
Bengio, Y.7
-
23
-
-
0024634603
-
Phoneme recognition using time-delay neural networks
-
A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, and K. J. Lang, "Phoneme recognition using time-delay neural networks, " Acoustics, Speech and Signal Processing, IEEE Transactions on, vol. 37, no. 3, pp. 328-339, 1989.
-
(1989)
Acoustics, Speech and Signal Processing, IEEE Transactions on
, vol.37
, Issue.3
, pp. 328-339
-
-
Waibel, A.1
Hanazawa, T.2
Hinton, G.3
Shikano, K.4
Lang, K.J.5
-
24
-
-
84951162898
-
Deep convolutional neural networks for large-scale speech tasks
-
T. N. Sainath, B. Kingsbury, G. Saon, H. Soltau, A.-R. Mohamed, G. Dahl, and B. Ramabhadran, "Deep convolutional neural networks for large-scale speech tasks, " Neural Networks, 2014.
-
(2014)
Neural Networks
-
-
Sainath, T.N.1
Kingsbury, B.2
Saon, G.3
Soltau, H.4
Mohamed, A.-R.5
Dahl, G.6
Ramabhadran, B.7
-
25
-
-
84959078561
-
Convolutional neural networks for small-footprint keyword spotting
-
T. Sainath and C. Parada, "Convolutional neural networks for small-footprint keyword spotting, " in Proc. Interspeech, 2015.
-
(2015)
Proc. Interspeech
-
-
Sainath, T.1
Parada, C.2
-
27
-
-
84971463350
-
-
CoRR arXiv: 1512.02595
-
D. Amodei, R. Anubhai, E. Battenberg, C. Case, J. Casper, B. Catanzaro, J. Chen, M. Chrzanowski, A. Coates, G. Diamos et al., "Deep speech 2: End-to-end speech recognition in english and Mandarin, " CoRR arXiv:1512.02595, 2015.
-
(2015)
Deep Speech 2: End-to-end Speech Recognition in English and Mandarin
-
-
Amodei, D.1
Anubhai, R.2
Battenberg, E.3
Case, C.4
Casper, J.5
Catanzaro, B.6
Chen, J.7
Chrzanowski, M.8
Coates, A.9
Diamos, G.10
-
28
-
-
70349213445
-
Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling
-
B. Kingsbury, "Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling, " in Proc. ICASSP. IEEE, 2009, pp. 3761-3764.
-
(2009)
Proc. ICASSP IEEE
, pp. 3761-3764
-
-
Kingsbury, B.1
-
30
-
-
84905233897
-
Meannormalized stochastic gradient for large-scale deep learning
-
S. Wiesler, A. Richard, R. Schluter, and H. Ney, "Meannormalized stochastic gradient for large-scale deep learning, " in proc. ICASSP. IEEE, 2014, pp. 180-184.
-
(2014)
Proc. ICASSP IEEE
, pp. 180-184
-
-
Wiesler, S.1
Richard, A.2
Schluter, R.3
Ney, H.4
-
31
-
-
84990032289
-
-
CoRR arXiv: 1512.00567
-
C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the inception architecture for computer vision, " CoRR arXiv:1512.00567, 2015.
-
(2015)
Rethinking the Inception Architecture for Computer Vision
-
-
Szegedy, C.1
Vanhoucke, V.2
Ioffe, S.3
Shlens, J.4
Wojna, Z.5
-
32
-
-
84958589374
-
-
CoRR arXiv: 1512.03385
-
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition, " CoRR arXiv:1512.03385, 2015.
-
(2015)
Deep Residual Learning for Image Recognition
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
33
-
-
84973326024
-
Batch normalized recurrent neural networks
-
C. Laurent, G. Pereyra, P. Brakel, Y. Zhang, and Y. Bengio, "Batch normalized recurrent neural networks, " Proc. ICASSP, 2016.
-
(2016)
Proc. ICASSP
-
-
Laurent, C.1
Pereyra, G.2
Brakel, P.3
Zhang, Y.4
Bengio, Y.5
|