-
1
-
-
84973911419
-
Delving deep into rectifiers: Surpassing human-level performance on imagenet classification
-
K. He, X. Zhang, S. Ren, and J. Sun, "Delving deep into rectifiers: Surpassing human-level performance on imagenet classification, " in Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 1026-1034.
-
(2015)
Proceedings of the IEEE International Conference on Computer Vision
, pp. 1026-1034
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
2
-
-
85032751458
-
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups
-
G. Hinton, L. Deng, D. Yu, G. E. Dahl, A.-r. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, T. N. Sainath et al., "Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups, " Signal Processing Magazine, IEEE, vol. 29, no. 6, pp. 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.E.4
Mohamed, A.-R.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, P.9
Sainath, T.N.10
-
3
-
-
84928547704
-
Sequence to sequence learning with neural networks
-
I. Sutskever, O. Vinyals, and Q. V. Le, "Sequence to sequence learning with neural networks, " in Advances in Neural Information Processing Systems, 2014, pp. 3104-3112.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 3104-3112
-
-
Sutskever, I.1
Vinyals, O.2
Le, Q.V.3
-
4
-
-
84978835300
-
-
arXiv preprint arXiv:1506. 07285
-
A. Kumar, O. Irsoy, J. Su, J. Bradbury, R. English, B. Pierce, P. Ondruska, I. Gulrajani, and R. Socher, "Ask me anything: Dynamic memory networks for natural language processing, " arXiv preprint arXiv:1506. 07285, 2015.
-
(2015)
Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
-
-
Kumar, A.1
Irsoy, O.2
Su, J.3
Bradbury, J.4
English, R.5
Pierce, B.6
Ondruska, P.7
Gulrajani, I.8
Socher, R.9
-
5
-
-
69349090197
-
Learning deep architectures for AI
-
Y. Bengio, "Learning deep architectures for AI, " Foundations and trendsR in Machine Learning, vol. 2, no. 1, pp. 1-127, 2009.
-
(2009)
Foundations and TrendsR in Machine Learning
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
6
-
-
84954310140
-
The loss surfaces of multilayer networks
-
A. Choromanska, M. Henaff, M. Mathieu, G. B. Arous, and Y. LeCun, "The loss surfaces of multilayer networks, " in International Conference on Artificial Intelligence and Statistics, 2015, pp. 192-204.
-
(2015)
International Conference on Artificial Intelligence and Statistics
, pp. 192-204
-
-
Choromanska, A.1
Henaff, M.2
Mathieu, M.3
Arous, G.B.4
LeCun, Y.5
-
7
-
-
59449087310
-
Exploring strategies for training deep neural networks
-
H. Larochelle, Y. Bengio, J. Louradour, and P. Lamblin, "Exploring strategies for training deep neural networks, " The Journal of Machine Learning Research, vol. 10, pp. 1-40, 2009.
-
(2009)
The Journal of Machine Learning Research
, vol.10
, pp. 1-40
-
-
Larochelle, H.1
Bengio, Y.2
Louradour, J.3
Lamblin, P.4
-
8
-
-
0041914606
-
Gradient flow in recurrent nets: The difficulty of learning long-term dependencies
-
IEEE Press
-
S. Hochreiter, Y. Bengio, P. Frasconi, and J. Schmidhuber, "Gradient flow in recurrent nets: the difficulty of learning long-term dependencies, " A field guide to dynamical recurrent neural networks. IEEE Press, 2001.
-
(2001)
A Field Guide to Dynamical Recurrent Neural Networks
-
-
Hochreiter, S.1
Bengio, Y.2
Frasconi, P.3
Schmidhuber, J.4
-
9
-
-
84862294866
-
Deep sparse rectifier networks
-
X. Glorot, A. Bordes, and Y. Bengio, "Deep sparse rectifier networks, " in Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume, vol. 15, 2011, pp. 315-323.
-
(2011)
Proceedings of the 14th International Conference on Artificial Intelligence and Statistics. JMLR W&CP Volume
, vol.15
, pp. 315-323
-
-
Glorot, X.1
Bordes, A.2
Bengio, Y.3
-
10
-
-
84897543523
-
Maxout networks
-
I. Goodfellow, D. Warde-Farley, M. Mirza, A. Courville, and Y. Bengio, "Maxout networks, " in Proceedings of The 30th International Conference on Machine Learning, 2013, pp. 1319-1327.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning
, pp. 1319-1327
-
-
Goodfellow, I.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
11
-
-
0031573117
-
Long short-term memory
-
S. Hochreiter and J. Schmidhuber, "Long short-term memory, " Neural computation, vol. 9, no. 8, pp. 1735-1780, 1997.
-
(1997)
Neural Computation
, vol.9
, Issue.8
, pp. 1735-1780
-
-
Hochreiter, S.1
Schmidhuber, J.2
-
13
-
-
84965164720
-
Training very deep networks
-
R. K. Srivastava, K. Greff, and J. Schmidhuber, "Training very deep networks, " in Advances in Neural Information Processing Systems, 2015, pp. 2368-2376.
-
(2015)
Advances in Neural Information Processing Systems
, pp. 2368-2376
-
-
Srivastava, R.K.1
Greff, K.2
Schmidhuber, J.3
-
14
-
-
84958589374
-
-
arXiv preprint arXiv:1512. 03385
-
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition, " arXiv preprint arXiv:1512. 03385, 2015.
-
(2015)
Deep Residual Learning for Image Recognition
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
15
-
-
84943799837
-
-
arXiv preprint arXiv:1409. 1259
-
K. Cho, B. van Merriënboer, D. Bahdanau, and Y. Bengio, "On the properties of neural machine translation: Encoder-decoder approaches, " arXiv preprint arXiv:1409. 1259, 2014.
-
(2014)
On the Properties of Neural Machine Translation: Encoder-decoder Approaches
-
-
Cho, K.1
Van Merriënboer, B.2
Bahdanau, D.3
Bengio, Y.4
-
16
-
-
84961291190
-
Learning phrase representations using rnn encoder-decoder for statistical machine translation
-
K. Cho, B. Van Merriënboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, and Y. Bengio, "Learning phrase representations using rnn encoder-decoder for statistical machine translation, " in EMNLP, 2014, pp. 1724-1734.
-
(2014)
EMNLP
, pp. 1724-1734
-
-
Cho, K.1
Van Merriënboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
17
-
-
84939821078
-
-
arXiv preprint arXiv:1412. 3555
-
J. Chung, C. Gulcehre, K. Cho, and Y. Bengio, "Empirical evaluation of gated recurrent neural networks on sequence modeling, " arXiv preprint arXiv:1412. 3555, 2014.
-
(2014)
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
-
-
Chung, J.1
Gulcehre, C.2
Cho, K.3
Bengio, Y.4
-
18
-
-
84897527816
-
-
T. Mikolov, I. Sutskever, A. Deoras, H.-S. Le, S. Kombrink, and J. Cernocky, "Subword language modeling with neural networks, " preprint (http://www. fit. vutbr. cz/imikolov/rnnlm/char. pdf), 2012.
-
(2012)
Subword Language Modeling with Neural Networks
-
-
Mikolov, T.1
Sutskever, I.2
Deoras, A.3
Le, H.-S.4
Kombrink, S.5
Cernocky, J.6
-
19
-
-
84978952442
-
-
arXiv preprint arXiv:1508. 06615
-
Y. Kim, Y. Jernite, D. Sontag, and A. M. Rush, "Character-aware neural language models, " arXiv preprint arXiv:1508. 06615, 2015.
-
(2015)
Character-aware Neural Language Models
-
-
Kim, Y.1
Jernite, Y.2
Sontag, D.3
Rush, A.M.4
-
20
-
-
85019122005
-
-
arXiv preprint arXiv:1602. 00357
-
T. Pham, T. Tran, D. Phung, and S. Venkatesh, "Deepcare: A deep dynamic memory model for predictive medicine, " arXiv preprint arXiv:1602. 00357, 2016.
-
(2016)
Deepcare: A Deep Dynamic Memory Model for Predictive Medicine
-
-
Pham, T.1
Tran, T.2
Phung, D.3
Venkatesh, S.4
|