-
3
-
-
85018926397
-
Efficient batchwise dropout training using submatrices
-
Benjamin Graham, Jeremy Reizenstein, and Leigh Robinson. Efficient batchwise dropout training using submatrices. CoRR, abs/1502.02478, 2015.
-
(2015)
CoRR
-
-
Graham, B.1
Reizenstein, J.2
Robinson, L.3
-
4
-
-
84867720412
-
-
arXiv preprint arXiv:1207.0580
-
Geoffrey E Hinton, Nitish Srivastava, Alex Krizhevsky, Ilya Sutskever, and Ruslan R Salakhutdinov. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580, 2012.
-
(2012)
Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.R.5
-
6
-
-
85083951076
-
Adam: A method for stochastic optimization
-
Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014.
-
(2014)
CoRR
-
-
Kingma, D.P.1
Ba, J.2
-
7
-
-
85014556040
-
Variational dropout and the local reparameterization trick
-
Diederik P. Kingma, Tim Salimans, and Max Welling. Variational dropout and the local reparameterization trick. CoRR, abs/1506.02557, 2015.
-
(2015)
CoRR
-
-
Kingma, D.P.1
Salimans, T.2
Welling, M.3
-
10
-
-
0032203257
-
Gradient-based learning applied to document recognition
-
Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
-
(1998)
Proceedings of the IEEE
, vol.86
, Issue.11
, pp. 2278-2324
-
-
LeCun, Y.1
Bottou, L.2
Bengio, Y.3
Haffner, P.4
-
11
-
-
84865114495
-
Reading digits in natural images with unsupervised feature learning
-
Granada, Spain
-
Yuval Netzer, Tao Wang, Adam Coates, Alessandro Bissacco, Bo Wu, and Andrew Y Ng. Reading digits in natural images with unsupervised feature learning. In NIPS workshop on deep learning and unsupervised feature learning, volume 2011, page 4. Granada, Spain, 2011.
-
(2011)
NIPS Workshop on Deep Learning and Unsupervised Feature Learning
, vol.2011
, pp. 4
-
-
Netzer, Y.1
Wang, T.2
Coates, A.3
Bissacco, A.4
Wu, B.5
Ng, A.Y.6
-
13
-
-
84862277721
-
Factored 3-way restricted boltzmann machines for modeling natural images
-
Marc'Aurelio Ranzato, Alex Krizhevsky, and Geoffrey E. Hinton. Factored 3-way restricted boltzmann machines for modeling natural images. In AISTATS, pages 621-628, 2010.
-
(2010)
AISTATS
, pp. 621-628
-
-
Ranzato, M.1
Krizhevsky, A.2
Hinton, G.E.3
-
15
-
-
85162053390
-
Smoothness, low noise and fast rates
-
Nathan Srebro, Karthik Sridharan, and Ambuj Tewari. Smoothness, low noise and fast rates. In Advances in Neural Information Processing Systems 23 (NIPS), pages 2199-2207, 2010.
-
(2010)
Advances in Neural Information Processing Systems 23 (NIPS)
, pp. 2199-2207
-
-
Srebro, N.1
Sridharan, K.2
Tewari, A.3
-
16
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1):1929-1958, 2014.
-
(2014)
The Journal of Machine Learning Research
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
17
-
-
84897510162
-
On the importance of initialization and momentum in deep learning
-
Ilya Sutskever, James Martens, George Dahl, and Geoffrey Hinton. On the importance of initialization and momentum in deep learning. In Proceedings of the 30th international conference on machine learning (ICML-13), pages 1139-1147, 2013.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning (ICML-13)
, pp. 1139-1147
-
-
Sutskever, I.1
Martens, J.2
Dahl, G.3
Hinton, G.4
-
19
-
-
84897550107
-
Regularization of neural networks using dropconnect
-
Li Wan, Matthew Zeiler, Sixin Zhang, Yann L Cun, and Rob Fergus. Regularization of neural networks using dropconnect. In Proceedings of the 30th International Conference on Machine Learning (ICML-13), pages 1058-1066, 2013.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning (ICML-13)
, pp. 1058-1066
-
-
Wan, L.1
Zeiler, M.2
Zhang, S.3
Cun, Y.L.4
Fergus, R.5
-
21
-
-
84926377572
-
Feature noising for log-linear structured prediction
-
Sida I Wang, Mengqiu Wang, Stefan Wager, Percy Liang, and Christopher D Manning. Feature noising for log-linear structured prediction. In EMNLP, pages 1170-1179, 2013.
-
(2013)
EMNLP
, pp. 1170-1179
-
-
Wang, S.I.1
Wang, M.2
Wager, S.3
Liang, P.4
Manning, C.D.5
-
23
-
-
84949895824
-
Adaptive dropout rates for learning with corrupted features
-
Jingwei Zhuo, Jun Zhu, and Bo Zhang. Adaptive dropout rates for learning with corrupted features. In IJCAI, pages 4126-4133, 2015.
-
(2015)
IJCAI
, pp. 4126-4133
-
-
Zhuo, J.1
Zhu, J.2
Zhang, B.3
|