-
1
-
-
84958264664
-
-
Software available from tensorflow.org
-
Abadi, Martín, Agarwal, Ashish, Barham, Paul, Brevdo, Eugene, Chen, Zhifeng, Citro, Craig, Corrado, Greg S., Davis, Andy, Dean, Jeffrey, Devin, Matthieu, Ghemawat, Sanjay, Goodfellow, Ian, Harp, Andrew, Irving, Geoffrey, Isard, Michael, Jia, Yangqing, Jozefowicz, Rafal, Kaiser, Lukasz, Kudlur, Manjunath, Leven-berg, Josh, Mané, Dan, Monga, Rajat, Moore, Sherry, Murray, Derek, Olah, Chris, Schuster, Mike, Shlens, Jonathon, Steiner, Benoit, Sutskever, Ilya, Talwar, Kunal, Tucker, Paul, Vanhoucke, Vincent, Vasudevan, Vijay, Viégas, Fernanda, Vinyals, Oriol, Warden, Pete, Wattenberg, Martin, Wicke, Martin, Yu, Yuan, and Zheng, Xiaoqiang. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. URL http://tensorflow.org/. Software available from tensorflow.org.
-
(2015)
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
-
-
Abadi, M.1
Agarwal, A.2
Barham, P.3
Brevdo, E.4
Chen, Z.5
Citro, C.6
Corrado, G.S.7
Davis, A.8
Dean, J.9
Devin, M.10
Ghemawat, S.11
Goodfellow, I.12
Harp, A.13
Irving, G.14
Isard, M.15
Jia, Y.16
Jozefowicz, R.17
Kaiser, L.18
Kudlur, M.19
Levenberg, J.20
Mané, D.21
Monga, R.22
Moore, S.23
Murray, D.24
Olah, C.25
Schuster, M.26
Shlens, J.27
Steiner, B.28
Sutskever, I.29
Talwar, K.30
Tucker, P.31
Vanhoucke, V.32
Vasudevan, V.33
Viégas, F.34
Vinyals, O.35
Warden, P.36
Wattenberg, M.37
Wicke, M.38
Yu, Y.39
Zheng, X.40
more..
-
2
-
-
33749545215
-
Model compression
-
Buciluǎ, Cristian, Caruana, Rich, and Niculescu-Mizil, Alexandru. Model compression. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 535–541. ACM, 2006.
-
(2006)
Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, pp. 535-541
-
-
Buciluǎ, C.1
Caruana, R.2
Niculescu-Mizil, A.3
-
3
-
-
0000155950
-
-
Denver, CO, Morgan Kaufmann, San Mateo
-
Fahlman, Scott E. and Lebiere, Christian. The cascade-correlation learning architecture. pp. 524–532, Denver, CO, 1990. Morgan Kaufmann, San Mateo.
-
(1990)
The Cascade-Correlation Learning Architecture
, pp. 524-532
-
-
Fahlman, S.E.1
Lebiere, C.2
-
5
-
-
84897543523
-
Maxout networks
-
Goodfellow, Ian J., Warde-Farley, David, Mirza, Mehdi, Courville, Aaron, and Bengio, Yoshua. Maxout networks. In ICML’2013, 2013.
-
(2013)
ICML’2013
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.4
Bengio, Y.5
-
6
-
-
46249099599
-
Knowledge transfer in deep convolutional neural nets
-
Gutstein, Steven, Fuentes, Olac, and Freudenthal, Eric. Knowledge transfer in deep convolutional neural nets. International Journal on Artificial Intelligence Tools, 17(03):555–567, 2008.
-
(2008)
International Journal on Artificial Intelligence Tools
, vol.17
, Issue.3
, pp. 555-567
-
-
Gutstein, S.1
Fuentes, O.2
Freudenthal, E.3
-
8
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
Hinton, Geoffrey E., Osindero, Simon, and Teh, Yee Whye. A fast learning algorithm for deep belief nets. Neural Computation, 18:1527–1554, 2006.
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.W.3
-
10
-
-
77953183471
-
What is the best multi-stage architecture for object recognition?
-
Jarrett, Kevin, Kavukcuoglu, Koray, Ranzato, Marc’Aurelio, and LeCun, Yann. What is the best multi-stage architecture for object recognition? In Proc. International Conference on Computer Vision (ICCV’09), pp. 2146–2153. IEEE, 2009.
-
(2009)
Proc. International Conference on Computer Vision (ICCV’09)
, pp. 2146-2153
-
-
Jarrett, K.1
Kavukcuoglu, K.2
Ranzato, M.3
LeCun, Y.4
-
11
-
-
85070990006
-
-
Technical report, (unpublished)
-
Mahayri, Amjad, Ballas, Nicolas, and Courville, Aaron. FitNets and batch normalization. Technical report, (unpublished), 2015.
-
(2015)
FitNets and Batch Normalization
-
-
Mahayri, A.1
Ballas, N.2
Courville, A.3
-
12
-
-
84949887093
-
Never-ending learning
-
Mitchell, T., Cohen, W., Hruschka, E., Talukdar, P., Betteridge, J., Carlson, A., Dalvi, B., Gardner, M., Kisiel, B., Krishnamurthy, J., Lao, N., Mazaitis, K., Mohamed, T., Nakashole, N., Platanios, E., Ritter, A., Samadi, M., Settles, B., Wang, R., Wijaya, D., Gupta, A., Chen, X., Saparov, A., Greaves, M., and Welling, J. Never-ending learning. In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15), 2015.
-
(2015)
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI-15)
-
-
Mitchell, T.1
Cohen, W.2
Hruschka, E.3
Talukdar, P.4
Betteridge, J.5
Carlson, A.6
Dalvi, B.7
Gardner, M.8
Kisiel, B.9
Krishnamurthy, J.10
Lao, N.11
Mazaitis, K.12
Mohamed, T.13
Nakashole, N.14
Platanios, E.15
Ritter, A.16
Samadi, M.17
Settles, B.18
Wang, R.19
Wijaya, D.20
Gupta, A.21
Chen, X.22
Saparov, A.23
Greaves, M.24
Welling, J.25
more..
-
13
-
-
84964544562
-
-
Technical Report arXiv
-
Romero, Adriana, Ballas, Nicolas, Ebrahimi Kahou, Samira, Chassang, Antoine, Gatta, Carlo, and Bengio, Yoshua. FitNets: Hints for thin deep nets. Technical Report Arxiv report 1412.6550, arXiv, 2014.
-
(2014)
FitNets: Hints for Thin Deep Nets
-
-
Romero, A.1
Ballas, N.2
Ebrahimi Kahou, S.3
Chassang, A.4
Gatta, C.5
Bengio, Y.6
-
15
-
-
85083953063
-
Very deep convolutional networks for large-scale image recognition
-
Simonyan, Karen and Zisserman, Andrew. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
-
(2015)
ICLR
-
-
Simonyan, K.1
Zisserman, A.2
-
16
-
-
84893795422
-
Parsing with compositional vector grammars
-
Socher, Richard, Bauer, John, Manning, Christopher D., and Ng, Andrew Y. Parsing with compositional vector grammars. In In Proceedings of the ACL conference, 2013.
-
(2013)
Proceedings of the ACL Conference
-
-
Socher, R.1
Bauer, J.2
Manning, C.D.3
Ng, A.Y.4
-
17
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
Srivastava, Nitish, Hinton, Geoffrey, Krizhevsky, Alex, Sutskever, Ilya, and Salakhutdinov, Ruslan. Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15: 1929–1958, 2014. URL http://jmlr.org/papers/v15/srivastava14a.html.
-
(2014)
Journal of Machine Learning Research
, vol.15
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
18
-
-
84964983441
-
-
Technical report
-
Szegedy, Christian, Liu, Wei, Jia, Yangqing, Sermanet, Pierre, Reed, Scott, Anguelov, Dragomir, Erhan, Du-mitru, Vanhoucke, Vincent, and Rabinovich, Andrew. Going deeper with convolutions. Technical report, arXiv:1409.4842, 2014.
-
(2014)
Going Deeper with Convolutions
-
-
Szegedy, C.1
Liu, W.2
Jia, Y.3
Sermanet, P.4
Reed, S.5
Anguelov, D.6
Erhan, D.-M.7
Vanhoucke, V.8
Rabinovich, A.9
-
19
-
-
2342641167
-
-
Technical Report CMU-CS-95-208, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, November
-
Thrun, Sebastian. Lifelong learning: A case study. Technical Report CMU-CS-95-208, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, November 1995.
-
(1995)
Lifelong Learning: A Case Study
-
-
Thrun, S.1
-
20
-
-
84893343292
-
Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
-
Tieleman, T and Hinton, G. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4, 2012.
-
(2012)
COURSERA: Neural Networks for Machine Learning
, vol.4
-
-
Tieleman, T.1
Hinton, G.2
|