-
1
-
-
84958264664
-
-
Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, et al. TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. URL http://tensorflow.org/. Software available from tensorflow.org.
-
(2015)
TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
-
-
Abadi, M.1
Agarwal, A.2
Barham, P.3
Brevdo, E.4
Chen, Z.5
Citro, C.6
Corrado, G.S.7
Davis, A.8
Dean, J.9
Devin, M.10
-
2
-
-
85018882182
-
Interaction networks for learning about objects, relations and physics
-
Peter Battaglia, Razvan Pascanu, Matthew Lai, Danilo Jimenez Rezende, and Koray Kavukcuoglu. Interaction networks for learning about objects, relations and physics. Advances in Neural Information Processing Systems, 2016.
-
(2016)
Advances in Neural Information Processing Systems
-
-
Battaglia, P.1
Pascanu, R.2
Lai, M.3
Rezende, D.J.4
Kavukcuoglu, K.5
-
5
-
-
84965103751
-
Learning continuous control policies by stochastic value gradients
-
Nicolas Heess, Gregory Wayne, David Silver, Tim Lillicrap, Tom Erez, and Yuval Tassa. Learning continuous control policies by stochastic value gradients. Advances in Neural Information Processing Systems, 2015.
-
(2015)
Advances in Neural Information Processing Systems
-
-
Heess, N.1
Wayne, G.2
Silver, D.3
Lillicrap, T.4
Erez, T.5
Tassa, Y.6
-
7
-
-
84999036937
-
Asynchronous methods for deep reinforcement learning
-
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. Proceedings of the 33rd International Conference on Machine Learning, 2016.
-
(2016)
Proceedings of the 33rd International Conference on Machine Learning
-
-
Mnih, V.1
Badia, A.P.2
Mirza, M.3
Graves, A.4
Lillicrap, T.P.5
Harley, T.6
Silver, D.7
Kavukcuoglu, K.8
-
9
-
-
0026155868
-
Principles of metareasoning
-
Stuart Russell and Eric Wefald. Principles of metareasoning. Artificial Intelligence, 49(1):361 - 395, 1991.
-
(1991)
Artificial Intelligence
, vol.49
, Issue.1
, pp. 361-395
-
-
Russell, S.1
Wefald, E.2
-
10
-
-
85018927054
-
-
Aäron van den Oord, Nal Kalchbrenner, Oriol Vinyals, Lasse Espeholt, Alex Graves, and Koray Kavukcuoglu. Conditional image generation with PixelCNN decoders. arXiv:1606.05328, 2016.
-
(2016)
Conditional Image Generation with PixelCNN Decoders
-
-
Van Den Oord, A.1
Kalchbrenner, N.2
Vinyals, O.3
Espeholt, L.4
Graves, A.5
Kavukcuoglu, K.6
-
11
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Ronald J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8(3-4):229-256, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 229-256
-
-
Williams, R.J.1
-
12
-
-
0041154467
-
Function optimization using connectionist reinforcement learning algorithms
-
Ronald J. Williams and Jing Peng. Function optimization using connectionist reinforcement learning algorithms. Connection Science, 3(3):241-268, 1991.
-
(1991)
Connection Science
, vol.3
, Issue.3
, pp. 241-268
-
-
Williams, R.J.1
Peng, J.2
|