-
2
-
-
85031088945
-
-
arXiv
-
Charles Beattie, Joel Z. Leibo, Denis Teplyashin, Tom Ward, Marcus Wainwright, Heinrich KÃijttler, Andrew Lefrancq, Simon Green, Victor Valdes, Amir Sadik, Julian Schrittwieser, Keith Anderson, Sarah York, Max Cant, Adam Cain, Adrian Bolton, Stephen Gaffney, Helen King, Demis Hassabis, Shane Legg, and Stig Petersen. Deepmind lab. In arXiv, 2016. URL https://arxiv.org/abs/1612.03801.
-
(2016)
Deepmind Lab
-
-
Beattie, C.1
Leibo, J.Z.2
Teplyashin, D.3
Ward, T.4
Wainwright, M.5
KÃijttler, H.6
Lefrancq, A.7
Green, S.8
Valdes, V.9
Sadik, A.10
Schrittwieser, J.11
Anderson, K.12
York, S.13
Cant, M.14
Cain, A.15
Bolton, A.16
Gaffney, S.17
King, H.18
Hassabis, D.19
Legg, S.20
Petersen, S.21
more..
-
3
-
-
0035357061
-
A solution to the simultaneous localization and map building (slam) problem
-
MWM Gamini Dissanayake, Paul Newman, Steve Clark, Hugh F. Durrant-Whyte, and Michael Csorba. A solution to the simultaneous localization and map building (slam) problem. IEEE Transactions on Robotics and Automation, 17(3):229-241, 2001.
-
(2001)
IEEE Transactions on Robotics and Automation
, vol.17
, Issue.3
, pp. 229-241
-
-
Gamini Dissanayake, M.W.M.1
Newman, P.2
Clark, S.3
Durrant-Whyte, H.F.4
Csorba, M.5
-
5
-
-
84890543083
-
Speech recognition with deep recurrent neural networks
-
Alex Graves, Mohamed Abdelrahman, and Geoffrey Hinton. Speech recognition with deep recurrent neural networks. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP, 2013.
-
(2013)
Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP
-
-
Graves, A.1
Abdelrahman, M.2
Hinton, G.3
-
6
-
-
84993949467
-
Hybrid computing using a neural network with dynamic external memory
-
Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwinska, ´ Sergio Gómez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou, et al. Hybrid computing using a neural network with dynamic external memory. Nature, 2016.
-
(2016)
Nature
-
-
Graves, A.1
Wayne, G.2
Reynolds, M.3
Harley, T.4
Danihelka, I.5
Grabska-Barwinska, A.6
Colmenarejo, S.G.7
Grefenstette, E.8
Ramalho, T.9
Agapiou, J.10
-
8
-
-
85088229768
-
Reinforcement learning with unsupervised auxiliary tasks
-
Max Jaderberg, Volodymir Mnih, Wojciech Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, and Koray Kavukcuoglu. Reinforcement learning with unsupervised auxiliary tasks. In Submitted to Int'l Conference on Learning Representations, ICLR, 2017.
-
(2017)
Submitted to Int'l Conference on Learning Representations, ICLR
-
-
Jaderberg, M.1
Mnih, V.2
Czarnecki, W.3
Schaul, T.4
Leibo, J.Z.5
Silver, D.6
Kavukcuoglu, K.7
-
10
-
-
85041964859
-
-
CoRR, abs/1606.02396
-
Tejas D. Kulkarni, Ardavan Saeedi, Simanta Gautam, and Samuel J. Gershman. Deep successor reinforcement learning. CoRR, abs/1606.02396, 2016. URL http://arxiv.org/abs/1606.02396.
-
(2016)
Deep Successor Reinforcement Learning
-
-
Kulkarni, T.D.1
Saeedi, A.2
Gautam, S.3
Gershman, S.J.4
-
12
-
-
85063887900
-
Recurrent reinforcement learning: A hybrid approach
-
Xiujun Li, Lihong Li, Jianfeng Gao, Xiaodong He, Jianshu Chen, Li Deng, and Ji He. Recurrent reinforcement learning: A hybrid approach. In Proceedings of the International Conference on Learning Representations, ICLR, 2016. URL https://arxiv.org/abs/1509.03044.
-
(2016)
Proceedings of the International Conference on Learning Representations, ICLR
-
-
Li, X.1
Li, L.2
Gao, J.3
He, X.4
Chen, J.5
Deng, L.6
He, J.7
-
15
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, et al. Human-level control through deep reinforcement learning. Nature, 518:529-533, 2015.
-
(2015)
Nature
, vol.518
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
-
16
-
-
84999036937
-
Asynchronous methods for deep reinforcement learning
-
Puigdomô
-
´lnech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In Proc. of Int'l Conf. on Machine Learning, ICML, 2016.
-
(2016)
Proc. Of Int'l Conf. Of Machine Learning, ICML
-
-
Mnih, V.1
AdriÃa2
Badia, L.3
Mirza, M.4
Graves, A.5
Lillicrap, T.P.6
Harley, T.7
Silver, D.8
Kavukcuoglu, K.9
-
17
-
-
84980007683
-
Massively parallel methods for deep reinforcement learning
-
Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, et al. Massively parallel methods for deep reinforcement learning. In Proceedings of the International Conference on Machine Learning Deep Learning Workshop, ICML, 2015.
-
(2015)
Proceedings of the International Conference on Machine Learning Deep Learning Workshop, ICML
-
-
Nair, A.1
Srinivasan, P.2
Blackwell, S.3
Alcicek, C.4
Fearon, R.5
-
19
-
-
84999048282
-
Control of memory, active perception, and action in minecraft
-
Junhyuk Oh, Valliappa Chockalingam, Satinder P. Singh, and Honglak Lee. Control of memory, active perception, and action in minecraft. In Proc. of International Conference on Machine Learning, ICML, 2016.
-
(2016)
Proc. Of International Conference on Machine Learning, ICML
-
-
Oh, J.1
Chockalingam, V.2
Singh, S.P.3
Lee, H.4
-
20
-
-
0018633672
-
Hippocampus, space, and memory
-
03
-
David S Olton, James T Becker, and Gail E Handelmann. Hippocampus, space, and memory. Behavioral and Brain Sciences, 2(03):313-322, 1979.
-
(1979)
Behavioral and Brain Sciences
, vol.2
, pp. 313-322
-
-
Olton, D.S.1
Becker, J.T.2
Handelmann, G.E.3
-
22
-
-
84965136229
-
Semi-supervised learning with ladder networks
-
Antti Rasmus, Mathias Berglund, Mikko Honkala, Harri Valpola, and Tapani Raiko. Semi-supervised learning with ladder networks. In Advances in Neural Information Processing Systems, NIPS, 2015.
-
(2015)
Advances in Neural Information Processing Systems, NIPS
-
-
Rasmus, A.1
Berglund, M.2
Honkala, M.3
Valpola, H.4
Raiko, T.5
-
23
-
-
84998645243
-
Rule-injection hints as a means of improving network performance and learning time
-
Springer
-
Steven C Suddarth and YL Kergosien. Rule-injection hints as a means of improving network performance and learning time. In Neural Networks, pp. 120-129. Springer, 1990.
-
(1990)
Neural Networks
, pp. 120-129
-
-
Suddarth, S.C.1
Kergosien, Y.L.2
-
24
-
-
0033170372
-
Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
-
Richard S Sutton, Doina Precup, and Satinder Singh. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial intelligence, 112(1):181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
26
-
-
85019201204
-
-
CoRR, abs/1604.07255
-
Chen Tessler, Shahar Givony, Tom Zahavy, Daniel J. Mankowitz, and Shie Mannor. A deep hierarchical approach to lifelong learning in minecraft. CoRR, abs/1604.07255, 2016. URL http://arxiv.org/abs/1604.07255.
-
(2016)
A Deep Hierarchical Approach to Lifelong Learning in Minecraft
-
-
Tessler, C.1
Givony, S.2
Zahavy, T.3
Mankowitz, D.J.4
Mannor, S.5
-
27
-
-
84893343292
-
Lecture 6.5 - RMSprop: Divide the gradient by a running average of its recent magnitude
-
Tijmen Tieleman and Geoffrey Hinton. Lecture 6.5 - rmsprop: Divide the gradient by a running average of its recent magnitude. In Coursera: Neural Networks for Machine Learning, volume 4, 2012.
-
(2012)
Coursera: Neural Networks for Machine Learning
, vol.4
-
-
Tieleman, T.1
Hinton, G.2
-
30
-
-
84998996019
-
Augmenting supervised neural networks with unsupervised objectives for large-scale image classification
-
Yuting Zhang, Kibok Lee, and Honglak Lee. Augmenting supervised neural networks with unsupervised objectives for large-scale image classification. In Proc. of International Conference on Machine Learning, ICML, 2016.
-
(2016)
Proc. Of International Conference on Machine Learning, ICML
-
-
Zhang, Y.1
Lee, K.2
Lee, H.3
-
31
-
-
84965095288
-
Stacked what-where auto-encoders
-
Junbo Zhao, Michaël Mathieu, Ross Goroshin, and Yann LeCun. Stacked what-where auto-encoders. Int'l Conf. on Learning Representations (Workshop), ICLR, 2015. URL http://arxiv.org/abs/1506.02351.
-
(2015)
Int'l Conf. On Learning Representations (Workshop), ICLR
-
-
Zhao, J.1
Mathieu, M.2
Goroshin, R.3
LeCun, Y.4
-
32
-
-
85027704536
-
-
CoRR, abs/1609.05143
-
Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph J. Lim, Abhinav Gupta, Li Fei-Fei, and Ali Farhadi. Target-driven visual navigation in indoor scenes using deep reinforcement learning. CoRR, abs/1609.05143, 2016. URL http://arxiv.org/abs/1609.05143.
-
(2016)
Target-Driven Visual Navigation in Indoor Scenes Using Deep Reinforcement Learning
-
-
Zhu, Y.1
Mottaghi, R.2
Kolve, E.3
Lim, J.J.4
Gupta, A.5
Fei-Fei, L.6
Farhadi, A.7
|