-
1
-
-
84884276459
-
Reinforcement learning in robotics: A survey
-
J. Kober, J. A. Bagnell, and J. Peters, "Reinforcement learning in robotics: A survey", International Journal of Robotics Research, vol. 32, no. 11, pp. 1238-1274, 2013.
-
(2013)
International Journal of Robotics Research
, vol.32
, Issue.11
, pp. 1238-1274
-
-
Kober, J.1
Bagnell, J.A.2
Peters, J.3
-
2
-
-
84943767635
-
-
arXiv:1504.00702 cs. LG
-
S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies", 2015, arXiv:1504.00702 [cs. LG].
-
(2015)
End-to-end Training of Deep Visuomotor Policies
-
-
Levine, S.1
Finn, C.2
Darrell, T.3
Abbeel, P.4
-
3
-
-
84971448181
-
-
arXiv:1602.01783 cs. LG
-
V. Mnih, A. P. Badia, M. Mirza, A. Graves, T. P. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu, "Asynchronous methods for deep reinforcement learning", 2016, arXiv:1602.01783 [cs. LG].
-
(2016)
Asynchronous Methods for Deep Reinforcement Learning
-
-
Mnih, V.1
Badia, A.P.2
Mirza, M.3
Graves, A.4
Lillicrap, T.P.5
Harley, T.6
Silver, D.7
Kavukcuoglu, K.8
-
4
-
-
84963949906
-
Mastering the game of go with deep neural networks and tree search
-
D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, I. Sutskever, T. Lillicrap, M. Leach, K. Kavukcuoglu, T. Graepel, and D. Hassabis, "Mastering the game of go with deep neural networks and tree search", Nature, vol. 529, no. 7585, pp. 484-489, 2016.
-
(2016)
Nature
, vol.529
, Issue.7585
, pp. 484-489
-
-
Silver, D.1
Huang, A.2
Maddison, C.J.3
Guez, A.4
Sifre, L.5
Driessche, G.V.D.6
Schrittwieser, J.7
Antonoglou, I.8
Panneershelvam, V.9
Lanctot, M.10
Dieleman, S.11
Grewe, D.12
Nham, J.13
Kalchbrenner, N.14
Sutskever, I.15
Lillicrap, T.16
Leach, M.17
Kavukcuoglu, K.18
Graepel, T.19
Hassabis, D.20
more..
-
5
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L.-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching", Machine Learning, vol. 8, no. 3-4, pp. 293-321, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 293-321
-
-
Lin, L.-J.1
-
6
-
-
85083953657
-
Continuous control with deep reinforcement learning
-
T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning", in International Conference on Learning Representations (ICLR), 2016.
-
(2016)
International Conference on Learning Representations (ICLR)
-
-
Lillicrap, T.P.1
Hunt, J.J.2
Pritzel, A.3
Heess, N.4
Erez, T.5
Tassa, Y.6
Silver, D.7
Wierstra, D.8
-
7
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, "Human-level control through deep reinforcement learning", Nature, vol. 518, no. 7540, pp. 529-533, 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
8
-
-
85006438211
-
-
deep Reinforcement Learning Workshop, Advances in Neural Information Processing Systems NIPS
-
T. de Bruin, J. Kober, K. Tuyls, and R. Babuška, "The importance of experience replay database composition in deep reinforcement learning", 2015, deep Reinforcement Learning Workshop, Advances in Neural Information Processing Systems (NIPS).
-
(2015)
The Importance of Experience Replay Database Composition in Deep Reinforcement Learning
-
-
De Bruin, T.1
Kober, J.2
Tuyls, K.3
Babuška, R.4
-
9
-
-
84980041049
-
-
arXiv:1511.05952 cs. LG
-
T. Schaul, J. Quan, I. Antonoglou, and D. Silver, "Prioritized experience replay", 2015, arXiv:1511.05952 [cs. LG].
-
(2015)
Prioritized Experience Replay
-
-
Schaul, T.1
Quan, J.2
Antonoglou, I.3
Silver, D.4
-
10
-
-
0004135065
-
-
Springer
-
G. Montavon, G. B. Orr, and K.-R. Müller, Eds., Neural Networks: Tricks of the Trade, 2nd ed., ser. Lecture Notes in Computer Science (LNCS). Springer, 2012, vol. 7700.
-
(2012)
Neural Networks: Tricks of the Trade, 2nd Ed., Ser. Lecture Notes in Computer Science (LNCS)
, vol.7700
-
-
Montavon, G.1
Orr, G.B.2
Müller, K.-R.3
-
11
-
-
84919793697
-
Deterministic policy gradient algorithms
-
D. Silver, G. Lever, N. Heess, T. Degris, D. Wierstra, and M. Riedmiller, "Deterministic policy gradient algorithms", in International Conference on Machine Learning (ICML), 2014, pp. 387-395.
-
(2014)
International Conference on Machine Learning (ICML)
, pp. 387-395
-
-
Silver, D.1
Lever, G.2
Heess, N.3
Degris, T.4
Wierstra, D.5
Riedmiller, M.6
-
12
-
-
84908477926
-
-
arXiv:1312.6211 stat. ML
-
I. Goodfellow, M. Mirza, X. Da, A. Courville, and Y. Bengio, "An empirical investigation of catastrophic forgeting in gradient-based neural networks", 2013, arXiv:1312.6211 [stat. ML].
-
(2013)
An Empirical Investigation of Catastrophic Forgeting in Gradient-based Neural Networks
-
-
Goodfellow, I.1
Mirza, M.2
Da, X.3
Courville, A.4
Bengio, Y.5
-
13
-
-
0001473437
-
On estimation of a probability density function and mode
-
E. Parzen, "On estimation of a probability density function and mode", The Annals of Mathematical Statistics, vol. 33, no. 3, pp. 1065-1076, 1962.
-
(1962)
The Annals of Mathematical Statistics
, vol.33
, Issue.3
, pp. 1065-1076
-
-
Parzen, E.1
-
14
-
-
77949422045
-
Robotic magnetic steering and locomotion of capsule endoscope for diagnostic and surgical endoluminal procedures
-
G. Ciuti, P. Valdastri, A. Menciassi, and P. Dario, "Robotic magnetic steering and locomotion of capsule endoscope for diagnostic and surgical endoluminal procedures", Robotica, vol. 28, no. 02, pp. 199-207, 2010.
-
(2010)
Robotica
, vol.28
, Issue.2
, pp. 199-207
-
-
Ciuti, G.1
Valdastri, P.2
Menciassi, A.3
Dario, P.4
|