-
1
-
-
84884276459
-
Reinforcement learning in robotics: A survey
-
J. Kober, J. A. Bagnell, and J. Peters, "Reinforcement learning in robotics: A survey", Int. J. Robotics Research, vol. 11, no. 32, pp. 1238-1274, 2013.
-
(2013)
Int. J. Robotics Research
, vol.11
, Issue.32
, pp. 1238-1274
-
-
Kober, J.1
Bagnell, J.A.2
Peters, J.3
-
2
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey", J. Artificial Intell. Research, vol. 4, 1996.
-
(1996)
J. Artificial Intell. Research
, vol.4
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
3
-
-
84893561637
-
Acquiring visual servoing reaching and grasping skills using neural reinforcement learning
-
T. Lampe and M. Riedmiller, "Acquiring visual servoing reaching and grasping skills using neural reinforcement learning", in IJCNN, 2013.
-
(2013)
IJCNN
-
-
Lampe, T.1
Riedmiller, M.2
-
4
-
-
84938227861
-
Towards learning hierarchical skills for multi-phase manipulation tasks
-
O. Kroemer, C. Daniel, G. Neumann, H. van Hoof, and J. Peters, "Towards learning hierarchical skills for multi-phase manipulation tasks", in ICRA, 2015.
-
(2015)
ICRA
-
-
Kroemer, O.1
Daniel, C.2
Neumann, G.3
Van Hoof, H.4
Peters, J.5
-
5
-
-
84455188451
-
Learning force control policies for compliant manipulation
-
M. Kalakrishnan, L. Righetti, P. Pastor, and S. Schaal, "Learning force control policies for compliant manipulation", in IROS, 2011.
-
(2011)
IROS
-
-
Kalakrishnan, M.1
Righetti, L.2
Pastor, P.3
Schaal, S.4
-
6
-
-
0034291488
-
Control of contact via tactile sensing
-
H. Zhang and N. Chen, "Control of contact via tactile sensing", Trans. Robotics and Automation, vol. 16, no. 5, pp. 482-495, 2000.
-
(2000)
Trans. Robotics and Automation
, vol.16
, Issue.5
, pp. 482-495
-
-
Zhang, H.1
Chen, N.2
-
7
-
-
84929223521
-
Limits to compliance and the role of tactile sensing in grasping
-
L. Jentoft, Q. Wan, and R. Howe, "Limits to compliance and the role of tactile sensing in grasping", in ICRA, 2014, pp. 6394-6399.
-
(2014)
ICRA
, pp. 6394-6399
-
-
Jentoft, L.1
Wan, Q.2
Howe, R.3
-
8
-
-
78651516759
-
Contact-reactive grasping of objects with partial shape information
-
K. Hsiao, S. Chitta, M. Ciocarlie, and E. Jones, "Contact-reactive grasping of objects with partial shape information", in IROS, 2010.
-
(2010)
IROS
-
-
Hsiao, K.1
Chitta, S.2
Ciocarlie, M.3
Jones, E.4
-
9
-
-
84891103646
-
Towards associative skill memories
-
P. Pastor, M. Kalakrishnan, L. Righetti, and S. Schaal, "Towards associative skill memories", in Humanoids, 2012.
-
(2012)
Humanoids
-
-
Pastor, P.1
Kalakrishnan, M.2
Righetti, L.3
Schaal, S.4
-
10
-
-
35248866146
-
An introduction to reinforcement learning theory: Value function methods
-
Springer Berlin Heidelberg
-
P. L. Bartlett, "An introduction to reinforcement learning theory: Value function methods", in Advanced lectures on Machine Learning. Springer Berlin Heidelberg, 2003, pp. 184-202.
-
(2003)
Advanced Lectures on Machine Learning
, pp. 184-202
-
-
Bartlett, P.L.1
-
11
-
-
84911470116
-
Learning robot tactile sensing for object manipulation
-
Y. Chebotar, O. Kroemer, and J. Peters, "Learning robot tactile sensing for object manipulation", in IROS, 2014, pp. 3368-3375.
-
(2014)
IROS
, pp. 3368-3375
-
-
Chebotar, Y.1
Kroemer, O.2
Peters, J.3
-
12
-
-
77955426970
-
Combining active learning and reactive control for robot grasping
-
O. Kroemer, R. Detry, J. Piater, and J. Peters, "Combining active learning and reactive control for robot grasping", Robotics and Autonomous Syst., vol. 58, no. 9, pp. 1105-1116, 2010.
-
(2010)
Robotics and Autonomous Syst.
, vol.58
, Issue.9
, pp. 1105-1116
-
-
Kroemer, O.1
Detry, R.2
Piater, J.3
Peters, J.4
-
13
-
-
84868008342
-
Skill learning and task outcome prediction for manipulation
-
P. Pastor, M. Kalakrishnan, S. Chitta, E. Theodorou, and S. Schaal, "Skill learning and task outcome prediction for manipulation", in ICRA, 2011.
-
(2011)
ICRA
-
-
Pastor, P.1
Kalakrishnan, M.2
Chitta, S.3
Theodorou, E.4
Schaal, S.5
-
14
-
-
67650835709
-
Learning perceptual coupling for motor primitives
-
J. Kober, B. Mohler, and J. Peters, "Learning perceptual coupling for motor primitives", in IROS, 2008, pp. 834-839.
-
(2008)
IROS
, pp. 834-839
-
-
Kober, J.1
Mohler, B.2
Peters, J.3
-
15
-
-
84962230791
-
Learning of non-parametric control policies with high-dimensional state features
-
H. van Hoof, J. Peters, and G. Neumann, "Learning of non-parametric control policies with high-dimensional state features", in AISTATS, 2015.
-
(2015)
AISTATS
-
-
Van Hoof, H.1
Peters, J.2
Neumann, G.3
-
16
-
-
84962314631
-
Learning robot in-hand manipulation with tactile features
-
H. van Hoof, T. Hermans, G. Neumann, and J. Peters, "Learning robot in-hand manipulation with tactile features", in Humanoids, 2015.
-
(2015)
Humanoids
-
-
Van Hoof, H.1
Hermans, T.2
Neumann, G.3
Peters, J.4
-
17
-
-
56449089103
-
Extracting and composing robust features with denoising autoencoders
-
P. Vincent, H. Larochelle, Y. Bengio, and P.-A. Manzagol, "Extracting and composing robust features with denoising autoencoders", in ICML, 2008, pp. 1096-1103.
-
(2008)
ICML
, pp. 1096-1103
-
-
Vincent, P.1
Larochelle, H.2
Bengio, Y.3
Manzagol, P.-A.4
-
18
-
-
69349090197
-
Learning deep architectures for AI
-
Y. Bengio, "Learning deep architectures for AI", Found. Trends Mach. Learn., vol. 2, no. 1, pp. 1-127, 2009.
-
(2009)
Found. Trends Mach. Learn.
, vol.2
, Issue.1
, pp. 1-127
-
-
Bengio, Y.1
-
19
-
-
85083952489
-
Auto-encoding variational Bayes
-
D. P. Kingma and M. Welling, "Auto-encoding variational Bayes", in ICLR, 2014.
-
(2014)
ICLR
-
-
Kingma, D.P.1
Welling, M.2
-
20
-
-
84962243554
-
Efficient movement representation by embedding dynamic movement primitives in deep autoencoders
-
N. Chen, J. Bayer, S. Urban, and P. van der Smagt, "Efficient movement representation by embedding dynamic movement primitives in deep autoencoders", in Humanoids, 2015.
-
(2015)
Humanoids
-
-
Chen, N.1
Bayer, J.2
Urban, S.3
Van Der Smagt, P.4
-
21
-
-
85006369951
-
Learn to swing up and balance a real pole based on raw visual input data
-
J. Mattner, S. Lange, and M. Riedmiller, "Learn to swing up and balance a real pole based on raw visual input data", in ICONIP, 2012.
-
(2012)
ICONIP
-
-
Mattner, J.1
Lange, S.2
Riedmiller, M.3
-
22
-
-
84977501896
-
Deep spatial autoencoders for visuomotor learning
-
C. Finn, X. Y. Tan, Y. Duan, T. Darrell, S. Levine, and P. Abbeel, "Deep spatial autoencoders for visuomotor learning", in ICRA, 2016.
-
(2016)
ICRA
-
-
Finn, C.1
Tan, X.Y.2
Duan, Y.3
Darrell, T.4
Levine, S.5
Abbeel, P.6
-
23
-
-
84980051362
-
-
ArXiv, Tech. Rep.
-
J.-A. M. Assael, N. Wahlström, T. B. Schön, and M. P. Deisenroth, "Data-efficient learning of feedback policies from image pixels using deep dynamical models", ArXiv, Tech. Rep., 2015.
-
(2015)
Data-efficient Learning of Feedback Policies from Image Pixels Using Deep Dynamical Models
-
-
Assael, M.J.-A.1
Wahlström, N.2
Schön, T.B.3
Deisenroth, M.P.4
-
24
-
-
84965104233
-
-
ArXiv, Tech. Rep
-
N. Wahlström, T. B. Schön, and M. P. Deisenroth, "From pixels to torques: Policy learning with deep dynamical models", ArXiv, Tech. Rep. 1502.02251, 2015.
-
(2015)
From Pixels to Torques: Policy Learning with Deep Dynamical Models
-
-
Wahlström, N.1
Schön, T.B.2
Deisenroth, M.P.3
-
25
-
-
84865083902
-
Autonomous reinforcement learning on raw visual input data in a real world application
-
S. Lange, M. Riedmiller, and A. Voigtländer, "Autonomous reinforcement learning on raw visual input data in a real world application", in IJCNN, 2012.
-
(2012)
IJCNN
-
-
Lange, S.1
Riedmiller, M.2
Voigtländer, A.3
-
26
-
-
33646687423
-
Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
-
M. Riedmiller, "Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method", in ECML, 2005.
-
(2005)
ECML
-
-
Riedmiller, M.1
-
27
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis, "Human-level control through deep reinforcement learning", Nature, vol. 518, no. 7540, 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
Petersen, S.11
Beattie, C.12
Sadik, A.13
Antonoglou, I.14
King, H.15
Kumaran, D.16
Wierstra, D.17
Legg, S.18
Hassabis, D.19
-
28
-
-
84883060087
-
Evolving largescale neural networks for vision-based reinforcement learning
-
J. Koutník, G. Cuccu, J. Schmidhuber, and F. Gomez, "Evolving largescale neural networks for vision-based reinforcement learning", in Annu. Conf. Genetic and Evolutionary Computation, 2013.
-
(2013)
Annu. Conf. Genetic and Evolutionary Computation
-
-
Koutník, J.1
Cuccu, G.2
Schmidhuber, J.3
Gomez, F.4
-
29
-
-
84965135289
-
-
arXiv, Tech. Rep
-
T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning", arXiv, Tech. Rep. 1509.02971, 2015.
-
(2015)
Continuous Control with Deep Reinforcement Learning
-
-
Lillicrap, T.P.1
Hunt, J.J.2
Pritzel, A.3
Heess, N.4
Erez, T.5
Tassa, Y.6
Silver, D.7
Wierstra, D.8
-
30
-
-
84969963490
-
Trust region policy optimization
-
J. Schulman, S. Levine, P. Moritz, M. Jordan, and P. Abbeel, "Trust region policy optimization", in ICML, 2015.
-
(2015)
ICML
-
-
Schulman, J.1
Levine, S.2
Moritz, P.3
Jordan, M.4
Abbeel, P.5
-
31
-
-
84979924150
-
End-to-end training of deep visuomotor policies
-
S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies", Journal of Machine Learning Research, vol. 17, no. 39, pp. 1-40, 2016.
-
(2016)
Journal of Machine Learning Research
, vol.17
, Issue.39
, pp. 1-40
-
-
Levine, S.1
Finn, C.2
Darrell, T.3
Abbeel, P.4
-
32
-
-
84965129327
-
Embed to control: A locally linear latent dynamics model for control from raw images
-
M. Watter, J. T. Springenberg, J. Boedecker, and M. Riedmiller, "Embed to control: A locally linear latent dynamics model for control from raw images", in NIPS, 2015, pp. 2728-2736.
-
(2015)
NIPS
, pp. 2728-2736
-
-
Watter, M.1
Springenberg, J.T.2
Boedecker, J.3
Riedmiller, M.4
-
34
-
-
77958569725
-
Relative entropy policy search
-
J. Peters, K. Mülling, and Y. Altün, "Relative entropy policy search", in AAAI, 2010, pp. 1607-1612.
-
(2010)
AAAI
, pp. 1607-1612
-
-
Peters, J.1
Mülling, K.2
Altün, Y.3
-
36
-
-
77953218689
-
Random features for large-scale kernel machines
-
A. Rahimi and B. Recht, "Random features for large-scale kernel machines", in NIPS, 2007, pp. 1177-1184.
-
(2007)
NIPS
, pp. 1177-1184
-
-
Rahimi, A.1
Recht, B.2
|