-
1
-
-
0037288370
-
Recent advances in hierarchical reinforcement learning
-
Special Issue on Reinforcement Learning
-
A.G. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:41-77, 2003. Special Issue on Reinforcement Learning.
-
(2003)
Discrete Event Dynamic Systems
, vol.13
, pp. 41-77
-
-
Barto, A.G.1
Mahadevan, S.2
-
3
-
-
80055032021
-
Skill discovery in continuous reinforcement learning domains using skill chaining
-
G.D. Konidaris and A.G. Barto. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pages 1015-1023, 2009.
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1015-1023
-
-
Konidaris, G.D.1
Barto, A.G.2
-
5
-
-
63149159130
-
A survey of robot learning from demonstration
-
B. Argall, S. Chernova, M. Veloso, and B. Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57:469-483, 2009.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, pp. 469-483
-
-
Argall, B.1
Chernova, S.2
Veloso, M.3
Browning, B.4
-
6
-
-
0031343489
-
A feedback control structure for on-line learning tasks
-
M. Huber and R.A. Grupen. A feedback control structure for on-line learning tasks. Robotics and Autonomous Systems, 22(3-4):303-315, 1997.
-
(1997)
Robotics and Autonomous Systems
, vol.22
, Issue.3-4
, pp. 303-315
-
-
Huber, M.1
Grupen, R.A.2
-
7
-
-
84979715630
-
Supervised actor-critic reinforcement learning
-
J. Si, A.G. Barto, A. Powell, and D. Wunsch, editors. John Wiley & Sons, Inc., New York
-
M. Rosenstein and A.G. Barto. Supervised actor-critic reinforcement learning. In J. Si, A.G. Barto, A. Powell, and D. Wunsch, editors, Learning and Approximate Dynamic Programming: Scaling up the Real World, pages 359-380. John Wiley & Sons, Inc., New York, 2004.
-
(2004)
Learning and Approximate Dynamic Programming: Scaling Up the Real World
, pp. 359-380
-
-
Rosenstein, M.1
Barto, A.G.2
-
9
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
R.S. Sutton, D. Precup, and S.P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1-2
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.P.3
-
10
-
-
0021390267
-
Automatic synthesis of fine-motion strategies for robots
-
T. Lozano-Perez, M.T. Mason, and R.H. Taylor. Automatic synthesis of fine-motion strategies for robots. The International Journal of Robotics Research, 3(1):3-24, 1984.
-
(1984)
The International Journal of Robotics Research
, vol.3
, Issue.1
, pp. 3-24
-
-
Lozano-Perez, T.1
Mason, M.T.2
Taylor, R.H.3
-
11
-
-
0032647341
-
Sequential composition of dynamically dextrous robot behaviors
-
R.R. Burridge, A.A. Rizzi, and D.E. Koditschek. Sequential composition of dynamically dextrous robot behaviors. International Journal of Robotics Research, 18(6):534-555, 1999.
-
(1999)
International Journal of Robotics Research
, vol.18
, Issue.6
, pp. 534-555
-
-
Burridge, R.R.1
Rizzi, A.A.2
Koditschek, D.E.3
-
12
-
-
5844409947
-
Adaptive targeting of chaos
-
S. Boccaletti, A. Farini, E.J. Kostelich, and F.T. Arecchi. Adaptive targeting of chaos. Physical Review E, 55(5):4845-4848, 1997.
-
(1997)
Physical Review e
, vol.55
, Issue.5
, pp. 4845-4848
-
-
Boccaletti, S.1
Farini, A.2
Kostelich, E.J.3
Arecchi, F.T.4
-
14
-
-
56449130136
-
Automatic discovery and transfer of MAXQ hierarchies
-
N. Mehta, S. Ray, P. Tadepalli, and T. Dietterich. Automatic discovery and transfer of MAXQ hierarchies. In Proceedings of the Twenty Fifth International Conference on Machine Learning, pages 648-655, 2008.
-
(2008)
Proceedings of the Twenty Fifth International Conference on Machine Learning
, pp. 648-655
-
-
Mehta, N.1
Ray, S.2
Tadepalli, P.3
Dietterich, T.4
-
17
-
-
77957761338
-
LQR-Trees: Feedback motion planning on sparse randomized trees
-
R. Tedrake. LQR-Trees: Feedback motion planning on sparse randomized trees. In Proceedings of Robotics: Science and Systems, pages 18-24, 2009.
-
(2009)
Proceedings of Robotics: Science and Systems
, pp. 18-24
-
-
Tedrake, R.1
-
20
-
-
17144391260
-
Performance-derived behavior vocabularies: Data-driven acquisition of skills from motion
-
O.C. Jenkins and M. Matarić. Performance-derived behavior vocabularies: data-driven acquisition of skills from motion. International Journal of Humanoid Robotics, 1(2):237-288, 2004.
-
(2004)
International Journal of Humanoid Robotics
, vol.1
, Issue.2
, pp. 237-288
-
-
Jenkins, O.C.1
Matarić, M.2
-
23
-
-
40649106649
-
Natural actor-critic
-
J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71(7-9):1180-1190, 2008.
-
(2008)
Neurocomputing
, vol.71
, Issue.7-9
, pp. 1180-1190
-
-
Peters, J.1
Schaal, S.2
|