-
1
-
-
84864030941
-
An application of reinforcement learning to aerobatic helicopter flight
-
Abbeel, P., Coates, A., Quigley, M., and Ng, A. An application of reinforcement learning to aerobatic helicopter flight. In Advances in Neural Information Processing Systems (NIPS 19), 2006.
-
(2006)
Advances in Neural Information Processing Systems (NIPS 19)
-
-
Abbeel, P.1
Coates, A.2
Quigley, M.3
Ng, A.4
-
3
-
-
49049094416
-
Random sampling of states in dynamic programming
-
Atkeson, C. and Stephens, B. Random sampling of states in dynamic programming. IEEE Transactions on Systems, Man, and Cybernetics, Part B, 38(4): 924-929, 2008.
-
(2008)
IEEE Transactions on Systems, Man, and Cybernetics, Part B
, vol.38
, Issue.4
, pp. 924-929
-
-
Atkeson, C.1
Stephens, B.2
-
4
-
-
84898962948
-
Policy search by dynamic programming
-
Bagnell, A., Kakade, S., Ng, A., and Schneider, J. Policy search by dynamic programming. In Advances in Neural Information Processing Systems (NIPS 16), 2003.
-
(2003)
Advances in Neural Information Processing Systems (NIPS 16)
-
-
Bagnell, A.1
Kakade, S.2
Ng, A.3
Schneider, J.4
-
5
-
-
0013495368
-
Experiments with infinite-horizon, policy-gradient estimation
-
Baxter, J., Bartlett, P., and Weaver, L. Experiments with infinite-horizon, policy-gradient estimation. Journal of Artificial Intelligence Research, 15: 351-381, 2001.
-
(2001)
Journal of Artificial Intelligence Research
, vol.15
, pp. 351-381
-
-
Baxter, J.1
Bartlett, P.2
Weaver, L.3
-
8
-
-
0036059542
-
Movement imitation with nonlinear dynamical systems in humanoid robots
-
Ijspeert, A., Nakanishi, J., and Schaal, S. Movement imitation with nonlinear dynamical systems in humanoid robots. In International Conference on Robotics and Automation, 2002.
-
International Conference on Robotics and Automation, 2002
-
-
Ijspeert, A.1
Nakanishi, J.2
Schaal, S.3
-
10
-
-
84871705710
-
STOMP: Stochastic trajectory optimization for motion planning
-
Kalakrishnan, M., Chitta, S., Theodorou, E., Pastor, P., and Schaal, S. STOMP: stochastic trajectory optimization for motion planning. In International Conference on Robotics and Automation, 2011.
-
International Conference on Robotics and Automation, 2011
-
-
Kalakrishnan, M.1
Chitta, S.2
Theodorou, E.3
Pastor, P.4
Schaal, S.5
-
13
-
-
44949241322
-
Reinforcement learning of motor skills with policy gradients
-
Peters, J. and Schaal, S. Reinforcement learning of motor skills with policy gradients. Neural Networks, 21(4):682-697, 2008.
-
(2008)
Neural Networks
, vol.21
, Issue.4
, pp. 682-697
-
-
Peters, J.1
Schaal, S.2
-
15
-
-
84862273266
-
A reduction of imitation learning and structured prediction to no-regret online learning
-
Ross, S., Gordon, G., and Bagnell, A. A reduction of imitation learning and structured prediction to no-regret online learning. Journal of Machine Learning Research, 15:627-635, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.15
, pp. 627-635
-
-
Ross, S.1
Gordon, G.2
Bagnell, A.3
-
16
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Sutton, R., McAllester, D., Singh, S., and Mansour, Y. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems (NIPS 11), 1999.
-
(1999)
Advances in Neural Information Processing Systems (NIPS 11)
-
-
Sutton, R.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
18
-
-
84872363924
-
Synthesis and stabilization of complex behaviors through online trajectory optimization
-
Tassa, Y., Erez, T., and Todorov, E. Synthesis and stabilization of complex behaviors through online trajectory optimization. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012.
-
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012
-
-
Tassa, Y.1
Erez, T.2
Todorov, E.3
-
19
-
-
14044262287
-
Stochastic policy gradient reinforcement learning on a simple 3d biped
-
Tedrake, R., Zhang, T., and Seung, H. Stochastic policy gradient reinforcement learning on a simple 3d biped. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004.
-
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2004
-
-
Tedrake, R.1
Zhang, T.2
Seung, H.3
-
20
-
-
84872292044
-
MuJoCo: A physics engine for model-based control
-
Todorov, E., Erez, T., and Tassa, Y. MuJoCo: A physics engine for model-based control. In IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012.
-
IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012
-
-
Todorov, E.1
Erez, T.2
Tassa, Y.3
-
22
-
-
34547691027
-
SIMBICON: Simple biped locomotion control
-
Yin, K., Loken, K., and van de Panne, M. SIMBICON: simple biped locomotion control. ACM Transactions Graphics, 26(3), 2007.
-
(2007)
ACM Transactions Graphics
, vol.26
, Issue.3
-
-
Yin, K.1
Loken, K.2
Van De Panne, M.3
|