-
2
-
-
34250635407
-
Policy gradient methods for robotics
-
J. Peters and S. Schaal, "Policy gradient methods for robotics," in IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'06, Beijing, China, October 9-15 2006.
-
IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'06, Beijing, China, October 9-15 2006
-
-
Peters, J.1
Schaal, S.2
-
3
-
-
4043069840
-
On actor-critic algorithms
-
V. Konda and J. Tsitsiklis, "On actor-critic algorithms," SIAM Journal on Control and Optimization, vol. 42, number 4, pp. 1143-1166, 2003.
-
(2003)
SIAM Journal on Control and Optimization
, vol.42
, Issue.4
, pp. 1143-1166
-
-
Konda, V.1
Tsitsiklis, J.2
-
4
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
R. Sutton, D. McAllester, S. Singh, and Y. Mansour, "Policy gradient methods for reinforcement learning with function approximation," Advances in Neural Information Processing Systems, vol. 12, pp. 1057-1063, 2000.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
5
-
-
14044262287
-
Stochastic policy gradient reinforcement learning on a simple 3D biped
-
R. Tedrake, T. W. Zhang, and H. S. Seung, "Stochastic policy gradient reinforcement learning on a simple 3D biped," in IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'04, Sendai, Japan, September 28 - October 2 2004.
-
IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'04, Sendai, Japan, September 28 - October 2 2004
-
-
Tedrake, R.1
Zhang, T.W.2
Seung, H.S.3
-
6
-
-
33846174631
-
Learning sensory feedback to CPG with policy gradient for biped locomotion
-
T. Matsubara, J. Morimoto, J. Nakanishi, M. Sato, and K. Doya, "Learning sensory feedback to CPG with policy gradient for biped locomotion," in Proceedings of the International Conference on Robotics and Automation ICRA, Barcelona, Spain, April 2005.
-
Proceedings of the International Conference on Robotics and Automation ICRA, Barcelona, Spain, April 2005
-
-
Matsubara, T.1
Morimoto, J.2
Nakanishi, J.3
Sato, M.4
Doya, K.5
-
7
-
-
70049104346
-
-
Ph.D. dissertation, Department of Computer Science, University of Southern California.
-
J. Peters, "Machine learning of motor skills for robotics," Ph.D. dissertation, Department of Computer Science, University of Southern California., 2007.
-
(2007)
Machine Learning of Motor Skills for Robotics
-
-
Peters, J.1
-
8
-
-
0000396062
-
Natural gradient works efficiently in learning
-
S. Amari, "Natural gradient works efficiently in learning," Neural Computation, vol. 10, pp. 251-276, 1998.
-
(1998)
Neural Computation
, vol.10
, pp. 251-276
-
-
Amari, S.1
-
9
-
-
84864064043
-
Natural actor-critic for road traffic optimisation
-
S. Richter, D. Aberdeen, and J. Yu, "Natural actor-critic for road traffic optimisation," in Neural Information Processing Systems, NIPS'06, 2006, pp. 1169-1176.
-
Neural Information Processing Systems, NIPS'06, 2006
, pp. 1169-1176
-
-
Richter, S.1
Aberdeen, D.2
Yu, J.3
-
10
-
-
34250613580
-
Stable learning of quasi-passive dynamic walking by an unstable biped robot based on off-policy natural actor-critic
-
T. Ueno, Y. Nakamura, T. Shibata, K. Hosoda, and S. Ishii, "Stable learning of quasi-passive dynamic walking by an unstable biped robot based on off-policy natural actor-critic," in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2006.
-
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2006
-
-
Ueno, T.1
Nakamura, Y.2
Shibata, T.3
Hosoda, K.4
Ishii, S.5
-
11
-
-
0004090962
-
-
Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May
-
W. Smart, "Making reinforcement learning work on real robots," Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May 2002.
-
(2002)
Making Reinforcement Learning Work on Real Robots
-
-
Smart, W.1
-
12
-
-
33846984231
-
Learning obstacle avoidance parameters from operator behavior
-
December
-
B. Hammer, S. Singh, and S. Scherer, "Learning obstacle avoidance parameters from operator behavior," Journal of Field Robotics, Special Issue on Machine Learning Based Robotics in Unstructured Environments, vol. 23 (11/12), December 2006.
-
(2006)
Journal of Field Robotics, Special Issue on Machine Learning Based Robotics in Unstructured Environments
, vol.23
, Issue.11-12
-
-
Hammer, B.1
Singh, S.2
Scherer, S.3
-
13
-
-
33646413135
-
Natural actor-critic
-
J. Peters, S. Vijayakumar, and S. Schaal, "Natural actor-critic," in ECML, 2005, pp. 280-291.
-
(2005)
ECML
, pp. 280-291
-
-
Peters, J.1
Vijayakumar, S.2
Schaal, S.3
-
14
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching." Machine Learning, vol. 8(3/4), pp. 293-321, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 293-321
-
-
Lin, L.1
-
17
-
-
36348971779
-
Ictineu auv wins the first sauc-e competition
-
D. Ribas, N. Palomeras, P. Ridao, M. Carreras, and E. Hernandez, "Ictineu auv wins the first sauc-e competition," in IEEE International Conference on Robotics and Automation, 2007.
-
IEEE International Conference on Robotics and Automation, 2007
-
-
Ribas, D.1
Palomeras, N.2
Ridao, P.3
Carreras, M.4
Hernandez, E.5
-
18
-
-
3342922286
-
On the identification of non-linear models of unmanned underwater vehicles
-
P. Ridao, A. Tiano, A. El-Fakdi, M. Carreras, and A. Zirilli, "On the identification of non-linear models of unmanned underwater vehicles," Control Engineering Practice, vol. 12, pp. 1483-1499, 2004.
-
(2004)
Control Engineering Practice
, vol.12
, pp. 1483-1499
-
-
Ridao, P.1
Tiano, A.2
El-Fakdi, A.3
Carreras, M.4
Zirilli, A.5
-
19
-
-
35248838766
-
Underwater cable tracking by visual feedback
-
J. Antich and A. Ortiz, "Underwater cable tracking by visual feedback," in First Iberian Conference on Pattern recognition and Image Analysis (IbPRIA, LNCS 2652), Port d'Andratx, Spain, 2003.
-
First Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA, LNCS 2652), Port D'Andratx, Spain, 2003
-
-
Antich, J.1
Ortiz, A.2
|