-
1
-
-
79551686776
-
Cross-entropy optimization of control policies with adaptive basis functions
-
Busoniu, L., Ernst, D., Schutter, B. De, and Babuska, R. Cross-entropy optimization of control policies with adaptive basis functions. IEEE Transactions on Systems, Man, and Cybernetics, 41(1):196-209, 2011.
-
(2011)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.41
, Issue.1
, pp. 196-209
-
-
Busoniu, L.1
Ernst, D.2
De Schutter, B.3
Babuska, R.4
-
3
-
-
0035377566
-
Completely derandomized self-adaptation in evolution strategies
-
Hansen, N. and Ostermeier, A. Completely derandomized self-adaptation in evolution strategies. Evolutionary Computation, 9(2):159-195, 2001.
-
(2001)
Evolutionary Computation
, vol.9
, Issue.2
, pp. 159-195
-
-
Hansen, N.1
Ostermeier, A.2
-
5
-
-
0036059542
-
Movement imitation with nonlinear dynamical systems in humanoid robots
-
Ijspeert, A. J., Nakanishi, J., and Schaal, S. Movement imitation with nonlinear dynamical systems in humanoid robots. In Proc. of the IEEE Int'l Conference on Robotics and Automation (ICRA), 2002.
-
Proc. of the IEEE Int'l Conference on Robotics and Automation (ICRA), 2002
-
-
Ijspeert, A.J.1
Nakanishi, J.2
Schaal, S.3
-
6
-
-
78049390740
-
Policy search for motor primitives in robotics
-
Kober, J. and Peters, J. Policy search for motor primitives in robotics. Machine Learning, 84:171-203, 2011.
-
(2011)
Machine Learning
, vol.84
, pp. 171-203
-
-
Kober, J.1
Peters, J.2
-
8
-
-
1942516890
-
The Cross-Entropy Method for fast policy search
-
Mannor, S., Rubinstein, R. Y., and Gat, Y. The Cross-Entropy Method for fast policy search. In Proceedings of the Int'l Conference on Machine Learning, 2003.
-
Proceedings of the Int'l Conference on Machine Learning, 2003
-
-
Mannor, S.1
Rubinstein, R.Y.2
Gat, Y.3
-
9
-
-
84860394829
-
Learning cost-efficient control policies with XCSF: Generalization capabilities and further improvement
-
Marin, D., Decock, J., Rigoux, L., and Sigaud, O. Learning cost-efficient control policies with XCSF: Generalization capabilities and further improvement. In Proc. of Genetic and evolutionary computation, 2011.
-
Proc. of Genetic and Evolutionary Computation, 2011
-
-
Marin, D.1
Decock, J.2
Rigoux, L.3
Sigaud, O.4
-
10
-
-
40649106649
-
Natural actor-critic
-
Peters, J. and Schaal, S.. Natural actor-critic. Neurocomputing, 71(7-9):1180-1190, 2008.
-
(2008)
Neurocomputing
, vol.71
, Issue.7-9
, pp. 1180-1190
-
-
Peters, J.1
Schaal, S.2
-
11
-
-
85141643084
-
Exploring parameter space in reinforcement learning
-
Rückstiess, T., Sehnke, F., Schaul, T., Wierstra, D., Sun, Y., and Schmidhuber, J.. Exploring parameter space in reinforcement learning. Paladyn. Journal of Behavioral Robotics, 1:14-24, 2010.
-
(2010)
Paladyn. Journal of Behavioral Robotics
, vol.1
, pp. 14-24
-
-
Rückstiess, T.1
Sehnke, F.2
Schaul, T.3
Wierstra, D.4
Sun, Y.5
Schmidhuber, J.6
-
12
-
-
84870910181
-
Learning to grasp under uncertainty
-
Stulp, F., Theodorou, E., Buchli, J., and Schaal, S. Learning to grasp under uncertainty. In Proceedings of the Int'l Conference on Robotics and Automation, 2011.
-
Proceedings of the Int'l Conference on Robotics and Automation, 2011
-
-
Stulp, F.1
Theodorou, E.2
Buchli, J.3
Schaal, S.4
-
13
-
-
79551503171
-
A generalized path integral control approach to reinforcement learning
-
Theodorou, E., Buchli, J., and Schaal, S.. A generalized path integral control approach to reinforcement learning. J. of Machine Learning Research, 11:3137-3181, 2010.
-
(2010)
J. of Machine Learning Research
, vol.11
, pp. 3137-3181
-
-
Theodorou, E.1
Buchli, J.2
Schaal, S.3
-
14
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams, R. J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
|