-
2
-
-
0037288370
-
Recent advances in hierarchical reinforcement learning
-
A. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete event systems, 13(1-2):41-77, 2003.
-
(2003)
Discrete Event Systems
, vol.13
, Issue.1-2
, pp. 41-77
-
-
Barto, A.1
Mahadevan, S.2
-
3
-
-
0037592480
-
Evolution strategies-a comprehensive introduction
-
Hans-Georg Beyer and Hans-Paul Schwefel. Evolution strategies-a comprehensive introduction. Natural Computing, 1(1):3-52, 2002.
-
(2002)
Natural Computing
, vol.1
, Issue.1
, pp. 3-52
-
-
Beyer, H.-G.1
Schwefel, H.-P.2
-
4
-
-
79551686776
-
Crossentropy optimization of control policies with adaptive basis functions
-
L. Busoniu, D. Ernst, B. De Schutter, and R. Babuska. Crossentropy optimization of control policies with adaptive basis functions. IEEE Transactions on Systems, Man, andCybernetics-Part B: Cybernetics, 41(1):196-209, 2011.
-
(2011)
IEEE Transactions on Systems, Man, andCybernetics-Part B: Cybernetics
, vol.41
, Issue.1
, pp. 196-209
-
-
Busoniu, L.1
Ernst, D.2
De Schutter, B.3
Babuska, R.4
-
6
-
-
0035377566
-
Completely derandomized selfadaptation in evolution strategies
-
N. Hansen and A. Ostermeier. Completely derandomized selfadaptation in evolution strategies. Evolutionary Computation, 9(2):159-195, 2001.
-
(2001)
Evolutionary Computation
, vol.9
, Issue.2
, pp. 159-195
-
-
Hansen, N.1
Ostermeier, A.2
-
9
-
-
84886993021
-
Similarities and differences between policy gradient methods and evolution strategies
-
Verena Heidrich-Meisner and Christian Igel. Similarities and differences between policy gradient methods and evolution strategies. In ESANN 2008, 16th European Symposium on Artifi-cial Neural Networks, Bruges, Belgium, April 23-25, 2008,Proceedings, pages 149-154, 2008.
-
(2008)
ESANN 2008, 16th European Symposium on Artifi-cial Neural Networks, Bruges, Belgium, April 23-25, 2008,Proceedings
, pp. 149-154
-
-
Heidrich-Meisner, V.1
Igel, C.2
-
10
-
-
84875592161
-
Dynamical Movement Primitives: Learning attractor models for motor behaviors
-
A. Ijspeert, J. Nakanishi, P Pastor, H. Hoffmann, and S. Schaal. Dynamical Movement Primitives: Learning attractor models for motor behaviors. Neural Computation, 25(2):328-373, 2013.
-
(2013)
Neural Computation
, vol.25
, Issue.2
, pp. 328-373
-
-
Ijspeert, A.1
Nakanishi, J.2
Pastor, P.3
Hoffmann, H.4
Schaal, S.5
-
12
-
-
79958852534
-
Characterizing reinforcement learning methods through parameterized learning problems
-
Shivaram Kalyanakrishnan and Peter Stone. Characterizing reinforcement learning methods through parameterized learning problems. Machine Learning, 84(1-2):205-247, 2011.
-
(2011)
Machine Learning
, vol.84
, Issue.1-2
, pp. 205-247
-
-
Kalyanakrishnan, S.1
Stone, P.2
-
13
-
-
29044440299
-
Path integrals and symmetry breaking for optimal control theory
-
2005
-
H.J. Kappen. Path integrals and symmetry breaking for optimal control theory. Journal of Statistical Mechanics: Theory andExperiment, 2005(11):P11011, 2005.
-
(2005)
Journal of Statistical Mechanics: Theory andExperiment
, vol.11
, pp. P11011
-
-
Kappen, H.J.1
-
14
-
-
80053623760
-
Learning stable non-linear dynamical systems with gaussian mixture models
-
S. Mohammad Khansari-Zadeh and Aude Billard. Learning stable non-linear dynamical systems with gaussian mixture models. IEEE Transactions on Robotics, 2011.
-
(2011)
IEEE Transactions on Robotics
-
-
Khansari-Zadeh, S.M.1
Billard, A.2
-
15
-
-
78651495944
-
Reinforcement learning to adjust robot movements to new situations
-
June
-
J. Kober, E. Oztop, and J. Peters. Reinforcement learning to adjust robot movements to new situations. In Proceedings of Robotics:Science and Systems, Zaragoza, Spain, June 2010.
-
(2010)
Proceedings of Robotics:Science and Systems, Zaragoza, Spain
-
-
Kober, J.1
Oztop, E.2
Peters, J.3
-
16
-
-
78049390740
-
Policy search for motor primitives in robotics
-
J. Kober and J. Peters. Policy search for motor primitives in robotics. Machine Learning, 84:171-203, 2011.
-
(2011)
Machine Learning
, vol.84
, pp. 171-203
-
-
Kober, J.1
Peters, J.2
-
17
-
-
84885895576
-
Towards fast and adaptive optimal control policies for robots: A direct policy search approach
-
Guimaraes, Portugal
-
D. Marin and O. Sigaud. Towards fast and adaptive optimal control policies for robots: A direct policy search approach. In Proceed-ings Robotica, pages 21-26, Guimaraes, Portugal, 2012.
-
(2012)
Proceed-ings Robotica
, pp. 21-26
-
-
Marin, D.1
Sigaud, O.2
-
18
-
-
84864436640
-
Closed-loop primitives: A method to generate and recognize reaching actions from demonstration
-
Mustafa Parlaktuna, Doruk Tunaoglu, Erol Sahin, and Emre Ugur. Closed-loop primitives: A method to generate and recognize reaching actions from demonstration. In International Confer-ence on Robotics and Automation, pages 2015-2020, 2012.
-
(2012)
International Confer-ence on Robotics and Automation
, pp. 2015-2020
-
-
Parlaktuna, M.1
Tunaoglu, D.2
Sahin, E.3
Ugur, E.4
-
20
-
-
40649106649
-
Natural actor-critic
-
Jan Peters and Stefan Schaal. Natural actor-critic. Neurocom-puting, 71(7-9):1180-1190, 2008.
-
(2008)
Neurocom-puting
, vol.71
, Issue.7-9
, pp. 1180-1190
-
-
Peters, J.1
Schaal, S.2
-
25
-
-
85141643084
-
Exploring parameter space in reinforcement learning. Paladyn
-
ISSN 2080-9778
-
Thomas Rückstiess, Frank Sehnke, Tom Schaul, Daan Wierstra, Yi Sun, and Jürgen Schmidhuber. Exploring parameter space in reinforcement learning. Paladyn. Journal of BehavioralRobotics, 1:14-24, 2010. ISSN 2080-9778.
-
(2010)
Journal of BehavioralRobotics
, vol.1
, pp. 14-24
-
-
Rückstiess, T.1
Sehnke, F.2
Schaul, T.3
Wierstra, D.4
Sun, Y.5
Schmidhuber, J.6
-
26
-
-
0031231885
-
Experiments with reinforcement learning in problems with continuous state and action spaces
-
J.C. Santamaría, R.S. Sutton, and A. Ram. Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive behavior, 6(2):163-217, 1997.
-
(1997)
Adaptive Behavior
, vol.6
, Issue.2
, pp. 163-217
-
-
Santamaría, J.C.1
Sutton, R.S.2
Ram, A.3
-
28
-
-
77950297907
-
Parameterexploring policy gradients
-
Frank Sehnke, Christian Osendorfer, Thomas Rückstie, Alex Graves, Jan Peters, and Jürgen Schmidhuber. Parameterexploring policy gradients. Neural Networks, 23(4):551-559, 2010.
-
(2010)
Neural Networks
, vol.23
, Issue.4
, pp. 551-559
-
-
Sehnke, F.1
Osendorfer, C.2
Rückstie, T.3
Graves, A.4
Peters, J.5
Schmidhuber, J.6
-
29
-
-
74049165047
-
From motor learning to interaction learning in robots
-
Springer-Verlag
-
O. Sigaud and J. Peters. From motor learning to interaction learning in robots. In From Motor Learning to Interaction Learningin Robots, volume 264, pages 1-12. Springer-Verlag, 2010.
-
(2010)
From Motor Learning to Interaction Learningin Robots
, vol.264
, pp. 1-12
-
-
Sigaud, O.1
Peters, J.2
-
30
-
-
84867115622
-
Learning parameterized skills
-
In John Langford and Joelle Pineau, editors, New York, NY, USA, July. Omnipress
-
Bruno Da Silva, George Konidaris, and Andrew Barto. Learning parameterized skills. In John Langford and Joelle Pineau, editors, Proceedings of the 29th International Conference on Ma-chine Learning (ICML-12), ICML '12, pages 1679-1686, New York, NY, USA, July 2012. Omnipress. ISBN 978-1-4503-1285-1.
-
(2012)
Proceedings of the 29th International Conference on Ma-chine Learning (ICML-12), ICML '12
, pp. 1679-1686
-
-
Da Silva, B.1
Konidaris, G.2
Barto, A.3
-
32
-
-
84455172101
-
Learning motion primitive goals for robust manipulation
-
Freek Stulp, Evangelos Theodorou, Mrinal Kalakrishnan, Peter Pastor, Ludovic Righetti, and Stefan Schaal. Learning motion primitive goals for robust manipulation. In International Con-ference on Intelligent Robots and Systems (IROS), 2011.
-
(2011)
International Con-ference on Intelligent Robots and Systems (IROS
-
-
Stulp, F.1
Theodorou, E.2
Kalakrishnan, M.3
Pastor, P.4
Righetti, L.5
Schaal, S.6
-
33
-
-
84870935597
-
Reinforcement learning with sequences of motion primitives for robust manipulation
-
King-Sun Fu Best Paper Award of the IEEE Trans-actions on Robotics for the year 2012
-
Freek Stulp, Evangelos Theodorou, and Stefan Schaal. Reinforcement learning with sequences of motion primitives for robust manipulation. IEEE Transactions on Robotics, 28(6):1360-1370, 2012. King-Sun Fu Best Paper Award of the IEEE Trans-actions on Robotics for the year 2012.
-
(2012)
IEEE Transactions on Robotics
, vol.28
, Issue.6
, pp. 1360-1370
-
-
Stulp, F.1
Theodorou, E.2
Schaal, S.3
-
35
-
-
80052851862
-
Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives
-
Minija Tamosiumaite, Bojan Nemec, Ales Ude, and Florentin Wörgötter. Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives. Robots andAutonomous Systems, 59(11):910-922, 2011.
-
(2011)
Robots andAutonomous Systems
, vol.59
, Issue.11
, pp. 910-922
-
-
Tamosiumaite, M.1
Nemec, B.2
Ude, A.3
Wörgötter, F.4
-
36
-
-
79551503171
-
A generalized path integral control approach to reinforcement learning
-
Evangelos Theodorou, Jonas Buchli, and Stefan Schaal. A generalized path integral control approach to reinforcement learning. Journal of Machine Learning Research, 11:3137-3181, 2010.
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 3137-3181
-
-
Theodorou, E.1
Buchli, J.2
Schaal, S.3
-
37
-
-
79958789196
-
Ontogenetic and phylogenetic reinforcement learning
-
Julian Togelius, Tom Schaul, Daan Wierstra, Christian Igel, Faustino Gomez, and Jürgen Schmidhuber. Ontogenetic and phylogenetic reinforcement learning. Zeitschrift Künstliche In-telligenz-Special Issue on Reinforcement Learning, pages 30-33, 2009.
-
(2009)
Zeitschrift Künstliche In-telligenz-Special Issue on Reinforcement Learning
, pp. 30-33
-
-
Togelius, J.1
Schaul, T.2
Wierstra, D.3
Igel, C.4
Gomez, F.5
Schmidhuber, J.6
-
38
-
-
77957706006
-
Taskspecific generalization of discrete and periodic dynamic movement primitives
-
Ales Ude, Andrej Gams, Tamim Asfour, and Jun Morimoto. Taskspecific generalization of discrete and periodic dynamic movement primitives. IEEE Transactions on Robotics, 26(5): 800-815, 2010.
-
(2010)
IEEE Transactions on Robotics
, vol.26
, Issue.5
, pp. 800-815
-
-
Ude, A.1
Gams, A.2
Asfour, T.3
Morimoto, J.4
-
39
-
-
0002891388
-
Locally weighted projection regression: An o(n) algorithm for incremental real time learning in high dimensional spaces
-
S. Vijayakumar and S. Schaal. Locally weighted projection regression: An o(n) algorithm for incremental real time learning in high dimensional spaces. In Proceedings of the 17th InternationalConference on Machine Learning (ICML), pages 288-293, 2000.
-
(2000)
Proceedings of the 17th InternationalConference on Machine Learning (ICML
, pp. 288-293
-
-
Vijayakumar, S.1
Schaal, S.2
-
41
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
R. J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8: 229-256, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
|