메뉴 건너뛰기




Volumn 4, Issue 1, 2013, Pages 49-61

Robot Skill Learning: From Reinforcement Learning to Evolution Strategies

Author keywords

black box optimization; dynamic movement primitives; evolution strategies; reinforcement learning

Indexed keywords

EVOLUTIONARY ALGORITHMS; LEARNING ALGORITHMS; OPTIMIZATION; ROBOTS;

EID: 84899573498     PISSN: None     EISSN: 20814836     Source Type: Journal    
DOI: 10.2478/pjbr-2013-0003     Document Type: Article
Times cited : (132)

References (41)
  • 2
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • A. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete event systems, 13(1-2):41-77, 2003.
    • (2003) Discrete Event Systems , vol.13 , Issue.1-2 , pp. 41-77
    • Barto, A.1    Mahadevan, S.2
  • 3
    • 0037592480 scopus 로고    scopus 로고
    • Evolution strategies-a comprehensive introduction
    • Hans-Georg Beyer and Hans-Paul Schwefel. Evolution strategies-a comprehensive introduction. Natural Computing, 1(1):3-52, 2002.
    • (2002) Natural Computing , vol.1 , Issue.1 , pp. 3-52
    • Beyer, H.-G.1    Schwefel, H.-P.2
  • 6
    • 0035377566 scopus 로고    scopus 로고
    • Completely derandomized selfadaptation in evolution strategies
    • N. Hansen and A. Ostermeier. Completely derandomized selfadaptation in evolution strategies. Evolutionary Computation, 9(2):159-195, 2001.
    • (2001) Evolutionary Computation , vol.9 , Issue.2 , pp. 159-195
    • Hansen, N.1    Ostermeier, A.2
  • 10
    • 84875592161 scopus 로고    scopus 로고
    • Dynamical Movement Primitives: Learning attractor models for motor behaviors
    • A. Ijspeert, J. Nakanishi, P Pastor, H. Hoffmann, and S. Schaal. Dynamical Movement Primitives: Learning attractor models for motor behaviors. Neural Computation, 25(2):328-373, 2013.
    • (2013) Neural Computation , vol.25 , Issue.2 , pp. 328-373
    • Ijspeert, A.1    Nakanishi, J.2    Pastor, P.3    Hoffmann, H.4    Schaal, S.5
  • 12
    • 79958852534 scopus 로고    scopus 로고
    • Characterizing reinforcement learning methods through parameterized learning problems
    • Shivaram Kalyanakrishnan and Peter Stone. Characterizing reinforcement learning methods through parameterized learning problems. Machine Learning, 84(1-2):205-247, 2011.
    • (2011) Machine Learning , vol.84 , Issue.1-2 , pp. 205-247
    • Kalyanakrishnan, S.1    Stone, P.2
  • 13
  • 14
    • 80053623760 scopus 로고    scopus 로고
    • Learning stable non-linear dynamical systems with gaussian mixture models
    • S. Mohammad Khansari-Zadeh and Aude Billard. Learning stable non-linear dynamical systems with gaussian mixture models. IEEE Transactions on Robotics, 2011.
    • (2011) IEEE Transactions on Robotics
    • Khansari-Zadeh, S.M.1    Billard, A.2
  • 16
    • 78049390740 scopus 로고    scopus 로고
    • Policy search for motor primitives in robotics
    • J. Kober and J. Peters. Policy search for motor primitives in robotics. Machine Learning, 84:171-203, 2011.
    • (2011) Machine Learning , vol.84 , pp. 171-203
    • Kober, J.1    Peters, J.2
  • 17
    • 84885895576 scopus 로고    scopus 로고
    • Towards fast and adaptive optimal control policies for robots: A direct policy search approach
    • Guimaraes, Portugal
    • D. Marin and O. Sigaud. Towards fast and adaptive optimal control policies for robots: A direct policy search approach. In Proceed-ings Robotica, pages 21-26, Guimaraes, Portugal, 2012.
    • (2012) Proceed-ings Robotica , pp. 21-26
    • Marin, D.1    Sigaud, O.2
  • 20
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • Jan Peters and Stefan Schaal. Natural actor-critic. Neurocom-puting, 71(7-9):1180-1190, 2008.
    • (2008) Neurocom-puting , vol.71 , Issue.7-9 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 26
    • 0031231885 scopus 로고    scopus 로고
    • Experiments with reinforcement learning in problems with continuous state and action spaces
    • J.C. Santamaría, R.S. Sutton, and A. Ram. Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive behavior, 6(2):163-217, 1997.
    • (1997) Adaptive Behavior , vol.6 , Issue.2 , pp. 163-217
    • Santamaría, J.C.1    Sutton, R.S.2    Ram, A.3
  • 29
    • 74049165047 scopus 로고    scopus 로고
    • From motor learning to interaction learning in robots
    • Springer-Verlag
    • O. Sigaud and J. Peters. From motor learning to interaction learning in robots. In From Motor Learning to Interaction Learningin Robots, volume 264, pages 1-12. Springer-Verlag, 2010.
    • (2010) From Motor Learning to Interaction Learningin Robots , vol.264 , pp. 1-12
    • Sigaud, O.1    Peters, J.2
  • 33
    • 84870935597 scopus 로고    scopus 로고
    • Reinforcement learning with sequences of motion primitives for robust manipulation
    • King-Sun Fu Best Paper Award of the IEEE Trans-actions on Robotics for the year 2012
    • Freek Stulp, Evangelos Theodorou, and Stefan Schaal. Reinforcement learning with sequences of motion primitives for robust manipulation. IEEE Transactions on Robotics, 28(6):1360-1370, 2012. King-Sun Fu Best Paper Award of the IEEE Trans-actions on Robotics for the year 2012.
    • (2012) IEEE Transactions on Robotics , vol.28 , Issue.6 , pp. 1360-1370
    • Stulp, F.1    Theodorou, E.2    Schaal, S.3
  • 35
    • 80052851862 scopus 로고    scopus 로고
    • Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives
    • Minija Tamosiumaite, Bojan Nemec, Ales Ude, and Florentin Wörgötter. Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives. Robots andAutonomous Systems, 59(11):910-922, 2011.
    • (2011) Robots andAutonomous Systems , vol.59 , Issue.11 , pp. 910-922
    • Tamosiumaite, M.1    Nemec, B.2    Ude, A.3    Wörgötter, F.4
  • 36
    • 79551503171 scopus 로고    scopus 로고
    • A generalized path integral control approach to reinforcement learning
    • Evangelos Theodorou, Jonas Buchli, and Stefan Schaal. A generalized path integral control approach to reinforcement learning. Journal of Machine Learning Research, 11:3137-3181, 2010.
    • (2010) Journal of Machine Learning Research , vol.11 , pp. 3137-3181
    • Theodorou, E.1    Buchli, J.2    Schaal, S.3
  • 38
    • 77957706006 scopus 로고    scopus 로고
    • Taskspecific generalization of discrete and periodic dynamic movement primitives
    • Ales Ude, Andrej Gams, Tamim Asfour, and Jun Morimoto. Taskspecific generalization of discrete and periodic dynamic movement primitives. IEEE Transactions on Robotics, 26(5): 800-815, 2010.
    • (2010) IEEE Transactions on Robotics , vol.26 , Issue.5 , pp. 800-815
    • Ude, A.1    Gams, A.2    Asfour, T.3    Morimoto, J.4
  • 39
    • 0002891388 scopus 로고    scopus 로고
    • Locally weighted projection regression: An o(n) algorithm for incremental real time learning in high dimensional spaces
    • S. Vijayakumar and S. Schaal. Locally weighted projection regression: An o(n) algorithm for incremental real time learning in high dimensional spaces. In Proceedings of the 17th InternationalConference on Machine Learning (ICML), pages 288-293, 2000.
    • (2000) Proceedings of the 17th InternationalConference on Machine Learning (ICML , pp. 288-293
    • Vijayakumar, S.1    Schaal, S.2
  • 41
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • R. J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8: 229-256, 1992.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.