SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Robotics and Automation

Volumn , Issue , 2010, Pages 2397-2403

Reinforcement learning of motor skills in high dimensions: A path integral approach

(3) Theodorou, Evangelos a Buchli, Jonas a Schaal, Stefan a

a UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS STATE-ACTION SPACES; EMPIRICAL EVALUATIONS; ESTIMATION THEORY; GENERAL APPROACH; GRADIENT BASED; GRADIENT LEARNING; HIGH DIMENSIONS; HIGH-DIMENSIONAL; MATRIX INVERSIONS; MOTOR SKILLS; MOTOR SYSTEMS; NUMERICAL INSTABILITY; NUMERICALLY ROBUST; OPTIMAL CONTROL THEORY; PARAMETERIZED CONTROL; PATH INTEGRAL; PATH INTEGRAL APPROACH; PERFORMANCE IMPROVEMENTS; REAL-WORLD; STOCHASTIC OPTIMAL CONTROL;

CONTROL; OPTIMIZATION; QUANTUM THEORY; REINFORCEMENT LEARNING; ROBOTICS; ROBOTS;

LEARNING ALGORITHMS;

EID: 77955836276 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ROBOT.2010.5509336 Document Type: Conference Paper

Times cited : (249)

References (26)

1
- 70049104346
- PhD thesis
- J. Peters. Machine learning of motor skills for robotics. PhD thesis, 2007.
- (2007) Machine Learning of Motor Skills for Robotics
- Peters, J.¹

2
- 48349140736
- Rollout sampling approximate policy iteration
- September
- Christos Dimitrakakis and Michail G. Lagoudakis. Rollout sampling approximate policy iteration. Machine Learning, 72(3), September 2008.
- (2008) Machine Learning , vol.72 , Issue.3
- Dimitrakakis, C.¹ Lagoudakis, M.G.²

3
- 0001234682
- Using em for reinforcement learning
- P. Dayan and G. Hinton. Using em for reinforcement learning. Neural Computation, 9, 1997.
- (1997) Neural Computation , vol.9
- Dayan, P.¹ Hinton, G.²

4
- 77955839714
- Learning motor primitives in robotics
- D. Schuurmans, J. Benigio, and D. Koller, editors, Cambridge, MA: MIT Press. clmc
- J. Koeber and J. Peters. Learning motor primitives in robotics. In D. Schuurmans, J. Benigio, and D. Koller, editors, Advances in Neural Information Processing Systems 21 (NIPS 2008), Vancouver, BC, Dec. 8-11, 2009. Cambridge, MA: MIT Press. clmc.
- Advances in Neural Information Processing Systems 21 (NIPS 2008), Vancouver, BC, Dec. 8-11, 2009
- Koeber, J.¹ Peters, J.²

5
- 38649095925
- Learning to control in operational space
- clmc
- J. Peters and S. Schaal. Learning to control in operational space. International Journal of Robotics Research, 27:197-212, 2008. clmc.
- (2008) International Journal of Robotics Research , vol.27 , pp. 197-212
- Peters, J.¹ Schaal, S.²

6
- 77955811545
- M. Toussaint and A. Storkey. Probabilistic inference for solving discrete and continuous state markov decision processes, 2006.
- (2006) Probabilistic Inference for Solving Discrete and Continuous State Markov Decision Processes
- Toussaint, M.¹ Storkey, A.²

7
- 61849173491
- Gaussian Process Dynamic Programming
- March
- Marc P. Deisenroth, Carl E. Rasmussen, and Jan Peters. Gaussian Process Dynamic Programming. Neurocomputing, 72(7-9):1508-1524, March 2009.
- (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1508-1524
- Deisenroth, M.P.¹ Rasmussen, C.E.² Peters, J.³

8
- 34547996989
- Bayesian actor-critic algorithms
- New York, NY, USA, ACM
- Mohammad Ghavamzadeh and Yaakov Engel. Bayesian actor-critic algorithms. In ICML '07: Proceedings of the 24th international conference on Machine learning, pages 297-304, New York, NY, USA, 2007. ACM.
- (2007) ICML '07: Proceedings of the 24th International Conference on Machine Learning , pp. 297-304
- Ghavamzadeh, M.¹ Engel, Y.²

9
- 70349327392
- Learning model-free robot control by a monte carlo em algorithm
- Nikos Vlassis, Marc Toussaint, Georgios Kontes, and Savas Piperidis. Learning model-free robot control by a monte carlo em algorithm. Auton. Robots, 27(2):123-130, 2009.
- (2009) Auton. Robots , vol.27 , Issue.2 , pp. 123-130
- Vlassis, N.¹ Toussaint, M.² Kontes, G.³ Piperidis, S.⁴

10
- 77955818655
- A path integral approach to agent planning
- H. J. Kappen, W. Wiegerinck, and B. van den Broek. A path integral approach to agent planning. In AAMAS, 2007.
- (2007) AAMAS
- Kappen, H.J.¹ Wiegerinck, W.² Van Den Broek, B.³

11
- 52249107868
- Graphical model inference in optimal control of stochastic multi-agent systems
- B. van den Broek, W. Wiegerinck, and B. Kappen. Graphical model inference in optimal control of stochastic multi-agent systems. Journal of Artificial Intelligence Research, 32(1):95-122, 2008.
- (2008) Journal of Artificial Intelligence Research , vol.32 , Issue.1 , pp. 95-122
- Van Den Broek, B.¹ Wiegerinck, W.² Kappen, B.³

12
- 84899019754
- Learning attractor landscapes for learning motor primitives
- S. Becker, S. Thrun, and K. Obermayer, editors, Cambridge, MA: MIT Press
- A. Ijspeert, J. Nakanishi, and S. Schaal. Learning attractor landscapes for learning motor primitives. In S. Becker, S. Thrun, and K. Obermayer, editors, Advances in Neural Information Processing Systems 15, pages 1547-1554. Cambridge, MA: MIT Press, 2003.
- (2003) Advances in Neural Information Processing Systems 15 , pp. 1547-1554
- Ijspeert, A.¹ Nakanishi, J.² Schaal, S.³

13
- 0004102479
- MIT Press, Cambridge, 97026416 Richard S. Sutton and Andrew G. Barto. Includes bibliographical references and index
- Richard S. Sutton and Andrew G. Barto. Reinforcement learning : An introduction. Adaptive computation and machine learning. MIT Press, Cambridge, 1998. 97026416 Richard S. Sutton and Andrew G. Barto. Includes bibliographical references (p. [291]-312) and index.
- (1998) Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning , pp. 291-312
- Sutton, R.S.¹ Barto, A.G.²

14
- 0004294973
- Dover books on advanced mathematics. Dover Publications, New York, 94020406 Robert F. Stengel. ill. ; 21 cm. Originally published: Stochastic optimal control. New York ; Wiley, c1986. With new pref. Includes bibliographical references and index
- Robert F. Stengel. Optimal control and estimation. Dover books on advanced mathematics. Dover Publications, New York, 1994. 94020406 Robert F. Stengel. ill. ; 21 cm. Originally published: Stochastic optimal control. New York ; Wiley, c1986. With new pref. Includes bibliographical references and index.
- (1994) Optimal Control and Estimation
- Stengel, R.F.¹

15
- 0003423896
- Springer, New York, 2nd edition, 2005929857 Wendell H. Fleming, H. Mete Soner. 25 cm
- Wendell Helms Fleming and H. Mete Soner. Controlled Markov processes and viscosity solutions. Applications of mathematics. Springer, New York, 2nd edition, 2006. 2005929857 Wendell H. Fleming, H. Mete Soner. 25 cm.
- (2006) Controlled Markov Processes and Viscosity Solutions. Applications of Mathematics
- Fleming, W.H.¹ Mete Soner, H.²

16
- 0003722979
- Universitext. Springer, Berlin ; New York, 6th edition
- B. K. ksendal. Stochastic differential equations : an introduction with applications. Universitext. Springer, Berlin ; New York, 6th edition, 2003.
- (2003) Stochastic Differential Equations: An Introduction with Applications
- Ksendal, B.K.¹

17
- 0031341708
- vol.3, Dec
- Jiongmin Yong. Relations among odes, pdes, fsdes, bsdes, and fbsdes. volume 3, pages 2779-2784 vol.3, Dec 1997.
- (1997) Relations among Odes, Pdes, Fsdes, Bsdes, and Fbsdes , vol.3 , pp. 2779-2784
- Yong, J.¹

18
- 29044440299
- Path integrals and symmetry breaking for optimal control theory
- H J Kappen. Path integrals and symmetry breaking for optimal control theory. Journal of Statistical Mechanics: Theory and Experiment, 2005(11):P11011, 2005.
- (2005) Journal of Statistical Mechanics: Theory and Experiment , vol.2005 , Issue.11 , pp. 11011
- Kappen, H.J.¹

19
- 33947410345
- An introduction to stochastic control theory, path integrals and reinforcement learning
- J. Marro, P. L. Garrido, and J. J. Torres, editors, Cooperative Behavior in Neural Systems, February
- H. J. Kappen. An introduction to stochastic control theory, path integrals and reinforcement learning. In J. Marro, P. L. Garrido, and J. J. Torres, editors, Cooperative Behavior in Neural Systems, volume 887 of American Institute of Physics Conference Series, pages 149-181, February 2007.
- (2007) American Institute of Physics Conference Series , vol.887 , pp. 149-181
- Kappen, H.J.¹

20
- 28844435646
- Linear theory for control of nonlinear stochastic systems
- Nov
- Hilbert J. Kappen. Linear theory for control of nonlinear stochastic systems. Phys. Rev. Lett., 95(20):200201, Nov 2005.
- (2005) Phys. Rev. Lett. , vol.95 , Issue.20 , pp. 200201
- Kappen, H.J.¹

21
- 34250635407
- Policy gradient methods for robotics
- J. Peters and S. Schaal. Policy gradient methods for robotics. In Proceedings of the IEEE International Conference on Intelligent Robotics Systems (IROS 2006), Beijing, Oct. 9-15, 2006.
- Proceedings of the IEEE International Conference on Intelligent Robotics Systems (IROS 2006), Beijing, Oct. 9-15, 2006
- Peters, J.¹ Schaal, S.²

22
- 40649109346
- Reinforcement learning for parameterized motor primitives
- clmc
- J. Peters and S. Schaal. Reinforcement learning for parameterized motor primitives. In Proceedings of the 2006 International Joint Conference on Neural Networks (IJCNN 2006), 2006. clmc.
- Proceedings of the 2006 International Joint Conference on Neural Networks (IJCNN 2006), 2006
- Peters, J.¹ Schaal, S.²

23
- 67650822173
- Submitted
- E. Todorov. Classic maximum principles and estimation-control dualities for nonlinear stochastic systems. 2009. (Submitted).
- (2009) Classic Maximum Principles and Estimation-control Dualities for Nonlinear Stochastic Systems
- Todorov, E.¹

24
- 67650915125
- Efficient computation of optimal actions
- Emanuel Todorov. Efficient computation of optimal actions. Proc Natl Acad Sci U S A, 106(28):11478-83.
- Proc Natl Acad Sci U S A , vol.106 , Issue.28 , pp. 11478-11483
- Todorov, E.¹

25
- 28844435646
- Linear theory for control of nonlinear stochastic systems
- Journal Article United States
- H. J. Kappen. Linear theory for control of nonlinear stochastic systems. Phys Rev Lett, 95(20):200201, 2005. Journal Article United States.
- (2005) Phys Rev Lett , vol.95 , Issue.20 , pp. 200201
- Kappen, H.J.¹

26
- 49949095696
- Stochastic optimal control in continuous space-time multi-agent system
- W. Wiegerinck, B. van den Broek, and H. J. Kappen. Stochastic optimal control in continuous space-time multi-agent system. In UAI, 2006.
- (2006) UAI
- Wiegerinck, W.¹ Van Den Broek, B.² Kappen, H.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.