SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Robotics and Automation

Volumn , Issue , 2014, Pages 3896-3902

Sample-based informationl-theoretic stochastic optimal control

(4) Lioutikov, Rudolf a Paraschos, Alexandros a Peters, Jan a,b Neumann, Gerhard a

a DARMSTADT UNIVERSITY OF TECHNOLOGY (Germany)

b MAX PLANCK INSTITUTE FOR INTELLIGENT SYSTEMS (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION THEORY; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS; SYSTEM THEORY;

INFORMATION THEORETIC BOUNDS; MODEL-BASED REINFORCEMENT LEARNING; SIMULATED ROBOT; STATE-OF-THE-ART APPROACH; STOCHASTIC OPTIMAL CONTROL; SYSTEM DYNAMICS MODEL; UNDERLYING SYSTEMS; VALUE FUNCTIONS;

STOCHASTIC MODELS;

EID: 84908057666 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICRA.2014.6907424 Document Type: Conference Paper

Times cited : (53)

References (20)

1
- 0003886055
- John Wiley & Sons, Inc.
- R. Stengel, Stochastic Optimal Control: Theory and Application. John Wiley & Sons, Inc., 1986.
- (1986) Stochastic Optimal Control: Theory and Application
- Stengel, R.¹

2
- 33947410345
- An introduction to stochastic control theory, path integrals and reinforcement learning
- H. Kappen, "An Introduction to Stochastic Control Theory, Path Integrals and Reinforcement Learning, " in Cooperative Behavior in Neural Systems, vol. 887, 2007.
- (2007) Cooperative Behavior in Neural Systems , vol.887
- Kappen, H.¹

3
- 71149083296
- Robot trajectory optimization using approximate inference
- M. Toussaint, "Robot Trajectory Optimization using Approximate Inference, " in 26th International Conference on Machine Learning, ser. (ICML), 2009.
- (2009) 26th International Conference on Machine Learning, Ser. (ICML)
- Toussaint, M.¹

4
- 84877282363
- On stochastic optimal control and reinforcement learning by approximate inference
- K. Rawlik, M. Toussaint, and S. Vijayakumar, "On Stochastic Optimal Control and Reinforcement Learning by Approximate Inference, " in Proceedings of Robotics: Science and Systems, 2012.
- (2012) Proceedings of Robotics: Science and Systems
- Rawlik, K.¹ Toussaint, M.² Vijayakumar, S.³

5
- 84887273882
- Learning sequential motor tasks
- submitted
- C. Daniel, G. Neumann, and J. Peters, "Learning Sequential Motor Tasks, " in IEEE International Conference on Robotics and Automation (ICRA), 2013, submitted.
- (2013) IEEE International Conference on Robotics and Automation (ICRA)
- Daniel, C.¹ Neumann, G.² Peters, J.³

6
- 77958569725
- Relative entropy policy search
- J. Peters, K. Mülling, and Y. Altun, "Relative Entropy Policy Search, " in Proceedings of the 24th National Conference on Artificial Intelligence (AAAI), 2010.
- (2010) Proceedings of the 24th National Conference on Artificial Intelligence (AAAI)
- Peters, J.¹ Mülling, K.² Altun, Y.³

7
- 0141708339
- Exploiting model uncertainty estimates for safe dynamic control learning
- J. G. Schneider, "Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning, " in NIPS, 1997, pp. 1047-1053.
- (1997) NIPS , pp. 1047-1053
- Schneider, J.G.¹

8
- 80053441894
- PILCO: A model-based and data-efficient approach to policy search
- M. Deisenroth and C. Rasmussen, "PILCO: A Model-Based and Data-Efficient Approach to Policy Search, " in 28th International Conference on Machine Learning (ICML), 2011.
- (2011) 28th International Conference on Machine Learning (ICML)
- Deisenroth, M.¹ Rasmussen, C.²

9
- 84887272277
- Minimax differential dynamic programming: An application to robust bipedwalking
- J. Morimoto and C.Atkeson, "Minimax differential dynamic programming: An application to robust bipedwalking, " Neural Information Processing Systems 2002, 2002.
- (2002) Neural Information Processing Systems , vol.2002
- Morimoto, J.¹ Atkeson, C.²

10
- 23944452693
- A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
- E. Todorov and W. L., "A Generalized Iterative LQG Method for Locally-Optimal Feedback Control of Constrained Nonlinear Stochastic Systems, " in 24th American Control Conference (ACC), 2005.
- (2005) 24th American Control Conference (ACC
- Todorov, E.¹

11
- 84880694195
- Stable function approximation in dynamic programming
- G. J. Gordon, "Stable Function Approximation in Dynamic Programming, " in 12th International Conference on Machine Learning (ICML), 1995.
- (1995) 12th International Conference on Machine Learning (ICML)
- Gordon, G.J.¹

12
- 84870922246
- Dynamic policy programming
- no. Nov
- M. Azar, V. Gómez, and H. J. Kappen, "Dynamic Policy Programming, " Journal of Machine Learning Research, vol. 13, no. Nov, pp. 3207-3245, 2012.
- (2012) Journal of Machine Learning Research , vol.13 , pp. 3207-3245
- Azar, M.¹ Gómez, V.² Kappen, H.J.³

13
- 4644323293
- Least-squares policy iteration
- December
- M. G. Lagoudakis and R. Parr, "Least-Squares Policy Iteration, " Journal of Machine Learning Research, vol. 4, pp. 1107-1149, December 2003. [Online]. Available: http://dl.acm.org/citation.cfm?id=945365.964290
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

14
- 70049104729
- Fitted q-iteration by advantage weighted regression
- G. Neumann and J. Peters, "Fitted Q-Iteration by Advantage Weighted Regression, " in Neural Information Processing Systems (NIPS), 2009.
- (2009) Neural Information Processing Systems (NIPS)
- Neumann, G.¹ Peters, J.²

15
- 77955836276
- Reinforcement learning of motor skills in high dimensions: A path integral approach
- E. Theodorou, J. Buchli, and S. Schaal, "Reinforcement Learning of Motor Skills in High Dimensions: A Path Integral Approach, " in 2010 IEEE International Conference on Robotics and Automation (ICRA), 2010.
- (2010) 2010 IEEE International Conference on Robotics and Automation (ICRA)
- Theodorou, E.¹ Buchli, J.² Schaal, S.³

16
- 25444448065
- The MIT Press
- C. E. Rasmussen and C. K. I. Williams, Gaussian Processes for Machine Learning. The MIT Press, 2006.
- (2006) Gaussian Processes for Machine Learning
- Rasmussen, C.E.¹ Williams, C.K.I.²

17
- 0004055894
- Cambridge University Press
- S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University Press, 2004.
- (2004) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

18
- 78650511722
- T. Mathworks. Matlab Optimization Toolbox User's Guide.
- Matlab Optimization Toolbox User's Guide
- Mathworks, T.¹

19
- 34250246420
- Elimination of bounds in optimization problems by transforming variables
- F. Sisser, "Elimination of bounds in Optimization Problems by Transforming Variables, " Mathematical Programming, vol. 20, no. 1, pp. 110-121, 1981.
- (1981) Mathematical Programming , vol.20 , Issue.1 , pp. 110-121
- Sisser, F.¹

20
- 84884129561
- Ph.D. dissertation TU Darmstadt, Department of Computer Science, July 4
- T. Lens, "Physical human-robot interaction with a lightweight, elastic tendon driven robotic arm: Modeling, control, and safety analysis, " Ph.D. dissertation, TU Darmstadt, Department of Computer Science, July 4 2012.
- (2012) Physical Human-robot Interaction with A Lightweight, Elastic Tendon Driven Robotic Arm: Modeling, Control, and Safety Analysis
- Lens, T.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.