SCOPUS 정보 검색 플랫폼

31st International Conference on Machine Learning, ICML 2014

Volumn 3, Issue , 2014, Pages 2411-2420

Learning complex neural network policies with trajectory optimization

(2) Levine, Sergey a Koltun, Vladlen b

a Stanford University (United States)

b ADOBE RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; TRAJECTORIES;

COMPLEX NEURAL NETWORKS; DIRECT POLICY SEARCH; EXPECTED COSTS; HIGH-DIMENSIONAL; LEARNING CONTROLLERS; PUSH RECOVERIES; TRAJECTORY OPTIMIZATION; UNEVEN TERRAIN;

COMPLEX NETWORKS;

EID: 84919913608 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (82)

References (26)

1
- 0021618667
- Generalized eigen problem algorithms and software for algebraic Riccati equations
- Arnold, W.F. III and Laub, A.J. Generalized eigen problem algorithms and software for algebraic Riccati equations. Proceedings of the IEEE, 72(12), 1984.
- (1984) Proceedings of the IEEE , vol.72 , Issue.12
- Arnold, W.F.¹ Laub, A.J.²

2
- 0004055894
- Cambridge University Press, New York, NY, USA
- Boyd, S. and Vandenberghe, L. Convex Optimization. Cambridge University Press, New York, NY, USA, 2004.
- (2004) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

3
- 80053441894
- PILCO: A model-based and data-efficient approach to policy search
- Deisenroth, M. and Rasmussen, C. PILCO: a model-based and data-efficient approach to policy search. In International Conference on Machine Learning (ICML), 2011.
- (2011) International Conference on Machine Learning (ICML)
- Deisenroth, M.¹ Rasmussen, C.²

4
- 84903590417
- A survey on policy search for robotics
- Deisenroth, M., Neumann, G., and Peters, J. A survey on policy search for robotics. Foundations and Trends in Robotics, 2(1-2), 2013.
- (2013) Foundations and Trends in Robotics , vol.2 , Issue.1-2
- Deisenroth, M.¹ Neumann, G.² Peters, J.³

5
- 84899019754
- Learning attractor landscapes for learning motor primitives
- Ijspeert, A., Nakanishi, J., and Schaal, S. Learning attractor landscapes for learning motor primitives. In Advances in Neural Information Processing Systems (NIPS), 2003.
- (2003) Advances in Neural Information Processing Systems (NIPS)
- Ijspeert, A.¹ Nakanishi, J.² Schaal, S.³

6
- 84862024986
- Optimal control as a graphical model inference problem
- Kappen, H. J., Gomez, V., and Opper, M. Optimal control as a graphical model inference problem. Machine Learning, 87(2), 2012.
- (2012) Machine Learning , vol.87 , Issue.2
- Kappen, H.J.¹ Gomez, V.² Opper, M.³

7
- 85060321083
- Learning motor primitives for robotics
- Kober, J. and Peters, J. Learning motor primitives for robotics. In International Conference on Robotics and Automation (ICRA), 2009.
- (2009) International Conference on Robotics and Automation (ICRA)
- Kober, J.¹ Peters, J.²

8
- 84884276459
- Reinforcement learning in robotics: A survey
- Kober, J., Bagnell, J. A., and Peters, J. Reinforcement learning in robotics: A survey. International Journal of Robotic Research, 32(11), 2013.
- (2013) International Journal of Robotic Research , vol.32 , Issue.11
- Kober, J.¹ Bagnell, J.A.² Peters, J.³

9
- 84869390608
- Design, analysis and learning control of a fully actuated micro wind turbine
- Kolter, J. Z., Jackowski, Z., and Tedrake, R. Design, analysis and learning control of a fully actuated micro wind turbine. In American Control Conference (ACC), 2012.
- (2012) American Control Conference (ACC)
- Kolter, J.Z.¹ Jackowski, Z.² Tedrake, R.³

10
- 84919939713
- Exploring deep and recurrent architectures for optimal control
- Levine, S. Exploring deep and recurrent architectures for optimal control. In NIPS 2013 Workshop on Deep Learning, 2013.
- (2013) NIPS 2013 Workshop on Deep Learning
- Levine, S.¹

11
- 84897529781
- Guided policy search
- Levine, S. and Koltun, V. Guided policy search. In International Conference on Machine Learning (ICML), 2013a.
- (2013) International Conference on Machine Learning (ICML)
- Levine, S.¹ Koltun, V.²

12
- 84898932265
- Variational policy search via trajectory optimization
- Levine, S. and Koltun, V. Variational policy search via trajectory optimization. In Advances in Neural Information Processing Systems (NIPS), 2013b.
- (2013) Advances in Neural Information Processing Systems (NIPS)
- Levine, S.¹ Koltun, V.²

13
- 85131220210
- Combining the benefits of function approximation and trajectory optimization
- Mordatch, I. and Todorov, E. Combining the benefits of function approximation and trajectory optimization. In Robotics: Science and Systems (RSS), 2014.
- (2014) Robotics: Science and Systems (RSS)
- Mordatch, I.¹ Todorov, E.²

14
- 84898978448
- Probabilistic movement primitives
- Paraschos, A., Daniel, C., Peters, J., and Neumann, G. Probabilistic movement primitives. In Advances in Neural Information Processing Systems (NIPS), 2013.
- (2013) Advances in Neural Information Processing Systems (NIPS)
- Paraschos, A.¹ Daniel, C.² Peters, J.³ Neumann, G.⁴

15
- 84886998125
- Applying the episodic natural actor-critic architecture to motor primitive learning
- Peters, J. and Schaal, S. Applying the episodic natural actor-critic architecture to motor primitive learning. In European Symposium on Artificial Neural Networks (ESANN), 2007.
- (2007) European Symposium on Artificial Neural Networks (ESANN)
- Peters, J.¹ Schaal, S.²

16
- 44949241322
- Reinforcement learning of motor skills with policy gradients
- Peters, J. and Schaal, S. Reinforcement learning of motor skills with policy gradients. Neural Networks, 21(4), 2008.
- (2008) Neural Networks , vol.21 , Issue.4
- Peters, J.¹ Schaal, S.²

17
- 84877282363
- On stochastic optimal control and reinforcement learning by approximate inference
- Rawlik, K., Toussaint, M., and Vijayakumar, S. On stochastic optimal control and reinforcement learning by approximate inference. In Robotics: Science and Systems (RSS), 2012.
- (2012) Robotics: Science and Systems (RSS)
- Rawlik, K.¹ Toussaint, M.² Vijayakumar, S.³

18
- 84867115891
- Agnostic system identification for model-based reinforcement learning
- Ross, S. and Bagnell, A. Agnostic system identification for model-based reinforcement learning. In International Conference on Machine Learning (ICML), 2012.
- (2012) International Conference on Machine Learning (ICML)
- Ross, S.¹ Bagnell, A.²

19
- 84899437369
- A reduction of imitation learning and structured prediction to no-regret online learning
- Ross, S., Gordon, G., and Bagnell, A. A reduction of imitation learning and structured prediction to no-regret online learning. In International Conference on Artificial Intelligence and Statistics (AISTATS), 2011.
- (2011) International Conference on Artificial Intelligence and Statistics (AISTATS)
- Ross, S.¹ Gordon, G.² Bagnell, A.³

20
- 84887290913
- Learning monocular reactive UAV control in cluttered natural environments
- Ross, S., Melik-Barkhudarov, N., Shankar, K. Shaurya, Wendel, A., Dey, D., Bagnell, J. A., and Hebert, M. Learning monocular reactive UAV control in cluttered natural environments. In International Conference on Robotics and Automation (ICRA), 2013.
- (2013) International Conference on Robotics and Automation (ICRA)
- Ross, S.¹ Melik-Barkhudarov, N.² Shankar, K.S.³ Wendel, A.⁴ Dey, D.⁵ Bagnell, J.A.⁶ Hebert, M.⁷

21
- 77955836276
- Reinforcement learning of motor skills in high dimensions: A path integral approach
- Theodorou, E., Buchli, J., and Schaal, S. Reinforcement learning of motor skills in high dimensions: a path integral approach. In International Conference on Robotics and Automation (ICRA), 2010.
- (2010) International Conference on Robotics and Automation (ICRA)
- Theodorou, E.¹ Buchli, J.² Schaal, S.³

22
- 84923382376
- Linearly-solvable Markov decision problems
- Todorov, E. Linearly-solvable Markov decision problems. In Advances in Neural Information Processing Systems (NIPS), 2006.
- (2006) Advances in Neural Information Processing Systems (NIPS)
- Todorov, E.¹

23
- 23944452693
- A generalized iterative LQG method for locally-optimal feedback control of constrained non-linear stochastic systems
- Todorov, E. and Li, W. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In American Control Conference (ACC), 2005.
- (2005) American Control Conference (ACC)
- Todorov, E.¹ Li, W.²

24
- 84872292044
- Mu Jo Co: A physics engine for model-based control
- Todorov, E., Erez, T., and Tassa, Y. Mu Jo Co: A physics engine for model-based control. In International Conference on Intelligent Robots and Systems (IROS), 2012.
- (2012) International Conference on Intelligent Robots and Systems (IROS)
- Todorov, E.¹ Erez, T.² Tassa, Y.³

25
- 34547691027
- Simbicon: Simple biped locomotion control
- Yin, K., Loken, K., and van de Panne, M. SIMBICON: simple biped locomotion control. ACM Transactions Graphics, 26(3), 2007.
- (2007) ACM Transactions Graphics , vol.26 , Issue.3
- Yin, K.¹ Loken, K.² Van De Panne, M.³

26
- 84856113877
- PhD thesis, Carnegie Mellon University
- Ziebart, B. Modeling purposeful adaptive behavior with the principle of maximum causal entropy. PhD thesis, Carnegie Mellon University, 2010.
- (2010) Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy
- Ziebart, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.