메뉴 건너뛰기




Volumn 3, Issue , 2014, Pages 2411-2420

Learning complex neural network policies with trajectory optimization

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; TRAJECTORIES;

EID: 84919913608     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (82)

References (26)
  • 1
    • 0021618667 scopus 로고
    • Generalized eigen problem algorithms and software for algebraic Riccati equations
    • Arnold, W.F. III and Laub, A.J. Generalized eigen problem algorithms and software for algebraic Riccati equations. Proceedings of the IEEE, 72(12), 1984.
    • (1984) Proceedings of the IEEE , vol.72 , Issue.12
    • Arnold, W.F.1    Laub, A.J.2
  • 6
    • 84862024986 scopus 로고    scopus 로고
    • Optimal control as a graphical model inference problem
    • Kappen, H. J., Gomez, V., and Opper, M. Optimal control as a graphical model inference problem. Machine Learning, 87(2), 2012.
    • (2012) Machine Learning , vol.87 , Issue.2
    • Kappen, H.J.1    Gomez, V.2    Opper, M.3
  • 10
    • 84919939713 scopus 로고    scopus 로고
    • Exploring deep and recurrent architectures for optimal control
    • Levine, S. Exploring deep and recurrent architectures for optimal control. In NIPS 2013 Workshop on Deep Learning, 2013.
    • (2013) NIPS 2013 Workshop on Deep Learning
    • Levine, S.1
  • 13
    • 85131220210 scopus 로고    scopus 로고
    • Combining the benefits of function approximation and trajectory optimization
    • Mordatch, I. and Todorov, E. Combining the benefits of function approximation and trajectory optimization. In Robotics: Science and Systems (RSS), 2014.
    • (2014) Robotics: Science and Systems (RSS)
    • Mordatch, I.1    Todorov, E.2
  • 16
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • Peters, J. and Schaal, S. Reinforcement learning of motor skills with policy gradients. Neural Networks, 21(4), 2008.
    • (2008) Neural Networks , vol.21 , Issue.4
    • Peters, J.1    Schaal, S.2
  • 23
    • 23944452693 scopus 로고    scopus 로고
    • A generalized iterative LQG method for locally-optimal feedback control of constrained non-linear stochastic systems
    • Todorov, E. and Li, W. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In American Control Conference (ACC), 2005.
    • (2005) American Control Conference (ACC)
    • Todorov, E.1    Li, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.