메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Variational policy search via trajectory optimization

Author keywords

[No Author keywords available]

Indexed keywords

AERODYNAMICS; ALGORITHMS; DYNAMICAL SYSTEMS; SPACE RESEARCH; TRAJECTORIES;

EID: 84898932265     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (116)

References (23)
  • 12
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • J. Peters and S. Schaal. Reinforcement learning of motor skills with policy gradients. Neural Networks, 21(4):682-697, 2008.
    • (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
    • Peters, J.1    Schaal, S.2
  • 13
    • 84877282363 scopus 로고    scopus 로고
    • On stochastic optimal control and reinforcement learning by approximate inference
    • K. Rawlik, M. Toussaint, and S. Vijayakumar. On stochastic optimal control and reinforcement learning by approximate inference. In Robotics: Science and Systems, 2012.
    • (2012) Robotics: Science and Systems
    • Rawlik, K.1    Toussaint, M.2    Vijayakumar, S.3
  • 14
    • 84862273266 scopus 로고    scopus 로고
    • A reduction of imitation learning and structured prediction to no-regret online learning
    • S. Ross, G. Gordon, and A. Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. Journal of Machine Learning Research, 15:627-635, 2011.
    • (2011) Journal of Machine Learning Research , vol.15 , pp. 627-635
    • Ross, S.1    Gordon, G.2    Bagnell, A.3
  • 15
    • 84950871099 scopus 로고
    • Accurate approximations for posterior moments and marginal densities
    • L. Tierney and J. B. Kadane. Accurate approximations for posterior moments and marginal densities. Journal of the American Statistical Association, 81(393):82-86, 1986.
    • (1986) Journal of the American Statistical Association , vol.81 , Issue.393 , pp. 82-86
    • Tierney, L.1    Kadane, J.B.2
  • 17
    • 23944452693 scopus 로고    scopus 로고
    • A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
    • E. Todorov and W. Li. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In American Control Conference, 2005.
    • (2005) American Control Conference
    • Todorov, E.1    Li, W.2
  • 21
    • 70349327392 scopus 로고    scopus 로고
    • Learning model-free robot control by a Monte Carlo em algorithm
    • N. Vlassis, M. Toussaint, G. Kontes, and S. Piperidis. Learning model-free robot control by a Monte Carlo EM algorithm. Autonomous Robots, 27(2):123-130, 2009.
    • (2009) Autonomous Robots , vol.27 , Issue.2 , pp. 123-130
    • Vlassis, N.1    Toussaint, M.2    Kontes, G.3    Piperidis, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.