메뉴 건너뛰기




Volumn , Issue , 2014, Pages 3896-3902

Sample-based informationl-theoretic stochastic optimal control

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION THEORY; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC SYSTEMS; SYSTEM THEORY;

EID: 84908057666     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICRA.2014.6907424     Document Type: Conference Paper
Times cited : (53)

References (20)
  • 2
    • 33947410345 scopus 로고    scopus 로고
    • An introduction to stochastic control theory, path integrals and reinforcement learning
    • H. Kappen, "An Introduction to Stochastic Control Theory, Path Integrals and Reinforcement Learning, " in Cooperative Behavior in Neural Systems, vol. 887, 2007.
    • (2007) Cooperative Behavior in Neural Systems , vol.887
    • Kappen, H.1
  • 7
    • 0141708339 scopus 로고    scopus 로고
    • Exploiting model uncertainty estimates for safe dynamic control learning
    • J. G. Schneider, "Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning, " in NIPS, 1997, pp. 1047-1053.
    • (1997) NIPS , pp. 1047-1053
    • Schneider, J.G.1
  • 9
    • 84887272277 scopus 로고    scopus 로고
    • Minimax differential dynamic programming: An application to robust bipedwalking
    • J. Morimoto and C.Atkeson, "Minimax differential dynamic programming: An application to robust bipedwalking, " Neural Information Processing Systems 2002, 2002.
    • (2002) Neural Information Processing Systems , vol.2002
    • Morimoto, J.1    Atkeson, C.2
  • 10
    • 23944452693 scopus 로고    scopus 로고
    • A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
    • E. Todorov and W. L., "A Generalized Iterative LQG Method for Locally-Optimal Feedback Control of Constrained Nonlinear Stochastic Systems, " in 24th American Control Conference (ACC), 2005.
    • (2005) 24th American Control Conference (ACC
    • Todorov, E.1
  • 13
    • 4644323293 scopus 로고    scopus 로고
    • Least-squares policy iteration
    • December
    • M. G. Lagoudakis and R. Parr, "Least-Squares Policy Iteration, " Journal of Machine Learning Research, vol. 4, pp. 1107-1149, December 2003. [Online]. Available: http://dl.acm.org/citation.cfm?id=945365.964290
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
    • Lagoudakis, M.G.1    Parr, R.2
  • 19
    • 34250246420 scopus 로고
    • Elimination of bounds in optimization problems by transforming variables
    • F. Sisser, "Elimination of bounds in Optimization Problems by Transforming Variables, " Mathematical Programming, vol. 20, no. 1, pp. 110-121, 1981.
    • (1981) Mathematical Programming , vol.20 , Issue.1 , pp. 110-121
    • Sisser, F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.