메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Policy gradients in linearly-solvable MDPs

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS TIME SYSTEMS;

EID: 85162021468     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (18)
  • 1
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • S. Amari. Natural gradient works efficiently in learning. Neural Computation, 10:251-276, 1998.
    • (1998) Neural Computation , vol.10 , pp. 251-276
    • Amari, S.1
  • 4
    • 0020203191 scopus 로고
    • Optimal control and nonlinear filtering for nondegenerate diffusion processes
    • W. Fleming and S. Mitter. Optimal control and nonlinear filtering for nondegenerate diffusion processes. Stochastics, 8:226-261, 1982.
    • (1982) Stochastics , vol.8 , pp. 226-261
    • Fleming, W.1    Mitter, S.2
  • 7
    • 28844435646 scopus 로고    scopus 로고
    • Linear theory for control of nonlinear stochastic systems
    • H. Kappen. Linear theory for control of nonlinear stochastic systems. Physical Review Letters, 95, 2005.
    • (2005) Physical Review Letters , vol.95
    • Kappen, H.1
  • 11
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71:1180-1190, 2008.
    • (2008) Neurocomputing , vol.71 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 16
    • 67650915125 scopus 로고    scopus 로고
    • Efficient computation of optimal actions
    • E. Todorov. Efficient computation of optimal actions. PNAS, 106:11478-11483, 2009.
    • (2009) PNAS , vol.106 , pp. 11478-11483
    • Todorov, E.1
  • 17
    • 85162042971 scopus 로고    scopus 로고
    • Eigen-function approximation methods for linearly-solvable optimal control problems
    • E. Todorov. Eigen-function approximation methods for linearly-solvable optimal control problems. IEEE ADPRL, 2009.
    • (2009) IEEE ADPRL
    • Todorov, E.1
  • 18
    • 0000337576 scopus 로고
    • Simple statistical gradient following algorithms for connectionist reinforcement learning
    • R. Williams. Simple statistical gradient following algorithms for connectionist reinforcement learning. Machine Learning, pages 229-256, 1992.
    • (1992) Machine Learning , pp. 229-256
    • Williams, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.