메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Projected natural actor-critic

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES; REINFORCEMENT LEARNING;

EID: 84899017702     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (23)

References (32)
  • 1
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • S. Amari. Natural gradient works efficiently in learning. Neural Computation, 10:251-276, 1998.
    • (1998) Neural Computation , vol.10 , pp. 251-276
    • Amari, S.1
  • 3
    • 0037211015 scopus 로고    scopus 로고
    • Fast calculation of stabilizing PID controllers
    • M. T. Söylemez, N. Munro, and H. Baki. Fast calculation of stabilizing PID controllers. Automatica, 39 (1):121-126, 2003.
    • (2003) Automatica , vol.39 , Issue.1 , pp. 121-126
    • Söylemez, M.T.1    Munro, N.2    Baki, H.3
  • 11
    • 84898984859 scopus 로고    scopus 로고
    • Control design for Markov chains under safety constraints: A convex approach
    • abs/1209.2883
    • E. Arvelo and N. C. Martins. Control design for Markov chains under safety constraints: A convex approach. CoRR, abs/1209.2883, 2012.
    • (2012) CoRR
    • Arvelo, E.1    Martins, N.C.2
  • 12
    • 31144477417 scopus 로고    scopus 로고
    • Risk-sensitive reinforcement learning applied to control under constraints
    • P. Geibel and F. Wysotzki. Risk-sensitive reinforcement learning applied to control under constraints. Journal of Artificial Intelligence Research 24, pages 81-108, 2005.
    • (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 81-108
    • Geibel, P.1    Wysotzki, F.2
  • 13
  • 18
    • 0037403111 scopus 로고    scopus 로고
    • Mirror descent and nonlinear projected subgradient methods for convex optimization
    • A. Beck and M. Teboulle. Mirror descent and nonlinear projected subgradient methods for convex optimization. Operations Research Letters, 2003.
    • (2003) Operations Research Letters
    • Beck, A.1    Teboulle, M.2
  • 26
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71:1180-1190, 2008.
    • (2008) Neurocomputing , vol.71 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 30
    • 67349216631 scopus 로고    scopus 로고
    • Combined feedforward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system
    • D. Blana, R. F. Kirsch, and E. K. Chadwick. Combined feedforward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system. Medical and Biological Engineering and Computing, 47: 533-542, 2009.
    • (2009) Medical and Biological Engineering and Computing , vol.47 , pp. 533-542
    • Blana, D.1    Kirsch, R.F.2    Chadwick, E.K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.