메뉴 건너뛰기




Volumn 3, Issue , 2010, Pages 1607-1612

Relative Entropy Policy Search

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; ENTROPY;

EID: 77958569725     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (512)

References (16)
  • 1
    • 0039816976 scopus 로고
    • Using local trajectory optimizers to speed up global optimization in dynamic programming
    • Atkeson, C. G. 1993. Using local trajectory optimizers to speed up global optimization in dynamic programming. In NIPS, 663-670.
    • (1993) NIPS , pp. 663-670
    • Atkeson, C.G.1
  • 4
    • 0348090400 scopus 로고    scopus 로고
    • The linear programming approach to approximate dynamic programming
    • de Farias, D. P., and Roy, B. V. 2003. The linear programming approach to approximate dynamic programming. Operations Research 51(6): 850-856.
    • (2003) Operations Research , vol.51 , Issue.6 , pp. 850-856
    • De Farias, D.P.1    Roy, B.V.2
  • 6
    • 77958597914 scopus 로고    scopus 로고
    • Variational methods for reinforcement learning
    • Furmston, T., and Barber, D. 2010. Variational methods for reinforcement learning. In AISTATS.
    • (2010) AISTATS
    • Furmston, T.1    Barber, D.2
  • 7
    • 57749096203 scopus 로고    scopus 로고
    • Adaptive importance sampling with automatic model selection in value function approximation
    • Hachiya, H.; Akiyama, T.; Sugiyama, M.; Peters, J.; 2008. Adaptive importance sampling with automatic model selection in value function approximation. In AAAI, 1351-1356.
    • (2008) AAAI , pp. 1351-1356
    • Hachiya, H.1    Akiyama, T.2    Sugiyama, M.3    Peters, J.4
  • 8
    • 77958608223 scopus 로고    scopus 로고
    • Hernandez, J. 2010. http://www.dia.fi.upm.es/ja-martin/download.htm.
    • (2010)
    • Hernandez, J.1
  • 12
    • 18544382314 scopus 로고    scopus 로고
    • Learning from scarce experience
    • Peshkin, L., and Shelton, C. R. 2002. Learning from scarce experience. In ICML, 498-505.
    • (2002) ICML , pp. 498-505
    • Peshkin, L.1    Shelton, C.R.2
  • 13
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor critic
    • Peters, J., and Schaal, S. 2008. Natural actor critic. Neuro-compuring 71(7-9): 1180-1190.
    • (2008) Neuro-compuring , vol.71 , Issue.7-9 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.