메뉴 건너뛰기




Volumn 1, Issue , 2011, Pages 465-470

Optimal rewards versus leaf-evaluation heuristics in planning agents

Author keywords

[No Author keywords available]

Indexed keywords

AGENT DESIGN; ALTERNATIVE APPROACH; COMPUTATIONAL CONSTRAINTS; COMPUTATIONAL RESOURCES; DESIGN APPROACHES; EVALUATION FUNCTION; HEURISTIC APPROACH; OPTIMAL REWARD; PLANNING AGENTS; REWARD FUNCTION; SPARSE SAMPLING; STATE SPACE;

EID: 80055052859     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (13)

References (11)
  • 1
    • 57749177069 scopus 로고    scopus 로고
    • Potential-based shaping in model-based reinforcement learning
    • AAAI Press
    • Asmuth, J.; Littman, M. L.; and Zinkov, R. 2008. Potential-based shaping in model-based reinforcement learning. In Proceedings of the 23rd AAAI, 604-609. AAAI Press.
    • (2008) Proceedings of the 23rd AAAI , pp. 604-609
    • Asmuth, J.1    Littman, M.L.2    Zinkov, R.3
  • 5
    • 84880649215 scopus 로고    scopus 로고
    • A sparse sampling algorithm for near-optimal planning in large Markov decision processes
    • Kearns, M.; Mansour, Y.; and Ng, A. Y. 1999. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. In Proceedings of the 16th IJCAI, 1324-1331.
    • (1999) Proceedings of the 16th IJCAI , pp. 1324-1331
    • Kearns, M.1    Mansour, Y.2    Ng, A.Y.3
  • 7
    • 80053212134 scopus 로고    scopus 로고
    • Apprenticeship learning using inverse reinforcement learning and gradient methods
    • Neu, G., and Szepesvári, C. 2007. Apprenticeship learning using inverse reinforcement learning and gradient methods. In Proceedings of the 23rd UAI, 295-302.
    • (2007) Proceedings of the 23rd UAI , pp. 295-302
    • Neu, G.1    Szepesvári, C.2
  • 8
    • 0141596576 scopus 로고    scopus 로고
    • Policy invariance under reward transformations: Theory and application to reward shaping
    • Ng, A. Y.; Russell, S. J.; and Harada, D. 1999. Policy invariance under reward transformations: theory and application to reward shaping. In Proceedings of the 16th ICML, 278-287.
    • (1999) Proceedings of the 16th ICML , pp. 278-287
    • Ng, A.Y.1    Russell, S.J.2    Harada, D.3
  • 9
    • 0000218399 scopus 로고
    • Programming a computer for playing chess
    • Shannon, C. E. 1950. Programming a computer for playing chess. In Philosophical Magazine Vol. 41, 256-275.
    • (1950) Philosophical Magazine , vol.41 , pp. 256-275
    • Shannon, C.E.1
  • 10
    • 80055057262 scopus 로고    scopus 로고
    • Gradient methods for internal reward optimization
    • Sorg, J.; Singh, S.; and Lewis, R. L. 2010a. Gradient methods for internal reward optimization. In Advances in NIPS 23.
    • (2010) Advances in NIPS , vol.23
    • Sorg, J.1    Singh, S.2    Lewis, R.L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.