SCOPUS 정보 검색 플랫폼

Proceedings of the National Conference on Artificial Intelligence

Volumn 1, Issue , 2011, Pages 465-470

Optimal rewards versus leaf-evaluation heuristics in planning agents

(3) Sorg, Jonathan a Singh, Satinder a Lewis, Richard L a

a UNIVERSITY OF MICHIGAN (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AGENT DESIGN; ALTERNATIVE APPROACH; COMPUTATIONAL CONSTRAINTS; COMPUTATIONAL RESOURCES; DESIGN APPROACHES; EVALUATION FUNCTION; HEURISTIC APPROACH; OPTIMAL REWARD; PLANNING AGENTS; REWARD FUNCTION; SPARSE SAMPLING; STATE SPACE;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; DESIGN; HEURISTIC METHODS; PLANT EXTRACTS; TREES (MATHEMATICS);

FUNCTION EVALUATION;

EID: 80055052859 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (13)

References (11)

1
- 57749177069
- Potential-based shaping in model-based reinforcement learning
- AAAI Press
- Asmuth, J.; Littman, M. L.; and Zinkov, R. 2008. Potential-based shaping in model-based reinforcement learning. In Proceedings of the 23rd AAAI, 604-609. AAAI Press.
- (2008) Proceedings of the 23rd AAAI , pp. 604-609
- Asmuth, J.¹ Littman, M.L.² Zinkov, R.³

2
- 0034439308
- Stochastic optimization of controlled partially observable Markov decision processes
- Bartlett, P. L., and Baxter, J. 2000. Stochastic optimization of controlled partially observable Markov decision processes. In Proceedings of the 39th IEEE Conference on Decision and Control.
- (2000) Proceedings of the 39th IEEE Conference on Decision and Control
- Bartlett, P.L.¹ Baxter, J.²

3
- 57749181518
- Simulation-based approach to general game playing
- Finnsson, H., and Björnsson, Y. 2008. Simulation-based approach to general game playing. In Proceedings of the 23rd AAAI, 259-264.
- (2008) Proceedings of the 23rd AAAI , pp. 259-264
- Finnsson, H.¹ Björnsson, Y.²

4
- 70349295261
- Achieving master level play in 9 x 9 computer Go
- Gelly, S., and Silver, D. 2008. Achieving master level play in 9 x 9 computer Go. Proceedings of the 23rd AAAI.
- (2008) Proceedings of the 23rd AAAI
- Gelly, S.¹ Silver, D.²

5
- 84880649215
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Kearns, M.; Mansour, Y.; and Ng, A. Y. 1999. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. In Proceedings of the 16th IJCAI, 1324-1331.
- (1999) Proceedings of the 16th IJCAI , pp. 1324-1331
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

6
- 33750293964
- Bandit based Monte-Carlo planning
- Kocsis, L., and Szepesvári, C. 2006. Bandit based Monte-Carlo planning. In Proceedings of the 17th ECML, 282-293.
- (2006) Proceedings of the 17th ECML , pp. 282-293
- Kocsis, L.¹ Szepesvári, C.²

7
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- Neu, G., and Szepesvári, C. 2007. Apprenticeship learning using inverse reinforcement learning and gradient methods. In Proceedings of the 23rd UAI, 295-302.
- (2007) Proceedings of the 23rd UAI , pp. 295-302
- Neu, G.¹ Szepesvári, C.²

8
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Ng, A. Y.; Russell, S. J.; and Harada, D. 1999. Policy invariance under reward transformations: theory and application to reward shaping. In Proceedings of the 16th ICML, 278-287.
- (1999) Proceedings of the 16th ICML , pp. 278-287
- Ng, A.Y.¹ Russell, S.J.² Harada, D.³

9
- 0000218399
- Programming a computer for playing chess
- Shannon, C. E. 1950. Programming a computer for playing chess. In Philosophical Magazine Vol. 41, 256-275.
- (1950) Philosophical Magazine , vol.41 , pp. 256-275
- Shannon, C.E.¹

10
- 80055057262
- Gradient methods for internal reward optimization
- Sorg, J.; Singh, S.; and Lewis, R. L. 2010a. Gradient methods for internal reward optimization. In Advances in NIPS 23.
- (2010) Advances in NIPS , vol.23
- Sorg, J.¹ Singh, S.² Lewis, R.L.³

11
- 77956525933
- Internal rewards mitigate agent boundedness
- Sorg, J.; Singh, S.; and Lewis, R. L. 2010b. Internal rewards mitigate agent boundedness. In Proceedings of the 27th ICML.
- (2010) Proceedings of the 27th ICML
- Sorg, J.¹ Singh, S.² Lewis, R.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.