SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2003, Pages 1003-1010

Generalizing plans to new environments in relational MDPs

(4) Guestrin, Carlos a Koller, Daphne a Gearhart, Chris a Kanodia, Neal a

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE VALUE FUNCTION; DIFFERENT CLASS; LOSS OF PERFORMANCE; MARKOV DECISION PROCESSES; MULTI-AGENT SETTING; PLANNING TIME; POLYNOMIAL NUMBER; PROBLEM-BASED;

MARKOV PROCESSES; MULTI AGENT SYSTEMS; OPTIMIZATION;

COMPUTER GAMES;

EID: 84880803349 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (151)

References (17)

1
- 84880891360
- Symbolic dynamic programming for first-order MDPs
- C. Boutilier, R. Reitcr, and B. Price. Symbolic dynamic programming for first-order MDPs. In IJCA1-0I, 2001.
- (2001) IJCA1-0I
- Boutilier, C.¹ Reitcr, R.² Price, B.³

2
- 13444255536
- On constraint sampling for the linear programming approach to approximate dynamic programming
- Submitted to
- D.P. de Farias and B. Van Roy. On constraint sampling for the linear programming approach to approximate dynamic programming. Submitted to Math, of Operations Research, 2001.
- (2001) Math, of Operations Research
- De Farias, D.P.¹ Van Roy, B.²

3
- 85055854784
- Probabilistic temporal reasoning
- T. Dean and K. Kanazawa. Probabilistic temporal reasoning. In AAA1-88, 1988.
- (1988) AAA1-88
- Dean, T.¹ Kanazawa, K.²

4
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

5
- 0015440625
- Learning and executing generalized robot plans
- R E. Fikes, P. E. Hart, and N. J. Nilsson. Learning and executing generalized robot plans. Artf. Intel, 3(4):251-288, 1972.
- (1972) Artf. Intel , vol.3 , Issue.4 , pp. 251-288
- Fikes, R.E.¹ Hart, P.E.² Nilsson, N.J.³

6
- 0012296128
- Multiagent planning with factored MDPs
- C. E. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In NIPS-14, 2001.
- (2001) NIPS-14
- Guestrin, C.E.¹ Koller, D.² Parr, R.³

7
- 1142281638
- Context specific multiagent coordination and planning with factored MDPs
- C. E. Guestrin, S. Venkataraman, and D. Roller. Context specific multiagent coordination and planning with factored MDPs. In AAA/-02, 2002.
- (2002) AAA/-02
- Guestrin, C.E.¹ Venkataraman, S.² Roller, D.³

8
- 0006419533
- Hierarchical solution of Markov decision processes using macro-actions
- M. Hauskrecht, N. Meuleau, L. Kaelbling, T. Dean, and C. Boutilier. Hierarchical solution of Markov decision processes using macro-actions. In UAI, 1998.
- (1998) UAI
- Hauskrecht, M.¹ Meuleau, N.² Kaelbling, L.³ Dean, T.⁴ Boutilier, C.⁵

9
- 0033189384
- Learning action strategies for planning domains
- DOI 10.1016/S0004-3702(99)00060-0
- R. Khardon. Learning action strategics for planning domains. Artificial Intelligence, 113:125-148, 1999. (Pubitemid 30542740)
- (1999) Artificial Intelligence , vol.113 , Issue.1 , pp. 125-148
- Khardon, R.¹

10
- 0003199849
- Probabilistic frame-based systems
- D. Koller and A. Pfeffer. Probabilistic frame-based systems. In AAA A 1998.
- AAA A 1998
- Koller, D.¹ Pfeffer, A.²

11
- 0038362668
- Learning generalized policies in planning using concept languages
- M. Martin and H. Gcffner. Learning generalized policies in planning using concept languages. In KR, 2000.
- (2000) KR
- Martin, M.¹ Gcffner, H.²

12
- 0346738900
- Flexible decomposition algorithms for weakly coupled markov decision problems
- R. Parr. Flexible decomposition algorithms for weakly coupled markov decision problems. In UAI-98, 1998.
- (1998) UAI-98
- Parr, R.¹

13
- 1542342765
- Direct value-approximation for factored MDPs
- D. Schuurmans and R. Patrascu. Direct value-approximation for factored MDPs. In NIPS-14, 2001.
- (2001) NIPS-14
- Schuurmans, D.¹ Patrascu, R.²

14
- 0000273218
- Generalized polynomial approximations in Markovian decision processes
- P. Schweitzer and A. Seidmann. Generalized polynomial approximations in Markovian decision processes. / of Mathematical Analysis and Applications, 110:568-582, 1985.
- (1985) Of Mathematical Analysis and Applications , vol.110 , pp. 568-582
- Schweitzer, P.¹ Seidmann, A.²

15
- 0004102479
- MIT Press, Cambridge, MA
- R. Sutton and A. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

16
- 0042496037
- Discovering structure in multiple learning tasks: The TC algorithm
- S. Thrun and J. O'Sullivan. Discovering structure in multiple learning tasks: The TC algorithm. In 1CML-96, 1996.
- (1996) 1CML-96
- Thrun, S.¹ O'Sullivan, J.²

17
- 13444310066
- Inductive policy selection for first-order MDPs
- S. W. Yoon, A. Fern, and B. Givan. Inductive policy selection for first-order MDPs. In UAI-02, 2002.
- (2002) UAI-02
- Yoon, S.W.¹ Fern, A.² Givan, B.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.