-
1
-
-
84880891360
-
Symbolic dynamic programming for first-order MDPs
-
C. Boutilier, R. Reitcr, and B. Price. Symbolic dynamic programming for first-order MDPs. In IJCA1-0I, 2001.
-
(2001)
IJCA1-0I
-
-
Boutilier, C.1
Reitcr, R.2
Price, B.3
-
2
-
-
13444255536
-
On constraint sampling for the linear programming approach to approximate dynamic programming
-
Submitted to
-
D.P. de Farias and B. Van Roy. On constraint sampling for the linear programming approach to approximate dynamic programming. Submitted to Math, of Operations Research, 2001.
-
(2001)
Math, of Operations Research
-
-
De Farias, D.P.1
Van Roy, B.2
-
3
-
-
85055854784
-
Probabilistic temporal reasoning
-
T. Dean and K. Kanazawa. Probabilistic temporal reasoning. In AAA1-88, 1988.
-
(1988)
AAA1-88
-
-
Dean, T.1
Kanazawa, K.2
-
4
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
5
-
-
0015440625
-
Learning and executing generalized robot plans
-
R E. Fikes, P. E. Hart, and N. J. Nilsson. Learning and executing generalized robot plans. Artf. Intel, 3(4):251-288, 1972.
-
(1972)
Artf. Intel
, vol.3
, Issue.4
, pp. 251-288
-
-
Fikes, R.E.1
Hart, P.E.2
Nilsson, N.J.3
-
7
-
-
1142281638
-
Context specific multiagent coordination and planning with factored MDPs
-
C. E. Guestrin, S. Venkataraman, and D. Roller. Context specific multiagent coordination and planning with factored MDPs. In AAA/-02, 2002.
-
(2002)
AAA/-02
-
-
Guestrin, C.E.1
Venkataraman, S.2
Roller, D.3
-
8
-
-
0006419533
-
Hierarchical solution of Markov decision processes using macro-actions
-
M. Hauskrecht, N. Meuleau, L. Kaelbling, T. Dean, and C. Boutilier. Hierarchical solution of Markov decision processes using macro-actions. In UAI, 1998.
-
(1998)
UAI
-
-
Hauskrecht, M.1
Meuleau, N.2
Kaelbling, L.3
Dean, T.4
Boutilier, C.5
-
9
-
-
0033189384
-
Learning action strategies for planning domains
-
DOI 10.1016/S0004-3702(99)00060-0
-
R. Khardon. Learning action strategics for planning domains. Artificial Intelligence, 113:125-148, 1999. (Pubitemid 30542740)
-
(1999)
Artificial Intelligence
, vol.113
, Issue.1
, pp. 125-148
-
-
Khardon, R.1
-
11
-
-
0038362668
-
Learning generalized policies in planning using concept languages
-
M. Martin and H. Gcffner. Learning generalized policies in planning using concept languages. In KR, 2000.
-
(2000)
KR
-
-
Martin, M.1
Gcffner, H.2
-
12
-
-
0346738900
-
Flexible decomposition algorithms for weakly coupled markov decision problems
-
R. Parr. Flexible decomposition algorithms for weakly coupled markov decision problems. In UAI-98, 1998.
-
(1998)
UAI-98
-
-
Parr, R.1
-
13
-
-
1542342765
-
Direct value-approximation for factored MDPs
-
D. Schuurmans and R. Patrascu. Direct value-approximation for factored MDPs. In NIPS-14, 2001.
-
(2001)
NIPS-14
-
-
Schuurmans, D.1
Patrascu, R.2
-
14
-
-
0000273218
-
Generalized polynomial approximations in Markovian decision processes
-
P. Schweitzer and A. Seidmann. Generalized polynomial approximations in Markovian decision processes. / of Mathematical Analysis and Applications, 110:568-582, 1985.
-
(1985)
Of Mathematical Analysis and Applications
, vol.110
, pp. 568-582
-
-
Schweitzer, P.1
Seidmann, A.2
-
16
-
-
0042496037
-
Discovering structure in multiple learning tasks: The TC algorithm
-
S. Thrun and J. O'Sullivan. Discovering structure in multiple learning tasks: The TC algorithm. In 1CML-96, 1996.
-
(1996)
1CML-96
-
-
Thrun, S.1
O'Sullivan, J.2
-
17
-
-
13444310066
-
Inductive policy selection for first-order MDPs
-
S. W. Yoon, A. Fern, and B. Givan. Inductive policy selection for first-order MDPs. In UAI-02, 2002.
-
(2002)
UAI-02
-
-
Yoon, S.W.1
Fern, A.2
Givan, B.3
|