-
2
-
-
0012352653
-
Approximating value trees in structured dynamic programming
-
C. Boutilier and R. Dearden. Approximating value trees in structured dynamic programming. In Proc. ICML, pages 54-62, 1996.
-
(1996)
Proc. ICML
, pp. 54-62
-
-
Boutilier, C.1
Dearden, R.2
-
3
-
-
0001811022
-
Prioritized goal decomposition of Markov decision processes: Towards a synthesis of classical and decision theoretic planning
-
C. Boutilier, R.I. Brafman, and C. Geib. Prioritized goal decomposition of Markov decision processes: Towards a synthesis of classical and decision theoretic planning. In Proc. UAI, pages 24-32, 1998.
-
(1998)
Proc. UAI
, pp. 24-32
-
-
Boutilier, C.1
Brafman, R.I.2
Geib, C.3
-
5
-
-
84880686930
-
Tractable inference for complex stochastic processes
-
X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In Proc. UAI, 1998.
-
Proc. UAI, 1998
-
-
Boyen, X.1
Koller, D.2
-
6
-
-
0000746330
-
Model reduction techniques for computing approximately optimal solutions for Markov decision processes
-
T. Dean, R. Givan, and S. Leach. Model reduction techniques for computing approximately optimal solutions for Markov decision processes. In Proc. UAI, 1997.
-
Proc. UAI, 1997
-
-
Dean, T.1
Givan, R.2
Leach, S.3
-
7
-
-
84880694195
-
Stable function approximation in dynamic programming
-
G J. Gordon. Stable function approximation in dynamic programming. In Proc. ICML, pages 261-268, 1995.
-
(1995)
Proc. ICML
, pp. 261-268
-
-
Gordon, G.J.1
-
10
-
-
0031632806
-
Solving very large weakly coupled Markov decision processes
-
N. Meuleau, M. Hauskrecht, K-E. Kim, L. Peshkin, L.P. Kaelbling, T. Dean, and C. Boutilier. Solving very large weakly coupled Markov decision processes. In Proc. AAA/, pages 165-172, 1998.
-
(1998)
Proc. AAA
, pp. 165-172
-
-
Meuleau, N.1
Hauskrecht, M.2
Kim, K.-E.3
Peshkin, L.4
Kaelbling, L.P.5
Dean, T.6
Boutilier, C.7
-
12
-
-
84899022377
-
How to dynamically merge Markov decision processes
-
S.P. Singh and D. Cohn. How to dynamically merge Markov decision processes. In NIPS 10, pages 1057-1063, 1998.
-
(1998)
NIPS 10
, pp. 1057-1063
-
-
Singh, S.P.1
Cohn, D.2
-
14
-
-
0000672258
-
Improved switching among temporally abstract actions
-
To appear
-
R.S. Sutton, S. Singh, D. Precup, and B. Ravindran. Improved switching among temporally abstract actions. In NIPS 12, 1999. To appear.
-
(1999)
NIPS 12
-
-
Sutton, R.S.1
Singh, S.2
Precup, D.3
Ravindran, B.4
-
15
-
-
0002313852
-
Scaling up average reward reinforcement learning by approximating the domain models and the value function
-
P. Tadepalli and D. Ok. Scaling up average reward reinforcement learning by approximating the domain models and the value function. In Proc. ICML, 1996.
-
Proc. ICML, 1996
-
-
Tadepalli, P.1
Ok, D.2
-
16
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
J. D. Tsitsiklis and B. Van Roy. Feature-based methods for large scale dynamic programming. Machine Learning, 22(1):59-94, January 1996. (Pubitemid 126724363)
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
|