-
3
-
-
0034248853
-
Stochastic dynamic programming with factored representations
-
Boutilier, C.; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1-2):49-107.
-
(2000)
Artificial Intelligence
, vol.121
, Issue.1-2
, pp. 49-107
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
7
-
-
0000423702
-
Robust combination of local controllers
-
Breese, J., and Koller, D., eds., Seattle, WA: Morgan Kaufmann
-
Guestrin, C., and Ormoneit, D. 2001. Robust combination of local controllers. In Breese, J., and Koller, D., eds., Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI-01), 178-185. Seattle, WA: Morgan Kaufmann.
-
(2001)
Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI-01)
, pp. 178-185
-
-
Guestrin, C.1
Ormoneit, D.2
-
8
-
-
0006419533
-
Hierarchical solution of Markov decision processes using macro-actions
-
Cooper, G. F., and Moral, S., eds., Morgan Kaufmann
-
Hauskrecht, M.; Meuleau, N.; Boutilier, C.; Kaelbling, L. P.; and Dean, T. 1998. Hierarchical solution of Markov decision processes using macro-actions. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
-
(1998)
Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
-
-
Hauskrecht, M.1
Meuleau, N.2
Boutilier, C.3
Kaelbling, L.P.4
Dean, T.5
-
11
-
-
84880677563
-
Efficient reinforcement learning in factored MDPs
-
Dean, T., ed., Stockholm, Sweden: Morgan Kaufmann
-
Kearns, M., and Koller, D. 1999. Efficient reinforcement learning in factored MDPs. In Dean, T., ed., Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99), 740-747. Stockholm, Sweden: Morgan Kaufmann.
-
(1999)
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99)
, pp. 740-747
-
-
Kearns, M.1
Koller, D.2
-
17
-
-
84880688141
-
Multi-value-functions: Efficient automatic action hierarchies for multiple goal MDPs
-
Dean, T., ed., Stockholm, Sweden: Morgan Kaufmann
-
Moore, A. W.; Baird, L. C.; and Kaelbling, L. 1999. Multi-value-functions: Efficient automatic action hierarchies for multiple goal MDPs. In Dean, T., ed., Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99). Stockholm, Sweden: Morgan Kaufmann.
-
(1999)
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99)
-
-
Moore, A.W.1
Baird, L.C.2
Kaelbling, L.3
-
18
-
-
0001205548
-
Complexity of finite-horizon markov decision process problems
-
Mundhenk, M.; Goldsmith, J.; Lusena, C.; and Allender, E. 2000. Complexity of finite-horizon markov decision process problems. Journal of the ACM 47(4):681-720.
-
(2000)
Journal of the ACM
, vol.47
, Issue.4
, pp. 681-720
-
-
Mundhenk, M.1
Goldsmith, J.2
Lusena, C.3
Allender, E.4
-
19
-
-
0346738900
-
Flexible decomposition algorithms for weakly coupled Markov decision problems
-
Cooper, G. F., and Moral, S., eds., Morgan Kaufmann
-
Parr, R. 1998a. Flexible decomposition algorithms for weakly coupled Markov decision problems. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
-
(1998)
Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
-
-
Parr, R.1
-
21
-
-
0003392384
-
-
Ph.D. Dissertation, University of Massachusetts, Amherst, Department of Computer Science
-
Precup, D. 2000. Temporal Abstraction in Reinforcement Learning. Ph.D. Dissertation, University of Massachusetts, Amherst, Department of Computer Science.
-
(2000)
Temporal Abstraction in Reinforcement Learning
-
-
Precup, D.1
-
23
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Sutton, R. S.; Precup, D.; and Singh, S. 1999. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112:181-211.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
|