-
3
-
-
0346942368
-
Decision-Theoretic Planning: Structural Assumptions and Computational Leverage
-
C. Boutilier, T. Dean, and S. Hanks. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage. Journal of Artificial Intelligence Research, 11:1-94, 1999. (Pubitemid 129628760)
-
(1999)
Journal of Artificial Intelligence Research
, vol.11
, pp. 1-94
-
-
Boutilier, C.1
Dean, T.2
Hanks, S.3
-
4
-
-
0002278788
-
Hierarchical reinforcement learning with the maxq value function decomposition
-
T. G. Dietterich. Hierarchical reinforcement learning with the maxq value function decomposition. Artificial Intelligence Research, 13:227-303, 2000.
-
(2000)
Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
5
-
-
0000746330
-
Model Reduction Techniques for Computing Approximately Optimal Solutions for Markov Decision Processes
-
San Francisco, CA, Morgan Kaufmann Publishers
-
R. Givan, T. Dean, and S. Leach. Model Reduction Techniques for Computing Approximately Optimal Solutions for Markov Decision Processes. In Proceedings of the 13th Annual Conference on Uncertainty in Artificial Intelligence (UAI-97), pages 124-131, San Francisco, CA, 1997. Morgan Kaufmann Publishers.
-
(1997)
Proceedings of the 13th Annual Conference on Uncertainty in Artificial Intelligence (UAI-97)
, pp. 124-131
-
-
Givan, R.1
Dean, T.2
Leach, S.3
-
6
-
-
29344435556
-
Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies
-
AAAI
-
S. Goel and M. Huber. Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies. In Proceedings of the 16th International FLAIRS Conference, pages 346-350. AAAI, 2003.
-
(2003)
Proceedings of the 16th International FLAIRS Conference
, pp. 346-350
-
-
Goel, S.1
Huber, M.2
-
7
-
-
0038178323
-
Solving Factored MDPs using Non-Homogeneous Partitions
-
K. Kim and T. Dean. Solving Factored MDPs using Non-Homogeneous Partitions. Artificial Intelligence, 147:225-251, 2003.
-
(2003)
Artificial Intelligence
, vol.147
, pp. 225-251
-
-
Kim, K.1
Dean, T.2
-
8
-
-
84880718755
-
Concurrent hierarchical reinforcement learning
-
Bhaskara Marthi, Stuart Russell, David Latham, and Carlos Guestrin. Concurrent hierarchical reinforcement learning. In International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, 2005.
-
International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, 2005
-
-
Marthi, B.1
Russell, S.2
Latham, D.3
Guestrin, C.4
-
9
-
-
56049119180
-
Transfer leraning with an ensemble of background tasks
-
Zvika Marx, Michael T. Rosenstein, and Leslie Pack Kaelbling. Transfer leraning with an ensemble of background tasks. In NIPS 2005 Workshop on Transfer Learning, Whistler, Canada, 2005.
-
NIPS 2005 Workshop on Transfer Learning, Whistler, Canada, 2005
-
-
Marx, Z.1
Rosenstein, M.T.2
Kaelbling, L.P.3
-
10
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
DOI 10.1016/S0004-3702(99)00052-1
-
R.S. Sutton, D. Precup, and S. Singh. Between MDPs and Semi-MDPs: Learning, Planning, and Representing Knowledge at Multiple Temporal Scales. Artificial Intelligence, 112:181-211, 1999. (Pubitemid 32079890)
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
11
-
-
27544473171
-
Behavior transfer for value-function-based reinforcement learning
-
Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, New York, NY, July ACM Press
-
Matthew E. Taylor and Peter Stone. Behavior transfer for value-function-based reinforcement learning. In Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
-
(2005)
The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 53-59
-
-
Taylor, M.E.1
Stone, P.2
|