-
1
-
-
0003989214
-
Hierarchical control and learning for markov decision processes,
-
Ph.D. dissertation, University of California, Berkeley, CA
-
R. Parr, "Hierarchical control and learning for markov decision processes," Ph.D. dissertation, University of California, Berkeley, CA, 1998.
-
(1998)
-
-
Parr, R.1
-
2
-
-
84942867726
-
An overview of maxq hierarchical reinforcement learning
-
T. G. Dietterich, "An overview of maxq hierarchical reinforcement learning," Lecture Notes in Computer Science, vol. 1864, 2000.
-
(2000)
Lecture Notes in Computer Science
, vol.1864
-
-
Dietterich, T.G.1
-
3
-
-
0007907759
-
Emergent hierarchical control structures: Learning reactive / hierarchical relationships in reinforcement environments
-
B. Digney, "Emergent hierarchical control structures: Learning reactive / hierarchical relationships in reinforcement environments," in Proceedings of the Fourth Conference on the Simulation of Adaptive Behavior, 1996.
-
(1996)
Proceedings of the Fourth Conference on the Simulation of Adaptive Behavior
-
-
Digney, B.1
-
6
-
-
0034272032
-
Bounded-parameter markov decision processes
-
R. Givan, S. Leach, and T. Dean, "Bounded-parameter markov decision processes," Artificial Intelligence, vol. 122, no. 1-2, pp. 71-109, 2000.
-
(2000)
Artificial Intelligence
, vol.122
, Issue.1-2
, pp. 71-109
-
-
Givan, R.1
Leach, S.2
Dean, T.3
-
7
-
-
0346942368
-
Decision-theoretic planning: Structural assumptions and computational leverage
-
C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," Journal of Artificial Intelligence Research, vol. 11, pp. 1-94, 1999.
-
(1999)
Journal of Artificial Intelligence Research
, vol.11
, pp. 1-94
-
-
Boutilier, C.1
Dean, T.2
Hanks, S.3
-
8
-
-
0033170372
-
Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales
-
R. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales," Artificial Intelligence, vol. 112, pp. 181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.1
Precup, D.2
Singh, S.3
-
9
-
-
0038178323
-
Solving factored MDPs using non-homogeneous partitions
-
K. Kim and T. Dean, "Solving factored MDPs using non-homogeneous partitions," Artificial Intelligence, vol. 147, pp. 225-251, 2003.
-
(2003)
Artificial Intelligence
, vol.147
, pp. 225-251
-
-
Kim, K.1
Dean, T.2
-
10
-
-
0038517214
-
Equivalence notions and model minimization in markov decision processes
-
T. Dean, R. Givan, and M. Greig, "Equivalence notions and model minimization in markov decision processes," in Special issue on planning with uncertainty and incomplete information, 2003, pp. 163-223.
-
(2003)
Special issue on planning with uncertainty and incomplete information
, pp. 163-223
-
-
Dean, T.1
Givan, R.2
Greig, M.3
|