-
1
-
-
0028572333
-
Using abstractions for decision theoretic planning with time constraints
-
AAAI
-
C. Boutilier and R. Dearden. Using abstractions for decision theoretic planning with time constraints. In Proceedings of the AAAI-94, pages 1016-1022. AAAI, 1994.
-
(1994)
Proceedings of the AAAI-94
, pp. 1016-1022
-
-
Boutilier, C.1
Dearden, R.2
-
4
-
-
0031370386
-
Model minimization in markov decision processes
-
AAAI
-
Thomas Dean and Robert Givan. Model minimization in markov decision processes. In Proceedings of AAAI-97, pages 106-111. AAAI, 1997.
-
(1997)
Proceedings of AAAI-97
, pp. 106-111
-
-
Dean, T.1
Givan, R.2
-
7
-
-
0034272032
-
Bounded-parameter markov decision processes
-
Robert Givan, Sonia Leach, and Thomas Dean. Bounded-parameter markov decision processes. Artificial Intelligence, 122:71-109, 2000.
-
(2000)
Artificial Intelligence
, vol.122
, pp. 71-109
-
-
Givan, R.1
Leach, S.2
Dean, T.3
-
8
-
-
0141763163
-
Symmetry groups and translation invariant representations of markov processes
-
J. Glover. Symmetry groups and translation invariant representations of markov processes. The Annals of Probability, 19(2):562-586, 1991.
-
(1991)
The Annals of Probability
, vol.19
, Issue.2
, pp. 562-586
-
-
Glover, J.1
-
10
-
-
0000148778
-
Iba. A heuristic approach to the discovery of macro-operators
-
Glenn A. Iba. A heuristic approach to the discovery of macro-operators. Machine Learning, 3:285-317, 1989.
-
(1989)
Machine Learning
, vol.3
, pp. 285-317
-
-
Glenn, A.1
-
11
-
-
0014604028
-
A note on the iterative decomposition of finite automata
-
J. R. Jump. A note on the iterative decomposition of finite automata. Information and Control, 15:424-435, 1969.
-
(1969)
Information and Control
, vol.15
, pp. 424-435
-
-
Jump, J.R.1
-
12
-
-
0026222347
-
Bisimulation through probabilistic testing
-
K. G. Larsen and A. Skou. Bisimulation through probabilistic testing. Information and Computation, 94(1):1-28, 1991.
-
(1991)
Information and Computation
, vol.94
, Issue.1
, pp. 1-28
-
-
Larsen, K.G.1
Skou, A.2
-
16
-
-
33745919581
-
Reinforcement Learning
-
MIT Press, Cambridge, MA
-
Richard S. Sutton and Andrew G. Barto. Reinforcement Learning. An Introduction. MIT Press, Cambridge, MA, 1998.
-
(1998)
An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
18
-
-
0004049893
-
-
PhD thesis, Cambridge University, Cambridge, England
-
C. J. C. H. Watkins. Learning from delayed rewards. PhD thesis, Cambridge University, Cambridge, England, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.1
-
19
-
-
33645390656
-
Symmetry in markov decision processes and its implications for single agent and multi agent learning
-
San Francisco, CA, Morgan Kaufmann
-
M. Zinkevich and T. Balch. Symmetry in markov decision processes and its implications for single agent and multi agent learning. In Proceedings of the 18th International Conference on Machine Learning, pages 632-640, San Francisco, CA, 2001. Morgan Kaufmann.
-
(2001)
Proceedings of the 18Th International Conference on Machine Learning
, pp. 632-640
-
-
Zinkevich, M.1
Balch, T.2
|