-
2
-
-
0033170372
-
Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
-
Sutton R.S., Precup D., and Singh S. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112 1-2 (1999) 181-211
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1-2
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
3
-
-
14344251007
-
-
M.R. James, S. Singh, Learning and discovery of predictive state representations in dynamical systems with reset, in: ICML 2004, Dpt of Computer Science and Engineering, University of Michigan, Ann Arbor, 2004, pp. 417-424
-
M.R. James, S. Singh, Learning and discovery of predictive state representations in dynamical systems with reset, in: ICML 2004, Dpt of Computer Science and Engineering, University of Michigan, Ann Arbor, 2004, pp. 417-424
-
-
-
-
4
-
-
1942516880
-
-
R. Munos, Error bounds for approximate policy iteration, in: International Conference on Machine Learning ICML 2003, Centre de Mathématiques Appliquées, Ecole Polytechnique, Palaiseau, France, 2003, pp. 560-567
-
R. Munos, Error bounds for approximate policy iteration, in: International Conference on Machine Learning ICML 2003, Centre de Mathématiques Appliquées, Ecole Polytechnique, Palaiseau, France, 2003, pp. 560-567
-
-
-
-
5
-
-
41249085418
-
-
A. McCallum, Reinforcement learning with selective perception and hidden state, Ph.D. Thesis, 1996
-
A. McCallum, Reinforcement learning with selective perception and hidden state, Ph.D. Thesis, 1996
-
-
-
-
6
-
-
84880771557
-
-
B. Ravindran, A.G. Barto, SMDP homomorphisms: An algebraic approach to abstraction in semi Markov decision processes, in: IJCAI 2003, AAAI Press edition, Dpt of Computer Science, University of Massachussetts, Amherst, 2003, pp. 1011-1016
-
B. Ravindran, A.G. Barto, SMDP homomorphisms: An algebraic approach to abstraction in semi Markov decision processes, in: IJCAI 2003, AAAI Press edition, Dpt of Computer Science, University of Massachussetts, Amherst, 2003, pp. 1011-1016
-
-
-
-
7
-
-
0038517214
-
Equivalence notions and model minimization in Markov decision processes
-
Givan R., Dean T., and Greig M. Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence 147 1-2 (2003) 163-223
-
(2003)
Artificial Intelligence
, vol.147
, Issue.1-2
, pp. 163-223
-
-
Givan, R.1
Dean, T.2
Greig, M.3
-
8
-
-
0002278788
-
State abstraction in maxq hierachical reinforcement learning
-
Dietterich T.G. State abstraction in maxq hierachical reinforcement learning. Artificial Intelligence Research 13 (2000) 227-303
-
(2000)
Artificial Intelligence Research
, Issue.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
10
-
-
0344666744
-
-
M. Ricordeau, Q-concept-learning: Generalization with concept lattice representation in reinforcement learning, in: I.C. Society (Ed.), International Conference on Tools with Artificial Intelligence, ICTAI 03, Lirmm, Montpellier, 2003, pp. 316-323
-
M. Ricordeau, Q-concept-learning: Generalization with concept lattice representation in reinforcement learning, in: I.C. Society (Ed.), International Conference on Tools with Artificial Intelligence, ICTAI 03, Lirmm, Montpellier, 2003, pp. 316-323
-
-
-
-
14
-
-
84861810840
-
-
M. Liquière, J. Sallantin, Structural machine learning with Galois lattice and graphs, in: M.K. Ed (Ed.), ICML 1998, Lirmm, Montpellier, Morgan Kaufmann Ed, 1998, pp. 305-313
-
M. Liquière, J. Sallantin, Structural machine learning with Galois lattice and graphs, in: M.K. Ed (Ed.), ICML 1998, Lirmm, Montpellier, Morgan Kaufmann Ed, 1998, pp. 305-313
-
-
-
-
15
-
-
41249095897
-
-
R. Munos, Finite-element methods with local triangulation refinement for continuous reinforcement learning problems, 1997
-
R. Munos, Finite-element methods with local triangulation refinement for continuous reinforcement learning problems, 1997
-
-
-
-
16
-
-
41249095783
-
-
A. McCallum, Efficiently inducing features of conditional random fields, in: Conference on Uncertainty in Articifical Intelligence, UAI, 2003, Dpt of Computer Science, University of Massachussetts, Amherst, 2003
-
A. McCallum, Efficiently inducing features of conditional random fields, in: Conference on Uncertainty in Articifical Intelligence, UAI, 2003, Dpt of Computer Science, University of Massachussetts, Amherst, 2003
-
-
-
|