-
1
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
A. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Artificial Intelligence, Special Volume on Computational Research on Interaction and Agency, 72(1):81-138, 1995.
-
(1995)
Artificial Intelligence, Special Volume on Computational Research on Interaction and Agency
, vol.72
, Issue.1
, pp. 81-138
-
-
Barto, A.1
Bradtke, S.2
Singh, S.3
-
4
-
-
85150714688
-
Reinforcement learning methods for continuous-time Markov decision problems
-
S. Bradtke and M. Duff. Reinforcement learning methods for continuous-time Markov decision problems. Advances in Neural Information Processing Systems, 7:393-400, 1995.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 393-400
-
-
Bradtke, S.1
Duff, M.2
-
8
-
-
84990553353
-
A model for reasoning about persistence and causation
-
T. Dean and K. Kanazawa. A model for reasoning about persistence and causation. Computational Intelligence, 5(3): 142-150, 1989.
-
(1989)
Computational Intelligence
, vol.5
, Issue.3
, pp. 142-150
-
-
Dean, T.1
Kanazawa, K.2
-
10
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
T. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000a.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.1
-
12
-
-
0007907759
-
Emergent hierarchical control structures: Learning reactive/hierarchical relationships in reinforcement environments
-
B. Digney. Emergent hierarchical control structures: Learning reactive/hierarchical relationships in reinforcement environments. From animals to animals, 4:363-372, 1996.
-
(1996)
From Animals to Animals
, vol.4
, pp. 363-372
-
-
Digney, B.1
-
15
-
-
2842560201
-
Strips: A new approach to the application of theorem proving to problem solving
-
R. Fikes and N. Nilsson. Strips: A New Approach to the Application of Theorem Proving to Problem Solving. Artificial Intelligence, 2:189-208, 1971.
-
(1971)
Artificial Intelligence
, vol.2
, pp. 189-208
-
-
Fikes, R.1
Nilsson, N.2
-
18
-
-
0023365727
-
Statecharts: A visual formalism for complex systems
-
D. Harel. Statecharts: A visual formalism for complex systems. Science of Computer Programming, 8:231-274, 1987.
-
(1987)
Science of Computer Programming
, vol.8
, pp. 231-274
-
-
Harel, D.1
-
19
-
-
0006419533
-
Hierarchical solution of Markov decision processes using macro-actions
-
M. Hauskrecht, N. Meuleau, L. Kaelbling, T. Dean, and C. Boutilier. Hierarchical Solution of Markov Decision Processes using Macro-actions. Uncertainty in Artificial Intelligence, 14:220-229, 1998.
-
(1998)
Uncertainty in Artificial Intelligence
, vol.14
, pp. 220-229
-
-
Hauskrecht, M.1
Meuleau, N.2
Kaelbling, L.3
Dean, T.4
Boutilier, C.5
-
22
-
-
0002956570
-
Spudd: Stochastic planning using decision diagrams
-
J. Hoey, R. St-Aubin, A. Hu, and C. Boutilier. Spudd: Stochastic Planning using Decision Diagrams. Proceedings of Uncertainty in Artificial Intelligence, 15:279-288, 1999.
-
(1999)
Proceedings of Uncertainty in Artificial Intelligence
, vol.15
, pp. 279-288
-
-
Hoey, J.1
St-Aubin, R.2
Hu, A.3
Boutilier, C.4
-
26
-
-
14344250635
-
Dynamic abstraction in reinforcement learning via clustering
-
S. Mannor, I. Menache, A. Hoze, and U. Klein. Dynamic abstraction in reinforcement learning via clustering. Proceedings of the International Conference on Machine Learning, 21:560-567, 2004.
-
(2004)
Proceedings of the International Conference on Machine Learning
, vol.21
, pp. 560-567
-
-
Mannor, S.1
Menache, I.2
Hoze, A.3
Klein, U.4
-
34
-
-
32844454706
-
-
Ph.D. Thesis, Department of Computer Science, University of Massachusetts, Amherst, USA
-
B. Ravindran. An Algebraic Approach to Abstraction in Reinforcement Learning. Ph.D. Thesis, Department of Computer Science, University of Massachusetts, Amherst, USA, 2004.
-
(2004)
An Algebraic Approach to Abstraction in Reinforcement Learning
-
-
Ravindran, B.1
-
38
-
-
0033170372
-
Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
R. Sutton, D. Precup, and S. Singh. Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.1
Precup, D.2
Singh, S.3
|