-
1
-
-
0016556021
-
A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
-
J. S. Albus. A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Journal of Dynamic Systems, Measurement and Control, pages 220-227, 1975.
-
(1975)
Journal of Dynamic Systems, Measurement and Control
, pp. 220-227
-
-
Albus, J.S.1
-
2
-
-
0034248853
-
Stochastic dynamic programming with factored representations
-
C. Boutilier, R. Dearden, and M. Goldszmidt. Stochastic dynamic programming with factored representations. Artificial Intelligence, 121(1-2):49-107, 2000.
-
(2000)
Artificial Intelligence
, vol.121
, Issue.1-2
, pp. 49-107
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
3
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
G. Tesauro, D. S. Touretzky, and T. Leen, editors
-
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 369-376, 1995.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
4
-
-
32144437265
-
Manifold representations for value-function approximation
-
San Jose, California, USA
-
R. Glaubius and W. D. Smart. Manifold representations for value-function approximation. In Working Notes of the Workshop on Markov Decision Processes, AAAI 2004, San Jose, California, USA, 2004.
-
(2004)
Working Notes of the Workshop on Markov Decision Processes, AAAI 2004
-
-
Glaubius, R.1
Smart, W.D.2
-
8
-
-
12744263996
-
Hierarchical optimal control of MDPs
-
A. McGovern, D. Precup, B. Ravindran, S. Singh, and R. S. Sutton. Hierarchical optimal control of MDPs. In Proceedings of the Tenth Yale Workshop on Adaptive and Learning Systems, pages 186-191, 1998.
-
(1998)
Proceedings of the Tenth Yale Workshop on Adaptive and Learning Systems
, pp. 186-191
-
-
McGovern, A.1
Precup, D.2
Ravindran, B.3
Singh, S.4
Sutton, R.S.5
-
9
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less real time
-
A. Moore and C. Atkeson. Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, 13:103-130, 1993.
-
(1993)
Machine Learning
, vol.13
, pp. 103-130
-
-
Moore, A.1
Atkeson, C.2
-
13
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors
-
R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems, volume 8, pages 1038-1044, 1996.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
15
-
-
0001046225
-
Practical issues in temporal difference learning
-
G. J. Tesauro. Practical issues in temporal difference learning. Machine Learning, 8(3/4):257-277, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 257-277
-
-
Tesauro, G.J.1
|