-
1
-
-
0016556021
-
A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
-
Albus, J.S. (1975). A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, 97, 220-227.
-
(1975)
Dynamic Systems, Measurement and Control
, vol.97
, pp. 220-227
-
-
Albus, J.S.1
-
2
-
-
0031074521
-
Locally weighted learning
-
Atkeson, C.G., Schaal, S., & Moore, A.W. (1997). Locally weighted learning. Artificial Intelligence Review, 11, 11-73.
-
(1997)
Artificial Intelligence Review
, vol.11
, pp. 11-73
-
-
Atkeson, C.G.1
Schaal, S.2
Moore, A.W.3
-
3
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Barto, A.G., Sutton, R.S., & AndersOn, C.W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics, SMC-13, 834-846.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.SMC-13
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
5
-
-
0010878888
-
-
(Technical Report IRIDIA-94-14). Université Libre de Bruxelles
-
Caironi, P.V.C., & Dorigo, M. (1994). Training Q-agents (Technical Report IRIDIA-94-14). Université Libre de Bruxelles.
-
(1994)
Training Q-agents
-
-
Caironi, P.V.C.1
Dorigo, M.2
-
6
-
-
0007512578
-
Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning
-
Cichosz, P. (1995). Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning. Journal of Artificial Intelligence Research, 2, 287-318.
-
(1995)
Journal of Artificial Intelligence Research
, vol.2
, pp. 287-318
-
-
Cichosz, P.1
-
7
-
-
0347763086
-
Supervised learning with growing cell structures
-
J. Cowan, G. Tesauro, & J. Alspector (Eds.), San Mateo, CA: Morgan Kaufmann
-
Fritzke, B. (1994). Supervised learning with growing cell structures. In J. Cowan, G. Tesauro, & J. Alspector (Eds.), Advances in neural information processing systems (Vol.6, pp. 255-262). San Mateo, CA: Morgan Kaufmann.
-
(1994)
Advances in Neural Information Processing Systems
, vol.6
, pp. 255-262
-
-
Fritzke, B.1
-
8
-
-
0029751419
-
The effect of representation and knowledge on goal-directed exploration with reinforcement learning algorithms
-
Koenig, S., & Simmons, R.G. (1996). The effect of representation and knowledge on goal-directed exploration with reinforcement learning algorithms. Machine Learning, 22, 228-250.
-
(1996)
Machine Learning
, vol.22
, pp. 228-250
-
-
Koenig, S.1
Simmons, R.G.2
-
11
-
-
0000955979
-
Incremental multi-step Q-learning
-
Peng, J., & Williams, R. (1996). Incremental multi-step Q-learning. Machine Learning, 22, 283-290.
-
(1996)
Machine Learning
, vol.22
, pp. 283-290
-
-
Peng, J.1
Williams, R.2
-
13
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
Singh, S., & Sutton, R. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22, 123-158.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.1
Sutton, R.2
-
14
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R.S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
15
-
-
0000723997
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Cambridge, MA: MIT Press
-
Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Advances in neural information processing systems, (Vol. 8, pp. 1033-1045). Cambridge, MA: MIT Press.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1033-1045
-
-
Sutton, R.S.1
-
16
-
-
2542485629
-
Practical issues in temporal difference learning
-
D.S., Lippman, J.E. Moody, & D.S Touretzky (Eds.), San Mateo, CA: Morgan Kaufmann
-
Tesauro, G. (1992). Practical issues in temporal difference learning. In D.S., Lippman, J.E. Moody, & D.S Touretzky (Eds.), Advances in neural information processing systems (Vol. 4, pp. 259-266). San Mateo, CA: Morgan Kaufmann.
-
(1992)
Advances in Neural Information Processing Systems
, vol.4
, pp. 259-266
-
-
Tesauro, G.1
|