-
1
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Barto, A. G., R. S. Sutton, and C. W. Anderson. 1983. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Systems, Man, and Cybernetics 13:835-846.
-
(1983)
IEEE Trans. Systems, Man, and Cybernetics
, vol.13
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
2
-
-
0007512578
-
Truncating temporal differences: On the efficient implementation of TD(A) for reinforcement learning
-
Cichosz, P. 1995. Truncating temporal differences: On the efficient implementation of TD(A) for reinforcement learning. Journal of Artificial Intelligence Research 2:287-318.
-
(1995)
Journal of Artificial Intelligence Research
, vol.2
, pp. 287-318
-
-
Cichosz, P.1
-
3
-
-
0008666497
-
-
Ph.D. thesis, Warsaw University of Technology, Department of Electronics and Information Technology
-
Cichosz, P. 1997. Reinforcement Learning by Truncating Temporal Differences. Ph.D. thesis, Warsaw University of Technology, Department of Electronics and Information Technology.
-
(1997)
Reinforcement Learning by Truncating Temporal Differences
-
-
Cichosz, P.1
-
7
-
-
0003673017
-
-
Ph.D. thesis, Carnegie-Mellon University, School of Computer Science, Pittsburgh, PA
-
Lin, Long-Ji. 1993. Reinforcement Learning for Robots Using Neural Networks. Ph.D. thesis, Carnegie-Mellon University, School of Computer Science, Pittsburgh, PA.
-
(1993)
Reinforcement Learning for Robots Using Neural Networks.
-
-
Lin, L.-J.1
-
8
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
Mahadevan, S. and J. Connell. 1992. Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence 55:311-365.
-
(1992)
Artificial Intelligence
, vol.55
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
11
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
Singh, S. P. and R. S. Sutton. 1996. Reinforcement learning with replacing eligibility traces. Machine Learning 22:123-158.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
12
-
-
0003617454
-
-
Ph.D. thesis, University of Massachusetts, Department of Computer and Information Science, Boston, MA
-
Sutton, R. S. 1984. Temporal Credit Assignment in Reinforcement Learning. Ph.D. thesis, University of Massachusetts, Department of Computer and Information Science, Boston, MA.
-
(1984)
Temporal Credit Assignment in Reinforcement Learning
-
-
Sutton, R.S.1
-
13
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R. S. 1988. Learning to predict by the methods of temporal differences. Machine Learning 3:9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
14
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
Cambridge, MA: MIT Press. Morgan Kaufmann
-
Sutton, R. S. 1996. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, pp. 1038-1044. Cambridge, MA: MIT Press. Morgan Kaufmann.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
15
-
-
0010495476
-
On step-size and bias in temporal-difference learning
-
Yale University, New Haven, CT
-
Sutton, R. S. and S. P. Singh. 1994. On step-size and bias in temporal-difference learning. In Proc. Eighth Yale Workshop on Adaptive and Learning Systems. Center for Systems Science, Yale University, New Haven, CT, pp. 31-36.
-
(1994)
Proc. Eighth Yale Workshop on Adaptive and Learning Systems. Center for Systems Science
, pp. 31-36
-
-
Sutton, R.S.1
Singh, S.P.2
-
16
-
-
0001046225
-
Practical issues in temporal difference learning
-
Tesauro, G. 1992. Practical issues in temporal difference learning. Machine Learning 8:257-277.
-
(1992)
Machine Learning
, vol.8
, pp. 257-277
-
-
Tesauro, G.1
|