-
1
-
-
0004049893
-
Learning from delayed rewards,
-
Ph.D. dissertation, University of Cambridge, England
-
C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, University of Cambridge, England, 1989.
-
(1989)
-
-
Watkins, C.1
-
3
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
April
-
D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, April 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
4
-
-
84880694195
-
Stable function approximation in dynamic programming
-
A. Prieditis and S. Russell, Eds. San Francisco, CA: Morgan Kaufmann
-
G. J. Gordon, "Stable function approximation in dynamic programming," in Proceedings of the Twelfth International Conference on Machine Learning, A. Prieditis and S. Russell, Eds. San Francisco, CA: Morgan Kaufmann, 1995, pp. 261-268.
-
(1995)
Proceedings of the Twelfth International Conference on Machine Learning
, pp. 261-268
-
-
Gordon, G.J.1
-
5
-
-
85153965130
-
Reinforcement learning with soft state aggregation
-
G. Tesauro, D. Touretzky, and T. Leen, Eds, The MIT Press
-
S. P. Singh, T. Jaakkola, and M. I. Jordan, "Reinforcement learning with soft state aggregation," in Advances in Neural Information Processing Systems : Proceedings of the 1994 conference, G. Tesauro, D. Touretzky, and T. Leen, Eds., vol. 7. The MIT Press, 1995, pp. 361-368.
-
(1995)
Advances in Neural Information Processing Systems : Proceedings of the 1994 conference
, vol.7
, pp. 361-368
-
-
Singh, S.P.1
Jaakkola, T.2
Jordan, M.I.3
-
6
-
-
0036832956
-
Kernel-based reinforcement learning
-
D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
7
-
-
33646398129
-
Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method
-
M. Riedmiller, "Neural fitted q iteration - first experiences with a data efficient neural reinforcement learning method," in Proceedings of the Sixteenth European Conference on Machine Learning, Porto, Portugal, 2005, pp. 317-328.
-
(2005)
Proceedings of the Sixteenth European Conference on Machine Learning, Porto, Portugal
, pp. 317-328
-
-
Riedmiller, M.1
-
9
-
-
0036832950
-
Technical update: Least-squares temporal difference learning
-
J. A Boyan, "Technical update: Least-squares temporal difference learning," Machine Learning, vol. 49, no. 2-3, pp. 233-246, 2002.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 233-246
-
-
Boyan, J.A.1
-
10
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds, The MIT Press
-
R. S. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Proceedings of the International Conference on Advances in Neural Information Processing Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds., vol. 8. The MIT Press, 1996, pp. 1038-1044.
-
(1996)
Proceedings of the International Conference on Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
11
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
S. P. Singh and R. S. Sutton, "Reinforcement learning with replacing eligibility traces," Machine Learning, vol. 22, no. 1-3, pp. 123-158, 1996.
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
12
-
-
0038595393
-
-
Carnegie Mellon University, Pittsburgh, PA 15213, Tech. Rep. CMU-CS95-103, January
-
G. J. Gordon, "Stable function approximation in dynamic programming," Carnegie Mellon University, Pittsburgh, PA 15213, Tech. Rep. CMU-CS95-103, January 1995.
-
(1995)
Stable function approximation in dynamic programming
-
-
Gordon, G.J.1
-
14
-
-
0344961876
-
Reinforcement learning on explicitly specified time-scales
-
R. Schoknecht and M. Riedmiller, "Reinforcement learning on explicitly specified time-scales," Neural Computing, vol. 12, no. 2, pp. 61-80, 2003.
-
(2003)
Neural Computing
, vol.12
, Issue.2
, pp. 61-80
-
-
Schoknecht, R.1
Riedmiller, M.2
|