-
1
-
-
0001133021
-
Generalization in reinforcement learning: Safely approximating the value function
-
Morgan Kaufmann
-
J. Boyan and A. Moore. Generalization in reinforcement learning: Safely approximating the value function. In Proceedings of Neural Information Processings Systems 7. Morgan Kaufmann, 1995. http://www.cs.cmu.edu/~awm/papers. html.
-
(1995)
Proceedings of Neural Information Processings Systems
, vol.7
-
-
Boyan, J.1
Moore, A.2
-
2
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Cambridge, MA, The MIT Press
-
J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: Safely approximating the value function. In G. Tesauro, D. S. Touretzky, and T. K. Leen, editors, Advances in Neural Information Processing Systems 7, pages 369-376, Cambridge, MA, 1995. The MIT Press.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
3
-
-
84942854203
-
An algorithmic description of XCS
-
P. L. Lanzi, W. Stolzmann, and S. W. Wilson, editors, volume 1996 of LNAI, Springer-Verlag, Berlin
-
M. V. Butz and S. W. Wilson. An Algorithmic Description of XCS. In P. L. Lanzi, W. Stolzmann, and S. W. Wilson, editors, Advances in Learning Classifier Systems, volume 1996 of LNAI, pages 253-272. Springer-Verlag, Berlin, 2001.
-
(2001)
Advances in Learning Classifier Systems
, pp. 253-272
-
-
Butz, M.V.1
Wilson, S.W.2
-
4
-
-
27144461284
-
Extending xcsf beyond linear approximation
-
Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign
-
P. L. Lanzi, D. Loiacono, S. W Wilson, and D. E. Goldberg. Extending xcsf beyond linear approximation. Technical Report 2005006, Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign, 2005.
-
(2005)
Technical Report 2005006
-
-
Lanzi, P.L.1
Loiacono, D.2
Wilson, S.W.3
Goldberg, D.E.4
-
5
-
-
27144463204
-
Generalization in the xcsf classifier system: Analysis, improvement, and extension
-
Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign
-
P. L. Lanzi, D. Loiacono, S. W. Wilson, and D. E. Goldberg. Generalization in the xcsf classifier system: Analysis, improvement, and extension. Technical Report 2005012, Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign, 2005.
-
(2005)
Technical Report 2005012
-
-
Lanzi, P.L.1
Loiacono, D.2
Wilson, S.W.3
Goldberg, D.E.4
-
6
-
-
27144520367
-
Xcs with computable prediction for the learning of boolean functions
-
Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign
-
P. L. Lanzi, D. Loiacono, S. W. Wilson, and D. E. Goldberg. Xcs with computable prediction for the learning of boolean functions. Technical Report 2005007, Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign, 2005.
-
(2005)
Technical Report 2005007
-
-
Lanzi, P.L.1
Loiacono, D.2
Wilson, S.W.3
Goldberg, D.E.4
-
7
-
-
27144520367
-
Xcs with computable prediction in multistep environments
-
Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign
-
P. L. Lanzi, D. Loiacono, S. W. Wilson, and D. E. Goldberg. Xcs with computable prediction in multistep environments. Technical Report 2005008, Illinois Genetic Algorithms Laboratory - University of Illinois at Urbana-Champaign, 2005.
-
(2005)
Technical Report 2005008
-
-
Lanzi, P.L.1
Loiacono, D.2
Wilson, S.W.3
Goldberg, D.E.4
-
8
-
-
84898960655
-
A convergent form of approximate policy iteration
-
S. T. S. Becker and K. Obermayer, editors, Cambridge, MA, MIT Press
-
T. J. Perkins and D. Precup. A convergent form of approximate policy iteration. In S. T. S. Becker and K. Obermayer, editors, Advances in Neural Information Processing Systems 15, pages 1595-1602, Cambridge, MA, 2003. MIT Press.
-
(2003)
Advances in Neural Information Processing Systems
, vol.15
, pp. 1595-1602
-
-
Perkins, T.J.1
Precup, D.2
-
9
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, The MIT Press, Cambridge, MA
-
R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 1038-1044. The MIT Press, Cambridge, MA., 1996.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
11
-
-
0000985504
-
TD-gammon, a self-teaching backgammon program, achieves master-level play
-
G. Tesauro. TD-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
-
12
-
-
0003270924
-
Issues in using function approximation for reinforcement learning
-
M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors, Hillsdale, NJ, Lawrence Erlbaum.
-
S. Thrun and A. Schwartz. Issues in Using Function Approximation for Reinforcement Learning. In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors, Proceedings of the 1993 Connectionist Models Summer School, Hillsdale, NJ, 1993. Lawrence Erlbaum.
-
(1993)
Proceedings of the 1993 Connectionist Models Summer School
-
-
Thrun, S.1
Schwartz, A.2
-
14
-
-
27144534351
-
-
chapter Neurocomputing: Foundation of Research, The MIT Press, Cambridge
-
B. Widrow and M. E. Hoff. Adaptive Switching Circuits, chapter Neurocomputing: Foundation of Research, pages 126-134. The MIT Press, Cambridge, 1988.
-
(1988)
Adaptive Switching Circuits
, pp. 126-134
-
-
Widrow, B.1
Hoff, M.E.2
-
15
-
-
0001387704
-
Classifier fitness based on accuracy
-
S. W. Wilson. Classifier Fitness Based on Accuracy. Evolutionary Computation, 3(2):149-175, 1995. http://prediction-dynamics.com/.
-
(1995)
Evolutionary Computation
, vol.3
, Issue.2
, pp. 149-175
-
-
Wilson, S.W.1
-
16
-
-
0042142404
-
Function approximation with a classifier system
-
L. S. et al., editor, San Francisco, California, USA, 7-11 July Morgan Kaufmann
-
S. W. Wilson. Function approximation with a classifier system. In L. S. et al., editor, Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pages 974-981, San Francisco, California, USA, 7-11 July 2001. Morgan Kaufmann.
-
(2001)
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001)
, pp. 974-981
-
-
Wilson, S.W.1
-
17
-
-
84942897286
-
Mining oblique data with XCS
-
Springer-Verlag, Apr.
-
S. W. Wilson. Mining Oblique Data with XCS. volume 1996 of Lecture notes in Computer Science, pages 158-174. Springer-Verlag, Apr. 2001.
-
(2001)
Lecture Notes in Computer Science
, vol.1996
, pp. 158-174
-
-
Wilson, S.W.1
-
18
-
-
27144549349
-
Classifiers that approximate functions
-
S. W. Wilson. Classifiers that approximate functions. Journal of Natural Computating, 1(2-3):211-234, 2002.
-
(2002)
Journal of Natural Computating
, vol.1
, Issue.2-3
, pp. 211-234
-
-
Wilson, S.W.1
-
19
-
-
35048875188
-
Classifier systems for continuous pay-off environments
-
K. D. et al., editor, Genetic and Evolutionary Computation - GECCO-2004, Part II, 103 Seattle, WA, USA, 26-30 June Springer-Verlag
-
S. W. Wilson. Classifier systems for continuous pay-off environments. In K. D. et al., editor, Genetic and Evolutionary Computation - GECCO-2004, Part II, volume 3103 of Lecture Notes in Computer Science, pages 824-835, Seattle, WA, USA, 26-30 June 2004. Springer-Verlag.
-
(2004)
Lecture Notes in Computer Science
, pp. 824-835
-
-
Wilson, S.W.1
|