-
1
-
-
85151728371
-
Residual algorithms: Reinforcement learning with function approximation
-
Morgan Kaufmann Publishers, July
-
L. C. Baird. Residual algorithms: Reinforcement learning with function approximation. In Machine Learning: Proceeding of the Twelfth International Conference. Morgan Kaufmann Publishers, July 1995.
-
(1995)
Machine Learning: Proceeding of the Twelfth International Conference
-
-
Baird, L.C.1
-
2
-
-
0005977690
-
-
PhD thesis, School of Computer Science. Carnigie Mellon University, Pittsburgh, PA 15213
-
L. C. Baird. Reinforcement Learning Through Gradient Descent. PhD thesis, School of Computer Science. Carnigie Mellon University, Pittsburgh, PA 15213, 1999.
-
(1999)
Reinforcement Learning Through Gradient Descent
-
-
Baird, L.C.1
-
3
-
-
35248841035
-
Limits in long path learning with XCS
-
E. Cantú-Paz, J. A. Foster, K. Deb, D. Davis, R. Roy, U.-M. O'Reilly, H.-G. Beyer, R. Standish, G. Kendall, S. Wilson, M. Harman, J. Wegener, D. Dasgupta, M. A. Potter, A. C. Schultz, K. Dowsland, N. Jonoska, and J. Miller, editors, Genetic and Evolutionary Computation - GECCO-2003. Springer-Verlag
-
A. Barry. Limits in Long Path Learning with XCS. In E. Cantú-Paz, J. A. Foster, K. Deb, D. Davis, R. Roy, U.-M. O'Reilly, H.-G. Beyer, R. Standish, G. Kendall, S. Wilson, M. Harman, J. Wegener, D. Dasgupta, M. A. Potter, A. C. Schultz, K. Dowsland, N. Jonoska, and J. Miller, editors, Genetic and Evolutionary Computation - GECCO-2003, volume 2724 of LNCS, pages 1832-1843. Springer-Verlag, 2003.
-
(2003)
LNCS
, vol.2724
, pp. 1832-1843
-
-
Barry, A.1
-
5
-
-
1542293640
-
The stability of long action chains in XCS
-
A. M. Barry. The Stability of Long Action Chains in XCS. Journal of Soft Computing, 6(3-4):183-199, 2002.
-
(2002)
Journal of Soft Computing
, vol.6
, Issue.3-4
, pp. 183-199
-
-
Barry, A.M.1
-
7
-
-
34249944702
-
Gradient descent methods in learning classifier systems: Improving XCS performance in multistep problems
-
Illinois Genetic Algorithms Laboratory, December
-
M. V. Butz, D. E. Goldberg, and P. L. Lanzi. Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems. Technical Report 2003028, Illinois Genetic Algorithms Laboratory, December 2004.
-
(2004)
Technical Report 2003028
-
-
Butz, M.V.1
Goldberg, D.E.2
Lanzi, P.L.3
-
8
-
-
0042142970
-
How XCS evolves accurate classifiers
-
L. Spector, E. D. Goodman, A. Wu, W. B. Langdon, H.-M. Voigt, M. Gen, S. Sen, M. Dorigo, S. Pezeshk, M. H. Garzon, and E. Burke, editors. Morgan Kaufmann
-
M. V. Butz, T. Kovacs, P. L. Lanzi, and S. W. Wilson. How XCS Evolves Accurate Classifiers. In L. Spector, E. D. Goodman, A. Wu, W. B. Langdon, H.-M. Voigt, M. Gen, S. Sen, M. Dorigo, S. Pezeshk, M. H. Garzon, and E. Burke, editors, GECCO-2001: Proceedings of the Genetic and Evolutionary Computation Conference, pages 927-934. Morgan Kaufmann, 2001.
-
(2001)
GECCO-2001: Proceedings of the Genetic and Evolutionary Computation Conference
, pp. 927-934
-
-
Butz, M.V.1
Kovacs, T.2
Lanzi, P.L.3
Wilson, S.W.4
-
9
-
-
0012816981
-
An algorithmic description of XCS
-
Illinois Genetic Algorithms Laboratory
-
M. V. Butz and S. W. Wilson. An Algorithmic Description of XCS. Technical Report 2000017, Illinois Genetic Algorithms Laboratory, 2000.
-
(2000)
Technical Report 2000017
-
-
Butz, M.V.1
Wilson, S.W.2
-
12
-
-
32444434562
-
Learning and bucket brigade dynamics in classifier systems
-
M. Compiani, D. Montanari, R. Serra, and P. Simonini. Learning and Bucket Brigade Dynamics in Classifier Systems. Special issue of Physica D (Vol. 42), 42:202-212, 1990.
-
(1990)
Special Issue of Physica D (Vol. 42)
, vol.42
, pp. 202-212
-
-
Compiani, M.1
Montanari, D.2
Serra, R.3
Simonini, P.4
-
13
-
-
84949206029
-
An incremental multiplexer problem and its uses in classifier system research
-
P. L. Lanzi, W. Stolzmann, and S. W. Wilson, editors, Advances in Learning Classifier Systems. Springer-Verlag, Berlin
-
L. Davis, C. Fu, and S. W. Wilson. An Incremental Multiplexer Problem and Its Uses in Classifier System Research. In P. L. Lanzi, W. Stolzmann, and S. W. Wilson, editors, Advances in Learning Classifier Systems, volume 2321 of LNAI, pages 23-31. Springer-Verlag, Berlin, 2002.
-
(2002)
LNAI
, vol.2321
, pp. 23-31
-
-
Davis, L.1
Fu, C.2
Wilson, S.W.3
-
15
-
-
0002878183
-
Properties of the bucket brigade
-
J. J. Grefenstette, editor. Lawrence Erlbaum Associates: Pittsburgh, PA, July
-
J. H. Holland. Properties of the Bucket Brigade. In J. J. Grefenstette, editor, Proceedings of the 1st International Conference on Genetic Algorithms and their Applications (ICGA85), pages 1-7. Lawrence Erlbaum Associates: Pittsburgh, PA, July 1985.
-
(1985)
Proceedings of the 1st International Conference on Genetic Algorithms and Their Applications (ICGA85)
, pp. 1-7
-
-
Holland, J.H.1
-
16
-
-
0003707420
-
-
MIT Press, Cambridge
-
J. H. Holland, K. J. Holyoak, R. E. Nisbett, and P, R. Thagard. Induction: Processes of Inference, Learning, and Discovery. MIT Press, Cambridge, 1986.
-
(1986)
Induction: Processes of Inference, Learning, and Discovery
-
-
Holland, J.H.1
Holyoak, K.J.2
Nisbett, R.E.3
Thagard, P.R.4
-
17
-
-
0001666176
-
Cognitive systems based on adaptive algorithms
-
D. A. Waterman and F. Hayes-Roth, editors. New York: Academic Press
-
J. H. Holland and J. S. Reitman. Cognitive systems based on adaptive algorithms. In D. A. Waterman and F. Hayes-Roth, editors, Pattern-directed Inference Systems. New York: Academic Press, 1978.
-
(1978)
Pattern-directed Inference Systems
-
-
Holland, J.H.1
Reitman, J.S.2
-
19
-
-
0004199321
-
-
Master's thesis, School of Computer Science, University of Birmingham, Birmingham, U.K.
-
T. Kovacs. Evolving Optimal Populations with XCS Classifier Systems. Master's thesis, School of Computer Science, University of Birmingham, Birmingham, U.K., 1996.
-
(1996)
Evolving Optimal Populations with XCS Classifier Systems
-
-
Kovacs, T.1
-
21
-
-
4444225146
-
Learning classifier systems from a reinforcement learning perspective
-
Dipartimento di Elettronica e Informazione, Politecnico di Milano
-
P. L. Lanzi. Learning Classifier Systems from a Reinforcement Learning Perspective. Technical Report 00-03, Dipartimento di Elettronica e Informazione, Politecnico di Milano, 2000.
-
(2000)
Technical Report
, Issue.3
-
-
Lanzi, P.L.1
-
23
-
-
0003068303
-
Bucket brigade performance: I. Long sequences of classifiers
-
J. J. Grefenstette, editor, Cambridge, MA, July. Lawrence Erlbaum Associates
-
R. L. Riolo. Bucket Brigade Performance: I. Long Sequences of Classifiers. In J. J. Grefenstette, editor, Proceedings of the 2nd International Conference on Genetic Algorithms (ICGA87), pages 184-195, Cambridge, MA, July 1987. Lawrence Erlbaum Associates.
-
(1987)
Proceedings of the 2nd International Conference on Genetic Algorithms (ICGA87)
, pp. 184-195
-
-
Riolo, R.L.1
-
24
-
-
85153965130
-
Reinforcement learning with soft state aggregation
-
G. Tesauro, D. Touretzky, and T. Leen, editors. The MIT Press
-
S. P. Singh, T. Jaakkola, and M. I. Jordan. Reinforcement Learning with Soft State Aggregation. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 361-368. The MIT Press, 1995.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 361-368
-
-
Singh, S.P.1
Jaakkola, T.2
Jordan, M.I.3
-
25
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
S. P. Singh and R. S. Sutton. Reinforcement Learning with Replacing Eligibility Traces. Machine Learning, 22(1-3):123-158, 1996.
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
26
-
-
0033738767
-
Classifier systems in combat: Two-sided learning of maneuvers for advanced fighter aircraft
-
R. E. Smith, B. A. Dike, R. K. Mehra, B. Ravichandran, and A. El-Fallah. Classifier Systems in Combat: Two-sided Learning of Maneuvers for Advanced Fighter Aircraft. Computer Methods in Applied Mechanics and Engineering, 186(2-4):421-437, 2000.
-
(2000)
Computer Methods in Applied Mechanics and Engineering
, vol.186
, Issue.2-4
, pp. 421-437
-
-
Smith, R.E.1
Dike, B.A.2
Mehra, R.K.3
Ravichandran, B.4
El-Fallah, A.5
-
27
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
R. Sutton. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. Advances in Neural Information Processing Systems, 8:1038-1044, 1996.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.1
-
31
-
-
0004049893
-
-
PhD thesis, University of Cambridge, Psychology Department
-
C. J. Watkins. Learning from delayed rewards. PhD thesis, University of Cambridge, Psychology Department, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.1
-
33
-
-
0001387704
-
Classifier fitness based on accuracy
-
S. W. Wilson. Classifier Fitness Based on Accuracy. Evolutionary Computation, 3(2):149-175, 1995.
-
(1995)
Evolutionary Computation
, vol.3
, Issue.2
, pp. 149-175
-
-
Wilson, S.W.1
-
34
-
-
0000648788
-
Generalization in the XCS classifier system
-
J. R. Koza, W. Banzhaf, K. Chellapilla, K. Deb, M. Dorigo, D. B. Fogel, M. H. Garzon, D. E. Goldberg, H. Iba, and R. Riolo, editors. Morgan Kaufmann
-
S. W. Wilson. Generalization in the XCS Classifier System. In J. R. Koza, W. Banzhaf, K. Chellapilla, K. Deb, M. Dorigo, D. B. Fogel, M. H. Garzon, D. E. Goldberg, H. Iba, and R. Riolo, editors, Genetic Programming 1998: Proceedings of the Third Annual Conference, pages 665-674. Morgan Kaufmann, 1998.
-
(1998)
Genetic Programming 1998: Proceedings of the Third Annual Conference
, pp. 665-674
-
-
Wilson, S.W.1
-
35
-
-
32444442041
-
Mining oblique data with XCS
-
University of Illinois at Urbana-Champaign
-
S. W. Wilson. Mining Oblique Data with XCS. Technical Report 2000028, University of Illinois at Urbana-Champaign, 2000.
-
(2000)
Technical Report 2000028
-
-
Wilson, S.W.1
|