메뉴 건너뛰기




Volumn , Issue , 2005, Pages 1851-1858

XCS with eligibility traces

Author keywords

Eligibility Traces; LCS; Q Learning; Temporal Difference Learning; XCS

Indexed keywords

COMPUTATION THEORY; COMPUTATIONAL METHODS; COMPUTER SCIENCE; GENETIC ALGORITHMS; ROBUSTNESS (CONTROL SYSTEMS);

EID: 32444435131     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1068009.1068322     Document Type: Conference Paper
Times cited : (9)

References (35)
  • 1
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • Morgan Kaufmann Publishers, July
    • L. C. Baird. Residual algorithms: Reinforcement learning with function approximation. In Machine Learning: Proceeding of the Twelfth International Conference. Morgan Kaufmann Publishers, July 1995.
    • (1995) Machine Learning: Proceeding of the Twelfth International Conference
    • Baird, L.C.1
  • 2
    • 0005977690 scopus 로고    scopus 로고
    • PhD thesis, School of Computer Science. Carnigie Mellon University, Pittsburgh, PA 15213
    • L. C. Baird. Reinforcement Learning Through Gradient Descent. PhD thesis, School of Computer Science. Carnigie Mellon University, Pittsburgh, PA 15213, 1999.
    • (1999) Reinforcement Learning Through Gradient Descent
    • Baird, L.C.1
  • 3
    • 35248841035 scopus 로고    scopus 로고
    • Limits in long path learning with XCS
    • E. Cantú-Paz, J. A. Foster, K. Deb, D. Davis, R. Roy, U.-M. O'Reilly, H.-G. Beyer, R. Standish, G. Kendall, S. Wilson, M. Harman, J. Wegener, D. Dasgupta, M. A. Potter, A. C. Schultz, K. Dowsland, N. Jonoska, and J. Miller, editors, Genetic and Evolutionary Computation - GECCO-2003. Springer-Verlag
    • A. Barry. Limits in Long Path Learning with XCS. In E. Cantú-Paz, J. A. Foster, K. Deb, D. Davis, R. Roy, U.-M. O'Reilly, H.-G. Beyer, R. Standish, G. Kendall, S. Wilson, M. Harman, J. Wegener, D. Dasgupta, M. A. Potter, A. C. Schultz, K. Dowsland, N. Jonoska, and J. Miller, editors, Genetic and Evolutionary Computation - GECCO-2003, volume 2724 of LNCS, pages 1832-1843. Springer-Verlag, 2003.
    • (2003) LNCS , vol.2724 , pp. 1832-1843
    • Barry, A.1
  • 5
    • 1542293640 scopus 로고    scopus 로고
    • The stability of long action chains in XCS
    • A. M. Barry. The Stability of Long Action Chains in XCS. Journal of Soft Computing, 6(3-4):183-199, 2002.
    • (2002) Journal of Soft Computing , vol.6 , Issue.3-4 , pp. 183-199
    • Barry, A.M.1
  • 7
    • 34249944702 scopus 로고    scopus 로고
    • Gradient descent methods in learning classifier systems: Improving XCS performance in multistep problems
    • Illinois Genetic Algorithms Laboratory, December
    • M. V. Butz, D. E. Goldberg, and P. L. Lanzi. Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems. Technical Report 2003028, Illinois Genetic Algorithms Laboratory, December 2004.
    • (2004) Technical Report 2003028
    • Butz, M.V.1    Goldberg, D.E.2    Lanzi, P.L.3
  • 8
    • 0042142970 scopus 로고    scopus 로고
    • How XCS evolves accurate classifiers
    • L. Spector, E. D. Goodman, A. Wu, W. B. Langdon, H.-M. Voigt, M. Gen, S. Sen, M. Dorigo, S. Pezeshk, M. H. Garzon, and E. Burke, editors. Morgan Kaufmann
    • M. V. Butz, T. Kovacs, P. L. Lanzi, and S. W. Wilson. How XCS Evolves Accurate Classifiers. In L. Spector, E. D. Goodman, A. Wu, W. B. Langdon, H.-M. Voigt, M. Gen, S. Sen, M. Dorigo, S. Pezeshk, M. H. Garzon, and E. Burke, editors, GECCO-2001: Proceedings of the Genetic and Evolutionary Computation Conference, pages 927-934. Morgan Kaufmann, 2001.
    • (2001) GECCO-2001: Proceedings of the Genetic and Evolutionary Computation Conference , pp. 927-934
    • Butz, M.V.1    Kovacs, T.2    Lanzi, P.L.3    Wilson, S.W.4
  • 9
    • 0012816981 scopus 로고    scopus 로고
    • An algorithmic description of XCS
    • Illinois Genetic Algorithms Laboratory
    • M. V. Butz and S. W. Wilson. An Algorithmic Description of XCS. Technical Report 2000017, Illinois Genetic Algorithms Laboratory, 2000.
    • (2000) Technical Report 2000017
    • Butz, M.V.1    Wilson, S.W.2
  • 13
    • 84949206029 scopus 로고    scopus 로고
    • An incremental multiplexer problem and its uses in classifier system research
    • P. L. Lanzi, W. Stolzmann, and S. W. Wilson, editors, Advances in Learning Classifier Systems. Springer-Verlag, Berlin
    • L. Davis, C. Fu, and S. W. Wilson. An Incremental Multiplexer Problem and Its Uses in Classifier System Research. In P. L. Lanzi, W. Stolzmann, and S. W. Wilson, editors, Advances in Learning Classifier Systems, volume 2321 of LNAI, pages 23-31. Springer-Verlag, Berlin, 2002.
    • (2002) LNAI , vol.2321 , pp. 23-31
    • Davis, L.1    Fu, C.2    Wilson, S.W.3
  • 17
    • 0001666176 scopus 로고
    • Cognitive systems based on adaptive algorithms
    • D. A. Waterman and F. Hayes-Roth, editors. New York: Academic Press
    • J. H. Holland and J. S. Reitman. Cognitive systems based on adaptive algorithms. In D. A. Waterman and F. Hayes-Roth, editors, Pattern-directed Inference Systems. New York: Academic Press, 1978.
    • (1978) Pattern-directed Inference Systems
    • Holland, J.H.1    Reitman, J.S.2
  • 19
    • 0004199321 scopus 로고    scopus 로고
    • Master's thesis, School of Computer Science, University of Birmingham, Birmingham, U.K.
    • T. Kovacs. Evolving Optimal Populations with XCS Classifier Systems. Master's thesis, School of Computer Science, University of Birmingham, Birmingham, U.K., 1996.
    • (1996) Evolving Optimal Populations with XCS Classifier Systems
    • Kovacs, T.1
  • 21
    • 4444225146 scopus 로고    scopus 로고
    • Learning classifier systems from a reinforcement learning perspective
    • Dipartimento di Elettronica e Informazione, Politecnico di Milano
    • P. L. Lanzi. Learning Classifier Systems from a Reinforcement Learning Perspective. Technical Report 00-03, Dipartimento di Elettronica e Informazione, Politecnico di Milano, 2000.
    • (2000) Technical Report , Issue.3
    • Lanzi, P.L.1
  • 23
    • 0003068303 scopus 로고
    • Bucket brigade performance: I. Long sequences of classifiers
    • J. J. Grefenstette, editor, Cambridge, MA, July. Lawrence Erlbaum Associates
    • R. L. Riolo. Bucket Brigade Performance: I. Long Sequences of Classifiers. In J. J. Grefenstette, editor, Proceedings of the 2nd International Conference on Genetic Algorithms (ICGA87), pages 184-195, Cambridge, MA, July 1987. Lawrence Erlbaum Associates.
    • (1987) Proceedings of the 2nd International Conference on Genetic Algorithms (ICGA87) , pp. 184-195
    • Riolo, R.L.1
  • 24
    • 85153965130 scopus 로고
    • Reinforcement learning with soft state aggregation
    • G. Tesauro, D. Touretzky, and T. Leen, editors. The MIT Press
    • S. P. Singh, T. Jaakkola, and M. I. Jordan. Reinforcement Learning with Soft State Aggregation. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems, volume 7, pages 361-368. The MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 361-368
    • Singh, S.P.1    Jaakkola, T.2    Jordan, M.I.3
  • 25
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • S. P. Singh and R. S. Sutton. Reinforcement Learning with Replacing Eligibility Traces. Machine Learning, 22(1-3):123-158, 1996.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 27
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • R. Sutton. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. Advances in Neural Information Processing Systems, 8:1038-1044, 1996.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.1
  • 31
    • 0004049893 scopus 로고
    • PhD thesis, University of Cambridge, Psychology Department
    • C. J. Watkins. Learning from delayed rewards. PhD thesis, University of Cambridge, Psychology Department, 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.J.1
  • 33
    • 0001387704 scopus 로고
    • Classifier fitness based on accuracy
    • S. W. Wilson. Classifier Fitness Based on Accuracy. Evolutionary Computation, 3(2):149-175, 1995.
    • (1995) Evolutionary Computation , vol.3 , Issue.2 , pp. 149-175
    • Wilson, S.W.1
  • 34
    • 0000648788 scopus 로고    scopus 로고
    • Generalization in the XCS classifier system
    • J. R. Koza, W. Banzhaf, K. Chellapilla, K. Deb, M. Dorigo, D. B. Fogel, M. H. Garzon, D. E. Goldberg, H. Iba, and R. Riolo, editors. Morgan Kaufmann
    • S. W. Wilson. Generalization in the XCS Classifier System. In J. R. Koza, W. Banzhaf, K. Chellapilla, K. Deb, M. Dorigo, D. B. Fogel, M. H. Garzon, D. E. Goldberg, H. Iba, and R. Riolo, editors, Genetic Programming 1998: Proceedings of the Third Annual Conference, pages 665-674. Morgan Kaufmann, 1998.
    • (1998) Genetic Programming 1998: Proceedings of the Third Annual Conference , pp. 665-674
    • Wilson, S.W.1
  • 35
    • 32444442041 scopus 로고    scopus 로고
    • Mining oblique data with XCS
    • University of Illinois at Urbana-Champaign
    • S. W. Wilson. Mining Oblique Data with XCS. Technical Report 2000028, University of Illinois at Urbana-Champaign, 2000.
    • (2000) Technical Report 2000028
    • Wilson, S.W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.