메뉴 건너뛰기




Volumn 2, Issue , 2006, Pages 1321-1328

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

Author keywords

Empirical study; Genetic algorithms; Machine learning; Performance analysis

Indexed keywords

GENETIC ALGORITHMS; NEURAL NETWORKS; PUBLIC POLICY;

EID: 33750259111     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1143997.1144202     Document Type: Conference Paper
Times cited : (82)

References (28)
  • 2
    • 85150714688 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems
    • G. Tesauro, D. Touretzky, and T. Leen, editors, San Mateo, CA, Morgan Kaufmann
    • S. J. Bradtke and M. O. Duff. Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Information Processing Systems 7, pages 393-400, San Mateo, CA, 1995. Morgan Kaufmann.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
    • Bradtke, S.J.1    Duff, M.O.2
  • 4
    • 21244457900 scopus 로고    scopus 로고
    • Robust nonlinear control through neuroevolution
    • Department of Computer Sciences, University of Texas at Austin
    • F. Gomez and R. Miikkulainen. Robust nonlinear control through neuroevolution. Technical Report AI02-292, Department of Computer Sciences, University of Texas at Austin, 2002.
    • (2002) Technical Report , vol.AI02-292
    • Gomez, F.1    Miikkulainen, R.2
  • 6
    • 0012329219 scopus 로고    scopus 로고
    • A comparison between cellular encoding and direct encoding for genetic neural networks
    • J. R. Koza, D. E. Goldberg, D. B. Fogel, and R. L. Riolo, editors, MIT Press
    • F. Gruau, D. Whitley, and L. Pyeatt. A comparison between cellular encoding and direct encoding for genetic neural networks. In J. R. Koza, D. E. Goldberg, D. B. Fogel, and R. L. Riolo, editors, Genetic Programming 1996: Proceedings of the First Annual Conference, pages 81-89. MIT Press, 1996.
    • (1996) Genetic Programming 1996: Proceedings of the First Annual Conference , pp. 81-89
    • Gruau, F.1    Whitley, D.2    Pyeatt, L.3
  • 7
    • 4344663737 scopus 로고    scopus 로고
    • Genetic programming and multi-agent layered learning by reinforcements
    • New York,NY, July Morgan Kaufmann
    • W. H. Hsu and S. M. Gustafson. Genetic programming and multi-agent layered learning by reinforcements. In Genetic and Evolutionary Computation Conference, pages 764-771, New York,NY, July 2002. Morgan Kaufmann.
    • (2002) Genetic and Evolutionary Computation Conference , pp. 764-771
    • Hsu, W.H.1    Gustafson, S.M.2
  • 9
    • 0002318273 scopus 로고    scopus 로고
    • Efficient reinforcement learning through symbiotic evolution
    • D. E. Moriarty and R. Miikkulainen. Efficient reinforcement learning through symbiotic evolution. Machine Learning, 22:11-32, 1996.
    • (1996) Machine Learning , vol.22 , pp. 11-32
    • Moriarty, D.E.1    Miikkulainen, R.2
  • 12
    • 0034153728 scopus 로고    scopus 로고
    • Cooperative coevolution: An architecture for evolving coadapted subcomponents
    • M. A. Potter and K. A. De Jong. Cooperative coevolution: An architecture for evolving coadapted subcomponents. Evolutionary Computation, 8:1-29, 2000.
    • (2000) Evolutionary Computation , vol.8 , pp. 1-29
    • Potter, M.A.1    De Jong, K.A.2
  • 13
    • 0001355838 scopus 로고
    • Radial basis functions for multivariate interpolation: A review
    • J. C. Mason and M. G. Cox, editors, Oxford, Clarendon Press
    • M. J. D. Powell. Radial basis functions for multivariate interpolation: A review. In J. C. Mason and M. G. Cox, editors, Algorithms for Approximation, pages 143-167, Oxford, 1987. Clarendon Press.
    • (1987) Algorithms for Approximation , pp. 143-167
    • Powell, M.J.D.1
  • 15
    • 1542329420 scopus 로고
    • Genetic set recombination and its application to neural network topology optimization
    • N. J. Radcliffe. Genetic set recombination and its application to neural network topology optimization. Neural computing and applications, 1(1):67-90, 1993.
    • (1993) Neural Computing and Applications , vol.1 , Issue.1 , pp. 67-90
    • Radcliffe, N.J.1
  • 16
    • 0003636089 scopus 로고
    • On-line Q-learning using connectionist systems
    • Engineering Department, Cambridge University
    • G. A. Rummery and M. Niranjan. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University, 1994.
    • (1994) Technical Report CUED/F-INFENG-RT 116
    • Rummery, G.A.1    Niranjan, M.2
  • 17
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replaceing eligibility traces
    • S. P. Singh and R. S. Sutton. Reinforcement learning with replaceing eligibility traces. Machine Learning, 22:123-158, 1996.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 19
    • 0036594106 scopus 로고    scopus 로고
    • Evolving neural networks through augmenting topologies
    • K. O. Stanley and R. Miikkulainen. Evolving neural networks through augmenting topologies. Evolutionary Computation, 10(2):99-127, 2002.
    • (2002) Evolutionary Computation , vol.10 , Issue.2 , pp. 99-127
    • Stanley, K.O.1    Miikkulainen, R.2
  • 22
    • 33746857797 scopus 로고    scopus 로고
    • Keepaway soccer: From machine learning testbed to benchmark
    • I. Noda, A. Jacoff, A. Bredenfeld, and Y. Takahashi, editors, Springer Verlag, Berlin, To appear
    • P. Stone, G. Kuhlmann, M. E. Taylor, and Y. Liu. Keepaway soccer: From machine learning testbed to benchmark. In I. Noda, A. Jacoff, A. Bredenfeld, and Y. Takahashi, editors, RoboCup-2005: Robot Soccer World Cup IX. Springer Verlag, Berlin, 2006. To appear.
    • (2006) RoboCup-2005: Robot Soccer World Cup IX
    • Stone, P.1    Kuhlmann, G.2    Taylor, M.E.3    Liu, Y.4
  • 23
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3): 165-188, 2005.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 24
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • R. S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, pages 1038-1044, 1996.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 26
    • 21244469857 scopus 로고    scopus 로고
    • Evolving keepaway soccer players through task decomposition
    • S. Whiteson, N. Kohl, R. Miikkulainen, and P. Stone. Evolving keepaway soccer players through task decomposition. Machine Learning, 59(1):5-30, 2005.
    • (2005) Machine Learning , vol.59 , Issue.1 , pp. 5-30
    • Whiteson, S.1    Kohl, N.2    Miikkulainen, R.3    Stone, P.4
  • 27
    • 33646714634 scopus 로고    scopus 로고
    • Evolutionary function approximation for reinforcement learning
    • To appear
    • S. Whiteson and P. Stone. Evolutionary function approximation for reinforcement learning. Journal of Machine Learning Research, 2006. To appear.
    • (2006) Journal of Machine Learning Research
    • Whiteson, S.1    Stone, P.2
  • 28
    • 0033362601 scopus 로고    scopus 로고
    • Evolving artificial neural networks
    • X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9):1423-1447, 1999.
    • (1999) Proceedings of the IEEE , vol.87 , Issue.9 , pp. 1423-1447
    • Yao, X.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.