메뉴 건너뛰기




Volumn 2, Issue , 2007, Pages 1675-1678

Temporal difference and policy search methods for reinforcement learning: An empirical comparison

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARK DOMAINS; LEARNING METHODS; POLICY SEARCH METHODS; REINFORCEMENT LEARNINGS;

EID: 36348987166     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (12)

References (17)
  • 1
    • 21244457900 scopus 로고    scopus 로고
    • Robust nonlinear control through neuroevolution
    • Technical Report AI02-292
    • Gomez, F., and Miikkulainen, R. 2002. Robust nonlinear control through neuroevolution. Technical Report AI02-292.
    • (2002)
    • Gomez, F.1    Miikkulainen, R.2
  • 3
    • 4344663737 scopus 로고    scopus 로고
    • Genetic programming and multi-agent layered learning by reinforcements
    • New York.NY: Morgan Kaufmann
    • Hsu, W. H., and Gustafson, S. M. 2002. Genetic programming and multi-agent layered learning by reinforcements. In Genetic and Evolutionary Computation Conference, 764-771. New York.NY: Morgan Kaufmann.
    • (2002) Genetic and Evolutionary Computation Conference , pp. 764-771
    • Hsu, W.H.1    Gustafson, S.M.2
  • 5
    • 0002318273 scopus 로고    scopus 로고
    • Efficient reinforcement learning through symbiotic evolution
    • Moriarty, D. E., and Miikkulainen, R. 1996. Efficient reinforcement learning through symbiotic evolution. Machine Learning 22:11-32.
    • (1996) Machine Learning , vol.22 , pp. 11-32
    • Moriarty, D.E.1    Miikkulainen, R.2
  • 7
    • 0003636089 scopus 로고
    • On-line Q-learning using connectionist systems
    • Engineering Department, Cambridge University
    • Rummery, G. A., and Niranjan, M. 1994. On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG-RT 116, Engineering Department, Cambridge University.
    • (1994) Technical Report CUED/F-INFENG-RT , vol.116
    • Rummery, G.A.1    Niranjan, M.2
  • 8
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Singh, S. P., and Sutton, R. S. 1996. Reinforcement learning with replacing eligibility traces. Machine Learning 22:123-158.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 9
    • 0036594106 scopus 로고    scopus 로고
    • Evolving neural networks through augmenting topologies
    • Stanley, K. O., and Miikkulainen, R. 2002. Evolving neural networks through augmenting topologies. Evolutionary Computation 10(2):99-127.
    • (2002) Evolutionary Computation , vol.10 , Issue.2 , pp. 99-127
    • Stanley, K.O.1    Miikkulainen, R.2
  • 11
    • 37249034293 scopus 로고    scopus 로고
    • Keepaway soccer: From machine learning testbed to benchmark
    • Noda, I, Jacoff, A, Bredenfeld, A, and Takahashi, Y, eds, Berlin: Springer Verlag
    • Stone, P.; Kuhlmann, G.; Taylor, M. E.; and Liu, Y. 2006. Keepaway soccer: From machine learning testbed to benchmark. In Noda, I.; Jacoff, A.; Bredenfeld, A.; and Takahashi, Y., eds, RoboCup-2005: Robot Soccer World Cup IX, volume 4020. Berlin: Springer Verlag. 93-105.
    • (2006) RoboCup-2005: Robot Soccer World Cup IX , vol.4020 , pp. 93-105
    • Stone, P.1    Kuhlmann, G.2    Taylor, M.E.3    Liu, Y.4
  • 12
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for RoboCup-soccer keepaway
    • Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior 13(3):165-188.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 14
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • Sutton, R. S. 1996. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, 1038-1044.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 17
    • 21244469857 scopus 로고    scopus 로고
    • Evolving keepaway soccer players through task decomposition
    • Whiteson, S.; Kohl, N.; Miikkulainen, R.; and Stone, P. 2005, Evolving keepaway soccer players through task decomposition. Machine Learning 59(1):5-30.
    • (2005) Machine Learning , vol.59 , Issue.1 , pp. 5-30
    • Whiteson, S.1    Kohl, N.2    Miikkulainen, R.3    Stone, P.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.