메뉴 건너뛰기




Volumn 20, Issue 1, 2005, Pages 63-90

Evolutionary game theory and multi-agent reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

EVOLUTIONARY ALGORITHMS; LEARNING SYSTEMS; MULTI AGENT SYSTEMS;

EID: 28544446213     PISSN: 02698889     EISSN: 14698005     Source Type: Journal    
DOI: 10.1017/S026988890500041X     Document Type: Article
Times cited : (101)

References (61)
  • 2
    • 0020970738 scopus 로고
    • Neuron-like adaptive elements that can solve difficult learning control problems
    • Barto, A, Sutton, R and Anderson, C, 1983, Neuron-like adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man and Cybernetics 13(5), 834-846.
    • (1983) IEEE Transactions on Systems, Man and Cybernetics , vol.13 , Issue.5 , pp. 834-846
    • Barto, A.1    Sutton, R.2    Anderson, C.3
  • 8
    • 0031281590 scopus 로고    scopus 로고
    • Learning through reinforcement and replicator dynamics
    • November
    • Borgers, T and Sarin, R, 1997, Learning through reinforcement and replicator dynamics. Journal of Economic Theory 77(1), November.
    • (1997) Journal of Economic Theory , vol.77 , Issue.1
    • Borgers, T.1    Sarin, R.2
  • 12
    • 0003860985 scopus 로고    scopus 로고
    • Princeton, NJ: Princeton University Press
    • Gintis, CM, 2000, Game Theory Evolving. Princeton, NJ: Princeton University Press.
    • (2000) Game Theory Evolving
    • Gintis, C.M.1
  • 27
    • 34548719708 scopus 로고
    • The logic of animal conflict
    • Maynard Smith, J and Price, GR, 1973, The logic of animal conflict. Nature 146, 15-18.
    • (1973) Nature , vol.146 , pp. 15-18
    • Maynard Smith, J.1    Price, G.R.2
  • 43
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Boston, MA: Kluwer Academic
    • Sutton, RS, 1988, Learning to predict by the methods of temporal differences. Machine Learning, vol. 3. Boston, MA: Kluwer Academic, pp. 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 47
    • 0000502181 scopus 로고
    • On the behavior of finite automata in random media
    • Tsetlin, ML, 1962, On the behavior of finite automata in random media. Automation and Remote Control 22, 1210-1219.
    • (1962) Automation and Remote Control , vol.22 , pp. 1210-1219
    • Tsetlin, M.L.1
  • 49
    • 27144547178 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • Laboratory for Information and Decision Systems and the Operation Research Center, MIT, Cambridge, MA
    • Tsitsiklis, JN, 1993, Asynchronous stochastic approximation and Q-learning. Internal Report, Laboratory for Information and Decision Systems and the Operation Research Center, MIT, Cambridge, MA.
    • (1993) Internal Report
    • Tsitsiklis, J.N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.