메뉴 건너뛰기




Volumn 12, Issue 1, 2006, Pages 115-153

An evolutionary dynamical analysis of multi-agent learning in iterated games

Author keywords

COllective INtelligence; Evolutionary Game Theory; Iterated games; Multi agent systems; Reinforcement learning

Indexed keywords


EID: 31344450384     PISSN: 13872532     EISSN: 15737454     Source Type: Journal    
DOI: 10.1007/s10458-005-3783-9     Document Type: Review
Times cited : (135)

References (56)
  • 2
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • A. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete-Event Syst. J. vol. 13, pp. 41-77, 2003.
    • (2003) Discrete-event Syst. J. , vol.13 , pp. 41-77
    • Barto, A.1    Mahadevan, S.2
  • 5
    • 0031281590 scopus 로고    scopus 로고
    • Reinforcement and replicator dynamics
    • T. Börgers and R. Sarin, "Reinforcement and replicator dynamics," J. Econ. Theory, vol. 77, no.1, pp. 1-14, 1997.
    • (1997) J. Econ. Theory , vol.77 , Issue.1 , pp. 1-14
    • Börgers, T.1    Sarin, R.2
  • 7
    • 0001491619 scopus 로고
    • A mathematical model for simple learning
    • R. R. Bush and F. Mosteller, "A Mathematical Model for Simple Learning," The Psychol. Rev. vol. 58, pp. 15-18, 1951.
    • (1951) The Psychol. Rev. , vol.58 , pp. 15-18
    • Bush, R.R.1    Mosteller, F.2
  • 10
    • 0000742255 scopus 로고
    • A stochastic learning model of economic behaviour
    • J. G. Cross, "A stochastic learning model of economic behaviour," Quart. J. Econ., vol. 87, no.5, pp. 239-266, 1973.
    • (1973) Quart. J. Econ. , vol.87 , Issue.5 , pp. 239-266
    • Cross, J.G.1
  • 21
    • 0012286079 scopus 로고    scopus 로고
    • An algorithm for distributed reinforcement learning in cooperative multi-agent systems
    • Morgan Kaufmann: San Francisco, CA
    • M. Lauer and M. Riedmiller, "An algorithm for distributed reinforcement learning in cooperative multi-agent systems," in Proc. 17th International Conf. on Machine Learning Morgan Kaufmann: San Francisco, CA, pp. 535-542, 2000.
    • (2000) Proc. 17th International Conf. on Machine Learning , pp. 535-542
    • Lauer, M.1    Riedmiller, M.2
  • 24
    • 34548719708 scopus 로고
    • The logic of animal conflict
    • J. Maynard Smith, and G. R. Price, "The logic of animal conflict," Nature, vol. 146, no. 2, pp. 15-18, 1973.
    • (1973) Nature , vol.146 , Issue.2 , pp. 15-18
    • Smith, J.M.1    Price, G.R.2
  • 27
    • 84948131383 scopus 로고    scopus 로고
    • Social agents playing a periodical policy
    • Proceedings of the 12th European Conference on Machine Learning, Springer
    • A. Nowé, J. Parent, and K. Verbeeck, "Social agents playing a periodical policy," in Proceedings of the 12th European Conference on Machine Learning, Volume 2176 of Lecture Notes in Artificial Intelligence, Springer, pp. 382-393, 2001.
    • (2001) Lecture Notes in Artificial Intelligence , vol.2176 , pp. 382-393
    • Nowé, A.1    Parent, J.2    Verbeeck, K.3
  • 30
    • 3142772701 scopus 로고    scopus 로고
    • Adaptive load balancing of parallel applications with social reinforement learning on heterogeneous sysems
    • to appear
    • J. Parent, K. Verbeeck, A. Nowé, K. Steenhaut, J. Lemeire, and E. Dirkx, "Adaptive load balancing of parallel applications with social reinforement learning on heterogeneous sysems," J. Sci. Program. 2004. to appear.
    • (2004) J. Sci. Program.
    • Parent, J.1    Verbeeck, K.2    Nowé, A.3    Steenhaut, K.4    Lemeire, J.5    Dirkx, E.6
  • 31
    • 31344476554 scopus 로고    scopus 로고
    • An evolutionary game-theoretic comparison of two double-action market designs
    • Workshop on Agent Medicated Electronic commerce VI: Theories for Engineering of Distributed Mechanisms and Systems (AMEC'04), Springer
    • S. Phelps, S. Parsons, and P. McBurney, "An evolutionary game-theoretic comparison of two double-action market designs," in Workshop on Agent Medicated Electronic commerce VI: Theories for Engineering of Distributed Mechanisms and Systems (AMEC'04), Volume 2531 of Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2004.
    • (2004) Lecture Notes in Artificial Intelligence , vol.2531 , pp. 109-118
    • Phelps, S.1    Parsons, S.2    McBurney, P.3
  • 35
    • 0034661690 scopus 로고    scopus 로고
    • Evolution of biological information
    • T. D. Schneider, "Evolution of biological information," J. Nucl. Acid Res. vol. 28, no. 14, pp. 2794-2799, 2000.
    • (2000) J. Nucl. Acid Res. , vol.28 , Issue.14 , pp. 2794-2799
    • Schneider, T.D.1
  • 38
    • 22944450534 scopus 로고    scopus 로고
    • Collective INtelligence with sequence of actions
    • 14th European conference on Machine Learning, Springer
    • P. J. 't Hoen and S. M. Bohte, "Collective INtelligence with sequence of actions," in 14th European conference on Machine Learning, Volume 2837 of Lecture Notes in Articifical Intelligence, Springer, 2003.
    • (2003) Lecture Notes in Articifical Intelligence , vol.2837
    • 'T Hoen, P.J.1    Bohte, S.M.2
  • 42
    • 0036894214 scopus 로고    scopus 로고
    • Varieties of learning automata: An overview
    • P. S. Sastry and M. A. L. Thathacher, "Varieties of Learning Automata: An Overview," IEEE Trans. Sys. Man Cybernet, vol. 32, no. 6, pp. 323-334, 2002.
    • (2002) IEEE Trans. Sys. Man Cybernet , vol.32 , Issue.6 , pp. 323-334
    • Sastry, P.S.1    Thathacher, M.A.L.2
  • 43
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learn, vol. 16, pp. 185-202, 1994.
    • (1994) Machine Learn , vol.16 , pp. 185-202
    • Tsitsiklis, J.N.1
  • 46
    • 9444229990 scopus 로고    scopus 로고
    • Extended replicator dynamics as a key to reinforcement learning in multi-agent systems
    • Proceedings of the 14th European Conference on Machine Learning (ECML), Springer
    • K. Tuyls, D. Heytens, A. Nowé, and B. Manderick, "Extended Replicator Dynamics as a Key to Reinforcement Learning in Multi-Agent Systems," in Proceedings of the 14th European Conference on Machine Learning (ECML), Volume 2837, of Lecture Notes in Artificial Intelligence, Springer, 2003.
    • (2003) Lecture Notes in Artificial Intelligence , vol.2837
    • Tuyls, K.1    Heytens, D.2    Nowé, A.3    Manderick, B.4
  • 48
    • 31344438454 scopus 로고    scopus 로고
    • An evolutionary game theoretic perspective on learning in multi-agent systems
    • Kluwer Academic Publishers
    • K. Tuyls, A. Nowe, T. Lenaerts, and B. Manderick, "An evolutionary game theoretic perspective on learning in multi-agent systems," in Synthese, Section Knowledge, Rationality and Action, Kluwer Academic Publishers, 2004, vol. 139, no. 2, pp. 297-330.
    • (2004) Synthese, Section Knowledge, Rationality and Action , vol.139 , Issue.2 , pp. 297-330
    • Tuyls, K.1    Nowe, A.2    Lenaerts, T.3    Manderick, B.4
  • 50
    • 31344463262 scopus 로고    scopus 로고
    • Homo egualis reinforcement learning agents for load balancing
    • Proceedings of the 1st NASA Workshop on Radical Agent Concepts, Springer
    • K. Verbeeck, A. Nowé, and J. Parent, "Homo egualis reinforcement learning agents for load balancing," in Proceedings of the 1st NASA Workshop on Radical Agent Concepts, Volume 2564 of Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2002.
    • (2002) Lecture Notes in Artificial Intelligence , vol.2564 , pp. 109-118
    • Verbeeck, K.1    Nowé, A.2    Parent, J.3
  • 56
    • 0032691530 scopus 로고    scopus 로고
    • General principles of learning-based multi-agent systems
    • Oren Etzioni and Jörg P. Müller and Jeffrey M. Bradshaw (ed.), ACM Press: Seattle, WA, USA
    • David H. Wolpert, Kevin R. Wheler, and Kagan Turner, "General Principles of learning-based multi-agent systems", in Oren Etzioni and Jörg P. Müller and Jeffrey M. Bradshaw (ed.), Proceedings of the Third International Conference on Autonomous Agents (Agents'99), ACM Press: Seattle, WA, USA, pp. 77-83, 1999.
    • (1999) Proceedings of the Third International Conference on Autonomous Agents (Agents'99) , pp. 77-83
    • Wolpert, D.H.1    Wheler, K.R.2    Turner, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.