SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 12, Issue 1, 2006, Pages 115-153

An evolutionary dynamical analysis of multi-agent learning in iterated games

(3) Tuyls, Karl a T Hoen, Pieter Jan b Vanschoenwinkel, Bram c

a HASSELT UNIVERSITY (Belgium)

b CWI (Netherlands)

c VRIJE UNIVERSITEIT BRUSSEL (Belgium)

Author keywords

COllective INtelligence; Evolutionary Game Theory; Iterated games; Multi agent systems; Reinforcement learning

Indexed keywords

EID: 31344450384 PISSN: 13872532 EISSN: 15737454 Source Type: Journal
DOI: 10.1007/s10458-005-3783-9 Document Type: Review

Times cited : (135)

References (56)

1
- 1142280919
- Adaptive policy gradient in multiagent learning
- B. Banerjee and J. Peng, "Adaptive policy gradient in multiagent learning," in Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2003.
- (2003) Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
- Banerjee, B.¹ Peng, J.²

2
- 0141988716
- Recent advances in hierarchical reinforcement learning
- A. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete-Event Syst. J. vol. 13, pp. 41-77, 2003.
- (2003) Discrete-event Syst. J. , vol.13 , pp. 41-77
- Barto, A.¹ Mahadevan, S.²

3
- 0011917212
- PhD thesis, University of Karlsruhe
- A. L. C. Bazzan, A game-theoretic approach to coordination of traffic signal agents, PhD thesis, University of Karlsruhe, 1997.
- (1997) A Game-theoretic Approach to Coordination of Traffic Signal Agents
- Bazzan, A.L.C.¹

4
- 1142293055
- Transition independent decentralized Markov decision problem
- R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman, "Transition independent decentralized Markov decision problem," in Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2003.
- (2003) Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

5
- 0031281590
- Reinforcement and replicator dynamics
- T. Börgers and R. Sarin, "Reinforcement and replicator dynamics," J. Econ. Theory, vol. 77, no.1, pp. 1-14, 1997.
- (1997) J. Econ. Theory , vol.77 , Issue.1 , pp. 1-14
- Börgers, T.¹ Sarin, R.²

6
- 0004245022
- The University of Chicago Press
- R. Boyd and P. J. Richerson, Culture and the Evolutionary Process, The University of Chicago Press, 1985.
- (1985) Culture and the Evolutionary Process
- Boyd, R.¹ Richerson, P.J.²

7
- 0001491619
- A mathematical model for simple learning
- R. R. Bush and F. Mosteller, "A Mathematical Model for Simple Learning," The Psychol. Rev. vol. 58, pp. 15-18, 1951.
- (1951) The Psychol. Rev. , vol.58 , pp. 15-18
- Bush, R.R.¹ Mosteller, F.²

8
- 0003781528
- Wiley: New York
- R. R. Bush and F. Mosteller, Stochastic Models for Learning, Wiley: New York, 1955.
- (1955) Stochastic Models for Learning
- Bush, R.R.¹ Mosteller, F.²

9
- 0031630561
- The dynamics of reinforcement learning in cooperative multi-agent systems
- C. Claus and G. Boutilier, "The Dynamics of Reinforcement Learning in Cooperative Multi-Agent Systems," in Proceedings of the 15th International Conference on Artificial Intelligence, pp. 746-752, 1998.
- (1998) Proceedings of the 15th International Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, G.²

10
- 0000742255
- A stochastic learning model of economic behaviour
- J. G. Cross, "A stochastic learning model of economic behaviour," Quart. J. Econ., vol. 87, no.5, pp. 239-266, 1973.
- (1973) Quart. J. Econ. , vol.87 , Issue.5 , pp. 239-266
- Cross, J.G.¹

11
- 0003860985
- Princeton University Press
- C. M. Gintis, Game Theory Evolving, Princeton University Press, 2000.
- (2000) Game Theory Evolving
- Gintis, C.M.¹

12
- 0036931075
- Dispersion games: General definitions and some specific learning results
- T. Grenager, and R. Powers, and Y. Shoham, "Dispersion games: general definitions and some specific learning results," in Proceedings of the Eighteenth National Conference on Artificial Intelligence AAAI 02, 2002.
- (2002) Proceedings of the Eighteenth National Conference on Artificial Intelligence AAAI 02
- Grenager, T.¹ Powers, R.² Shoham, Y.³

13
- 84880803349
- Generalizing plans to new environments in relational MDPs
- C. Guestrin, D. Koller, C. Gearhart, and N. Kanodia, "Generalizing plans to new environments in relational MDPs," in International Joint Conference on Artificial Intelligence (IJCAI-03), 2003.
- (2003) International Joint Conference on Artificial Intelligence (IJCAI-03)
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

14
- 0003779190
- Academic Press, Inc
- M. W. Hrisch and S. Smale, Differential Equation, Dynamical Systems and Linear Algebra, Academic Press, Inc 1974.
- (1974) Differential Equation, Dynamical Systems and Linear Algebra
- Hrisch, M.W.¹ Smale, S.²

15
- 0003532627
- Cambridge University Press
- J. Hofbauer and K. Sigmund, Evolutionary Games and Population Dynamics, Cambridge University Press, 1998.
- (1998) Evolutionary Games and Population Dynamics
- Hofbauer, J.¹ Sigmund, K.²

16
- 31344439253
- Multiagent reinforcement learning in stochastic games
- J. Hu and M. P. Wellman, "Multiagent reinforcement learning in stochastic games," in Internal Report from the Laboratory for Information and Decision Systems and the Operation Research Center, 1999.
- (1999) Internal Report from the Laboratory for Information and Decision Systems and the Operation Research Center
- Hu, J.¹ Wellman, M.P.²

17
- 1142268755
- Multi-agent learning in extensive games with complete information
- P. Huang and K. Sycara, "Multi-agent Learning in Extensive Games with complete information," in Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2003.
- (2003) Proceedings of the Third International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
- Huang, P.¹ Sycara, K.²

18
- 9444236608
- On no-regret learning fictitious play and nash equilibrium
- Cambridge University Press
- C. Jafari, A. Greenwald, D. Gondek, and G. Ercal, "On no-regret learning fictitious play and nash equilibrium," in Proceedings of the Eighteenth International Conference on Machine Learning (ICML), Cambridge University Press, pp. 223-226, 2001.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning (ICML) , pp. 223-226
- Jafari, C.¹ Greenwald, A.² Gondek, D.³ Ercal, G.⁴

19
- 22944491105
- Performance model for large scale multiagent systems
- H. Jung and M. Tambe, "Performance model for large scale multiagent systems" in Proceedings of the Third International Conference Autonomous Agents and Multiagent Systems (AAMAS), 2003.
- (2003) Proceedings of the Third International Conference Autonomous Agents and Multiagent Systems (AAMAS)
- Jung, H.¹ Tambe, M.²

20
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement Learning: A Survey," J. Artif. Intell. Res. vol. 4, pp. 237-285, 1996.
- (1996) J. Artif. Intell. Res. , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

21
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- Morgan Kaufmann: San Francisco, CA
- M. Lauer and M. Riedmiller, "An algorithm for distributed reinforcement learning in cooperative multi-agent systems," in Proc. 17th International Conf. on Machine Learning Morgan Kaufmann: San Francisco, CA, pp. 535-542, 2000.
- (2000) Proc. 17th International Conf. on Machine Learning , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

22
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Cambridge University Press
- M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proceedings of the Eleventh International Conference on Machine Learning," Cambridge University Press, pp. 157-163, 1994.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

23
- 0004018184
- Cambridge University Press
- J. Maynard Smith, Evolution and the Theory of the Games, Cambridge University Press, 1982.
- (1982) Evolution and the Theory of the Games
- Smith, J.M.¹

24
- 34548719708
- The logic of animal conflict
- J. Maynard Smith, and G. R. Price, "The logic of animal conflict," Nature, vol. 146, no. 2, pp. 15-18, 1973.
- (1973) Nature , vol.146 , Issue.2 , pp. 15-18
- Smith, J.M.¹ Price, G.R.²

25
- 0016082525
- Learning automata: A survey
- K. Narendra and M. Thathachar, "Learning automata: A survey," IEEE Trans. Syst. Man Cybernet, vol. 14, no.5, pp. 323-334, 1974.
- (1974) IEEE Trans. Syst. Man Cybernet , vol.14 , Issue.5 , pp. 323-334
- Narendra, K.¹ Thathachar, M.²

26
- 0003891507
- Prentice-Hall
- K. Narendra and M. Thathachar, Learning Automata: An Introduction, Prentice-Hall, 1989.
- (1989) Learning Automata: An Introduction
- Narendra, K.¹ Thathachar, M.²

27
- 84948131383
- Social agents playing a periodical policy
- Proceedings of the 12th European Conference on Machine Learning, Springer
- A. Nowé, J. Parent, and K. Verbeeck, "Social agents playing a periodical policy," in Proceedings of the 12th European Conference on Machine Learning, Volume 2176 of Lecture Notes in Artificial Intelligence, Springer, pp. 382-393, 2001.
- (2001) Lecture Notes in Artificial Intelligence , vol.2176 , pp. 382-393
- Nowé, A.¹ Parent, J.² Verbeeck, K.³

28
- 4544335718
- Run the GAMUT: A comprehensive approach to evaluating game-theoretic algorithms, algorithms
- E. Nudelman, J. Wortman, K. Leyton-Brown, and Y. Shoham, "Run the GAMUT: A comprehensive approach to evaluating game-theoretic algorithms, algorithms," in Proceedings of the Fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2004.
- (2004) Proceedings of the Fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
- Nudelman, E.¹ Wortman, J.² Leyton-Brown, K.³ Shoham, Y.⁴

29
- 0003427725
- MIT Press
- M. J. Osborne and A. Rubinstein, A Coruse in Game Theory, MIT Press, 1994.
- (1994) A Coruse in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

30
- 3142772701
- Adaptive load balancing of parallel applications with social reinforement learning on heterogeneous sysems
- to appear
- J. Parent, K. Verbeeck, A. Nowé, K. Steenhaut, J. Lemeire, and E. Dirkx, "Adaptive load balancing of parallel applications with social reinforement learning on heterogeneous sysems," J. Sci. Program. 2004. to appear.
- (2004) J. Sci. Program.
- Parent, J.¹ Verbeeck, K.² Nowé, A.³ Steenhaut, K.⁴ Lemeire, J.⁵ Dirkx, E.⁶

31
- 31344476554
- An evolutionary game-theoretic comparison of two double-action market designs
- Workshop on Agent Medicated Electronic commerce VI: Theories for Engineering of Distributed Mechanisms and Systems (AMEC'04), Springer
- S. Phelps, S. Parsons, and P. McBurney, "An evolutionary game-theoretic comparison of two double-action market designs," in Workshop on Agent Medicated Electronic commerce VI: Theories for Engineering of Distributed Mechanisms and Systems (AMEC'04), Volume 2531 of Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2004.
- (2004) Lecture Notes in Artificial Intelligence , vol.2531 , pp. 109-118
- Phelps, S.¹ Parsons, S.² McBurney, P.³

32
- 32844460921
- New criteria and a new algorithm for learning in multi-agent system
- R. Powers and Y. Shoham, "New criteria and a new algorithm for learning in multi-agent system," in Proceedings of Eighteenth Annual Conference on Neural Information Processing Systems (NIPS), 2004.
- (2004) Proceedings of Eighteenth Annual Conference on Neural Information Processing Systems (NIPS)
- Powers, R.¹ Shoham, Y.²

33
- 1142305723
- Cambridge University Press
- F.V. Redondo, Game Theory and Economics, Cambridge University Press, 2001.
- (2001) Game Theory and Economics
- Redondo, F.V.¹

34
- 0004151788
- MIT Press: Cambridge, MA
- L. Samuelson, Evolutionary Games and Equilibrium Selection, MIT Press: Cambridge, MA, 1997.
- (1997) Evolutionary Games and Equilibrium Selection
- Samuelson, L.¹

35
- 0034661690
- Evolution of biological information
- T. D. Schneider, "Evolution of biological information," J. Nucl. Acid Res. vol. 28, no. 14, pp. 2794-2799, 2000.
- (2000) J. Nucl. Acid Res. , vol.28 , Issue.14 , pp. 2794-2799
- Schneider, T.D.¹

36
- 1142293590
- Institute for Theoretical physics: Köln Euroland
- D. Stauffer, Life Love and Death: Models of Biological Reproduction and Aging, Institute for Theoretical physics: Köln Euroland, 1999.
- (1999) Life Love and Death: Models of Biological Reproduction and Aging
- Stauffer, D.¹

37
- 0004102479
- MIT Press: Cambridge MA
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press: Cambridge MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

38
- 22944450534
- Collective INtelligence with sequence of actions
- 14th European conference on Machine Learning, Springer
- P. J. 't Hoen and S. M. Bohte, "Collective INtelligence with sequence of actions," in 14th European conference on Machine Learning, Volume 2837 of Lecture Notes in Articifical Intelligence, Springer, 2003.
- (2003) Lecture Notes in Articifical Intelligence , vol.2837
- 'T Hoen, P.J.¹ Bohte, S.M.²

39
- 31344446978
- Collective INtelligence with task assignment
- forthcoming
- P. J. 't Hoen and S. M. Bohte, "Collective INtelligence with task assignment," in proceedings of the Workshop on Collectives and the Design of Complex Systems (CDOCS03), forthcoming.
- Proceedings of the Workshop on Collectives and the Design of Complex Systems (CDOCS03)
- 'T Hoen, P.J.¹ Bohte, S.M.²

40
- 31344467088
- Springer
- Also available as Technical Rapport SEN-E0315, Lecture Notes in Artificial Intelligence, Springer, 2003.
- (2003) Technical Rapport SEN-E0315, Lecture Notes in Artificial Intelligence

41
- 22944478374
- Analyzing multi-agent reinforcement learning using evolutionary dynamics
- Springer
- P. J. 't Hoen and K. Tuyls, "Analyzing multi-agent reinforcement learning using evolutionary dynamics," in Proceedings of the 15th European Conference on Machine Learning (ECML), Lecture Notes in Artificial Intelligence, Springer, 2004.
- (2004) Proceedings of the 15th European Conference on Machine Learning (ECML), Lecture Notes in Artificial Intelligence
- 'T Hoen, P.J.¹ Tuyls, K.²

42
- 0036894214
- Varieties of learning automata: An overview
- P. S. Sastry and M. A. L. Thathacher, "Varieties of Learning Automata: An Overview," IEEE Trans. Sys. Man Cybernet, vol. 32, no. 6, pp. 323-334, 2002.
- (2002) IEEE Trans. Sys. Man Cybernet , vol.32 , Issue.6 , pp. 323-334
- Sastry, P.S.¹ Thathacher, M.A.L.²

43
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learn, vol. 16, pp. 185-202, 1994.
- (1994) Machine Learn , vol.16 , pp. 185-202
- Tsitsiklis, J.N.¹

44
- 85158118268
- Collective INtelligence and braess' paradox
- Austin, August
- K. Tumer and D. Wolpert, "Collective INtelligence and Braess' Paradox," in Proceedings of the Sixteenth National Conference on Artificial Intelligence, Austin, pp. 104-109, August, 2000.
- (2000) Proceedings of the Sixteenth National Conference on Artificial Intelligence , pp. 104-109
- Tumer, K.¹ Wolpert, D.²

45
- 4344576577
- Ph.D. dissertation, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium
- K. Tuyls, Learning in Multi-Agent Systems, An Evolutionary Game Theoretic Approach, Ph.D. dissertation, Computational Modeling Lab, Vrije Universiteit Brussel, Belgium, 2004.
- (2004) Learning in Multi-agent Systems, an Evolutionary Game Theoretic Approach
- Tuyls, K.¹

46
- 9444229990
- Extended replicator dynamics as a key to reinforcement learning in multi-agent systems
- Proceedings of the 14th European Conference on Machine Learning (ECML), Springer
- K. Tuyls, D. Heytens, A. Nowé, and B. Manderick, "Extended Replicator Dynamics as a Key to Reinforcement Learning in Multi-Agent Systems," in Proceedings of the 14th European Conference on Machine Learning (ECML), Volume 2837, of Lecture Notes in Artificial Intelligence, Springer, 2003.
- (2003) Lecture Notes in Artificial Intelligence , vol.2837
- Tuyls, K.¹ Heytens, D.² Nowé, A.³ Manderick, B.⁴

47
- 1142305721
- Towards a relation between learning agents and evolutionary dynamics
- Cambridge University Press
- K. Tuyls, T. Lenaerts, K. Verbeeck, S. Maes and B. Manderick, "Towards a relation between learning agents and evolutionary dynamics", in Proceedings of the Belgian-Dutch Conference on Artificial Intelligence (BNAIC 2002), Cambridge University Press, pp. 223-226, 2002.
- (2002) Proceedings of the Belgian-Dutch Conference on Artificial Intelligence (BNAIC 2002) , pp. 223-226
- Tuyls, K.¹ Lenaerts, T.² Verbeeck, K.³ Maes, S.⁴ Manderick, B.⁵

48
- 31344438454
- An evolutionary game theoretic perspective on learning in multi-agent systems
- Kluwer Academic Publishers
- K. Tuyls, A. Nowe, T. Lenaerts, and B. Manderick, "An evolutionary game theoretic perspective on learning in multi-agent systems," in Synthese, Section Knowledge, Rationality and Action, Kluwer Academic Publishers, 2004, vol. 139, no. 2, pp. 297-330.
- (2004) Synthese, Section Knowledge, Rationality and Action , vol.139 , Issue.2 , pp. 297-330
- Tuyls, K.¹ Nowe, A.² Lenaerts, T.³ Manderick, B.⁴

49
- 1142268235
- A selection-mutation model for Q-learning in multi-agent systems
- The ACM International Conference Proceedings Series
- K. Tuyls, K. Verbeeck, T. Lenaerts, "A Selection-Mutation model for Q-learning in Multi-Agent Systems," in Proceedings of the Third Interational conference on Autonomous Agents and Multi-agent Systems (AAMAS), The ACM International Conference Proceedings Series, 2003.
- (2003) Proceedings of the Third Interational Conference on Autonomous Agents and Multi-agent Systems (AAMAS)
- Tuyls, K.¹ Verbeeck, K.² Lenaerts, T.³

50
- 31344463262
- Homo egualis reinforcement learning agents for load balancing
- Proceedings of the 1st NASA Workshop on Radical Agent Concepts, Springer
- K. Verbeeck, A. Nowé, and J. Parent, "Homo egualis reinforcement learning agents for load balancing," in Proceedings of the 1st NASA Workshop on Radical Agent Concepts, Volume 2564 of Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2002.
- (2002) Lecture Notes in Artificial Intelligence , vol.2564 , pp. 109-118
- Verbeeck, K.¹ Nowé, A.² Parent, J.³

51
- 84884079276
- Princeton University Press
- J. von Neumann and O. Morgenstern, Theory of Games and Economic Behavior, Princeton University Press, 1944.
- (1944) Theory of Games and Economic Behavior
- Von Neumann, J.¹ Morgenstern, O.²

52
- 1842545000
- Analyzing complex strategic interactions in multi-agent games
- Springer
- W. E. Walsh, R. Das, G. Tesauro, and J. O. Kephart, "Analyzing complex strategic interactions in multi-agent games," in Proceedings of the The Eighteenth National Conference on Artificial Intelligence (AAAI-02) Workshop on Game Theoretic, and Decision Theoretic Agents, Lecture Notes in Artificial Intelligence, Springer, pp. 109-118, 2002.
- (2002) Proceedings of the the Eighteenth National Conference on Artificial Intelligence (AAAI-02) Workshop on Game Theoretic, and Decision Theoretic Agents, Lecture Notes in Artificial Intelligence , pp. 109-118
- Walsh, W.E.¹ Das, R.² Tesauro, G.³ Kephart, J.O.⁴

53
- 34249833101
- Q-learning
- C. Watkins and P. Dayan, "Q-learning", Machine Learn., vol. 8, pp. 279-292, 1992.
- (1992) Machine Learn. , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

54
- 0004202581
- MIT Press
- J. W. Weibull, Evolutionary Game Theory, MIT Press, 1996.
- (1996) Evolutionary Game Theory
- Weibull, J.W.¹

55
- 84899033169
- Using collective INtelligence to route internet traffic
- Denver
- David H. Wolpert, Kagan Turner, and Jeremy Frank, "Using Collective INtelligence to route internet traffic," in Advances in Neural Information Processing Systems-II, Denver, pp. 952-958, 1998.
- (1998) Advances in Neural Information Processing Systems-II , pp. 952-958
- Wolpert, D.H.¹ Turner, K.² Frank, J.³

56
- 0032691530
- General principles of learning-based multi-agent systems
- Oren Etzioni and Jörg P. Müller and Jeffrey M. Bradshaw (ed.), ACM Press: Seattle, WA, USA
- David H. Wolpert, Kevin R. Wheler, and Kagan Turner, "General Principles of learning-based multi-agent systems", in Oren Etzioni and Jörg P. Müller and Jeffrey M. Bradshaw (ed.), Proceedings of the Third International Conference on Autonomous Agents (Agents'99), ACM Press: Seattle, WA, USA, pp. 77-83, 1999.
- (1999) Proceedings of the Third International Conference on Autonomous Agents (Agents'99) , pp. 77-83
- Wolpert, D.H.¹ Wheler, K.R.² Turner, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.