SCOPUS 정보 검색 플랫폼

Synthese

Volumn 139, Issue 2, 2004, Pages 297-330

An Evolutionary Game Theoretic perspective on Learning in Multi-Agent Systems

(4) Tuyls, Karl a Nowe, Ann a Lenaerts, Tom a Manderick, Bernard a

a VRIJE UNIVERSITEIT BRUSSEL (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 31344438454 PISSN: 00397857 EISSN: 15730964 Source Type: Journal
DOI: 10.1023/B:SYNT.0000024908.89191.f1 Document Type: Review

Times cited : (15)

References (41)

1
- 33751170995
- Learning to behave socially and avoid the braess paradox in a commuting scenario
- Melbourne Australia
- Bazzan A. L. C. and Franziska Klugl: 2003, 'Learning to Behave Socially and Avoid the Braess Paradox in a Commuting Scenario', in Proceedings of the First International Workshop on Evolutionary Game Theory for Learning in MAS, Melbourne Australia.
- (2003) Proceedings of the First International Workshop on Evolutionary Game Theory for Learning in MAS
- Bazzan, A.L.C.¹ Klugl, F.²

2
- 0011917212
- Ph. D. thesis, University of Karlsruhe
- Bazzan A. L. C.: 1997, A Game-Theoretic Approach to Coordination of Traffic Signal Agents, Ph. D. thesis, University of Karlsruhe.
- (1997) A Game-Theoretic Approach to Coordination of Traffic Signal Agents
- Bazzan, A.L.C.¹

3
- 0031281590
- Learning through reinforcement and replicator dynamics
- Börgers, T. and R. Sarin: 1997, 'Learning through Reinforcement and Replicator Dynamics', Journal of Economic Theory 77(1).
- (1997) Journal of Economic Theory , vol.77 , Issue.1
- Börgers, T.¹ Sarin, R.²

4
- 34250513249
- Uber ein paradoxen aus der Verkehrsplanung
- Braess D.: 1968, 'Uber ein paradoxen aus der Verkehrsplanung', Unternehmensforschung 12, 258.
- (1968) Unternehmensforschung , vol.12 , pp. 258
- Braess, D.¹

5
- 0003781528
- Wiley, New York
- Bush, R. R. and F. Mosteller, F.: 1955, Stochastic Models for Learning, Wiley, New York.
- (1955) Stochastic Models for Learning
- Bush, R.R.¹ Mosteller, F.²

6
- 0031630561
- The dynamics of reinforcement learning in cooperative multi-agent systems
- Claus, C. and C. Boutilier: 1998, 'The Dynamics of Reinforcement Learning in Cooperative Multi-Agent Systems, in Proceedings of the 15th International Conference on Artificial Intelligence, pp. 746-752.
- (1998) Proceedings of the 15th International Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

7
- 84892320972
- Learning TOMs: Convergence to non-myopic equilibria
- Melbourne, Australia
- Ghosh, A. and S. Sen: 2003, 'Learning TOMs: Convergence to Non-Myopic Equilibria', in Proceedings of the First International Workshop on Evolutionary Game Theory for Learning in MAS, Melbourne, Australia.
- (2003) Proceedings of the First International Workshop on Evolutionary Game Theory for Learning in MAS
- Ghosh, A.¹ Sen, S.²

8
- 0003860985
- University Press, Princeton
- Gintis, C. M.: 2000, Game Theory Evolving, University Press, Princeton.
- (2000) Game Theory Evolving
- Gintis, C.M.¹

9
- 0003779190
- Academic Press, Inc.
- Hirsch, M. W. and S. Smale: 1974, Differential Equations, Dynamical Systems and Linear Algebra, Academic Press, Inc.
- (1974) Differential Equations, Dynamical Systems and Linear Algebra
- Hirsch, M.W.¹ Smale, S.²

10
- 0003532627
- Cambridge University Press
- Hofbauer, J. and K. Sigmund: 1998, Evolutionary Games and Population Dynamics, Cambridge University Press.
- (1998) Evolutionary Games and Population Dynamics
- Hofbauer, J.¹ Sigmund, K.²

11
- 1642321450
- Cambridge University Press
- Hu, J. and M. P. Wellman: 1998, Multiagent Reinforcement Learning in Stochastic Games, Cambridge University Press.
- (1998) Multiagent Reinforcement Learning in Stochastic Games
- Hu, J.¹ Wellman, M.P.²

12
- 9444236608
- On no-regret learning, fictitious play, and nash equilibrium
- Jafari, C., A. Greenwald, D. Gondek, and G. Ercal: 2001, 'On No-Regret Learning, Fictitious Play, and Nash Equilibrium', in Proceedings of the Eighteenth International Conference on Machine Learning, pp. 223-226.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 223-226
- Jafari, C.¹ Greenwald, A.² Gondek, D.³ Ercal, G.⁴

13
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., M. L. Littman, and A. W. Moore: 1996, 'Reinforcement Learning: A Survey', Journal of Artificial Intelligence Research.
- (1996) Journal of Artificial Intelligence Research
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

14
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, M. L.: 1994, 'Markov Games as a Framework for Multi-Agent Reinforcement Learning', Proceedings of the Eleventh International Conference on Machine Learning, pp. 157-163.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

15
- 0012327484
- Using eligibility traces to find the best memoryless policy in a partially observable markov process
- San Francisco
- Loch, J. and S. Singh: 1998, 'Using Eligibility Traces to Find the Best Memoryless Policy in a Partially Observable Markov Process', Proceedings of the Fifteenth International Conference on Machine Learning, San Francisco.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning
- Loch, J.¹ Singh, S.²

16
- 33751180011
- A roadmap for agent based computing
- Luck, M., P. McBurney, and C. Preist: 2003, 'A Roadmap for Agent Based Computing', AgentLink, Nehvork of Excellence.
- (2003) AgentLink, Nehvork of Excellence
- Luck, M.¹ McBurney, P.² Preist, C.³

17
- 0004018184
- Cambridge University Press
- Maynard-Smith, J.: 1982, Evolution and the Theory of Games, Cambridge University Press.
- (1982) Evolution and the Theory of Games
- Maynard-Smith, J.¹

18
- 34548719708
- The logic of animal conflict
- Maynard Smith, J. and G. R. Price: 1973, 'The Logic of Animal Conflict', Nature 146, 15-18.
- (1973) Nature , vol.146 , pp. 15-18
- Maynard Smith, J.¹ Price, G.R.²

19
- 0003891507
- Prentice-Hall
- Narendra, K. and M. Thathachar: 1989, Learning Automata: An Introduction, Prentice-Hall.
- (1989) Learning Automata: An Introduction
- Narendra, K.¹ Thathachar, M.²

20
- 84948131383
- Social agents playing a periodical policy
- Nowé, A., J. Parent, and K. Verbeeck: 2001, 'Social Agents Playing a Periodical Policy', in Proceedings of the 12th European Conference on Machine Learning, pp. 382-393.
- (2001) Proceedings of the 12th European Conference on Machine Learning , pp. 382-393
- Nowé, A.¹ Parent, J.² Verbeeck, K.³

21
- 0011847654
- Distributed reinforcement learning, loadbased routing a case study
- Stockholm, Sweden
- Nowé A. and K. Verbeeck: 1999, 'Distributed Reinforcement learning, Loadbased Routing a Case Study', Notes of the Neural, Symbolic and Reinforcement Methods for Sequence Learning Workshop at ijcai99, Stockholm, Sweden.
- (1999) Notes of the Neural, Symbolic and Reinforcement Methods for Sequence Learning Workshop at Ijcai99
- Nowé, A.¹ Verbeeck, K.²

22
- 84884079276
- Princeton University Press, Princeton
- von Neumann, J. and O. Morgenstern: 1944, Theory of Games and Economic Behaviour, Princeton University Press, Princeton.
- (1944) Theory of Games and Economic Behaviour
- Von Neumann, J.¹ Morgenstern, O.²

23
- 0003427725
- MIT Press, Cambridge, MA
- Osborne, J. O. and A. Rubinstein: 1994, A Course in Game Theory, MIT Press, Cambridge, MA.
- (1994) A Course in Game Theory
- Osborne, J.O.¹ Rubinstein, A.²

24
- 0012331004
- An analysis of direct reinforcement learning in non-markovian domains
- San Francisco
- Pendrith, M. D. and M. J. McGarity: 1998, 'An Analysis of Direct Reinforcement Learning in Non-Markovian Domains', in Proceedings of the Fifteenth International Conference on Machine Learning, San Francisco.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning
- Pendrith, M.D.¹ McGarity, M.J.²

25
- 56449099734
- On the existence of fixed points for Q-learning and sarsa in partially observable domains
- Perkins, T. J. and M. D. Pendrith: 2002, 'On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains', in Proceedings of the International Conference on Machine Learning (ICML02).
- (2002) Proceedings of the International Conference on Machine Learning (ICML02)
- Perkins, T.J.¹ Pendrith, M.D.²

26
- 1142305723
- Cambridge University Press
- Redondo, F. V.: 2001, Game Theory and Economics, Cambridge University Press.
- (2001) Game Theory and Economics
- Redondo, F.V.¹

27
- 33751186226
- Robocup project
- Robocup project: 2003, 'The Official Robocup Website at www.robocup.org, Robocup.
- (2003) Robocup

28
- 0004151788
- MIT Press, Cambridge, MA
- Samuelson, L.: 1997, Evolutionary Games and Equilibrium Selection, MIT Press, Cambridge, MA.
- (1997) Evolutionary Games and Equilibrium Selection
- Samuelson, L.¹

29
- 0034661690
- Evolution of biological information
- Schneider, T. D.: 2000, 'Evolution of Biological Information', Journal of Nucleic Acids Research 28, 2794-2799.
- (2000) Journal of Nucleic Acids Research , vol.28 , pp. 2794-2799
- Schneider, T.D.¹

30
- 1142293590
- Institute for Theoretical Physics, Koln, Euroland
- Stauffer, D.: 1999, Life, Love and Death: Models of Biological Reproduction and Aging, Institute for Theoretical Physics, Koln, Euroland.
- (1999) Life, Love and Death: Models of Biological Reproduction and Aging
- Stauffer, D.¹

31
- 0003401114
- MIT Press, Cambridge, MA
- Stone P.: 2000, Layered Learning in Multi-Agent Systems, MIT Press, Cambridge, MA.
- (2000) Layered Learning in Multi-agent Systems
- Stone, P.¹

32
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R. S. and A. G. Barto: 1998, Reinforcement Learning: An Introduction, MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

33
- 27144547178
- Asynchronous stochastic approximation and q-learning
- MIT Press, Cambridge, MA
- Tsitsiklis, J. N.: 1993, 'Asynchronous Stochastic Approximation and q-Learning', Internal Report from the Laboratory for Information and Decision Systems and the Operation Research Center, MIT Press, Cambridge, MA.
- (1993) Internal Report from the Laboratory for Information and Decision Systems and the Operation Research Center
- Tsitsiklis, J.N.¹

34
- 1142305721
- Towards a relation between learning agents and evolutionary dynamics
- KU Leuven, Belgium
- Tuyls, K., T. Lenaerts, K. Verbeeck, S. Maes, and B. Manderick: 2002, 'Towards a Relation between Learning Agents and Evolutionary Dynamics', in Proceedings of the Belgium-Netherlands Artificial Intelligence Conference 2002 (BNAIC), KU Leuven, Belgium.
- (2002) Proceedings of the Belgium-Netherlands Artificial Intelligence Conference 2002 (BNAIC)
- Tuyls, K.¹ Lenaerts, T.² Verbeeck, K.³ Maes, S.⁴ Manderick, B.⁵

35
- 8344263004
- On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor
- Lecture Notes in Artificial Intelligence, Multi-Agent Systems and Applications III, (Central and Eastern European conference on Multi-Agent Systems 2003), Prague, 16-18 June 2003, Czech Republic
- Tuyls, K., K. Verbeeck, and S. Maes: 2003a, 'On a Dynamical Analysis of Reinforcement Learning in Games: Emergence of Occam's Razor, Lecture Notes in Artificial Intelligence, Multi-Agent Systems and Applications III, Lecture Notes in AI 2691, (Central and Eastern European conference on Multi-Agent Systems 2003), Prague, 16-18 June 2003, Czech Republic.
- (2003) Lecture Notes in AI , vol.2691
- Tuyls, K.¹ Verbeeck, K.² Maes, S.³

36
- 26444437242
- A selection-mutation model for Q-learning in multi-agent systems
- Melbourne, 14-18 July 2003, Australia
- Tuyls, K., K. Verbeeck, and T. Lenaerts, T.: 2003b, 'A Selection-Mutation Model for Q-Learning in Multi-Agent Systems', in The ACM International Conference Proceedings Series, Autonomous Agents and Multi-Agent Systems 2003, Melbourne, 14-18 July 2003, Australia.
- (2003) The ACM International Conference Proceedings Series, Autonomous Agents and Multi-agent Systems 2003
- Tuyls, K.¹ Verbeeck, K.² Lenaerts, T.³

37
- 9444229990
- Extended replicator dynamics as a key to reinforcement learning in multi-agent systems
- Cavtat-Dubrovnik, 22-26 September 2003, Croatia
- Tuyls, K., D. Heytens, A. Nowe, and B. Manderick: 2003c, 'Extended Replicator Dynamics as a Key to Reinforcement Learning in Multi-Agent Systems', Proceedings of the European Conference on Machine Learning'03, Lecture Notes in Artificial Intelligence, Cavtat-Dubrovnik, 22-26 September 2003, Croatia.
- (2003) Proceedings of the European Conference on Machine Learning'03, Lecture Notes in Artificial Intelligence
- Tuyls, K.¹ Heytens, D.² Nowe, A.³ Manderick, B.⁴

38
- 33645057735
- MIT Press, Cambridge, MA
- Weibull, J. W.: 1996, Evolutionary Came Theory, MIT Press, Cambridge, MA.
- (1996) Evolutionary Came Theory
- Weibull, J.W.¹

39
- 33751163387
- What we have learned from evolutionary game theory so far?
- Weibull, J. W.: 1998, 'What we have Learned from Evolutionary Game Theory so Far?', Stockholm School of Economics and I.U.I., May 7, 1998.
- (1998) Stockholm School of Economics and I.U.I., May 7, 1998
- Weibull, J.W.¹

40
- 0003744207
- Gerard Weiss (ed.), MIT Press, Cambridge, MA
- Weiss, G.: 1999, in Gerard Weiss (ed.), Multiagent Systems. A Modem Approach to Distributed Artificial Intelligence, MIT Press, Cambridge, MA.
- (1999) Multiagent Systems. A Modem Approach to Distributed Artificial Intelligence
- Weiss, G.¹

41
- 0004285157
- John Wiley & Sons, Chichester, England
- Wooldridge, M.: 2002, An Introduction to MultiAgent Systems, John Wiley & Sons, Chichester, England.
- (2002) An Introduction to MultiAgent Systems
- Wooldridge, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.