SCOPUS 정보 검색 플랫폼

Knowledge Engineering Review

Volumn 20, Issue 1, 2005, Pages 63-90

Evolutionary game theory and multi-agent reinforcement learning

(2) Tuyls, Karl a Nowé, Ann b

a MAASTRICHT UNIVERSITY (Netherlands)

b VRIJE UNIVERSITEIT BRUSSEL (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

EVOLUTIONARY ALGORITHMS; LEARNING SYSTEMS; MULTI AGENT SYSTEMS;

EVOLUTIONARY GAME THEORY; REINFORCEMENT LEARNING;

GAME THEORY;

EID: 28544446213 PISSN: 02698889 EISSN: 14698005 Source Type: Journal
DOI: 10.1017/S026988890500041X Document Type: Article

Times cited : (101)

References (61)

1
- 0344154963
- Strategy Learning with multilayer connectionist representations
- Anderson, CW, 1987, Strategy Learning with multilayer connectionist representations. In Proceedings of the 4th International Conference on Machine Learning, pp. 103-114.
- (1987) Proceedings of the 4th International Conference on Machine Learning , pp. 103-114
- Anderson, C.W.¹

2
- 0020970738
- Neuron-like adaptive elements that can solve difficult learning control problems
- Barto, A, Sutton, R and Anderson, C, 1983, Neuron-like adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man and Cybernetics 13(5), 834-846.
- (1983) IEEE Transactions on Systems, Man and Cybernetics , vol.13 , Issue.5 , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

3
- 33751170995
- Learning to behave socially and avoid the Braess paradox in a commuting scenario
- Bazzan, ALC and Klugl, F, 2003, Learning to behave socially and avoid the Braess paradox in a commuting scenario. In Proceedings of the 1st International Workshop on Evolutionary Game Theory for Learning in MAS, Melbourne Australia, 14 July 2003.
- (2003) Proceedings of the 1st International Workshop on Evolutionary Game Theory for Learning in MAS, Melbourne Australia, 14 July 2003
- Bazzan, A.L.C.¹ Klugl, F.²

4
- 0011917212
- PhD thesis. University of Karlsruhe
- Bazzan, ALC, 1997, A game-theoretic approach to coordination of traffic signal agents. PhD thesis. University of Karlsruhe.
- (1997) A Game-theoretic Approach to Coordination of Traffic Signal Agents
- Bazzan, A.L.C.¹

5
- 0004009767
- Princeton, NJ: Princeton University Press
- Bellman, RE and Dreyfuss SE, 1962, Applied Dynamical Programming. Princeton, NJ: Princeton University Press.
- (1962) Applied Dynamical Programming
- Bellman, R.E.¹ Dreyfuss, S.E.²

6
- 55949131592
- Reading, MA: Academic Press
- Bertsekas, DP, 1976, Dynamic Programming and Stochastic Control (Mathematics in Science and Engineering, 125). Reading, MA: Academic Press.
- (1976) Dynamic Programming and Stochastic Control (Mathematics in Science and Engineering, 125) , vol.125
- Bertsekas, D.P.¹

7
- 0003070025
- Nash equilibrium and evolution by imitation
- Arrow, K et al. (ed.). London: MacMillan
- Bjornerstedt, J and Weibull, J, 1995, Nash equilibrium and evolution by imitation. In Arrow, K et al. (ed.). The Rational Foundations of Economic Behavior. London: MacMillan.
- (1995) The Rational Foundations of Economic Behavior
- Bjornerstedt, J.¹ Weibull, J.²

8
- 0031281590
- Learning through reinforcement and replicator dynamics
- November
- Borgers, T and Sarin, R, 1997, Learning through reinforcement and replicator dynamics. Journal of Economic Theory 77(1), November.
- (1997) Journal of Economic Theory , vol.77 , Issue.1
- Borgers, T.¹ Sarin, R.²

9
- 0003781528
- New York: Wiley
- Bush, RR and Mosteller, F, 1955, Stochastic Models for Learning. New York: Wiley.
- (1955) Stochastic Models for Learning
- Bush, R.R.¹ Mosteller, F.²

10
- 0028564629
- Acting optimally in partially observable stochastic domains
- Cassandra, AR, Kaelbling, LP. and Littman, ML, 1994, Acting optimally in partially observable stochastic domains. In Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA.
- (1994) Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.L.³

11
- 0031630561
- The dynamics of reinforcement learning in cooperative multi-agent systems
- Claus, C and Boutilier, C, 1998, The dynamics of reinforcement learning in cooperative multi-agent systems. In Proceedings of the 15th International Conference on Artificial Intelligence, pp. 746-752.
- (1998) Proceedings of the 15th International Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

12
- 0003860985
- Princeton, NJ: Princeton University Press
- Gintis, CM, 2000, Game Theory Evolving. Princeton, NJ: Princeton University Press.
- (2000) Game Theory Evolving
- Gintis, C.M.¹

13
- 28544446120
- Menlo Park, CA: AAAI Press
- Grenager, T, Powers, R and Shoham, Y, 2002, Dispersion Games: General Definitions and Some Specific Learning Results. Menlo Park, CA: AAAI Press.
- (2002) Dispersion Games: General Definitions and Some Specific Learning Results
- Grenager, T.¹ Powers, R.² Shoham, Y.³

14
- 0003779190
- Reading, MA: Academic Press
- Hirsch, MW and Smale, S, 1974, Differential Equations, Dynamical Systems and Linear Algebra. Reading, MA: Academic Press.
- (1974) Differential Equations, Dynamical Systems and Linear Algebra
- Hirsch, M.W.¹ Smale, S.²

15
- 22944478374
- Engineering multi-agent reinforcement learning using evolutionary dynamics
- Berlin: Springer
- Hoen, PJ and Tuyls, K, 2004, Engineering multi-agent reinforcement learning using evolutionary dynamics. In Proceedings of the 15th European Conference on Machine Learning (ECML'04) (Lecture Notes in Artificial Intelligence, 3201), Pisa, Italy, 20-24 September 2004. Berlin: Springer.
- (2004) Proceedings of the 15th European Conference on Machine Learning (ECML'04) (Lecture Notes in Artificial Intelligence, 3201), Pisa, Italy, 20-24 September 2004 , vol.3201
- Hoen, P.J.¹ Tuyls, K.²

16
- 0003532627
- Cambridge: Cambridge University Press
- Hofbauer, J and Sigmund, K, 1998, Evolutionary Games and Population Dynamics. Cambridge: Cambridge University Press.
- (1998) Evolutionary Games and Population Dynamics
- Hofbauer, J.¹ Sigmund, K.²

17
- 1642321450
- Cambridge: Cambridge University Press
- Hu, J and Welhnan, MO, 1999, Multiagent Reinforcement Learning in Stochastic Games. Cambridge: Cambridge University Press.
- (1999) Multiagent Reinforcement Learning in Stochastic Games
- Hu, J.¹ Welhnan, M.O.²

18
- 9444236608
- On no-regret learning, fictitious play, and Nash equilibrium
- Jafari, C, Greenwald, A, Gondek, D and Ercal, G, 2001, On no-regret learning, fictitious play, and Nash equilibrium. In Proceedings of the 18th International Conference on Machine Learning, pp. 223-226.
- (2001) Proceedings of the 18th International Conference on Machine Learning , pp. 223-226
- Jafari, C.¹ Greenwald, A.² Gondek, D.³ Ercal, G.⁴

19
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, LP, Littman, ML and Moore, AW, Reinforcement learning: a survey. Journal of Artificial Intelligence Research.
- Journal of Artificial Intelligence Research
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

20
- 28544435259
- Menlo Park, CA: AAAI Press
- Kapetanakis, S and Kudenko, D, 2002, Reinforcement Learning of Coordination in Cooperative Multi-agent Systems. Menlo Park, CA: AAAI Press.
- (2002) Reinforcement Learning of Coordination in Cooperative Multi-agent Systems
- Kapetanakis, S.¹ Kudenko, D.²

21
- 28544448043
- PhD dissertation, University of York
- Kapetanakis, S, 2004, Independent learning of coordination in cooperative single-stage games. PhD dissertation, University of York.
- (2004) Independent Learning of Coordination in Cooperative Single-stage Games
- Kapetanakis, S.¹

22
- 28544436546
- PhD dissertation, Helsinki University of Technology
- Kononen, V, 2004, Multiagent reinforcement learning in Markov games: asymmetric and symmetric approaches. PhD dissertation, Helsinki University of Technology.
- (2004) Multiagent Reinforcement Learning in Markov Games: Asymmetric and Symmetric Approaches
- Kononen, V.¹

23
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- Lauer, M and Riedmiller, M, 2000, An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proceedings of the 17th International Conference on Machine Learning.
- (2000) Proceedings of the 17th International Conference on Machine Learning
- Lauer, M.¹ Riedmiller, M.²

24
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Littman, ML, 1994, Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the 11th International Conference on Machine Learning, pp. 157-163.
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

25
- 0012327484
- Using eligibility traces to find the best memoryless policy in a partially observable Markov process
- Loch, J and Singh, S, 1998, Using eligibility traces to find the best memoryless policy in a partially observable Markov process. In Proceedings of the 15th International Conference on Machine Learning, San Francisco, CA.
- (1998) Proceedings of the 15th International Conference on Machine Learning, San Francisco, CA
- Loch, J.¹ Singh, S.²

26
- 0004018184
- Cambridge: Cambridge University Press
- Maynard-Smith, J, 1982, Evolution and the Theory of Games. Cambridge: Cambridge University Press.
- (1982) Evolution and the Theory of Games
- Maynard-Smith, J.¹

27
- 34548719708
- The logic of animal conflict
- Maynard Smith, J and Price, GR, 1973, The logic of animal conflict. Nature 146, 15-18.
- (1973) Nature , vol.146 , pp. 15-18
- Maynard Smith, J.¹ Price, G.R.²

28
- 32844460364
- Towards a Pareto optimal solution in general-sum games
- Mukherjee, R and Sen, S, 2001, Towards a Pareto optimal solution in general-sum games. Working Notes of the 5th Conference on Autonomous Agents, pp. 21-28.
- (2001) Working Notes of the 5th Conference on Autonomous Agents , pp. 21-28
- Mukherjee, R.¹ Sen, S.²

29
- 0016082525
- Learning automata: A survey
- Narendra, K and Thathacher, M, 1974, Learning automata: a survey. IEEE Transactions on Systems, Man, and Cybernetics 14, 323-334.
- (1974) IEEE Transactions on Systems, Man, and Cybernetics , vol.14 , pp. 323-334
- Narendra, K.¹ Thathacher, M.²

30
- 0003891507
- Englewood Cliffs, NJ: Prentice-Hall
- Narendra, K and Thathacher, M, 1989, Learning Automata: An Introduction. Englewood Cliffs, NJ: Prentice-Hall.
- (1989) Learning Automata: An Introduction
- Narendra, K.¹ Thathacher, M.²

31
- 84948131383
- Social agents playing a periodical policy
- Nowé, A, Parent, J and Verbeeck, K, 2001, Social agents playing a periodical policy. In Proceedings of the 12th European Conference on Machine Learning, pp. 382-393.
- (2001) Proceedings of the 12th European Conference on Machine Learning , pp. 382-393
- Nowé, A.¹ Parent, J.² Verbeeck, K.³

32
- 0011847654
- Distributed reinforcement learning, loadbased routing a case study
- Nowé, A and Verbeeck, K, 1999, Distributed reinforcement learning, loadbased routing a case study. Notes of the Neural, Symbolic and Reinforcement Methods for Sequence Learning Workshop at IJCAI99, Stockholm, Sweden.
- (1999) Notes of the Neural, Symbolic and Reinforcement Methods for Sequence Learning Workshop at IJCAI99, Stockholm, Sweden
- Nowé, A.¹ Verbeeck, K.²

33
- 84884079276
- Princeton, NJ: Princeton University Press
- von Neumann, J and Morgenstern, O, 1944, Theory of Games and Economic Behaviour. Princeton, NJ: Princeton University Press.
- (1944) Theory of Games and Economic Behaviour
- Von Neumann, J.¹ Morgenstern, O.²

34
- 0003427725
- Cambridge, MA: MIT Press
- Osborne JO and Rubinstein, A, 1994, A Course in Game Theory. Cambridge, MA: MIT Press.
- (1994) A Course in Game Theory
- Osborne, J.O.¹ Rubinstein, A.²

35
- 28544433644
- Computational Modeling Lab, Vrije Universiteit Brussel
- Maarten, P, 2003, A Study of Reinforcement Learning Techniques for Cooperative Multi-agent Systems. Computational Modeling Lab, Vrije Universiteit Brussel.
- (2003) A Study of Reinforcement Learning Techniques for Cooperative Multi-agent Systems
- Maarten, P.¹

36
- 0012331004
- An analysis of direct reinforcement learning in non-Markovian domains
- Pendrith MD and McGarity MJ, 1998, An analysis of direct reinforcement learning in non-Markovian domains. In Proceedings of the 15th International Conference on Machine Learning, San Francisco, CA.
- (1998) Proceedings of the 15th International Conference on Machine Learning, San Francisco, CA
- Pendrith, M.D.¹ McGarity, M.J.²

37
- 56449099734
- On the existence of fixed points for Q-learning and Sarsa in partially observable domains
- Perkins TJ and Pendrith MD, 2002, On the existence of fixed points for Q-learning and Sarsa in partially observable domains. In Proceedings of the International Conference on Machine Learning (ICML02).
- (2002) Proceedings of the International Conference on Machine Learning (ICML02)
- Perkins, T.J.¹ Pendrith, M.D.²

38
- 1142305723
- Cambridge: Cambridge University Press
- Redondo, FV, 2001, Game Theory and Economics. Cambridge: Cambridge University Press.
- (2001) Game Theory and Economics
- Redondo, F.V.¹

39
- 0004151788
- Cambridge, MA: MIT Press
- Samuelson, L, 1997, Evolutionary Games and Equilibrium Selection. Cambridge, MA: MIT Press.
- (1997) Evolutionary Games and Equilibrium Selection
- Samuelson, L.¹

40
- 0034661690
- Evolution of biological information
- Schneider, TD, 2000, Evolution of biological information. Journal of Nucleic Acids Research 28, 2794-2799.
- (2000) Journal of Nucleic Acids Research , vol.28 , pp. 2794-2799
- Schneider, T.D.¹

41
- 1142293590
- Institute for Theoretical Physics, Köln, Euroland
- Stauffer, D, 1999, Life, Love and Death: Models of Biological Reproduction and Aging. Institute for Theoretical Physics, Köln, Euroland.
- (1999) Life, Love and Death: Models of Biological Reproduction and Aging
- Stauffer, D.¹

42
- 28544444638
- Towards a hardware implementation of reinforcement learning for call admission control in networks for integrated services
- Steenhaut, K, Nowé, A, Fakir, M and Dirkx, E, 1997, Towards a hardware implementation of reinforcement learning for call admission control in networks for integrated services. In Proceedings of the International Workshop on Applications of Neural Networks and other Intelligent Techniques to Telecommunications, 3, Melbourne.
- (1997) Proceedings of the International Workshop on Applications of Neural Networks and Other Intelligent Techniques to Telecommunications, 3, Melbourne
- Steenhaut, K.¹ Nowé, A.² Fakir, M.³ Dirkx, E.⁴

43
- 33847202724
- Learning to predict by the methods of temporal differences
- Boston, MA: Kluwer Academic
- Sutton, RS, 1988, Learning to predict by the methods of temporal differences. Machine Learning, vol. 3. Boston, MA: Kluwer Academic, pp. 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

44
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, RS and Barto, AG, 1998, Reinforcement Learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

45
- 0003401114
- Cambridge, MA: MIT Press
- Stone P, 2000, Layered Learning in Multi-agent Systems. Cambridge, MA: MIT Press.
- (2000) Layered Learning in Multi-agent Systems
- Stone, P.¹

46
- 0036894214
- Varieties of learning automata: An overview
- Thathacher, MAL and Sastry, PS, 2002, Varieties of learning automata: an overview. IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics 32(6).
- (2002) IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics , vol.32 , Issue.6
- Thathacher, M.A.L.¹ Sastry, P.S.²

47
- 0000502181
- On the behavior of finite automata in random media
- Tsetlin, ML, 1962, On the behavior of finite automata in random media. Automation and Remote Control 22, 1210-1219.
- (1962) Automation and Remote Control , vol.22 , pp. 1210-1219
- Tsetlin, M.L.¹

48
- 0004162272
- New York: Academic
- Tsetlin, ML, 1973, Theory and Modeling of Biological Systems. New York: Academic.
- (1973) Theory and Modeling of Biological Systems
- Tsetlin, M.L.¹

49
- 27144547178
- Asynchronous stochastic approximation and Q-learning
- Laboratory for Information and Decision Systems and the Operation Research Center, MIT, Cambridge, MA
- Tsitsiklis, JN, 1993, Asynchronous stochastic approximation and Q-learning. Internal Report, Laboratory for Information and Decision Systems and the Operation Research Center, MIT, Cambridge, MA.
- (1993) Internal Report
- Tsitsiklis, J.N.¹

50
- 1142305721
- Towards a relation between learning agents and evolutionary dynamics
- Belgium: KU Leuven
- Tuyls, K, Lenaerts, T, Verbeeck, K, Maes, S and Manderick, B, 2002, Towards a relation between learning agents and evolutionary dynamics. In Proceedings of the Belgium-Netherlands Artificial Intelligence Conference 2002 (BNAIC). Belgium: KU Leuven.
- (2002) Proceedings of the Belgium-Netherlands Artificial Intelligence Conference 2002 (BNAIC)
- Tuyls, K.¹ Lenaerts, T.² Verbeeck, K.³ Maes, S.⁴ Manderick, B.⁵

51
- 8344263004
- On a dynamical analysis of reinforcement learning in games: Emergence of Occam's Razor
- Berlin, Springer
- Tuyls, K, Verbeeck, K and Maes, S, On a dynamical analysis of reinforcement learning in games: emergence of Occam's Razor. Multi-agent Systems and Applications III (Central and Eastern European conference on Multi- Agent Systems 2003), Prague, 16-18 June 2003, Czech Republic (Lecture Notes in Artificial Intelligence, 2691). Berlin, Springer.
- Multi-agent Systems and Applications III (Central and Eastern European Conference on Multi- agent Systems 2003), Prague, 16-18 June 2003, Czech Republic (Lecture Notes in Artificial Intelligence, 2691) , vol.2691
- Tuyls, K.¹ Verbeeck, K.² Maes, S.³

52
- 26444437242
- A selection-mutation model for Q-leaming in multi-agent systems
- New York: ACM Press
- Tuyls, K, Verbeeck, K and Lenaerts, T, 2003, A selection-mutation model for Q-leaming in multi-agent systems. In The ACM International Conference Proceedings Series, Autonomous Agents and Multi-agent Systems 2003, Melbourne, Australia 14-18 July 2003. New York: ACM Press.
- (2003) The ACM International Conference Proceedings Series, Autonomous Agents and Multi-agent Systems 2003, Melbourne, Australia 14-18 July 2003
- Tuyls, K.¹ Verbeeck, K.² Lenaerts, T.³

53
- 9444229990
- Extended replicator dynamics as a key to reinforcement learning in multi-agent systems
- Berlin: Springer
- Tuyls, K, Heytens, D, Nowé, A and Manderick, B, 2003, Extended replicator dynamics as a key to reinforcement learning in multi-agent systems. In Proceedings of the European Conference on Machine Learning '03 (Lecture Notes in Artificial Intelligence), Cavtat-Dubrovnik, Croatia 22-26 September 2003. Berlin: Springer.
- (2003) Proceedings of the European Conference on Machine Learning '03 (Lecture Notes in Artificial Intelligence), Cavtat-Dubrovnik, Croatia 22-26 September 2003
- Tuyls, K.¹ Heytens, D.² Nowé, A.³ Manderick, B.⁴

54
- 84943265381
- Learning to reach the Pareto optimal Nash equilibrium as a team
- Berlin: Springer
- Verbeeck, K, Nowé, A, Lenaerts, T and Parent, J, 2002, Learning to reach the Pareto optimal Nash equilibrium as a team. In Proceedings of the 15th Australian Joint Conference on Artificial Intelligence (Lecture Notes in Artificial Intelligence, 2557). Berlin: Springer, pp. 407-418.
- (2002) Proceedings of the 15th Australian Joint Conference on Artificial Intelligence (Lecture Notes in Artificial Intelligence, 2557) , vol.2557 , pp. 407-418
- Verbeeck, K.¹ Nowé, A.² Lenaerts, T.³ Parent, J.⁴

55
- 34249833101
- Q-learning
- Watkins, C and Dayan, P, 1992, Q-learning. Machine Learning 8(3). 279-292.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

56
- 0004202581
- Cambridge, MA: MIT Press
- Weibull, JW, Evolutionary Game Theory. Cambridge, MA: MIT Press.
- Evolutionary Game Theory
- Weibull, J.W.¹

57
- 0003057852
- Stockhohn School of Economics and I.U.I, 7 May 1998
- Weibull, JW, 1998, What we have learned from evolutionary game theory so far? Stockhohn School of Economics and I.U.I, 7 May 1998.
- (1998) What We Have Learned from Evolutionary Game Theory so Far?
- Weibull, J.W.¹

58
- 0003744207
- Weiss, G (ed.). Cambridge, MA: MIT Press
- Weiss, G, 1999, In Weiss, G (ed.), Multiagent Systems. A Modern Approach to Distributed Artificial Intelligence. Cambridge, MA: MIT Press.
- (1999) Multiagent Systems. A Modern Approach to Distributed Artificial Intelligence
- Weiss, G.¹

59
- 84899033169
- Using collective intelligence to route internet traffic
- Wolpert, DH, Turner, K and Frank, J, 1998, Using collective intelligence to route internet traffic. Advances in Neural Information Processing Systems, Denver, CO, 1998, pp. 952-958.
- (1998) Advances in Neural Information Processing Systems, Denver, CO, 1998 , pp. 952-958
- Wolpert, D.H.¹ Turner, K.² Frank, J.³

60
- 0032691530
- General principles of learning-based multi-agent systems
- New York: ACM Press
- Wolpert, DH, Wheller, KR and Tumer, K, 1999, General principles of learning-based multi-agent systems. In Proceedings of the 3rd International Conference on Autonomous Agents (Agents'99), Seattle, W A. New York: ACM Press.
- (1999) Proceedings of the 3rd International Conference on Autonomous Agents (Agents'99), Seattle, WA
- Wolpert, D.H.¹ Wheller, K.R.² Tumer, K.³

61
- 0004285157
- Chichester: Wiley
- Wooldridge, M, 2002, An Introduction to MultiAgent Systems. Chichester: Wiley.
- (2002) An Introduction to MultiAgent Systems
- Wooldridge, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.