SCOPUS 정보 검색 플랫폼

9th International Conference on Control, Automation, Robotics and Vision, 2006, ICARCV '06

Volumn , Issue , 2006, Pages

Multi-agent reinforcement learning: A survey

(3) Buşoniu, Lucian a Babuška, Robert a De Schutter, Bart a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

Author keywords

Distributed control; Game theory; Multi agent systems; Reinforcement learning

Indexed keywords

ALGORITHMS; BEHAVIORAL RESEARCH; DISTRIBUTED PARAMETER CONTROL SYSTEMS; ECONOMICS; GAME THEORY; MULTI AGENT SYSTEMS; ROBOTICS; TELECOMMUNICATION;

DISTRIBUTED CONTROL; MULTI-AGENT LEARNING;

REINFORCEMENT LEARNING;

EID: 34547192059 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICARCV.2006.345353 Document Type: Conference Paper

Times cited : (107)

References (37)

1
- 0004102479
- Cambridge, US: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, US: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 4544279348
- Computer Science Dept, Stanford University, California, US, Tech. Rep, 16 May
- Y. Shoham, R. Powers, and T. Grenager, "Multi-agent reinforcement learning: A critical survey," Computer Science Dept., Stanford University, California, US, Tech. Rep., 16 May 2003.
- (2003) Multi-agent reinforcement learning: A critical survey
- Shoham, Y.¹ Powers, R.² Grenager, T.³

3
- 28544446213
- Evolutionary game theory and multi-agent reinforcement learning
- K. Tuyls and A. Nowé, "Evolutionary game theory and multi-agent reinforcement learning," The Knowledge Engineering Review, vol. 20, no. 1, pp. 63-90, 2005.
- (2005) The Knowledge Engineering Review , vol.20 , Issue.1 , pp. 63-90
- Tuyls, K.¹ Nowé, A.²

4
- 34547200119
- G. Chalkiadakis, Multiagent reinforcement learning: Stochastic games with multiple learning players, Dept. of Computer Science, University of Toronto, Canada, Tech. Rep., 25 March 2003, URL: http://www.cs.toronto.edu/~gehalk/DepthReport/DepthReport.ps.
- G. Chalkiadakis, "Multiagent reinforcement learning: Stochastic games with multiple learning players," Dept. of Computer Science, University of Toronto, Canada, Tech. Rep., 25 March 2003, URL: http://www.cs.toronto.edu/~gehalk/DepthReport/DepthReport.ps.

5
- 22944447799
- Multiagent learning in the presence of agents with limitations,
- Ph.D. dissertation, Computer Science Dept, Carnegie Mellon University, Pittsburgh, US, May
- M. Bowling, "Multiagent learning in the presence of agents with limitations," Ph.D. dissertation, Computer Science Dept., Carnegie Mellon University, Pittsburgh, US, May 2003.
- (2003)
- Bowling, M.¹

6
- 26444601262
- Cooperative multi-agent learning: The state of the art
- November
- L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," Autonomous Agents and Multi-Agent Systems, vol. 11, no. 3, pp. 387-434, November 2005.
- (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
- Panait, L.¹ Luke, S.²

7
- 34249833101
- Technical note: Q-learning
- C. J. C. H. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

8
- 0036531878
- Multiagent learning using a variable learning rate
- M. Bowling and M. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
- (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

9
- 1942421183
- AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
- Washington, US, 21-24 August
- V. Conitzer and T. Sandholm, "AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents," in Proceedings Twentieth International Conference on Machine Learning (ICML-03), Washington, US, 21-24 August 2003, pp. 83-90.
- (2003) Proceedings Twentieth International Conference on Machine Learning (ICML-03) , pp. 83-90
- Conitzer, V.¹ Sandholm, T.²

10
- 84899027977
- Convergence and no-regret in multiagent learning
- Vancouver, Canada, 13-18 December
- M. Bowling, "Convergence and no-regret in multiagent learning," in Advances in Neural Information Processing Systems 17 (NIPS-04), Vancouver, Canada, 13-18 December 2004, pp. 209-216.
- (2004) Advances in Neural Information Processing Systems 17 (NIPS-04) , pp. 209-216
- Bowling, M.¹

11
- 0001547175
- Value-function reinforcement learning in Markov games
- M. L. Littman, "Value-function reinforcement learning in Markov games," Journal of Cognitive Systems Research, vol. 2, pp. 55-66, 2001.
- (2001) Journal of Cognitive Systems Research , vol.2 , pp. 55-66
- Littman, M.L.¹

12
- 84898936075
- New criteria and a new algorithm for learning in multi-agent systems
- Vancouver, Canada
- R. Powers and Y Shoham, "New criteria and a new algorithm for learning in multi-agent systems," in Advances in Neural Information Processing Systems 17 (NIPS-04), Vancouver, Canada, 2004, pp. 1089-1096.
- (2004) Advances in Neural Information Processing Systems 17 (NIPS-04) , pp. 1089-1096
- Powers, R.¹ Shoham, Y.²

13
- 4644369748
- Nash Q-learning for general-sum stochastic games
- J. Hu and M. P. Wellman, "Nash Q-learning for general-sum stochastic games," Journal of Machine Learning Research, vol. 4, pp. 1039-1069, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.P.²

14
- 1942517280
- Correlated-Q learning
- Washington, US, 21-24 August
- A. Greenwald and K. Hall, "Correlated-Q learning," in Proceedings Twentieth International Conference on Machine Learning (ICML-03), Washington, US, 21-24 August 2003, pp. 242-249.
- (2003) Proceedings Twentieth International Conference on Machine Learning (ICML-03) , pp. 242-249
- Greenwald, A.¹ Hall, K.²

15
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- US, 29 June, 2 July
- M. Lauer and M. Riedmiller, "An algorithm for distributed reinforcement learning in cooperative multi-agent systems," in Proceedings Seventeenth International Conference on Machine Learning (ICML-00), Stanford University, US, 29 June - 2 July 2000, pp. 535-542.
- (2000) Proceedings Seventeenth International Conference on Machine Learning (ICML-00), Stanford University , pp. 535-542
- Lauer, M.¹ Riedmiller, M.²

16
- 0002500351
- Planning, learning and coordination in multiagent decision processes
- De Zeeuwse Stromen, The Netherlands, 17-20 March
- C. Boutilier, "Planning, learning and coordination in multiagent decision processes," in Proceedings Sixth Conference on Theoretical Aspects of Rationality and Knowledge (TARK-96), De Zeeuwse Stromen, The Netherlands, 17-20 March 1996, pp. 195-210.
- (1996) Proceedings Sixth Conference on Theoretical Aspects of Rationality and Knowledge (TARK-96) , pp. 195-210
- Boutilier, C.¹

17
- 34547156217
- M. T. J. Spaan, N. Vlassis, and F. C. A. Groen, High level coordination of agents based on multiagent Markov decision processes with roles, in Workshop on Cooperative Robotics, 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-02), Lausanne, Switzerland, 1 October 2002, pp. 66-73.
- M. T. J. Spaan, N. Vlassis, and F. C. A. Groen, "High level coordination of agents based on multiagent Markov decision processes with roles," in Workshop on Cooperative Robotics, 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-02), Lausanne, Switzerland, 1 October 2002, pp. 66-73.

18
- 4544236179
- Coordinated reinforcement learning
- Sydney, Australia, 8-12 July
- C. Guestrin, M. G. Lagoudakis, and R. Parr, "Coordinated reinforcement learning," in Proceedings Nineteenth International Conference on Machine Learning (ICML-02), Sydney, Australia, 8-12 July 2002, pp. 227-234.
- (2002) Proceedings Nineteenth International Conference on Machine Learning (ICML-02) , pp. 227-234
- Guestrin, C.¹ Lagoudakis, M.G.² Parr, R.³

19
- 12244304892
- Non-communicative multi-robot coordination in dynamic environment
- J. R. Kok, M. T. J. Spaan, and N. Vlassis, "Non-communicative multi-robot coordination in dynamic environment," Robotics and Autonomous Systems, vol. 50, no. 2-3, pp. 99-114, 2005.
- (2005) Robotics and Autonomous Systems , vol.50 , Issue.2-3 , pp. 99-114
- Kok, J.R.¹ Spaan, M.T.J.² Vlassis, N.³

20
- 34547198492
- N. Vlassis, A concise introduction to multiagent systems and distributed AI, University of Amsterdam, The Netherlands, Tech. Rep., September 2003, URL: http://www.science.uva.nl/~vlassis/cimasdai/cimasdai.pdf.
- N. Vlassis, "A concise introduction to multiagent systems and distributed AI," University of Amsterdam, The Netherlands, Tech. Rep., September 2003, URL: http://www.science.uva.nl/~vlassis/cimasdai/cimasdai.pdf.

21
- 4544220380
- Hierarchical reinforcement learning in communication-mediated multiagent coordination
- New York, US, 19-23 August
- F. Fischer, M. Rovatsos, and G. Weiss, "Hierarchical reinforcement learning in communication-mediated multiagent coordination," in Proceedings 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04), New York, US, 19-23 August 2004, pp. 1334-1335.
- (2004) Proceedings 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04) , pp. 1334-1335
- Fischer, F.¹ Rovatsos, M.² Weiss, G.³

22
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Madison, US, 26-30 July
- C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," in Proceedings 15th National Conference on Artificial Intelligence and 10th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAI-98), Madison, US, 26-30 July 1998, pp. 746-752.
- (1998) Proceedings 15th National Conference on Artificial Intelligence and 10th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAI-98) , pp. 746-752
- Claus, C.¹ Boutilier, C.²

23
- 0036932299
- Reinforcement learning of coordination in cooperative multi-agent systems
- Menlo Park, US, 28 July, 1 August
- S. Kapetanakis and D. Kudenko, "Reinforcement learning of coordination in cooperative multi-agent systems," in Proceedings 18th National Conference on Artificial Intelligence and 14th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAl-02), Menlo Park, US, 28 July - 1 August 2002, pp. 326-331.
- (2002) Proceedings 18th National Conference on Artificial Intelligence and 14th Conference on Innovative Applications of Artificial Intelligence (AAAI/IAAl-02) , pp. 326-331
- Kapetanakis, S.¹ Kudenko, D.²

24
- 67649405225
- Reinforcement learning to play an optimal Nash equilibrium in team Markov games
- Vancouver, Canada, 9-14 December
- X. Wang and T. Sandholm, "Reinforcement learning to play an optimal Nash equilibrium in team Markov games," in Advances in Neural. Information Processing Systems 15 (NIPS-02), Vancouver, Canada, 9-14 December 2002, pp. 1571-1578.
- (2002) Advances in Neural. Information Processing Systems 15 (NIPS-02) , pp. 1571-1578
- Wang, X.¹ Sandholm, T.²

25
- 2642545776
- Opponent modeling in multi-agent systems
- G. Weiß and S. Sen, Eds. Springer Verlag
- D. Carmel and S. Markovitch, "Opponent modeling in multi-agent systems," in Adaptation and Learning in Multi-Agent Systems, G. Weiß and S. Sen, Eds. Springer Verlag, 1996, pp. 40-52.
- (1996) Adaptation and Learning in Multi-Agent Systems , pp. 40-52
- Carmel, D.¹ Markovitch, S.²

26
- 84898941549
- Extending Q-leaming to general adaptive multi-agent systems
- Vancouver and Whistler, Canada, 8-13 December
- G. Tesauro, "Extending Q-leaming to general adaptive multi-agent systems," in Advances in Neural Information Processing Systems 16 (NIPS-03), Vancouver and Whistler, Canada, 8-13 December 2003.
- (2003) Advances in Neural Information Processing Systems 16 (NIPS-03)
- Tesauro, G.¹

27
- 0001644761
- Nash convergence of gradient dynamics in general-sum games
- San Francisco, US, 30 June, 3 July
- S. Singh, M. Kearns, and Y. Mansour, "Nash convergence of gradient dynamics in general-sum games," in Proceedings 16th Conference on Uncertainty in Artificial Intelligence (UAI-00), San Francisco, US, 30 June - 3 July 2000, pp. 541-548.
- (2000) Proceedings 16th Conference on Uncertainty in Artificial Intelligence (UAI-00) , pp. 541-548
- Singh, S.¹ Kearns, M.² Mansour, Y.³

28
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Washington, US, 21-24 August
- M. Zinkevich, "Online convex programming and generalized infinitesimal gradient ascent," in Proceedings Twentieth International Conference on Machine Learning (ICML-03), Washington, US, 21-24 August 2003, pp. 928-936.
- (2003) Proceedings Twentieth International Conference on Machine Learning (ICML-03) , pp. 928-936
- Zinkevich, M.¹

29
- 0028555752
- Learning to coordinate without sharing information
- Seattle, US, 31 July, 4 August
- S. Sen, M. Sekaran, and J. Hale, "Learning to coordinate without sharing information," in Proceedings 12th National Conference on Artificial Intelligence (AAAI-94), Seattle, US, 31 July - 4 August 1994, pp. 426431.
- (1994) Proceedings 12th National Conference on Artificial Intelligence (AAAI-94) , pp. 426431
- Sen, S.¹ Sekaran, M.² Hale, J.³

30
- 84949949419
- Learning in multi-robot systems
- G. Weiß and S. Sen, Eds. Springer Verlag
- M. J. Mataric̀, "Learning in multi-robot systems," in Adaptation and Learning in Multi-Agent Systems, G. Weiß and S. Sen, Eds. Springer Verlag, 1996, pp. 152-163.
- (1996) Adaptation and Learning in Multi-Agent Systems , pp. 152-163
- Mataric̀, M.J.¹

31
- 85156187730
- Improving elevator performance using reinforcement learning
- R. H. Crites and A. G. Barto, "Improving elevator performance using reinforcement learning," in Advances in Neural Information Processing Systems, vol. 8, 1996, pp. 1017-1023.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

32
- 78649701299
- Asymmetric multiagent reinforcement learning
- Halifax, Canada, 13-17 October
- V. Könönen, "Asymmetric multiagent reinforcement learning," in Proceedings IEEEAVTC International Conference on Intelligent Agent Technology (IAT-03), Halifax, Canada, 13-17 October 2003, pp. 336-342.
- (2003) Proceedings IEEEAVTC International Conference on Intelligent Agent Technology (IAT-03) , pp. 336-342
- Könönen, V.¹

33
- 4544231144
- Best-response multiagent learning in non-stationary environments
- New York, US, 19-23 August
- M. Weinberg and J. S. Rosenschein, "Best-response multiagent learning in non-stationary environments," in Proceedings 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04), New York, US, 19-23 August 2004, pp. 506-513.
- (2004) Proceedings 3rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-04) , pp. 506-513
- Weinberg, M.¹ Rosenschein, J.S.²

34
- 1142280919
- Adaptive policy gradient in multiagent learning
- Melbourne, Australia, 14-18 July
- B. Banerjee and J. Peng, "Adaptive policy gradient in multiagent learning," in Proceedings 2nd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-03), Melbourne, Australia, 14-18 July 2003, pp. 686-692.
- (2003) Proceedings 2nd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-03) , pp. 686-692
- Banerjee, B.¹ Peng, J.²

35
- 0036355732
- A multiagent reinforcement learning algorithm using extended optimal response
- Bologna, Italy, 15-19 July
- N. Suematsu and A. Hayashi, "A multiagent reinforcement learning algorithm using extended optimal response," in Proceedings 1st International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-02), Bologna, Italy, 15-19 July 2002, pp. 370-377.
- (2002) Proceedings 1st International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS-02) , pp. 370-377
- Suematsu, N.¹ Hayashi, A.²

36
- 0342683320
- A general method for multi-agent reinforcement learning in unrestricted environments
- Stanford University, US, March 25-27
- J. Schmidhuber, "A general method for multi-agent reinforcement learning in unrestricted environments," in Working Notes AAAI Symposium on Adaptation, Co-evolution and Learning in Multiagent Systems, Stanford University, US, March 25-27 1996, pp. 84-87.
- (1996) Working Notes AAAI Symposium on Adaptation, Co-evolution and Learning in Multiagent Systems , pp. 84-87
- Schmidhuber, J.¹

37
- 23144455713
- Learning in multiagent systems: An introduction from a game-theoretic perspective
- Springer Verlag, August
- J. M. Vidal, "Learning in multiagent systems: An introduction from a game-theoretic perspective," in Adaptive Agents: Lecture Notes in Artificial Intelligence. Springer Verlag, August 2003, vol. 2636, pp. 202-215.
- (2003) Adaptive Agents: Lecture Notes in Artificial Intelligence , vol.2636 , pp. 202-215
- Vidal, J.M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.