메뉴 건너뛰기




Volumn , Issue , 2006, Pages

Multi-agent reinforcement learning: A survey

Author keywords

Distributed control; Game theory; Multi agent systems; Reinforcement learning

Indexed keywords

ALGORITHMS; BEHAVIORAL RESEARCH; DISTRIBUTED PARAMETER CONTROL SYSTEMS; ECONOMICS; GAME THEORY; MULTI AGENT SYSTEMS; ROBOTICS; TELECOMMUNICATION;

EID: 34547192059     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICARCV.2006.345353     Document Type: Conference Paper
Times cited : (103)

References (37)
  • 3
    • 28544446213 scopus 로고    scopus 로고
    • Evolutionary game theory and multi-agent reinforcement learning
    • K. Tuyls and A. Nowé, "Evolutionary game theory and multi-agent reinforcement learning," The Knowledge Engineering Review, vol. 20, no. 1, pp. 63-90, 2005.
    • (2005) The Knowledge Engineering Review , vol.20 , Issue.1 , pp. 63-90
    • Tuyls, K.1    Nowé, A.2
  • 4
    • 34547200119 scopus 로고    scopus 로고
    • G. Chalkiadakis, Multiagent reinforcement learning: Stochastic games with multiple learning players, Dept. of Computer Science, University of Toronto, Canada, Tech. Rep., 25 March 2003, URL: http://www.cs.toronto.edu/~gehalk/DepthReport/DepthReport.ps.
    • G. Chalkiadakis, "Multiagent reinforcement learning: Stochastic games with multiple learning players," Dept. of Computer Science, University of Toronto, Canada, Tech. Rep., 25 March 2003, URL: http://www.cs.toronto.edu/~gehalk/DepthReport/DepthReport.ps.
  • 5
    • 22944447799 scopus 로고    scopus 로고
    • Multiagent learning in the presence of agents with limitations,
    • Ph.D. dissertation, Computer Science Dept, Carnegie Mellon University, Pittsburgh, US, May
    • M. Bowling, "Multiagent learning in the presence of agents with limitations," Ph.D. dissertation, Computer Science Dept., Carnegie Mellon University, Pittsburgh, US, May 2003.
    • (2003)
    • Bowling, M.1
  • 6
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • November
    • L. Panait and S. Luke, "Cooperative multi-agent learning: The state of the art," Autonomous Agents and Multi-Agent Systems, vol. 11, no. 3, pp. 387-434, November 2005.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 7
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • C. J. C. H. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , pp. 279-292
    • Watkins, C.J.C.H.1    Dayan, P.2
  • 8
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • M. Bowling and M. Veloso, "Multiagent learning using a variable learning rate," Artificial Intelligence, vol. 136, no. 2, pp. 215-250, 2002.
    • (2002) Artificial Intelligence , vol.136 , Issue.2 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 9
    • 1942421183 scopus 로고    scopus 로고
    • AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
    • Washington, US, 21-24 August
    • V. Conitzer and T. Sandholm, "AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents," in Proceedings Twentieth International Conference on Machine Learning (ICML-03), Washington, US, 21-24 August 2003, pp. 83-90.
    • (2003) Proceedings Twentieth International Conference on Machine Learning (ICML-03) , pp. 83-90
    • Conitzer, V.1    Sandholm, T.2
  • 10
    • 84899027977 scopus 로고    scopus 로고
    • Convergence and no-regret in multiagent learning
    • Vancouver, Canada, 13-18 December
    • M. Bowling, "Convergence and no-regret in multiagent learning," in Advances in Neural Information Processing Systems 17 (NIPS-04), Vancouver, Canada, 13-18 December 2004, pp. 209-216.
    • (2004) Advances in Neural Information Processing Systems 17 (NIPS-04) , pp. 209-216
    • Bowling, M.1
  • 11
    • 0001547175 scopus 로고    scopus 로고
    • Value-function reinforcement learning in Markov games
    • M. L. Littman, "Value-function reinforcement learning in Markov games," Journal of Cognitive Systems Research, vol. 2, pp. 55-66, 2001.
    • (2001) Journal of Cognitive Systems Research , vol.2 , pp. 55-66
    • Littman, M.L.1
  • 13
    • 4644369748 scopus 로고    scopus 로고
    • Nash Q-learning for general-sum stochastic games
    • J. Hu and M. P. Wellman, "Nash Q-learning for general-sum stochastic games," Journal of Machine Learning Research, vol. 4, pp. 1039-1069, 2003.
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
    • Hu, J.1    Wellman, M.P.2
  • 17
    • 34547156217 scopus 로고    scopus 로고
    • M. T. J. Spaan, N. Vlassis, and F. C. A. Groen, High level coordination of agents based on multiagent Markov decision processes with roles, in Workshop on Cooperative Robotics, 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-02), Lausanne, Switzerland, 1 October 2002, pp. 66-73.
    • M. T. J. Spaan, N. Vlassis, and F. C. A. Groen, "High level coordination of agents based on multiagent Markov decision processes with roles," in Workshop on Cooperative Robotics, 2002 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-02), Lausanne, Switzerland, 1 October 2002, pp. 66-73.
  • 19
    • 12244304892 scopus 로고    scopus 로고
    • Non-communicative multi-robot coordination in dynamic environment
    • J. R. Kok, M. T. J. Spaan, and N. Vlassis, "Non-communicative multi-robot coordination in dynamic environment," Robotics and Autonomous Systems, vol. 50, no. 2-3, pp. 99-114, 2005.
    • (2005) Robotics and Autonomous Systems , vol.50 , Issue.2-3 , pp. 99-114
    • Kok, J.R.1    Spaan, M.T.J.2    Vlassis, N.3
  • 20
    • 34547198492 scopus 로고    scopus 로고
    • N. Vlassis, A concise introduction to multiagent systems and distributed AI, University of Amsterdam, The Netherlands, Tech. Rep., September 2003, URL: http://www.science.uva.nl/~vlassis/cimasdai/cimasdai.pdf.
    • N. Vlassis, "A concise introduction to multiagent systems and distributed AI," University of Amsterdam, The Netherlands, Tech. Rep., September 2003, URL: http://www.science.uva.nl/~vlassis/cimasdai/cimasdai.pdf.
  • 24
    • 67649405225 scopus 로고    scopus 로고
    • Reinforcement learning to play an optimal Nash equilibrium in team Markov games
    • Vancouver, Canada, 9-14 December
    • X. Wang and T. Sandholm, "Reinforcement learning to play an optimal Nash equilibrium in team Markov games," in Advances in Neural. Information Processing Systems 15 (NIPS-02), Vancouver, Canada, 9-14 December 2002, pp. 1571-1578.
    • (2002) Advances in Neural. Information Processing Systems 15 (NIPS-02) , pp. 1571-1578
    • Wang, X.1    Sandholm, T.2
  • 25
    • 2642545776 scopus 로고    scopus 로고
    • Opponent modeling in multi-agent systems
    • G. Weiß and S. Sen, Eds. Springer Verlag
    • D. Carmel and S. Markovitch, "Opponent modeling in multi-agent systems," in Adaptation and Learning in Multi-Agent Systems, G. Weiß and S. Sen, Eds. Springer Verlag, 1996, pp. 40-52.
    • (1996) Adaptation and Learning in Multi-Agent Systems , pp. 40-52
    • Carmel, D.1    Markovitch, S.2
  • 26
    • 84898941549 scopus 로고    scopus 로고
    • Extending Q-leaming to general adaptive multi-agent systems
    • Vancouver and Whistler, Canada, 8-13 December
    • G. Tesauro, "Extending Q-leaming to general adaptive multi-agent systems," in Advances in Neural Information Processing Systems 16 (NIPS-03), Vancouver and Whistler, Canada, 8-13 December 2003.
    • (2003) Advances in Neural Information Processing Systems 16 (NIPS-03)
    • Tesauro, G.1
  • 30
    • 84949949419 scopus 로고    scopus 로고
    • Learning in multi-robot systems
    • G. Weiß and S. Sen, Eds. Springer Verlag
    • M. J. Mataric̀, "Learning in multi-robot systems," in Adaptation and Learning in Multi-Agent Systems, G. Weiß and S. Sen, Eds. Springer Verlag, 1996, pp. 152-163.
    • (1996) Adaptation and Learning in Multi-Agent Systems , pp. 152-163
    • Mataric̀, M.J.1
  • 37
    • 23144455713 scopus 로고    scopus 로고
    • Learning in multiagent systems: An introduction from a game-theoretic perspective
    • Springer Verlag, August
    • J. M. Vidal, "Learning in multiagent systems: An introduction from a game-theoretic perspective," in Adaptive Agents: Lecture Notes in Artificial Intelligence. Springer Verlag, August 2003, vol. 2636, pp. 202-215.
    • (2003) Adaptive Agents: Lecture Notes in Artificial Intelligence , vol.2636 , pp. 202-215
    • Vidal, J.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.