메뉴 건너뛰기




Volumn 5863 LNCS, Issue PART 1, 2009, Pages 570-579

Learning cooperative behaviours in multiagent reinforcement learning

Author keywords

Context dependent multiagent SARSA; Learning cooperative behaviours; Multiagent reinforcement learning

Indexed keywords

2-D SPACE; CONTEXT DEPENDENT; EMPIRICAL RESULTS; GRID SIZE; LOGICAL SOLUTION; MULTI-AGENT; MULTI-AGENT REINFORCEMENT LEARNING; PARTIALLY OBSERVABLE ENVIRONMENTS; PROBLEM FORMULATION; SARSA ALGORITHM; SARSA LEARNING; SINGLE-AGENT; STATE SPACE;

EID: 76649120218     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-10677-4_65     Document Type: Conference Paper
Times cited : (5)

References (18)
  • 2
    • 0003863106 scopus 로고    scopus 로고
    • An analysis of stochastic game theory for multiagent reinforcement learning
    • Technical report
    • Bowling, M., Velosa, M.: An analysis of stochastic game theory for multiagent reinforcement learning. Technical report, Carnegie Mellon University (2000), http://www.cs.ualberta.ca/~bowling/papers/00tr.pdf
    • (2000)
    • Bowling, M.1    Velosa, M.2
  • 4
    • 0030674885 scopus 로고    scopus 로고
    • Cooperative mobile robotics: Antecedents and directions
    • Cao, Y.U., Fukunaga, A.S., Kahng, A.B.: Cooperative mobile robotics: Antecedents and directions. Autonomous Robotics 4, 1-23 (1997)
    • (1997) Autonomous Robotics , vol.4 , pp. 1-23
    • Cao, Y.U.1    Fukunaga, A.S.2    Kahng, A.B.3
  • 5
    • 58049168069 scopus 로고    scopus 로고
    • Optimal formation reconfiguration control of multiple UCAVs using improved particle swarm optimisation
    • Duan, H.B., Ma, G.J., Luo, D.L.: Optimal formation reconfiguration control of multiple UCAVs using improved particle swarm optimisation. Bionic Engineering 5(4), 340-347 (2009)
    • (2009) Bionic Engineering , vol.5 , Issue.4 , pp. 340-347
    • Duan, H.B.1    Ma, G.J.2    Luo, D.L.3
  • 6
  • 7
    • 0001547175 scopus 로고    scopus 로고
    • Value-function reinforcement learning in markov games
    • Littman, M.L.: Value-function reinforcement learning in markov games. Journal of Cognitive Systems Research 2, 67-79 (2001)
    • (2001) Journal of Cognitive Systems Research , vol.2 , pp. 67-79
    • Littman, M.L.1
  • 8
    • 0030647149 scopus 로고    scopus 로고
    • Reinforcement learning in multi-robot domain
    • Matarić, M.J.: Reinforcement learning in multi-robot domain. Autonomous Robots 4, 73-83 (1997)
    • (1997) Autonomous Robots , vol.4 , pp. 73-83
    • Matarić, M.J.1
  • 9
    • 0002797521 scopus 로고    scopus 로고
    • Learning in behaviour-based multi-robot systems: Policies, models, and other agents
    • Matarić, M.J.: Learning in behaviour-based multi-robot systems: policies, models, and other agents. Journal of Cognitive Systems Research 2, 81-93 (2001)
    • (2001) Journal of Cognitive Systems Research , vol.2 , pp. 81-93
    • Matarić, M.J.1
  • 12
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • Panait, L., Luke, S.: Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems 11(3), 387-434 (2005)
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 14
    • 4544279348 scopus 로고    scopus 로고
    • Multiagent reinforcement learning: A critical survey
    • Technical report, Standford University
    • Shoham, Y., Powers, R.: Multiagent reinforcement learning: A critical survey. Technical report, Standford University (2003), http://multiagent. stanford.edu/papers/MALearning-ACriticalSurvey-2003-0516.pdf
    • (2003)
    • Shoham, Y.1    Powers, R.2
  • 18
    • 76649143996 scopus 로고    scopus 로고
    • Yang, E.F., Gu, D.B.: Multiagent reinforcement learning for multi-robot systems: A survey. Technical report, The University of Essex (2004), http://cswww.essex.ac.uk/technical-report/2004/cs
    • Yang, E.F., Gu, D.B.: Multiagent reinforcement learning for multi-robot systems: A survey. Technical report, The University of Essex (2004), http://cswww.essex.ac.uk/technical-report/2004/cs


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.