메뉴 건너뛰기




Volumn 215, Issue , 2010, Pages 367-372

The dynamics of multi-agent reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

EVOLUTIONARY ALGORITHMS; GAME THEORY; LEARNING ALGORITHMS; MULTI AGENT SYSTEMS; PROCESS CONTROL; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

EID: 77956017208     PISSN: 09226389     EISSN: 18798314     Source Type: Book Series    
DOI: 10.3233/978-1-60750-606-5-367     Document Type: Conference Paper
Times cited : (5)

References (18)
  • 2
    • 0141965747 scopus 로고    scopus 로고
    • The complexity of decentralized control of markov decision processes
    • D. Bernstein, S. Zilberstein, and N. Immerman, 'The complexity of decentralized control of markov decision processes', in UAI, (2000).
    • (2000) UAI
    • Bernstein, D.1    Zilberstein, S.2    Immerman, N.3
  • 4
    • 84880818689 scopus 로고    scopus 로고
    • Simultaneous adversarial multi-robot learning
    • M. Bowling and M. Veloso, 'Simultaneous adversarial multi-robot learning', in IJCAI, (2003).
    • (2003) IJCAI
    • Bowling, M.1    Veloso, M.2
  • 6
    • 0036531878 scopus 로고    scopus 로고
    • Multiagent learning using a variable learning rate
    • Michael Bowling and Manuela Veloso, 'Multiagent learning using a variable learning rate', Artificial Intelligence, 136, 215-250, (2002).
    • (2002) Artificial Intelligence , vol.136 , pp. 215-250
    • Bowling, M.1    Veloso, M.2
  • 8
    • 77956030012 scopus 로고    scopus 로고
    • Ph.D. dissertation, Imperial College (University of London), December
    • L. Dickens, Learning to Act Stochastically, Ph.D. dissertation, Imperial College (University of London), December 2009.
    • (2009) Learning to Act Stochastically
    • Dickens, L.1
  • 12
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • M. Littman, 'Markov games as a framework for multi-agent reinforcement learning', in ICML, (1994).
    • (1994) ICML
    • Littman, M.1
  • 13
    • 0004223233 scopus 로고
    • Ph.D. dissertation, Princeton University
    • John Nash, Non-Cooperative Games, Ph.D. dissertation, Princeton University, 1950.
    • (1950) Non-Cooperative Games
    • Nash, J.1
  • 14
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • J. Peters and S. Schaal, 'Natural Actor-Critic', Neurocomputing, 71(7- 9), (2008).
    • (2008) Neurocomputing , vol.71 , Issue.7-9
    • Peters, J.1    Schaal, S.2
  • 15
    • 2142812536 scopus 로고
    • Learning without state- estimation in partially observable markovian decision processes
    • S. Singh, T. Jaakkola, and M. Jordan, 'Learning without State- Estimation in Partially Observable Markovian Decision Processes', in ICML, (1994).
    • (1994) ICML
    • Singh, S.1    Jaakkola, T.2    Jordan, M.3
  • 17
    • 31344450384 scopus 로고    scopus 로고
    • An evolutionary dynamical analysis of multi-agent learning in iterated games
    • K. Tuyls, P. J. Hoen, and B. Vanschoenwinkel, 'An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games', Autonomous Agents and Multi-Agent Systems, 12(1), 115-153, (2006).
    • (2006) Autonomous Agents and Multi-Agent Systems , vol.12 , Issue.1 , pp. 115-153
    • Tuyls, K.1    Hoen, P.J.2    Vanschoenwinkel, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.