SCOPUS 정보 검색 플랫폼

Frontiers in Artificial Intelligence and Applications

Volumn 215, Issue , 2010, Pages 367-372

The dynamics of multi-agent reinforcement learning

(3) Dickens, Luke a Broda, Krysia a Russo, Alessandra a

a IMPERIAL COLLEGE LONDON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

EVOLUTIONARY ALGORITHMS; GAME THEORY; LEARNING ALGORITHMS; MULTI AGENT SYSTEMS; PROCESS CONTROL; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

CONTROL PROCESS; INFINITE HORIZONS; MULTI AGENT; MULTI-AGENT REINFORCEMENT LEARNING; MULTIAGENT CONTROL; NASH EQUILIBRIA; NON DETERMINISM; PARTIAL STATE; PRESSURE-FIELD; REINFORCEMENT LEARNING ALGORITHMS;

RANDOM PROCESSES;

EID: 77956017208 PISSN: 09226389 EISSN: 18798314 Source Type: Book Series
DOI: 10.3233/978-1-60750-606-5-367 Document Type: Conference Paper

Times cited : (5)

References (18)

1
- 0013535965
- Infinite-horizon policy-gradient estimation
- J. Baxter and P. Bartlett, 'Infinite-Horizon Policy-Gradient Estimation', Journal of Artificial Intelligence Research, 15, (2001).
- (2001) Journal of Artificial Intelligence Research , vol.15
- Baxter, J.¹ Bartlett, P.²

2
- 0141965747
- The complexity of decentralized control of markov decision processes
- D. Bernstein, S. Zilberstein, and N. Immerman, 'The complexity of decentralized control of markov decision processes', in UAI, (2000).
- (2000) UAI
- Bernstein, D.¹ Zilberstein, S.² Immerman, N.³

3
- 85162049326
- Incremental natural actor-critic algorithms
- S. Bhatnagar, R. Sutton, M. Ghavamzadeh, and M. Lee, 'Incremental Natural Actor-Critic Algorithms', in NIPS, (2008).
- (2008) NIPS
- Bhatnagar, S.¹ Sutton, R.² Ghavamzadeh, M.³ Lee, M.⁴

4
- 84880818689
- Simultaneous adversarial multi-robot learning
- M. Bowling and M. Veloso, 'Simultaneous adversarial multi-robot learning', in IJCAI, (2003).
- (2003) IJCAI
- Bowling, M.¹ Veloso, M.²

5
- 27344450680
- Existence of multiagent equilibria with limited agents
- M. Bowling and M. Veloso, 'Existence of Multiagent Equilibria with Limited Agents', Journal of Artificial Intelligence Research, 22, (2004).
- (2004) Journal of Artificial Intelligence Research , vol.22
- Bowling, M.¹ Veloso, M.²

6
- 0036531878
- Multiagent learning using a variable learning rate
- Michael Bowling and Manuela Veloso, 'Multiagent learning using a variable learning rate', Artificial Intelligence, 136, 215-250, (2002).
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

7
- 34548339146
- POMDP solution methods: A survey
- University of Toronto
- D. Braziunas, 'POMDP solution methods: a survey', Technical report, Department of Computer Science, University of Toronto, (2003).
- (2003) Technical Report, Department of Computer Science
- Braziunas, D.¹

8
- 77956030012
- Ph.D. dissertation, Imperial College (University of London), December
- L. Dickens, Learning to Act Stochastically, Ph.D. dissertation, Imperial College (University of London), December 2009.
- (2009) Learning to Act Stochastically
- Dickens, L.¹

9
- 0004247096
- MIT Press
- D. Fudenberg and D. K. Levine, The Theory of Learning in Games, MIT Press, 1998.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

10
- 0029679044
- Reinforcement learning: A survey
- L. Kaelbling, M. Littman, and A. Moore, 'Reinforcement learning: A survey', Journal of Artificial Intelligence Research, 4, (1996).
- (1996) Journal of Artificial Intelligence Research , vol.4
- Kaelbling, L.¹ Littman, M.² Moore, A.³

11
- 4043069840
- On actor-critic algorithms
- V. R. Konda and J. N. Tsitsiklis, 'On actor-critic algorithms', SIAM J. Control Optim., 42(4), (2003).
- (2003) SIAM J. Control Optim. , vol.42 , Issue.4
- Konda, V.R.¹ Tsitsiklis, J.N.²

12
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- M. Littman, 'Markov games as a framework for multi-agent reinforcement learning', in ICML, (1994).
- (1994) ICML
- Littman, M.¹

13
- 0004223233
- Ph.D. dissertation, Princeton University
- John Nash, Non-Cooperative Games, Ph.D. dissertation, Princeton University, 1950.
- (1950) Non-Cooperative Games
- Nash, J.¹

14
- 40649106649
- Natural actor-critic
- J. Peters and S. Schaal, 'Natural Actor-Critic', Neurocomputing, 71(7- 9), (2008).
- (2008) Neurocomputing , vol.71 , Issue.7-9
- Peters, J.¹ Schaal, S.²

15
- 2142812536
- Learning without state- estimation in partially observable markovian decision processes
- S. Singh, T. Jaakkola, and M. Jordan, 'Learning without State- Estimation in Partially Observable Markovian Decision Processes', in ICML, (1994).
- (1994) ICML
- Singh, S.¹ Jaakkola, T.² Jordan, M.³

16
- 0004102479
- MIT Press, Cambridge, MA
- R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

17
- 31344450384
- An evolutionary dynamical analysis of multi-agent learning in iterated games
- K. Tuyls, P. J. Hoen, and B. Vanschoenwinkel, 'An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games', Autonomous Agents and Multi-Agent Systems, 12(1), 115-153, (2006).
- (2006) Autonomous Agents and Multi-Agent Systems , vol.12 , Issue.1 , pp. 115-153
- Tuyls, K.¹ Hoen, P.J.² Vanschoenwinkel, B.³

18
- 57749114040
- Cyclic equilibria in markov games
- M. Zinkevich, A. Greenwald, and M. Littman, 'Cyclic Equilibria in Markov Games', in NIPS, (2005).
- (2005) NIPS
- Zinkevich, M.¹ Greenwald, A.² Littman, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.