SCOPUS 정보 검색 플랫폼

Volumn 12, Issue 4, 2017, Pages

Multiagent cooperation and competition with deep reinforcement learning

(8) Tampuu, Ardi a Matiisen, Tambet a Kodelja, Dorian a,c Kuzovkin, Ilya a Korjus, Kristjan a Aru, Juhan b Aru, Jaan a Vicente, Raul a

a UNIVERSITY OF TARTU (Estonia)

b ETH ZURICH (Switzerland)

c DIF (France)

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIOR; COMPETITION; HUMAN; REINFORCEMENT; VIDEO GAME; ALGORITHM; COOPERATION; GAME; HUMAN RELATION; LEARNING; PHYSIOLOGY; REWARD; SOCIAL BEHAVIOR;

ALGORITHMS; COOPERATIVE BEHAVIOR; GAME THEORY; HUMANS; INTERPERSONAL RELATIONS; LEARNING; REINFORCEMENT (PSYCHOLOGY); REWARD; SOCIAL BEHAVIOR;

EID: 85017018413 PISSN: None EISSN: 19326203 Source Type: Journal
DOI: 10.1371/journal.pone.0172395 Document Type: Article

Times cited : (807)

References (30)

1
- 0004102479
- MIT press Cambridge
- Sutton RS, Barto AG. Reinforcement learning: An introduction. MIT press Cambridge; 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 84924485966
- Cambridge University Press
- Poole DL, Mackworth AK. Artificial Intelligence: foundations of computational agents. Cambridge University Press; 2010.
- (2010) Artificial Intelligence: Foundations of Computational Agents
- Poole, D.L.¹ Mackworth, A.K.²

3
- 40949147745
- A comprehensive survey of multiagent reinforcement learning
- Busoniu L, Babuska R, De Schutter B. A comprehensive survey of multiagent reinforcement learning. Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on. 2008;38(2):156-172. https://doi.org/10.1109/TSMCC.2007.913919
- (2008) Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

4
- 84884097753
- Princeton University Press
- Sumpter DJ. Collective animal behavior. Princeton University Press; 2010.
- (2010) Collective Animal Behavior
- Sumpter, D.J.¹

5
- 84924356033
- John Wiley & Sons
- Schwartz HM. Multi-Agent Machine Learning: A Reinforcement Approach. John Wiley & Sons; 2014.
- (2014) Multi-agent Machine Learning: A Reinforcement Approach
- Schwartz, H.M.¹

6
- 84855819164
- Finite-time stability of multi-agent system in disturbed environment
- Wang L, Sun S, Xia C. Finite-time stability of multi-agent system in disturbed environment. Nonlinear Dynamics. 2012;67(3):2009-2016. https://doi.org/10.1007/s11071-011-0125-0
- (2012) Nonlinear Dynamics , vol.67 , Issue.3 , pp. 2009-2016
- Wang, L.¹ Sun, S.² Xia, C.³

7
- 84924051598
- Human-level control through deep reinforcement learning
- 25719670
- Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, et al. Human-level control through deep reinforcement learning. Nature. 2015;518(7540):529-533. https://doi.org/10.1038/nature14236 PMID: 25719670
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶

8
- 84883060087
- Evolving large-scale neural networks for vision-based reinforcement learning
- ACM
- Koutnik J, Cuccu G, Schmidhuber J, Gomez F. Evolving large-scale neural networks for vision-based reinforcement learning. In: Proceedings of the 15th annual conference on Genetic and evolutionary computation. ACM; 2013. p. 1061-1068.
- (2013) Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation , pp. 1061-1068
- Koutnik, J.¹ Cuccu, G.² Schmidhuber, J.³ Gomez, F.⁴

9
- 84904867557
- arXiv preprint arXiv:13125602
- Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, et al. Playing atari with deep reinforcement learning. arXiv preprint arXiv:13125602. 2013;.
- (2013) Playing Atari with Deep Reinforcement Learning
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Graves, A.⁴ Antonoglou, I.⁵ Wierstra, D.⁶

10
- 0003673017
- DTIC Document
- Lin LJ. Reinforcement learning for robots using neural networks. DTIC Document; 1993.
- (1993) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.J.¹

11
- 84980041049
- arXiv preprint arXiv:151105952
- Schaul T, Quan J, Antonoglou I, Silver D. Prioritized Experience Replay. arXiv preprint arXiv:151105952. 2015;.
- (2015) Prioritized Experience Replay
- Schaul, T.¹ Quan, J.² Antonoglou, I.³ Silver, D.⁴

12
- 85152198941
- Multi-agent reinforcement learning: Independent vs. Cooperative agents
- Tan M. Multi-agent reinforcement learning: Independent vs. cooperative agents. In: Proceedings of the tenth international conference on machine learning; 1993. p. 330-337.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

13
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus C, Boutilier C. The dynamics of reinforcement learning in cooperative multiagent systems. In: AAAI/IAAI; 1998. p. 746-752.
- (1998) AAAI/IAAI , pp. 746-752
- Claus, C.¹ Boutilier, C.²

14
- 0004049893
- University of Cambridge England
- Watkins CJCH. Learning from delayed rewards. University of Cambridge England; 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

15
- 34249833101
- Q-learning
- Watkins CJ, Dayan P. Q-learning. Machine learning. 1992;8(3-4):279-292. https://doi.org/10.1023/A:1022676722315
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.¹ Dayan, P.²

16
- 84863970766
- Mott B, Anthony S. Stella: a multiplatform Atari 2600 VCS emulator; 2003.
- (2003) Stella: A Multiplatform Atari 2600 VCS Emulator
- Mott, B.¹ Anthony, S.²

17
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro G. Temporal difference learning and TD-Gammon. Communications of the ACM. 1995;38(3):58-68. https://doi.org/10.1145/203330.203343
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

18
- 84891544852
- Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play
- IEEE
- van der Ree M, Wiering M. Reinforcement learning in the game of Othello: learning against a fixed opponent and learning from self-play. In: Adaptive Dynamic Programming And Reinforcement Learning (ADPRL), 2013 IEEE Symposium on. IEEE; 2013. p. 108-115.
- (2013) Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2013 IEEE Symposium on , pp. 108-115
- Van Der Ree, M.¹ Wiering, M.²

19
- 84963949906
- Mastering the game of Go with deep neural networks and tree search
- 26819042
- Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, et al. Mastering the game of Go with deep neural networks and tree search. Nature. 2016;529(7587):484-489. https://doi.org/10. 1038/nature16961 PMID: 26819042
- (2016) Nature , vol.529 , Issue.7587 , pp. 484-489
- Silver, D.¹ Huang, A.² Maddison, C.J.³ Guez, A.⁴ Sifre, L.⁵ Van Den Driessche, G.⁶

20
- 85161998941
- Double Q-learning
- Hasselt HV. Double Q-learning. In: Advances in Neural Information Processing Systems; 2010. p. 2613-2621.
- (2010) Advances in Neural Information Processing Systems , pp. 2613-2621
- Hasselt, H.V.¹

21
- 84962006941
- arXiv preprint arXiv:14126806
- Springenberg JT, Dosovitskiy A, Brox T, Riedmiller M. Striving for simplicity: The all convolutional net. arXiv preprint arXiv:14126806. 2014;.
- (2014) Striving for Simplicity: The All Convolutional Net
- Springenberg, J.T.¹ Dosovitskiy, A.² Brox, T.³ Riedmiller, M.⁴

22
- 84975760699
- Using goal-driven deep learning models to understand sensory cortex
- 26906502
- Yamins DL, DiCarlo JJ. Using goal-driven deep learning models to understand sensory cortex. Nature neuroscience. 2016;19(3):356-365. https://doi.org/10.1038/nn. 4244 PMID: 26906502
- (2016) Nature Neuroscience , vol.19 , Issue.3 , pp. 356-365
- Yamins, D.L.¹ DiCarlo, J.J.²

23
- 84936878573
- Deep neural networks reveal a gradient in the complexity of neural representations across the ventral stream
- 26157000
- Guçlu U, van Gerven MA. Deep neural networks reveal a gradient in the complexity of neural representations across the ventral stream. The Journal of Neuroscience. 2015;35(27):10005-10014. https://doi.org/10.1523/JNEUROSCI.5023-14.2015 PMID: 26157000
- (2015) The Journal of Neuroscience , vol.35 , Issue.27 , pp. 10005-10014
- Guçlu, U.¹ Van Gerven, M.A.²

24
- 84937959846
- Recurrent models of visual attention
- Mnih V, Heess N, Graves A, et al. Recurrent models of visual attention. In: Advances in Neural Information Processing Systems; 2014. p. 2204-2212.
- (2014) Advances in Neural Information Processing Systems , pp. 2204-2212
- Mnih, V.¹ Heess, N.² Graves, A.³

25
- 84965143740
- End-to-end memory networks
- Sukhbaatar S, Weston J, Fergus R, et al. End-to-end memory networks. In: Advances in Neural Information Processing Systems; 2015. p. 2431-2439.
- (2015) Advances in Neural Information Processing Systems , pp. 2431-2439
- Sukhbaatar, S.¹ Weston, J.² Fergus, R.³

26
- 85009983859
- arXiv preprint arXiv:151109249
- Schmidhuber J. On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models. arXiv preprint arXiv:151109249. 2015;.
- (2015) On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models
- Schmidhuber, J.¹

27
- 84979258646
- arXiv preprint arXiv:160202672
- Foerster JN, Assael YM, de Freitas N, Whiteson S. Learning to communicate to solve riddles with deep distributed recurrent q-networks. arXiv preprint arXiv:160202672. 2016;.
- (2016) Learning to Communicate to Solve Riddles with Deep Distributed Recurrent q-networks
- Foerster, J.N.¹ Assael, Y.M.² De Freitas, N.³ Whiteson, S.⁴

28
- 84922231667
- OUP Oxford
- Skyrms B. Signals: Evolution, learning, and information. OUP Oxford; 2010.
- (2010) Signals: Evolution, Learning, and Information
- Skyrms, B.¹

29
- 47949112218
- Do conventions need to be common knowledge?
- Binmore K. Do conventions need to be common knowledge? Topoi. 2008;27(1-2):17-27. https://doi.org/10.1007/s11245-008-9033-4
- (2008) Topoi , vol.27 , Issue.1-2 , pp. 17-27
- Binmore, K.¹

30
- 85016937915
- John Wiley & Sons
- Lewis D. Convention: A philosophical study. John Wiley & Sons; 2008.
- (2008) Convention: A Philosophical Study
- Lewis, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.