SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 2016, Pages 2145-2153

Learning to communicate with deep multi-agent reinforcement learning

(4) Foerster, Jakob N a Assael, Yannis M a De Freitas, Nando a,b,c Whiteson, Shimon a

a UNIVERSITY OF OXFORD (United Kingdom)

b CANADIAN INSTITUTE FOR ADVANCED RESEARCH (Canada)

c DEEPMIND (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

DEEP NEURAL NETWORKS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

AGENT LEARNING; COMPLEX ENVIRONMENTS; COMPUTER VISION PROBLEMS; DECENTRALISED; ENGINEERING INNOVATIONS; MULTI-AGENT REINFORCEMENT LEARNING; MULTIPLE AGENTS; PARTIAL OBSERVABILITY;

DEEP LEARNING;

EID: 85019195482 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1592)

References (26)

1
- 52249098423
- Optimal and approximate Q-value functions for decentralized POMDPs
- F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32:289-353, 2008.
- (2008) JAIR , vol.32 , pp. 289-353
- Oliehoek, F.A.¹ Spaan, M.T.J.² Vlassis, N.³

2
- 84962082047
- Multi-agent reinforcement learning as a rehearsal for decentralized planning
- L. Kraemer and B. Banerjee. Multi-agent reinforcement learning as a rehearsal for decentralized planning. Neurocomputing, 190:82-94, 2016.
- (2016) Neurocomputing , vol.190 , pp. 82-94
- Kraemer, L.¹ Banerjee, B.²

3
- 84924051598
- Human-level control through deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 2015.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰ Petersen, S.¹¹ Beattie, C.¹² Sadik, A.¹³ Antonoglou, I.¹⁴ King, H.¹⁵ Kumaran, D.¹⁶ Wierstra, D.¹⁷ Legg, S.¹⁸ Hassabis, D.¹⁹

4
- 85152198941
- Multi-agent reinforcement learning: Independent vs. Cooperative agents
- M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In ICML, 1993.
- (1993) ICML
- Tan, M.¹

5
- 84868340899
- QueryPOMDP: POMDP-based communication in multiagent systems
- F. S. Melo, M. Spaan, and S. J. Witwicki. QueryPOMDP: POMDP-based communication in multiagent systems. In Multi-Agent Systems, pages 189-204. 2011.
- (2011) Multi-Agent Systems , pp. 189-204
- Melo, F.S.¹ Spaan, M.² Witwicki, S.J.³

6
- 26444601262
- Cooperative multi-agent learning: The state of the art
- L. Panait and S. Luke. Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems, 11(3):387-434, 2005.
- (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
- Panait, L.¹ Luke, S.²

7
- 70349288184
- Learning of communication codes in multi-agent reinforcement learning problem
- T. Kasai, H. Tenmoto, and A. Kamiya. Learning of communication codes in multi-agent reinforcement learning problem. In IEEE Soft Computing in Industrial Applications, pages 1-6, 2008.
- (2008) IEEE Soft Computing in Industrial Applications , pp. 1-6
- Kasai, T.¹ Tenmoto, H.² Kamiya, A.³

8
- 7044224227
- Learning communication for multi-agent systems
- Springer
- C. L. Giles and K. C. Jim. Learning communication for multi-agent systems. In Innovative Concepts for Agent-Based Systems, pages 377-390. Springer, 2002.
- (2002) Innovative Concepts for Agent-Based Systems , pp. 377-390
- Giles, C.L.¹ Jim, K.C.²

9
- 84965100881
- arXiv preprint arXiv:1502.04623
- K. Gregor, I. Danihelka, A. Graves, and D. Wierstra. Draw: A recurrent neural network for image generation. arXiv preprint arXiv:1502.04623, 2015.
- (2015) Draw: A Recurrent Neural Network for Image Generation
- Gregor, K.¹ Danihelka, I.² Graves, A.³ Wierstra, D.⁴

10
- 85015299171
- arXiv preprint arXiv:1605.07736
- S. Sukhbaatar, A. Szlam, and R. Fergus. Learning multiagent communication with backpropagation. arXiv preprint arXiv:1605.07736, 2016.
- (2016) Learning Multiagent Communication with Backpropagation
- Sukhbaatar, S.¹ Szlam, A.² Fergus, R.³

11
- 84988920420
- arXiv preprint arXiv:1602.02830
- M. Courbariaux and Y. Bengio. BinaryNet: Training deep neural networks with weights and activations constrained to +1 or -1. arXiv preprint arXiv:1602.02830, 2016.
- (2016) BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1
- Courbariaux, M.¹ Bengio, Y.²

12
- 79961219393
- Discovering binary codes for documents by learning deep generative models
- G. Hinton and R. Salakhutdinov. Discovering binary codes for documents by learning deep generative models. Topics in Cognitive Science, 3(1):74-91, 2011.
- (2011) Topics in Cognitive Science , vol.3 , Issue.1 , pp. 74-91
- Hinton, G.¹ Salakhutdinov, R.²

13
- 0003420416
- MIT Press
- R. S. Sutton and A. G. Barto. Introduction to reinforcement learning. MIT Press, 1998.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

14
- 84998600495
- arXiv preprint arXiv:1511.08779
- A. Tampuu, T. Matiisen, D. Kodelja, I. Kuzovkin, K. Korjus, J. Aru, J. Aru, and R. Vicente. Multiagent cooperation and competition with deep reinforcement learning. arXiv preprint arXiv:1511.08779, 2015.
- (2015) Multiagent Cooperation and Competition with Deep Reinforcement Learning
- Tampuu, A.¹ Matiisen, T.² Kodelja, D.³ Kuzovkin, I.⁴ Korjus, K.⁵ Aru, J.⁶ Aru, J.⁷ Vicente, R.⁸

15
- 84924111881
- Cambridge University Press, New York
- Y. Shoham and K. Leyton-Brown. Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations. Cambridge University Press, New York, 2009.
- (2009) Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations
- Shoham, Y.¹ Leyton-Brown, K.²

16
- 84908189741
- arXiv preprint 1401.8074
- E. Zawadzki, A. Lipson, and K. Leyton-Brown. Empirically evaluating multiagent learning algorithms. arXiv preprint 1401.8074, 2014.
- (2014) Empirically Evaluating Multiagent Learning Algorithms
- Zawadzki, E.¹ Lipson, A.² Leyton-Brown, K.³

17
- 85015404662
- arXiv preprint arXiv:1507.06527
- M. Hausknecht and P. Stone. Deep recurrent Q-learning for partially observable MDPs. arXiv preprint arXiv:1507.06527, 2015.
- (2015) Deep Recurrent Q-learning for Partially Observable MDPs
- Hausknecht, M.¹ Stone, P.²

18
- 84959861546
- arXiv preprint arXiv:1506.08941
- K. Narasimhan, T. Kulkarni, and R. Barzilay. Language understanding for text-based games using deep reinforcement learning. arXiv preprint arXiv:1506.08941, 2015.
- (2015) Language Understanding for Text-based Games Using Deep Reinforcement Learning
- Narasimhan, K.¹ Kulkarni, T.² Barzilay, R.³

19
- 84893343292
- Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
- T. Tieleman and G. Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4(2), 2012.
- (2012) COURSERA: Neural Networks for Machine Learning , vol.4 , Issue.2
- Tieleman, T.¹ Hinton, G.²

20
- 84943799837
- arXiv preprint arXiv:1409.1259
- K. Cho, B. van Merriënboer, D. Bahdanau, and Y. Bengio. On the properties of neural machine translation: Encoder-decoder approaches. arXiv preprint arXiv:1409.1259, 2014.
- (2014) On the Properties of Neural Machine Translation: Encoder-decoder Approaches
- Cho, K.¹ Van Merriënboer, B.² Bahdanau, D.³ Bengio, Y.⁴

21
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

22
- 84939821078
- arXiv preprint arXiv:1412.3555
- J. Chung, C. Gulcehre, K. Cho, and Y. Bengio. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
- (2014) Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
- Chung, J.¹ Gulcehre, C.² Cho, K.³ Bengio, Y.⁴

23
- 84969584486
- Batch normalization: Accelerating deep network training by reducing internal covariate shift
- S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, pages 448-456, 2015.
- (2015) ICML , pp. 448-456
- Ioffe, S.¹ Szegedy, C.²

24
- 84860550938
- Technical report, OCF, UC Berkeley
- W. Wu. 100 prisoners and a lightbulb. Technical report, OCF, UC Berkeley, 2002.
- (2002) 100 Prisoners and a Lightbulb
- Wu, W.¹

25
- 0032203257
- Gradient-based learning applied to document recognition
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

26
- 33746382692
- How did language go discrete?
- M. Tallerman, editor chapter 3. Oxford University Press
- M. Studdert-Kennedy. How did language go discrete? In M. Tallerman, editor, Language Origins: Perspectives on Evolution, chapter 3. Oxford University Press, 2005.
- (2005) Language Origins: Perspectives on Evolution
- Studdert-Kennedy, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.