SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 2016, Pages 2252-2260

Learning multiagent communication with backpropagation

(3) Sukhbaatar, Sainbayar a Szlam, Arthur b Fergus, Rob b

a POLYTECHNIC UNIVERSITY (United States)

b FACEBOOK AI RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BACKPROPAGATION; MULTI AGENT SYSTEMS;

COOPERATIVE TASKS; MULTI-AGENT COMMUNICATIONS; MULTIPLE AGENTS; NEURAL MODELING;

COOPERATIVE COMMUNICATION;

EID: 85018860957 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1184)

References (37)

1
- 40949147745
- A comprehensive survey of multiagent reinforcement learning
- L. Busoniu, R. Babuska, and B. De Schutter. A comprehensive survey of multiagent reinforcement learning. Systems, Man, and Cybernetics, IEEE Transactions on, 38(2):156-172, 2008.
- (2008) Systems, Man, and Cybernetics, IEEE Transactions On , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

2
- 84871781883
- An overview of recent progress in the study of distributed multi-agent coordination
- Y. Cao, W. Yu, W. Ren, and G. Chen. An overview of recent progress in the study of distributed multi-agent coordination. IEEE Transactions on Industrial Informatics, 1(9):427-438, 2013.
- (2013) IEEE Transactions on Industrial Informatics , vol.1 , Issue.9 , pp. 427-438
- Cao, Y.¹ Yu, W.² Ren, W.³ Chen, G.⁴

3
- 0032208335
- Elevator group control using multiple reinforcement learning agents
- R. H. Crites and A. G. Barto. Elevator group control using multiple reinforcement learning agents. Machine Learning, 33(2):235-262, 1998.
- (1998) Machine Learning , vol.33 , Issue.2 , pp. 235-262
- Crites, R.H.¹ Barto, A.G.²

4
- 84979258646
- arXiv, abs/1602.02672
- J. N. Foerster, Y. M. Assael, N. de Freitas, and S. Whiteson. Learning to communicate to solve riddles with deep distributed recurrent Q-networks. arXiv, abs/1602.02672, 2016.
- (2016) Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-networks
- Foerster, J.N.¹ Assael, Y.M.² De Freitas, N.³ Whiteson, S.⁴

5
- 0034207091
- Probabilistic approach to collaborative multi-robot localization
- D. Fox, W. Burgard, H. Kruppa, and S. Thrun. Probabilistic approach to collaborative multi-robot localization. Autonomous Robots, 8(3):325-344, 2000.
- (2000) Autonomous Robots , vol.8 , Issue.3 , pp. 325-344
- Fox, D.¹ Burgard, W.² Kruppa, H.³ Thrun, S.⁴

6
- 7044224227
- Learning communication for multi-agent systems
- Springer
- C. L. Giles and K. C. Jim. Learning communication for multi-agent systems. In Innovative Concepts for Agent Based Systems, pages 377-390. Springer, 2002.
- (2002) Innovative Concepts for Agent Based Systems , pp. 377-390
- Giles, C.L.¹ Jim, K.C.²

7
- 0012296128
- Multiagent planning with factored MDPs
- C. Guestrin, D. Koller, and R. Parr. Multiagent planning with factored MDPs. In NIPS, 2001.
- (2001) NIPS
- Guestrin, C.¹ Koller, D.² Parr, R.³

8
- 84937779024
- Deep learning for real-time atari game play using offline monte-carlo tree search planning
- X. Guo, S. Singh, H. Lee, R. L. Lewis, and X. Wang. Deep learning for real-time atari game play using offline monte-carlo tree search planning. In NIPS, 2014.
- (2014) NIPS
- Guo, X.¹ Singh, S.² Lee, H.³ Lewis, R.L.⁴ Wang, X.⁵

9
- 85083953090
- Neural GPUs learn algorithms
- L. Kaiser and I. Sutskever. Neural gpus learn algorithms. In ICLR, 2016.
- (2016) ICLR
- Kaiser, L.¹ Sutskever, I.²

10
- 70349288184
- Learning of communication codes in multi-agent reinforcement learning problem
- T. Kasai, H. Tenmoto, and A. Kamiya. Learning of communication codes in multi-agent reinforcement learning problem. IEEE Conference on Soft Computing in Industrial Applications, pages 1-6, 2008.
- (2008) IEEE Conference on Soft Computing in Industrial Applications , pp. 1-6
- Kasai, T.¹ Tenmoto, H.² Kamiya, A.³

11
- 85083951076
- Adam: A method for stochastic optimization
- D. Kingma and J. Ba. Adam: A method for stochastic optimization. In ICLR, 2015.
- (2015) ICLR
- Kingma, D.¹ Ba, J.²

12
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- M. Lauer and M. A. Riedmiller. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In ICML, 2000.
- (2000) ICML
- Lauer, M.¹ Riedmiller, M.A.²

13
- 84979924150
- End-to-end training of deep visuomotor policies
- S. Levine, C. Finn, T. Darrell, and P. Abbeel. End-to-end training of deep visuomotor policies. Journal of Machine Learning Research, 17(39):1-40, 2016.
- (2016) Journal of Machine Learning Research , vol.17 , Issue.39 , pp. 1-40
- Levine, S.¹ Finn, C.² Darrell, T.³ Abbeel, P.⁴

14
- 85018937234
- Gated graph sequence neural networks
- Y. Li, D. Tarlow, M. Brockschmidt, and R. Zemel. Gated graph sequence neural networks. In ICLR, 2015.
- (2015) ICLR
- Li, Y.¹ Tarlow, D.² Brockschmidt, M.³ Zemel, R.⁴

15
- 0001547175
- Value-function reinforcement learning in Markov games
- M. L. Littman. Value-function reinforcement learning in markov games. Cognitive Systems Research, 2(1):55-66, 2001.
- (2001) Cognitive Systems Research , vol.2 , Issue.1 , pp. 55-66
- Littman, M.L.¹

16
- 85083951314
- Move evaluation in go using deep convolutional neural networks
- C. J. Maddison, A. Huang, I. Sutskever, and D. Silver. Move evaluation in go using deep convolutional neural networks. In ICLR, 2015.
- (2015) ICLR
- Maddison, C.J.¹ Huang, A.² Sutskever, I.³ Silver, D.⁴

17
- 85028097976
- Coordination of communication in robot teams by reinforcement learning
- D. Maravall, J. De Lope, and R. Domnguez. Coordination of communication in robot teams by reinforcement learning. Robotics and Autonomous Systems, 61(7):661-666, 2013.
- (2013) Robotics and Autonomous Systems , vol.61 , Issue.7 , pp. 661-666
- Maravall, D.¹ De Lope, J.² Domnguez, R.³

18
- 0030647149
- Reinforcement learning in the multi-robot domain
- M. Matari. Reinforcement learning in the multi-robot domain. Autonomous Robots, 4(1):73-83, 1997.
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
- Matari, M.¹

19
- 84868340899
- Querypomdp: Pomdp-based communication in multiagent systems
- F. S. Melo, M. Spaan, and S. J. Witwicki. Querypomdp: Pomdp-based communication in multiagent systems. In Multi-Agent Systems, pages 189-204, 2011.
- (2011) Multi-Agent Systems , pp. 189-204
- Melo, F.S.¹ Spaan, M.² Witwicki, S.J.³

20
- 84924051598
- Human-level control through deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, D. Wierstra, S. Legg, and D. Hassabis. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 2015.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰ Petersen, S.¹¹ Beattie, C.¹² Sadik, A.¹³ Wierstra, D.¹⁴ Legg, S.¹⁵ Hassabis, D.¹⁶

21
- 64149119332
- Consensus and cooperation in networked multi-agent systems
- R. Olfati-Saber, J. Fax, and R. Murray. Consensus and cooperation in networked multi-agent systems. Proceedings of the IEEE, 95(1):215-233, 2007.
- (2007) Proceedings of the IEEE , vol.95 , Issue.1 , pp. 215-233
- Olfati-Saber, R.¹ Fax, J.² Murray, R.³

22
- 0020276268
- Reverend bayes on inference engines: A distributed hierarchical approach
- J. Pearl. Reverend bayes on inference engines: A distributed hierarchical approach. In AAAI, 1982.
- (1982) AAAI
- Pearl, J.¹

23
- 58649113008
- The graph neural network model
- F. Scarselli, M. Gori, A. C. Tsoi, M. Hagenbuchner, and G. Monfardini. The graph neural network model. IEEE Trans. Neural Networks, 20(1):61-80, 2009.
- (2009) IEEE Trans. Neural Networks , vol.20 , Issue.1 , pp. 61-80
- Scarselli, F.¹ Gori, M.² Tsoi, A.C.³ Hagenbuchner, M.⁴ Monfardini, G.⁵

24
- 84963949906
- Mastering the game of go with deep neural networks and tree search
- D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. Van Den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, et al. Mastering the game of go with deep neural networks and tree search. Nature, 529(7587):484-489, 2016.
- (2016) Nature , vol.529 , Issue.7587 , pp. 484-489
- Silver, D.¹ Huang, A.² Maddison, C.J.³ Guez, A.⁴ Sifre, L.⁵ Van Den Driessche, G.⁶ Schrittwieser, J.⁷ Antonoglou, I.⁸ Panneershelvam, V.⁹ Lanctot, M.¹⁰

25
- 0031648211
- Towards collaborative and adversarial learning: A case study in robotic soccer
- P. Stone and M. Veloso. Towards collaborative and adversarial learning: A case study in robotic soccer. International Journal of Human Computer Studies, (48), 1998.
- (1998) International Journal of Human Computer Studies , Issue.48
- Stone, P.¹ Veloso, M.²

26
- 85018863429
- Mazebase: A sandbox for learning from games
- S. Sukhbaatar, A. Szlam, G. Synnaeve, S. Chintala, and R. Fergus. Mazebase: A sandbox for learning from games. CoRR, abs/1511.07401, 2015.
- (2015) CoRR
- Sukhbaatar, S.¹ Szlam, A.² Synnaeve, G.³ Chintala, S.⁴ Fergus, R.⁵

27
- 84965143740
- End-to-end memory networks
- S. Sukhbaatar, A. Szlam, J. Weston, and R. Fergus. End-to-end memory networks. NIPS, 2015.
- (2015) NIPS
- Sukhbaatar, S.¹ Szlam, A.² Weston, J.³ Fergus, R.⁴

28
- 0003420416
- MIT Press
- R. S. Sutton and A. G. Barto. Introduction to Reinforcement Learning. MIT Press, 1998.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

29
- 84998600495
- arXiv:1511.08779
- A. Tampuu, T. Matiisen, D. Kodelja, I. Kuzovkin, K. Korjus, J. Aru, and R. Vicente. Multiagent cooperation and competition with deep reinforcement learning. arXiv:1511.08779, 2015.
- (2015) Multiagent Cooperation and Competition with Deep Reinforcement Learning
- Tampuu, A.¹ Matiisen, T.² Kodelja, D.³ Kuzovkin, I.⁴ Korjus, K.⁵ Aru, J.⁶ Vicente, R.⁷

30
- 85152198941
- Multi-agent reinforcement learning: Independent vs. Cooperative agents
- M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In ICML, 1993.
- (1993) ICML
- Tan, M.¹

31
- 84893343292
- Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
- T. Tieleman and G. Hinton. Lecture 6.5-RmsProp: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 2012.
- (2012) COURSERA: Neural Networks for Machine Learning
- Tieleman, T.¹ Hinton, G.²

32
- 84878551320
- chapter Efficient Distributed Reinforcement Learning through Agreement
- P. Varshavskaya, L. P. Kaelbling, and D. Rus. Distributed Autonomous Robotic Systems 8, chapter Efficient Distributed Reinforcement Learning through Agreement, pages 367-378. 2009.
- (2009) Distributed Autonomous Robotic Systems , vol.8 , pp. 367-378
- Varshavskaya, P.¹ Kaelbling, L.P.² Rus, D.³

33
- 67649405225
- Reinforcement learning to play an optimal nash equilibrium in team Markov games
- X. Wang and T. Sandholm. Reinforcement learning to play an optimal nash equilibrium in team markov games. In NIPS, pages 1571-1578, 2002.
- (2002) NIPS , pp. 1571-1578
- Wang, X.¹ Sandholm, T.²

34
- 85083951707
- Towards ai-complete question answering: A set of prerequisite toy tasks
- J. Weston, A. Bordes, S. Chopra, and T. Mikolov. Towards ai-complete question answering: A set of prerequisite toy tasks. In ICLR, 2016.
- (2016) ICLR
- Weston, J.¹ Bordes, A.² Chopra, S.³ Mikolov, T.⁴

35
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- R. J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. In Machine Learning, pages 229-256, 1992.
- (1992) Machine Learning , pp. 229-256
- Williams, R.J.¹

36
- 84999008900
- Dynamic memory networks for visual and textual question answering
- C. Xiong, S. Merity, and R. Socher. Dynamic memory networks for visual and textual question answering. ICML, 2016.
- (2016) ICML
- Xiong, C.¹ Merity, S.² Socher, R.³

37
- 84899453582
- Coordinating multi-agent reinforcement learning with limited communication
- C. Zhang and V. Lesser. Coordinating multi-agent reinforcement learning with limited communication. In Proc. AAMAS, pages 1101-1108, 2013.
- (2013) Proc. AAMAS , pp. 1101-1108
- Zhang, C.¹ Lesser, V.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.