메뉴 건너뛰기




Volumn , Issue , 2016, Pages 2145-2153

Learning to communicate with deep multi-agent reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

DEEP NEURAL NETWORKS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

EID: 85019195482     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1592)

References (26)
  • 1
    • 52249098423 scopus 로고    scopus 로고
    • Optimal and approximate Q-value functions for decentralized POMDPs
    • F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32:289-353, 2008.
    • (2008) JAIR , vol.32 , pp. 289-353
    • Oliehoek, F.A.1    Spaan, M.T.J.2    Vlassis, N.3
  • 2
    • 84962082047 scopus 로고    scopus 로고
    • Multi-agent reinforcement learning as a rehearsal for decentralized planning
    • L. Kraemer and B. Banerjee. Multi-agent reinforcement learning as a rehearsal for decentralized planning. Neurocomputing, 190:82-94, 2016.
    • (2016) Neurocomputing , vol.190 , pp. 82-94
    • Kraemer, L.1    Banerjee, B.2
  • 4
    • 85152198941 scopus 로고
    • Multi-agent reinforcement learning: Independent vs. Cooperative agents
    • M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In ICML, 1993.
    • (1993) ICML
    • Tan, M.1
  • 5
    • 84868340899 scopus 로고    scopus 로고
    • QueryPOMDP: POMDP-based communication in multiagent systems
    • F. S. Melo, M. Spaan, and S. J. Witwicki. QueryPOMDP: POMDP-based communication in multiagent systems. In Multi-Agent Systems, pages 189-204. 2011.
    • (2011) Multi-Agent Systems , pp. 189-204
    • Melo, F.S.1    Spaan, M.2    Witwicki, S.J.3
  • 6
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • L. Panait and S. Luke. Cooperative multi-agent learning: The state of the art. Autonomous Agents and Multi-Agent Systems, 11(3):387-434, 2005.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 12
    • 79961219393 scopus 로고    scopus 로고
    • Discovering binary codes for documents by learning deep generative models
    • G. Hinton and R. Salakhutdinov. Discovering binary codes for documents by learning deep generative models. Topics in Cognitive Science, 3(1):74-91, 2011.
    • (2011) Topics in Cognitive Science , vol.3 , Issue.1 , pp. 74-91
    • Hinton, G.1    Salakhutdinov, R.2
  • 19
    • 84893343292 scopus 로고    scopus 로고
    • Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude
    • T. Tieleman and G. Hinton. Lecture 6.5-rmsprop: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning, 4(2), 2012.
    • (2012) COURSERA: Neural Networks for Machine Learning , vol.4 , Issue.2
    • Tieleman, T.1    Hinton, G.2
  • 23
    • 84969584486 scopus 로고    scopus 로고
    • Batch normalization: Accelerating deep network training by reducing internal covariate shift
    • S. Ioffe and C. Szegedy. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In ICML, pages 448-456, 2015.
    • (2015) ICML , pp. 448-456
    • Ioffe, S.1    Szegedy, C.2
  • 25
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278-2324, 1998.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 26
    • 33746382692 scopus 로고    scopus 로고
    • How did language go discrete?
    • M. Tallerman, editor chapter 3. Oxford University Press
    • M. Studdert-Kennedy. How did language go discrete? In M. Tallerman, editor, Language Origins: Perspectives on Evolution, chapter 3. Oxford University Press, 2005.
    • (2005) Language Origins: Perspectives on Evolution
    • Studdert-Kennedy, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.