메뉴 건너뛰기




Volumn 12, Issue 4, 2017, Pages

Multiagent cooperation and competition with deep reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

BEHAVIOR; COMPETITION; HUMAN; REINFORCEMENT; VIDEO GAME; ALGORITHM; COOPERATION; GAME; HUMAN RELATION; LEARNING; PHYSIOLOGY; REWARD; SOCIAL BEHAVIOR;

EID: 85017018413     PISSN: None     EISSN: 19326203     Source Type: Journal    
DOI: 10.1371/journal.pone.0172395     Document Type: Article
Times cited : (807)

References (30)
  • 6
    • 84855819164 scopus 로고    scopus 로고
    • Finite-time stability of multi-agent system in disturbed environment
    • Wang L, Sun S, Xia C. Finite-time stability of multi-agent system in disturbed environment. Nonlinear Dynamics. 2012;67(3):2009-2016. https://doi.org/10.1007/s11071-011-0125-0
    • (2012) Nonlinear Dynamics , vol.67 , Issue.3 , pp. 2009-2016
    • Wang, L.1    Sun, S.2    Xia, C.3
  • 7
    • 84924051598 scopus 로고    scopus 로고
    • Human-level control through deep reinforcement learning
    • 25719670
    • Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, et al. Human-level control through deep reinforcement learning. Nature. 2015;518(7540):529-533. https://doi.org/10.1038/nature14236 PMID: 25719670
    • (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
    • Mnih, V.1    Kavukcuoglu, K.2    Silver, D.3    Rusu, A.A.4    Veness, J.5    Bellemare, M.G.6
  • 13
    • 0031630561 scopus 로고    scopus 로고
    • The dynamics of reinforcement learning in cooperative multiagent systems
    • Claus C, Boutilier C. The dynamics of reinforcement learning in cooperative multiagent systems. In: AAAI/IAAI; 1998. p. 746-752.
    • (1998) AAAI/IAAI , pp. 746-752
    • Claus, C.1    Boutilier, C.2
  • 15
    • 34249833101 scopus 로고
    • Q-learning
    • Watkins CJ, Dayan P. Q-learning. Machine learning. 1992;8(3-4):279-292. https://doi.org/10.1023/A:1022676722315
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.J.1    Dayan, P.2
  • 17
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro G. Temporal difference learning and TD-Gammon. Communications of the ACM. 1995;38(3):58-68. https://doi.org/10.1145/203330.203343
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 19
    • 84963949906 scopus 로고    scopus 로고
    • Mastering the game of Go with deep neural networks and tree search
    • 26819042
    • Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, et al. Mastering the game of Go with deep neural networks and tree search. Nature. 2016;529(7587):484-489. https://doi.org/10. 1038/nature16961 PMID: 26819042
    • (2016) Nature , vol.529 , Issue.7587 , pp. 484-489
    • Silver, D.1    Huang, A.2    Maddison, C.J.3    Guez, A.4    Sifre, L.5    Van Den Driessche, G.6
  • 22
    • 84975760699 scopus 로고    scopus 로고
    • Using goal-driven deep learning models to understand sensory cortex
    • 26906502
    • Yamins DL, DiCarlo JJ. Using goal-driven deep learning models to understand sensory cortex. Nature neuroscience. 2016;19(3):356-365. https://doi.org/10.1038/nn. 4244 PMID: 26906502
    • (2016) Nature Neuroscience , vol.19 , Issue.3 , pp. 356-365
    • Yamins, D.L.1    DiCarlo, J.J.2
  • 23
    • 84936878573 scopus 로고    scopus 로고
    • Deep neural networks reveal a gradient in the complexity of neural representations across the ventral stream
    • 26157000
    • Guçlu U, van Gerven MA. Deep neural networks reveal a gradient in the complexity of neural representations across the ventral stream. The Journal of Neuroscience. 2015;35(27):10005-10014. https://doi.org/10.1523/JNEUROSCI.5023-14.2015 PMID: 26157000
    • (2015) The Journal of Neuroscience , vol.35 , Issue.27 , pp. 10005-10014
    • Guçlu, U.1    Van Gerven, M.A.2
  • 29
    • 47949112218 scopus 로고    scopus 로고
    • Do conventions need to be common knowledge?
    • Binmore K. Do conventions need to be common knowledge? Topoi. 2008;27(1-2):17-27. https://doi.org/10.1007/s11245-008-9033-4
    • (2008) Topoi , vol.27 , Issue.1-2 , pp. 17-27
    • Binmore, K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.