메뉴 건너뛰기




Volumn , Issue , 2018, Pages

Emergent complexity via multi-agent competition

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; REINFORCEMENT LEARNING;

EID: 85083954226     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (183)

References (34)
  • 3
    • 1942470793 scopus 로고    scopus 로고
    • Multitask learning
    • Springer
    • Rich Caruana. Multitask learning. In Learning to learn, pp. 95–133. Springer, 1998.
    • (1998) Learning to Learn , pp. 95-133
    • Caruana, R.1
  • 14
    • 85018878907 scopus 로고    scopus 로고
    • Stein variational gradient descent: A general purpose Bayesian inference algorithm
    • Qiang Liu and Dilin Wang. Stein variational gradient descent: A general purpose bayesian inference algorithm. In Advances In Neural Information Processing Systems, pp. 2378–2386, 2016.
    • (2016) Advances in Neural Information Processing Systems , pp. 2378-2386
    • Liu, Q.1    Wang, D.2
  • 16
    • 84857861863 scopus 로고    scopus 로고
    • Independent reinforcement learners in cooperative Markov games: A survey regarding coordination problems
    • Laetitia Matignon, Guillaume J Laurent, and Nadine Le Fort-Piat. Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems. The Knowledge Engineering Review, 27(1):1–31, 2012.
    • (2012) The Knowledge Engineering Review , vol.27 , Issue.1 , pp. 1-31
    • Matignon, L.1    Laurent, G.J.2    Le Fort-Piat, N.3
  • 19
    • 85062216559 scopus 로고    scopus 로고
    • OpenAI. OpenAI Dota 2 1v1 bot, 2017. URL https://openai.com/the-international/.
    • (2017) OpenAI Dota 2 1v1 Bot
  • 20
    • 26444601262 scopus 로고    scopus 로고
    • Cooperative multi-agent learning: The state of the art
    • Liviu Panait and Sean Luke. Cooperative multi-agent learning: The state of the art. Autonomous agents and multi-agent systems, 11(3):387–434, 2005.
    • (2005) Autonomous Agents and Multi-Agent Systems , vol.11 , Issue.3 , pp. 387-434
    • Panait, L.1    Luke, S.2
  • 31
    • 0029276036 scopus 로고
    • Temporal difference learning and td-gammon
    • Gerald Tesauro. Temporal difference learning and td-gammon. Communications of the ACM, 38(3): 58–68, 1995.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 34
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Ronald J Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning, 8(3-4):229–256, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 229-256
    • Williams, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.