-
2
-
-
84858960516
-
A survey of monte carlo tree search methods
-
Browne, Cameron B, Powley, Edward, Whitehouse, Daniel, Lucas, Simon M, Cowling, Peter, Rohlf-shagen, Philipp, Tavener, Stephen, Perez, Diego, Samothrakis, Spyridon, Colton, Simon, et al. A survey of monte carlo tree search methods. Computational Intelligence and AI in Games, IEEE Transactions on, 4(1):1–43, 2012.
-
(2012)
Computational Intelligence and AI in Games, IEEE Transactions on
, vol.4
, Issue.1
, pp. 1-43
-
-
Browne, C.B.1
Powley, E.2
Whitehouse, D.3
Lucas, S.M.4
Cowling, P.5
Rohlfshagen, P.6
Tavener, S.7
Perez, D.8
Samothrakis, S.9
Colton, S.10
-
5
-
-
78951484236
-
Fuegoan open-source framework for board games and go engine based on monte carlo tree search
-
Enzenberger, Markus, Müller, Martin, Arneson, Broderick, and Segal, Richard. Fuegoan open-source framework for board games and go engine based on monte carlo tree search. Computational Intelligence and AI in Games, IEEE Transactions on, 2(4):259–270, 2010.
-
(2010)
Computational Intelligence and AI in Games, IEEE Transactions on
, vol.2
, Issue.4
, pp. 259-270
-
-
Enzenberger, M.1
Müller, M.2
Arneson, B.3
Segal, R.4
-
6
-
-
84954187031
-
Adaptive playouts in monte-carlo tree search with policy-gradient reinforcement learning
-
Springer
-
Graf, Tobias and Platzner, Marco. Adaptive playouts in monte-carlo tree search with policy-gradient reinforcement learning. In Advances in Computer Games, pp. 1–11. Springer, 2015.
-
(2015)
Advances in Computer Games
, pp. 1-11
-
-
Graf, T.1
Platzner, M.2
-
7
-
-
84958589374
-
-
arXiv preprint
-
He, Kaiming, Zhang, Xiangyu, Ren, Shaoqing, and Sun, Jian. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385, 2015.
-
(2015)
Deep Residual Learning for Image Recognition
-
-
He, K.1
Zhang, X.2
Ren, S.3
Sun, J.4
-
8
-
-
33750293964
-
Bandit based monte-carlo planning
-
Springer
-
Kocsis, Levente and Szepesvári, Csaba. Bandit based monte-carlo planning. In Machine Learning: ECML 2006, pp. 282–293. Springer, 2006.
-
(2006)
Machine Learning: ECML 2006
, pp. 282-293
-
-
Kocsis, L.1
Szepesvári, C.2
-
9
-
-
84898938510
-
Actor-critic algorithms
-
Konda, Vijay R and Tsitsiklis, John N. Actor-critic algorithms. In NIPS, volume 13, pp. 1008–1014, 1999.
-
(1999)
NIPS
, vol.13
, pp. 1008-1014
-
-
Konda, V.R.1
Tsitsiklis, J.N.2
-
10
-
-
85083951314
-
-
Maddison, Chris J, Huang, Aja, Sutskever, Ilya, and Silver, David. Move evaluation in go using deep convolutional neural networks. 2015.
-
(2015)
Move Evaluation in Go Using Deep Convolutional Neural Networks
-
-
Maddison, C.J.1
Huang, A.2
Sutskever, I.3
Silver, D.4
-
11
-
-
0031682491
-
Evolving neural networks to play go
-
Richards, Norman, Moriarty, David E, and Miikkulainen, Risto. Evolving neural networks to play go. Applied Intelligence, 8(1):85–96, 1998.
-
(1998)
Applied Intelligence
, vol.8
, Issue.1
, pp. 85-96
-
-
Richards, N.1
Moriarty, D.E.2
Miikkulainen, R.3
-
12
-
-
0000433333
-
Temporal difference learning of position evaluation in the game of go
-
Schraudolph, Nicol N, Dayan, Peter, and Sejnowski, Terrence J. Temporal difference learning of position evaluation in the game of go. Advances in Neural Information Processing Systems, pp. 817–817, 1994.
-
(1994)
Advances in Neural Information Processing Systems
, pp. 817
-
-
Schraudolph, N.N.1
Dayan, P.2
Sejnowski, T.J.3
-
14
-
-
84963949906
-
Mastering the game of go with deep neural networks and tree search
-
Silver, David, Huang, Aja, Maddison, Chris J., Guez, Arthur, Sifre, Laurent, van den Driessche, George, Schrittwieser, Julian, Antonoglou, Ioannis, Panneershelvam, Veda, Lanctot, Marc, Diele-man, Sander, Grewe, Dominik, Nham, John, Kalchbrenner, Nal, Sutskever, Ilya, Lillicrap, Timothy, Leach, Madeleine, Kavukcuoglu, Koray, Graepel, Thore, and Hassabis, Demis. Mastering the game of go with deep neural networks and tree search. Nature, 2016.
-
(2016)
Nature
-
-
Silver, D.1
Huang, A.2
Maddison, C.J.3
Guez, A.4
Sifre, L.5
Van Den Driessche, G.6
Schrittwieser, J.7
Antonoglou, I.8
Panneershelvam, V.9
Lanctot, M.10
Dieleman, S.11
Grewe, D.12
Nham, J.13
Kalchbrenner, N.14
Sutskever, I.15
Lillicrap, T.16
Leach, M.17
Kavukcuoglu, K.18
Graepel, T.19
Hassabis, D.20
more..
-
15
-
-
52049104037
-
Mimicking go experts with convolutional neural networks
-
Springer
-
Sutskever, Ilya and Nair, Vinod. Mimicking go experts with convolutional neural networks. In Artificial Neural Networks-ICANN 2008, pp. 101–110. Springer, 2008.
-
(2008)
Artificial Neural Networks-ICANN 2008
, pp. 101-110
-
-
Sutskever, I.1
Nair, V.2
-
16
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Citeseer
-
Sutton, Richard S, McAllester, David A, Singh, Satinder P, Mansour, Yishay, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pp. 1057–1063. Citeseer, 1999.
-
(1999)
NIPS
, vol.99
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.A.2
Singh, S.P.3
Mansour, Y.4
|