SCOPUS 정보 검색 플랫폼

4th International Conference on Learning Representations, ICLR 2016 - Conference Track Proceedings

Volumn , Issue , 2016, Pages

Better computer go player with neural network and long-term prediction

(2) Tian, Yuandong a Zhu, Yan a

a FACEBOOK AI RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

BUDGET CONTROL; NEURAL NETWORKS; PATTERN MATCHING;

BRANCHING FACTORS; CONVOLUTIONAL NEURAL NETWORK; EVALUATION FUNCTION; LONG-TERM GOALS; LONG-TERM PREDICTION; MONTE CARLO TREE SEARCH (MCTS); SEARCH TECHNIQUE; STATE OF THE ART;

DEEP NEURAL NETWORKS;

EID: 85083953106 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (43)

References (16)

1
- 85071028545
- Baudis, Petr and Gailly, Jean-loup. Pachi: State of the art open source go program. pp. 2438, 2012.
- (2012) Pachi: State of the Art Open Source Go Program , pp. 2438
- Baudis, P.¹ Gailly, J.-L.²

2
- 84858960516
- A survey of monte carlo tree search methods
- Browne, Cameron B, Powley, Edward, Whitehouse, Daniel, Lucas, Simon M, Cowling, Peter, Rohlf-shagen, Philipp, Tavener, Stephen, Perez, Diego, Samothrakis, Spyridon, Colton, Simon, et al. A survey of monte carlo tree search methods. Computational Intelligence and AI in Games, IEEE Transactions on, 4(1):1–43, 2012.
- (2012) Computational Intelligence and AI in Games, IEEE Transactions on , vol.4 , Issue.1 , pp. 1-43
- Browne, C.B.¹ Powley, E.² Whitehouse, D.³ Lucas, S.M.⁴ Cowling, P.⁵ Rohlfshagen, P.⁶ Tavener, S.⁷ Perez, D.⁸ Samothrakis, S.⁹ Colton, S.¹⁰

3
- 84969920322
- Training deep convolutional neural networks to play go
- Clark, Christopher and Storkey, Amos. Training deep convolutional neural networks to play go. In Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pp. 1766–1774, 2015.
- (2015) Proceedings of the 32nd International Conference on Machine Learning (ICML-15) , pp. 1766-1774
- Clark, C.¹ Storkey, A.²

4
- 0003582853
- Enzenberger, Markus. The integration of a priori knowledge into a go playing neural network. URL: http://www. markus-enzenberger. de/neurogo. html, 1996.
- (1996) The Integration of A Priori Knowledge into A Go Playing Neural Network
- Enzenberger, M.¹

5
- 78951484236
- Fuegoan open-source framework for board games and go engine based on monte carlo tree search
- Enzenberger, Markus, Müller, Martin, Arneson, Broderick, and Segal, Richard. Fuegoan open-source framework for board games and go engine based on monte carlo tree search. Computational Intelligence and AI in Games, IEEE Transactions on, 2(4):259–270, 2010.
- (2010) Computational Intelligence and AI in Games, IEEE Transactions on , vol.2 , Issue.4 , pp. 259-270
- Enzenberger, M.¹ Müller, M.² Arneson, B.³ Segal, R.⁴

6
- 84954187031
- Adaptive playouts in monte-carlo tree search with policy-gradient reinforcement learning
- Springer
- Graf, Tobias and Platzner, Marco. Adaptive playouts in monte-carlo tree search with policy-gradient reinforcement learning. In Advances in Computer Games, pp. 1–11. Springer, 2015.
- (2015) Advances in Computer Games , pp. 1-11
- Graf, T.¹ Platzner, M.²

7
- 84958589374
- arXiv preprint
- He, Kaiming, Zhang, Xiangyu, Ren, Shaoqing, and Sun, Jian. Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385, 2015.
- (2015) Deep Residual Learning for Image Recognition
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

8
- 33750293964
- Bandit based monte-carlo planning
- Springer
- Kocsis, Levente and Szepesvári, Csaba. Bandit based monte-carlo planning. In Machine Learning: ECML 2006, pp. 282–293. Springer, 2006.
- (2006) Machine Learning: ECML 2006 , pp. 282-293
- Kocsis, L.¹ Szepesvári, C.²

9
- 84898938510
- Actor-critic algorithms
- Konda, Vijay R and Tsitsiklis, John N. Actor-critic algorithms. In NIPS, volume 13, pp. 1008–1014, 1999.
- (1999) NIPS , vol.13 , pp. 1008-1014
- Konda, V.R.¹ Tsitsiklis, J.N.²

10
- 85083951314
- Maddison, Chris J, Huang, Aja, Sutskever, Ilya, and Silver, David. Move evaluation in go using deep convolutional neural networks. 2015.
- (2015) Move Evaluation in Go Using Deep Convolutional Neural Networks
- Maddison, C.J.¹ Huang, A.² Sutskever, I.³ Silver, D.⁴

11
- 0031682491
- Evolving neural networks to play go
- Richards, Norman, Moriarty, David E, and Miikkulainen, Risto. Evolving neural networks to play go. Applied Intelligence, 8(1):85–96, 1998.
- (1998) Applied Intelligence , vol.8 , Issue.1 , pp. 85-96
- Richards, N.¹ Moriarty, D.E.² Miikkulainen, R.³

12
- 0000433333
- Temporal difference learning of position evaluation in the game of go
- Schraudolph, Nicol N, Dayan, Peter, and Sejnowski, Terrence J. Temporal difference learning of position evaluation in the game of go. Advances in Neural Information Processing Systems, pp. 817–817, 1994.
- (1994) Advances in Neural Information Processing Systems , pp. 817
- Schraudolph, N.N.¹ Dayan, P.² Sejnowski, T.J.³

13
- 78951495405
- Doctor of philosophy, University of Alberta
- Silver, David. Reinforcement learning and simulation-based search. Doctor of philosophy, University of Alberta, 2009.
- (2009) Reinforcement Learning and Simulation-Based Search
- Silver, D.¹

14
- 84963949906
- Mastering the game of go with deep neural networks and tree search
- Silver, David, Huang, Aja, Maddison, Chris J., Guez, Arthur, Sifre, Laurent, van den Driessche, George, Schrittwieser, Julian, Antonoglou, Ioannis, Panneershelvam, Veda, Lanctot, Marc, Diele-man, Sander, Grewe, Dominik, Nham, John, Kalchbrenner, Nal, Sutskever, Ilya, Lillicrap, Timothy, Leach, Madeleine, Kavukcuoglu, Koray, Graepel, Thore, and Hassabis, Demis. Mastering the game of go with deep neural networks and tree search. Nature, 2016.
- (2016) Nature
- Silver, D.¹ Huang, A.² Maddison, C.J.³ Guez, A.⁴ Sifre, L.⁵ Van Den Driessche, G.⁶ Schrittwieser, J.⁷ Antonoglou, I.⁸ Panneershelvam, V.⁹ Lanctot, M.¹⁰ Dieleman, S.¹¹ Grewe, D.¹² Nham, J.¹³ Kalchbrenner, N.¹⁴ Sutskever, I.¹⁵ Lillicrap, T.¹⁶ Leach, M.¹⁷ Kavukcuoglu, K.¹⁸ Graepel, T.¹⁹ Hassabis, D.²⁰ more..

15
- 52049104037
- Mimicking go experts with convolutional neural networks
- Springer
- Sutskever, Ilya and Nair, Vinod. Mimicking go experts with convolutional neural networks. In Artificial Neural Networks-ICANN 2008, pp. 101–110. Springer, 2008.
- (2008) Artificial Neural Networks-ICANN 2008 , pp. 101-110
- Sutskever, I.¹ Nair, V.²

16
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- Citeseer
- Sutton, Richard S, McAllester, David A, Singh, Satinder P, Mansour, Yishay, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pp. 1057–1063. Citeseer, 1999.
- (1999) NIPS , vol.99 , pp. 1057-1063
- Sutton, R.S.¹ McAllester, D.A.² Singh, S.P.³ Mansour, Y.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.