메뉴 건너뛰기




Volumn , Issue , 2016, Pages

Better computer go player with neural network and long-term prediction

Author keywords

[No Author keywords available]

Indexed keywords

BUDGET CONTROL; NEURAL NETWORKS; PATTERN MATCHING;

EID: 85083953106     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (43)

References (16)
  • 6
    • 84954187031 scopus 로고    scopus 로고
    • Adaptive playouts in monte-carlo tree search with policy-gradient reinforcement learning
    • Springer
    • Graf, Tobias and Platzner, Marco. Adaptive playouts in monte-carlo tree search with policy-gradient reinforcement learning. In Advances in Computer Games, pp. 1–11. Springer, 2015.
    • (2015) Advances in Computer Games , pp. 1-11
    • Graf, T.1    Platzner, M.2
  • 8
    • 33750293964 scopus 로고    scopus 로고
    • Bandit based monte-carlo planning
    • Springer
    • Kocsis, Levente and Szepesvári, Csaba. Bandit based monte-carlo planning. In Machine Learning: ECML 2006, pp. 282–293. Springer, 2006.
    • (2006) Machine Learning: ECML 2006 , pp. 282-293
    • Kocsis, L.1    Szepesvári, C.2
  • 9
    • 84898938510 scopus 로고    scopus 로고
    • Actor-critic algorithms
    • Konda, Vijay R and Tsitsiklis, John N. Actor-critic algorithms. In NIPS, volume 13, pp. 1008–1014, 1999.
    • (1999) NIPS , vol.13 , pp. 1008-1014
    • Konda, V.R.1    Tsitsiklis, J.N.2
  • 15
    • 52049104037 scopus 로고    scopus 로고
    • Mimicking go experts with convolutional neural networks
    • Springer
    • Sutskever, Ilya and Nair, Vinod. Mimicking go experts with convolutional neural networks. In Artificial Neural Networks-ICANN 2008, pp. 101–110. Springer, 2008.
    • (2008) Artificial Neural Networks-ICANN 2008 , pp. 101-110
    • Sutskever, I.1    Nair, V.2
  • 16
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • Citeseer
    • Sutton, Richard S, McAllester, David A, Singh, Satinder P, Mansour, Yishay, et al. Policy gradient methods for reinforcement learning with function approximation. In NIPS, volume 99, pp. 1057–1063. Citeseer, 1999.
    • (1999) NIPS , vol.99 , pp. 1057-1063
    • Sutton, R.S.1    McAllester, D.A.2    Singh, S.P.3    Mansour, Y.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.