메뉴 건너뛰기




Volumn , Issue , 2013, Pages 108-115

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL AGENTS; LEARNING AGENTS; MULTI-LAYER PERCEPTRONS; OTHELLO; Q-LEARNING; SELF-PLAY; TD-LEARNING;

EID: 84891544852     PISSN: 23251824     EISSN: 23251867     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2013.6614996     Document Type: Conference Paper
Times cited : (56)

References (18)
  • 3
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • G. Tesauro, "Temporal difference learning and TD-Gammon," Communications of the ACM, vol. 38, pp. 58-68, 1995.
    • (1995) Communications of the ACM , vol.38 , pp. 58-68
    • Tesauro, G.1
  • 7
    • 21844502480 scopus 로고
    • Discovering complex othello strategies through evolutionary neural networks
    • D. Moriarty and R. Miikkulainen, "Discovering complex othello strategies through evolutionary neural networks," Connection Science, vol. 7, no. 3, pp. 195-210, 1995.
    • (1995) Connection Science , vol.7 , Issue.3 , pp. 195-210
    • Moriarty, D.1    Miikkulainen, R.2
  • 9
    • 84876914496 scopus 로고    scopus 로고
    • Neural-fitted td-learning for playing othello with structured neural networks
    • S. van den Dries and M. Wiering, "Neural-fitted td-learning for playing othello with structured neural networks," IEEE Transactions on Neural Networks and Learning Systems, vol. 23, no. 11, pp. 1701-1713, 2012.
    • (2012) IEEE Transactions on Neural Networks and Learning Systems , vol.23 , Issue.11 , pp. 1701-1713
    • Van Den Dries, S.1    Wiering, M.2
  • 10
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 11
    • 34249833101 scopus 로고
    • Q-learning
    • C. Watkins and P. Dayan, "Q-learning," Machine learning, vol. 8, no. 3, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
    • Watkins, C.1    Dayan, P.2
  • 12
    • 35349027192 scopus 로고    scopus 로고
    • Application of reinforcement learning to the game of othello
    • N. van Eck and M. van Wezel, "Application of reinforcement learning to the game of othello," Computers &Operations Research, vol. 35, no. 6, pp. 1999-2017, 2008.
    • (2008) Computers &Operations Research , vol.35 , Issue.6 , pp. 1999-2017
    • Van Eck, N.1    Van Wezel, M.2
  • 14
    • 82655164054 scopus 로고    scopus 로고
    • Self-play and using an expert to learn to play backgammon with temporal difference learning
    • M. Wiering, "Self-play and using an expert to learn to play backgammon with temporal difference learning," Journal of Intelligent Learning Systems and Applications, vol. 2, no. 2, pp. 57-68, 2010.
    • (2010) Journal of Intelligent Learning Systems and Applications , vol.2 , Issue.2 , pp. 57-68
    • Wiering, M.1
  • 15
    • 84904342386 scopus 로고    scopus 로고
    • The evolution of strong othello programs
    • R. Nakatsu and J. Hoshino, Eds. Kluwer
    • M. Buro, "The evolution of strong othello programs," in Entertainment Computing-Technology and Applications, R. Nakatsu and J. Hoshino, Eds. Kluwer, 2003, pp. 81-88.
    • (2003) Entertainment Computing-Technology and Applications , pp. 81-88
    • Buro, M.1
  • 16
    • 0000249150 scopus 로고
    • Statistical feature combination for the evaluation of game positions
    • -, "Statistical feature combination for the evaluation of game positions," Journal of Artificial Intelligence Research, vol. 3, pp. 373-382, 1995.
    • (1995) Journal of Artificial Intelligence Research , vol.3 , pp. 373-382
    • Buro, M.1
  • 18
    • 0033342921 scopus 로고    scopus 로고
    • Strategy acquisition for the game othello based on reinforcement learning
    • T. Yoshioka and S. Ishii, "Strategy acquisition for the game othello based on reinforcement learning," IEICE Transactions on Information and Systems, vol. 82, no. 12, pp. 1618-1626, 1999.
    • (1999) IEICE Transactions on Information and Systems , vol.82 , Issue.12 , pp. 1618-1626
    • Yoshioka, T.1    Ishii, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.