SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2013, Pages 108-115

Reinforcement learning in the game of Othello: Learning against a fixed opponent and learning from self-play

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL AGENTS; LEARNING AGENTS; MULTI-LAYER PERCEPTRONS; OTHELLO; Q-LEARNING; SELF-PLAY; TD-LEARNING;

ARTIFICIAL INTELLIGENCE; DYNAMIC PROGRAMMING; REINFORCEMENT LEARNING;

LEARNING ALGORITHMS;

EID: 84891544852 PISSN: 23251824 EISSN: 23251867 Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2013.6614996 Document Type: Conference Paper

Times cited : (56)

References (18)

1
- 0004102479
- The MIT press, Cambridge MA, A Bradford Book
- R. Sutton and A. Barto, Reinforcement learning: An introduction. The MIT press, Cambridge MA, A Bradford Book, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

2
- 84873574800
- Springer
- M. Wiering and M. van Ottelo, Eds., Reinforcement Learning: State-ofthe-art. Springer, 2012.
- (2012) Reinforcement Learning: State-ofthe-art
- Wiering, M.¹ Van Ottelo, M.²

3
- 0029276036
- Temporal difference learning and TD-Gammon
- G. Tesauro, "Temporal difference learning and TD-Gammon," Communications of the ACM, vol. 38, pp. 58-68, 1995.
- (1995) Communications of the ACM , vol.38 , pp. 58-68
- Tesauro, G.¹

4
- 0003215153
- Learning to play the game of chess
- S. Thrun, "Learning to play the game of chess," Advances in Neural Information Processing Systems, vol. 7, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7
- Thrun, S.¹

6
- 0000433333
- Temporal difference learning of position evaluation in the game of go
- N. Schraudolph, P. Dayan, and T. Sejnowski, "Temporal difference learning of position evaluation in the game of go," Advances in Neural Information Processing Systems, pp. 817-817, 1994.
- (1994) Advances in Neural Information Processing Systems , pp. 817-817
- Schraudolph, N.¹ Dayan, P.² Sejnowski, T.³

7
- 21844502480
- Discovering complex othello strategies through evolutionary neural networks
- D. Moriarty and R. Miikkulainen, "Discovering complex othello strategies through evolutionary neural networks," Connection Science, vol. 7, no. 3, pp. 195-210, 1995.
- (1995) Connection Science , vol.7 , Issue.3 , pp. 195-210
- Moriarty, D.¹ Miikkulainen, R.²

8
- 70349278434
- Learning to play othello with n-tuple systems
- S. Lucas, "Learning to play othello with n-tuple systems," Australian Journal of Intelligent Information Processing, vol. 4, pp. 1-20, 2008.
- (2008) Australian Journal of Intelligent Information Processing , vol.4 , pp. 1-20
- Lucas, S.¹

9
- 84876914496
- Neural-fitted td-learning for playing othello with structured neural networks
- S. van den Dries and M. Wiering, "Neural-fitted td-learning for playing othello with structured neural networks," IEEE Transactions on Neural Networks and Learning Systems, vol. 23, no. 11, pp. 1701-1713, 2012.
- (2012) IEEE Transactions on Neural Networks and Learning Systems , vol.23 , Issue.11 , pp. 1701-1713
- Van Den Dries, S.¹ Wiering, M.²

10
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

11
- 34249833101
- Q-learning
- C. Watkins and P. Dayan, "Q-learning," Machine learning, vol. 8, no. 3, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

12
- 35349027192
- Application of reinforcement learning to the game of othello
- N. van Eck and M. van Wezel, "Application of reinforcement learning to the game of othello," Computers &Operations Research, vol. 35, no. 6, pp. 1999-2017, 2008.
- (2008) Computers &Operations Research , vol.35 , Issue.6 , pp. 1999-2017
- Van Eck, N.¹ Van Wezel, M.²

13
- 0003636089
- Technical Report, University of Cambridge, Department of Engineering
- G. Rummery and M. Niranjan, On-line Q-learning using connectionist systems. Technical Report, University of Cambridge, Department of Engineering, 1994.
- (1994) On-line Q-learning Using Connectionist Systems
- Rummery, G.¹ Niranjan, M.²

14
- 82655164054
- Self-play and using an expert to learn to play backgammon with temporal difference learning
- M. Wiering, "Self-play and using an expert to learn to play backgammon with temporal difference learning," Journal of Intelligent Learning Systems and Applications, vol. 2, no. 2, pp. 57-68, 2010.
- (2010) Journal of Intelligent Learning Systems and Applications , vol.2 , Issue.2 , pp. 57-68
- Wiering, M.¹

16
- 0000249150
- Statistical feature combination for the evaluation of game positions
- -, "Statistical feature combination for the evaluation of game positions," Journal of Artificial Intelligence Research, vol. 3, pp. 373-382, 1995.
- (1995) Journal of Artificial Intelligence Research , vol.3 , pp. 373-382
- Buro, M.¹

17
- 45149102912
- Temporal difference learning versus coevolution for acquiring othello position evaluation
- S. Lucas and T. Runarsson, "Temporal difference learning versus coevolution for acquiring othello position evaluation," in Computational Intelligence and Games, 2006 IEEE Symposium on, 2006, pp. 52-59.
- (2006) Computational Intelligence and Games, 2006 IEEE Symposium on , pp. 52-59
- Lucas, S.¹ Runarsson, T.²

18
- 0033342921
- Strategy acquisition for the game othello based on reinforcement learning
- T. Yoshioka and S. Ishii, "Strategy acquisition for the game othello based on reinforcement learning," IEICE Transactions on Information and Systems, vol. 82, no. 12, pp. 1618-1626, 1999.
- (1999) IEICE Transactions on Information and Systems , vol.82 , Issue.12 , pp. 1618-1626
- Yoshioka, T.¹ Ishii, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.