SCOPUS 정보 검색 플랫폼

2008 IEEE Symposium on Computational Intelligence and Games, CIG 2008

Volumn , Issue , 2008, Pages 205-211

An othello evaluation function based on temporal difference learning using probability of winning

(4) Osaki, Yasuhiro a Shibahara, Kazutomo a Tajima, Yasuhiro a Kotani, Yoshiyuki a

a TOKYO UNIVERSITY OF AGRICULTURE AND TECHNOLOGY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

EVALUATION FUNCTION; MONTE CARLO SIMULATION; OTHELLO; REINFORCEMENT LEARNING METHOD; TEACHING EVALUATION; TEMPORAL DIFFERENCE LEARNING; TERMINAL POSITION; TESTING ENVIRONMENT; WINNING PROBABILITY;

ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; EDUCATION; GAME THEORY; MONTE CARLO METHODS; REINFORCEMENT;

PROBABILITY;

EID: 70349275749 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/CIG.2008.5035641 Document Type: Conference Paper

Times cited : (12)

References (9)

1
- 0029276036
- Temporal difference learning and TD-gammon
- Gerald Tesauro,"Temporal difference learning and TD-gammon", Communications of the ACM, 38(3):pp.58-68,(1994).
- (1994) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

2
- 0004870198
- Experiments in parameter learning using temporal differences
- (June)
- Jonathan Baxter, Andrew Tridgell, Lex Weaver,"Experiments in Parameter Learning Using Temporal Differences", ICGA Journal (June):pp.84-99,(1998).
- (1998) ICGA Journal , pp. 84-99
- Baxter, J.¹ Tridgell, A.² Weaver, L.³

3
- 0003420416
- MIT Press
- Richard Sutton, Andrew Barto,"Introduction to Reinforcement Learn-ing",MIT Press, 1998.
- (1998) Introduction to Reinforcement Learning
- Sutton, R.¹ Barto, A.²

4
- 33847202724
- Learning to predict by the methods of temporal differences
- Richard Sutton,"Learning to predict by the methods of temporal diffe-rences",Machine Learning(3):pp.9-44,(1988).
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.¹

5
- 34548711715
- Temporal difference learning of an othello evaluation function for a small neural network with shared weights
- Edward P.Manning,"Temporal Difference Learning of an Othello Eva-luation Function for a Small Neural Network with Shared Weights", IEEE Symposium on Computational Intelligence and Games (2007): pp.216-223.
- (2007) IEEE Symposium on Computational Intelligence and Games , pp. 216-223
- Manning, E.P.¹

6
- 45149102912
- Temporal difference learning versus co-evolution for acquiring othello position evaluation
- Simon M.Lucas, Thomas P.Runarsson,"Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation", IEEE Symposium on Computational Intelligence and Games (2006): pp, 52-59.
- (2006) IEEE Symposium on Computational Intelligence and Games , pp. 52-59
- Lucas, S.M.¹ Runarsson, T.P.²

7
- 34548796964
- M.buro, "LOGISTELLO-a strong learning othello program" http://www.cs.ualbeta.ca/~mburo/ps/log-overview.ps.gz.
- LOGISTELLO - A Strong Learning Othello Program
- Buro, M.¹

8
- 34250659969
- Modification of UCT with patterns in Monte-Carlo Go
- Sylvain Gelly, Yizao Wang, Remi Munos, Olivier Teytaud, "Modification of UCT with Patterns in Monte-Carlo Go", RR-6062-INRIA (2006):pp.1-19.
- (2006) RR-6062-INRIA , pp. 1-19
- Gelly, S.¹ Wang, Y.² Munos, R.³ Teytaud, O.⁴

9
- 70349287633
- Computing Elo ratings of move patterns in the game of go
- Remi Coulom,"Computing Elo Ratings of Move Patterns in the Game of Go", Proceeding. of the 6th International Conference on Computers and Games (2007):pp.113-124.
- (2007) Proceeding. of the 6th International Conference on Computers and Games , pp. 113-124
- Coulom, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.