메뉴 건너뛰기




Volumn , Issue , 2008, Pages 205-211

An othello evaluation function based on temporal difference learning using probability of winning

Author keywords

[No Author keywords available]

Indexed keywords

EVALUATION FUNCTION; MONTE CARLO SIMULATION; OTHELLO; REINFORCEMENT LEARNING METHOD; TEACHING EVALUATION; TEMPORAL DIFFERENCE LEARNING; TERMINAL POSITION; TESTING ENVIRONMENT; WINNING PROBABILITY;

EID: 70349275749     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CIG.2008.5035641     Document Type: Conference Paper
Times cited : (12)

References (9)
  • 1
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-gammon
    • Gerald Tesauro,"Temporal difference learning and TD-gammon", Communications of the ACM, 38(3):pp.58-68,(1994).
    • (1994) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 2
    • 0004870198 scopus 로고    scopus 로고
    • Experiments in parameter learning using temporal differences
    • (June)
    • Jonathan Baxter, Andrew Tridgell, Lex Weaver,"Experiments in Parameter Learning Using Temporal Differences", ICGA Journal (June):pp.84-99,(1998).
    • (1998) ICGA Journal , pp. 84-99
    • Baxter, J.1    Tridgell, A.2    Weaver, L.3
  • 4
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Richard Sutton,"Learning to predict by the methods of temporal diffe-rences",Machine Learning(3):pp.9-44,(1988).
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.1
  • 5
    • 34548711715 scopus 로고    scopus 로고
    • Temporal difference learning of an othello evaluation function for a small neural network with shared weights
    • Edward P.Manning,"Temporal Difference Learning of an Othello Eva-luation Function for a Small Neural Network with Shared Weights", IEEE Symposium on Computational Intelligence and Games (2007): pp.216-223.
    • (2007) IEEE Symposium on Computational Intelligence and Games , pp. 216-223
    • Manning, E.P.1
  • 6
    • 45149102912 scopus 로고    scopus 로고
    • Temporal difference learning versus co-evolution for acquiring othello position evaluation
    • Simon M.Lucas, Thomas P.Runarsson,"Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation", IEEE Symposium on Computational Intelligence and Games (2006): pp, 52-59.
    • (2006) IEEE Symposium on Computational Intelligence and Games , pp. 52-59
    • Lucas, S.M.1    Runarsson, T.P.2
  • 8
    • 34250659969 scopus 로고    scopus 로고
    • Modification of UCT with patterns in Monte-Carlo Go
    • Sylvain Gelly, Yizao Wang, Remi Munos, Olivier Teytaud, "Modification of UCT with Patterns in Monte-Carlo Go", RR-6062-INRIA (2006):pp.1-19.
    • (2006) RR-6062-INRIA , pp. 1-19
    • Gelly, S.1    Wang, Y.2    Munos, R.3    Teytaud, O.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.