메뉴 건너뛰기




Volumn , Issue , 2001, Pages 529-534

Temporal difference learning applied to a high-performance game-playing program

Author keywords

[No Author keywords available]

Indexed keywords

EVALUATION FUNCTION; GAME-PLAYING PROGRAMS; MAN MACHINES; TD-LEARNING; TEMPORAL DIFFERENCE LEARNING; WORLD CLASS;

EID: 0038145011     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (75)

References (16)
  • 2
    • 0004870198 scopus 로고    scopus 로고
    • Experiments in parameter learning using temporal differences
    • J. Baxter, A. Tridgell, and L. Weaver. Experiments in parameter learning using temporal differences. ICCA Journal, 21(2):84-99, 1998.
    • (1998) ICCA Journal , vol.21 , Issue.2 , pp. 84-99
    • Baxter, J.1    Tridgell, A.2    Weaver, L.3
  • 3
    • 0002882372 scopus 로고    scopus 로고
    • KnightCap: A chess program that learns by combining TD(λ) with game-tree search
    • J. Baxter, A. Tridgell, and L. Weaver. KnightCap: A chess program that learns by combining TD(λ) with game-tree search. ICML, pages 28-36, 1998.
    • (1998) ICML , pp. 28-36
    • Baxter, J.1    Tridgell, A.2    Weaver, L.3
  • 4
    • 0034275416 scopus 로고    scopus 로고
    • Learning to play chess using temporal differences
    • J. Baxter, A. Tridgell, and L. Weaver. Learning to play chess using temporal differences. Machine Learning, 40(3):243-263, 2000.
    • (2000) Machine Learning , vol.40 , Issue.3 , pp. 243-263
    • Baxter, J.1    Tridgell, A.2    Weaver, L.3
  • 5
    • 0004502426 scopus 로고    scopus 로고
    • Learning piece values using temporal differences
    • D. Beal. Learning piece values using temporal differences. ICCA Journal, 20(3):147-151, 1997.
    • (1997) ICCA Journal , vol.20 , Issue.3 , pp. 147-151
    • Beal, D.1
  • 8
    • 0000249150 scopus 로고
    • Statistical feature combination for the evaluation of game positions
    • M. Buro. Statistical feature combination for the evaluation of game positions. JAIR, 3:373-382, 1995.
    • (1995) JAIR , vol.3 , pp. 373-382
    • Buro, M.1
  • 9
    • 84880892796 scopus 로고    scopus 로고
    • Improving heuristic mini-max search by supervised learning
    • To appear
    • M. Buro. Improving heuristic mini-max search by supervised learning. Artificial Intelligence, 2001. To appear.
    • (2001) Artificial Intelligence
    • Buro, M.1
  • 10
    • 65849284789 scopus 로고
    • Automatic feature generation for problem solving systems
    • T. Fawcett and P. Utgoff. Automatic feature generation for problem solving systems. ICML, pages 144-153, 1992.
    • (1992) ICML , pp. 144-153
    • Fawcett, T.1    Utgoff, P.2
  • 14
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • G. Tesauro. Temporal difference learning and TD-Gammon. CACM, 38(3):58-68, 1995.
    • (1995) CACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.