메뉴 건너뛰기




Volumn 122, Issue 1, 2000, Pages 3-21

Temporal difference learning for heuristic search and game playing

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; GAME THEORY; HEURISTIC METHODS; RANDOM PROCESSES;

EID: 0033632826     PISSN: 00200255     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0020-0255(99)00093-6     Document Type: Article
Times cited : (13)

References (15)
  • 1
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R.S. Sutton, Learning to predict by the methods of temporal differences, Machine Learning 3 (1988) 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 2
    • 85031550944 scopus 로고    scopus 로고
    • Temporal coherence and prediction decay in temporal difference learning
    • Department of Computer Science, Queen Mary and Westfield College, University of London
    • D.F. Beal, M.C. Smith, Temporal coherence and prediction decay in temporal difference learning, Technical Report no. 756, Department of Computer Science, Queen Mary and Westfield College, University of London, 1998.
    • (1998) Technical Report , vol.756
    • Beal, D.F.1    Smith, M.C.2
  • 3
  • 4
    • 0004867273 scopus 로고    scopus 로고
    • Evaluation tuning for computer chess: Linear discriminant methods
    • T.S. Anantharaman, Evaluation tuning for computer chess: linear discriminant methods, International Computer Chess Association Journal 20 (4) (1997) 224-242.
    • (1997) International Computer Chess Association Journal , vol.20 , Issue.4 , pp. 224-242
    • Anantharaman, T.S.1
  • 5
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • G. Tesauro, Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, G.1
  • 6
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program, achieves master level play
    • G. Tesauro, TD-Gammon, a self-teaching backgammon program, achieves master level play, Neural Computation 6 (2) (1994) 215-220.
    • (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-220
    • Tesauro, G.1
  • 8
    • 0000430514 scopus 로고
    • The convergence of TD (λ) for general λ
    • P. Dayan, The convergence of TD (λ) for general λ, Machine Learning 8 (1992) 341-362.
    • (1992) Machine Learning , vol.8 , pp. 341-362
    • Dayan, P.1
  • 10
    • 0024137490 scopus 로고
    • Increased rates of convergence through learning rate adaptation
    • R.A. Jacobs, Increased rates of convergence through learning rate adaptation, Neural Networks 1 (1988) 295-307.
    • (1988) Neural Networks , vol.1 , pp. 295-307
    • Jacobs, R.A.1
  • 13
    • 0342295641 scopus 로고
    • available from many sources, including
    • M. Mutz, Gnu Shogi v1.2p03, 1994 (available from many sources, including ftp://ftp.uni-passau.de/pub/local/shogi).
    • (1994) Gnu Shogi V1.2p03
    • Mutz, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.