-
1
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
R.S. Sutton, Learning to predict by the methods of temporal differences, Machine Learning 3 (1988) 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
2
-
-
85031550944
-
Temporal coherence and prediction decay in temporal difference learning
-
Department of Computer Science, Queen Mary and Westfield College, University of London
-
D.F. Beal, M.C. Smith, Temporal coherence and prediction decay in temporal difference learning, Technical Report no. 756, Department of Computer Science, Queen Mary and Westfield College, University of London, 1998.
-
(1998)
Technical Report
, vol.756
-
-
Beal, D.F.1
Smith, M.C.2
-
3
-
-
0007943864
-
Machine learning in computer chess: The next generation
-
J. Fürnkranz, Machine learning in computer chess: the next generation, International Computer Chess Association Journal 19 (3) (1996) 147-161.
-
(1996)
International Computer Chess Association Journal
, vol.19
, Issue.3
, pp. 147-161
-
-
Fürnkranz, J.1
-
4
-
-
0004867273
-
Evaluation tuning for computer chess: Linear discriminant methods
-
T.S. Anantharaman, Evaluation tuning for computer chess: linear discriminant methods, International Computer Chess Association Journal 20 (4) (1997) 224-242.
-
(1997)
International Computer Chess Association Journal
, vol.20
, Issue.4
, pp. 224-242
-
-
Anantharaman, T.S.1
-
5
-
-
0001046225
-
Practical issues in temporal difference learning
-
G. Tesauro, Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
-
(1992)
Machine Learning
, vol.8
, pp. 257-277
-
-
Tesauro, G.1
-
6
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program, achieves master level play
-
G. Tesauro, TD-Gammon, a self-teaching backgammon program, achieves master level play, Neural Computation 6 (2) (1994) 215-220.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-220
-
-
Tesauro, G.1
-
8
-
-
0000430514
-
The convergence of TD (λ) for general λ
-
P. Dayan, The convergence of TD (λ) for general λ, Machine Learning 8 (1992) 341-362.
-
(1992)
Machine Learning
, vol.8
, pp. 341-362
-
-
Dayan, P.1
-
10
-
-
0024137490
-
Increased rates of convergence through learning rate adaptation
-
R.A. Jacobs, Increased rates of convergence through learning rate adaptation, Neural Networks 1 (1988) 295-307.
-
(1988)
Neural Networks
, vol.1
, pp. 295-307
-
-
Jacobs, R.A.1
-
12
-
-
0000238476
-
Natural developments in game research: From chess to shogi to go
-
H. Matsubara, H. Iida, R. Grimbergen, Natural developments in game research: from chess to shogi to go, International Computer Chess Association Journal 19 (2) (1996) 103-112.
-
(1996)
International Computer Chess Association Journal
, vol.19
, Issue.2
, pp. 103-112
-
-
Matsubara, H.1
Iida, H.2
Grimbergen, R.3
-
13
-
-
0342295641
-
-
available from many sources, including
-
M. Mutz, Gnu Shogi v1.2p03, 1994 (available from many sources, including ftp://ftp.uni-passau.de/pub/local/shogi).
-
(1994)
Gnu Shogi V1.2p03
-
-
Mutz, M.1
|