-
1
-
-
0029679044
-
-
1996.
-
L.P. Kaclbling, M.L. Littman, and A.W. Moore, "Reinforcement learning: A survey," J. Artificial Intell. Res., vol.4, pp.237-285, 1996.
-
M.L. Littman, and A.W. Moore, "Reinforcement Learning: a Survey," J. Artificial Intell. Res., Vol.4, Pp.237-285
-
-
Kaclbling, L.P.1
-
3
-
-
85027101010
-
-
1998.
-
J. Baxter, A. Tridgell, and L. Weaver, "A chess program that learns by combining TD(A) with game-tree search," Proc. 15th Int. Conf. Machine Learning, pp.28-36, 1998.
-
A. Tridgell, and L. Weaver, "A Chess Program that Learns by Combining TD(A) with Game-tree Search," Proc. 15th Int. Conf. Machine Learning, Pp.28-36
-
-
Baxter, J.1
-
4
-
-
85027114828
-
-
1994.
-
N.N. Schraudolph, P. Dayan, and T.J. Sejnowski, "Temporal difference learning of position evaluation in the game of Go," Adv. Neural Inf. Proc. Syst., vol.6, pp.817-824, 1994.
-
P. Dayan, and T.J. Sejnowski, "Temporal Difference Learning of Position Evaluation in the Game of Go," Adv. Neural Inf. Proc. Syst., Vol.6, Pp.817-824
-
-
Schraudolph, N.N.1
-
6
-
-
58049085683
-
-
1989.
-
J. Moody and C.J. Darken, "Fast learning in networks of locally-tuned processing units," Neural Computation, vol.1, pp.281-294, 1989.
-
"Fast Learning in Networks of Locally-tuned Processing Units," Neural Computation, Vol.1, Pp.281-294
-
-
Moody, J.1
Darken, C.J.2
-
7
-
-
85027113272
-
-
1998.
-
T. Yoshioka, S. Ishii, and M. Ito, "Strategy acquisition of the game "Othello" based on reinforcement learning," Int. Conf. Neural Info. Proc., pp.841-844, 1998.
-
S. Ishii, and M. Ito, "Strategy Acquisition of the Game "Othello" Based on Reinforcement Learning," Int. Conf. Neural Info. Proc., Pp.841-844
-
-
Yoshioka, T.1
-
8
-
-
79957749002
-
-
1995.
-
M.E. Harmon, L.C. Baird, and A.H. Klopf, "Reinforcement learning applied to a differential game," Adaptive Behavior, vol.4, no.l, 1995.
-
L.C. Baird, and A.H. Klopf, "Reinforcement Learning Applied to a Differential Game," Adaptive Behavior, Vol.4, No.l
-
-
Harmon, M.E.1
-
9
-
-
34249833101
-
-
1992.
-
C.J.C.H. Watkins and P. Dayan, "Q-Learning," Machine Learning, vol.8, pp.279-292, 1992.
-
"Q-Learning," Machine Learning, Vol.8, Pp.279-292
-
-
Watkins, C.J.C.1
Dayan, P.2
-
11
-
-
85027132093
-
-
1996.
-
A. Leouski and P. Utgoff, "What a neural network can learn about Othello," Technical Report, 90-10, University of Massachusetts, Amherst, 1996.
-
"What a Neural Network Can Learn about Othello," Technical Report, 90-10, University of Massachusetts, Amherst
-
-
Leouski, A.1
Utgoff, P.2
-
13
-
-
0025419413
-
-
1990.
-
K.F. Lee and S. Mahajan, "The development of a world class Othello program," Artificial Intelligence, vol.43, pp.21-36, 1990.
-
"The Development of a World Class Othello Program," Artificial Intelligence, Vol.43, Pp.21-36
-
-
Lee, K.F.1
Mahajan, S.2
-
15
-
-
85027135771
-
-
1995.
-
J.A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximation the value function," Advances in Neural Information Processing Systems 7, pp.369-376, MIT Press, 1995.
-
"Generalization in Reinforcement Learning: Safely Approximation the Value Function," Advances in Neural Information Processing Systems 7, Pp.369-376, MIT Press
-
-
Boyan, J.A.1
Moore, A.W.2
-
16
-
-
0038501238
-
-
1996.
-
S. Schaal and C.C. Atkeson, "From isolation to cooperation: An alternative view of a system of experts," Advances in Neural Information Processing Systems 8, pp.605-611, MIT Press, 1996.
-
"From Isolation to Cooperation: an Alternative View of a System of Experts," Advances in Neural Information Processing Systems 8, Pp.605-611, MIT Press
-
-
Schaal, S.1
Atkeson, C.C.2
-
17
-
-
0032312876
-
-
1998.
-
J. Morimoto and K. Doya, "Reinforcement learning of dynamic motor sequence: learning to stand up," Proc. IEEE/RSJ Int. Conf. Intell. Robots & Syst., vol.3, pp.17211726, 1998.
-
"Reinforcement Learning of Dynamic Motor Sequence: Learning to Stand Up," Proc. IEEE/RSJ Int. Conf. Intell. Robots & Syst., Vol.3, Pp.17211726
-
-
Morimoto, J.1
Doya, K.2
-
18
-
-
0002997066
-
-
1999.
-
M. Sato and S. Ishii, "Reinforcement learning based on online EM algorithm," Advances in Neural Information Processing Systems 11, pp.1052-1058, MIT Press, 1999.
-
"Reinforcement Learning Based on Online EM Algorithm," Advances in Neural Information Processing Systems 11, Pp.1052-1058, MIT Press
-
-
Sato, M.1
Ishii, S.2
-
20
-
-
85027197301
-
-
1998.
-
M. Sato and S. Ishii, "On-line EM algorithm for mixture of local experts," Proc. Fifth Int. Conf. Neural Inf. Proc., vol.3, pp.1397-1401, 1998.
-
"On-line EM Algorithm for Mixture of Local Experts," Proc. Fifth Int. Conf. Neural Inf. Proc., Vol.3, Pp.1397-1401
-
-
Sato, M.1
Ishii, S.2
|