-
1
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program, achieves master-level play
-
G. Tesauro, "TD-Gammon, a self-teaching backgammon program, achieves master-level play," Neural Comput., vol. 6. no. 2, pp. 215-219, 1994.
-
(1994)
Neural Comput.
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
-
3
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
A. G. Barto, S. J. Bradtke, and S. P. Singh, "Learning to act using real-time dynamic programming," Artif. Intell., vol. 72, no. 1, pp. 81-138, 1995.
-
(1995)
Artif. Intell.
, vol.72
, Issue.1
, pp. 81-138
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
4
-
-
0000929496
-
Multiagent reinforcement learning: Theoretical framework and an algorithm
-
Madison, WI, July
-
J. Hu and M. P. Wellman, "Multiagent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Machine Learning. Madison, WI, July 1998, pp. 242-250.
-
(1998)
Proc. 15th Int. Conf. Machine Learning
, pp. 242-250
-
-
Hu, J.1
Wellman, M.P.2
-
5
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. cooperative agents
-
Amherst, MA
-
M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. 10th Int. Conf. Machine Learning, Amherst, MA, 1993, pp. 330-337.
-
(1993)
Proc. 10th Int. Conf. Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
8
-
-
85012688561
-
-
Princeton, NJ: Princeton Univ. Press
-
R. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
9
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-285, 1996.
-
(1996)
J. Artif. Intell. Res.
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
11
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less real time
-
A. W. Moore and C. G. Atkeson, "Prioritized sweeping: Reinforcement learning with less data and less real time," Mach. Learn., vol. 13, pp. 103-130, 1993.
-
(1993)
Mach. Learn.
, vol.13
, pp. 103-130
-
-
Moore, A.W.1
Atkeson, C.G.2
-
13
-
-
84988772855
-
The Internet fish construction kit
-
Santa Clara, CA, Apr
-
B. A. LaMacchia, "The internet fish construction kit," in Proc. 6th World Wide Web Conf., Santa Clara, CA, Apr. 1997, pp. 277-288.
-
(1997)
Proc. 6th World Wide Web Conf.
, pp. 277-288
-
-
Lamacchia, B.A.1
-
14
-
-
0141514593
-
Information retrieval in distributed hypertexts
-
New York, Oct
-
P. De Bra, G. J. Houben, Y. Kornatzky, and R. Post, ''Information retrieval in distributed hypertexts," presented at the Proc. Intelligent Multimedia Retrieval Systems Management, New York, Oct. 1994.
-
(1994)
Proc. Intelligent Multimedia Retrieval Systems Management
-
-
De Bra, P.1
Houben, G.J.2
Kornatzky, Y.3
Post, R.4
|