-
3
-
-
0002882372
-
Knightcap: A chess program that learns by combining TD(A) with game-tree search
-
Madison, WI: Morgan Kaufmann
-
Baxter, J., Trigdell, A., & Weaver, L. (1998). Knightcap: A chess program that learns by combining TD(A) with game-tree search. Proceedings of the Fifteenth International Conference on Machine Learning (pp. 28-36). Madison, WI: Morgan Kaufmann.
-
(1998)
Proceedings of the Fifteenth International Conference on Machine Learning
, pp. 28-36
-
-
Baxter, J.1
Trigdell, A.2
Weaver, L.3
-
4
-
-
84880891360
-
Symbolic dynamic programming for first order MDPs
-
Seattle, Washington
-
Boutilier, C., Reiter, R., & Price, B. (2001). Symbolic dynamic programming for first order MDPs. Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (pp. 690-697). Seattle, Washington.
-
(2001)
Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence
, pp. 690-697
-
-
Boutilier, C.1
Reiter, R.2
Price, B.3
-
5
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
7
-
-
84948172455
-
Speeding up relational reinforcement learning through the use of an incremental first order decision tree learning
-
Freiburg, Germany
-
Driessens, K., Ramon, J., & Blockeel, H. (2001). Speeding up relational reinforcement learning through the use of an incremental first order decision tree learning. Proceedings of the Twelfth European Conference on Machine Learning (pp. 97-108). Freiburg, Germany.
-
(2001)
Proceedings of the Twelfth European Conference on Machine Learning
, pp. 97-108
-
-
Driessens, K.1
Ramon, J.2
Blockeel, H.3
-
8
-
-
0035312760
-
Relational reinforcement learning
-
Dzeroski, S., Raedt, L. D., & Driessens, K. (2001). Relational reinforcement learning. Machine Learning, 43, 7-52.
-
(2001)
Machine Learning
, vol.43
, pp. 7-52
-
-
Dzeroski, S.1
Raedt, L.D.2
Driessens, K.3
-
10
-
-
13444258086
-
Learning domain-specific control knowledge from random walks
-
Whistler, British Columbia
-
Fern, A., Yoon, S. W., & Givan, R. (2004). Learning domain-specific control knowledge from random walks. Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (pp. 191-199). Whistler, British Columbia.
-
(2004)
Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling
, pp. 191-199
-
-
Fern, A.1
Yoon, S.W.2
Givan, R.3
-
11
-
-
84880803349
-
Generalizing plans to new environments in relational mdps
-
Acapulco, Mexico
-
Guestrin, C., Koller, D., Gearhart, C., & Kanodia, N. (2003). Generalizing plans to new environments in relational mdps. Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence (pp. 1003-1010). Acapulco, Mexico.
-
(2003)
Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence
, pp. 1003-1010
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
-
12
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L. P., Littman, M. L., & Moore, A. W. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
13
-
-
84898646291
-
Chess neighborhoods, function combination, and reinforcement learning
-
Hamamatsu, Japan
-
Levinson, R., & Weber, R. (2000). Chess neighborhoods, function combination, and reinforcement learning. Proceedings of the Second International Conference on Computers and Games (pp. 133-150). Hamamatsu, Japan.
-
(2000)
Proceedings of the Second International Conference on Computers and Games
, pp. 133-150
-
-
Levinson, R.1
Weber, R.2
-
14
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
New Brunswick, NJ: Morgan Kaufmann
-
Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. Proceedings of the Eleventh International Conference on Machine Learning (pp. 157-163). New Brunswick, NJ: Morgan Kaufmann.
-
(1994)
Proceedings of the Eleventh International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
16
-
-
0033570798
-
A unified analysis of value-function-based reinforcement learning algorithms
-
Szepesvari, C., & Littman, M. (1999). A unified analysis of value-function-based reinforcement learning algorithms. Neural Computation, 11, 2017-2060.
-
(1999)
Neural Computation
, vol.11
, pp. 2017-2060
-
-
Szepesvari, C.1
Littman, M.2
-
17
-
-
26944455336
-
Relational reinforcement learning: An overview
-
Banff, Alberta
-
Tadepalli, P., Givan, R., & Driessens, K. (2004). Relational reinforcement learning: An overview. Proceedings of the ICML'04 Workshop on Relational Reinforcement Learning (pp. 1-9). Banff, Alberta.
-
(2004)
Proceedings of the ICML'04 Workshop on Relational Reinforcement Learning
, pp. 1-9
-
-
Tadepalli, P.1
Givan, R.2
Driessens, K.3
-
18
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program
-
Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program. Neural Computation, 6, 215-219.
-
(1994)
Neural Computation
, vol.6
, pp. 215-219
-
-
Tesauro, G.1
|