-
1
-
-
0002192119
-
Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
-
D. Chapman and L. P. Kaelbling. Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proceedings of IJCAI-91, 1991.
-
Proceedings of IJCAI-91, 1991
-
-
Chapman, D.1
Kaelbling, L.P.2
-
4
-
-
33745615420
-
-
PhD thesis, Department of Computer Science, K.U. Leuven
-
K. Driessens. Relational Reinforcement Learning. PhD thesis, Department of Computer Science, K.U. Leuven, 2004.
-
(2004)
Relational Reinforcement Learning
-
-
Driessens, K.1
-
5
-
-
0035312760
-
Relational reinforcement learning
-
April
-
S. Dzeroski, L. De Raedt, and K. Driessens. Relational reinforcement learning. Machine Learning, 43(1/2):5-52, April 2001.
-
(2001)
Machine Learning
, vol.43
, Issue.1-2
, pp. 5-52
-
-
Dzeroski, S.1
De Raedt, L.2
Driessens, K.3
-
6
-
-
33749683139
-
The thing that we tried didn't work very well: Deictic representation in reinforcement learning
-
S. Finney, N. Gardiol, L. Kaelbling, and T. Oates. The thing that we tried didn't work very well : Deictic representation in reinforcement learning. In Proceedings of the 18th Annual Conference on Uncertainty in Artificial Intelligence, pages 154-161, 2002.
-
(2002)
Proceedings of the 18th Annual Conference on Uncertainty in Artificial Intelligence
, pp. 154-161
-
-
Finney, S.1
Gardiol, N.2
Kaelbling, L.3
Oates, T.4
-
8
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
L.P. Kaebling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2):99-134, 1998.
-
(1998)
Artificial Intelligence
, vol.101
, Issue.1-2
, pp. 99-134
-
-
Kaebling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
11
-
-
33750741792
-
Exploiting relational structure to understand publication patterns in high-energy physics
-
December
-
A. McGovern, L. Friedland, M. Hay, B. Gallagher, A. Fast, J. Neville, and D. Jensen. Exploiting relational structure to understand publication patterns in high-energy physics. SIGKDD Explorations, 5(2):165-172, December 2003.
-
(2003)
SIGKDD Explorations
, vol.5
, Issue.2
, pp. 165-172
-
-
McGovern, A.1
Friedland, L.2
Hay, M.3
Gallagher, B.4
Fast, A.5
Neville, J.6
Jensen, D.7
-
12
-
-
23044531386
-
Learning a tsume-go heuristic with TILDE
-
Proceedings of CG2000, the Second International Conference on Computers and Games, Springer-Verlag
-
J. Ramon, T. Francis, and H. Blockeel. Learning a tsume-go heuristic with TILDE. In Proceedings of CG2000, the Second International Conference on Computers and Games, volume 2063 of Lecture Notes in Computer Science, pages 151-169. Springer-Verlag, 2001.
-
(2001)
Lecture Notes in Computer Science
, vol.2063
, pp. 151-169
-
-
Ramon, J.1
Francis, T.2
Blockeel, H.3
-
14
-
-
22644449927
-
A study of two probabilistic methods for searching large spaces with ILP
-
A Srinivasan. A study of two probabilistic methods for searching large spaces with ILP. Data Mining and Knowledge Discovery, 3(1):95-123, 1999.
-
(1999)
Data Mining and Knowledge Discovery
, vol.3
, Issue.1
, pp. 95-123
-
-
Srinivasan, A.1
-
17
-
-
0031246271
-
Decision Tree Induction Based on Efficient Tree Restructuring
-
P. E. Utgoff, N.C. Berkman, and J. A. Clouse. Decision tree induction based on efficient tree restructuring. Machine Learning, 29:5-44, 1997. (Pubitemid 127507172)
-
(1997)
Machine Learning
, vol.29
, Issue.1
, pp. 5-44
-
-
Utgoff, P.E.1
Berkman, N.C.2
Clouse, J.A.3
-
18
-
-
37249061374
-
A survey of reinforcement learning in relational domains
-
Technical Report TR-CTIT-05-31, ISSN 1381-3625
-
M. van Otterlo. A survey of reinforcement learning in relational domains. Technical Report TR-CTIT-05-31, CTIT Technical Report Series, ISSN 1381-3625, 2005.
-
(2005)
CTIT Technical Report Series
-
-
Van Otterlo, M.1
|