-
1
-
-
0029679044
-
Reinforcement learning: A survey
-
L. Kaelbling, M. Littman, and A. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.1
Littman, M.2
Moore, A.3
-
3
-
-
27644480261
-
Integrating relevance feedback techniques for image retrieval using reinforcement learning
-
P.-Y. Yin, B. Bhanu, K.-C. Chang, and A. Dong, "Integrating relevance feedback techniques for image retrieval using reinforcement learning," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1536-1551, 2005.
-
(2005)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.27
, Issue.10
, pp. 1536-1551
-
-
Yin, P.-Y.1
Bhanu, B.2
Chang, K.-C.3
Dong, A.4
-
4
-
-
33645712749
-
LEAD: A methodology for learning efficient approaches to medical diagnosis
-
S. Fakih and T. Das, "LEAD: a methodology for learning efficient approaches to medical diagnosis," IEEE Transactions on Information Technology in Biomedicine, vol. 10, no. 2, pp. 220-228, 2006.
-
(2006)
IEEE Transactions on Information Technology in Biomedicine
, vol.10
, Issue.2
, pp. 220-228
-
-
Fakih, S.1
Das, T.2
-
5
-
-
33646492472
-
An efficient dynamic algorithm for maintaining all-pairs shortest paths in stochastic networks
-
S. Misra and B. Oommen, "An efficient dynamic algorithm for maintaining all-pairs shortest paths in stochastic networks," IEEE Transactions on Computers, vol. 55, no. 6, pp. 686-702, 2006.
-
(2006)
IEEE Transactions on Computers
, vol.55
, Issue.6
, pp. 686-702
-
-
Misra, S.1
Oommen, B.2
-
6
-
-
0035312760
-
Relational reinforcement learning
-
S. Džeroski, L. De Raedt, and K. Driessens, "Relational reinforcement learning," Machine Learning, vol. 43, pp. 7-52, 2001.
-
(2001)
Machine Learning
, vol.43
, pp. 7-52
-
-
Džeroski, S.1
De Raedt, L.2
Driessens, K.3
-
7
-
-
37249061374
-
A survey of reinforcement learning in relational domains,
-
1381-3625, July 2005
-
M. Van Otterlo, "A survey of reinforcement learning in relational domains," CTIT Technical Report Series ISSN 1381-3625, July 2005.
-
CTIT Technical Report Series ISSN
-
-
Van Otterlo, M.1
-
9
-
-
13444310066
-
Inductive policy selection for first-order MDPs
-
S. Yoon, A. Fern, and R. Givan, "Inductive policy selection for first-order MDPs," in UAI'02, 2002.
-
(2002)
UAI'02
-
-
Yoon, S.1
Fern, A.2
Givan, R.3
-
10
-
-
40949103747
-
Symbolic learning for adaptive agents
-
J. Cole, K. Lloyd, and K. Ng, "Symbolic learning for adaptive agents," in The Annual Partner Conference, Smart Internet Technology Cooperative Research Centre, 2003.
-
(2003)
The Annual Partner Conference, Smart Internet Technology Cooperative Research Centre
-
-
Cole, J.1
Lloyd, K.2
Ng, K.3
-
11
-
-
84880803349
-
Generalizing plans to new environments in relational MDPs
-
C. Guestrin, D. Koller, C. Gearhart, and N. Kanodia, "Generalizing plans to new environments in relational MDPs," in IJCAI'03, 2003.
-
(2003)
IJCAI'03
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
-
18
-
-
79551514274
-
Relational sequence alignements
-
T. Gärtner, G. C. Garriga, and T. Meinl, Eds, September
-
A. Karwath and K. Kersting, "Relational sequence alignements," in Proceedings of The 4th International Workshop on Mining and Learning with Graphs (MLG '06), T. Gärtner, G. C. Garriga, and T. Meinl, Eds., September 2006.
-
(2006)
Proceedings of The 4th International Workshop on Mining and Learning with Graphs (MLG '06)
-
-
Karwath, A.1
Kersting, K.2
-
21
-
-
0000826543
-
Negation as failure
-
K. Clark, "Negation as failure." in Logic and Data Bases, 1977, pp. 293-322.
-
(1977)
Logic and Data Bases
, pp. 293-322
-
-
Clark, K.1
-
23
-
-
0029333536
-
An algorithm for probabilistic planning
-
N. Kushmerick, S. Hanks, and D. Weld, "An algorithm for probabilistic planning," Artificial Intelligence, vol. 76, pp. 239-286, 1995.
-
(1995)
Artificial Intelligence
, vol.76
, pp. 239-286
-
-
Kushmerick, N.1
Hanks, S.2
Weld, D.3
-
26
-
-
0033165498
-
Reasoning about noisy sensors and effectors in the situation calculus
-
F. Bacchus, J. Y. Halpern, and H. J. Levesque, "Reasoning about noisy sensors and effectors in the situation calculus," Artificial Intelligence, vol. 111, no. 1-2, pp. 171-208, 1999.
-
(1999)
Artificial Intelligence
, vol.111
, Issue.1-2
, pp. 171-208
-
-
Bacchus, F.1
Halpern, J.Y.2
Levesque, H.J.3
-
27
-
-
78651544767
-
-
P. Mateus, A. Pacheco, and J. Pinto, Observations and the probabilistic situation calculus, in Proceedings of the 8th KR, 2002, pp. 327-340.
-
P. Mateus, A. Pacheco, and J. Pinto, "Observations and the probabilistic situation calculus," in Proceedings of the 8th KR, 2002, pp. 327-340.
-
-
-
-
28
-
-
0346942368
-
Decision-theoretic planning: Structural assumptions and computational leverage
-
C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," Journal of Artificial Intelligence Research (JAIR), vol. 11, pp. 1-94, 1999.
-
(1999)
Journal of Artificial Intelligence Research (JAIR)
, vol.11
, pp. 1-94
-
-
Boutilier, C.1
Dean, T.2
Hanks, S.3
-
29
-
-
0031187203
-
The independent choice logic for modelling multiple agents under uncertainty
-
D. Poole, "The independent choice logic for modelling multiple agents under uncertainty," Artificial Intelligence, vol. 94, no. 1-2, pp. 7-56, 1997.
-
(1997)
Artificial Intelligence
, vol.94
, Issue.1-2
, pp. 7-56
-
-
Poole, D.1
-
30
-
-
84880891360
-
Symbolic dynamic programming for first-order MDPs
-
C. Boutilier, R. Reiter, and B. Price, "Symbolic dynamic programming for first-order MDPs," in Proceedings of the 17th IJCAI, 2001, pp. 690-700.
-
(2001)
Proceedings of the 17th IJCAI
, pp. 690-700
-
-
Boutilier, C.1
Reiter, R.2
Price, B.3
-
31
-
-
84880895882
-
Decisino-theoretic, high-level agent programming in the situatino calculus
-
C. Boutilier, R. Reiter, M. Soutchanski, and S. Thrun, "Decisino-theoretic, high-level agent programming in the situatino calculus," in Proceedings of the 17th AAAI/12th IAAI, 2000, pp. 355-362.
-
(2000)
Proceedings of the 17th AAAI/12th IAAI
, pp. 355-362
-
-
Boutilier, C.1
Reiter, R.2
Soutchanski, M.3
Thrun, S.4
-
32
-
-
0027694534
-
Representing action and change by logic programs
-
M. Gelfoun and V. Lifschitz, "Representing action and change by logic programs," Journal of Logical Programming, vol. 17, no. 2-4, pp. 301-321, 1993.
-
(1993)
Journal of Logical Programming
, vol.17
, Issue.2-4
, pp. 301-321
-
-
Gelfoun, M.1
Lifschitz, V.2
-
33
-
-
0036923234
-
Reasoning about actions in a probabilistic setting
-
C. Baral, N. Tran, and L.-C. Tuan, "Reasoning about actions in a probabilistic setting," in Proceedings of the 18th AAAI/14th IAAI, 2002, pp. 507-512.
-
(2002)
Proceedings of the 18th AAAI/14th IAAI
, pp. 507-512
-
-
Baral, C.1
Tran, N.2
Tuan, L.-C.3
-
36
-
-
0029514510
-
The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
-
A. Moore, "The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces," Machine Learning, vol. 21, no. 3, pp. 199-233, 1995.
-
(1995)
Machine Learning
, vol.21
, Issue.3
, pp. 199-233
-
-
Moore, A.1
|