-
1
-
-
84898960325
-
Programmable reinforcement learning agents
-
D. Andre, S. Russell, Programmable reinforcement learning agents, in: Advances in Neural Information Processing Systems (NIPS-01), vol. 13, 2001, pp. 78-85
-
(2001)
Advances in Neural Information Processing Systems (NIPS-01)
, vol.13
, pp. 78-85
-
-
Andre, D.1
Russell, S.2
-
2
-
-
0041547046
-
Reasoning about noisy sensors in the situation calculus
-
Montreal
-
F. Bacchus, J.Y. Halpern, H.J. Levesque, Reasoning about noisy sensors in the situation calculus, in: International Joint Conference on Artificial Intelligence (IJCAI-95), Montreal, 1995, pp. 1933-1940
-
(1995)
International Joint Conference on Artificial Intelligence (IJCAI-95)
, pp. 1933-1940
-
-
Bacchus, F.1
Halpern, J.Y.2
Levesque, H.J.3
-
3
-
-
0033897011
-
Using temporal logics to express search control knowledge for planning
-
Bacchus F., and Kabanza F. Using temporal logics to express search control knowledge for planning. Artificial Intelligence 116 1-2 (2000) 123-191
-
(2000)
Artificial Intelligence
, vol.116
, Issue.1-2
, pp. 123-191
-
-
Bacchus, F.1
Kabanza, F.2
-
4
-
-
0027880685
-
Algebraic decision diagrams and their applications
-
R.I. Bahar, E. Frohm, C. Gaona, G. Hachtel, E. Macii, A. Pardo, F. Somenzi, Algebraic decision diagrams and their applications, in: IEEE/ACM International Conference on CAD, 1993, pp. 428-432
-
(1993)
IEEE/ACM International Conference on CAD
, pp. 428-432
-
-
Bahar, R.I.1
Frohm, E.2
Gaona, C.3
Hachtel, G.4
Macii, E.5
Pardo, A.6
Somenzi, F.7
-
5
-
-
0002700781
-
Learning to act using real-time dynamic programming
-
Tech. Rep. UM-CS-1993-002, U. Mass. Amherst
-
A.G. Barto, S.J. Bradtke, S.P. Singh, Learning to act using real-time dynamic programming, Tech. Rep. UM-CS-1993-002, U. Mass. Amherst, 1993
-
(1993)
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
6
-
-
85012688561
-
-
Princeton University Press, Princeton, NJ
-
Bellman R.E. Dynamic Programming (1957), Princeton University Press, Princeton, NJ
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
9
-
-
0001657540
-
Fast planning through graph analysis
-
Montreal
-
A.L. Blum, M.L. Furst, Fast planning through graph analysis, in: IJCAI 95, Montreal, 1995, pp. 1636-1642
-
(1995)
IJCAI 95
, pp. 1636-1642
-
-
Blum, A.L.1
Furst, M.L.2
-
10
-
-
60549099880
-
-
B. Bonet, H. Geffner, mGPT: A probabilistic planner based on heuristic search, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/~mlittman/topics/ipc04-pt/proceedings/, 2004
-
B. Bonet, H. Geffner, mGPT: A probabilistic planner based on heuristic search, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/~mlittman/topics/ipc04-pt/proceedings/, 2004
-
-
-
-
11
-
-
84880685295
-
Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning
-
Nagoya
-
C. Boutilier, R.I. Brafman, C. Geib, Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning, in: International Joint Conference on Artificial Intelligence (IJCAI-97). Nagoya, 1997, pp. 1156-1162
-
(1997)
International Joint Conference on Artificial Intelligence (IJCAI-97)
, pp. 1156-1162
-
-
Boutilier, C.1
Brafman, R.I.2
Geib, C.3
-
13
-
-
0000675721
-
Context-specific independence in Bayesian networks
-
Portland, OR
-
C. Boutilier, N. Friedman, M. Goldszmidt, D. Koller, Context-specific independence in Bayesian networks, in: Uncertainty in Artificial Intelligence (UAI-96), Portland, OR, 1996, pp. 115-123
-
(1996)
Uncertainty in Artificial Intelligence (UAI-96)
, pp. 115-123
-
-
Boutilier, C.1
Friedman, N.2
Goldszmidt, M.3
Koller, D.4
-
14
-
-
84880891360
-
Symbolic dynamic programming for first-order MDPs
-
Seattle
-
C. Boutilier, R. Reiter, B. Price, Symbolic dynamic programming for first-order MDPs, in: International Joint Conference on Artificial Intelligence (IJCAI-01), Seattle, 2001, pp. 690-697
-
(2001)
International Joint Conference on Artificial Intelligence (IJCAI-01)
, pp. 690-697
-
-
Boutilier, C.1
Reiter, R.2
Price, B.3
-
15
-
-
60549115923
-
-
C. Boutilier, R. Reiter, M. Soutchanski, S. Thrun, Decision-theoretic, high-level agent programming in the situation calculus, in: AAAI-00, Austin, TX, 2000, pp. 355-362
-
C. Boutilier, R. Reiter, M. Soutchanski, S. Thrun, Decision-theoretic, high-level agent programming in the situation calculus, in: AAAI-00, Austin, TX, 2000, pp. 355-362
-
-
-
-
18
-
-
0024084964
-
Generalized subsumption and its application to induction and redundancy
-
Buntine W. Generalized subsumption and its application to induction and redundancy. Artificial Intelligence 36 (1988) 375-399
-
(1988)
Artificial Intelligence
, vol.36
, pp. 375-399
-
-
Buntine, W.1
-
19
-
-
0348090400
-
The linear programming approach to approximate dynamic programming
-
de Farias D., and Roy B.V. The linear programming approach to approximate dynamic programming. Operations Research 51 6 (2003) 850-865
-
(2003)
Operations Research
, vol.51
, Issue.6
, pp. 850-865
-
-
de Farias, D.1
Roy, B.V.2
-
20
-
-
84880750109
-
Lifted first-order probabilistic inference
-
IJCAI, Edinburgh, UK
-
R. de Salvo Braz, E. Amir, D. Roth, Lifted first-order probabilistic inference, in: 19th International Joint Conference on Artificial Intelligence (IJCAI-2005), Edinburgh, UK, 2005, pp. 1319-1325
-
(2005)
19th International Joint Conference on Artificial Intelligence
, pp. 1319-1325
-
-
de Salvo Braz, R.1
Amir, E.2
Roth, D.3
-
21
-
-
33750728007
-
MPE and partial inversion in lifted probabilistic variable elimination
-
Boston, USA
-
R. de Salvo Braz, E. Amir, D. Roth, MPE and partial inversion in lifted probabilistic variable elimination, in: National Conference on Artificial Intelligence (AAAI-06), Boston, USA, 2006
-
(2006)
National Conference on Artificial Intelligence (AAAI-06)
-
-
de Salvo Braz, R.1
Amir, E.2
Roth, D.3
-
22
-
-
0030697013
-
Abstraction and approximate decision-theoretic planning
-
Dearden R., and Boutilier C. Abstraction and approximate decision-theoretic planning. Artificial Intelligence 89 12 (1997) 219-283
-
(1997)
Artificial Intelligence
, vol.89
, Issue.12
, pp. 219-283
-
-
Dearden, R.1
Boutilier, C.2
-
23
-
-
0033188982
-
Bucket elimination: A unifying framework for reasoning
-
Dechter R. Bucket elimination: A unifying framework for reasoning. Artificial Intelligence 113 (1999) 41-85
-
(1999)
Artificial Intelligence
, vol.113
, pp. 41-85
-
-
Dechter, R.1
-
27
-
-
13444258086
-
-
A. Fern, S. Yoon, R. Givan, Learning domain-specific control knowledge from random walks, in: International Conference on Planning and Scheduling (ICAPS-04), June 2004, pp. 191-199
-
A. Fern, S. Yoon, R. Givan, Learning domain-specific control knowledge from random walks, in: International Conference on Planning and Scheduling (ICAPS-04), June 2004, pp. 191-199
-
-
-
-
28
-
-
84862952608
-
Extending DTGolog with options
-
IJCAI, Acapulco, Mexico
-
A. Ferrein, C. Fritz, G. Lakemeyer, Extending DTGolog with options, in: 18th International Joint Conference on Artificial Intelligence (IJCAI-2003), Acapulco, Mexico, 2003, pp. 144-151
-
(2003)
18th International Joint Conference on Artificial Intelligence
, pp. 144-151
-
-
Ferrein, A.1
Fritz, C.2
Lakemeyer, G.3
-
29
-
-
2842560201
-
STRIPS: A new approach to the application of theorem proving to problem solving
-
Fikes R.E., and Nilsson N.J. STRIPS: A new approach to the application of theorem proving to problem solving. AI Journal 2 (1971) 189-208
-
(1971)
AI Journal
, vol.2
, pp. 189-208
-
-
Fikes, R.E.1
Nilsson, N.J.2
-
30
-
-
84898930526
-
Envelope-based planning in relational MDPs
-
Vancouver, CA
-
N.H. Gardiol, L.P. Kaelbling, Envelope-based planning in relational MDPs, in: Advances in Neural Information Processing Systems 16 (NIPS-03), Vancouver, CA, 2004, pp. 1040-1046
-
(2004)
Advances in Neural Information Processing Systems 16 (NIPS-03)
, pp. 1040-1046
-
-
Gardiol, N.H.1
Kaelbling, L.P.2
-
31
-
-
33748273074
-
Graph kernels and Gaussian processes for relational reinforcement learning
-
Gartner T., Driessens K., and Ramon J. Graph kernels and Gaussian processes for relational reinforcement learning. Machine Learning Journal (MLJ) 64 (2006) 91-119
-
(2006)
Machine Learning Journal (MLJ)
, vol.64
, pp. 91-119
-
-
Gartner, T.1
Driessens, K.2
Ramon, J.3
-
32
-
-
60549114463
-
-
A. Gerevini, B. Bonet, B. Givan Eds, Lake District, UK
-
A. Gerevini, B. Bonet, B. Givan (Eds.), Online Proceedings for The Fifth International Planning Competition IPC-05: http://www.ldc.usb.ve/bonet/ipc5/docs/ipc-2006-booklet.pdf.gz, Lake District, UK, 2006
-
(2006)
Online Proceedings for The Fifth International Planning Competition IPC-05
-
-
-
33
-
-
44449170889
-
Exploiting first-order regression in inductive policy selection
-
Banff, Canada
-
C. Gretton, S. Thiebaux, Exploiting first-order regression in inductive policy selection, in: Uncertainty in Artificial Intelligence (UAI-04), Banff, Canada, 2004, pp. 217-225
-
(2004)
Uncertainty in Artificial Intelligence (UAI-04)
, pp. 217-225
-
-
Gretton, C.1
Thiebaux, S.2
-
34
-
-
29344475738
-
Solving factored MDPs with continuous and discrete variables
-
C. Guestrin, M. Hauskrecht, B. Kveton, Solving factored MDPs with continuous and discrete variables, in: 20th Conference on Uncertainty in Artificial Intelligence, 2004, pp. 235-242
-
(2004)
20th Conference on Uncertainty in Artificial Intelligence
, pp. 235-242
-
-
Guestrin, C.1
Hauskrecht, M.2
Kveton, B.3
-
35
-
-
84880803349
-
Generalizing plans to new environments in relational MDPs
-
IJCAI, Acapulco, Mexico
-
C. Guestrin, D. Koller, C. Gearhart, N. Kanodia, Generalizing plans to new environments in relational MDPs, in: 18th International Joint Conference on Artificial Intelligence (IJCAI-2003), Acapulco, Mexico, 2003, pp. 1003-1010
-
(2003)
18th International Joint Conference on Artificial Intelligence
, pp. 1003-1010
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
-
37
-
-
84898970468
-
Linear program approximations for factored continuous-state Markov decision processes
-
M. Hauskrecht, B. Kveton, Linear program approximations for factored continuous-state Markov decision processes, in: Advances in Neural Information Processing Systems 16, 2004, pp. 895-902
-
(2004)
in: Advances in Neural Information Processing Systems
, vol.16
, pp. 895-902
-
-
Hauskrecht, M.1
Kveton, B.2
-
38
-
-
0002956570
-
Stochastic planning using decision diagrams
-
SPUDD:, Stockholm
-
J. Hoey, R. St-Aubin, A. Hu, C. Boutilier, SPUDD: Stochastic planning using decision diagrams, in: Uncertainty in Artificial Intelligence (UAI-99), Stockholm, 1999, pp. 279-288
-
(1999)
Uncertainty in Artificial Intelligence (UAI-99)
, pp. 279-288
-
-
Hoey, J.1
St-Aubin, R.2
Hu, A.3
Boutilier, C.4
-
42
-
-
33744496081
-
A heuristic search algorithm for solving first-order MDPs
-
Edinburgh, Scotland
-
E. Karabaev, O. Skvortsova, A heuristic search algorithm for solving first-order MDPs, in: Uncertainty in Artificial Intelligence (UAI-05), Edinburgh, Scotland, 2005, pp. 292-299
-
(2005)
Uncertainty in Artificial Intelligence (UAI-05)
, pp. 292-299
-
-
Karabaev, E.1
Skvortsova, O.2
-
43
-
-
14344249892
-
Bellman goes relational
-
ACM Press
-
K. Kersting, M. van Otterlo, L. de Raedt, Bellman goes relational, in: International Conference on Machine Learning (ICML-04), ACM Press, 2004, pp. 465-472
-
(2004)
International Conference on Machine Learning (ICML-04)
, pp. 465-472
-
-
Kersting, K.1
van Otterlo, M.2
de Raedt, L.3
-
44
-
-
0033189384
-
Learning action strategies for planning domains
-
Khardon R. Learning action strategies for planning domains. Artificial Intelligence 113 1-2 (1999) 125-148
-
(1999)
Artificial Intelligence
, vol.113
, Issue.1-2
, pp. 125-148
-
-
Khardon, R.1
-
45
-
-
0032649290
-
Learning to take actions
-
Khardon R. Learning to take actions. Machine Learning 35 1 (1999) 57-90
-
(1999)
Machine Learning
, vol.35
, Issue.1
, pp. 57-90
-
-
Khardon, R.1
-
48
-
-
0031126053
-
GOLOG: A logic programming language for dynamic domains
-
Levesque H.J., Reiter R., Lespérance Y., Lin F., and Scherl R. GOLOG: A logic programming language for dynamic domains. Journal of Logic Programming 31 1-3 (1997) 59-83
-
(1997)
Journal of Logic Programming
, vol.31
, Issue.1-3
, pp. 59-83
-
-
Levesque, H.J.1
Reiter, R.2
Lespérance, Y.3
Lin, F.4
Scherl, R.5
-
51
-
-
29344433509
-
Samuel meets Amarel: Automating value function approximation using global state space analysis
-
Pittsburgh
-
S. Mahadevan, Samuel meets Amarel: Automating value function approximation using global state space analysis, in: National Conference on Artificial Intelligence (AAAI-05), Pittsburgh, 2005, pp. 1000-1005
-
(2005)
National Conference on Artificial Intelligence (AAAI-05)
, pp. 1000-1005
-
-
Mahadevan, S.1
-
52
-
-
0004030536
-
Situations, actions and causal laws, Tech. rep., Stanford University, 1963, reprinted
-
Minsky M. (Ed), MIT Press, Cambridge, MA
-
McCarthy J. Situations, actions and causal laws, Tech. rep., Stanford University, 1963, reprinted. In: Minsky M. (Ed). Semantic Information Processing (1968), MIT Press, Cambridge, MA 410-417
-
(1968)
Semantic Information Processing
, pp. 410-417
-
-
McCarthy, J.1
-
53
-
-
0031632806
-
Solving very large weakly coupled Markov decision processes
-
Madison, WI
-
N. Meuleau, M. Hauskrecht, K.-E. Kim, L. Peshkin, L.P. Kaelbling, T. Dean, C. Boutilier, Solving very large weakly coupled Markov decision processes, in: National Conference on Artificial Intelligence (AAAI-98), Madison, WI, 1998, pp. 165-172
-
(1998)
National Conference on Artificial Intelligence (AAAI-98)
, pp. 165-172
-
-
Meuleau, N.1
Hauskrecht, M.2
Kim, K.-E.3
Peshkin, L.4
Kaelbling, L.P.5
Dean, T.6
Boutilier, C.7
-
54
-
-
33749555400
-
-
Ph.D. thesis, Univesität Karlsruhe TH, Karlsruhe, Germany, January
-
B. Motik, Reasoning in description logics using resolution and deductive databases, Ph.D. thesis, Univesität Karlsruhe (TH), Karlsruhe, Germany, January 2006
-
(2006)
Reasoning in description logics using resolution and deductive databases
-
-
Motik, B.1
-
55
-
-
0141596576
-
Policy invariance under reward transformations: theory and application to reward shaping
-
Morgan Kaufmann, San Francisco, CA
-
Ng A.Y., Harada D., and Russell S. Policy invariance under reward transformations: theory and application to reward shaping. Proc. 16th International Conf. on Machine Learning (1999), Morgan Kaufmann, San Francisco, CA 278-287
-
(1999)
Proc. 16th International Conf. on Machine Learning
, pp. 278-287
-
-
Ng, A.Y.1
Harada, D.2
Russell, S.3
-
56
-
-
84898956770
-
Reinforcement learning with hierarchies of machines
-
Jordan M.M.K., and Solla S. (Eds), MIT Press, Cambridge, MA
-
Parr R., and Russell S. Reinforcement learning with hierarchies of machines. In: Jordan M.M.K., and Solla S. (Eds). Advances in Neural Information Processing Systems 10 (1998), MIT Press, Cambridge, MA 1043-1049
-
(1998)
Advances in Neural Information Processing Systems 10
, pp. 1043-1049
-
-
Parr, R.1
Russell, S.2
-
57
-
-
0036927202
-
Greedy linear value-approximation for factored Markov decision processes
-
Edmonton
-
R. Patrascu, P. Poupart, D. Schuurmans, C. Boutilier, C. Guestrin, Greedy linear value-approximation for factored Markov decision processes, in: National Conference on Artificial Intelligence (AAAI-02), Edmonton, 2002, pp. 285-291
-
(2002)
National Conference on Artificial Intelligence (AAAI-02)
, pp. 285-291
-
-
Patrascu, R.1
Poupart, P.2
Schuurmans, D.3
Boutilier, C.4
Guestrin, C.5
-
59
-
-
0031187203
-
The independent choice logic for modelling multiple agents under uncertainty
-
Poole D. The independent choice logic for modelling multiple agents under uncertainty. Artificial Intelligence 94 1-2 (1997) 7-56
-
(1997)
Artificial Intelligence
, vol.94
, Issue.1-2
, pp. 7-56
-
-
Poole, D.1
-
61
-
-
0036923210
-
Piecewise linear value function approximation for factored MDPs
-
Edmonton
-
P. Poupart, C. Boutilier, R. Patrascu, D. Schuurmans, Piecewise linear value function approximation for factored MDPs, in: National Conference on Artificial Intelligence (AAAI-02), Edmonton, 2002, pp. 292-299
-
(2002)
National Conference on Artificial Intelligence (AAAI-02)
, pp. 292-299
-
-
Poupart, P.1
Boutilier, C.2
Patrascu, R.3
Schuurmans, D.4
-
63
-
-
0002048328
-
The frame problem in the situation calculus: A simple solution (sometimes) and a completeness result for goal regression
-
Lifschitz V. (Ed), Academic Press, San Diego, CA
-
Reiter R. The frame problem in the situation calculus: A simple solution (sometimes) and a completeness result for goal regression. In: Lifschitz V. (Ed). Artificial Intelligence and Mathematical Theory of Computation (Papers in Honor of John McCarthy) (1991), Academic Press, San Diego, CA 359-380
-
(1991)
Artificial Intelligence and Mathematical Theory of Computation (Papers in Honor of John McCarthy)
, pp. 359-380
-
-
Reiter, R.1
-
65
-
-
0036327027
-
The design and implementation of vampire
-
Riazanov A., and Voronkov A. The design and implementation of vampire. AI Communications 15 2 (2002) 91-110
-
(2002)
AI Communications
, vol.15
, Issue.2
, pp. 91-110
-
-
Riazanov, A.1
Voronkov, A.2
-
67
-
-
60549107148
-
-
S. Sanner, First-order decision-theoretic planning in structured relational environments, Ph.D. thesis, University of Toronto, Toronto, ON, Canada, March 2008
-
S. Sanner, First-order decision-theoretic planning in structured relational environments, Ph.D. thesis, University of Toronto, Toronto, ON, Canada, March 2008
-
-
-
-
68
-
-
72949112166
-
Approximate linear programming for first-order MDPs
-
Edinburgh, Scotland
-
S. Sanner, C. Boutilier, Approximate linear programming for first-order MDPs, in: Uncertainty in Artificial Intelligence (UAI-05), Edinburgh, Scotland, 2005, pp. 509-517
-
(2005)
Uncertainty in Artificial Intelligence (UAI-05)
, pp. 509-517
-
-
Sanner, S.1
Boutilier, C.2
-
75
-
-
26944499565
-
Approximate policy construction using decision diagrams
-
APRICODD:, Denver
-
R. St-Aubin, J. Hoey, C. Boutilier, APRICODD: Approximate policy construction using decision diagrams, in: Advances in Neural Information Processing 13 (NIPS-00), Denver, 2000, pp. 1089-1095
-
(2000)
Advances in Neural Information Processing 13 (NIPS-00)
, pp. 1089-1095
-
-
St-Aubin, R.1
Hoey, J.2
Boutilier, C.3
-
77
-
-
33744462367
-
Decision-theoretic planning with non-Markovian rewards
-
Thiebaux S., Gretton C., Slaney J., Price D., and Kabanza F. Decision-theoretic planning with non-Markovian rewards. Journal of Artificial Intelligence Research 25 (January 2006) 17-74
-
(2006)
Journal of Artificial Intelligence Research
, vol.25
, pp. 17-74
-
-
Thiebaux, S.1
Gretton, C.2
Slaney, J.3
Price, D.4
Kabanza, F.5
-
78
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
Tsitsiklis J.N., and Van Roy B. Feature-based methods for large scale dynamic programming. Machine Learning 22 (1996) 59-94
-
(1996)
Machine Learning
, vol.22
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
80
-
-
84880869367
-
First order decision diagrams for relational MDPs
-
Hyderabad, India
-
C. Wang, S. Joshi, R. Khardon, First order decision diagrams for relational MDPs, in: Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07), Hyderabad, India, 2007, pp. 1095-1100
-
(2007)
Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07)
, pp. 1095-1100
-
-
Wang, C.1
Joshi, S.2
Khardon, R.3
-
84
-
-
13444310066
-
Inductive policy selection for first-order Markov decision processes
-
Edmonton
-
S. Yoon, A. Fern, R. Givan, Inductive policy selection for first-order Markov decision processes, in: Uncertainty in Artificial Intelligence (UAI-02), Edmonton, 2002, pp. 569-576
-
(2002)
Uncertainty in Artificial Intelligence (UAI-02)
, pp. 569-576
-
-
Yoon, S.1
Fern, A.2
Givan, R.3
-
85
-
-
60549105503
-
-
S. Yoon, A. Fern, R. Givan, Learning reactive policies for probabilistic planning domains, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/mlittman/topics/ipc04-pt/proceedings/, 2004
-
S. Yoon, A. Fern, R. Givan, Learning reactive policies for probabilistic planning domains, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/mlittman/topics/ipc04-pt/proceedings/, 2004
-
-
-
-
86
-
-
29344443330
-
-
S. Yoon, A. Fern, R. Givan, Learning measures of progress for planning domains, in: 20th National Conference on Artificial Intelligence, July 2005, pp. 1217-1222
-
S. Yoon, A. Fern, R. Givan, Learning measures of progress for planning domains, in: 20th National Conference on Artificial Intelligence, July 2005, pp. 1217-1222
-
-
-
-
87
-
-
33744466799
-
Approximate policy iteration with a policy language bias: Learning to solve relational Markov decision processes
-
Yoon S., Fern A., and Givan R. Approximate policy iteration with a policy language bias: Learning to solve relational Markov decision processes. Journal of Artificial Intelligence Research (JAIR) 25 (2006) 85-118
-
(2006)
Journal of Artificial Intelligence Research (JAIR)
, vol.25
, pp. 85-118
-
-
Yoon, S.1
Fern, A.2
Givan, R.3
-
88
-
-
58349118462
-
A baseline for probabilistic planning
-
FF-Replan
-
S. Yoon, A. Fern, R. Givan, FF-Replan: A baseline for probabilistic planning, in: 17th International Conference on Automated Planning and Scheduling (ICAPS-07), 2007, pp. 352-359
-
(2007)
17th International Conference on Automated Planning and Scheduling (ICAPS-07)
, pp. 352-359
-
-
Yoon, S.1
Fern, A.2
Givan, R.3
|