-
3
-
-
0346942368
-
Decision-theoretic planning: Structural assumptions and computational leverage
-
Boutilier, C., Dean, T., & Hanks, S. (1999). Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 11, 1-94.
-
(1999)
Journal of Artificial Intelligence Research
, vol.11
, pp. 1-94
-
-
Boutilier, C.1
Dean, T.2
Hanks, S.3
-
4
-
-
84880891360
-
Symbolic dynamic programming for first-order MDPs
-
Boutilier, C., Reiter, R., & Price, B. (2001). Symbolic dynamic programming for first-order MDPs. In Proc. of the Int. Conf. on Artificial Intelligence (IJCAI), pp. 690-700.
-
(2001)
Proc. of the Int. Conf. on Artificial Intelligence (IJCAI
, pp. 690-700
-
-
Boutilier, C.1
Reiter, R.2
Price, B.3
-
5
-
-
60549101047
-
The factored policy-gradient planner
-
Buffet, O., & Aberdeen, D. (2009). The factored policy-gradient planner. Artificial Intelligence Journal, 173(5-6), 722-747.
-
(2009)
Artificial Intelligence Journal
, vol.173
, Issue.5-6
, pp. 722-747
-
-
Buffet, O.1
Aberdeen, D.2
-
7
-
-
84880882489
-
Online learning and exploiting relational models in reinforcement learning
-
Croonenborghs, T., Ramon, J., Blockeel, H., & Bruynooghe, M. (2007). Online learning and exploiting relational models in reinforcement learning. In Proc. of the Int. Conf. on Artificial Intelligence (IJCAI), pp. 726-731.
-
(2007)
Proc. of the Int. Conf. on Artificial Intelligence (IJCAI
, pp. 726-731
-
-
Croonenborghs, T.1
Ramon, J.2
Blockeel, H.3
Bruynooghe, M.4
-
8
-
-
38349168404
-
Probabilistic planning via heuristic forward search and weighted model counting
-
Domshlak, C., & Hoffmann, J. (2007). Probabilistic planning via heuristic forward search and weighted model counting. Journal of Artificial Intelligence Research, 30, 565-620.
-
(2007)
Journal of Artificial Intelligence Research
, vol.30
, pp. 565-620
-
-
Domshlak, C.1
Hoffmann, J.2
-
9
-
-
33748273074
-
Graph kernels and Gaussian processes for relational reinforcement learning
-
Driessens, K., Ramon, J., & G̈artner, T. (2006). Graph kernels and Gaussian processes for relational reinforcement learning. Machine Learning, 64(1-3), 91-119.
-
(2006)
Machine Learning
, vol.64
, Issue.1-3
, pp. 91-119
-
-
Driessens, K.1
Ramon, J.2
G̈artner, T.3
-
10
-
-
0035312760
-
Relational reinforcement learning
-
DOI 10.1023/A:1007694015589
-
Ďzeroski, S., de Raedt, L., & Driessens, K. (2001). Relational reinforcement learning. Machine Learning, 43, 7-52. (Pubitemid 32286614)
-
(2001)
Machine Learning
, vol.43
, Issue.1-2
, pp. 7-52
-
-
Dzeroski, S.1
De Raedt, L.2
Driessens, K.3
-
11
-
-
33744466799
-
Approximate policy iteration with a policy language bias: Solving relational markov decision processes
-
Fern, A., Yoon, S., & Givan, R. (2006). Approximate policy iteration with a policy language bias: solving relational markov decision processes. Journal of Artificial Intelligence Research, 25(1), 75-118.
-
(2006)
Journal of Artificial Intelligence Research
, vol.25
, Issue.1
, pp. 75-118
-
-
Fern, A.1
Yoon, S.2
Givan, R.3
-
14
-
-
70350633871
-
-
Tech. rep. MIT-CSAIL-TR-2008-050, MIT CS & AI Lab, Cambridge, MA
-
Gardiol, N. H., & Kaelbling, L. P. (2008). Adaptive envelope MDPs for relational equivalence-based planning. Tech. rep. MIT-CSAIL-TR-2008-050, MIT CS & AI Lab, Cambridge, MA.
-
(2008)
Adaptive Envelope MDPs for Relational Equivalence-based Planning
-
-
Gardiol, N.H.1
Kaelbling, L.P.2
-
17
-
-
78651497647
-
Conscious thought as simulation of behaviour and perception
-
Grush, R. (2004). Conscious thought as simulation of behaviour and perception. Behaviorial and brain sciences, 27, 377-442.
-
(2004)
Behaviorial and Brain Sciences
, vol.27
, pp. 377-442
-
-
Grush, R.1
-
19
-
-
0036605946
-
Conscious thought as simulation of behaviour and perception
-
Hesslow, G. (2002). Conscious thought as simulation of behaviour and perception. Trends in Cognitive Science, 6(6), 242-247.
-
(2002)
Trends in Cognitive Science
, vol.6
, Issue.6
, pp. 242-247
-
-
Hesslow, G.1
-
20
-
-
0036377352
-
The FF planning system: Fast plan generation through heuristic search
-
Hoffmann, J., & Nebel, B. (2001). The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research, 14, 253-302.
-
(2001)
Journal of Artificial Intelligence Research
, vol.14
, pp. 253-302
-
-
Hoffmann, J.1
Nebel, B.2
-
24
-
-
78651481478
-
Generalized first-order decision diagrams for first-order MDPs
-
Joshi, S., Kersting, K., & Khardon, R. (2009). Generalized first-order decision diagrams for first-order MDPs. In Proc. of the Int. Conf. on Artificial Intelligence (IJCAI), pp. 1916-1921.
-
(2009)
Proc. of the Int. Conf. on Artificial Intelligence (IJCAI
, pp. 1916-1921
-
-
Joshi, S.1
Kersting, K.2
Khardon, R.3
-
26
-
-
0036832951
-
A sparse sampling algorithm for near-optimal planning in large Markov decision processes
-
Kearns, M. J., Mansour, Y., & Ng, A. Y. (2002). A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning, 49(2-3), 193-208.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 193-208
-
-
Kearns, M.J.1
Mansour, Y.2
Ng, A.Y.3
-
27
-
-
56449088242
-
Non-parametric policy gradients: A unified treatment of propositional and relational domains
-
Kersting, K., & Driessens, K. (2008). Non-parametric policy gradients: A unified treatment of propositional and relational domains. In Proc. of the Int. Conf. on Machine Learning (ICML), pp. 456-463.
-
(2008)
Proc. of the Int. Conf. on Machine Learning (ICML
, pp. 456-463
-
-
Kersting, K.1
Driessens, K.2
-
28
-
-
14344249892
-
Bellman goes relational
-
Kersting, K., van Otterlo, M., & de Raedt, L. (2004). Bellman goes relational. In Proc. of the Int. Conf. on Machine Learning (ICML), pp. 465-472.
-
(2004)
Proc. of the Int. Conf. on Machine Learning (ICML
, pp. 465-472
-
-
Kersting, K.1
Van Otterlo, M.2
De Raedt, L.3
-
30
-
-
0029333536
-
An algorithm for probabilistic planning
-
Kushmerick, N., Hanks, S., & Weld, D. (1995). An algorithm for probabilistic planning. Artificial Intelligence, 78(1-2), 239-286.
-
(1995)
Artificial Intelligence
, vol.78
, Issue.1-2
, pp. 239-286
-
-
Kushmerick, N.1
Hanks, S.2
Weld, D.3
-
31
-
-
58849089265
-
Using classical planners to solve nondeterministic planning problems
-
Kuter, U., Nau, D. S., Reisner, E., & Goldman, R. P. (2008). Using classical planners to solve nondeterministic planning problems. In Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS), pp. 190-197.
-
(2008)
Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS
, pp. 190-197
-
-
Kuter, U.1
Nau, D.S.2
Reisner, E.3
Goldman, R.P.4
-
34
-
-
78651506090
-
-
ICAPS-Workshop International Planning Competition: Past, Present and Future
-
Little, I., & Thíebaux, S. (2007). Probabilistic planning vs replanning. In ICAPS-Workshop International Planning Competition: Past, Present and Future.
-
(2007)
Probabilistic Planning Vs Replanning
-
-
Little, I.1
Thíebaux, S.2
-
35
-
-
11544375673
-
The computational complexity of probabilistic planning
-
Littman, M. L., Goldsmith, J., & Mundhenk, M. (1997). The computational complexity of probabilistic planning. Journal of Artificial Intelligence Research, 9, 1-36.
-
(1997)
Journal of Artificial Intelligence Research
, vol.9
, pp. 1-36
-
-
Littman, M.L.1
Goldsmith, J.2
Mundhenk, M.3
-
38
-
-
34748875246
-
Learning symbolic models of stochastic domains
-
Pasula, H. M., Zettlemoyer, L. S., & Kaelbling, L. P. (2007). Learning symbolic models of stochastic domains. Journal of Artificial Intelligence Research, 29, 309-352.
-
(2007)
Journal of Artificial Intelligence Research
, vol.29
, pp. 309-352
-
-
Pasula, H.M.1
Zettlemoyer, L.S.2
Kaelbling, L.P.3
-
41
-
-
60549103706
-
Practical solution techniques for first-order MDPs
-
Sanner, S., & Boutilier, C. (2009). Practical solution techniques for first-order MDPs. Artificial Intelligence, 173(5-6), 748-788.
-
(2009)
Artificial Intelligence
, vol.173
, Issue.5-6
, pp. 748-788
-
-
Sanner, S.1
Boutilier, C.2
-
42
-
-
0024038570
-
Probabilistic inference and influence diagrams
-
Shachter, R. (1988). Probabilistic inference and influence diagrams. Operations Research, 36, 589-605.
-
(1988)
Operations Research
, vol.36
, pp. 589-605
-
-
Shachter, R.1
-
45
-
-
33749234798
-
Probabilistic inference for solving discrete and continuous state Markov decision processes
-
Toussaint, M., & Storkey, A. (2006). Probabilistic inference for solving discrete and continuous state Markov decision processes. In Proc. of the Int. Conf. on Machine Learning (ICML), pp. 945-952.
-
(2006)
Proc. of the Int. Conf. on Machine Learning (ICML
, pp. 945-952
-
-
Toussaint, M.1
Storkey, A.2
-
46
-
-
78651507715
-
Expectation-maximization methods for solving (PO)MDPs and optimal control problems
-
Chiappa, S., & Barber, D. (Eds.) Cambridge University Press
-
Toussaint, M., Storkey, A., & Harmeling, S. (2010). Expectation-maximization methods for solving (PO)MDPs and optimal control problems. In Chiappa, S., & Barber, D. (Eds.), Inference and Learning in Dynamic Models. Cambridge University Press.
-
(2010)
Inference and Learning in Dynamic Models
-
-
Toussaint, M.1
Storkey, A.2
Harmeling, S.3
-
48
-
-
78049411914
-
-
Ph.D. thesis, Rutgers, The State University of New Jersey, New Brunswick, NJ
-
Walsh, T. J. (2010). Efficient learning of relational models for sequential decision making. Ph.D. thesis, Rutgers, The State University of New Jersey, New Brunswick, NJ.
-
(2010)
Efficient Learning of Relational Models for Sequential Decision Making
-
-
Walsh, T.J.1
-
49
-
-
44449127194
-
First order decision diagrams for relational MDPs
-
Wang, C., Joshi, S., & Khardon, R. (2008). First order decision diagrams for relational MDPs. Journal of Artificial Intelligence Research, 31, 431-472.
-
(2008)
Journal of Artificial Intelligence Research
, vol.31
, pp. 431-472
-
-
Wang, C.1
Joshi, S.2
Khardon, R.3
-
50
-
-
0032633177
-
Recent advances in AI planning
-
Weld, D. S. (1999). Recent advances in AI planning. AI Magazine, 20(2), 93-123.
-
(1999)
AI Magazine
, vol.20
, Issue.2
, pp. 93-123
-
-
Weld, D.S.1
-
51
-
-
58849135844
-
Stochastic enforced hill-climbing
-
Wu, J.-H., Kalyanam, R., & Givan, R. (2008). Stochastic enforced hill-climbing. In Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS), pp. 396-403.
-
(2008)
Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS
, pp. 396-403
-
-
Wu, J.-H.1
Kalyanam, R.2
Givan, R.3
-
52
-
-
58349118462
-
FF-Replan: A baseline for probabilistic planning
-
Yoon, S. W., Fern, A., & Givan, R. (2007). FF-Replan: A baseline for probabilistic planning. In Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS), pp. 352- 359.
-
(2007)
Proc. of the Int. Conf. on Automated Planning and Scheduling (ICAPS
, pp. 352-359
-
-
Yoon, S.W.1
Fern, A.2
Givan, R.3
-
53
-
-
57749193939
-
Probabilistic planning via determinization in hindsight
-
Yoon, S. W., Fern, A., Givan, R., & Kambhampati, S. (2008). Probabilistic planning via determinization in hindsight. In Proc. of the AAAI Conf. on Artificial Intelligence (AAAI), pp. 1010-1016.
-
(2008)
Proc. of the AAAI Conf. on Artificial Intelligence (AAAI
, pp. 1010-1016
-
-
Yoon, S.W.1
Fern, A.2
Givan, R.3
Kambhampati, S.4
|