SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 173, Issue 5-6, 2009, Pages 748-788

Practical solution techniques for first-order MDPs

(2) Sanner, Scott a Boutilier, Craig b

a CSIRO (Australia)

b UNIVERSITY OF TORONTO (Canada)

Author keywords

First order logic; MDPs; Planning

Indexed keywords

FORMAL LOGIC; LINEARIZATION; SPECIFICATIONS;

APPROXIMATE LINEAR PROGRAMMING; DECISION-THEORETIC; FIRST ORDERS; FIRST-ORDER LOGIC; INTERNATIONAL PLANNING COMPETITIONS; MARKOV DECISION PROCESS; MDPS; PLANNING PROBLEMS; PRACTICAL SOLUTIONS; PROBABILISTIC PLANNING; PROBLEM SPECIFICATIONS; PROOF OF CONCEPTS; SOLUTION APPROACHES; SPACE AND TIME COMPLEXITY;

PLANNING;

EID: 60549103706 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/j.artint.2008.11.003 Document Type: Article

Times cited : (97)

References (91)

1
- 84898960325
- Programmable reinforcement learning agents
- D. Andre, S. Russell, Programmable reinforcement learning agents, in: Advances in Neural Information Processing Systems (NIPS-01), vol. 13, 2001, pp. 78-85
- (2001) Advances in Neural Information Processing Systems (NIPS-01) , vol.13 , pp. 78-85
- Andre, D.¹ Russell, S.²

2
- 0041547046
- Reasoning about noisy sensors in the situation calculus
- Montreal
- F. Bacchus, J.Y. Halpern, H.J. Levesque, Reasoning about noisy sensors in the situation calculus, in: International Joint Conference on Artificial Intelligence (IJCAI-95), Montreal, 1995, pp. 1933-1940
- (1995) International Joint Conference on Artificial Intelligence (IJCAI-95) , pp. 1933-1940
- Bacchus, F.¹ Halpern, J.Y.² Levesque, H.J.³

3
- 0033897011
- Using temporal logics to express search control knowledge for planning
- Bacchus F., and Kabanza F. Using temporal logics to express search control knowledge for planning. Artificial Intelligence 116 1-2 (2000) 123-191
- (2000) Artificial Intelligence , vol.116 , Issue.1-2 , pp. 123-191
- Bacchus, F.¹ Kabanza, F.²

4
- 0027880685
- Algebraic decision diagrams and their applications
- R.I. Bahar, E. Frohm, C. Gaona, G. Hachtel, E. Macii, A. Pardo, F. Somenzi, Algebraic decision diagrams and their applications, in: IEEE/ACM International Conference on CAD, 1993, pp. 428-432
- (1993) IEEE/ACM International Conference on CAD , pp. 428-432
- Bahar, R.I.¹ Frohm, E.² Gaona, C.³ Hachtel, G.⁴ Macii, E.⁵ Pardo, A.⁶ Somenzi, F.⁷

5
- 0002700781
- Learning to act using real-time dynamic programming
- Tech. Rep. UM-CS-1993-002, U. Mass. Amherst
- A.G. Barto, S.J. Bradtke, S.P. Singh, Learning to act using real-time dynamic programming, Tech. Rep. UM-CS-1993-002, U. Mass. Amherst, 1993
- (1993)
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

6
- 85012688561
- Princeton University Press, Princeton, NJ
- Bellman R.E. Dynamic Programming (1957), Princeton University Press, Princeton, NJ
- (1957) Dynamic Programming
- Bellman, R.E.¹

7
- 0004295484
- Prentice Hall, Englewood Cliffs, NJ
- Bertsekas D.P. Dynamic Programming (1987), Prentice Hall, Englewood Cliffs, NJ
- (1987) Dynamic Programming
- Bertsekas, D.P.¹

8
- 0003487482
- Athena Scientific, Belmont, MA
- Bertsekas D.P., and Tsitsiklis J.N. Neuro-Dynamic Programming (1996), Athena Scientific, Belmont, MA
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

9
- 0001657540
- Fast planning through graph analysis
- Montreal
- A.L. Blum, M.L. Furst, Fast planning through graph analysis, in: IJCAI 95, Montreal, 1995, pp. 1636-1642
- (1995) IJCAI 95 , pp. 1636-1642
- Blum, A.L.¹ Furst, M.L.²

10
- 60549099880
- B. Bonet, H. Geffner, mGPT: A probabilistic planner based on heuristic search, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/~mlittman/topics/ipc04-pt/proceedings/, 2004
- B. Bonet, H. Geffner, mGPT: A probabilistic planner based on heuristic search, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/~mlittman/topics/ipc04-pt/proceedings/, 2004

11
- 84880685295
- Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning
- Nagoya
- C. Boutilier, R.I. Brafman, C. Geib, Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning, in: International Joint Conference on Artificial Intelligence (IJCAI-97). Nagoya, 1997, pp. 1156-1162
- (1997) International Joint Conference on Artificial Intelligence (IJCAI-97) , pp. 1156-1162
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

12
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- Boutilier C., Dean T., and Hanks S. Decision-theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research (JAIR) 11 (1999) 1-94
- (1999) Journal of Artificial Intelligence Research (JAIR) , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

13
- 0000675721
- Context-specific independence in Bayesian networks
- Portland, OR
- C. Boutilier, N. Friedman, M. Goldszmidt, D. Koller, Context-specific independence in Bayesian networks, in: Uncertainty in Artificial Intelligence (UAI-96), Portland, OR, 1996, pp. 115-123
- (1996) Uncertainty in Artificial Intelligence (UAI-96) , pp. 115-123
- Boutilier, C.¹ Friedman, N.² Goldszmidt, M.³ Koller, D.⁴

14
- 84880891360
- Symbolic dynamic programming for first-order MDPs
- Seattle
- C. Boutilier, R. Reiter, B. Price, Symbolic dynamic programming for first-order MDPs, in: International Joint Conference on Artificial Intelligence (IJCAI-01), Seattle, 2001, pp. 690-697
- (2001) International Joint Conference on Artificial Intelligence (IJCAI-01) , pp. 690-697
- Boutilier, C.¹ Reiter, R.² Price, B.³

15
- 60549115923
- C. Boutilier, R. Reiter, M. Soutchanski, S. Thrun, Decision-theoretic, high-level agent programming in the situation calculus, in: AAAI-00, Austin, TX, 2000, pp. 355-362
- C. Boutilier, R. Reiter, M. Soutchanski, S. Thrun, Decision-theoretic, high-level agent programming in the situation calculus, in: AAAI-00, Austin, TX, 2000, pp. 355-362

16
- 84995466560
- Morgan Kaufmann Publishers Inc., San Francisco, CA
- Brachman R., and Levesque H. Knowledge Representation and Reasoning (2004), Morgan Kaufmann Publishers Inc., San Francisco, CA
- (2004) Knowledge Representation and Reasoning
- Brachman, R.¹ Levesque, H.²

17
- 78751686224
- The factored policy gradient planner (ipc-06 version)
- O. Buffet, D. Aberdeen, The factored policy gradient planner (ipc-06 version), in: Proceedings of the Fifth International Planning Competition, 2006
- (2006) Proceedings of the Fifth International Planning Competition
- Buffet, O.¹ Aberdeen, D.²

18
- 0024084964
- Generalized subsumption and its application to induction and redundancy
- Buntine W. Generalized subsumption and its application to induction and redundancy. Artificial Intelligence 36 (1988) 375-399
- (1988) Artificial Intelligence , vol.36 , pp. 375-399
- Buntine, W.¹

19
- 0348090400
- The linear programming approach to approximate dynamic programming
- de Farias D., and Roy B.V. The linear programming approach to approximate dynamic programming. Operations Research 51 6 (2003) 850-865
- (2003) Operations Research , vol.51 , Issue.6 , pp. 850-865
- de Farias, D.¹ Roy, B.V.²

20
- 84880750109
- Lifted first-order probabilistic inference
- IJCAI, Edinburgh, UK
- R. de Salvo Braz, E. Amir, D. Roth, Lifted first-order probabilistic inference, in: 19th International Joint Conference on Artificial Intelligence (IJCAI-2005), Edinburgh, UK, 2005, pp. 1319-1325
- (2005) 19th International Joint Conference on Artificial Intelligence , pp. 1319-1325
- de Salvo Braz, R.¹ Amir, E.² Roth, D.³

21
- 33750728007
- MPE and partial inversion in lifted probabilistic variable elimination
- Boston, USA
- R. de Salvo Braz, E. Amir, D. Roth, MPE and partial inversion in lifted probabilistic variable elimination, in: National Conference on Artificial Intelligence (AAAI-06), Boston, USA, 2006
- (2006) National Conference on Artificial Intelligence (AAAI-06)
- de Salvo Braz, R.¹ Amir, E.² Roth, D.³

22
- 0030697013
- Abstraction and approximate decision-theoretic planning
- Dearden R., and Boutilier C. Abstraction and approximate decision-theoretic planning. Artificial Intelligence 89 12 (1997) 219-283
- (1997) Artificial Intelligence , vol.89 , Issue.12 , pp. 219-283
- Dearden, R.¹ Boutilier, C.²

23
- 0033188982
- Bucket elimination: A unifying framework for reasoning
- Dechter R. Bucket elimination: A unifying framework for reasoning. Artificial Intelligence 113 (1999) 41-85
- (1999) Artificial Intelligence , vol.113 , pp. 41-85
- Dechter, R.¹

24
- 0013464438
- Integrating experimentation and guidance in relational reinforcement learning
- K. Driessens, S. Dzeroski, Integrating experimentation and guidance in relational reinforcement learning, in: International Conference on Machine Learning (ICML), 2002, pp. 115-122
- (2002) International Conference on Machine Learning (ICML) , pp. 115-122
- Driessens, K.¹ Dzeroski, S.²

25
- 0035312760
- Relational reinforcement learning
- Dzeroski S., DeRaedt L., and Driessens K. Relational reinforcement learning. Machine Learning Journal (MLJ) 43 (2001) 7-52
- (2001) Machine Learning Journal (MLJ) , vol.43 , pp. 7-52
- Dzeroski, S.¹ DeRaedt, L.² Driessens, K.³

26
- 22944468731
- Approximate policy iteration with a policy language bias
- December
- A. Fern, S. Yoon, R. Givan, Approximate policy iteration with a policy language bias, in: Advances in Neural Information Processing Systems 16 (NIPS-03), December 2003
- (2003) Advances in Neural Information Processing Systems 16 (NIPS-03)
- Fern, A.¹ Yoon, S.² Givan, R.³

27
- 13444258086
- A. Fern, S. Yoon, R. Givan, Learning domain-specific control knowledge from random walks, in: International Conference on Planning and Scheduling (ICAPS-04), June 2004, pp. 191-199
- A. Fern, S. Yoon, R. Givan, Learning domain-specific control knowledge from random walks, in: International Conference on Planning and Scheduling (ICAPS-04), June 2004, pp. 191-199

28
- 84862952608
- Extending DTGolog with options
- IJCAI, Acapulco, Mexico
- A. Ferrein, C. Fritz, G. Lakemeyer, Extending DTGolog with options, in: 18th International Joint Conference on Artificial Intelligence (IJCAI-2003), Acapulco, Mexico, 2003, pp. 144-151
- (2003) 18th International Joint Conference on Artificial Intelligence , pp. 144-151
- Ferrein, A.¹ Fritz, C.² Lakemeyer, G.³

29
- 2842560201
- STRIPS: A new approach to the application of theorem proving to problem solving
- Fikes R.E., and Nilsson N.J. STRIPS: A new approach to the application of theorem proving to problem solving. AI Journal 2 (1971) 189-208
- (1971) AI Journal , vol.2 , pp. 189-208
- Fikes, R.E.¹ Nilsson, N.J.²

30
- 84898930526
- Envelope-based planning in relational MDPs
- Vancouver, CA
- N.H. Gardiol, L.P. Kaelbling, Envelope-based planning in relational MDPs, in: Advances in Neural Information Processing Systems 16 (NIPS-03), Vancouver, CA, 2004, pp. 1040-1046
- (2004) Advances in Neural Information Processing Systems 16 (NIPS-03) , pp. 1040-1046
- Gardiol, N.H.¹ Kaelbling, L.P.²

31
- 33748273074
- Graph kernels and Gaussian processes for relational reinforcement learning
- Gartner T., Driessens K., and Ramon J. Graph kernels and Gaussian processes for relational reinforcement learning. Machine Learning Journal (MLJ) 64 (2006) 91-119
- (2006) Machine Learning Journal (MLJ) , vol.64 , pp. 91-119
- Gartner, T.¹ Driessens, K.² Ramon, J.³

32
- 60549114463
- A. Gerevini, B. Bonet, B. Givan Eds, Lake District, UK
- A. Gerevini, B. Bonet, B. Givan (Eds.), Online Proceedings for The Fifth International Planning Competition IPC-05: http://www.ldc.usb.ve/bonet/ipc5/docs/ipc-2006-booklet.pdf.gz, Lake District, UK, 2006
- (2006) Online Proceedings for The Fifth International Planning Competition IPC-05

33
- 44449170889
- Exploiting first-order regression in inductive policy selection
- Banff, Canada
- C. Gretton, S. Thiebaux, Exploiting first-order regression in inductive policy selection, in: Uncertainty in Artificial Intelligence (UAI-04), Banff, Canada, 2004, pp. 217-225
- (2004) Uncertainty in Artificial Intelligence (UAI-04) , pp. 217-225
- Gretton, C.¹ Thiebaux, S.²

34
- 29344475738
- Solving factored MDPs with continuous and discrete variables
- C. Guestrin, M. Hauskrecht, B. Kveton, Solving factored MDPs with continuous and discrete variables, in: 20th Conference on Uncertainty in Artificial Intelligence, 2004, pp. 235-242
- (2004) 20th Conference on Uncertainty in Artificial Intelligence , pp. 235-242
- Guestrin, C.¹ Hauskrecht, M.² Kveton, B.³

35
- 84880803349
- Generalizing plans to new environments in relational MDPs
- IJCAI, Acapulco, Mexico
- C. Guestrin, D. Koller, C. Gearhart, N. Kanodia, Generalizing plans to new environments in relational MDPs, in: 18th International Joint Conference on Artificial Intelligence (IJCAI-2003), Acapulco, Mexico, 2003, pp. 1003-1010
- (2003) 18th International Joint Conference on Artificial Intelligence , pp. 1003-1010
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

36
- 4544318426
- Efficient solution methods for factored MDPs
- Guestrin C., Koller D., Parr R., and Venktaraman S. Efficient solution methods for factored MDPs. Journal of Artificial Intelligence Research (JAIR) 19 (2002) 399-468
- (2002) Journal of Artificial Intelligence Research (JAIR) , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venktaraman, S.⁴

37
- 84898970468
- Linear program approximations for factored continuous-state Markov decision processes
- M. Hauskrecht, B. Kveton, Linear program approximations for factored continuous-state Markov decision processes, in: Advances in Neural Information Processing Systems 16, 2004, pp. 895-902
- (2004) in: Advances in Neural Information Processing Systems , vol.16 , pp. 895-902
- Hauskrecht, M.¹ Kveton, B.²

38
- 0002956570
- Stochastic planning using decision diagrams
- SPUDD:, Stockholm
- J. Hoey, R. St-Aubin, A. Hu, C. Boutilier, SPUDD: Stochastic planning using decision diagrams, in: Uncertainty in Artificial Intelligence (UAI-99), Stockholm, 1999, pp. 279-288
- (1999) Uncertainty in Artificial Intelligence (UAI-99) , pp. 279-288
- Hoey, J.¹ St-Aubin, R.² Hu, A.³ Boutilier, C.⁴

39
- 0036377352
- The FF planning system: Fast plan generation through heuristic search
- Hoffmann J., and Nebel B. The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research (JAIR) 14 (2001) 253-302
- (2001) Journal of Artificial Intelligence Research (JAIR) , vol.14 , pp. 253-302
- Hoffmann, J.¹ Nebel, B.²

40
- 38049183848
- FluCaP: A heuristic search planner for first-order mdps
- Hölldobler S., Karabaev E., and Skvortsova O. FluCaP: A heuristic search planner for first-order mdps. Journal of Artificial Intelligence Research (JAIR) 27 (2006) 419-439
- (2006) Journal of Artificial Intelligence Research (JAIR) , vol.27 , pp. 419-439
- Hölldobler, S.¹ Karabaev, E.² Skvortsova, O.³

41
- 0003644124
- MIT Press
- Howard R.A. Dynamic Programming and Markov Processes (1960), MIT Press
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

42
- 33744496081
- A heuristic search algorithm for solving first-order MDPs
- Edinburgh, Scotland
- E. Karabaev, O. Skvortsova, A heuristic search algorithm for solving first-order MDPs, in: Uncertainty in Artificial Intelligence (UAI-05), Edinburgh, Scotland, 2005, pp. 292-299
- (2005) Uncertainty in Artificial Intelligence (UAI-05) , pp. 292-299
- Karabaev, E.¹ Skvortsova, O.²

43
- 14344249892
- Bellman goes relational
- ACM Press
- K. Kersting, M. van Otterlo, L. de Raedt, Bellman goes relational, in: International Conference on Machine Learning (ICML-04), ACM Press, 2004, pp. 465-472
- (2004) International Conference on Machine Learning (ICML-04) , pp. 465-472
- Kersting, K.¹ van Otterlo, M.² de Raedt, L.³

44
- 0033189384
- Learning action strategies for planning domains
- Khardon R. Learning action strategies for planning domains. Artificial Intelligence 113 1-2 (1999) 125-148
- (1999) Artificial Intelligence , vol.113 , Issue.1-2 , pp. 125-148
- Khardon, R.¹

45
- 0032649290
- Learning to take actions
- Khardon R. Learning to take actions. Machine Learning 35 1 (1999) 57-90
- (1999) Machine Learning , vol.35 , Issue.1 , pp. 57-90
- Khardon, R.¹

46
- 84880688552
- Computing factored value functions for policies in structured MDPs
- Stockholm
- D. Koller, R. Parr, Computing factored value functions for policies in structured MDPs, in: International Joint Conference on Artificial Intelligence (IJCAI-99), Stockholm, 1999, pp. 1332-1339
- (1999) International Joint Conference on Artificial Intelligence (IJCAI-99) , pp. 1332-1339
- Koller, D.¹ Parr, R.²

47
- 0010359703
- Policy iteration for factored MDPs
- Stockholm
- D. Koller, R. Parr, Policy iteration for factored MDPs, in: Uncertainty in Artificial Intelligence (UAI-00), Stockholm, 2000, pp. 326-334
- (2000) Uncertainty in Artificial Intelligence (UAI-00) , pp. 326-334
- Koller, D.¹ Parr, R.²

48
- 0031126053
- GOLOG: A logic programming language for dynamic domains
- Levesque H.J., Reiter R., Lespérance Y., Lin F., and Scherl R. GOLOG: A logic programming language for dynamic domains. Journal of Logic Programming 31 1-3 (1997) 59-83
- (1997) Journal of Logic Programming , vol.31 , Issue.1-3 , pp. 59-83
- Levesque, H.J.¹ Reiter, R.² Lespérance, Y.³ Lin, F.⁴ Scherl, R.⁵

49
- 58349087845
- Paragraph: A Graphplan-based probabilistic planner
- I. Little, Paragraph: A Graphplan-based probabilistic planner, in: Proceedings of the Fifth International Planning Competition, 2006
- (2006) Proceedings of the Fifth International Planning Competition
- Little, I.¹

50
- 60549098575
- M.L. Littman, H.L.S. Younes Eds, Canada
- M.L. Littman, H.L.S. Younes (Eds.), Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/mlittman/topics/ipc04-pt/proceedings/. Vancouver, Canada, 2004
- (2004) Online Proceedings for The Probabilistic Planning Track of IPC-04

51
- 29344433509
- Samuel meets Amarel: Automating value function approximation using global state space analysis
- Pittsburgh
- S. Mahadevan, Samuel meets Amarel: Automating value function approximation using global state space analysis, in: National Conference on Artificial Intelligence (AAAI-05), Pittsburgh, 2005, pp. 1000-1005
- (2005) National Conference on Artificial Intelligence (AAAI-05) , pp. 1000-1005
- Mahadevan, S.¹

52
- 0004030536
- Situations, actions and causal laws, Tech. rep., Stanford University, 1963, reprinted
- Minsky M. (Ed), MIT Press, Cambridge, MA
- McCarthy J. Situations, actions and causal laws, Tech. rep., Stanford University, 1963, reprinted. In: Minsky M. (Ed). Semantic Information Processing (1968), MIT Press, Cambridge, MA 410-417
- (1968) Semantic Information Processing , pp. 410-417
- McCarthy, J.¹

53
- 0031632806
- Solving very large weakly coupled Markov decision processes
- Madison, WI
- N. Meuleau, M. Hauskrecht, K.-E. Kim, L. Peshkin, L.P. Kaelbling, T. Dean, C. Boutilier, Solving very large weakly coupled Markov decision processes, in: National Conference on Artificial Intelligence (AAAI-98), Madison, WI, 1998, pp. 165-172
- (1998) National Conference on Artificial Intelligence (AAAI-98) , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.-E.³ Peshkin, L.⁴ Kaelbling, L.P.⁵ Dean, T.⁶ Boutilier, C.⁷

54
- 33749555400
- Ph.D. thesis, Univesität Karlsruhe TH, Karlsruhe, Germany, January
- B. Motik, Reasoning in description logics using resolution and deductive databases, Ph.D. thesis, Univesität Karlsruhe (TH), Karlsruhe, Germany, January 2006
- (2006) Reasoning in description logics using resolution and deductive databases
- Motik, B.¹

55
- 0141596576
- Policy invariance under reward transformations: theory and application to reward shaping
- Morgan Kaufmann, San Francisco, CA
- Ng A.Y., Harada D., and Russell S. Policy invariance under reward transformations: theory and application to reward shaping. Proc. 16th International Conf. on Machine Learning (1999), Morgan Kaufmann, San Francisco, CA 278-287
- (1999) Proc. 16th International Conf. on Machine Learning , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.³

56
- 84898956770
- Reinforcement learning with hierarchies of machines
- Jordan M.M.K., and Solla S. (Eds), MIT Press, Cambridge, MA
- Parr R., and Russell S. Reinforcement learning with hierarchies of machines. In: Jordan M.M.K., and Solla S. (Eds). Advances in Neural Information Processing Systems 10 (1998), MIT Press, Cambridge, MA 1043-1049
- (1998) Advances in Neural Information Processing Systems 10 , pp. 1043-1049
- Parr, R.¹ Russell, S.²

57
- 0036927202
- Greedy linear value-approximation for factored Markov decision processes
- Edmonton
- R. Patrascu, P. Poupart, D. Schuurmans, C. Boutilier, C. Guestrin, Greedy linear value-approximation for factored Markov decision processes, in: National Conference on Artificial Intelligence (AAAI-02), Edmonton, 2002, pp. 285-291
- (2002) National Conference on Artificial Intelligence (AAAI-02) , pp. 285-291
- Patrascu, R.¹ Poupart, P.² Schuurmans, D.³ Boutilier, C.⁴ Guestrin, C.⁵

58
- 0000532979
- ADL:, KR
- E.P.D. Pednault, ADL: Exploring the middle ground between STRIPS and the situation calculus, in: KR, 1989, pp. 324-332
- (1989) Exploring the middle ground between STRIPS and the situation calculus , pp. 324-332
- Pednault, E.P.D.¹

59
- 0031187203
- The independent choice logic for modelling multiple agents under uncertainty
- Poole D. The independent choice logic for modelling multiple agents under uncertainty. Artificial Intelligence 94 1-2 (1997) 7-56
- (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 7-56
- Poole, D.¹

60
- 84880831450
- IJCAI
- D. Poole, First-order probabilistic inference, in: IJCAI, 2003, pp. 985-991
- (2003) First-order probabilistic inference , pp. 985-991
- Poole, D.¹

61
- 0036923210
- Piecewise linear value function approximation for factored MDPs
- Edmonton
- P. Poupart, C. Boutilier, R. Patrascu, D. Schuurmans, Piecewise linear value function approximation for factored MDPs, in: National Conference on Artificial Intelligence (AAAI-02), Edmonton, 2002, pp. 292-299
- (2002) National Conference on Artificial Intelligence (AAAI-02) , pp. 292-299
- Poupart, P.¹ Boutilier, C.² Patrascu, R.³ Schuurmans, D.⁴

62
- 85102627959
- Wiley, New York
- Puterman M.L. Markov Decision Processes: Discrete Stochastic Dynamic Programming (1994), Wiley, New York
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

63
- 0002048328
- The frame problem in the situation calculus: A simple solution (sometimes) and a completeness result for goal regression
- Lifschitz V. (Ed), Academic Press, San Diego, CA
- Reiter R. The frame problem in the situation calculus: A simple solution (sometimes) and a completeness result for goal regression. In: Lifschitz V. (Ed). Artificial Intelligence and Mathematical Theory of Computation (Papers in Honor of John McCarthy) (1991), Academic Press, San Diego, CA 359-380
- (1991) Artificial Intelligence and Mathematical Theory of Computation (Papers in Honor of John McCarthy) , pp. 359-380
- Reiter, R.¹

64
- 0003799276
- MIT Press
- Reiter R. Knowledge in Action: Logical Foundations for Specifying and Implementing Dynamical Systems (2001), MIT Press
- (2001) Knowledge in Action: Logical Foundations for Specifying and Implementing Dynamical Systems
- Reiter, R.¹

65
- 0036327027
- The design and implementation of vampire
- Riazanov A., and Voronkov A. The design and implementation of vampire. AI Communications 15 2 (2002) 91-110
- (2002) AI Communications , vol.15 , Issue.2 , pp. 91-110
- Riazanov, A.¹ Voronkov, A.²

66
- 9444291095
- Expressive equivalence of formalisms for planning with sensing
- J. Rintanen, Expressive equivalence of formalisms for planning with sensing, in: 13th International Conference on Automated Planning and Scheduling, 2003, pp. 185-194
- (2003) 13th International Conference on Automated Planning and Scheduling , pp. 185-194
- Rintanen, J.¹

67
- 60549107148
- S. Sanner, First-order decision-theoretic planning in structured relational environments, Ph.D. thesis, University of Toronto, Toronto, ON, Canada, March 2008
- S. Sanner, First-order decision-theoretic planning in structured relational environments, Ph.D. thesis, University of Toronto, Toronto, ON, Canada, March 2008

68
- 72949112166
- Approximate linear programming for first-order MDPs
- Edinburgh, Scotland
- S. Sanner, C. Boutilier, Approximate linear programming for first-order MDPs, in: Uncertainty in Artificial Intelligence (UAI-05), Edinburgh, Scotland, 2005, pp. 509-517
- (2005) Uncertainty in Artificial Intelligence (UAI-05) , pp. 509-517
- Sanner, S.¹ Boutilier, C.²

69
- 77957878103
- Practical linear evaluation techniques for first-order MDPs
- Boston, MA
- S. Sanner, C. Boutilier, Practical linear evaluation techniques for first-order MDPs, in: Uncertainty in Artificial Intelligence (UAI-06), Boston, MA, 2006
- (2006) Uncertainty in Artificial Intelligence (UAI-06)
- Sanner, S.¹ Boutilier, C.²

70
- 58349091881
- Approximate solution techniques for factored first-order MDPs
- S. Sanner, C. Boutilier, Approximate solution techniques for factored first-order MDPs, in: 17th International Conference on Automated Planning and Scheduling (ICAPS-07), 2007, pp. 288-295
- (2007) 17th International Conference on Automated Planning and Scheduling (ICAPS-07) , pp. 288-295
- Sanner, S.¹ Boutilier, C.²

71
- 1542342765
- Direct value approximation for factored MDPs
- Vancouver
- D. Schuurmans, R. Patrascu, Direct value approximation for factored MDPs, in: Advances in Neural Information Processing 14 (NIPS-01), Vancouver, 2001, pp. 1579-1586
- (2001) Advances in Neural Information Processing 14 (NIPS-01) , pp. 1579-1586
- Schuurmans, D.¹ Patrascu, R.²

72
- 0000273218
- Generalized polynomial approximations in Markovian decision processes
- Schweitzer P., and Seidmann A. Generalized polynomial approximations in Markovian decision processes. Journal of Mathematical Analysis and Applications 110 (1985) 568-582
- (1985) Journal of Mathematical Analysis and Applications , vol.110 , pp. 568-582
- Schweitzer, P.¹ Seidmann, A.²

73
- 0000392613
- Stochastic games
- Shapley L.S. Stochastic games. Proceedings of the National Academy of Sciences 39 (1953) 327-332
- (1953) Proceedings of the National Academy of Sciences , vol.39 , pp. 327-332
- Shapley, L.S.¹

74
- 84899022377
- How to dynamically merge Markov decision processes
- MIT Press, Cambridge, MA
- Singh S.P., and Cohn D. How to dynamically merge Markov decision processes. Advances in Neural Information Processing Systems (NIPS-98) (1998), MIT Press, Cambridge, MA 1057-1063
- (1998) Advances in Neural Information Processing Systems (NIPS-98) , pp. 1057-1063
- Singh, S.P.¹ Cohn, D.²

75
- 26944499565
- Approximate policy construction using decision diagrams
- APRICODD:, Denver
- R. St-Aubin, J. Hoey, C. Boutilier, APRICODD: Approximate policy construction using decision diagrams, in: Advances in Neural Information Processing 13 (NIPS-00), Denver, 2000, pp. 1089-1095
- (2000) Advances in Neural Information Processing 13 (NIPS-00) , pp. 1089-1095
- St-Aubin, R.¹ Hoey, J.² Boutilier, C.³

76
- 60549101188
- Symbolic stochastic focused dynamic programming with decision diagrams
- F. Teichteil, P. Fabiani, Symbolic stochastic focused dynamic programming with decision diagrams, in: Proceedings of the Fifth International Planning Competition, 2006
- (2006) Proceedings of the Fifth International Planning Competition
- Teichteil, F.¹ Fabiani, P.²

77
- 33744462367
- Decision-theoretic planning with non-Markovian rewards
- Thiebaux S., Gretton C., Slaney J., Price D., and Kabanza F. Decision-theoretic planning with non-Markovian rewards. Journal of Artificial Intelligence Research 25 (January 2006) 17-74
- (2006) Journal of Artificial Intelligence Research , vol.25 , pp. 17-74
- Thiebaux, S.¹ Gretton, C.² Slaney, J.³ Price, D.⁴ Kabanza, F.⁵

78
- 0029752470
- Feature-based methods for large scale dynamic programming
- Tsitsiklis J.N., and Van Roy B. Feature-based methods for large scale dynamic programming. Machine Learning 22 (1996) 59-94
- (1996) Machine Learning , vol.22 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

79
- 0004031010
- Ph.D. thesis, Carnegie Mellon University, August
- M. Veloso, Learning by analogical reasoning in general problem solving, Ph.D. thesis, Carnegie Mellon University, August 1992
- (1992) Learning by analogical reasoning in general problem solving
- Veloso, M.¹

80
- 84880869367
- First order decision diagrams for relational MDPs
- Hyderabad, India
- C. Wang, S. Joshi, R. Khardon, First order decision diagrams for relational MDPs, in: Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07), Hyderabad, India, 2007, pp. 1095-1100
- (2007) Twentieth International Joint Conference on Artificial Intelligence (IJCAI-07) , pp. 1095-1100
- Wang, C.¹ Joshi, S.² Khardon, R.³

81
- 44449127194
- First order decision diagrams for relational MDPs
- Wang C., Joshi S., and Khardon R. First order decision diagrams for relational MDPs. Journal of Artificial Intelligence Research (JAIR) 31 (2008) 431-472
- (2008) Journal of Artificial Intelligence Research (JAIR) , vol.31 , pp. 431-472
- Wang, C.¹ Joshi, S.² Khardon, R.³

82
- 72949115208
- Policy iteration for relational MDPs
- Vancouver, Canada
- C. Wang, R. Khardon, Policy iteration for relational MDPs, in: Uncertainty in Artificial Intelligence (UAI-07), Vancouver, Canada, 2007
- (2007) Uncertainty in Artificial Intelligence (UAI-07)
- Wang, C.¹ Khardon, R.²

83
- 58349116432
- Discovering relational domain features for probabilistic planning
- J. Wu, R. Givan, Discovering relational domain features for probabilistic planning, in: 17th International Conference on Automated Planning and Scheduling (ICAPS 2007), 2007, pp. 344-351
- (2007) 17th International Conference on Automated Planning and Scheduling (ICAPS , pp. 344-351
- Wu, J.¹ Givan, R.²

84
- 13444310066
- Inductive policy selection for first-order Markov decision processes
- Edmonton
- S. Yoon, A. Fern, R. Givan, Inductive policy selection for first-order Markov decision processes, in: Uncertainty in Artificial Intelligence (UAI-02), Edmonton, 2002, pp. 569-576
- (2002) Uncertainty in Artificial Intelligence (UAI-02) , pp. 569-576
- Yoon, S.¹ Fern, A.² Givan, R.³

85
- 60549105503
- S. Yoon, A. Fern, R. Givan, Learning reactive policies for probabilistic planning domains, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/mlittman/topics/ipc04-pt/proceedings/, 2004
- S. Yoon, A. Fern, R. Givan, Learning reactive policies for probabilistic planning domains, in: Online Proceedings for The Probabilistic Planning Track of IPC-04: http://www.cs.rutgers.edu/mlittman/topics/ipc04-pt/proceedings/, 2004

86
- 29344443330
- S. Yoon, A. Fern, R. Givan, Learning measures of progress for planning domains, in: 20th National Conference on Artificial Intelligence, July 2005, pp. 1217-1222
- S. Yoon, A. Fern, R. Givan, Learning measures of progress for planning domains, in: 20th National Conference on Artificial Intelligence, July 2005, pp. 1217-1222

87
- 33744466799
- Approximate policy iteration with a policy language bias: Learning to solve relational Markov decision processes
- Yoon S., Fern A., and Givan R. Approximate policy iteration with a policy language bias: Learning to solve relational Markov decision processes. Journal of Artificial Intelligence Research (JAIR) 25 (2006) 85-118
- (2006) Journal of Artificial Intelligence Research (JAIR) , vol.25 , pp. 85-118
- Yoon, S.¹ Fern, A.² Givan, R.³

88
- 58349118462
- A baseline for probabilistic planning
- FF-Replan
- S. Yoon, A. Fern, R. Givan, FF-Replan: A baseline for probabilistic planning, in: 17th International Conference on Automated Planning and Scheduling (ICAPS-07), 2007, pp. 352-359
- (2007) 17th International Conference on Automated Planning and Scheduling (ICAPS-07) , pp. 352-359
- Yoon, S.¹ Fern, A.² Givan, R.³

89
- 31144453572
- The first probabilistic track of the international planning competition
- Younes H.L.S., Littman M.L., Weissman D., and Asmuth J. The first probabilistic track of the international planning competition. Journal of Artificial Intelligence Research (JAIR) 24 (2005) 851-887
- (2005) Journal of Artificial Intelligence Research (JAIR) , vol.24 , pp. 851-887
- Younes, H.L.S.¹ Littman, M.L.² Weissman, D.³ Asmuth, J.⁴

90
- 0012253357
- A simple approach to Bayesian network computations
- N.L. Zhang, D. Poole, A simple approach to Bayesian network computations, in: Proc. of the Tenth Canadian Conference on Artificial Intelligence, 1994, pp. 171-178
- (1994) Proc. of the Tenth Canadian Conference on Artificial Intelligence , pp. 171-178
- Zhang, N.L.¹ Poole, D.²

91
- 0000049635
- Exploiting causal independence in Bayesian network inference
- Zhang N.L., and Poole D. Exploiting causal independence in Bayesian network inference. Journal of Artificial Intelligence Research (JAIR) 5 (1996) 301-328
- (1996) Journal of Artificial Intelligence Research (JAIR) , vol.5 , pp. 301-328
- Zhang, N.L.¹ Poole, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.