SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 89, Issue 1-2, 1997, Pages 219-283

Abstraction and approximate decision-theoretic planning

(2) Dearden, Richard a Boutilier, Craig a

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

Author keywords

Abstraction; Approximation; Decision theory; Execution; Heuristics; Markov decision processes; Planning; Search

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; DECISION THEORY; HEURISTIC METHODS; MARKOV PROCESSES; OPTIMIZATION;

DECISION-THEORETIC PLANNING;

PLANNING;

EID: 0030697013 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/s0004-3702(96)00023-9 Document Type: Article

Times cited : (94)

References (56)

1
- 0020810556
- The *-minimax search procedure for trees containing chance nodes
- B.W. Ballard, The *-minimax search procedure for trees containing chance nodes, Artif. Intell. 21 (1983) 327-350.
- (1983) Artif. Intell. , vol.21 , pp. 327-350
- Ballard, B.W.¹

2
- 0029210635
- Learning to act using real-time dynamic programming
- A.G. Barto, S.J. Bradtke and S.P. Singh, Learning to act using real-time dynamic programming, Artif. Intell. 72 (1995) 81-138.
- (1995) Artif. Intell. , vol.72 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

3
- 0003787146
- Princeton University Press, Princeton, NJ
- R.E. Bellman, Dynamic Programming (Princeton University Press, Princeton, NJ, 1957).
- (1957) Dynamic Programming
- Bellman, R.E.¹

4
- 0024680419
- Adaptive aggregation for infinite horizon dynamic programming
- D.P. Bertsekas and D.A. Castanon, Adaptive aggregation for infinite horizon dynamic programming, IEEE Trans. Automat. Control 34 (1989) 589-598.
- (1989) IEEE Trans. Automat. Control , vol.34 , pp. 589-598
- Bertsekas, D.P.¹ Castanon, D.A.²

5
- 0003565779
- Prentice-Hall, Englewood Cliffs, NJ
- D.P. Bertsekas, Dynamic Programming: Deterministic and Stochastic Models (Prentice-Hall, Englewood Cliffs, NJ, 1987).
- (1987) Dynamic Programming: Deterministic and Stochastic Models
- Bertsekas, D.P.¹

6
- 0001854509
- Solving time-dependent planning problems
- Detroit, MI
- M. Boddy and T.L. Dean, Solving time-dependent planning problems, in: Proceedings IJCAI-89, Detroit, MI (1989) 979-984.
- (1989) Proceedings IJCAI-89 , pp. 979-984
- Boddy, M.¹ Dean, T.L.²

7
- 0028447220
- Deliberation scheduling for problem solving in time-constrained environments
- M. Boddy and T.L. Dean, Deliberation scheduling for problem solving in time-constrained environments, Artif. Intell. 67 (1994) 245-285.
- (1994) Artif. Intell. , vol.67 , pp. 245-285
- Boddy, M.¹ Dean, T.L.²

8
- 0013580273
- Planning under uncertainty: Structural assumptions and computational leverage
- Assisi
- C. Boutilier, T.L. Dean and S. Hanks, Planning under uncertainty: structural assumptions and computational leverage, in: Proceedings Third European Workshop on Planning, Assisi (1995).
- (1995) Proceedings Third European Workshop on Planning
- Boutilier, C.¹ Dean, T.L.² Hanks, S.³

9
- 0012352653
- Approximating value trees in structured dynamic programming
- Bari
- C. Boutilier and R. Dearden, Approximating value trees in structured dynamic programming, in: Proceedings Thirteenth International Conference on Machine Learning, Bari (1996) 54-62.
- (1996) Proceedings Thirteenth International Conference on Machine Learning , pp. 54-62
- Boutilier, C.¹ Dearden, R.²

10
- 85166207010
- Exploiting structure in policy construction
- Montreal, Que.
- C. Boutilier, R. Dearden and M. Goldszmidt, Exploiting structure in policy construction, in: Proceedings IJCAI-95, Montreal, Que. (1995) 1104-1111.
- (1995) Proceedings IJCAI-95 , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

11
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Portland, OR
- C. Boutilier and D. Poole, Computing optimal policies for partially observable decision processes using compact representations, in: Proceedings AAAI-96, Portland, OR (1996) 1168-1175.
- (1996) Proceedings AAAI-96 , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

12
- 85168106990
- Process-oriented planning and average-reward optimality
- Montreal, Que.
- C. Boutilier and M.L. Puterman, Process-oriented planning and average-reward optimality, in: Proceedings IJCAI-95, Montreal, Que. (1995) 1096-1103.
- (1995) Proceedings IJCAI-95 , pp. 1096-1103
- Boutilier, C.¹ Puterman, M.L.²

13
- 0028564629
- Acting optimally in partially observable stochastic domains
- Seattle, WA
- A.R. Cassandra, L.P. Kaelbling and M.L. Littman, Acting optimally in partially observable stochastic domains, in: Proceedings AAAI-94, Seattle, WA (1994) 1023-1028.
- (1994) Proceedings AAAI-94 , pp. 1023-1028
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.L.³

14
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- Sydney
- D. Chapman and L.P. Kaelbling, Input generalization in delayed reinforcement learning: an algorithm and performance comparisons, in: Proceedings IJCAI-91, Sydney (1991) 726-731.
- (1991) Proceedings IJCAI-91 , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

15
- 0027708037
- Planning with deadlines in stochastic domains
- Washington, DC
- T.L. Dean, L.P. Kaelbling, J. Kirman and A. Nicholson, Planning with deadlines in stochastic domains, in: Proceedings AAAI-93, Washington, DC (1993) 574-579.
- (1993) Proceedings AAAI-93 , pp. 574-579
- Dean, T.L.¹ Kaelbling, L.P.² Kirman, J.³ Nicholson, A.⁴

16
- 84990553353
- A model for reasoning about persistence and causation
- T.L. Dean and K. Kanazawa, A model for reasoning about persistence and causation, Comput. Intell. 5 (1989) 142-150.
- (1989) Comput. Intell. , vol.5 , pp. 142-150
- Dean, T.L.¹ Kanazawa, K.²

17
- 0043046674
- A probabilistic model of action for least-commitment planning with information gathering
- Seattle, WA
- D. Draper, S. Hanks and D.S. Weld, A probabilistic model of action for least-commitment planning with information gathering, in: Proceedings Tenth Conference on Uncertainty in Artificial Intelligence, Seattle, WA (1994) 178-186.
- (1994) Proceedings Tenth Conference on Uncertainty in Artificial Intelligence , pp. 178-186
- Draper, D.¹ Hanks, S.² Weld, D.S.³

18
- 0038144877
- Reaction-first search
- Chambery
- M. Drummond, K. Swanson, J. Bresina and R. Levinson, Reaction-first search, in: Proceedings IJCAI-93, Chambery (1993) 1408-1414.
- (1993) Proceedings IJCAI-93 , pp. 1408-1414
- Drummond, M.¹ Swanson, K.² Bresina, J.³ Levinson, R.⁴

19
- 2842560201
- Strips: A new approach to the application of theorem proving to problem solving
- R.E. Fikes and N.J. Nilsson, Strips: a new approach to the application of theorem proving to problem solving, Artif. Intell. 2 (1971) 189-208.
- (1971) Artif. Intell. , vol.2 , pp. 189-208
- Fikes, R.E.¹ Nilsson, N.J.²

20
- 0004232519
- Halsted Press, New York
- S. French, Decision Theory (Halsted Press, New York, 1986).
- (1986) Decision Theory
- French, S.¹

21
- 0040731126
- Advances in probabilistic reasoning
- Los Angeles, CA
- D. Geiger and D. Heckerman, Advances in probabilistic reasoning, in: Proceedings Seventh Conference on Uncertainty in Artificial Intelligence, Los Angeles, CA (1991) 118-126.
- (1991) Proceedings Seventh Conference on Uncertainty in Artificial Intelligence , pp. 118-126
- Geiger, D.¹ Heckerman, D.²

22
- 0040083848
- Abstracting probabilistic actions
- Seattle, WA
- P. Haddawy and A. Doan, Abstracting probabilistic actions, in: Proceedings Tenth Conference on Uncertainty in Artificial Intelligence, Seattle, WA (1994) 270-271.
- (1994) Proceedings Tenth Conference on Uncertainty in Artificial Intelligence , pp. 270-271
- Haddawy, P.¹ Doan, A.²

23
- 0003676137
- Computation and action under bounded resources
- Stanford University, Stanford, CA
- E.J. Horvitz, Computation and action under bounded resources, Tech. Rept. KSL-90-76, Stanford University, Stanford, CA (1990).
- (1990) Tech. Rept. KSL-90-76
- Horvitz, E.J.¹

24
- 0042045003
- Utility-based abstraction and categorization
- Washington, DC
- E.J. Horvitz and A.C. Klein, Utility-based abstraction and categorization, in: Proceedings Ninth Conference on Uncertainty in Artificial Intelligence, Washington, DC (1993) 128-135.
- (1993) Proceedings Ninth Conference on Uncertainty in Artificial Intelligence , pp. 128-135
- Horvitz, E.J.¹ Klein, A.C.²

25
- 0003644124
- MIT Press, Cambridge, MA
- R.A. Howard, Dynamic Programming and Markov Processes (MIT Press, Cambridge, MA, 1960).
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

26
- 0003871605
- Wiley, New York
- R.A. Howard, Dynamic Probabilistic Systems (Wiley, New York, 1971).
- (1971) Dynamic Probabilistic Systems
- Howard, R.A.¹

27
- 0003863029
- Strategic Decision Group, Menlo Park, CA
- R.A. Howard and J.E. Matheson, eds., Readings on the Principles and Applications of Decision Analysis (Strategic Decision Group, Menlo Park, CA, 1984).
- (1984) Readings on the Principles and Applications of Decision Analysis
- Howard, R.A.¹ Matheson, J.E.²

28
- 0004001439
- Wiley, New York
- R.L. Keeney and H. Raiffa, Decisions with Multiple Objectives: Preferences and Value Trade-offs (Wiley, New York, 1978).
- (1978) Decisions with Multiple Objectives: Preferences and Value Trade-offs
- Keeney, R.L.¹ Raiffa, H.²

29
- 0028484996
- Automatically generating abstractions for planning
- C.A. Knoblock, Automatically generating abstractions for planning, Artif. Intell. 68 (1994) 243-302.
- (1994) Artif. Intell. , vol.68 , pp. 243-302
- Knoblock, C.A.¹

30
- 0039936301
- Characterizing abstraction hierarchies for planning
- Anaheim, CA
- C.A. Knoblock, J.D. Tenenberg and Q. Yang, Characterizing abstraction hierarchies for planning, in: Proceedings AAAI-91, Anaheim, CA (1991) 692-697.
- (1991) Proceedings AAAI-91 , pp. 692-697
- Knoblock, C.A.¹ Tenenberg, J.D.² Yang, Q.³

31
- 0025400088
- Real-time heuristic search
- R.E. Korf, Real-time heuristic search, Artif. Intell. 42 (1990) 189-211.
- (1990) Artif. Intell. , vol.42 , pp. 189-211
- Korf, R.E.¹

32
- 0028566295
- An algorithm for probabilistic least-commitment planning
- Seattle, WA
- N. Kushmerick, S. Hanks and D.S. Weld, An algorithm for probabilistic least-commitment planning, in: Proceedings AAAI-94, Seattle, WA (1994) 1073-1078.
- (1994) Proceedings AAAI-94 , pp. 1073-1078
- Kushmerick, N.¹ Hanks, S.² Weld, D.S.³

33
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Lake Tahoe
- M.L. Littman, A.R. Cassandra and L.P. Kaelbling, Learning policies for partially observable environments: scaling up, in: Proceedings Twelfth International Conference on Machine Learning, Lake Tahoe (1995) 362-370.
- (1995) Proceedings Twelfth International Conference on Machine Learning , pp. 362-370
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

34
- 0002290970
- On the complexity of solving Markov decision problems
- Montreal, Que.
- M.L. Littman, T.L. Dean and L.P. Kaelbling, On the complexity of solving Markov decision problems, in: Proceedings Eleventh Conference on Uncertainty in Artificial Intelligence, Montreal, Que. (1995) 394-402.
- (1995) Proceedings Eleventh Conference on Uncertainty in Artificial Intelligence , pp. 394-402
- Littman, M.L.¹ Dean, T.L.² Kaelbling, L.P.³

35
- 0002679852
- A survey of algorithmic methods for partially observed Markov decision processes
- W.S. Lovejoy, A survey of algorithmic methods for partially observed Markov decision processes, Ann. Oper. Res. 28 (1991) 47-66.
- (1991) Ann. Oper. Res. , vol.28 , pp. 47-66
- Lovejoy, W.S.¹

36
- 0342772590
- Systematic nonlinear planning
- Anaheim, CA
- D. McAllester and D. Rosenblitt, Systematic nonlinear planning, in: Proceedings AAAI-91, Anaheim, CA (1991) 634-639.
- (1991) Proceedings AAAI-91 , pp. 634-639
- McAllester, D.¹ Rosenblitt, D.²

37
- 0029514510
- The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces
- to appear
- A.W. Moore and C.G. Atkeson, The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces, Mach. Learn. (to appear).
- Mach. Learn.
- Moore, A.W.¹ Atkeson, C.G.²

38
- 85169001561
- Toward approximate planning in very large stochastic domains
- Stanford, CA
- A.E. Nicholson and L.P. Kaelbling, Toward approximate planning in very large stochastic domains, in: AAAI Spring Symposium on Decision Theoretic Planning, Stanford, CA (1994) 190-196.
- (1994) AAAI Spring Symposium on Decision Theoretic Planning , pp. 190-196
- Nicholson, A.E.¹ Kaelbling, L.P.²

39
- 85168129602
- Approximating optimal policies for partially observable stochastic domains
- Montreal, Que.
- R. Parr and S.J. Russell, Approximating optimal policies for partially observable stochastic domains, in: Proceedings IJCAI-95, Montreal, Que. (1995) 1088-1094.
- (1995) Proceedings IJCAI-95 , pp. 1088-1094
- Parr, R.¹ Russell, S.J.²

40
- 0003961852
- Addison-Wesley, Reading, MA
- J. Pearl, Heuristics: Intelligent Search Strategies for Computer Problem Solving (Addison-Wesley, Reading, MA, 1984).
- (1984) Heuristics: Intelligent Search Strategies for Computer Problem Solving
- Pearl, J.¹

41
- 0003391330
- Morgan Kaufmann, San Mateo, CA
- J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference (Morgan Kaufmann, San Mateo, CA, 1988).
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

42
- 0000532979
- ADL: Exploring the middle ground between STRIPS and the situation calculus
- Toronto, Ont.
- E. Pednault, ADL: exploring the middle ground between STRIPS and the situation calculus, in: Proceedings First International Conference on Principles of Knowledge Representation and Reasoning, Toronto, Ont. (1989) 324-332.
- (1989) Proceedings First International Conference on Principles of Knowledge Representation and Reasoning , pp. 324-332
- Pednault, E.¹

43
- 0026992168
- Conditional nonlinear planning
- College Park, MD
- M.A. Peot and D.E. Smith, Conditional nonlinear planning, in: Proceedings First International Conference on AI Planning Systems, College Park, MD (1992) 189-197.
- (1992) Proceedings First International Conference on AI Planning Systems , pp. 189-197
- Peot, M.A.¹ Smith, D.E.²

44
- 0040831492
- Exploiting the rule structure for decision making within the independent choice logic
- Montreal, Que.
- D. Poole, Exploiting the rule structure for decision making within the independent choice logic, in: Proceedings Eleventh Conference on Uncertainty in Artificial Intelligence, Montreal, Que. (1995) 454-463.
- (1995) Proceedings Eleventh Conference on Uncertainty in Artificial Intelligence , pp. 454-463
- Poole, D.¹

45
- 85102627959
- Wiley, New York
- M.L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming (Wiley, New York, 1994).
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

46
- 0037581251
- Modified policy iteration algorithms for discounted Markov decision problems
- M.L. Puterman and M.C. Shin, Modified policy iteration algorithms for discounted Markov decision problems, Manage. Sci. 24 (1978) 1127-1137.
- (1978) Manage. Sci. , vol.24 , pp. 1127-1137
- Puterman, M.L.¹ Shin, M.C.²

47
- 0003711660
- MIT Press, Cambridge, MA
- S.J. Russell and E. Wefald, Do the Right Thing: Studies in Limited Rationality (MIT Press, Cambridge, MA, 1991).
- (1991) Do the Right Thing: Studies in Limited Rationality
- Russell, S.J.¹ Wefald, E.²

48
- 0002996283
- Composing real-time systems
- Sydney
- S.J. Russell and S. Zilberstein, Composing real-time systems, in: Proceedings IJCAI-91, Sydney (1991) 212-217.
- (1991) Proceedings IJCAI-91 , pp. 212-217
- Russell, S.J.¹ Zilberstein, S.²

49
- 0016069798
- Planning in a hierarchy of abstraction spaces
- E.D. Sacerdoti, Planning in a hierarchy of abstraction spaces, Artif. Intell. 5 (1974) 115-135.
- (1974) Artif. Intell. , vol.5 , pp. 115-135
- Sacerdoti, E.D.¹

50
- 85125003135
- The nonlinear nature of plans
- Tblisi
- E.D. Sacerdoti, The nonlinear nature of plans, in: Proceedings IJCAI-75, Tblisi (1975) 206-214.
- (1975) Proceedings IJCAI-75 , pp. 206-214
- Sacerdoti, E.D.¹

51
- 0001871991
- Universal plans for reactive robots in unpredictable environments
- Milan
- M.J. Schoppers, Universal plans for reactive robots in unpredictable environments, in: Proceedings IJCAI-87, Milan (1987) 1039-1046.
- (1987) Proceedings IJCAI-87 , pp. 1039-1046
- Schoppers, M.J.¹

52
- 0022059617
- Iterative aggregation-disaggregation procedures for discounted semi-Markov reward processes
- P.L. Schweitzer, M.L. Puterman and K.W. Kindle, Iterative aggregation-disaggregation procedures for discounted semi-Markov reward processes, Oper. Res. 33 (1985) 589-605.
- (1985) Oper. Res. , vol.33 , pp. 589-605
- Schweitzer, P.L.¹ Puterman, M.L.² Kindle, K.W.³

53
- 85153965130
- Reinforcement learning with soft state aggregation
- S.J. Hanson, J.D. Cowan and C.L. Giles, eds., Morgan Kaufmann, San Mateo, CA
- S.P. Singh, T. Jaakkola and M.I. Jordan, Reinforcement learning with soft state aggregation, in: S.J. Hanson, J.D. Cowan and C.L. Giles, eds., Advances in Neural Information Processing Systems 7 (Morgan Kaufmann, San Mateo, CA, 1994).
- (1994) Advances in Neural Information Processing Systems , vol.7
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

54
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- R.D. Smallwood and E.J. Sondik, The optimal control of partially observable Markov processes over a finite horizon, Oper. Res. 21 (1973) 1071-1088.
- (1973) Oper. Res. , vol.21 , pp. 1071-1088
- Smallwood, R.D.¹ Sondik, E.J.²

55
- 0027709265
- Postponing threats in partial-order planning
- Washington, DC
- D.E. Smith and M.A. Peot, Postponing threats in partial-order planning, in: Proceedings AAAI-93, Washington, DC (1993) 500-506.
- (1993) Proceedings AAAI-93 , pp. 500-506
- Smith, D.E.¹ Peot, M.A.²

56
- 0028576345
- Control strategies for a stochastic planner
- Seattle, WA
- J. Tash and S.J. Russell, Control strategies for a stochastic planner, in: Proceedings AAAI-94, Seattle, WA (1994) 1079-1085.
- (1994) Proceedings AAAI-94 , pp. 1079-1085
- Tash, J.¹ Russell, S.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.