SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 11, Issue , 1999, Pages 1-94

Decision-Theoretic Planning: Structural Assumptions and Computational Leverage

(3) Boutilier, Craig a Dean, Thomas b Hanks, Steve c

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

b Brown Univerity (United States)

c University of Washington (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 0346942368 PISSN: 10769757 EISSN: None Source Type: Journal
DOI: 10.1613/jair.575 Document Type: Article

Times cited : (786)

References (158)

1
- 0003436260
- Morgan-Kaufmann, San Mateo
- Allen, J., Hendler, J., & Tate, A. (Eds.). (1990). Readings in Planning. Morgan-Kaufmann, San Mateo.
- (1990) Readings in Planning
- Allen, J.¹ Hendler, J.² Tate, A.³

2
- 50549213583
- Optimal control of Markov decision processes with incomplete state estimation
- Aström, K. J. (1965). Optimal control of Markov decision processes with incomplete state estimation. J. Math. Anal. Appl., 10, 174-205.
- (1965) J. Math. Anal. Appl. , vol.10 , pp. 174-205
- Aström, K.J.¹

3
- 0030352106
- Rewarding behaviors
- Portland, OR
- Bacchus, F., Boutilier, C., & Grove, A. (1996). Rewarding behaviors. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pp. 1160-1167 Portland, OR.
- (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence , pp. 1160-1167
- Bacchus, F.¹ Boutilier, C.² Grove, A.³

4
- 0031379970
- Structured solution methods for non-Markovian decision processes
- Providence, RI
- Bacchus, F., Boutilier, C., & Grove, A. (1997). Structured solution methods for non-Markovian decision processes. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 112-117 Providence, RI.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 112-117
- Bacchus, F.¹ Boutilier, C.² Grove, A.³

5
- 0001951408
- Using temporal logic to control search in a forward chaining planner
- Assisi, Italy
- Bacchus, F., & Kabanza, F. (1995). Using temporal logic to control search in a forward chaining planner. In Proceedings of the Third European Workshop on Planning (EWSP'95) Assisi, Italy. Available via the URL ftp://logos.uwaterloo.ca:/pub/tlplan/tlplan.ps.Z.
- (1995) Proceedings of the Third European Workshop on Planning (EWSP'95)
- Bacchus, F.¹ Kabanza, F.²

6
- 85166380785
- Making forward chaining relevant
- Pittsburgh, PA
- Bacchus, F., & Teh, Y. W. (1998). Making forward chaining relevant. In Proceedings of the Fourth International Conference on AI Planning Systems, pp. 54-61 Pittsburgh, PA.
- (1998) Proceedings of the Fourth International Conference on AI Planning Systems , pp. 54-61
- Bacchus, F.¹ Teh, Y.W.²

7
- 0027880685
- Algebraic decision diagrams and their applications
- IEEE
- Bahar, R. L, Frohm, E. A., Gaona, C. M., Hachtel, G. D., Macii, E., Pardo, A., & Somenzi, F. (1993). Algebraic decision diagrams and their applications. In International Conference on Camputer-Aided Design, pp. 188-191. IEEE.
- (1993) International Conference on Camputer-Aided Design , pp. 188-191
- Bahar, R.L.¹ Frohm, E.A.² Gaona, C.M.³ Hachtel, G.D.⁴ Macii, E.⁵ Pardo, A.⁶ Somenzi, F.⁷

8
- 0026153773
- Nonmonotonic reasoning in the framework of the situation calculus
- Baker, A. B. (1991). Nonmonotonic reasoning in the framework of the situation calculus. Artificial Intelligence, 49, 5-23.
- (1991) Artificial Intelligence , vol.49 , pp. 5-23
- Baker, A.B.¹

9
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72(1-2), 81-138.
- (1995) Artificial Intelligence , vol.72 , Issue.1-2 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

10
- 85012688561
- Princeton University Press, Princeton, NJ
- Bellman, R. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ.
- (1957) Dynamic Programming
- Bellman, R.¹

11
- 0024680419
- Adaptive aggregation for infinite horizon dynamic programming
- Bertsekas, D. P., & Castanon, D. A. (1989). Adaptive aggregation for infinite horizon dynamic programming. IEEE Transactions on Automatic Control, 34(6), 589-598.
- (1989) IEEE Transactions on Automatic Control , vol.34 , Issue.6 , pp. 589-598
- Bertsekas, D.P.¹ Castanon, D.A.²

12
- 0004295484
- Prentice-Hall, Englewood Cliffs, NJ
- Bertsekas, D. P. (1987). Dynamic Programming. Prentice-Hall, Englewood Cliffs, NJ.
- (1987) Dynamic Programming
- Bertsekas, D.P.¹

13
- 0003487482
- Athena, Belmont, MA
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic Programming. Athena, Belmont, MA.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

14
- 0000615044
- Discrete dynamic programming
- Blackwell, D. (1962). Discrete dynamic programming. Annals of Mathematical Statistics, 33, 719-726.
- (1962) Annals of Mathematical Statistics , vol.33 , pp. 719-726
- Blackwell, D.¹

15
- 0001657540
- Fast planning through graph analysis
- Montreal, Canada
- Blum, A. L., & Furst, M. L. (1995). Fast planning through graph analysis. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1636-1642 Montreal, Canada.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1636-1642
- Blum, A.L.¹ Furst, M.L.²

16
- 0002098456
- Learning sorting and decision trees with POMDPs
- Madison, WI
- Bonet, B., & Geffner, H. (1998). Learning sorting and decision trees with POMDPs. In Proceedings of the Fifteenth International Conference on Machine Learning, pp. 73-81 Madison, WI.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning , pp. 73-81
- Bonet, B.¹ Geffner, H.²

17
- 0031385389
- A robust and fast action selection mechanism
- Providence, RI
- Bonet, B., Loerincs, G., & Geffner, H. (1997). A robust and fast action selection mechanism. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 714-719 Providence, RI.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 714-719
- Bonet, B.¹ Loerincs, G.² Geffner, H.³

18
- 0342843072
- Correlated action effects in decision theoretic regression
- Providence, RI
- Boutilier, C. (1997). Correlated action effects in decision theoretic regression. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, pp. 30-37 Providence, RI.
- (1997) Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence , pp. 30-37
- Boutilier, C.¹

19
- 84880685295
- Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning
- Nagoya, Japan
- Boutilier, C., Brafman, R. I., & Geib, C. (1997). Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pp. 1156-1162 Nagoya, Japan.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence , pp. 1156-1162
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

20
- 0001811022
- Structured reachability analysis for Markov decision processes
- Madison, WI
- Boutilier, C., Brafman, R. I., & Geib, C. (1998). Structured reachability analysis for Markov decision processes. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 24-32 Madison, WI.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 24-32
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

21
- 0028572333
- Using abstractions for decision-theoretic planning with time constraints
- Seattle, WA
- Boutilier, C., & Dearden, R. (1994). Using abstractions for decision-theoretic planning with time constraints. In Proceedings of the Twelfth National Conference on Artificial Intelligence, pp. 1016-1022 Seattle, WA.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 1016-1022
- Boutilier, C.¹ Dearden, R.²

22
- 0012352653
- Approximating value trees in structured dynamic programming
- Bari, Italy
- Boutilier, C., & Dearden, R. (1996). Approximating value trees in structured dynamic programming. In Proceedings of the Thirteenth International Conference on Machine Learning, pp. 54-62 Bari, Italy.
- (1996) Proceedings of the Thirteenth International Conference on Machine Learning , pp. 54-62
- Boutilier, C.¹ Dearden, R.²

23
- 85166207010
- Exploiting structure in policy construction
- Montreal, Canada
- Boutilier, C., Dearden, R., & Goldszmidt, M. (1995). Exploiting structure in policy construction. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1104-1111 Montreal, Canada.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

24
- 0346108433
- manuscript
- Boutilier, C., Dearden, R., & Goldszmidt, M. (1999). Stochastic dynamic programming with factored representations, (manuscript).
- (1999) Stochastic Dynamic Programming with Factored Representations
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

25
- 0000675721
- Context-specific independence in Bayesian networks
- Portland, OR
- Boutilier, C., Friedman, N., Goldszmidt, M., & Koller, D. (1996). Context-specific independence in Bayesian networks. In Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence, pp. 115-123 Portland, OR.
- (1996) Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence , pp. 115-123
- Boutilier, C.¹ Friedman, N.² Goldszmidt, M.³ Koller, D.⁴

26
- 84957878011
- The frame problem and Bayesian network action representations
- Toronto
- Boutilier, C., & Goldszmidt, M. (1996). The frame problem and Bayesian network action representations. In Proceedings of the Eleventh Biennial Canadian Conference on Artificial Intelligence, pp. 69-83 Toronto.
- (1996) Proceedings of the Eleventh Biennial Canadian Conference on Artificial Intelligence , pp. 69-83
- Boutilier, C.¹ Goldszmidt, M.²

27
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Portland, OR
- Boutilier, C., & Poole, D. (1996). Computing optimal policies for partially observable decision processes using compact representations. In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pp. 1168-1175 Portland, OR.
- (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

28
- 85168106990
- Process-oriented planning and average-reward optimality
- Montreal, Canada
- Boutilier, C., & Puterman, M. L. (1995). Process-oriented planning and average-reward optimality. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1096-1103 Montreal, Canada.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1096-1103
- Boutilier, C.¹ Puterman, M.L.²

29
- 0031385391
- A heuristic variable-grid solution method for POMDPs
- Providence, RI
- Brafman, R. I. (1997). A heuristic variable-grid solution method for POMDPs. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 727-733 Providence, RI.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 727-733
- Brafman, R.I.¹

30
- 0022769976
- Graph-based algorithms for boolean function manipulation
- Bryant, R. E. (1986). Graph-based algorithms for boolean function manipulation. IEEE Transactions on Computers, C-35(8), 677-691.
- (1986) IEEE Transactions on Computers , vol.C-35 , Issue.8 , pp. 677-691
- Bryant, R.E.¹

31
- 0028498153
- The computational complexity of propositional STRIPS planning
- Bylander, T. (1994). The computational complexity of propositional STRIPS planning. Artificial Intelligence, 69, 161-204.
- (1994) Artificial Intelligence , vol.69 , pp. 161-204
- Bylander, T.¹

32
- 0003541774
- Wiley, New York
- Caines, P. E. (1988). Linear stochastic systems. Wiley, New York.
- (1988) Linear Stochastic Systems
- Caines, P.E.¹

33
- 0028564629
- Acting optimally in partially observable stochastic domains
- Seattle, WA
- Cassandra, A. R., Kaelbling, L. P., & Littman, M. L. (1994). Acting optimally in partially observable stochastic domains. In Proceedings of the Twelfth National Conference on Artificial Intelligence, pp. 1023-1028 Seattle, WA.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 1023-1028
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.L.³

34
- 0001909869
- Incremental pruning: A simple, fast, exact method for pomdps
- Providence, RI
- Cassandra, A. R., Littman, M. L., & Zhang, N. L. (1997). Incremental pruning: A simple, fast, exact method for pomdps. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, pp. 54-61 Providence, RI.
- (1997) Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence , pp. 54-61
- Cassandra, A.R.¹ Littman, M.L.² Zhang, N.L.³

35
- 0023381915
- Planning for conjunctive goals
- Chapman, D. (1987). Planning for conjunctive goals. Artificial Intelligence, 32(3), 333-377.
- (1987) Artificial Intelligence , vol.32 , Issue.3 , pp. 333-377
- Chapman, D.¹

36
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- Sydney, Australia
- Chapman, D., & Kaelbling, L. P. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proceedings of the Twelfth International Joint Conference on Artificial Intelligence, pp. 726-731 Sydney, Australia.
- (1991) Proceedings of the Twelfth International Joint Conference on Artificial Intelligence , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

37
- 0001391104
- Decomposition principle for dynamic programs
- Dantzig, G., & Wolfe, P. (1960). Decomposition principle for dynamic programs. Operations Research, 8(1), 101-111.
- (1960) Operations Research , vol.8 , Issue.1 , pp. 101-111
- Dantzig, G.¹ Wolfe, P.²

38
- 0003472432
- Benjamin Cummings
- Dean, T., Allen, J., & Aloimonos, Y. (1995). Artificial Intelligence: Theory and Practice. Benjamin Cummings.
- (1995) Artificial Intelligence: Theory and Practice
- Dean, T.¹ Allen, J.² Aloimonos, Y.³

39
- 0031370386
- Model minimization in Markov decision processes
- Providence, RI. AAAI
- Dean, T., & Givan, R. (1997). Model minimization in Markov decision processes. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 106-111 Providence, RI. AAAI.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 106-111
- Dean, T.¹ Givan, R.²

40
- 85166375107
- Solving planning problems with large state and action spaces
- Pittsburgh, PA
- Dean, T., Givan, R., & Kim, K.-E. (1998). Solving planning problems with large state and action spaces. In Proceedings of the Fourth International Conference on AI Planning Systems, pp. 102-110 Pittsburgh, PA.
- (1998) Proceedings of the Fourth International Conference on AI Planning Systems , pp. 102-110
- Dean, T.¹ Givan, R.² Kim, K.-E.³

41
- 0000746330
- Model reduction techniques for computing approximately optimal solutions for Markov decision processes
- Providence, RI
- Dean, T., Givan, R., & Leach, S. (1997). Model reduction techniques for computing approximately optimal solutions for Markov decision processes. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, pp. 124-131 Providence, RI.
- (1997) Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence , pp. 124-131
- Dean, T.¹ Givan, R.² Leach, S.³

42
- 0027708037
- Planning with deadlines in stochastic domains
- Dean, T., Kaelbling, L., Kirman, J., & Nicholson, A. (1993). Planning with deadlines in stochastic domains. In Proceedings of the Eleventh National Conference on Artificial Intelligence, pp. 574-579.
- (1993) Proceedings of the Eleventh National Conference on Artificial Intelligence , pp. 574-579
- Dean, T.¹ Kaelbling, L.² Kirman, J.³ Nicholson, A.⁴

43
- 0029332887
- Planning under time constraints in stochastic domains
- Dean, T., Kaelbling, L., Kirman, J., & Nicholson, A. (1995). Planning under time constraints in stochastic domains. Artificial Intelligence, 70(1-2), 3-74.
- (1995) Artificial Intelligence , vol.70 , Issue.1-2 , pp. 3-74
- Dean, T.¹ Kaelbling, L.² Kirman, J.³ Nicholson, A.⁴

44
- 84990553353
- A model for reasoning about persistence and causation
- Dean, T., & Kanazawa, K. (1989). A model for reasoning about persistence and causation. Computational Intelligence, 5(3), 142-150.
- (1989) Computational Intelligence , vol.5 , Issue.3 , pp. 142-150
- Dean, T.¹ Kanazawa, K.²

45
- 85168151397
- Decomposition techniques for planning in stochastic domains
- Dean, T., & Lin, S.-H. (1995). Decomposition techniques for planning in stochastic domains. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1121-1127.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1121-1127
- Dean, T.¹ Lin, S.-H.²

46
- 0004240515
- Morgan Kaufmann, San Mateo, California
- Dean, T., & Wellman, M. (1991). Planning and Control. Morgan Kaufmann, San Mateo, California.
- (1991) Planning and Control
- Dean, T.¹ Wellman, M.²

47
- 0004865142
- Integrating planning and execution in stochastic domains
- Washington, DC
- Dearden, R., & Boutilier, C. (1994). Integrating planning and execution in stochastic domains. In Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence, pp. 162-169 Washington, DC.
- (1994) Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence , pp. 162-169
- Dearden, R.¹ Boutilier, C.²

48
- 0030697013
- Abstraction and approximate decision theoretic planning
- Dearden, R., & Boutilier, C. (1997). Abstraction and approximate decision theoretic planning. Artificial Intelligence, 89, 219-283.
- (1997) Artificial Intelligence , vol.89 , pp. 219-283
- Dearden, R.¹ Boutilier, C.²

49
- 0002251094
- Bucket elimination: A unifying framework for probabilistic inference
- Portland, OR
- Dechter, R. (1996). Bucket elimination: A unifying framework for probabilistic inference. In Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence, pp. 211-219 Portland, OR.
- (1996) Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence , pp. 211-219
- Dechter, R.¹

50
- 84880665054
- Mini-buckets: A general scheme for generating approximations in automated reasoning in probabilistic inference
- Nagoya, Japan
- Dechter, R. (1997). Mini-buckets: A general scheme for generating approximations in automated reasoning in probabilistic inference. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pp. 1297-1302 Nagoya, Japan.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence , pp. 1297-1302
- Dechter, R.¹

51
- 0006464452
- Sur un problème de production et de stockage dans l'aléatoire
- D'Epenoux, F. (1963). Sur un problème de production et de stockage dans l'aléatoire. Management Science, 10, 98-108.
- (1963) Management Science , vol.10 , pp. 98-108
- D'Epenoux, F.¹

52
- 0037919267
- Explanation-based learning and reinforcement learning: A unified approach
- Lake Tahoe, NV
- Dietterich, T. G., & Flann, N. S. (1995). Explanation-based learning and reinforcement learning: A unified approach. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 176-184 Lake Tahoe, NV.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 176-184
- Dietterich, T.G.¹ Flann, N.S.²

53
- 0043046674
- A probabilistic model of action for least-commitment planning with information gathering
- Washington, DC
- Draper, D., Hanks, S., & Weld, D. (1994a). A probabilistic model of action for least-commitment planning with information gathering. In Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence, pp. 178-186 Washington, DC.
- (1994) Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence , pp. 178-186
- Draper, D.¹ Hanks, S.² Weld, D.³

54
- 84943292752
- Probabilistic planning with information gathering and contingent execution
- Draper, D., Hanks, S., & Weld, D. (1994b). Probabilistic planning with information gathering and contingent execution. In Proceedings of the Second International Conference on AI Planning Systems, pp. 31-36.
- (1994) Proceedings of the Second International Conference on AI Planning Systems , pp. 31-36
- Draper, D.¹ Hanks, S.² Weld, D.³

55
- 0002555157
- An approach to planning with incomplete information
- Boston, MA
- Etzioni, O., Hanks, S., Weld, D., Draper, D., Lesh, N., & Williamson, M. (1992). An approach to planning with incomplete information. In Proceedings of the Third International Conference on Principles of Knowledge Representation and Reasoning, pp. 115-125 Boston, MA.
- (1992) Proceedings of the Third International Conference on Principles of Knowledge Representation and Reasoning , pp. 115-125
- Etzioni, O.¹ Hanks, S.² Weld, D.³ Draper, D.⁴ Lesh, N.⁵ Williamson, M.⁶

56
- 0015440625
- Learning and executing generalized robot plans
- Fikes, R., Hart, P., & Nilsson, N. (1972). Learning and executing generalized robot plans. Artificial Intelligence, 3, 251-288.
- (1972) Artificial Intelligence , vol.3 , pp. 251-288
- Fikes, R.¹ Hart, P.² Nilsson, N.³

57
- 2842560201
- STRIPS: A new approach to the application of theorem proving to problem solving
- Fikes, R., & Nilsson, N. J. (1971). STRIPS: A new approach to the application of theorem proving to problem solving. Artificial Intelligence, 2, 189-208.
- (1971) Artificial Intelligence , vol.2 , pp. 189-208
- Fikes, R.¹ Nilsson, N.J.²

58
- 0009210299
- Ph.D. thesis, Stanford University, Stanford
- Finger, J. (1986). Exploiting Constraints in Design Synthesis. Ph.D. thesis, Stanford University, Stanford.
- (1986) Exploiting Constraints in Design Synthesis
- Finger, J.¹

59
- 84945709831
- Algorithm 97 (shortest path)
- Floyd, R. W. (1962). Algorithm 97 (shortest path). Communications of the ACM, 5(6), 345.
- (1962) Communications of the ACM , vol.5 , Issue.6 , pp. 345
- Floyd, R.W.¹

60
- 0344030849
- An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix
- Fox, B. L., & Landi, D. M. (1968). An algorithm for identifying the ergodic subchains and transient states of a stochastic matrix. Communications of the ACM, 2, 619-621.
- (1968) Communications of the ACM , vol.2 , pp. 619-621
- Fox, B.L.¹ Landi, D.M.²

61
- 0004232519
- Halsted Press, New York
- French, S. (1986). Decision Theory. Halsted Press, New York.
- (1986) Decision Theory
- French, S.¹

62
- 0040731126
- Advances in probabilistic reasoning
- Los Angeles, CA
- Geiger, D., & Heckerman, D. (1991). Advances in probabilistic reasoning. In Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 118-126 Los Angeles, CA.
- (1991) Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence , pp. 118-126
- Geiger, D.¹ Heckerman, D.²

63
- 84880654869
- Model minimization, regression, and prepositional STRIPS planning
- Nagoya, Japan
- Givan, R., & Dean, T. (1997). Model minimization, regression, and prepositional STRIPS planning. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pp. 1163-1168 Nagoya, Japan.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence , pp. 1163-1168
- Givan, R.¹ Dean, T.²

64
- 33644680515
- Bounded-parameter Markov decision processes
- Toulouse, France
- Givan, R., Leach, S., & Dean, T. (1997). Bounded-parameter Markov decision processes. In Proceedings of the Fourth European Conference on Planning (ECP'97), pp. 234-246 Toulouse, France.
- (1997) Proceedings of the Fourth European Conference on Planning (ECP'97) , pp. 234-246
- Givan, R.¹ Leach, S.² Dean, T.³

65
- 0038595408
- Representing uncertainty in simple planners
- Bonn, Germany
- Goldman, R. P., & Boddy, M. S. (1994). Representing uncertainty in simple planners. In Proceedings of the Fourth International Conference on Principles of Knowledge Representation and Reasoning, pp. 238-245 Bonn, Germany.
- (1994) Proceedings of the Fourth International Conference on Principles of Knowledge Representation and Reasoning , pp. 238-245
- Goldman, R.P.¹ Boddy, M.S.²

66
- 0040083848
- Abstracting probabilistic actions
- Washington, DC
- Haddawy, P., & Doan, A. (1994). Abstracting probabilistic actions. In Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence, pp. 270-277 Washington, DC.
- (1994) Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence , pp. 270-277
- Haddawy, P.¹ Doan, A.²

67
- 0032137162
- Utility Models for Goal-Directed Decision-Theoretic Planners
- Haddawy, P., & Hanks, S. (1998). Utility Models for Goal-Directed Decision-Theoretic Planners. Computational Intelligence, 14(3).
- (1998) Computational Intelligence , vol.14 , Issue.3
- Haddawy, P.¹ Hanks, S.²

68
- 85166983383
- Decision-theoretic refinement planning using inheritence abstraction
- Chicago, IL
- Haddawy, P., & Suwandi, M. (1994). Decision-theoretic refinement planning using inheritence abstraction. In Proceedings of the Second International Conference on AI Planning Systems, pp. 266-271 Chicago, IL.
- (1994) Proceedings of the Second International Conference on AI Planning Systems , pp. 266-271
- Haddawy, P.¹ Suwandi, M.²

69
- 0038256822
- Ph.D. thesis 756, Yale University, Department of Computer Science, New Haven, CT
- Hanks, S. (1990). Projecting plans for uncertain worlds. Ph.D. thesis 756, Yale University, Department of Computer Science, New Haven, CT.
- (1990) Projecting Plans for Uncertain Worlds
- Hanks, S.¹

70
- 0028400910
- Modeling a dynamic and uncertain world I: Symbolic and probabilistic reasoning about change
- Hanks, S., & McDermott, D. V. (1994). Modeling a dynamic and uncertain world I: Symbolic and probabilistic reasoning about change. Artificial Intelligence, 66(1), 1-55.
- (1994) Artificial Intelligence , vol.66 , Issue.1 , pp. 1-55
- Hanks, S.¹ McDermott, D.V.²

71
- 0346108430
- AAAI Press, Menlo Park
- Hanks, S., Russell, S., & Wellman, M. (Eds.). (1994). Decision Theoretic Planning: Proceedings of the AAAI Spring Symposium. AAAI Press, Menlo Park.
- (1994) Decision Theoretic Planning: Proceedings of the AAAI Spring Symposium
- Hanks, S.¹ Russell, S.² Wellman, M.³

72
- 0031623876
- Heuristic search in cyclic AND/OR graphs
- Madison, WI
- Hansen, E. A., & Zilberstein, S. (1998). Heuristic search in cyclic AND/OR graphs. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pp. 412-418 Madison, WI.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence , pp. 412-418
- Hansen, E.A.¹ Zilberstein, S.²

73
- 0031385618
- A heuristic variable-grid solution method for POMDPs
- Providence, RI
- Hauskrecht, M. (1997). A heuristic variable-grid solution method for POMDPs. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 734-739 Providence, RI.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 734-739
- Hauskrecht, M.¹

74
- 0003613101
- Ph.D. thesis, Massachusetts Institute of Technology, Cambridge
- Hauskrecht, M. (1998). Planning and Control in Stochastic Domains with Imperfect Information. Ph.D. thesis, Massachusetts Institute of Technology, Cambridge.
- (1998) Planning and Control in Stochastic Domains with Imperfect Information
- Hauskrecht, M.¹

75
- 0006419533
- Hierarchical solution of Markov decision processes using macro-actions
- Madison, WI
- Hauskrecht, M., Meuleau, N., Kaelbling, L. P., Dean, T., & Boutilier, C. (1998). Hierarchical solution of Markov decision processes using macro-actions. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 220-229 Madison, WI.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 220-229
- Hauskrecht, M.¹ Meuleau, N.² Kaelbling, L.P.³ Dean, T.⁴ Boutilier, C.⁵

76
- 0002956570
- SPUDD: Stochastic planning using decision diagrams
- To appear
- Hoey, J., St-Aubin, R., Hu, A., & Boutilier, C. (1999). SPUDD: Stochastic planning using decision diagrams. In Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence Stockholm. To appear.
- (1999) Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence Stockholm
- Hoey, J.¹ St-Aubin, R.² Hu, A.³ Boutilier, C.⁴

77
- 0003644124
- MIT Press, Cambridge, Massachusetts
- Howard, R. A. (1960). Dynamic Programming and Markov Processes. MIT Press, Cambridge, Massachusetts.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

78
- 0000086731
- Influence diagrams
- Howard, R. A., & Matheson, J. E. (Eds.). Strategic Decisions Group, Menlo Park, CA
- Howard, R. A., & Matheson, J. E. (1984). Influence diagrams. In Howard, R. A., & Matheson, J. E. (Eds.), The Principles and Applications of Decision Analysis. Strategic Decisions Group, Menlo Park, CA.
- (1984) The Principles and Applications of Decision Analysis
- Howard, R.A.¹ Matheson, J.E.²

79
- 11544363000
- Refinement planning as a unifying framework for plan synthesis
- Kambhampati, S. (1997). Refinement planning as a unifying framework for plan synthesis. AI Magazine, Summer 1997, 67-97.
- (1997) AI Magazine , vol.SUMMER 1997 , pp. 67-97
- Kambhampati, S.¹

80
- 84880649215
- A sparse sampling algorithm for near-optimal planning in large markov decision processes
- To appear
- Kearns, M., Mansour, Y., & Ng, A. Y. (1999). A sparse sampling algorithm for near-optimal planning in large markov decision processes. In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence Stockholm. To appear.
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence Stockholm
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

81
- 0004001439
- John Wiley and Sons, New York
- Keeney, R. L., & Raiffa, H. (1976). Decisions with Multiple Objectives: Preferences and Value Tradeoffs. John Wiley and Sons, New York.
- (1976) Decisions with Multiple Objectives: Preferences and Value Tradeoffs
- Keeney, R.L.¹ Raiffa, H.²

82
- 0002874631
- A computational scheme for reasoning in dynamic probabilistic networks
- Stanford
- Kjaerulff, U. (1992). A computational scheme for reasoning in dynamic probabilistic networks. In Proceedings of the Eighth Conference on Uncertainty in AI, pp. 121-129 Stanford.
- (1992) Proceedings of the Eighth Conference on Uncertainty in AI , pp. 121-129
- Kjaerulff, U.¹

83
- 0346108427
- Kluwer, Boston
- Knoblock, C. A. (1993). Generating Abstraction Hierarchies: An Automated Approach to Reducing Search in Planning. Kluwer, Boston.
- (1993) Generating Abstraction Hierarchies: An Automated Approach to Reducing Search in Planning
- Knoblock, C.A.¹

84
- 0039936301
- Characterizing abstraction hierarchies for planning
- Anaheim, CA
- Knoblock, C. A., Tenenberg, J. D., & Yang, Q. (1991). Characterizing abstraction hierarchies for planning. In Proceedings of the Ninth National Conference on Artificial Intelligence, pp. 692-697 Anaheim, CA.
- (1991) Proceedings of the Ninth National Conference on Artificial Intelligence , pp. 692-697
- Knoblock, C.A.¹ Tenenberg, J.D.² Yang, Q.³

85
- 0347369286
- M.sc. thesis UCB/CSD-92-685, University of California at Berkeley, Computer Science Department
- Koenig, S. (1991). Optimal probabilistic and decision-theoretic planning using Markovian decision theory. M.sc. thesis UCB/CSD-92-685, University of California at Berkeley, Computer Science Department.
- (1991) Optimal Probabilistic and Decision-theoretic Planning Using Markovian Decision Theory
- Koenig, S.¹

86
- 0002038863
- Real-time search in nondeterministic domains
- Montreal, Canada
- Koenig, S., & Simmons, R. (1995). Real-time search in nondeterministic domains. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1660-1667 Montreal, Canada.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1660-1667
- Koenig, S.¹ Simmons, R.²

87
- 0022045044
- Macro-operators: A weak method for learning
- Korf, R. (1985). Macro-operators: A weak method for learning. Artificial Intelligence, 26, 35-77.
- (1985) Artificial Intelligence , vol.26 , pp. 35-77
- Korf, R.¹

88
- 0025400088
- Real-time heuristic search
- Korf, R. E. (1990). Real-time heuristic search. Artificial Intelligence, 42, 189-211.
- (1990) Artificial Intelligence , vol.42 , pp. 189-211
- Korf, R.E.¹

89
- 0029333536
- An Algorithm for Probabilistic Planning
- Kushmerick, N., Hanks, S., & Weld, D. (1995). An Algorithm for Probabilistic Planning. Artificial Intelligence, 76, 239-286.
- (1995) Artificial Intelligence , vol.76 , pp. 239-286
- Kushmerick, N.¹ Hanks, S.² Weld, D.³

90
- 0016114997
- Decomposition of systems governed by Markov chains
- Kushner, H. J., & Chen, C.-H. (1974). Decomposition of systems governed by Markov chains. IEEE Transactions on Automatic Control, 19(5), 501-507.
- (1974) IEEE Transactions on Automatic Control , vol.19 , Issue.5 , pp. 501-507
- Kushner, H.J.¹ Chen, C.-H.²

91
- 0026998940
- Online minimization of transition systems
- Victoria, BC
- Lee, D., & Yannakakis, M. (1992). Online minimization of transition systems. In Proceedings of 24th Annual ACM Symposium on the Theory of Computing, pp. 264-274 Victoria, BC.
- (1992) Proceedings of 24th Annual ACM Symposium on the Theory of Computing , pp. 264-274
- Lee, D.¹ Yannakakis, M.²

92
- 0000955093
- State constraints revisited
- Lin, F., & Reiter, R. (1994). State constraints revisited. Journal of Logic and Computation, 4(5), 655-678.
- (1994) Journal of Logic and Computation , vol.4 , Issue.5 , pp. 655-678
- Lin, F.¹ Reiter, R.²

93
- 0347369287
- Ph.D. thesis, Department of Computer Science, Brown University
- Lin, S.-H. (1997). Exploiting Structure for Planning and Control. Ph.D. thesis, Department of Computer Science, Brown University.
- (1997) Exploiting Structure for Planning and Control
- Lin, S.-H.¹

94
- 0037919269
- Generating optimal policies for high-level plans with conditional branches and loops
- Lin, S.-H., & Dean, T. (1995). Generating optimal policies for high-level plans with conditional branches and loops. In Proceedings of the Third European Workshop on Planning (EWSP'95), pp. 187-200.
- (1995) Proceedings of the Third European Workshop on Planning (EWSP'95) , pp. 187-200
- Lin, S.-H.¹ Dean, T.²

95
- 0031369472
- Probabilistic propositional planning: Representations and complexity
- Providence, RI
- Littman, M. L. (1997). Probabilistic propositional planning: Representations and complexity. In Proceedings of the Fourteenth National Conference on Artificial Intelligence, pp. 748-754 Providence, RI.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 748-754
- Littman, M.L.¹

96
- 0002290970
- On the complexity of solving Markov decision problems
- Montreal, Canada
- Littman, M. L., Dean, T. L., & Kaelbling, L. P. (1995). On the complexity of solving Markov decision problems. In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 394-402 Montreal, Canada.
- (1995) Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence , pp. 394-402
- Littman, M.L.¹ Dean, T.L.² Kaelbling, L.P.³

97
- 0003861655
- Ph.D. thesis CS-96-09, Brown University, Department of Computer Science, Providence, RI
- Littman, M. L. (1996). Algorithms for sequential decision making. Ph.D. thesis CS-96-09, Brown University, Department of Computer Science, Providence, RI.
- (1996) Algorithms for Sequential Decision Making
- Littman, M.L.¹

98
- 0000494894
- Computationally feasible bounds for partially observed Markov decision processes
- Lovejoy, W. S. (1991a). Computationally feasible bounds for partially observed Markov decision processes. Operations Research, 39(1), 162-175.
- (1991) Operations Research , vol.39 , Issue.1 , pp. 162-175
- Lovejoy, W.S.¹

99
- 0002679852
- A survey of algorithmic methods for partially observed Markov decision processes
- Lovejoy, W. S. (1991b). A survey of algorithmic methods for partially observed Markov decision processes. Annals of Operations Research, 28, 47-66.
- (1991) Annals of Operations Research , vol.28 , pp. 47-66
- Lovejoy, W.S.¹

100
- 0003488909
- Addison-Wesley, Reading, Massachusetts
- Luenberger, D. G. (1973). Introduction to Linear and Nonlinear Programming. Addison-Wesley, Reading, Massachusetts.
- (1973) Introduction to Linear and Nonlinear Programming
- Luenberger, D.G.¹

101
- 0003881809
- Wiley, New York
- Luenberger, D. G. (1979). Introduction to Dynamic Systems: Theory, Models and Applications. Wiley, New York.
- (1979) Introduction to Dynamic Systems: Theory, Models and Applications
- Luenberger, D.G.¹

102
- 0032596468
- On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
- Orlando, FL. To appear
- Madani, O., Condon, A., & Hanks, S. (1999). On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems. In Proceedings of the Sixteenth National Conference on Artificial Intelligence Orlando, FL. To appear.
- (1999) Proceedings of the Sixteenth National Conference on Artificial Intelligence
- Madani, O.¹ Condon, A.² Hanks, S.³

103
- 0010853273
- To discount or not to discount in reinforcement learning: A case study in comparing R-learning and Q-learning
- New Brunswick, NJ
- Mahadevan, S. (1994). To discount or not to discount in reinforcement learning: A case study in comparing R-learning and Q-learning. In Proceedings of the Eleventh International Conference on Machine Learning, pp. 164-172 New Brunswick, NJ.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 164-172
- Mahadevan, S.¹

104
- 0342772590
- Systematic nonlinear planning
- Anaheim, CA
- McAllester, D., & Rosenblitt, D. (1991). Systematic nonlinear planning. In Proceedings of the Ninth National Conference on Artificial Intelligence, pp. 634-639 Anaheim, CA.
- (1991) Proceedings of the Ninth National Conference on Artificial Intelligence , pp. 634-639
- McAllester, D.¹ Rosenblitt, D.²

105
- 2342482919
- Instance-based utile distinctions for reinforcement learning with hidden state
- Lake Tahoe, Nevada
- McCallum, R. A. (1995). Instance-based utile distinctions for reinforcement learning with hidden state. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 387-395 Lake Tahoe, Nevada.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 387-395
- McCallum, R.A.¹

106
- 0014638440
- Some philosophical problems from the standpoint of artificial intelligence
- McCarthy, J., & Hayes, P. J. (1969). Some philosophical problems from the standpoint of artificial intelligence. Machine Intelligence, 4, 463-502.
- (1969) Machine Intelligence , vol.4 , pp. 463-502
- McCarthy, J.¹ Hayes, P.J.²

107
- 0031632806
- Solving very large weakly coupled Markov decision processes
- Madison, WI
- Meuleau, N., Hauskrecht, M., Kim, K., Peshkin, L., Kaelbling, L., Dean, T., & Boutilier, C. (1998). Solving very large weakly coupled Markov decision processes. In Proceedings of the Fifteenth National Conference on Artificial Intelligence, pp. 165-172 Madison, WI.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.³ Peshkin, L.⁴ Kaelbling, L.⁵ Dean, T.⁶ Boutilier, C.⁷

108
- 0029514510
- The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces
- Moore, A. W., & Atkeson, C. G. (1995). The parti-game algorithm for variable resolution reinforcement learning in multidimensional state spaces. Machine Learning, 21, 199-234.
- (1995) Machine Learning , vol.21 , pp. 199-234
- Moore, A.W.¹ Atkeson, C.G.²

109
- 0000977910
- The complexity of Markov chain decision processes
- Papadimitriou, C. H., & Tsitsiklis, J. N. (1987). The complexity of Markov chain decision processes. Mathematics of Operations Research, 12(3), 441-450.
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

110
- 0346738900
- Flexible decomposition algorithms for weakly coupled Markov decision processes
- Madison, WI
- Parr, R. (1998). Flexible decomposition algorithms for weakly coupled Markov decision processes. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 422-430 Madison, WI.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 422-430
- Parr, R.¹

111
- 85168129602
- Approximating optimal policies for partially observable stochastic domains
- Montreal
- Parr, R., & Russell, S. (1995). Approximating optimal policies for partially observable stochastic domains. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1088-1094 Montreal.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1088-1094
- Parr, R.¹ Russell, S.²

112
- 84898956770
- Reinforcement learning with hierarchies of machines
- Jordan, M., Kearns, M., & Solla, S. (Eds.). MIT Press, Cambridge
- Parr, R., & Russell, S. (1998). Reinforcement learning with hierarchies of machines. In Jordan, M., Kearns, M., & Solla, S. (Eds.), Advances in Neural Information Processing Systems 10, pp. 1043-1049. MIT Press, Cambridge.
- (1998) Advances in Neural Information Processing Systems 10 , pp. 1043-1049
- Parr, R.¹ Russell, S.²

113
- 0003391330
- Morgan Kaufmann, San Mateo
- Pearl, J. (1988). Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Mateo.
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

114
- 0000532979
- ADL: Exploring the middle ground between STRIPS and the situation calculus
- Toronto, Canada
- Pednault, E. (1989). ADL: Exploring the middle ground between STRIPS and the situation calculus. In Proceedings of the First International Conference on Principles of Knowledge Representation and Reasoning, pp. 324-332 Toronto, Canada.
- (1989) Proceedings of the First International Conference on Principles of Knowledge Representation and Reasoning , pp. 324-332
- Pednault, E.¹

115
- 0002489296
- UCPOP: A sound, complete, partial order planner for ADL
- Boston, MA
- Penberthy, J. S., & Weld, D. S. (1992). UCPOP: A sound, complete, partial order planner for ADL. In Proceedings of the Third International Conference on Principles of Knowledge Representation and Reasoning, pp. 103-114 Boston, MA.
- (1992) Proceedings of the Third International Conference on Principles of Knowledge Representation and Reasoning , pp. 103-114
- Penberthy, J.S.¹ Weld, D.S.²

116
- 0026992168
- Conditional Nonlinear Planning
- College Park, MD
- Peot, M., & Smith, D. (1992). Conditional Nonlinear Planning. In Proceedings of the First International Conference on AI Planning Systems, pp. 189-197 College Park, MD.
- (1992) Proceedings of the First International Conference on AI Planning Systems , pp. 189-197
- Peot, M.¹ Smith, D.²

117
- 0010402034
- Control knowledge to improve plan quality
- Chicago, IL
- Perez, M. A., & Carbonell, J. G. (1994). Control knowledge to improve plan quality. In Proceedings of the Second International Conference on AI Planning Systems, pp. 323-328 Chicago, IL.
- (1994) Proceedings of the Second International Conference on AI Planning Systems , pp. 323-328
- Perez, M.A.¹ Carbonell, J.G.²

118
- 0040831492
- Exploiting the rule structure for decision making within the independent choice logic
- Montreal, Canada
- Poole, D. (1995). Exploiting the rule structure for decision making within the independent choice logic. In Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 454-463 Montreal, Canada.
- (1995) Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence , pp. 454-463
- Poole, D.¹

119
- 0031187203
- The independent choice logic for modelling multiple agents under uncertainty
- Poole, D. (1997a). The independent choice logic for modelling multiple agents under uncertainty. Artificial Intelligence, 94(1-2), 7-56.
- (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 7-56
- Poole, D.¹

120
- 84880668827
- Probabilistic partial evaluation: Exploiting rule structure in probabilistic inference
- Nagoya, Japan
- Poole, D. (1997b). Probabilistic partial evaluation: Exploiting rule structure in probabilistic inference. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence, pp. 1284-1291 Nagoya, Japan.
- (1997) Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence , pp. 1284-1291
- Poole, D.¹

121
- 0006455676
- Context-specific approximation in probabilistic inference
- Madison, WI
- Poole, D. (1998). Context-specific approximation in probabilistic inference. In Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence, pp. 447-454 Madison, WI.
- (1998) Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 447-454
- Poole, D.¹

122
- 84957069070
- Theoretical results on reinforcement learning with temporally abstract behaviors
- Chemnitz, Germany
- Precup, D., Sutton, R. S., & Singh, S. (1998). Theoretical results on reinforcement learning with temporally abstract behaviors. In Proceedings of the Tenth European Conference on Machine Learning, pp. 382-393 Chemnitz, Germany.
- (1998) Proceedings of the Tenth European Conference on Machine Learning , pp. 382-393
- Precup, D.¹ Sutton, R.S.² Singh, S.³

123
- 0347999523
- CASSANDRA: Planning for contingencies
- Northwestern University, The Institute for the Learning Sciences
- Pryor, L., & Collins, G. (1993). CASSANDRA: Planning for contingencies. Technical report 41, Northwestern University, The Institute for the Learning Sciences.
- (1993) Technical Report 41
- Pryor, L.¹ Collins, G.²

124
- 0003998452
- John Wiley & Sons, New York
- Puterman, M. L. (1994). Markov Decision Processes. John Wiley & Sons, New York.
- (1994) Markov Decision Processes
- Puterman, M.L.¹

125
- 0037581251
- Modified policy iteration algorithms for discounted Markov decision problems
- Puterman, M. L., & Shin, M. (1978). Modified policy iteration algorithms for discounted Markov decision problems. Management Science, 24, 1127-1137.
- (1978) Management Science , vol.24 , pp. 1127-1137
- Puterman, M.L.¹ Shin, M.²

126
- 0001172487
- Multichain Markov decision processes with a sample-path constraint: A decomposition approach
- Ross, K. W., & Varadarajan, R. (1991). Multichain Markov decision processes with a sample-path constraint: A decomposition approach. Mathematics of Operations Research, 16(1), 195-207.
- (1991) Mathematics of Operations Research , vol.16 , Issue.1 , pp. 195-207
- Ross, K.W.¹ Varadarajan, R.²

127
- 0003584577
- Prentice Hall, Englewood Cliffs, NJ
- Russell, S., & Norvig, P. (1995). Artificial Intelligence: A Modern Approach. Prentice Hall, Englewood Cliffs, NJ.
- (1995) Artificial Intelligence: A Modern Approach
- Russell, S.¹ Norvig, P.²

128
- 0016069798
- Planning in a hierarchy of abstraction spaces
- Sacerdoti, E. D. (1974). Planning in a hierarchy of abstraction spaces. Artificial Intelligence, 5, 115-135.
- (1974) Artificial Intelligence , vol.5 , pp. 115-135
- Sacerdoti, E.D.¹

129
- 85125003135
- The nonlinear nature of plans
- Sacerdoti, E. D. (1975). The nonlinear nature of plans. In Proceedings of the Fourth International Joint Conference on Artificial Intelligence, pp. 206-214.
- (1975) Proceedings of the Fourth International Joint Conference on Artificial Intelligence , pp. 206-214
- Sacerdoti, E.D.¹

130
- 0001871991
- Universal plans for reactive robots in unpredictable environments
- Milan, Italy
- Schoppers, M. J. (1987). Universal plans for reactive robots in unpredictable environments. In Proceedings of the Tenth International Joint Conference on Artificial Intelligence, pp. 1039-1046 Milan, Italy.
- (1987) Proceedings of the Tenth International Joint Conference on Artificial Intelligence , pp. 1039-1046
- Schoppers, M.J.¹

131
- 85152626183
- A reinforcement learning method for maximizing undiscounted rewards
- Amherst, MA
- Schwartz, A. (1993). A reinforcement learning method for maximizing undiscounted rewards. In Proceedings of the Tenth International Conference on Machine Learning, pp. 298-305 Amherst, MA.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 298-305
- Schwartz, A.¹

132
- 0022059617
- Iterative aggregation-disaggregation procedures for discounted semi-Markov reward processes
- Schweitzer, P. L., Puterman, M. L., & Kindle, K. W. (1985). Iterative aggregation-disaggregation procedures for discounted semi-Markov reward processes. Operations Research, 33, 589-605.
- (1985) Operations Research , vol.33 , pp. 589-605
- Schweitzer, P.L.¹ Puterman, M.L.² Kindle, K.W.³

133
- 0022818911
- Evaluating influence diagrams
- Shachter, R. D. (1986). Evaluating influence diagrams. Operations Research, 55(6), 871-882.
- (1986) Operations Research , vol.55 , Issue.6 , pp. 871-882
- Shachter, R.D.¹

134
- 43949170056
- The role of relevance in explanation I: Irrelevance as statistical independence
- Shimony, S. E. (1993). The role of relevance in explanation I: Irrelevance as statistical independence. International Journal of Approximate Reasoning, 8(4), 281-324.
- (1993) International Journal of Approximate Reasoning , vol.8 , Issue.4 , pp. 281-324
- Shimony, S.E.¹

135
- 0005610003
- Probabilistic robot navigation in partially observable environments
- Montreal, Canada
- Simmons, R., & Koenig, S. (1995). Probabilistic robot navigation in partially observable environments. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 1080-1087 Montreal, Canada.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1080-1087
- Simmons, R.¹ Koenig, S.²

136
- 84899022377
- How to dynamically merge Markov decision processes
- MIT Press, Cambridge
- Singh, S. P., & Cohn, D. (1998). How to dynamically merge Markov decision processes. In Advances in Neural Information Processing Systems 10, pp. 1057-1063. MIT Press, Cambridge.
- (1998) Advances in Neural Information Processing Systems 10 , pp. 1057-1063
- Singh, S.P.¹ Cohn, D.²

137
- 85153965130
- Reinforcement learning with soft state aggregation
- Hanson, S. J., Cowan, J. D., & Giles, C. L. (Eds.). Morgan-Kaufmann, San Mateo
- Singh, S. P., Jaakkola, T., & Jordan, M. I. (1994). Reinforcement learning with soft state aggregation. In Hanson, S. J., Cowan, J. D., & Giles, C. L. (Eds.), Advances in Neural Information Processing Systems 7. Morgan-Kaufmann, San Mateo.
- (1994) Advances in Neural Information Processing Systems 7
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

138
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Smallwood, R. D., & Sondik, E. J. (1973). The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21, 1071-1088.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.D.¹ Sondik, E.J.²

139
- 0027709265
- Postponing threats in partial-order planning
- Washington, DC
- Smith, D., & Peot, M. (1993). Postponing threats in partial-order planning. In Proceedings of the Eleventh National Conference on Artificial Intelligence, pp. 500-506 Washington, DC.
- (1993) Proceedings of the Eleventh National Conference on Artificial Intelligence , pp. 500-506
- Smith, D.¹ Peot, M.²

140
- 0017943242
- The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
- Sondik, E. J. (1978). The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research, 26, 282-304.
- (1978) Operations Research , vol.26 , pp. 282-304
- Sondik, E.J.¹

141
- 0003328519
- Team-partitioned, opaque-transition reinforcement learning
- Asada, M. (Ed.). Springer Verlag, Berlin
- Stone, P., & Veloso, M. (1999). Team-partitioned, opaque-transition reinforcement learning. In Asada, M. (Ed.), RoboCup-98: Robot Soccer World Cup II. Springer Verlag, Berlin.
- (1999) RoboCup-98: Robot Soccer World Cup II
- Stone, P.¹ Veloso, M.²

142
- 84922015064
- TD models: Modeling the world at a mixture of time scales
- Lake Tahoe, NV
- Sutton, R. S. (1995). TD models: Modeling the world at a mixture of time scales. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 531-539 Lake Tahoe, NV.
- (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 531-539
- Sutton, R.S.¹

143
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

144
- 0028576345
- Control strategies for a stochastic planner
- Seattle, WA
- Tash, J., & Russell, S. (1994). Control strategies for a stochastic planner. In Proceedings of the Twelfth National Conference on Artificial Intelligence, pp. 1079-1085 Seattle, WA.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 1079-1085
- Tash, J.¹ Russell, S.²

145
- 0025399873
- Dynamic programming and influence diagrams
- Tatman, J. A., & Shachter, R. D. (1990). Dynamic programming and influence diagrams. IEEE Transactions on Systems, Man, and Cybernetics, 20(2), 365-379.
- (1990) IEEE Transactions on Systems, Man, and Cybernetics , vol.20 , Issue.2 , pp. 365-379
- Tatman, J.A.¹ Shachter, R.D.²

146
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. J. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6, 215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.J.¹

147
- 0032044899
- A probabilistic approach to concurrent mapping and localization for mobile robots
- Thrun, S., Fox, D., & Burgard, W. (1998). A probabilistic approach to concurrent mapping and localization for mobile robots. Machine Learning, 31, 29-53.
- (1998) Machine Learning , vol.31 , pp. 29-53
- Thrun, S.¹ Fox, D.² Burgard, W.³

148
- 0000277836
- Finding structure in reinforcement learning
- Tesauro, G., Touretzky, D., & Leen, T. (Eds.), Cambridge, MA. MIT Press
- Thrun, S., & Schwartz, A. (1995). Finding structure in reinforcement learning. In Tesauro, G., Touretzky, D., & Leen, T. (Eds.), Advances in Neural Information Processing Systems 7 Cambridge, MA. MIT Press.
- (1995) Advances in Neural Information Processing Systems 7
- Thrun, S.¹ Schwartz, A.²

149
- 0010732426
- Generating conditional plans and programs
- University of Edinburgh
- Warren, D. (1976). Generating conditional plans and programs. In Proceedings of AISB Summer Conference, pp. 344-354 University of Edinburgh.
- (1976) Proceedings of AISB Summer Conference , pp. 344-354
- Warren, D.¹

150
- 34249833101
- Q-learning
- Watkins, C. J. C. H., & Dayan, P. (1992). Q-learning. Machine Learning, 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

151
- 0028750404
- An introduction to least commitment planning
- Weld, D. S. (1994). An introduction to least commitment planning. AI Magazine, Winter 1994, 27-61.
- (1994) AI Magazine , vol.WINTER 1994 , pp. 27-61
- Weld, D.S.¹

152
- 0024739631
- Solutions procedures for partially observed Markov decision processes
- White III, C. C., & Scherer, W. T. (1989). Solutions procedures for partially observed Markov decision processes. Operations Research, 57(5), 791-797.
- (1989) Operations Research , vol.57 , Issue.5 , pp. 791-797
- White III, C.C.¹ Scherer, W.T.²

153
- 0342732841
- Ph.D. thesis 96-06-03, University of Washington, Department of Computer Science and Engineering
- Williamson, M. (1996). A value-directed approach to planning. Ph.D. thesis 96-06-03, University of Washington, Department of Computer Science and Engineering.
- (1996) A Value-directed Approach to Planning
- Williamson, M.¹

154
- 85166968219
- Optimal planning with a goal-directed utility model
- Chicago, IL
- Williamson, M., & Hanks, S. (1994). Optimal planning with a goal-directed utility model. In Proceedings of the Second International Conference on AI Planning Systems, pp. 176-180 Chicago, IL.
- (1994) Proceedings of the Second International Conference on AI Planning Systems , pp. 176-180
- Williamson, M.¹ Hanks, S.²

155
- 0004294784
- Addison-Wesley, Reading, Massachusetts
- Winston, P. H. (1992). Artificial Intelligence, Third Edition. Addison-Wesley, Reading, Massachusetts.
- (1992) Artificial Intelligence, Third Edition
- Winston, P.H.¹

156
- 0003642199
- Springer Verlag
- Yang, Q. (1998). Intelligent Planning : A Decomposition and Abstraction Based Approach. Springer Verlag.
- (1998) Intelligent Planning : A Decomposition and Abstraction Based Approach
- Yang, Q.¹

157
- 85016628903
- A model approximation scheme for planning in partially observable stochastic domains
- Zhang, N. L., & Liu, W. (1997). A model approximation scheme for planning in partially observable stochastic domains. Journal of Artificial Intelligence Research, 7, 199-230.
- (1997) Journal of Artificial Intelligence Research , vol.7 , pp. 199-230
- Zhang, N.L.¹ Liu, W.²

158
- 0000049635
- Exploiting causal independence in Bayesian network inference
- Zhang, N. L., & Poole, D. (1996). Exploiting causal independence in Bayesian network inference. Journal of Artificial Intelligence Research, 5, 301-328.
- (1996) Journal of Artificial Intelligence Research , vol.5 , pp. 301-328
- Zhang, N.L.¹ Poole, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.