SCOPUS 정보 검색 플랫폼

Artificial Intelligence

Volumn 121, Issue 1, 2000, Pages 49-107

Stochastic dynamic programming with factored representations

(3) Boutilier, Craig a Dearden, Richard b Goldszmidt, Moisés c

a UNIVERSITY OF TORONTO (Canada)

b UNIVERSITY OF BRITISH COLUMBIA (Canada)

c Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; COMPUTATIONAL COMPLEXITY; DECISION THEORY; DYNAMIC PROGRAMMING; MARKOV PROCESSES; MATHEMATICAL MODELS; PROBABILITY DISTRIBUTIONS; PROBLEM SOLVING; THEOREM PROVING; TREES (MATHEMATICS);

DYNAMIC BAYESIAN NETWORKS; DYNAMIC PROGRAMMING ALGORITHMS; MARKOV DECISION PROCESSES (MDP);

ARTIFICIAL INTELLIGENCE;

EID: 0034248853 PISSN: 00043702 EISSN: None Source Type: Journal
DOI: 10.1016/S0004-3702(00)00033-3 Document Type: Article

Times cited : (330)

References (84)

1
- 0031073475
- Locally weighted learning for control
- Atkeson C.G., Moore A.W., Schaal S. Locally weighted learning for control. Artificial Intelligence Review. Vol. 11:1997;75-113.
- (1997) Artificial Intelligence Review , vol.11 , pp. 75-113
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

2
- 0027880685
- Algebraic decision diagrams and their applications
- Bahar R.I., Frohm E.A., Gaona C.M., Hachtel G.D., Macii E., Pardo A., Somenzi F. Algebraic decision diagrams and their applications. Proc. International Conference on Computer-Aided Design. 1993;188-191.
- (1993) Proc. International Conference on Computer-Aided Design , pp. 188-191
- Bahar, R.I.¹ Frohm, E.A.² Gaona, C.M.³ Hachtel, G.D.⁴ Macii, E.⁵ Pardo, A.⁶ Somenzi, F.⁷

3
- 85012688561
- Princeton, NJ: Princeton University Press
- Bellman R.E. Dynamic Programming. 1957;Princeton University Press, Princeton, NJ.
- (1957) Dynamic Programming
- Bellman, R.E.¹

4
- 0024680419
- Adaptive aggregation for infinite horizon dynamic programming
- Bertsekas D.P., Castanon D.A. Adaptive aggregation for infinite horizon dynamic programming. IEEE Trans. Automat. Control. Vol. 34:1989;589-598.
- (1989) IEEE Trans. Automat. Control , vol.34 , pp. 589-598
- Bertsekas, D.P.¹ Castanon, D.A.²

5
- 0003565779
- Englewood Cliffs, NJ: Prentice-Hall
- Bertsekas D.P. Dynamic Programming: Deterministic and Stochastic Models. 1987;Prentice-Hall, Englewood Cliffs, NJ.
- (1987) Dynamic Programming: Deterministic and Stochastic Models
- Bertsekas, D.P.¹

6
- 0003487482
- Belmont, MA: Athena
- Bertsekas D.P., Tsitsiklis J.N. Neuro-Dynamic Programming. 1996;Athena, Belmont, MA.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

7
- 0028443644
- Trading accuracy for simplicity in decision trees
- Bohanic M., Bratko I. Trading accuracy for simplicity in decision trees. Machine Learning. Vol. 15:1994;223-250.
- (1994) Machine Learning , vol.15 , pp. 223-250
- Bohanic, M.¹ Bratko, I.²

8
- 0342843072
- Correlated action effects in decision theoretic regression
- Boutilier C. Correlated action effects in decision theoretic regression. Proc. 13th Conference on Uncertainty in Artificial Intelligence Providence, RI. 1997;30-37.
- (1997) Proc. 13th Conference on Uncertainty in Artificial Intelligence Providence, RI , pp. 30-37
- Boutilier, C.¹

9
- 84880685295
- Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning
- Boutilier C., Brafman R.I., Geib C. Prioritized goal decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning. Proc. IJCAI-97, Nagoya, Japan. 1997;1156-1162.
- (1997) Proc. IJCAI-97, Nagoya, Japan , pp. 1156-1162
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

10
- 0001811022
- Structured reachability analysis for Markov decision processes
- Boutilier C., Brafman R.I., Geib C. Structured reachability analysis for Markov decision processes. Proc. 14th Conference on Uncertainty in Artificial Intelligence, Madison, WI. 1998;24-32.
- (1998) Proc. 14th Conference on Uncertainty in Artificial Intelligence, Madison, WI , pp. 24-32
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

11
- 0346942368
- Decision theoretic planning: Structural assumptions and computational leverage
- Boutilier C., Dean T., Hanks S. Decision theoretic planning: Structural assumptions and computational leverage. J. Artificial Intelligence Res. Vol. 11:1999;1-94.
- (1999) J. Artificial Intelligence Res. , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

12
- 0028572333
- Using abstractions for decision-theoretic planning with time constraints
- Boutilier C., Dearden R. Using abstractions for decision-theoretic planning with time constraints. Proc. AAAI-94, Seattle, WA. 1994;1016-1022.
- (1994) Proc. AAAI-94, Seattle, WA , pp. 1016-1022
- Boutilier, C.¹ Dearden, R.²

13
- 0012352653
- Approximating value trees in structured dynamic programming
- Boutilier C., Dearden R. Approximating value trees in structured dynamic programming. Proc. 13th International Conference on Machine Learning, Bari, Italy. 1996;54-62.
- (1996) Proc. 13th International Conference on Machine Learning, Bari, Italy , pp. 54-62
- Boutilier, C.¹ Dearden, R.²

14
- 0000675721
- Context-specific independence in Bayesian networks
- Boutilier C., Friedman N., Goldszmidt M., Koller D. Context-specific independence in Bayesian networks. Proc. 12th Conference on Uncertainty in Artificial Intelligence, Portland, OR. 1996;115-123.
- (1996) Proc. 12th Conference on Uncertainty in Artificial Intelligence, Portland, or , pp. 115-123
- Boutilier, C.¹ Friedman, N.² Goldszmidt, M.³ Koller, D.⁴

15
- 84957878011
- The frame problem and Bayesian network action representations
- Boutilier C., Goldszmidt M. The frame problem and Bayesian network action representations. Proc. 11th Biennial Canadian Conference on Artificial Intelligence, Toronto, Ontario. 1996;69-83.
- (1996) Proc. 11th Biennial Canadian Conference on Artificial Intelligence, Toronto, Ontario , pp. 69-83
- Boutilier, C.¹ Goldszmidt, M.²

16
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Boutilier C., Poole D. Computing optimal policies for partially observable decision processes using compact representations. Proc. AAAI-96, Portland, OR. 1996;1168-1175.
- (1996) Proc. AAAI-96, Portland, or , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

17
- 85168106990
- Process-oriented planning and average-reward optimality
- Boutilier C., Puterman M.L. Process-oriented planning and average-reward optimality. Proc. IJCAI-95, Montreal, Quebec. 1995;1096-1103.
- (1995) Proc. IJCAI-95, Montreal, Quebec , pp. 1096-1103
- Boutilier, C.¹ Puterman, M.L.²

18
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- G. Tesauro, D.S. Touretzky, & T.K. Leen. Cambridge, MA: MIT Press
- Boyan J.A., Moore A.W. Generalization in reinforcement learning: Safely approximating the value function. Tesauro G., Touretzky D.S., Leen T.K. Advances in Neural Information Processing Systems 7. 1995;MIT Press, Cambridge, MA.
- (1995) Advances in Neural Information Processing Systems 7
- Boyan, J.A.¹ Moore, A.W.²

19
- 0022769976
- Graph-based algorithms for Boolean function manipulation
- Bryant R.E. Graph-based algorithms for Boolean function manipulation. IEEE Trans. Comput. Vol. C-35:(8):1986;677-691.
- (1986) IEEE Trans. Comput. , vol.C-35 , Issue.8 , pp. 677-691
- Bryant, R.E.¹

20
- 0025595038
- 20 states and beyond
- 20 states and beyond. Proc. Conference on Logic in Computer Science. 1990;428-439.
- (1990) Proc. Conference on Logic in Computer Science , pp. 428-439
- Burch, J.R.¹ Clarke, E.M.² McMillan, K.L.³ Dill, D.L.⁴ Hwang, L.J.⁵

21
- 0028564629
- Acting optimally in partially observable stochastic domains
- Cassandra A.R., Kaelbling L.P., Littman M.L. Acting optimally in partially observable stochastic domains. Proc. AAAI-94, Seattle, WA. 1994;1023-1028.
- (1994) Proc. AAAI-94, Seattle, WA , pp. 1023-1028
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.L.³

22
- 0023381915
- Planning for conjunctive goals
- Chapman D. Planning for conjunctive goals. Artificial Intelligence. Vol. 32:(3):1987;333-377.
- (1987) Artificial Intelligence , vol.32 , Issue.3 , pp. 333-377
- Chapman, D.¹

23
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- Chapman D., Kaelbling L.P. Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. Proc. IJCAI-91, Sydney, Australia. 1991;726-731.
- (1991) Proc. IJCAI-91, Sydney, Australia , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

24
- 0020900726
- Automatic verification of finite state concurrent systems using temporal logic specifications: A practical approach
- Clarke E.M., Emerson E.A., Sistla A.P. Automatic verification of finite state concurrent systems using temporal logic specifications: A practical approach. Proc. 10th ACM Symposium on Principles of Programming Languages, Austin, TX. 1983;117-126.
- (1983) Proc. 10th ACM Symposium on Principles of Programming Languages, Austin, TX , pp. 117-126
- Clarke, E.M.¹ Emerson, E.A.² Sistla, A.P.³

25
- 0009405929
- Action networks: A framework for reasoning about actions and change under uncertainty
- Darwiche A., Goldszmidt M. Action networks: A framework for reasoning about actions and change under uncertainty. Proc. 10th Conference on Uncertainty in Artificial Intelligence, Seattle, WA. 1994;136-144.
- (1994) Proc. 10th Conference on Uncertainty in Artificial Intelligence, Seattle, WA , pp. 136-144
- Darwiche, A.¹ Goldszmidt, M.²

26
- 0031370386
- Model minimization in Markov decision processes
- Dean T., Givan R. Model minimization in Markov decision processes. Proc. AAAI-97, Providence, RI. 1997;106-111.
- (1997) Proc. AAAI-97, Providence, RI , pp. 106-111
- Dean, T.¹ Givan, R.²

27
- 0000746330
- Model reduction techniques for computing approximately optimal solutions for Markov decision processes
- Dean T., Givan R., Leach S. Model reduction techniques for computing approximately optimal solutions for Markov decision processes. Proc. 13th Conference on Uncertainty in Artificial Intelligence, Providence, RI. 1997;124-131.
- (1997) Proc. 13th Conference on Uncertainty in Artificial Intelligence, Providence, RI , pp. 124-131
- Dean, T.¹ Givan, R.² Leach, S.³

28
- 0027708037
- Planning with deadlines in stochastic domains
- Dean T., Kaelbling L.P., Kirman J., Nicholson A. Planning with deadlines in stochastic domains. Proc. AAAI-93, Washington, DC. 1993;574-579.
- (1993) Proc. AAAI-93, Washington, DC , pp. 574-579
- Dean, T.¹ Kaelbling, L.P.² Kirman, J.³ Nicholson, A.⁴

29
- 84990553353
- A model for reasoning about persistence and causation
- Dean T., Kanazawa K. A model for reasoning about persistence and causation. Computational Intelligence. Vol. 5:(3):1989;142-150.
- (1989) Computational Intelligence , vol.5 , Issue.3 , pp. 142-150
- Dean, T.¹ Kanazawa, K.²

30
- 0030697013
- Abstraction and approximate decision theoretic planning
- Dearden R., Boutilier C. Abstraction and approximate decision theoretic planning. Artificial Intelligence. Vol. 89:1997;219-283.
- (1997) Artificial Intelligence , vol.89 , pp. 219-283
- Dearden, R.¹ Boutilier, C.²

31
- 0001806701
- The MAXQ method for hierarchical reinforcement learning
- Dietterich T.G. The MAXQ method for hierarchical reinforcement learning. Proc. 15th International Conference on Machine Learning, Madison, WI. 1998;118-126.
- (1998) Proc. 15th International Conference on Machine Learning, Madison, WI , pp. 118-126
- Dietterich, T.G.¹

32
- 0037919267
- Explanation-based learning and reinforcement learning: A unified approach
- Dietterich T.G., Flann N.S. Explanation-based learning and reinforcement learning: A unified approach. Proc. 12th International Conference on Machine Learning, Lake Tahoe, CA. 1995;176-184.
- (1995) Proc. 12th International Conference on Machine Learning, Lake Tahoe, CA , pp. 176-184
- Dietterich, T.G.¹ Flann, N.S.²

33
- 0031208987
- Explanation-based learning and reinforcement learning: A unified view
- Dietterich T.G., Flann N.S. Explanation-based learning and reinforcement learning: A unified view. Machine Learning. Vol. 28:(2):1997;169-210.
- (1997) Machine Learning , vol.28 , Issue.2 , pp. 169-210
- Dietterich, T.G.¹ Flann, N.S.²

34
- 0016543936
- Guarded commands, nondeterminacy and formal derivation of programs
- Dijkstra E.W. Guarded commands, nondeterminacy and formal derivation of programs. Comm. ACM. Vol. 18:(8):1975;453-457.
- (1975) Comm. ACM , vol.18 , Issue.8 , pp. 453-457
- Dijkstra, E.W.¹

35
- 0017628839
- Decision theory and artificial intelligence II: The hungry monkey
- Feldman J.A., Sproull R.F. Decision theory and artificial intelligence II: The hungry monkey. Cognitive Sci. Vol. 1:1977;158-192.
- (1977) Cognitive Sci. , vol.1 , pp. 158-192
- Feldman, J.A.¹ Sproull, R.F.²

36
- 2842560201
- STRIPS: A new approach to the application of theorem proving to problem solving
- Fikes R.E., Nilsson N.J. STRIPS: A new approach to the application of theorem proving to problem solving. Artificial Intelligence. Vol. 2:1971;189-208.
- (1971) Artificial Intelligence , vol.2 , pp. 189-208
- Fikes, R.E.¹ Nilsson, N.J.²

37
- 0343860991
- Multi-criteria reinforcement learning
- Gábor Z., Kalmár Z., Szepesvári C. Multi-criteria reinforcement learning. Proc. 15th International Conference on Machine Learning, Madison, WI. 1998;197-205.
- (1998) Proc. 15th International Conference on Machine Learning, Madison, WI , pp. 197-205
- Gábor, Z.¹ Kalmár, Z.² Szepesvári, C.³

38
- 0040731126
- Advances in probabilistic reasoning
- Geiger D., Heckerman D. Advances in probabilistic reasoning. Proc. 7th Conference on Uncertainty in Artificial Intelligence, Los Angeles, CA. 1991;118-126.
- (1991) Proc. 7th Conference on Uncertainty in Artificial Intelligence, Los Angeles, CA , pp. 118-126
- Geiger, D.¹ Heckerman, D.²

39
- 84880654869
- Model minimization, regression, and propositional STRIPS planning
- Givan R., Dean T. Model minimization, regression, and propositional STRIPS planning. Proc. IJCAI-97, Nagoya, Japan. 1997;1163-1168.
- (1997) Proc. IJCAI-97, Nagoya, Japan , pp. 1163-1168
- Givan, R.¹ Dean, T.²

40
- 0028400910
- Modeling a dynamic and uncertain world I: Symbolic and probabilistic reasoning about change
- Hanks S., McDermott D.V. Modeling a dynamic and uncertain world I: Symbolic and probabilistic reasoning about change. Artificial Intelligence. Vol. 66:1994;1-55.
- (1994) Artificial Intelligence , vol.66 , pp. 1-55
- Hanks, S.¹ McDermott, D.V.²

41
- 0038256822
- New Haven, CT: Yale University
- Hanks S.J. Projecting Plans for Uncertain Worlds, Ph.D. Thesis. 1990;Yale University, New Haven, CT.
- (1990) Projecting Plans for Uncertain Worlds, Ph.D. Thesis
- Hanks, S.J.¹

42
- 0003881270
- Englewood Cliffs, NJ: Prentice-Hall
- Hartmanis J., Stearns R.E. Algebraic Structure Theory of Sequential Machines. 1966;Prentice-Hall, Englewood Cliffs, NJ.
- (1966) Algebraic Structure Theory of Sequential Machines
- Hartmanis, J.¹ Stearns, R.E.²

43
- 0002956570
- SPUDD: Stochastic planning using decision diagrams
- Hoey J., St-Aubin R., Hu A., Boutilier C. SPUDD: Stochastic planning using decision diagrams. Proc. 15th Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden. 1999;279-288.
- (1999) Proc. 15th Conference on Uncertainty in Artificial Intelligence, Stockholm, Sweden , pp. 279-288
- Hoey, J.¹ St-Aubin, R.² Hu, A.³ Boutilier, C.⁴

44
- 0003644124
- Cambridge, MA: MIT Press
- Howard R.A. Dynamic Programming and Markov Processes. 1960;MIT Press, Cambridge, MA.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

45
- 0003863029
- R.A. Howard, & J.E. Matheson. Menlo Park, CA: Strategic Decision Group
- Howard R.A., Matheson J.E. Readings on the Principles and Applications of Decision Analysis. 1984;Strategic Decision Group, Menlo Park, CA.
- (1984) Readings on the Principles and Applications of Decision Analysis

46
- 0001815269
- Constructing optimal binary decision trees is NP-complete
- Hyafil L., Rivest R.L. Constructing optimal binary decision trees is NP-complete. Inform. Process. Lett. Vol. 5:1976;15-17.
- (1976) Inform. Process. Lett. , vol.5 , pp. 15-17
- Hyafil, L.¹ Rivest, R.L.²

47
- 0000908087
- Hierarchical reinforcement learning: Preliminary results
- Kaelbling L.P. Hierarchical reinforcement learning: Preliminary results. Proc. 10th International Conference on Machine Learning, Amherst, MA. 1993;167-173.
- (1993) Proc. 10th International Conference on Machine Learning, Amherst, MA , pp. 167-173
- Kaelbling, L.P.¹

48
- 0029679044
- Reinforcement learning: A survey
- Kaelbling L.P., Littman M.L., Moore A.W. Reinforcement learning: A survey. J. Artificial Intelligence Res. Vol. 4:1996;237-285.
- (1996) J. Artificial Intelligence Res. , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

49
- 0004001439
- New York: Wiley
- Keeney R.L., Raiffa H. Decisions with Multiple Objectives: Preferences and Value Trade-offs. 1976;Wiley, New York.
- (1976) Decisions with Multiple Objectives: Preferences and Value Trade-offs
- Keeney, R.L.¹ Raiffa, H.²

50
- 0029333536
- An algorithm for probabilistic planning
- Kushmerick N., Hanks S., Weld D. An algorithm for probabilistic planning. Artificial Intelligence. Vol. 76:1995;239-286.
- (1995) Artificial Intelligence , vol.76 , pp. 239-286
- Kushmerick, N.¹ Hanks, S.² Weld, D.³

51
- 0026998940
- Online miminization of transition systems
- Lee D., Yannakakis M. Online miminization of transition systems. Proc. 24th Annual ACM Symposium on the Theory of Computing (STOC-92), Victoria, BC. 1992;264-274.
- (1992) Proc. 24th Annual ACM Symposium on the Theory of Computing (STOC-92), Victoria, BC , pp. 264-274
- Lee, D.¹ Yannakakis, M.²

52
- 0003861655
- Providence, RI: Brown University, Department of Computer Science
- Littman M.L. Algorithms for sequential decision making, Ph.D. Thesis CS-96-09. 1996;Brown University, Department of Computer Science, Providence, RI.
- (1996) Algorithms for Sequential Decision Making, Ph.D. Thesis CS-96-09
- Littman, M.L.¹

53
- 0002679852
- A survey of algorithmic methods for partially observed Markov decision processes
- Lovejoy W.S. A survey of algorithmic methods for partially observed Markov decision processes. Ann. Oper. Res. Vol. 28:1991;47-66.
- (1991) Ann. Oper. Res. , vol.28 , pp. 47-66
- Lovejoy, W.S.¹

54
- 0002903880
- Systematic nonlinear planning
- McAllester D., Rosenblitt D. Systematic nonlinear planning. Proc. AAAI-91, Anaheim, CA. 1991;634-639.
- (1991) Proc. AAAI-91, Anaheim, CA , pp. 634-639
- McAllester, D.¹ Rosenblitt, D.²

55
- 0014638440
- Some philosophical problems from the standpoint of artificial intelligence
- B. Meltzer, & D. Michie. Edinburgh: Edinburgh University Press
- McCarthy J., Hayes P.J. Some philosophical problems from the standpoint of artificial intelligence. Meltzer B., Michie D. Machine Intelligence 4. 1969;463-502 Edinburgh University Press, Edinburgh.
- (1969) Machine Intelligence 4 , pp. 463-502
- McCarthy, J.¹ Hayes, P.J.²

56
- 0031632806
- Solving very large weakly coupled Markov decision processes
- Meuleau N., Hauskrecht M., Kim K.-E., Peshkin L., Kaelbling L.P., Dean T., Boutilier C. Solving very large weakly coupled Markov decision processes. Proc. AAAI-98, Madison, WI. 1998;165-172.
- (1998) Proc. AAAI-98, Madison, WI , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.-E.³ Peshkin, L.⁴ Kaelbling, L.P.⁵ Dean, T.⁶ Boutilier, C.⁷

57
- 0003391330
- San Mateo, CA: Morgan Kaufmann
- Pearl J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. 1988;Morgan Kaufmann, San Mateo, CA.
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

58
- 0002489296
- UCPOP: A sound, complete, partial order planner for ADL
- Penberthy J.S., Weld D.S. UCPOP: A sound, complete, partial order planner for ADL. Proc. 3rd International Conference on Principles of Knowledge Representation and Reasoning (KR-92), Cambridge, MA. 1992;103-114.
- (1992) Proc. 3rd International Conference on Principles of Knowledge Representation and Reasoning (KR-92), Cambridge, MA , pp. 103-114
- Penberthy, J.S.¹ Weld, D.S.²

59
- 0027702434
- Probabilistic Horn abduction and Bayesian networks
- Poole D. Probabilistic Horn abduction and Bayesian networks. Artificial Intelligence. Vol. 64:(1):1993;81-129.
- (1993) Artificial Intelligence , vol.64 , Issue.1 , pp. 81-129
- Poole, D.¹

60
- 0031187203
- The independent choice logic for modelling multiple agents under uncertainty
- Poole D. The independent choice logic for modelling multiple agents under uncertainty. Artificial Intelligence. Vol. 94:(1-2):1997;7-56.
- (1997) Artificial Intelligence , vol.94 , Issue.12 , pp. 7-56
- Poole, D.¹

61
- 84957069070
- Theoretical results on reinforcement learning with temporally abstract behaviors
- Precup D., Sutton R.S., Singh S. Theoretical results on reinforcement learning with temporally abstract behaviors. Proc. 10th European Conference on Machine Learning, Chemnitz, Germany. 1998;382-393.
- (1998) Proc. 10th European Conference on Machine Learning, Chemnitz, Germany , pp. 382-393
- Precup, D.¹ Sutton, R.S.² Singh, S.³

62
- 85102627959
- New York: Wiley
- Puterman M.L. Markov Decision Processes: Discrete Stochastic Dynamic Programming. 1994;Wiley, New York.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

63
- 0037581251
- Modified policy iteration algorithms for discounted Markov decision problems
- Puterman M.L., Shin M.C. Modified policy iteration algorithms for discounted Markov decision problems. Management Science. Vol. 24:1978;1127-1137.
- (1978) Management Science , vol.24 , pp. 1127-1137
- Puterman, M.L.¹ Shin, M.C.²

64
- 0003500248
- San Mateo, CA: Morgan Kaufmann
- Quinlan J.R. C45: Programs for Machine Learning. 1993;Morgan Kaufmann, San Mateo, CA.
- (1993) C45: Programs for Machine Learning
- Quinlan, J.R.¹

65
- 1442267080
- Learning decision lists
- Rivest R.L. Learning decision lists. Machine Learning. Vol. 2:1987;229-246.
- (1987) Machine Learning , vol.2 , pp. 229-246
- Rivest, R.L.¹

66
- 85125003135
- The nonlinear nature of plans
- Sacerdoti E.D. The nonlinear nature of plans. Proc. IJCAI-75, Tbilisi, Georgia. 1975;206-214.
- (1975) Proc. IJCAI-75, Tbilisi, Georgia , pp. 206-214
- Sacerdoti, E.D.¹

67
- 0001871991
- Universal plans for reactive robots in unpredictable environments
- Schoppers M.J. Universal plans for reactive robots in unpredictable environments. Proc. IJCAI-87, Milan, Italy. 1987;1039-1046.
- (1987) Proc. IJCAI-87, Milan, Italy , pp. 1039-1046
- Schoppers, M.J.¹

68
- 0022059617
- Iterative aggregation-disaggregation procedures for discounted semi-Markov reward processes
- Schweitzer P.L., Puterman M.L., Kindle K.W. Iterative aggregation-disaggregation procedures for discounted semi-Markov reward processes. Oper. Res. Vol. 33:1985;589-605.
- (1985) Oper. Res. , vol.33 , pp. 589-605
- Schweitzer, P.L.¹ Puterman, M.L.² Kindle, K.W.³

69
- 0022818911
- Evaluating influence diagrams
- Shachter R.D. Evaluating influence diagrams. Oper. Res. Vol. 33:(6):1986;871-882.
- (1986) Oper. Res. , vol.33 , Issue.6 , pp. 871-882
- Shachter, R.D.¹

70
- 43949170056
- The role of relevance in explanation I: Irrelevance as statistical independence
- Shimony S.E. The role of relevance in explanation I: Irrelevance as statistical independence. Internat. J. Approx. Reason. Vol. 8:(4):1993;281-324.
- (1993) Internat. J. Approx. Reason. , vol.8 , Issue.4 , pp. 281-324
- Shimony, S.E.¹

71
- 84899022377
- How to dynamically merge Markov decision processes
- Cambridge, MA: MIT Press
- Singh S.P., Cohn D. How to dynamically merge Markov decision processes. Advances in Neural Information Processing Systems 10. 1998;1057-1063 MIT Press, Cambridge, MA.
- (1998) Advances in Neural Information Processing Systems 10 , pp. 1057-1063
- Singh, S.P.¹ Cohn, D.²

72
- 0001027894
- Transfer of learning by composing solutions of elemental sequential tasks
- Singh S.P. Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning. Vol. 8:1992;323-339.
- (1992) Machine Learning , vol.8 , pp. 323-339
- Singh, S.P.¹

73
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Smallwood R.D., Sondik E.J. The optimal control of partially observable Markov processes over a finite horizon. Oper. Res. Vol. 21:1973;1071-1088.
- (1973) Oper. Res. , vol.21 , pp. 1071-1088
- Smallwood, R.D.¹ Sondik, E.J.²

74
- 0027561028
- Structuring conditional relationships in influence diagrams
- Smith J.E., Holtzman S., Matheson J.E. Structuring conditional relationships in influence diagrams. Oper. Res. Vol. 41:(2):1993;280-297.
- (1993) Oper. Res. , vol.41 , Issue.2 , pp. 280-297
- Smith, J.E.¹ Holtzman, S.² Matheson, J.E.³

75
- 0017943242
- The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
- Sondik E.J. The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Oper. Res. Vol. 26:1978;282-304.
- (1978) Oper. Res. , vol.26 , pp. 282-304
- Sondik, E.J.¹

76
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Sutton R.S. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proc. 7th International Conference on Machine Learning, Austin, TX. 1990;216-224.
- (1990) Proc. 7th International Conference on Machine Learning, Austin, TX , pp. 216-224
- Sutton, R.S.¹

77
- 0004102479
- Cambridge, MA: MIT Press
- Sutton R.S., Barto A.G. Reinforcement Learning: An Introduction. 1998;MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

78
- 0028576345
- Control strategies for a stochastic planner
- Tash J., Russell S. Control strategies for a stochastic planner. Proc. AAAI-94, Seattle, WA. 1994;1079-1085.
- (1994) Proc. AAAI-94, Seattle, WA , pp. 1079-1085
- Tash, J.¹ Russell, S.²

79
- 0025399873
- Dynamic programming and influence diagrams
- Tatman J.A., Shachter R.D. Dynamic programming and influence diagrams. IEEE Trans. Systems Man Cybernet. Vol. 20:(2):1990;365-379.
- (1990) IEEE Trans. Systems Man Cybernet. , vol.20 , Issue.2 , pp. 365-379
- Tatman, J.A.¹ Shachter, R.D.²

80
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro G.J. TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation. Vol. 6:1994;215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.J.¹

81
- 0029752470
- Feature-based methods for large scale dynamic programming
- Tsitsiklis J.H., Van Roy B. Feature-based methods for large scale dynamic programming. Machine Learning. Vol. 22:1996;59-94.
- (1996) Machine Learning , vol.22 , pp. 59-94
- Tsitsiklis, J.H.¹ Van Roy, B.²

82
- 0003448318
- University of Massachusetts
- Utgoff P.E. Decision tree induction based on efficient tree restructuring, Technical Report 95-18. 1995;University of Massachusetts.
- (1995) Decision Tree Induction Based on Efficient Tree Restructuring, Technical Report 95-18
- Utgoff, P.E.¹

83
- 0042586698
- Achieving several goals simultaneously
- E. Elcock, & D. Michie. Chichester, England: Ellis Horwood
- Waldinger R. Achieving several goals simultaneously. Elcock E., Michie D. Machine Intelligence 8: Machine Representations of Knowledge. 1977;94-136 Ellis Horwood, Chichester, England.
- (1977) Machine Intelligence 8: Machine Representations of Knowledge , pp. 94-136
- Waldinger, R.¹

84
- 34249833101
- Q-learning
- Watkins C.J.C.H., Dayan P. Q-learning. Machine Learning. Vol. 8:1992;279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.