SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn 2, Issue , 1999, Pages 1332-1339

Computing factored value functions for policies in structured MDPs

Author keywords

[No Author keywords available]

Indexed keywords

ADDITIVE FUNCTION; ADDITIVE VALUE FUNCTIONS; COMPACT REPRESENTATION; DYNAMIC BAYESIAN NETWORKS; LINEAR FUNCTIONS; MARKOV DECISION PROCESSES; PROCESS DESCRIPTIONS; VALUE DETERMINATION;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; MARKOV PROCESSES;

DYNAMIC PROGRAMMING;

EID: 84880688552 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (124)

References (17)

1
- 0002546896
- Graphical models for preference and utility
- F. Bacchus and A. Grove. Graphical models for preference and utility. In Proc. UAI, 1995.
- Proc. UAI, 1995
- Bacchus, F.¹ Grove, A.²

2
- 0012352653
- Approximating value trees in structured dynamic programming
- C. Boutilier and R. Dearden. Approximating value trees in structured dynamic programming. In Proc. ICML, pages 54-62, 1996.
- (1996) Proc. ICML , pp. 54-62
- Boutilier, C.¹ Dearden, R.²

3
- 0001811022
- Prioritized goal decomposition of Markov decision processes: Towards a synthesis of classical and decision theoretic planning
- C. Boutilier, R.I. Brafman, and C. Geib. Prioritized goal decomposition of Markov decision processes: Towards a synthesis of classical and decision theoretic planning. In Proc. UAI, pages 24-32, 1998.
- (1998) Proc. UAI , pp. 24-32
- Boutilier, C.¹ Brafman, R.I.² Geib, C.³

4
- 0346942368
- Decision theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Dean, and S. Hanks. Decision theoretic planning: Structural assumptions and computational leverage. Journal of Artificial Intelligence Research, 1999.
- (1999) Journal of Artificial Intelligence Research
- Boutilier, C.¹ Dean, T.² Hanks, S.³

5
- 84880686930
- Tractable inference for complex stochastic processes
- X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In Proc. UAI, 1998.
- Proc. UAI, 1998
- Boyen, X.¹ Koller, D.²

6
- 0000746330
- Model reduction techniques for computing approximately optimal solutions for Markov decision processes
- T. Dean, R. Givan, and S. Leach. Model reduction techniques for computing approximately optimal solutions for Markov decision processes. In Proc. UAI, 1997.
- Proc. UAI, 1997
- Dean, T.¹ Givan, R.² Leach, S.³

7
- 84880694195
- Stable function approximation in dynamic programming
- G J. Gordon. Stable function approximation in dynamic programming. In Proc. ICML, pages 261-268, 1995.
- (1995) Proc. ICML , pp. 261-268
- Gordon, G.J.¹

8
- 0000086731
- Influence diagrams
- Strategic Decisions Group
- R.A. Howard and J.E, Matheson. Influence diagrams. In Readings on the Principles and Applications of Decision Analysis, pages 721-762. Strategic Decisions Group, 1984.
- (1984) Readings on the Principles and Applications of Decision Analysis , pp. 721-762
- Howard, R.A.¹ Matheson, J.E.²

9
- 0004001439
- Wiley
- R.L. Keeney and H. Raiffa. Decisions with Multiple Objectives: Preferences and Value Tradeoffs. Wiley, 1976.
- (1976) Decisions with Multiple Objectives: Preferences and Value Tradeoffs
- Keeney, R.L.¹ Raiffa, H.²

10
- 0031632806
- Solving very large weakly coupled Markov decision processes
- N. Meuleau, M. Hauskrecht, K-E. Kim, L. Peshkin, L.P. Kaelbling, T. Dean, and C. Boutilier. Solving very large weakly coupled Markov decision processes. In Proc. AAA/, pages 165-172, 1998.
- (1998) Proc. AAA , pp. 165-172
- Meuleau, N.¹ Hauskrecht, M.² Kim, K.-E.³ Peshkin, L.⁴ Kaelbling, L.P.⁵ Dean, T.⁶ Boutilier, C.⁷

11
- 0001577708
- The adjoint Markoff process
- E. Nelson. The adjoint Markoff process. Duke Mathematical Journal, 25, 1958.
- (1958) Duke Mathematical Journal , vol.25
- Nelson, E.¹

12
- 84899022377
- How to dynamically merge Markov decision processes
- S.P. Singh and D. Cohn. How to dynamically merge Markov decision processes. In NIPS 10, pages 1057-1063, 1998.
- (1998) NIPS 10 , pp. 1057-1063
- Singh, S.P.¹ Cohn, D.²

13
- 0003449348
- Academic Press
- G. Strang. Linear Algebra and Its Applications. Academic Press, 1980.
- (1980) Linear Algebra and Its Applications
- Strang, G.¹

14
- 0000672258
- Improved switching among temporally abstract actions
- To appear
- R.S. Sutton, S. Singh, D. Precup, and B. Ravindran. Improved switching among temporally abstract actions. In NIPS 12, 1999. To appear.
- (1999) NIPS 12
- Sutton, R.S.¹ Singh, S.² Precup, D.³ Ravindran, B.⁴

15
- 0002313852
- Scaling up average reward reinforcement learning by approximating the domain models and the value function
- P. Tadepalli and D. Ok. Scaling up average reward reinforcement learning by approximating the domain models and the value function. In Proc. ICML, 1996.
- Proc. ICML, 1996
- Tadepalli, P.¹ Ok, D.²

16
- 0029752470
- Feature-based methods for large scale dynamic programming
- J. D. Tsitsiklis and B. Van Roy. Feature-based methods for large scale dynamic programming. Machine Learning, 22(1):59-94, January 1996. (Pubitemid 126724363)
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

17
- 0003787427
- PhD thesis, Massachusetts Institute of Technology
- B. Van Roy. Learning and Value Function Approximation in Complex Decision Problems. PhD thesis, Massachusetts Institute of Technology, 1998.
- (1998) Learning and Value Function Approximation in Complex Decision Problems
- Van Roy, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.