SCOPUS 정보 검색 플랫폼

ICAPS 2006 - Proceedings, Sixteenth International Conference on Automated Planning and Scheduling

Volumn 2006, Issue , 2006, Pages 114-120

Solving factored MDPs with exponential-family transition models

(2) Kveton, Branislav a Hauskrecht, Milos b

a UNIVERSITY OF PITTSBURGH (United States)

b University of Pittsburgh (United States)

Author keywords

[No Author keywords available]

Indexed keywords

HYBRID APPROXIMATE LINEAR PROGRAMMING (HALF); MARKOV DECISION PROCESSES (MDP); OPTIMAL VALUE FUNCTIONS; TRANSITION MODELS;

APPROXIMATION THEORY; DISCRETE TIME CONTROL SYSTEMS; HYBRID COMPUTERS; LINEAR PROGRAMMING; MARKOV PROCESSES; MATHEMATICAL MODELS; OPTIMIZATION;

DECISION MAKING;

EID: 33746054938 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (24)

1
- 0037262814
- An introduction to MCMC for machine learning
- Andricu, C.; de Frcilas, N.; Doucel, A.; and Jordan, M. 2003. An introduction to MCMC for machine learning. Machine Learning 50:5-43.
- (2003) Machine Learning , vol.50 , pp. 5-43
- Andricu, C.¹ De Frcilas, N.² Doucel, A.³ Jordan, M.⁴

2
- 84968468700
- Polynomial approximation - A new computational technique in dynamic programming: Allocation processes
- Bellman, R.; Kalaba, R.; and Kolkin, B. 1963. Polynomial approximation - a new computational technique in dynamic programming: Allocation processes. Mathematics of Computation 17(82): 155-161.
- (1963) Mathematics of Computation , vol.17 , Issue.82 , pp. 155-161
- Bellman, R.¹ Kalaba, R.² Kolkin, B.³

3
- 0003787146
- Princeton, NJ: Princeton University Press
- Bellman, R. 1957. Dynamic Programming. Princeton, NJ: Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

4
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas, D., and Tsitsiklis, J. 1996. Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

5
- 85166207010
- Exploiting structure in policy construction
- Boutilier, C.; Dearden, R.; and Goldszmidt, M. 1995. Exploiting structure in policy construction. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, 1104-1 111.
- (1995) Proceedings of the 14th International Joint Conference on Artificial Intelligence , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

6
- 3042524845
- Planning under continuous time and resource uncertainty: A challenge for AI
- Bresina, J.; Dearden, R.; Meuleau, N.; Ramakrishnan, S.; Smith, D.; and Washington, R. 2002. Planning under continuous time and resource uncertainty: A challenge for AI. In Proceedings of the 18th Conference on Uncertainty in Artificial Intelligence, 77-84.
- (2002) Proceedings of the 18th Conference on Uncertainty in Artificial Intelligence , pp. 77-84
- Bresina, J.¹ Dearden, R.² Meuleau, N.³ Ramakrishnan, S.⁴ Smith, D.⁵ Washington, R.⁶

7
- 0026206780
- An optimal one-way multigrid algorithm for discrete-time stochastic control
- Chow, C.-S., and Tsitsiklis, J. 1991. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Transactions on Automatic Control 36(8):898-914.
- (1991) IEEE Transactions on Automatic Control , vol.36 , Issue.8 , pp. 898-914
- Chow, C.-S.¹ Tsitsiklis, J.²

8
- 0348090400
- The linear programming approach to approximate dynamic programming
- de Parias, D. P., and Van Roy, B. 2003. The linear programming approach to approximate dynamic programming. Operations Research 51(6):850-856.
- (2003) Operations Research , vol.51 , Issue.6 , pp. 850-856
- De Parias, D.P.¹ Van Roy, B.²

9
- 5544258192
- On constraint sampling for the linear programming approach to approximate dynamic programming
- de Parias, D. P., and Van Roy, B. 2004. On constraint sampling for the linear programming approach to approximate dynamic programming. Mathematics of Operations Research 29(3):462-478.
- (2004) Mathematics of Operations Research , vol.29 , Issue.3 , pp. 462-478
- De Parias, D.P.¹ Van Roy, B.²

10
- 84990553353
- A model for reasoning about persistence and causation
- Dean, T., and Kanazawa, K. 1989. A model for reasoning about persistence and causation. Computational Intelligence 5:142-150.
- (1989) Computational Intelligence , vol.5 , pp. 142-150
- Dean, T.¹ Kanazawa, K.²

11
- 29344460055
- Dynamic programming for structured continuous Markov decision problems
- Feng, Z.; Dearden, R.; Meuleau, N.; and Washington, R. 2004. Dynamic programming for structured continuous Markov decision problems. In Proceedings of the 20th Conference on Uncertainty in A rtificial Intelligence, 154-161.
- (2004) Proceedings of the 20th Conference on Uncertainty in A Rtificial Intelligence , pp. 154-161
- Feng, Z.¹ Dearden, R.² Meuleau, N.³ Washington, R.⁴

12
- 29344475738
- Solving factored MDPs with continuous and discrete variables
- Guestrin, C.; Hauskrecht, M.; and Kveton, B. 2004. Solving factored MDPs with continuous and discrete variables. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, 235-242.
- (2004) Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence , pp. 235-242
- Guestrin, C.¹ Hauskrecht, M.² Kveton, B.³

13
- 84898970468
- Linear program approximations for factored continuous-state Markov decision processes
- Hauskrecht, M., and Kveton, B. 2004. Linear program approximations for factored continuous-state Markov decision processes. In Advances in Neural Information Processing Systems 16, 895-902.
- (2004) Advances in Neural Information Processing Systems , vol.16 , pp. 895-902
- Hauskrecht, M.¹ Kveton, B.²

14
- 0003598496
- Cambridge, United Kingdom: Cambridge University Press
- Jeffreys, H., and Jeffreys, B. 1988. Methods of Mathematical Physics. Cambridge, United Kingdom: Cambridge University Press.
- (1988) Methods of Mathematical Physics
- Jeffreys, H.¹ Jeffreys, B.²

15
- 84880688552
- Computing factored value functions for policies in structured MDPs
- Koller, D., and Parr, R. 1999. Computing factored value functions for policies in structured MDPs. In Proceedings of the 16th International Joint Conference on Artificial Intelligence, 1332-1339.
- (1999) Proceedings of the 16th International Joint Conference on Artificial Intelligence , pp. 1332-1339
- Koller, D.¹ Parr, R.²

16
- 33746031635
- An MCMC approach to solving hybrid factored MDPs
- Kveton, B., and Hauskrecht, M. 2005. An MCMC approach to solving hybrid factored MDPs. In Proceedings of the 19th International Joint Conference on Artificial Intelligence, 1346-1351.
- (2005) Proceedings of the 19th International Joint Conference on Artificial Intelligence , pp. 1346-1351
- Kveton, B.¹ Hauskrecht, M.²

17
- 77957901577
- Value function approximation with diffusion wavelets and Laplacian eigenfunctions
- Mahadevan, S., and Maggioni, M. 2006. Value function approximation with diffusion wavelets and Laplacian eigenfunctions. In Advances in Neural Information Processing Systems 18, 843-850.
- (2006) In Advances in Neural Information Processing Systems , vol.18 , pp. 843-850
- Mahadevan, S.¹ Maggioni, M.²

18
- 29344433509
- Samuel meets Amarel: Automating value function approximation using global state space analysis
- Mahadevan, S. 2005. Samuel meets Amarel: Automating value function approximation using global state space analysis. In Proceedings of the 20th National Conference on Artificial Intelligence, 1000-1005.
- (2005) Proceedings of the 20th National Conference on Artificial Intelligence , pp. 1000-1005
- Mahadevan, S.¹

19
- 0036832953
- Variable resolution discretization in optimal control
- Munos, R., and Moore, A. 2002. Variable resolution discretization in optimal control. Machine Learning 49:291-323.
- (2002) Machine Learning , vol.49 , pp. 291-323
- Munos, R.¹ Moore, A.²

20
- 0036927202
- Greedy linear value-approximation for factored Markov decision processes
- Patrascu, R.; Poupart, P.; Schuurmans, D.; Boutilier, C.; and Guestrin, C. 2002. Greedy linear value-approximation for factored Markov decision processes. In Proceedings of the 18th National Conference on Artificial Intelligence, 285-291.
- (2002) Proceedings of the 18th National Conference on Artificial Intelligence , pp. 285-291
- Patrascu, R.¹ Poupart, P.² Schuurmans, D.³ Boutilier, C.⁴ Guestrin, C.⁵

21
- 85102627959
- New York, NY: John Wiley & Sons
- Puterman, M. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York, NY: John Wiley & Sons.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

22
- 0001509947
- Using randomization to break the curse of dimensionality
- Rust, J. 1997. Using randomization to break the curse of dimensionality. Econometrica 65(3):487-516.
- (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
- Rust, J.¹

23
- 0000273218
- Generalized polynomial approximations in Markovian decision processes
- Schweitzer, P., and Seidmann, A. 1985. Generalized polynomial approximations in Markovian decision processes. Journal of Mathematical Analysis and Applications 110:568-582.
- (1985) Journal of Mathematical Analysis and Applications , vol.110 , pp. 568-582
- Schweitzer, P.¹ Seidmann, A.²

24
- 33746070253
- Ph.D. Dissertation, Massachusetts Institute of Technology
- Van Roy, B. 1998. Planning Under Uncertainty in Complex Structured Environments. Ph.D. Dissertation, Massachusetts Institute of Technology.
- (1998) Planning under Uncertainty in Complex Structured Environments
- Van Roy, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.