메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 114-120

Solving factored MDPs with exponential-family transition models

Author keywords

[No Author keywords available]

Indexed keywords

HYBRID APPROXIMATE LINEAR PROGRAMMING (HALF); MARKOV DECISION PROCESSES (MDP); OPTIMAL VALUE FUNCTIONS; TRANSITION MODELS;

EID: 33746054938     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (24)
  • 2
    • 84968468700 scopus 로고
    • Polynomial approximation - A new computational technique in dynamic programming: Allocation processes
    • Bellman, R.; Kalaba, R.; and Kolkin, B. 1963. Polynomial approximation - a new computational technique in dynamic programming: Allocation processes. Mathematics of Computation 17(82): 155-161.
    • (1963) Mathematics of Computation , vol.17 , Issue.82 , pp. 155-161
    • Bellman, R.1    Kalaba, R.2    Kolkin, B.3
  • 3
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton University Press
    • Bellman, R. 1957. Dynamic Programming. Princeton, NJ: Princeton University Press.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 7
    • 0026206780 scopus 로고
    • An optimal one-way multigrid algorithm for discrete-time stochastic control
    • Chow, C.-S., and Tsitsiklis, J. 1991. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Transactions on Automatic Control 36(8):898-914.
    • (1991) IEEE Transactions on Automatic Control , vol.36 , Issue.8 , pp. 898-914
    • Chow, C.-S.1    Tsitsiklis, J.2
  • 8
    • 0348090400 scopus 로고    scopus 로고
    • The linear programming approach to approximate dynamic programming
    • de Parias, D. P., and Van Roy, B. 2003. The linear programming approach to approximate dynamic programming. Operations Research 51(6):850-856.
    • (2003) Operations Research , vol.51 , Issue.6 , pp. 850-856
    • De Parias, D.P.1    Van Roy, B.2
  • 9
    • 5544258192 scopus 로고    scopus 로고
    • On constraint sampling for the linear programming approach to approximate dynamic programming
    • de Parias, D. P., and Van Roy, B. 2004. On constraint sampling for the linear programming approach to approximate dynamic programming. Mathematics of Operations Research 29(3):462-478.
    • (2004) Mathematics of Operations Research , vol.29 , Issue.3 , pp. 462-478
    • De Parias, D.P.1    Van Roy, B.2
  • 10
    • 84990553353 scopus 로고
    • A model for reasoning about persistence and causation
    • Dean, T., and Kanazawa, K. 1989. A model for reasoning about persistence and causation. Computational Intelligence 5:142-150.
    • (1989) Computational Intelligence , vol.5 , pp. 142-150
    • Dean, T.1    Kanazawa, K.2
  • 13
    • 84898970468 scopus 로고    scopus 로고
    • Linear program approximations for factored continuous-state Markov decision processes
    • Hauskrecht, M., and Kveton, B. 2004. Linear program approximations for factored continuous-state Markov decision processes. In Advances in Neural Information Processing Systems 16, 895-902.
    • (2004) Advances in Neural Information Processing Systems , vol.16 , pp. 895-902
    • Hauskrecht, M.1    Kveton, B.2
  • 17
    • 77957901577 scopus 로고    scopus 로고
    • Value function approximation with diffusion wavelets and Laplacian eigenfunctions
    • Mahadevan, S., and Maggioni, M. 2006. Value function approximation with diffusion wavelets and Laplacian eigenfunctions. In Advances in Neural Information Processing Systems 18, 843-850.
    • (2006) In Advances in Neural Information Processing Systems , vol.18 , pp. 843-850
    • Mahadevan, S.1    Maggioni, M.2
  • 19
    • 0036832953 scopus 로고    scopus 로고
    • Variable resolution discretization in optimal control
    • Munos, R., and Moore, A. 2002. Variable resolution discretization in optimal control. Machine Learning 49:291-323.
    • (2002) Machine Learning , vol.49 , pp. 291-323
    • Munos, R.1    Moore, A.2
  • 22
    • 0001509947 scopus 로고    scopus 로고
    • Using randomization to break the curse of dimensionality
    • Rust, J. 1997. Using randomization to break the curse of dimensionality. Econometrica 65(3):487-516.
    • (1997) Econometrica , vol.65 , Issue.3 , pp. 487-516
    • Rust, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.