메뉴 건너뛰기




Volumn 141, Issue , 2006, Pages 590-594

Approximate linear-programming algorithms for graph-based Markov decision processes

Author keywords

[No Author keywords available]

Indexed keywords

GRAPHIC METHODS; LINEAR PROGRAMMING;

EID: 84885992951     PISSN: 09226389     EISSN: 18798314     Source Type: Book Series    
DOI: None     Document Type: Article
Times cited : (22)

References (8)
  • 1
    • 0034248853 scopus 로고    scopus 로고
    • Stochastic dynamic programming with factored representations
    • C. Boutilier, R. Dearden, and M. Goldszmidt, 'Stochastic dynamic programming with factored representations', Artificial Intelligence, 121(1), 49-107, (2000).
    • (2000) Artificial Intelligence , vol.121 , Issue.1 , pp. 49-107
    • Boutilier, C.1    Dearden, R.2    Goldszmidt, M.3
  • 2
    • 33644980075 scopus 로고    scopus 로고
    • Controlled markov fields with finite state space on graphs
    • R. K. Chornei, H. Daduna, and P. S. Knopov, 'Controlled markov fields with finite state space on graphs', Stochastic Models, 21, 847-874, (2005).
    • (2005) Stochastic Models , vol.21 , pp. 847-874
    • Chornei, R.K.1    Daduna, H.2    Knopov, P.S.3
  • 3
    • 5544258192 scopus 로고    scopus 로고
    • On constraint sampling in the linear programming approach to approximate dynamic programming
    • D. P. de Farias and B. Van Roy, 'On constraint sampling in the linear programming approach to approximate dynamic programming', Math. of Op. Research, 29(3), 462-478, (2004).
    • (2004) Math. of Op. Research , vol.29 , Issue.3 , pp. 462-478
    • De Farias, D.P.1    Van Roy, B.2
  • 6
    • 80052922148 scopus 로고    scopus 로고
    • Mean field approximation of the policy iteration algorithm for graph-based mdp
    • N. Peyrard and R. Sabbadin, 'Mean field approximation of the policy iteration algorithm for graph-based mdp', in Proceedings ECAI'06, (2006).
    • (2006) Proceedings ECAI'06
    • Peyrard, N.1    Sabbadin, R.2
  • 8
    • 0012252296 scopus 로고
    • Tight performance bounds on greedy policies based on imperfect value functions
    • College of Computer Science, Northeastern University, Boston, MA
    • R. J. Williams and L.C.I. Baird, 'Tight performance bounds on greedy policies based on imperfect value functions', Technical report, College of Computer Science, Northeastern University, Boston, MA, (1993).
    • (1993) Technical Report
    • Williams, R.J.1    Baird, L.C.I.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.