메뉴 건너뛰기




Volumn 47, Issue 3-4, 2006, Pages 273-293

Symmetric approximate linear programming for factored MDPs with application to constrained problems

Author keywords

Approximate linear programming; Constrained Markov problems; Dual LP; Markov decision processes; Primal LP formulation

Indexed keywords


EID: 33847310802     PISSN: 10122443     EISSN: None     Source Type: Journal    
DOI: 10.1007/s10472-006-9038-x     Document Type: Conference Paper
Times cited : (7)

References (25)
  • 1
    • 0009459044 scopus 로고    scopus 로고
    • Constrained Markov decision processes with total cost criteria: Occupation measures and primal LP. Methods Models
    • Altman, E.: Constrained Markov decision processes with total cost criteria: occupation measures and primal LP. Methods Models Oper. Res. 43(1), 45-72 (1996)
    • (1996) Oper. Res , vol.43 , Issue.1 , pp. 45-72
    • Altman, E.1
  • 2
    • 1942424978 scopus 로고    scopus 로고
    • Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP. Methods Models
    • Altman, E.: Constrained Markov decision processes with total cost criteria: Lagrange approach and dual LP. Methods Models Oper. Res. 48, 387-417 (1998)
    • (1998) Oper. Res , vol.48 , pp. 387-417
    • Altman, E.1
  • 3
    • 0000235370 scopus 로고    scopus 로고
    • Altman, E., Shwartz, A.: Adaptive control of constrained Markov chains: criteria and policies. Ann. Oper. Res., special issue on Markov Decision Processes 28, 101-134 (1991)
    • Altman, E., Shwartz, A.: Adaptive control of constrained Markov chains: criteria and policies. Ann. Oper. Res., special issue on Markov Decision Processes 28, 101-134 (1991)
  • 10
    • 0034248853 scopus 로고    scopus 로고
    • Boutilier, C., Dearden, R., Goldszmidt, M.: Stochastic dynamic programming with factored representations. Artif. Intell. 121(1,2), 49-107 (2000)
    • Boutilier, C., Dearden, R., Goldszmidt, M.: Stochastic dynamic programming with factored representations. Artif. Intell. 121(1,2), 49-107 (2000)
  • 11
    • 0348090400 scopus 로고    scopus 로고
    • The linear programming approach to approximate dynamic programming
    • de Farias, D.P., Van Roy, B.: The linear programming approach to approximate dynamic programming. Oper. Res. 51(6), 850-856 (2003)
    • (2003) Oper. Res , vol.51 , Issue.6 , pp. 850-856
    • de Farias, D.P.1    Van Roy, B.2
  • 12
    • 5544258192 scopus 로고    scopus 로고
    • On constraint sampling in the linear programming approach to approximate dynamic programming
    • de Parias, D.P., Van Roy, B.: On constraint sampling in the linear programming approach to approximate dynamic programming. Math. Oper. Res. 29(3), 462-478 (2004)
    • (2004) Math. Oper. Res , vol.29 , Issue.3 , pp. 462-478
    • de Parias, D.P.1    Van Roy, B.2
  • 13
    • 84990553353 scopus 로고
    • A model for reasoning about persistence and causation
    • Dean, T., Kanazawa, K.: A model for reasoning about persistence and causation. Comput. Intell. 5(3). 142-150 (1989)
    • (1989) Comput. Intell , vol.5 , Issue.3 , pp. 142-150
    • Dean, T.1    Kanazawa, K.2
  • 21
    • 0036923210 scopus 로고    scopus 로고
    • Piecewise linear value function approximation for factored MDPs
    • American Association for Artificial Intelligence, Menlo Park, CA
    • Poupart, P., Boutilier, C., Patrascu, R., Schuurmans, D.: Piecewise linear value function approximation for factored MDPs. In: Eighteenth national conference on Artificial Intelligence, pp. 292-299. American Association for Artificial Intelligence, Menlo Park, CA (2002)
    • (2002) Eighteenth national conference on Artificial Intelligence , pp. 292-299
    • Poupart, P.1    Boutilier, C.2    Patrascu, R.3    Schuurmans, D.4
  • 24
    • 0000273218 scopus 로고
    • Generalized polynomial approximations in Markovian decision processes
    • Schweitzer, P., Seidmann, A.: Generalized polynomial approximations in Markovian decision processes. J. Math. Anal. Appl. 110, 568-582 (1985)
    • (1985) J. Math. Anal. Appl , vol.110 , pp. 568-582
    • Schweitzer, P.1    Seidmann, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.