메뉴 건너뛰기




Volumn 37, Issue 2, 2012, Pages 288-300

Distributionally robust markov decision processes

Author keywords

Distributional robustness; Markov decision process; Parameter uncertainty

Indexed keywords

CONFIDENCE LEVELS; CONFIDENCE REGION; DECISION CRITERIONS; DECISION MAKERS; MARKOV DECISION PROCESSES; NESTED SETS; OPTIMAL STRATEGIES; PARAMETER UNCERTAINTY; POLYNOMIAL-TIME; PRIORI INFORMATION; PROBABILISTIC GUARANTEES; ROBUST STRATEGY; TECHNICAL CONDITIONS; UNKNOWN PARAMETERS;

EID: 84861386911     PISSN: 0364765X     EISSN: 15265471     Source Type: Journal    
DOI: 10.1287/moor.1120.0540     Document Type: Article
Times cited : (98)

References (33)
  • 1
    • 0026927427 scopus 로고
    • Perturbation and stability theory for Markov control problems
    • DOI 10.1109/9.159584
    • Abbad M, Filar J (1992) Perturbation and stability theory for Markov control problems. IEEE Trans. Automatic Control 37(9):1415-1420. (Pubitemid 23555846)
    • (1992) IEEE Transactions on Automatic Control , vol.37 , Issue.9 , pp. 1415-1420
    • Abbad Mohammed1    Filar Jerzy, A.2
  • 4
    • 1942450194 scopus 로고    scopus 로고
    • Solving uncertain Markov decision problems
    • Carnegie Mellon University, Pittsburgh
    • Bagnell A, Ng A, Schneider J (2001) Solving uncertain Markov decision problems. Technical Report CMU-RI-TR-01-25, Carnegie Mellon University, Pittsburgh.
    • (2001) Technical Report CMU-RI-TR 01-25
    • Bagnell, A.1    Ng, A.2    Schneider, J.3
  • 5
    • 0004201621 scopus 로고    scopus 로고
    • Cambridge University Press, New York
    • Baron J (2000) Thinking and Deciding (Cambridge University Press, New York).
    • (2000) Thinking and Deciding
    • Baron, J.1
  • 6
    • 0032664938 scopus 로고    scopus 로고
    • Robust solutions of uncertain linear programs
    • Ben-Tal A, Nemirovski A (1999) Robust solutions of uncertain linear programs. Oper. Res. Lett. 25(1):1-13.
    • (1999) Oper. Res. Lett. , vol.25 , Issue.1 , pp. 1-13
    • Ben-Tal, A.1    Nemirovski, A.2
  • 10
    • 33845679809 scopus 로고    scopus 로고
    • On distributionally robust chance-constrained linear programs
    • DOI 10.1007/s10957-006-9084-x
    • Calafiore G, El Ghaoui L (2006) On distributionally robust chance-constrained linear programs. J. Optimization Theory and Appl. 130(1):1-22. (Pubitemid 44951197)
    • (2006) Journal of Optimization Theory and Applications , vol.130 , Issue.1 , pp. 1-22
    • Calafiore, G.C.1    Ghaoui, L.E.2
  • 11
    • 77249117255 scopus 로고    scopus 로고
    • Percentile optimization for Markov decision processes with parameter uncertainty
    • Delage E, Mannor S (2010) Percentile optimization for Markov decision processes with parameter uncertainty. Oper. Res. 58(1):203-213.
    • (2010) Oper. Res. , vol.58 , Issue.1 , pp. 203-213
    • Delage, E.1    Mannor, S.2
  • 12
    • 77953562445 scopus 로고    scopus 로고
    • Distributionally robust optimization under moment uncertainty with applications to data-driven problems
    • Delage E, Ye Y (2010) Distributionally robust optimization under moment uncertainty with applications to data-driven problems. Oper. Res. 58(3):596-612.
    • (2010) Oper. Res. , vol.58 , Issue.3 , pp. 596-612
    • Delage, E.1    Ye, Y.2
  • 13
    • 0019541031 scopus 로고
    • Optimal control of markov chains admitting strong and weak interactions
    • DOI 10.1016/0005-1098(81)90047-9
    • Delebecque F, Quadrat JP (1981) Optimal control of Markov chains admitting strong and weak interactions. Automatica 17(2):281-296. (Pubitemid 11504180)
    • (1981) Automatica , vol.17 , Issue.2 , pp. 281-296
    • Delebecque Francois1    Quadrat Jean Pierre2
  • 14
    • 0005093698 scopus 로고
    • The minimax approach to stochastic programming and an illustrative application
    • Dupacová J (1987) The minimax approach to stochastic programming and an illustrative application. Stochastics 20:73-88.
    • (1987) Stochastics , vol.20 , pp. 73-88
    • Dupacová, J.1
  • 15
    • 0041707484 scopus 로고
    • Elimination of randomization in certain statistical decision procedures and zero-sum two-person games
    • Dvoretzky A, Wald A, Wolfowitz J (1951) Elimination of randomization in certain statistical decision procedures and zero-sum two-person games. Ann. Math. Statist. 22(1):1-21.
    • (1951) Ann. Math. Statist. , vol.22 , Issue.1 , pp. 1-21
    • Dvoretzky, A.1    Wald, A.2    Wolfowitz, J.3
  • 16
    • 34548536224 scopus 로고    scopus 로고
    • Learning under ambiguity
    • DOI 10.1111/j.1467-937X.2007.00464.x
    • Epstein LG, Schneider M (2007) Learning under ambiguity. Rev. Econom. Stud. 74(4):1275-1303. (Pubitemid 47382376)
    • (2007) Review of Economic Studies , vol.74 , Issue.4 , pp. 1275-1303
    • Epstein, L.G.1    Schneider, M.2
  • 17
    • 0001266334 scopus 로고
    • Maxmin expected utility with a non-unique prior
    • Gilboa I, Schmeidler D (1989) Maxmin expected utility with a non-unique prior. J. Math. Econom. 18(2):141-153.
    • (1989) J. Math. Econom. , vol.18 , Issue.2 , pp. 141-153
    • Gilboa, I.1    Schmeidler, D.2
  • 18
    • 77955897551 scopus 로고    scopus 로고
    • Distributionally robust optimization and its tractable approximations
    • Goh J, Sim M (2010) Distributionally robust optimization and its tractable approximations. Oper. Res. 58(4):902-917.
    • (2010) Oper. Res. , vol.58 , Issue.4 , pp. 902-917
    • Goh, J.1    Sim, M.2
  • 20
    • 25444493818 scopus 로고    scopus 로고
    • Robust dynamic programming
    • DOI 10.1287/moor.1040.0129
    • Iyengar GN (2005) Robust dynamic programming. Math. Oper. Res. 30(2):257-280. (Pubitemid 43126832)
    • (2005) Mathematics of Operations Research , vol.30 , Issue.2 , pp. 257-280
    • Iyengar, G.N.1
  • 21
    • 85153140079 scopus 로고
    • Stochastic programming with recourse: Upper bounds and moment problems, a review
    • Academie Verlag, Berlin
    • Kall P (1988) Stochastic programming with recourse: Upper bounds and moment problems, a review. Advances in Mathematical Optimization (Academie-Verlag, Berlin), 86-103.
    • (1988) Advances in Mathematical Optimization , pp. 86-103
    • Kall, P.1
  • 22
    • 0008939363 scopus 로고
    • The theory of infinite games
    • Karlin S (1953) The theory of infinite games. Ann. Math. 58(2):371-401.
    • (1953) Ann. Math. , vol.58 , Issue.2 , pp. 371-401
    • Karlin, S.1
  • 23
    • 0011619060 scopus 로고
    • Maxmin expected utility and weight of evidence
    • Kelsey D (1994) Maxmin expected utility and weight of evidence. Oxford Econom. Papers 46(3):425-444.
    • (1994) Oxford Econom. Papers , vol.46 , Issue.3 , pp. 425-444
    • Kelsey, D.1
  • 24
    • 33847336943 scopus 로고    scopus 로고
    • Bias and variance approximation in value function estimates
    • DOI 10.1287/mnsc.1060.0614
    • Mannor S, Simester D, Sun P, Tsitsiklis JN (2007) Bias and variance approximation in value function estimates. Management Sci. 53(2):308-322. (Pubitemid 46326182)
    • (2007) Management Science , vol.53 , Issue.2 , pp. 308-322
    • Mannor, S.1    Simester, D.2    Sun, P.3    Tsitsiklis, J.N.4
  • 25
    • 14344250395 scopus 로고    scopus 로고
    • Robust control of Markov decision processes with uncertain transition matrices
    • DOI 10.1287/opre.1050.0216
    • Nilim A, El Ghaoui L (2005) Robust control of Markov decision processes with uncertain transition matrices. Oper. Res. 53(5):780-798. (Pubitemid 41525849)
    • (2005) Operations Research , vol.53 , Issue.5 , pp. 780-798
    • Nilim, A.1    Ghaoui, L.E.2
  • 26
    • 34247539626 scopus 로고    scopus 로고
    • Robust mean-covariance solutions for stochastic optimization
    • DOI 10.1287/opre.1060.0353
    • Popescu I (2007) Robust mean-covariance solutions for stochastic optimization. Oper. Res. 55(1):98-112. (Pubitemid 46663476)
    • (2007) Operations Research , vol.55 , Issue.1 , pp. 98-112
    • Popescu, I.1
  • 28
    • 0004267646 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Rockafellar RT (1970) Convex Analysis (Princeton University Press, Princeton, NJ).
    • (1970) Convex Analysis
    • Rockafellar, R.T.1
  • 29
    • 0002693448 scopus 로고
    • A min-max solution of an inventory problem
    • Arrow KJ, Karlin S, Scarf H, eds, Stanford University Press, Stanford, CA
    • Scarf H (1958) A min-max solution of an inventory problem. Arrow KJ, Karlin S, Scarf H, eds. Studies in Mathematical Theory of Inventory and Production (Stanford University Press, Stanford, CA), 201-209.
    • (1958) Studies in Mathematical Theory of Inventory and Production , pp. 201-209
    • Scarf, H.1
  • 30
    • 33644676235 scopus 로고    scopus 로고
    • Worst-case distribution analysis of stochastic programs
    • DOI 10.1007/s10107-005-0680-6
    • Shapiro A (2006) Worst-case distribution analysis of stochastic programs. Math. Programming 107(1):91-96. (Pubitemid 43334015)
    • (2006) Mathematical Programming , vol.107 , Issue.1-2 , pp. 91-96
    • Shapiro, A.1
  • 32
    • 84972513554 scopus 로고
    • On general minimax theorems
    • Sion M (1958) On general minimax theorems. Pacific J. Math. 8(1):171-176.
    • (1958) Pacific J. Math. , vol.8 , Issue.1 , pp. 171-176
    • Sion, M.1
  • 33
    • 0028460403 scopus 로고
    • Markov decision processes with imprecise transition probabilities
    • White III CC, El Deib HK (1992) Markov decision processes with imprecise transition probabilities. Oper. Res. 42(4):739-748.
    • (1992) Oper. Res. , vol.42 , Issue.4 , pp. 739-748
    • White Iii, C.C.1    El Deib, H.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.