메뉴 건너뛰기




Volumn 53, Issue 5, 2005, Pages 780-798

Robust control of Markov decision processes with uncertain transition matrices

Author keywords

Dynamic programming: Markov; Finite state; Game theory; programming: convex; Robustness; statistics: estimation; Uncertainty

Indexed keywords

DYNAMIC PROGRAMMING: MARKOV; FINITE STATE; GAME THEORY; PROGRAMMING: CONVEX; ROBUSTNESS; STATISTICS: ESTIMATION; UNCERTAINTY;

EID: 14344250395     PISSN: 0030364X     EISSN: 15265463     Source Type: Journal    
DOI: 10.1287/opre.1050.0216     Document Type: Article
Times cited : (777)

References (27)
  • 1
    • 0026927427 scopus 로고
    • Perturbation and stability theory for Markov control problems
    • Abbad, M., J. A. Filar. 1992. Perturbation and stability theory for Markov control problems. IEEE Trans. Automatic Control 37 1415-1420.
    • (1992) IEEE Trans. Automatic Control , vol.37 , pp. 1415-1420
    • Abbad, M.1    Filar, J.A.2
  • 2
    • 0026923658 scopus 로고
    • Algorithms for singularly perturbed limiting average Markov control problems
    • Abbad, M., J. Filar, T. Bielecki. 1992. Algorithms for singularly perturbed limiting average Markov control problems. IEEE Trans. Automatic Control 37 1421-1425.
    • (1992) IEEE Trans. Automatic Control , vol.37 , pp. 1421-1425
    • Abbad, M.1    Filar, J.2    Bielecki, T.3
  • 3
    • 0000611954 scopus 로고
    • Zero-sum Markov games and worst-case optimal control of queueing systems
    • Special Issue on Optimization of Queueing Systems
    • Altman, E., A. Hordijk. 1994. Zero-sum Markov games and worst-case optimal control of queueing systems. QUESTA 21(Special Issue on Optimization of Queueing Systems) 415-447.
    • (1994) QUESTA , vol.21 , pp. 415-447
    • Altman, E.1    Hordijk, A.2
  • 5
    • 1942450194 scopus 로고    scopus 로고
    • Solving uncertain Markov decision problems
    • Robotics Institute, Carnegie Mellon University, Pittsburgh, PA
    • Bagnell, J., A. Ng, J. Schneider. 2001. Solving uncertain Markov decision problems. Technical report CMU-RI-TR-01-25, Robotics Institute, Carnegie Mellon University, Pittsburgh, PA.
    • (2001) Technical Report , vol.CMU-RI-TR-01-25
    • Bagnell, J.1    Ng, A.2    Schneider, J.3
  • 10
    • 0000780135 scopus 로고
    • Prior distributions on space of probability measures
    • Ferguson, T. 1974. Prior distributions on space of probability measures. Ann. Statist. 2(4) 615-629.
    • (1974) Ann. Statist. , vol.2 , Issue.4 , pp. 615-629
    • Ferguson, T.1
  • 12
    • 27344445607 scopus 로고    scopus 로고
    • Robust dynamic programming
    • Columbia University, New York
    • Iyengar, G. 2003. Robust dynamic programming. Technical report TR-2002-07, Columbia University, New York.
    • (2003) Technical Report , vol.TR-2002-07
    • Iyengar, G.1
  • 13
    • 27344438007 scopus 로고    scopus 로고
    • Markov decision processes with uncertain transition rates: Sensitivity and robust control
    • Department of ECE, Purdue University, West Lafayette, IN
    • Kalyanasundaram, S., E. Chong, N. Shroff. 2001. Markov decision processes with uncertain transition rates: Sensitivity and robust control. Technical report, Department of ECE, Purdue University, West Lafayette, IN.
    • (2001) Technical Report
    • Kalyanasundaram, S.1    Chong, E.2    Shroff, N.3
  • 17
    • 27344456988 scopus 로고    scopus 로고
    • Robust solution to the Markov decision processes with uncertain transition matrices
    • Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA
    • Nilim, A., L. El Ghaoui. 2002. Robust solution to the Markov decision processes with uncertain transition matrices. Technical report UCB/ERL M02/31, Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA.
    • (2002) Technical Report , vol.UCB-ERL M02-31
    • Nilim, A.1    El Ghaoui, L.2
  • 18
  • 19
    • 0038771347 scopus 로고
    • On zero sum stochastic games with general state space. I
    • Nowak, A. S. 1984. On zero sum stochastic games with general state space. I. Probab. Math. Statist. 4(1) 13-32.
    • (1984) Probab. Math. Statist. , vol.4 , Issue.1 , pp. 13-32
    • Nowak, A.S.1
  • 20
    • 0004256568 scopus 로고
    • Springer-Verlag, New York
    • Pitman, J. 1993. Probability. Springer-Verlag, New York.
    • (1993) Probability
    • Pitman, J.1
  • 23
    • 0015630091 scopus 로고
    • Markov decision processes with uncertain transition probabilities
    • Satia, J. K., R. L. Lave. 1973. Markov decision processes with uncertain transition probabilities. Oper. Res. 21(3) 728-740.
    • (1973) Oper. Res. , vol.21 , Issue.3 , pp. 728-740
    • Satia, J.K.1    Lave, R.L.2
  • 24
    • 0036592138 scopus 로고    scopus 로고
    • Minimax analysis of stochastic problems
    • Shapiro, A., A. J. Kleywegt. 2002. Minimax analysis of stochastic problems. Optim. Methods Software. 17(1) 523-592.
    • (2002) Optim. Methods Software , vol.17 , Issue.1 , pp. 523-592
    • Shapiro, A.1    Kleywegt, A.J.2
  • 26
    • 0028460403 scopus 로고
    • Markov decision processes with imprecise transition probabilities
    • White, C. C., H. K. Eldeib. 1994. Markov decision processes with imprecise transition probabilities. Oper. Res. 42(4) 739-749.
    • (1994) Oper. Res. , vol.42 , Issue.4 , pp. 739-749
    • White, C.C.1    Eldeib, H.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.