SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Conference on Decision and Control

Volumn , Issue , 2012, Pages 6708-6715

Loss bounds for uncertain transition probabilities in Markov decision processes

(2) Mastin, Andrew a Jaillet, Patrick a

a Department of Electrical Engineering and Computer Science (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DYNAMIC PROGRAMMING; MARKOV PROCESSES;

FINITE HORIZONS; INFINITE HORIZONS; MARKOV DECISION PROCESSES; NON NEGATIVES; PRECOMPUTING; TOTAL VARIATION; TRANSITION PROBABILITIES; VALUE FUNCTIONS;

PROBABILITY DISTRIBUTIONS;

EID: 84874269643 PISSN: 07431546 EISSN: 25762370 Source Type: Conference Proceeding
DOI: 10.1109/CDC.2012.6426504 Document Type: Conference Paper

Times cited : (16)

References (22)

1
- 34547984629
- Massachusetts Institute of Technology, Tech. Rep. AD0417150, August
- E. A. Silver, "Markovian decision processes with uncertain transition probabilities or rewards," Massachusetts Institute of Technology, Tech. Rep. AD0417150, August 1963.
- (1963) Markovian Decision Processes with Uncertain Transition Probabilities or Rewards
- Silver, E.A.¹

2
- 0015630091
- Markovian decision processes with uncertain transition probabilities
- J. K. Satia and R. E. Lave Jr., "Markovian decision processes with uncertain transition probabilities," Operations Research, vol. 21, no. 3, pp. 728-740, 1973.
- (1973) Operations Research , vol.21 , Issue.3 , pp. 728-740
- Satia, J.K.¹ Lave Jr., R.E.²

3
- 79957825456
- Multilinear and integer programming for Markov decision processes with imprecise probabilities
- R. S. Filho and F. W. Trevizan, "Multilinear and integer programming for Markov decision processes with imprecise probabilities," in 5th International Symposium on Imprecise Probability: Theories and Applications, 2007.
- (2007) 5th International Symposium on Imprecise Probability: Theories and Applications
- Filho, R.S.¹ Trevizan, F.W.²

4
- 0032399375
- Controlled Markov set-chains with discounting
- M. Kurano, M. Hosaka, Y. Huang, and J. Song, "Controlled Markov set-chains with discounting." J. Appl. Probab., vol. 35, no. 2, pp. 293-302, 1998.
- (1998) J. Appl. Probab. , vol.35 , Issue.2 , pp. 293-302
- Kurano, M.¹ Hosaka, M.² Huang, Y.³ Song, J.⁴

5
- 14344250395
- Robust control of Markov decision processes with uncertain transition matrices
- September-October
- A. Nilim and L. El Ghaoui, "Robust control of Markov decision processes with uncertain transition matrices," Oper. Res., vol. 53, no. 5, pp. 780-798, September-October 2005.
- (2005) Oper. Res. , vol.53 , Issue.5 , pp. 780-798
- Nilim, A.¹ El Ghaoui, L.²

6
- 79953864311
- Efficient solutions to factored MDPs with imprecise transition probabilities
- K. V. Delgado, S. Sanner, and L. N. de Barros, "Efficient solutions to factored MDPs with imprecise transition probabilities," Artificial Intelligence, vol. 175, no. 910, pp. 1498 - 1527, 2011.
- (2011) Artificial Intelligence , vol.175 , Issue.910 , pp. 1498-1527
- Delgado, K.V.¹ Sanner, S.² De Barros, L.N.³

7
- 0028460403
- Markov decision processes with imprecise transition probabilities
- C. C. White III and H. K. Eldeib, "Markov decision processes with imprecise transition probabilities," Operations Research, vol. 42, pp. 739-749, 1994.
- (1994) Operations Research , vol.42 , pp. 739-749
- White III, C.C.¹ Eldeib, H.K.²

8
- 0034272032
- Bounded-parameter Markov decision processes
- R. Givan, S. Leach, and T. Dean, "Bounded-parameter Markov decision processes," Artificial Intelligence, vol. 122, pp. 71-109, 2000.
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

9
- 0023961231
- Sensitivity analysis in discrete dynamic programming
- February
- W. J. Hopp, "Sensitivity analysis in discrete dynamic programming," J. Optim. Theory Appl., vol. 56, pp. 257-269, February 1988.
- (1988) J. Optim. Theory Appl. , vol.56 , pp. 257-269
- Hopp, W.J.¹

10
- 0031272080
- How does the value function of a Markov decision process depend on the transition probabilities?
- November
- A. Müller, "How does the value function of a Markov decision process depend on the transition probabilities?" Math. Oper. Res., vol. 22, pp. 872-885, November 1997.
- (1997) Math. Oper. Res. , vol.22 , pp. 872-885
- Müller, A.¹

11
- 84874267664
- John Wiley & Sons, Inc.
- C. H. Tan and J. C. Hartman, Sensitivity Analysis and Dynamic Programming. John Wiley & Sons, Inc., 2010.
- (2010) Sensitivity Analysis and Dynamic Programming
- Tan, C.H.¹ Hartman, J.C.²

12
- 84855284034
- Sensitivity analysis in Markov decision processes with uncertain reward parameters
- - "Sensitivity analysis in Markov decision processes with uncertain reward parameters," Journal of Applied Probability, vol. 48, no. 4, pp. 954 - 967, 2011.
- (2011) Journal of Applied Probability , vol.48 , Issue.4 , pp. 954-967
- Tan, C.H.¹ Hartman, J.C.²

13
- 0028497385
- An upper bound on the loss from approximate optimal-value functions
- S. P. Singh and R. C. Yee, "An upper bound on the loss from approximate optimal-value functions," Machine Learning, vol. 16, no. 3, pp. 227-233, 1994.
- (1994) Machine Learning , vol.16 , Issue.3 , pp. 227-233
- Singh, S.P.¹ Yee, R.C.²

14
- 84880649215
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- M. Kearns, Y. Mansour, and A. Y. Ng, "A sparse sampling algorithm for near-optimal planning in large Markov decision processes," in Machine Learning, 1999, pp. 1324-1331.
- (1999) Machine Learning , pp. 1324-1331
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

15
- 84880899936
- Performance analysis of online anticipatory algorithms for large multistage stochastic integer programs
- San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
- L. Mercier and P. Van Hentenryck, "Performance analysis of online anticipatory algorithms for large multistage stochastic integer programs," in Proceedings of the 20th international joint conference on Artifical intelligence, ser. IJCAI'07. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 2007, pp. 1979-1984.
- (2007) Proceedings of the 20th International Joint Conference on Artifical Intelligence, Ser. IJCAI'07 , pp. 1979-1984
- Mercier, L.¹ Van Hentenryck, P.²

16
- 36348948748
- The MIT Press
- P. V. Hentenryck and R. Bent, Online Stochastic Combinatorial Optimization. The MIT Press, 2009.
- (2009) Online Stochastic Combinatorial Optimization
- Hentenryck, P.V.¹ Bent, R.²

17
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

18
- 1942516880
- Error bounds for approximate policy iteration
- R. Munos, "Error bounds for approximate policy iteration," in ICML, 2003, pp. 560-567.
- (2003) ICML , pp. 560-567
- Munos, R.¹

19
- 85162063395
- Error propagation for approximate policy and value iteration
- A. M. Farahmand, R. Munos, and C. Szepesvári, "Error propagation for approximate policy and value iteration," in NIPS, 2010, pp. 568-576.
- (2010) NIPS , pp. 568-576
- Farahmand, A.M.¹ Munos, R.² Szepesvári, C.³

20
- 47349092417
- John Wiley and Sons, Inc.
- W. B. Powell, Approximate Dynamic Programming. John Wiley and Sons, Inc., 2007.
- (2007) Approximate Dynamic Programming
- Powell, W.B.¹

21
- 0026838072
- Multilinear programming: Duality theories
- R. F. Drenick, "Multilinear programming: Duality theories," Journal of Optimization Theory and Applications, vol. 72, pp. 459-486, 1992.
- (1992) Journal of Optimization Theory and Applications , vol.72 , pp. 459-486
- Drenick, R.F.¹

22
- 0003565783
- 3rd ed. Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control, 3rd ed. Athena Scientific, 2007.
- (2007) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.