SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

ACM International Conference Proceeding Series

Volumn 227, Issue , 2007, Pages 225-232

Percentile optimization in uncertain Markov decision processes with application to efficient exploration

(2) Delage, Erick a Mannor, Shie b

a Stanford University (United States)

b MCGILL UNIVERSITY (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; MARKOV PROCESSES; MATHEMATICAL MODELS; OPTIMIZATION; PARAMETER ESTIMATION; UNCERTAIN SYSTEMS;

EXPLORATION STRATEGY; PARAMETER UNCERTAINTY; PERCENTILE OPTIMIZATION;

DYNAMICAL SYSTEMS;

EID: 34547985785 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1273496.1273525 Document Type: Conference Paper

Times cited : (31)

References (19)

1
- 0032207223
- Robust convex optimization
- Ben-Tal, A., & Nemirovski, A. (1998). Robust convex optimization. Mathematics of Operations Research, 23, 769-805.
- (1998) Mathematics of Operations Research , vol.23 , pp. 769-805
- Ben-Tal, A.¹ Nemirovski, A.²

2
- 0041965975
- R-max - a general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R., & Tennenholtz, M. (2003). R-max - a general polynomial time algorithm for near-optimal reinforcement learning. J. of Machine Learning Research., 3, 213-231.
- (2003) J. of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.¹ Tennenholtz, M.²

3
- 33845679809
- On distributionally robust chance-constrained linear programs
- Calafiore, G., & El Ghaoui, L. (2006). On distributionally robust chance-constrained linear programs. Optimization Theory and Applications, 130, 1-22.
- (2006) Optimization Theory and Applications , vol.130 , pp. 1-22
- Calafiore, G.¹ El Ghaoui, L.²

4
- 0002395681
- Chance constrained programming
- Charnes, A., & Cooper, W. (1959). Chance constrained programming. Management Science, 6, 73-79.
- (1959) Management Science , vol.6 , pp. 73-79
- Charnes, A.¹ Cooper, W.²

5
- 1142281527
- Model-based Bayesian exploration
- Dearden, R., Friedman, N., & Andre, D. (1999). Model-based Bayesian exploration. Proc. of Uncertainty in AI (pp. 150-159).
- (1999) Proc. of Uncertainty in AI , pp. 150-159
- Dearden, R.¹ Friedman, N.² Andre, D.³

6
- 0029219995
- Percentile performance criteria for limiting average Markov control problems
- Filar, J., Krass, D., & Ross, K. (1995). Percentile performance criteria for limiting average Markov control problems. IEEE Trans, on Automatic Control, 40, 2-10.
- (1995) IEEE Trans, on Automatic Control , vol.40 , pp. 2-10
- Filar, J.¹ Krass, D.² Ross, K.³

7
- 0004012196
- second edition. Chapman & Hall/CRC
- Gelman, A., Carlin, J., Stern, H., & Rubin, D. (2003). Bayesian data analysis, second edition. Chapman & Hall/CRC.
- (2003) Bayesian data analysis
- Gelman, A.¹ Carlin, J.² Stern, H.³ Rubin, D.⁴

8
- 0034272032
- Boundedparameter Markov decision processes
- Givan, R., Leach, S., & Dean, T. (2000). Boundedparameter Markov decision processes. Artificial Intelligence, 122, 71-109.
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

9
- 84939003870
- Information value theory
- Howard, R. (1966). Information value theory. IEEE Trans. on Systems Science and Cybernetics, SSC-2, 22-26.
- (1966) IEEE Trans. on Systems Science and Cybernetics, SSC-2 , pp. 22-26
- Howard, R.¹

10
- 25444493818
- Robust dynamic programming
- Iyengar, G. (2005). Robust dynamic programming. Mathematics of Operations Research, 30, 257-280.
- (2005) Mathematics of Operations Research , vol.30 , pp. 257-280
- Iyengar, G.¹

11
- 0012257655
- Near-optimal reinforcement learning in polynomial time
- Kearns, M., & Singh, S. (1998). Near-optimal reinforcement learning in polynomial time. Proc. ICML (pp. 260-268).
- (1998) Proc. ICML , pp. 260-268
- Kearns, M.¹ Singh, S.²

12
- 0041940559
- Applications of second order cone programming
- Lobo, M., Vandenberghe, L., Boyd, S., & Lebret, H. (1998). Applications of second order cone programming. Linear Algebra and its App., 284, 193-228.
- (1998) Linear Algebra and its App , vol.284 , pp. 193-228
- Lobo, M.¹ Vandenberghe, L.² Boyd, S.³ Lebret, H.⁴

13
- 33847336943
- Bias and variance in value function estimation
- Mannor, S., Simester, D., Sun, P., & Tsitsiklis, J. (2007). Bias and variance in value function estimation. Management Science, 53, 308-322.
- (2007) Management Science , vol.53 , pp. 308-322
- Mannor, S.¹ Simester, D.² Sun, P.³ Tsitsiklis, J.⁴

14
- 36248992411
- Convex approximations of chance constrained programs
- Nemirovski, A., & Shapiro, A. (2006). Convex approximations of chance constrained programs. SIAM Journal on Optimization, 17, 969-996.
- (2006) SIAM Journal on Optimization , vol.17 , pp. 969-996
- Nemirovski, A.¹ Shapiro, A.²

15
- 14344250395
- Robust Markov decision processes with uncertain transition matrices
- Nilim, A., & El Ghaoui, L. Robust Markov decision processes with uncertain transition matrices. Operations Research, 53, 780-798.
- Operations Research , vol.53 , pp. 780-798
- Nilim, A.¹ El Ghaoui, L.²

16
- 0003410675
- Kluwer Academic Publishers
- Prékopa, A. (1995). Stochastic programming. Kluwer Academic Publishers.
- (1995) Stochastic programming
- Prékopa, A.¹

17
- 85102627959
- Wiley
- Putterman, M. (1994). Markov decision processes: Discrete stochastic dynamic programming. Wiley.
- (1994) Markov decision processes: Discrete stochastic dynamic programming
- Putterman, M.¹

18
- 34547984629
- Markovian decision processes with uncertain transition probabilities or rewards
- 1, Operations Research Center, MIT
- Silver, E. (1963). Markovian decision processes with uncertain transition probabilities or rewards (Technical Report 1). Operations Research Center, MIT.
- (1963) Technical Report
- Silver, E.¹

19
- 31844432138
- A theoretical analysis of model-based interval estimation
- Støehl, A., & Littman, M. (2005). A theoretical analysis of model-based interval estimation. Proc. ICML (pp. 857-864).
- (2005) Proc. ICML , pp. 857-864
- Støehl, A.¹ Littman, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.