-
2
-
-
0041965975
-
R-max - a general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, R., & Tennenholtz, M. (2003). R-max - a general polynomial time algorithm for near-optimal reinforcement learning. J. of Machine Learning Research., 3, 213-231.
-
(2003)
J. of Machine Learning Research
, vol.3
, pp. 213-231
-
-
Brafman, R.1
Tennenholtz, M.2
-
4
-
-
0002395681
-
Chance constrained programming
-
Charnes, A., & Cooper, W. (1959). Chance constrained programming. Management Science, 6, 73-79.
-
(1959)
Management Science
, vol.6
, pp. 73-79
-
-
Charnes, A.1
Cooper, W.2
-
6
-
-
0029219995
-
Percentile performance criteria for limiting average Markov control problems
-
Filar, J., Krass, D., & Ross, K. (1995). Percentile performance criteria for limiting average Markov control problems. IEEE Trans, on Automatic Control, 40, 2-10.
-
(1995)
IEEE Trans, on Automatic Control
, vol.40
, pp. 2-10
-
-
Filar, J.1
Krass, D.2
Ross, K.3
-
7
-
-
0004012196
-
-
second edition. Chapman & Hall/CRC
-
Gelman, A., Carlin, J., Stern, H., & Rubin, D. (2003). Bayesian data analysis, second edition. Chapman & Hall/CRC.
-
(2003)
Bayesian data analysis
-
-
Gelman, A.1
Carlin, J.2
Stern, H.3
Rubin, D.4
-
8
-
-
0034272032
-
Boundedparameter Markov decision processes
-
Givan, R., Leach, S., & Dean, T. (2000). Boundedparameter Markov decision processes. Artificial Intelligence, 122, 71-109.
-
(2000)
Artificial Intelligence
, vol.122
, pp. 71-109
-
-
Givan, R.1
Leach, S.2
Dean, T.3
-
11
-
-
0012257655
-
Near-optimal reinforcement learning in polynomial time
-
Kearns, M., & Singh, S. (1998). Near-optimal reinforcement learning in polynomial time. Proc. ICML (pp. 260-268).
-
(1998)
Proc. ICML
, pp. 260-268
-
-
Kearns, M.1
Singh, S.2
-
12
-
-
0041940559
-
Applications of second order cone programming
-
Lobo, M., Vandenberghe, L., Boyd, S., & Lebret, H. (1998). Applications of second order cone programming. Linear Algebra and its App., 284, 193-228.
-
(1998)
Linear Algebra and its App
, vol.284
, pp. 193-228
-
-
Lobo, M.1
Vandenberghe, L.2
Boyd, S.3
Lebret, H.4
-
13
-
-
33847336943
-
Bias and variance in value function estimation
-
Mannor, S., Simester, D., Sun, P., & Tsitsiklis, J. (2007). Bias and variance in value function estimation. Management Science, 53, 308-322.
-
(2007)
Management Science
, vol.53
, pp. 308-322
-
-
Mannor, S.1
Simester, D.2
Sun, P.3
Tsitsiklis, J.4
-
14
-
-
36248992411
-
Convex approximations of chance constrained programs
-
Nemirovski, A., & Shapiro, A. (2006). Convex approximations of chance constrained programs. SIAM Journal on Optimization, 17, 969-996.
-
(2006)
SIAM Journal on Optimization
, vol.17
, pp. 969-996
-
-
Nemirovski, A.1
Shapiro, A.2
-
15
-
-
14344250395
-
Robust Markov decision processes with uncertain transition matrices
-
Nilim, A., & El Ghaoui, L. Robust Markov decision processes with uncertain transition matrices. Operations Research, 53, 780-798.
-
Operations Research
, vol.53
, pp. 780-798
-
-
Nilim, A.1
El Ghaoui, L.2
-
18
-
-
34547984629
-
Markovian decision processes with uncertain transition probabilities or rewards
-
1, Operations Research Center, MIT
-
Silver, E. (1963). Markovian decision processes with uncertain transition probabilities or rewards (Technical Report 1). Operations Research Center, MIT.
-
(1963)
Technical Report
-
-
Silver, E.1
-
19
-
-
31844432138
-
A theoretical analysis of model-based interval estimation
-
Støehl, A., & Littman, M. (2005). A theoretical analysis of model-based interval estimation. Proc. ICML (pp. 857-864).
-
(2005)
Proc. ICML
, pp. 857-864
-
-
Støehl, A.1
Littman, M.2
|