-
1
-
-
78649420293
-
Regret bounds and minimax policies under partial monitoring
-
Audibert, J.-Y., Bubeck, S.: Regret bounds and minimax policies under partial monitoring. Journal of Machine Learning Research 11, 2635-2686 (2010)
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 2635-2686
-
-
Audibert, J.-Y.1
Bubeck, S.2
-
3
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning Journal 47(2-3), 235-256 (2002)
-
(2002)
Machine Learning Journal
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
0037709910
-
The non-stochastic multi-armed bandit problem
-
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.: The non-stochastic multi-armed bandit problem. SIAM Journal on Computing 32(1), 48-77 (2002)
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
6
-
-
38049040954
-
Improved rates for the stochastic continuum-armed bandit problem
-
Bshouty, N.H., Gentile, C. (eds.) COLT. Springer, Heidelberg
-
Auer, P., Ortner, R., Szepesvári, C.: Improved rates for the stochastic continuum-armed bandit problem. In: Bshouty, N.H., Gentile, C. (eds.) COLT. LNCS (LNAI), vol. 4539, pp. 454-468. Springer, Heidelberg (2007)
-
(2007)
LNCS (LNAI)
, vol.4539
, pp. 454-468
-
-
Auer, P.1
Ortner, R.2
Szepesvári, C.3
-
8
-
-
79960128338
-
X-armed bandits
-
Bubeck, S., Munos, R., Stoltz, G., Szepesvári, C.: X-armed bandits. Journal of Machine Learning Research 12, 1655-1695 (2011)
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 1655-1695
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvári, C.4
-
10
-
-
67649577204
-
Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces
-
Cope, E.: Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces. IEEE Transactions on Automatic Control 54(6), 1243-1253 (2009)
-
(2009)
IEEE Transactions on Automatic Control
, vol.54
, Issue.6
, pp. 1243-1253
-
-
Cope, E.1
-
11
-
-
84898072179
-
Stochastic linear optimization under bandit feedback
-
Omnipress
-
Dani, V., Hayes, T.P., Kakade, S.M.: Stochastic linear optimization under bandit feedback. In: Proceedings of the 21st Annual Conference on Learning Theory, pp. 355-366. Omnipress (2008)
-
(2008)
Proceedings of the 21st Annual Conference on Learning Theory
, pp. 355-366
-
-
Dani, V.1
Hayes, T.P.2
Kakade, S.M.3
-
12
-
-
30344439147
-
Optimal algorithms for global optimization in case of unknown Lipschitz constant
-
Horn, M.: Optimal algorithms for global optimization in case of unknown Lipschitz constant. Journal of Complexity 22(1) (2006)
-
(2006)
Journal of Complexity
, vol.22
, Issue.1
-
-
Horn, M.1
-
13
-
-
0027678534
-
Lipschitzian optimization without the Lipschitz constant
-
Jones, D.R., Perttunen, C.D., Stuckman, B.E.: Lipschitzian optimization without the Lipschitz constant. Journal of Optimization Theory and Applications 79(1), 157-181 (1993)
-
(1993)
Journal of Optimization Theory and Applications
, vol.79
, Issue.1
, pp. 157-181
-
-
Jones, D.R.1
Perttunen, C.D.2
Stuckman, B.E.3
-
16
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Robbins, H.: Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society 58, 527-535 (1952)
-
(1952)
Bulletin of the American Mathematics Society
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
17
-
-
84863381440
-
Algorithms for infinitely manyarmed bandits
-
Wang, Y., Audibert, J.Y., Munos, R.: Algorithms for infinitely manyarmed bandits. In: Advances in Neural Information Processing Systems, pp. 1729-1736 (2009)
-
(2009)
Advances in Neural Information Processing Systems
, pp. 1729-1736
-
-
Wang, Y.1
Audibert, J.Y.2
Munos, R.3
|