-
2
-
-
0036568025
-
Finite-time analysis of the multi-armed bandit problem
-
P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47:235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
84877752876
-
Multiple identifications in multi-armed bandits
-
abs/1205.3181
-
S. Bubeck, T. Wang, and N. Viswanathan. Multiple identifications in multi-armed bandits. CoRR, abs/1205.3181, 2012.
-
(2012)
CoRR
-
-
Bubeck, S.1
Wang, T.2
Viswanathan, N.3
-
6
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
E. Even-Dar, S. Mannor, and Y. Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
8
-
-
85162482585
-
Multi-bandit best arm identification
-
V. Gabillon, M. Ghavamzadeh, A. Lazaric, and S. Bubeck. Multi-bandit best arm identification. In Proceedings of Advances in Neural Information Processing Systems 25, pages 2222-2230, 2011.
-
(2011)
Proceedings of Advances in Neural Information Processing Systems
, vol.25
, pp. 2222-2230
-
-
Gabillon, V.1
Ghavamzadeh, M.2
Lazaric, A.3
Bubeck, S.4
-
9
-
-
84867121052
-
-
PhD thesis, Department of Computer Science, The University of Texas at Austin, Austin, Texas, USA, December. Published as UT Austin Computer Science Technical Report TR-11-41
-
S. Kalyanakrishnan. Learning Methods for Sequential Decision Making with Imperfect Representations. PhD thesis, Department of Computer Science, The University of Texas at Austin, Austin, Texas, USA, December 2011. Published as UT Austin Computer Science Technical Report TR-11-41.
-
(2011)
Learning Methods for Sequential Decision Making with Imperfect Representations
-
-
Kalyanakrishnan, S.1
-
12
-
-
0001923944
-
Hoeffding races: Accelerating model selection search for classification and function approximation
-
O. Maron and A. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In Proceedings of Advances in Neural Information Processing Systems 6, pages 59-66, 1993.
-
(1993)
Proceedings of Advances in Neural Information Processing Systems
, vol.6
, pp. 59-66
-
-
Maron, O.1
Moore, A.2
-
15
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
H. Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society, 58:527-535, 1952.
-
(1952)
Bulletin of the American Mathematics Society
, vol.58
, pp. 527-535
-
-
Robbins, H.1
|