-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2): 235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
79952624396
-
Pure exploration in finitely armed and continuous armed bandits
-
S. Bubeck, R. Munos, and G. Stoltz. Pure exploration in finitely armed and continuous armed bandits. Theoretical Computer Science 412, 1832-1852, 412: 1832-1852, 2011.
-
(2011)
Theoretical Computer Science 412, 1832-1852
, vol.412
, pp. 1832-1852
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
-
5
-
-
84898949562
-
Kullback-leibler upper confidence bounds for optimal sequential allocation
-
to appear
-
O. Cappé, A. Garivier, O-A. Maillard, R. Munos, and G. Stoltz. Kullback-Leibler upper confidence bounds for optimal sequential allocation. to appear in Annals of Statistics, 2013.
-
(2013)
Annals of Statistics
-
-
Cappé, O.1
Garivier, A.2
Maillard, O.-A.3
Munos, R.4
Stoltz, G.5
-
7
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
E. Even-Dar, S. Mannor, and Y. Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7: 1079-1105, 2006.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
15
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6(1): 4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
17
-
-
30044441333
-
The sample complexity of exploration in the multi-armed bandit problem
-
S. Mannor and J. Tsitsiklis. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research, pages 623-648, 2004.
-
(2004)
Journal of Machine Learning Research
, pp. 623-648
-
-
Mannor, S.1
Tsitsiklis, J.2
-
18
-
-
0031069121
-
The racing algorithm: Model selection for lazy learners
-
O. Maron and A. Moore. The racing algorithm: Model selection for lazy learners. Artificial Intelligence Review, 11(1-5): 113-131, 1997.
-
(1997)
Artificial Intelligence Review
, vol.11
, Issue.1-5
, pp. 113-131
-
-
Maron, O.1
Moore, A.2
-
20
-
-
0001395850
-
On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
-
W.R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25: 285-294, 1933.
-
(1933)
Biometrika
, vol.25
, pp. 285-294
-
-
Thompson, W.R.1
|