-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2): 235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1): 48-77, 2002.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
3
-
-
0028317505
-
Selection in the presence of noise: The design of playoff systems
-
Micah Adler, Peter Gemmell, Mor Harchol-Balter, Richard Karp, and Claire Kenyon. Selection in the presence of noise: The design of playoff systems. In ACM-SIAM Symposium on Discrete Algorithms (SODA), 1994.
-
(1994)
ACM-SIAM Symposium on Discrete Algorithms (SODA)
-
-
Adler, M.1
Gemmell, P.2
Harchol-Balter, M.3
Karp, R.4
Kenyon, C.5
-
5
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade
-
Peter Auer. Using confidence bounds for exploitation-exploration trade. Journal of Machine Learning Research (JMLR), 3: 397-422, 2003.
-
(2003)
Journal of Machine Learning Research (JMLR)
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
6
-
-
84861596367
-
Robust reductions from ranking to classification
-
Maria-Florina Balcan, Nikhil Bansal, Alina Beygelzimer, Don Coppersmith, John Langford, and Gregory Sorkin. Robust reductions from ranking to classification. In Conference on Learning Theory (COLT), 2007.
-
(2007)
Conference on Learning Theory (COLT)
-
-
Balcan, M.-F.1
Bansal, N.2
Beygelzimer, A.3
Coppersmith, D.4
Langford, J.5
Sorkin, G.6
-
8
-
-
33748442333
-
Regret minimization under partial monitoring
-
Nicolò Cesa-Bianchi, Gábor Lugosi, and Gilles Stoltz. Regret minimization under partial monitoring. Mathematics of Operations Research, 31(3): 562-580, 2006.
-
(2006)
Mathematics of Operations Research
, vol.31
, Issue.3
, pp. 562-580
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
11
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research (JMLR), 7: 1079-1105, 2006.
-
(2006)
Journal of Machine Learning Research (JMLR)
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
12
-
-
4644367942
-
An efficient boosting algorithm for combining preferences
-
Yoav Freund, Raj Iyer, Robert Schapire, and Yoram Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research (JMLR), 4: 933-969, 2003.
-
(2003)
Journal of Machine Learning Research (JMLR)
, vol.4
, pp. 933-969
-
-
Freund, Y.1
Iyer, R.2
Schapire, R.3
Singer, Y.4
-
13
-
-
0028516898
-
Computing with noisy information
-
Uriel Feige, Prabhakar Raghavan, David Peleg, and Eli Upfal. Computing with noisy information. SIAM Journal on Computing, 23(5), 1994.
-
(1994)
SIAM Journal on Computing
, vol.23
, Issue.5
-
-
Feige, U.1
Raghavan, P.2
Peleg, D.3
Upfal, E.4
-
15
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58: 13-30, 1963.
-
(1963)
Journal of the American Statistical Association
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
20
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T. L. Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6: 4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
24
-
-
30044441333
-
The sample complexity of exploration in the multi-armed bandit problem
-
Shie Mannor and John N. Tsitsiklis. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research (JMLR), 5: 623-648, 2004.
-
(2004)
Journal of Machine Learning Research (JMLR)
, vol.5
, pp. 623-648
-
-
Mannor, S.1
Tsitsiklis, J.N.2
-
27
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Herbert Robbins. Some Aspects of the Sequential Design of Experiments. Bull. Amer. Math. Soc., 58: 527-535, 1952.
-
(1952)
Bull. Amer. Math. Soc.
, vol.58
, pp. 527-535
-
-
Robbins, H.1
|