-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer Finite-time analysis of the multiarmed bandit problem Mach. Learn. 47 2 2002 235 256
-
(2002)
Mach. Learn.
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert Schapire The nonstochastic multiarmed bandit problem SIAM J. Comput. 32 1 2002 48 77
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
4
-
-
0028317505
-
Selection in the presence of noise: The design of playoff systems
-
Micah Adler, Peter Gemmell, Mor Harchol-Balter, Richard Karp, Claire Kenyon, Selection in the presence of noise: The design of playoff systems, in: ACM-SIAM Symposium on Discrete Algorithms (SODA), 1994.
-
(1994)
ACM-SIAM Symposium on Discrete Algorithms (SODA)
-
-
Adler, M.1
Gemmell, P.2
Harchol-Balter, M.3
Karp, R.4
Kenyon, C.5
-
6
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade
-
Peter Auer Using confidence bounds for exploitation-exploration trade J. Mach. Learn. Res. 3 2003 397 422
-
(2003)
J. Mach. Learn. Res.
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
7
-
-
84861596367
-
Robust reductions from ranking to classification
-
Maria-Florina Balcan, Nikhil Bansal, Alina Beygelzimer, Don Coppersmith, John Langford, Gregory Sorkin, Robust reductions from ranking to classification, in: Conference on Learning Theory (COLT), 2007.
-
(2007)
Conference on Learning Theory (COLT)
-
-
Balcan, M.1
Bansal, N.2
Beygelzimer, A.3
Coppersmith, D.4
Langford, J.5
Sorkin, G.6
-
9
-
-
33748442333
-
Regret minimization under partial monitoring
-
Nicolò Cesa-Bianchi, Gábor Lugosi, and Gilles Stoltz Regret minimization under partial monitoring Math. Oper. Res. 31 3 2006 562 580
-
(2006)
Math. Oper. Res.
, vol.31
, Issue.3
, pp. 562-580
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
12
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
Eyal Even-Dar, Shie Mannor, and Yishay Mansour Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems J. Mach. Learn. Res. 7 2006 1079 1105
-
(2006)
J. Mach. Learn. Res.
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
16
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Wassily Hoeffding Probability inequalities for sums of bounded random variables J. Amer. Statist. Assoc. 58 1963 13 30
-
(1963)
J. Amer. Statist. Assoc.
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
20
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T.L. Lai, and Herbert Robbins Asymptotically efficient adaptive allocation rules Adv. in Appl. Math. 6 1985 4 22
-
(1985)
Adv. in Appl. Math.
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
24
-
-
30044441333
-
The sample complexity of exploration in the multi-armed bandit problem
-
Shie Mannor, and John N. Tsitsiklis The sample complexity of exploration in the multi-armed bandit problem J. Mach. Learn. Res. 5 2004 623 648
-
(2004)
J. Mach. Learn. Res.
, vol.5
, pp. 623-648
-
-
Mannor, S.1
Tsitsiklis, J.N.2
-
25
-
-
70049106076
-
Bandits for taxonomies: A model-based approach
-
Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, Vanja Josifovski, Bandits for taxonomies: A model-based approach, in: SIAM Conference on Data Mining (SDM), 2007.
-
(2007)
SIAM Conference on Data Mining (SDM)
-
-
Pandey, S.1
Agarwal, D.2
Chakrabarti, D.3
Josifovski, V.4
-
27
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Herbert Robbins Some aspects of the sequential design of experiments Bull. Amer. Math. Soc. 58 1952 527 535
-
(1952)
Bull. Amer. Math. Soc.
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
28
-
-
84898077397
-
The k-armed dueling bandits problem
-
Yisong Yue, Josef Broder, Robert Kleinberg, Thorsten Joachims, The k-armed dueling bandits problem, in: Conference on Learning Theory (COLT), 2009.
-
(2009)
Conference on Learning Theory (COLT)
-
-
Yue, Y.1
Broder, J.2
Kleinberg, R.3
Joachims, T.4
|