-
1
-
-
0345224411
-
The continuum-armed bandit problem
-
R. Agrawal. The continuum-armed bandit problem. SIAM J. Control and Optimization, 33:1926-1951, 1995.
-
(1995)
SIAM J. Control and Optimization
, vol.33
, pp. 1926-1951
-
-
Agrawal, R.1
-
3
-
-
0036568025
-
Finitetime analysis of the multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finitetime analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
0037709910
-
Schapire. The nonstochastic multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund., and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77,2003.
-
(2003)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Robert, E.4
-
5
-
-
38049040954
-
Improved rates for the stochastic continuum-armed bandit problem
-
Springer
-
Peter Auer, Ronald Ortner, and Csaba Szepesvári. Improved rates for the stochastic continuum-armed bandit problem. In Proceedings of the 20th Annual Conference on Learning Theory, (COLT 2007), pages 454-468.Springer, 2007.
-
(2007)
Proceedings of the 20th Annual Conference on Learning Theory, (COLT 2007)
, pp. 454-468
-
-
Auer, P.1
Ortner, R.2
Szepesvári, C.3
-
6
-
-
77952027689
-
Online optimization in x-armed bandits
-
Sébastien Bubeck, Rémi Munos, Gilles Stoltz, and Csaba Szepesvári. Online optimization in x-armed bandits. In NIPS, pages 201-208, 2008.
-
(2008)
NIPS
, pp. 201-208
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvári, C.4
-
9
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
10
-
-
20744454447
-
Online convex optimization in the bandit setting: Gradient descent without a gradient
-
Society for Industrial and Applied Mathematics Philadelphia, PA, USA
-
Abraham D. Flaxman, Adam T. Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: gradient descent without a gradient. In Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms (SODA 2005), pages 385-394. Society for Industrial and Applied Mathematics Philadelphia, PA, USA, 2005.
-
(2005)
Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA 2005)
, pp. 385-394
-
-
Flaxman, A.D.1
Kalai, A.T.2
McMahan, H.B.3
-
12
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963.
-
(1963)
Journal of the American Statistical Association
, vol.58
, Issue.301
, pp. 13-30
-
-
Hoeffding, W.1
-
13
-
-
84898981061
-
Nearly tight bounds for the continuum-armed bandit problem
-
Lawrence K. Saul, Yair Weiss, and Léon Bottou, editors. MIT Press
-
Robert D. Kleinberg. Nearly tight bounds for the continuum-armed bandit problem. In Lawrence K. Saul, Yair Weiss, and Léon Bottou, editors, Advances in Neural Information Processing Systems 17, (NIPS 2005), pages 697-704. MIT Press, 2005a.
-
(2005)
Advances in Neural Information Processing Systems 17, (NIPS 2005)
, pp. 697-704
-
-
Kleinberg, R.D.1
-
15
-
-
57049185311
-
Multi-armed bandits in metric spaces
-
Association for Computing Machinery
-
Robert D. Kleinberg, Aleksandrs Slivkins, and Eli Upfal. Multi-armed bandits in metric spaces. In Proceedings of the 40th Annual ACM Symposium, STOC 2008, pages 681-690. Association for Computing Machinery, 2008.
-
(2008)
Proceedings of the 40th Annual ACM Symposium, STOC 2008
, pp. 681-690
-
-
Kleinberg, R.D.1
Slivkins, A.2
Upfal, E.3
-
16
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T. L. Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6(1):4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
18
-
-
83055177001
-
The epoch-greedy algorithm for multi-armed bandits with side information
-
John Langford and Tong Zhang. The epoch-greedy algorithm for multi-armed bandits with side information. In NIPS, 2007.
-
(2007)
NIPS
-
-
Langford, J.1
Zhang, T.2
-
20
-
-
15844389867
-
Bandit problems with side observations
-
May
-
Chih-Chun Wang, Sanjeev R. Kulkarni, and H. Vincent Poor. Bandit problems with side observations. IEEE Transactions on Automatic Control, 50(3):338-355, May 2005.
-
(2005)
IEEE Transactions on Automatic Control
, vol.50
, Issue.3
, pp. 338-355
-
-
Wang, C.-C.1
Kulkarni, S.R.2
Poor, H.V.3
|