-
1
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1): 48-77, 2003.
-
(2003)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
2
-
-
4544345025
-
Adaptive routing with end-to-end feedback: Distributed learning and geometric approaches
-
New York, NY, USA ACM
-
Baruch Awerbuch and Robert D. Kleinberg. Adaptive routing with end-to-end feedback: distributed learning and geometric approaches. In STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, pages 45-53, New York, NY, USA, 2004. ACM.
-
(2004)
STOC '04: Proceedings of the Thirty-sixth Annual ACM Symposium on Theory of Computing
, pp. 45-53
-
-
Awerbuch, B.1
Kleinberg, R.D.2
-
3
-
-
84898060591
-
High-probability bounds for the regret of bandit online linear optimization, 2008
-
submission to 2008
-
P. Bartlett, V. Dani, T. Hayes, S. Kakade, A. Rakhlin, and A. Tewari. High-probability bounds for the regret of bandit online linear optimization, 2008. In submission to COLT 2008.
-
COLT
-
-
Bartlett, P.1
Dani, V.2
Hayes, T.3
Kakade, S.4
Rakhlin, A.5
Tewari, A.6
-
4
-
-
0003878614
-
-
SIAM, Philadelphia
-
A. Ben-Tal and A. Nemirovski. Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications, volume 2 of MPS/SIAM Series on Optimization. SIAM, Philadelphia, 2001.
-
(2001)
Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications, Volume 2 of MPS/SIAM Series on Optimization
-
-
Ben-Tal, A.1
Nemirovski, A.2
-
6
-
-
85162050055
-
The price of bandit information for online optimization
-
J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors MIT Press, Cambridge, MA
-
Varsha Dani, Thomas Hayes, and Sham Kakade. The price of bandit information for online optimization. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems 20. MIT Press, Cambridge, MA, 2008.
-
(2008)
Advances in Neural Information Processing Systems
, vol.20
-
-
Dani, V.1
Hayes, T.2
Kakade, S.3
-
7
-
-
33244456637
-
Robbing the bandit: Less regret in online geometric optimization against an adaptive adversary
-
New York, NY, USA ACM
-
Varsha Dani and Thomas P. Hayes. Robbing the bandit: less regret in online geometric optimization against an adaptive adversary. In SODA '06: Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, pages 937-943, New York, NY, USA, 2006. ACM.
-
(2006)
SODA '06: Proceedings of the Seventeenth Annual ACM-SIAM Symposium on Discrete Algorithm
, pp. 937-943
-
-
Dani, V.1
Hayes, T.P.2
-
8
-
-
84898069824
-
Correction to 'universal prediction of individual sequences' (jul 92 1258-1270)
-
Meir Feder, Neri Merhav, and Michael Gutman. Correction to 'universal prediction of individual sequences' (jul 92 1258-1270). IEEE Transactions on Information Theory, 40(1): 285, 1994.
-
(1994)
IEEE Transactions on Information Theory
, vol.40
, Issue.1
, pp. 285
-
-
Feder, M.1
Merhav, N.2
Gutman, M.3
-
9
-
-
20744454447
-
Online convex optimization in the bandit setting: Gradient descent without a gradient
-
Philadelphia, PA, USA Society for Industrial and Applied Mathematics
-
Abraham D. Flaxman, Adam Tauman Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: gradient descent without a gradient. In SODA '05: Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms, pages 385-394, Philadelphia, PA, USA, 2005. Society for Industrial and Applied Mathematics.
-
(2005)
SODA '05: Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms
, pp. 385-394
-
-
Flaxman, A.D.1
Kalai, A.T.2
Brendan McMahan, H.3
-
10
-
-
0033336069
-
Relative loss bounds for single neurons
-
November
-
D. P. Helmbold, J. Kivinen, and M. K. Warmuth. Relative loss bounds for single neurons. IEEE Transactions on Neural Networks, 10(6): 1291-1304, November 1999.
-
(1999)
IEEE Transactions on Neural Networks
, vol.10
, Issue.6
, pp. 1291-1304
-
-
Helmbold, D.P.1
Kivinen, J.2
Warmuth, M.K.3
-
11
-
-
24644463787
-
Efficient algorithms for online decision problems
-
Adam Kalai and Santosh Vempala. Efficient algorithms for online decision problems. Journal of Computer and System Sciences, 71(3): 291-307, 2005.
-
(2005)
Journal of Computer and System Sciences
, vol.71
, Issue.3
, pp. 291-307
-
-
Kalai, A.1
Vempala, S.2
-
12
-
-
0008815681
-
Exponentiated gradient versus gradient descent for linear predictors
-
Jyrki Kivinen and Manfred K. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Inf. Comput., 132(1): 1-63, 1997.
-
(1997)
Inf. Comput.
, vol.132
, Issue.1
, pp. 1-63
-
-
Kivinen, J.1
Warmuth, M.K.2
-
14
-
-
9444257628
-
Online geometric optimization in the bandit setting against an adaptive adversary
-
H. Brendan McMahan and Avrim Blum. Online geometric optimization in the bandit setting against an adaptive adversary. In COLT, pages 109-123, 2004.
-
(2004)
COLT
, pp. 109-123
-
-
Brendan McMahan, H.1
Blum, A.2
-
16
-
-
0003254250
-
Interior point polynomial algorithms in convex programming
-
Philadelphia
-
Y. E. Nesterov and A. S. Nemirovskii. Interior Point Polynomial Algorithms in Convex Programming. SIAM, Philadelphia, 1994.
-
(1994)
SIAM
-
-
Nesterov, Y.E.1
Nemirovskii, A.S.2
-
18
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
Herbert Robbins. Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc., 58(5): 527-535, 1952.
-
(1952)
Bull. Amer. Math. Soc.
, vol.58
, Issue.5
, pp. 527-535
-
-
Robbins, H.1
-
19
-
-
35348915372
-
A primal-dual perspective of online learning algorithms
-
Shai Shalev-Shwartz and Yoram Singer. A primal-dual perspective of online learning algorithms. Mach. Learn., 69(2-3): 115-142, 2007.
-
(2007)
Mach. Learn.
, vol.69
, Issue.2-3
, pp. 115-142
-
-
Shalev-Shwartz, S.1
Singer, Y.2
-
20
-
-
3142657664
-
Path kernels and multiplicative updates
-
Eiji Takimoto and Manfred K. Warmuth. Path kernels and multiplicative updates. J. Mach. Learn. Res., 4: 773-818, 2003.
-
(2003)
J. Mach. Learn. Res.
, vol.4
, pp. 773-818
-
-
Takimoto, E.1
Warmuth, M.K.2
-
21
-
-
1942484421
-
Online convex programming and generalized infinitesimal gradient ascent
-
Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML, pages 928-936, 2003.
-
(2003)
ICML
, pp. 928-936
-
-
Zinkevich, M.1
|