-
3
-
-
84954256963
-
Online least squares estimation with self-normalized processes: An application to bandit problems
-
Yasin Abbasi-Yadkori, Daavid Paal, and Csaba Szepesvari. Online least squares estimation with self-normalized processes: An application to bandit problems. Arxiv preprint http://arxiv.org/abs/1102.2670, 2011b.
-
(2011)
Arxiv Preprint
-
-
Abbasi-Yadkori, Y.1
Paal, D.2
Szepesvari, C.3
-
6
-
-
62949181077
-
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
-
Jean-Yves Audibert, Raemi Munos, and Csaba Szepesvari. Exploration-exploitation tradeoff using variance estimates in multi-armed bandits. Theoretical Computer Science, 410(19):1876-1902, 2009.
-
(2009)
Theoretical Computer Science
, vol.410
, Issue.19
, pp. 1876-1902
-
-
Audibert, J.1
Munos, R.2
Szepesvari, C.3
-
7
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade-offs
-
Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397-422, 2002.
-
(2002)
Journal of Machine Learning Research
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
8
-
-
0036568025
-
Finite time analysis of the multiarmed bandit problem
-
Peter Auer, Nicolo Cesa-Bianchi, and Paul Fischer. Finite time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002a.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
9
-
-
0036477185
-
Adaptive and self-confident on-line learning algorithms
-
Peter Auer, Nicolo Cesa-Bianchi, and Claudio Gentile. Adaptive and self-confident on-line learning algorithms. Journal of Computer and System Sciences, 64(1):48-75, 2002b.
-
(2002)
Journal of Computer and System Sciences
, vol.64
, Issue.1
, pp. 48-75
-
-
Auer, P.1
Cesa-Bianchi, N.2
Gentile, C.3
-
13
-
-
84926078662
-
-
Cambridge University Press, New York, NY, USA
-
Nicoloo Cesa-Bianchi and Gaabor Lugosi. Prediction, Learning, and Games. Cambridge University Press, New York, NY, USA, 2006.
-
(2006)
Prediction Learning and Games
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
14
-
-
4544304381
-
On the generalization ability of on-line learning algorithms
-
Nicoloo Cesa-Bianchi, Alex Conconi, and Claudio Gentile. On the generalization ability of on-line learning algorithms. IEEE Transactions on Information Theory, 50(9):2050-2057, 2004.
-
(2004)
IEEE Transactions on Information Theory
, vol.50
, Issue.9
, pp. 2050-2057
-
-
Cesa-Bianchi, N.1
Conconi, A.2
Gentile, C.3
-
16
-
-
84898072179
-
Stochastic linear optimization under bandit feed-back
-
Rocco Servedio and Tong Zhang, editors
-
Varsha Dani, Thomas P. Hayes, and Sham M. Kakade. Stochastic linear optimization under bandit feed-back. In Rocco Servedio and Tong Zhang, editors, Proceedings of the 21st Annual Conference on Learning Theory (COLT 2008), pages 355-366, 2008.
-
(2008)
Proceedings of the 21st Annual Conference on Learning Theory (COLT 2008
, pp. 355-366
-
-
Dani, V.1
Hayes, T.P.2
Kakade, S.M.3
-
17
-
-
56449091064
-
Data-driven online to batch conversions
-
Ofer Dekel and Yoram Singer. Data-driven online to batch conversions. NIPS 2005, 18:267, 2006.
-
(2005)
NIPS
, vol.18
, Issue.267
, pp. 2006
-
-
Dekel, O.1
Singer, Y.2
-
19
-
-
0002384441
-
On tail probabilities for martingales
-
David A. Freedman. On tail probabilities for martingales. The Annals of Probability, 3(1):100-118, 1975.
-
(1975)
The Annals of Probability
, vol.3
, Issue.1
, pp. 100-118
-
-
Freedman, D.A.1
-
20
-
-
0033280804
-
The robustness of the p-norm algorithms
-
New York, NY, USA, ACM
-
Claudio Gentile and Nick Littlestone. The robustness of the p-norm algorithms. In Proceedings of the twelfth annual conference on Computational learning theory, pages 1-11, New York, NY, USA, 1999. ACM.
-
(1999)
Proceedings of the Twelfth Annual Conference on Computational Learning Theory
, pp. 1-11
-
-
Gentile, C.1
Littlestone, N.2
-
22
-
-
0030661191
-
General convergence results for linear discriminant updates
-
ACM Press
-
Adam J. Grove, Nick Littlestone, and Dale Schuur-mans. General convergence results for linear discriminant updates. In Machine Learning, pages 171-183. ACM Press, 1997.
-
(1997)
Machine Learning
, pp. 171-183
-
-
Grove, A.J.1
Littlestone, N.2
Schuur-Mans, D.3
-
24
-
-
0008815681
-
Exponentiated gradient versus gradient descent for linear predictors
-
January
-
Jyrki Kivinen and Manfred K. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132(1):1-63, January 1997.
-
(1997)
Information and Computation
, vol.132
, Issue.1
, pp. 1-63
-
-
Kivinen, J.1
Warmuth, M.K.2
-
25
-
-
0018190841
-
Strong consistency of least squares estimates in multiple regression
-
Tze Leung Lai, Herbert Robbins, and Ching Zong Wei. Strong consistency of least squares estimates in multiple regression. Proceedings of the National Academy of Sciences, 75(7):3034-3036, 1979.
-
(1979)
Proceedings of the National Academy of Sciences
, vol.75
, Issue.7
, pp. 3034-3036
-
-
Leung Lai, T.1
Robbins, H.2
Zong Wei, C.3
-
27
-
-
77954641643
-
A contextual-bandit approach to personalized news article recommendation
-
ACM
-
Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th International Conference on World Wide Web (WWW 2010), pages 661-670. ACM, 2010.
-
(2010)
Proceedings of the 19th International Conference on World Wide Web (WWW 2010
, pp. 661-670
-
-
Li, L.1
Chu, W.2
Langford, J.3
Schapire, R.E.4
-
28
-
-
34250091945
-
Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
-
Nick Littlestone. Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2(4):285-318, 1988.
-
(1988)
Machine Learning
, vol.2
, Issue.4
, pp. 285-318
-
-
Littlestone, N.1
-
29
-
-
85011913774
-
From on-line to batch learning
-
Association for Computing Machinery, Inc, One Astor Plaza, 1515 Broadway, New York, NY, 10036-5701, USA
-
Nicolas Littlestone. From on-line to batch learning. In Annual Workshop on Computational Learning Theory: Proceedings of the second annual workshop on Computational learning theory (COLT 1989). Association for Computing Machinery, Inc, One Astor Plaza, 1515 Broadway, New York, NY, 10036-5701, USA, 1989.
-
(1989)
Annual Workshop on Computational Learning Theory: Proceedings of the Second Annual Workshop on Computational Learning Theory (COLT 1989)
-
-
Littlestone, N.1
-
30
-
-
30044441333
-
The sample complexity of exploration in the multi-armed bandit problem
-
Shie Mannor and John N. Tsitsiklis. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research, 5:623-648, 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 623-648
-
-
Mannor, S.1
Tsitsiklis, J.N.2
-
31
-
-
56449108844
-
Empirical bernstein stopping
-
Andrew McCallum and Sam Roweis, editors
-
Volodymyr Mnih, Csaba Szepesvari, and Jean-Yves Audibert. Empirical bernstein stopping. In Andrew McCallum and Sam Roweis, editors, Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), pages 672-679, 2008.
-
(2008)
Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008
, pp. 672-679
-
-
Mnih, V.1
Szepesvari, C.2
Audibert, J.3
-
34
-
-
0035413537
-
Competitive on-line statistics
-
Vladimir Vovk. Competitive on-line statistics. International Statistical Review, 69:213-248, 2001.
-
(2001)
International Statistical Review
, vol.69
, pp. 213-248
-
-
Vovk, V.1
-
35
-
-
79958846996
-
Exploring compact reinforcement-learning representations with linear regression
-
AUAI Press
-
Thomas J. Walsh, Istvaan Szita, Carlso Diuk, and Michael L. Littman. Exploring compact reinforcement-learning representations with linear regression. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI 2009), pages 591-598. AUAI Press, 2009.
-
(2009)
Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI 2009
, pp. 591-598
-
-
Walsh, T.J.1
Szita, I.2
Diuk, C.3
Littman, M.L.4
|