-
1
-
-
33750733956
-
Hannan consistency in on-line learning in case of unbounded losses under partial monitoring
-
Springer
-
C. Allenberg, P. Auer, L. Györfi, and G. Ottucsák. Hannan consistency in on-line learning in case of unbounded losses under partial monitoring. In ALT, volume 4264 of Lecture Notes in Computer Science, pages 229-243. Springer, 2006.
-
(2006)
ALT, Volume 4264 of Lecture Notes in Computer Science
, pp. 229-243
-
-
Allenberg, C.1
Auer, P.2
Györfi, L.3
Ottucsák, G.4
-
2
-
-
62949181077
-
Exploration-exploitation trade-off using variance estimates in multi-armed bandits
-
J.-Y. Audibert, R. Munos, and Cs. Szepesvári. Exploration- exploitation trade-off using variance estimates in multi-armed bandits. Theoretical Computer Science, 410:1876-1902, 2009.
-
(2009)
Theoretical Computer Science
, vol.410
, pp. 1876-1902
-
-
Audibert, J.-Y.1
Munos, R.2
Szepesvári, Cs.3
-
3
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade-offs
-
P. Auer. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397-422, 2002.
-
(2002)
Journal of Machine Learning Research
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
4
-
-
0029513526
-
Gambling in a rigged casino: The adversarial multi-armed bandit problem
-
IEEE Computer Society Press
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. Schapire. Gambling in a rigged casino: the adversarial multi-armed bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pages 322-331. IEEE Computer Society Press, 1995.
-
(1995)
Proceedings of the 36th Annual Symposium on Foundations of Computer Science
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
5
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning Journal, 47(2-3):235-256, 2002a.
-
(2002)
Machine Learning Journal
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
6
-
-
0037709910
-
The non-stochastic multi-armed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. Schapire. The non-stochastic multi-armed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002b.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
7
-
-
84972574511
-
Weighted sums of certain dependent random variables
-
K. Azuma. Weighted sums of certain dependent random variables. Tohoku Mathematical Journal, 19:357-367, 1967.
-
(1967)
Tohoku Mathematical Journal
, vol.19
, pp. 357-367
-
-
Azuma, K.1
-
8
-
-
0033285751
-
Analysis of two gradient-based algorithms for on-line regression
-
N. Cesa-Bianchi. Analysis of two gradient-based algorithms for on-line regression. Journal of Computer and System Sciences, 59(3):392-411, 1999.
-
(1999)
Journal of Computer and System Sciences
, vol.59
, Issue.3
, pp. 392-411
-
-
Cesa-Bianchi, N.1
-
10
-
-
0031140246
-
How to use expert advice
-
N. Cesa-Bianchi, Y. Freund, D. P. Helmbold, D. Haussler, R. E. Schapire, and M. K. Warmuth. How to use expert advice. Journal of the ACM, 44(3):427-485, 1997.
-
(1997)
Journal of the ACM
, vol.44
, Issue.3
, pp. 427-485
-
-
Cesa-Bianchi, N.1
Freund, Y.2
Helmbold, D.P.3
Haussler, D.4
Schapire, R.E.5
Warmuth, M.K.6
-
12
-
-
0002384441
-
On tail probabilities for martingales
-
D. A. Freedman. On tail probabilities for martingales. The Annals of Probability, 3:100-118, 1975.
-
(1975)
The Annals of Probability
, vol.3
, pp. 100-118
-
-
Freedman, D.A.1
-
13
-
-
33644897321
-
Adaptive routing using expert advice
-
A. György and G. Ottucsák. Adaptive routing using expert advice. Computer Journal-Oxford, 49(2):180-189, 2006.
-
(2006)
Computer Journal-Oxford
, vol.49
, Issue.2
, pp. 180-189
-
-
György, A.1
Ottucsák, G.2
-
15
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58:13-30, 1963.
-
(1963)
Journal of the American Statistical Association
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
16
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
H. Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society, 58:527-535, 1952.
-
(1952)
Bulletin of the American Mathematics Society
, vol.58
, pp. 527-535
-
-
Robbins, H.1
|