-
1
-
-
0029513526
-
Gambling in a rigged casino: The adversial multi-armed bandit problem
-
Washington, DC, USA, Oct. IEEE Computer Society Press, Los Alamitos, CA
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. Gambling in a rigged casino: the adversial multi-armed bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science, FOCS 1995, pages 322-331, Washington, DC, USA, Oct. 1995. IEEE Computer Society Press, Los Alamitos, CA.
-
(1995)
Proceedings of the 36th Annual Symposium on Foundations of Computer Science, FOCS 1995
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
2
-
-
84972545864
-
An analog of the minimax theorem for vector payoffs
-
D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific Journal of Mathematics, 6:1-8, 1956.
-
(1956)
Pacific Journal of Mathematics
, vol.6
, pp. 1-8
-
-
Blackwell, D.1
-
3
-
-
0031140246
-
How to use expert advice
-
N. Cesa-Bianchi, Y. Freund, D. P. Helmbold, D. Haussler, R. Schapire, and M. K. Warmuth. How to use expert advice. Journal of the ACM, 44(3)-.427-485, 1997.
-
(1997)
Journal of the ACM
, vol.44
, Issue.3
, pp. 427-485
-
-
Cesa-Bianchi, N.1
Freund, Y.2
Helmbold, D.P.3
Haussler, D.4
Schapire, R.5
Warmuth, M.K.6
-
5
-
-
20544462399
-
Minimizing regret with label efficient prediction
-
June
-
N. Cesa-Bianchi, G. Lugosi, and G. Stoltz. Minimizing regret with label efficient prediction. IEEE Trans. Inform. Theory, 11-51:2152-2162, June 2005.
-
(2005)
IEEE Trans. Inform. Theory
, vol.11
, Issue.51
, pp. 2152-2162
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
6
-
-
26944464957
-
Improved second-order bounds for prediction with expert advice
-
N. Cesa-Bianchi, Y. Mansour, and G. Stoltz. Improved second-order bounds for prediction with expert advice. In COLT 2005, pages 217-232, 2005.
-
(2005)
COLT 2005
, pp. 217-232
-
-
Cesa-Bianchi, N.1
Mansour, Y.2
Stoltz, G.3
-
8
-
-
0000315398
-
Local convergence of martingales and the law of large numbers
-
Y. S. Chow. Local convergence of martingales and the law of large numbers. Annals of Mathematical Statistics, 36:552-558, 1965.
-
(1965)
Annals of Mathematical Statistics
, vol.36
, pp. 552-558
-
-
Chow, Y.S.1
-
9
-
-
0013175742
-
Strategies for sequential prediction of stationary time series
-
M. Dror, P. L'Ecuyer, and F. Szidarovszky, editors, Kluwer Academic Publishers
-
L. Györfi and G. Lugosi. Strategies for sequential prediction of stationary time series. In M. Dror, P. L'Ecuyer, and F. Szidarovszky, editors, Modelling Uncertainty: An Examination of its Theory, Methods and Applications, pages 225-248. Kluwer Academic Publishers, 2001.
-
(2001)
Modelling Uncertainty: An Examination of its Theory, Methods and Applications
, pp. 225-248
-
-
Györfi, L.1
Lugosi, G.2
-
11
-
-
33644897321
-
Adaptive routing using expert advice
-
A. György and Gy. Ottucsák. Adaptive routing using expert advice. The Computer Journal, 49(2):180-189, 2006.
-
(2006)
The Computer Journal
, vol.49
, Issue.2
, pp. 180-189
-
-
György, A.1
Ottucsák, Gy.2
-
12
-
-
0001976283
-
Approximation to bayes risk in repeated plays
-
M. Dresher, A. Tucker, and P. Wolfe, editors . Princeton University Press
-
J. Hannan. Approximation to bayes risk in repeated plays. In M. Dresher, A. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games, volume 3, pages 97-139. Princeton University Press, 1957.
-
(1957)
Contributions to the Theory of Games
, vol.3
, pp. 97-139
-
-
Hannan, J.1
-
13
-
-
0242684983
-
A simple adaptive procedure leading to correlated equilibrium
-
S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometria, 68(5):181-200, 2002.
-
(2002)
Econometria
, vol.68
, Issue.5
, pp. 181-200
-
-
Hart, S.1
Mas-Colell, A.2
-
15
-
-
33646515747
-
Defensive universal learning with experts
-
Singapore, Springer, Berlin
-
J. Poland and M. Hutter. Defensive universal learning with experts. In Proc. 16th International Conf. on Algorithmic Learning Theory, ALT 2005, pages 356-370, Singapore, 2005. Springer, Berlin.
-
(2005)
Proc. 16th International Conf. on Algorithmic Learning Theory, ALT 2005
, pp. 356-370
-
-
Poland, J.1
Hutter, M.2
|