-
1
-
-
0029513526
-
Gambling in a rigged casino: The adversarial multi-armed bandit problem
-
Los Alamitos, CA, IEEE Computer Society Press
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. Gambling in a rigged casino: The adversarial multi-armed bandit problem. In Proc. 36th Annual Symposium on Foundations of Computer Science (FOCS 1995), pages 322-331, Los Alamitos, CA, 1995. IEEE Computer Society Press.
-
(1995)
Proc. 36th Annual Symposium on Foundations of Computer Science (FOCS 1995)
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
3
-
-
0344412405
-
Adaptive and self-confident on-line learning algorithms
-
Morgan Kaufmann, San Francisco
-
P. Auer and C. Gentile. Adaptive and self-confident on-line learning algorithms. In. Proc. 13th Conference on Computational Learning Theory, pages 107-117. Morgan Kaufmann, San Francisco, 2000.
-
(2000)
Proc. 13th Conference on Computational Learning Theory
, pp. 107-117
-
-
Auer, P.1
Gentile, C.2
-
4
-
-
0031140246
-
How to use expert advice
-
N. Cesa-Bianchi, Y. Freund, D. Haussler, D. Helmbold, R. Schapire, and M. K. Warmuth. How to use expert advice. Journal of the ACM, 44(3):427-485, 1997.
-
(1997)
Journal of the ACM
, vol.44
, Issue.3
, pp. 427-485
-
-
Cesa-Bianchi, N.1
Freund, Y.2
Haussler, D.3
Helmbold, D.4
Schapire, R.5
Warmuth, M.K.6
-
5
-
-
0031211090
-
A decision-theoretic generalization of on-line learning and an application to boosting
-
Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119-139, 1997.
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.1
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.E.2
-
6
-
-
0344875562
-
The robustness of the p-norm algorithm
-
C. Gentile. The robustness of the p-norm algorithm. Machine Learning, 53(3):265-299, 2003.
-
(2003)
Machine Learning
, vol.53
, Issue.3
, pp. 265-299
-
-
Gentile, C.1
-
7
-
-
0001976283
-
Approximation to Bayes risk in repeated plays
-
M. Dresher, A. W. Tucker, and P. Wolfe, editors, Princeton University Press
-
J. Hannan. Approximation to Bayes risk in repeated plays. In M. Dresher, A. W. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games 3, pages 97-139. Princeton University Press, 1957.
-
(1957)
Contributions to the Theory of Games
, vol.3
, pp. 97-139
-
-
Hannan, J.1
-
8
-
-
0043268070
-
Convergence and loss bounds for Bayesian sequence prediction
-
M. Hutter. Convergence and loss bounds for Bayesian sequence prediction. IEEE Trans. on Information Theory, 49(8):2061-2067, 2003a. URL http://arxiv.org/abs/cs.LG/0301014.
-
(2003)
IEEE Trans. on Information Theory
, vol.49
, Issue.8
, pp. 2061-2067
-
-
Hutter, M.1
-
9
-
-
4644374039
-
Optimality of universal Bayesian prediction for general loss and alphabet
-
M. Hutter. Optimality of universal Bayesian prediction for general loss and alphabet. Journal of Machine Learning Research, 4:971-1000, 2003b. URL http://arxiv.org/abs/cs.LG/0311014.
-
(2003)
Journal of Machine Learning Research
, vol.4
, pp. 971-1000
-
-
Hutter, M.1
-
10
-
-
21844431776
-
Online prediction - Bayes versus experts
-
July. Presented at the EU PASCAL Workshop on Learning Theoretic and Bayesian Inductive Principles (LTBIP-2004)
-
M. Hutter. Online prediction - Bayes versus experts. Technical report, July 2004a. URL http://www.idsia.ch/~marcus/ai/bayespea.htm. Presented at the EU PASCAL Workshop on Learning Theoretic and Bayesian Inductive Principles (LTBIP-2004).
-
(2004)
Technical Report
-
-
Hutter, M.1
-
12
-
-
21844436185
-
Prediction with expert advice by following the perturbed leader for general weights
-
Proc. 15th International Conf. on Algorithmic Learning Theory (ALT-2004), Padova, Springer, Berlin
-
M. Hutter and J. Poland. Prediction with expert advice by following the perturbed leader for general weights. In Proc. 15th International Conf. on Algorithmic Learning Theory (ALT-2004), volume 3244 of LNAI, pages 279-293, Padova, 2004. Springer, Berlin. URLhttp://arxiv.org/abs/cs.LG/0405043.
-
(2004)
LNAI
, vol.3244
, pp. 279-293
-
-
Hutter, M.1
Poland, J.2
-
13
-
-
10444233821
-
Efficient algorithms for online decision
-
Lecture Notes in Artificial Intelligence, Berlin, Springer
-
A. Kalai and S. Vempala. Efficient algorithms for online decision. In Proc. 16th Annual Conference on Learning Theory (COLT-2003), Lecture Notes in Artificial Intelligence, pages 506-521, Berlin, 2003. Springer.
-
(2003)
Proc. 16th Annual Conference on Learning Theory (COLT-2003)
, pp. 506-521
-
-
Kalai, A.1
Vempala, S.2
-
16
-
-
0001035413
-
On the method of bounded differences
-
London Mathematical Society Lecture Notes Series
-
C. McDiarmid. On the method of bounded differences. Surveys in Combinatorics, 141, London Mathematical Society Lecture Notes Series: 148-188, 1989.
-
(1989)
Surveys in Combinatorics
, vol.141
, pp. 148-188
-
-
McDiarmid, C.1
-
17
-
-
9444257628
-
Online geometric optimization in the bandit setting against an adaptive adversary
-
17th Annual Conference on Learning Theory (COLT), Springer
-
H. B. McMahan and A. Blum. Online geometric optimization in the bandit setting against an adaptive adversary. In 17th Annual Conference on Learning Theory (COLT), volume 3120 of LNCS, pages 109-123. Springer, 2004.
-
(2004)
LNCS
, vol.3120
, pp. 109-123
-
-
McMahan, H.B.1
Blum, A.2
-
21
-
-
3543088062
-
How to better use expert advice
-
R. Yaroshinsky, R. El-Yaniv, and S. Seiden. How to better use expert advice. Machine Learning, 55 (3):271-309, 2004.
-
(2004)
Machine Learning
, vol.55
, Issue.3
, pp. 271-309
-
-
Yaroshinsky, R.1
El-Yaniv, R.2
Seiden, S.3
|