-
1
-
-
84898073179
-
Beating the adaptive bandit with high probability
-
J. Abernethy and A. Rakhlin. Beating the adaptive bandit with high probability. In COLT, 2009.
-
(2009)
COLT
-
-
Abernethy, J.1
Rakhlin, A.2
-
2
-
-
0024089489
-
Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost
-
R. Agrawal, M.V. Hedge, and D. Teneketzis. Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost. IEEE Transactions on Automatic Control, 33(10):899-906, 1988.
-
(1988)
IEEE Transactions on Automatic Control
, vol.33
, Issue.10
, pp. 899-906
-
-
Agrawal, R.1
Hedge, M.V.2
Teneketzis, D.3
-
4
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
6
-
-
79960128338
-
X-armed bandits
-
S. Bubeck, R. Munos, G. Stoltz, and C. Szepesvári. X-armed bandits. Journal of Machine Learning Research, 12:1655-1695, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 1655-1695
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvári, C.4
-
9
-
-
33847624608
-
Improved second-order bounds for prediction with expert advice
-
N. Cesa-Bianchi, Y. Mansour, and G. Stoltz. Improved second-order bounds for prediction with expert advice. Machine Learning, 66(2/3):321-352, 2007.
-
(2007)
Machine Learning
, vol.66
, Issue.2-3
, pp. 321-352
-
-
Cesa-Bianchi, N.1
Mansour, Y.2
Stoltz, G.3
-
11
-
-
0031211090
-
A decision-theoretic generalization of on-line learning and an application to boosting
-
Y. Freund and R.E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and System Sciences, 55(1):119-139, 1997.
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.1
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.E.2
-
13
-
-
10944236938
-
A survey on the bandit problem with switching costs
-
T. Jun. A survey on the bandit problem with switching costs. De Economist, 152:513-541, 2004.
-
(2004)
De Economist
, vol.152
, pp. 513-541
-
-
Jun, T.1
-
18
-
-
0036649565
-
Sequential strategies for loss functions with memory
-
N. Merhav, E. Ordentlich, G. Seroussi, and M.J. Weinberger. Sequential strategies for loss functions with memory. IEEE Transactions on Information Theory, 48(7):1947-1958, 2002.
-
(2002)
IEEE Transactions on Information Theory
, vol.48
, Issue.7
, pp. 1947-1958
-
-
Merhav, N.1
Ordentlich, E.2
Seroussi, G.3
Weinberger, M.J.4
-
20
-
-
77953539718
-
Online regret bounds for Markov decision processes with deterministic transitions
-
R. Ortner. Online regret bounds for Markov decision processes with deterministic transitions. Theoretical Computer Science, 411(29-30):2684-2695, 2010.
-
(2010)
Theoretical Computer Science
, vol.411
, Issue.29-30
, pp. 2684-2695
-
-
Ortner, R.1
|