-
1
-
-
0345224411
-
The continuum-armed bandit problem
-
Agrawal, R. The continuum-armed bandit problem. SIAM J. Control Optim., 33:1926-1951, 1995.
-
(1995)
SIAM J. Control Optim.
, vol.33
, pp. 1926-1951
-
-
Agrawal, R.1
-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
Auer, P., Cesa-Bianchi, N., and Fischer, P. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, 2002a. (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Auer, P., Cesa-Bianchi, N., Freund, Y., and Schapire, R. E. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002b.
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
80053457129
-
Improved rates for the stochastic continuum-armed bandit problem
-
Auer, P., Ortner, R., and Szepesvári, C. Improved rates for the stochastic continuum-armed bandit problem. In Proc. COLT, 2007.
-
Proc. COLT, 2007
-
-
Auer, P.1
Ortner, R.2
Szepesvári, C.3
-
5
-
-
0037703398
-
Online oblivious routing
-
Bansal, N., Blum, A., Chawla, S., and Meyerson, A. Online oblivious routing. In Proc. of SPAA, 2003.
-
Proc. of SPAA, 2003
-
-
Bansal, N.1
Blum, A.2
Chawla, S.3
Meyerson, A.4
-
6
-
-
0037740018
-
Online learning in online auctions
-
Blum, A., Kumar, V., Rudra, A., and Wu, F. Online learning in online auctions. In Symp. on Discrete Alg., pp. 202-204, 2003.
-
(2003)
Symp. on Discrete Alg.
, pp. 202-204
-
-
Blum, A.1
Kumar, V.2
Rudra, A.3
Wu, F.4
-
7
-
-
77956276996
-
Online optimization in X-armed bandits
-
Bubeck, S., Munos, R., Stoltz, G., and Szepesvári, C. Online optimization in X-armed bandits. In Proc. NIPS, 2008.
-
Proc. NIPS, 2008
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvári, C.4
-
9
-
-
84898078771
-
Fast and optimal prediction on a labeled tree
-
Cesa-Bianchi, N., Gentile, C., and Vitale, F. Fast and optimal prediction on a labeled tree. In Proc. COLT, 2009a.
-
Proc. COLT, 2009a
-
-
Cesa-Bianchi, N.1
Gentile, C.2
Vitale, F.3
-
10
-
-
80053457403
-
Learning unknown graphs
-
Cesa-Bianchi, N., Gentile, C., and Vitale, F. Learning unknown graphs. In Proceedings of Algorithmic Learning Theory, 2009b.
-
Proceedings of Algorithmic Learning Theory, 2009b
-
-
Cesa-Bianchi, N.1
Gentile, C.2
Vitale, F.3
-
11
-
-
67649577204
-
Regret and convergence bounds for a class of continuum-armed bandit problems
-
Cope, E. W. Regret and convergence bounds for a class of continuum-armed bandit problems. IEEE Trans. Automat. Control, 54(6):1243-1253, 2009.
-
(2009)
IEEE Trans. Automat. Control
, vol.54
, Issue.6
, pp. 1243-1253
-
-
Cope, E.W.1
-
12
-
-
9444277556
-
PAC bounds for multi-armed bandit and Markov decision processes
-
Even-Dar, E., Mannor, S., and Mansour, Y. PAC bounds for multi-armed bandit and Markov decision processes. In Proc. COLT, 2002.
-
Proc. COLT, 2002
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
13
-
-
80053443906
-
On upper-confidence bound policies for non-stationary bandit problems
-
Garivier, A. and Moulines, E. On upper-confidence bound policies for non-stationary bandit problems. In Proc. EWRL, 2008.
-
Proc. EWRL, 2008
-
-
Garivier, A.1
Moulines, E.2
-
14
-
-
70449882757
-
Multi-armed bandit, dynamic environments and meta-bandits
-
Hartland, C., Gelly, S., Baskiotis, N., Teytaud, O., and Sebag, M. Multi-armed bandit, dynamic environments and meta-bandits. In Workshops of NIPS, 2006.
-
Workshops of NIPS, 2006
-
-
Hartland, C.1
Gelly, S.2
Baskiotis, N.3
Teytaud, O.4
Sebag, M.5
-
15
-
-
84968496713
-
Sequential minimax search for a maximum
-
Kiefer, J. Sequential minimax search for a maximum. Proc. Amer. Math. Soc., 4(3):502-506, 1953.
-
(1953)
Proc. Amer. Math. Soc.
, vol.4
, Issue.3
, pp. 502-506
-
-
Kiefer, J.1
-
16
-
-
84898981061
-
Nearly tight bounds for the continuum-armed bandit problem
-
Kleinberg, R. Nearly tight bounds for the continuum-armed bandit problem. In Proc. NIPS, 2004.
-
Proc. NIPS, 2004
-
-
Kleinberg, R.1
-
17
-
-
79960116894
-
Multi-armed bandits in metric spaces
-
Kleinberg, R., Slivkins, A., and Upfal, E. Multi-armed bandits in metric spaces. In Proc. STOC, 2008.
-
Proc. STOC, 2008
-
-
Kleinberg, R.1
Slivkins, A.2
Upfal, E.3
-
18
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, T. L. and Robbins, H. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
21
-
-
80053448127
-
Multi-armed bandit problems with dependent arms
-
Pandey, S., Chakrabarti, D., and Agarwal, D. Multi-armed bandit problems with dependent arms. In Proc. ICML, 2007.
-
Proc. ICML, 2007
-
-
Pandey, S.1
Chakrabarti, D.2
Agarwal, D.3
-
23
-
-
73849094380
-
Online ad auctions
-
Varian, H. R. Online ad auctions. American Economic Review, 99(2):430âǍŞ-434, 2009.
-
(2009)
American Economic Review
, vol.99
, Issue.2
, pp. 430-434
-
-
Varian, H.R.1
-
25
-
-
71149111013
-
Piecewise-stationary bandit problems with side observations
-
Yu, J. Y. and Mannor, S. Piecewise-stationary bandit problems with side observations. In Proc. ICML, 2009.
-
Proc. ICML, 2009
-
-
Yu, J.Y.1
Mannor, S.2
|