-
1
-
-
84860610530
-
Optimal algorithms for online convex optimization with multi-point bandit feedback
-
Agarwal, Alekh, Dekel, Ofer, and Xiao, Lin. Optimal algorithms for online convex optimization with multi-point bandit feedback. In Conference on Learning Theory (COLT), 2010.
-
Conference on Learning Theory (COLT), 2010
-
-
Agarwal, A.1
Dekel, O.2
Xiao, L.3
-
2
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Auer, Peter, Cesa-Bianchi, Nicolò, Freund, Yoav, and Schapire, Robert. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.4
-
3
-
-
33745295134
-
Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
-
Even-Dar, Eyal, Mannor, Shie, and Mansour, Yishay. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research (JMLR), 7:1079-1105, 2006.
-
(2006)
Journal of Machine Learning Research (JMLR)
, vol.7
, pp. 1079-1105
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
4
-
-
0028516898
-
Computing with noisy information
-
Feige, Uriel, Raghavan, Prabhakar, Peleg, David, and Upfal, Eli. Computing with noisy information. SIAM Journal on Computing, 23(5):1001-1018, 1994.
-
(1994)
SIAM Journal on Computing
, vol.23
, Issue.5
, pp. 1001-1018
-
-
Feige, U.1
Raghavan, P.2
Peleg, D.3
Upfal, E.4
-
6
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, T. L. and Robbins, Herbert. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
7
-
-
30044441333
-
The sample complexity of exploration in the multi-armed bandit problem
-
Mannor, Shie and Tsitsiklis, John N. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research (JMLR), 5: 623-648, 2004.
-
(2004)
Journal of Machine Learning Research (JMLR)
, vol.5
, pp. 623-648
-
-
Mannor, S.1
Tsitsiklis, J.N.2
-
9
-
-
67650085898
-
How does clickthrough data reflect retrieval quality?
-
Radlinski, Filip, Kurup, Madhu, and Joachims, Thorsten. How does clickthrough data reflect retrieval quality? In ACM Conference on Information and Knowledge Management (CIKM), 2008.
-
ACM Conference on Information and Knowledge Management (CIKM), 2008
-
-
Radlinski, F.1
Kurup, M.2
Joachims, T.3
-
11
-
-
84898077397
-
The k-armed dueling bandits problem
-
Yue, Yisong, Broder, Josef, Kleinberg, Robert, and Joachims, Thorsten. The k-armed dueling bandits problem. In Conference on Learning Theory (COLT), 2009.
-
Conference on Learning Theory (COLT), 2009
-
-
Yue, Y.1
Broder, J.2
Kleinberg, R.3
Joachims, T.4
|