-
1
-
-
84898972474
-
Contextual bandit learning under the realizability assumption
-
A. Agarwal, M. Dud́ik, S. Kale, J. Langford, and R. E. Schapire. Contextual bandit learning under the realizability assumption. In AISTATS, 2012.
-
(2012)
AISTATS
-
-
Agarwal, A.1
Dud́ik, M.2
Kale, S.3
Langford, J.4
Schapire, R.E.5
-
3
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
5
-
-
80053154335
-
Efficient optimal learning for contextual bandits
-
M. Dud́ik, D. Hsu, S. Kale, N. Karampatziakis, J. Langford, L. Reyzin, and T. Zhang. Efficient optimal learning for contextual bandits. In UAI, pages 169-178, 2011.
-
(2011)
UAI
, pp. 169-178
-
-
Dud́ik, M.1
Hsu, D.2
Kale, S.3
Karampatziakis, N.4
Langford, J.5
Reyzin, L.6
Zhang, T.7
-
6
-
-
77956543367
-
Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine
-
T. Graepel, J. Q. Candela, T. Borchert, and R. Herbrich. Web-scale Bayesian click-through rate prediction for sponsored search advertising in Microsoft's Bing search engine. In ICML, pages 13-20, 2010.
-
(2010)
ICML
, pp. 13-20
-
-
Graepel, T.1
Candela, J.Q.2
Borchert, T.3
Herbrich, R.4
-
7
-
-
78549244167
-
Solving two-armed bernoulli bandit problems using a bayesian learning automaton
-
O.-C. Granmo. Solving two-armed bernoulli bandit problems using a bayesian learning automaton. Int'l Journal of Intellient Computing and Cybernetics, 3(2):207-234, 2010.
-
(2010)
Int'l Journal of Intellient Computing and Cybernetics
, vol.3
, Issue.2
, pp. 207-234
-
-
Granmo, O.-C.1
-
8
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
10
-
-
84860647553
-
Simulation studies in optimistic bayesian sampling in contextual-bandit problems
-
Univ. of Bristol
-
B. C. May and D.S. Leslie. Simulation studies in optimistic Bayesian sampling in contextual-bandit problems. Technical Report 11:02, Dept. of Mathematics, Univ. of Bristol, 2011.
-
(2011)
Technical Report 11:02, Dept. of Mathematics
-
-
May, B.C.1
Leslie, D.S.2
-
11
-
-
84860620509
-
Optimistic bayesian sampling in contextual-bandit problems
-
Univ. of Bristol
-
B. C. May, N. Korda, A. Lee, and D.S. Leslie. Optimistic Bayesian sampling in contextual-bandit problems. Technical Report 11:01, Dept. of Mathematics, Univ. of Bristol, 2011.
-
(2011)
Technical Report 11:01, Dept. of Mathematics
-
-
May, B.C.1
Korda, N.2
Lee, A.3
Leslie, D.S.4
-
13
-
-
0001395850
-
On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
-
W. R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
-
(1933)
Biometrika
, vol.25
, Issue.3-4
, pp. 285-294
-
-
Thompson, W.R.1
|