-
1
-
-
0029513526
-
Gambling in a rigged casino: The adversarial multi-armed bandit problem
-
IEEE Computer Society Press, Los Alamitos, CA
-
AUER, P., CESA-BIANCHI, N., FREUND, Y. and SCHAPIRE, R. E. (1995). Gambling in a rigged casino: the adversarial multi-armed bandit problem. In 36th Annual Symposium on Foundations of Computer Science 322-331. IEEE Computer Society Press, Los Alamitos, CA.
-
(1995)
36th Annual Symposium on Foundations of Computer Science
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
2
-
-
0031534756
-
Bandit problems with infinitely many arms
-
BERRY, D. A., CHEN, R. W., ZAME, A., HEATH, D. C. and SHEPP, L. A. (1997). Bandit problems with infinitely many arms. Ann. Statist. 25 2103-2116.
-
(1997)
Ann. Statist.
, vol.25
, pp. 2103-2116
-
-
Berry, D.A.1
Chen, R.W.2
Zame, A.3
Heath, D.C.4
Shepp, L.A.5
-
4
-
-
0000492892
-
Minimum contrast estimators on sieves: Exponential bounds and rates of convergence
-
BIRGÉ, L. and MASSART, P. (1998). Minimum contrast estimators on sieves: exponential bounds and rates of convergence. Bernoulli 4 329-375.
-
(1998)
Bernoulli
, vol.4
, pp. 329-375
-
-
Birgé, L.1
Massart, P.2
-
5
-
-
0013218879
-
Covariate models for Bernoulli bandits
-
CLAYTON, M. K. (1989). Covariate models for Bernoulli bandits. Sequential Anal. 8 405-426.
-
(1989)
Sequential Anal.
, vol.8
, pp. 405-426
-
-
Clayton, M.K.1
-
7
-
-
21844511932
-
On the strong universal consistency of nearest neighbor regression function estimates
-
DEVROYE, L., GYÖRFI, L., KRZYZAK, A. and LUGOSI, G. (1994). On the strong universal consistency of nearest neighbor regression function estimates. Ann. Statist. 22 1371-1385.
-
(1994)
Ann. Statist.
, vol.22
, pp. 1371-1385
-
-
Devroye, L.1
Györfi, L.2
Krzyzak, A.3
Lugosi, G.4
-
11
-
-
0043114226
-
Rational learning: Finding a balance between utility and efficiency
-
Springer, New York
-
GRATCH, J., DEJONG, G. and YANG, Y. (1994). Rational learning: finding a balance between utility and efficiency. Selecting Models from Data: Artificial Intelligence and Statistics. Lecture Notes in Statist. 89 11-20. Springer, New York.
-
(1994)
Selecting Models from Data: Artificial Intelligence and Statistics. Lecture Notes in Statist.
, vol.89
, pp. 11-20
-
-
Gratch, J.1
Dejong, G.2
Yang, Y.3
-
12
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
LAI, T. L. and ROBBINS, H. (1985). Asymptotically efficient adaptive allocation rules. Adv. In Appl. Math. 6 4-22.
-
(1985)
Adv. In Appl. Math.
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
13
-
-
0029344133
-
Machine learning and nonparametric bandit theory
-
LAI, T. L. and YAKOWITZ, S. (1995). Machine learning and nonparametric bandit theory. IEEE Trans. Automat. Control 40 1199-1209.
-
(1995)
IEEE Trans. Automat. Control
, vol.40
, pp. 1199-1209
-
-
Lai, T.L.1
Yakowitz, S.2
-
14
-
-
0030489341
-
Histogram regression estimation using data-dependent partitions
-
NOBEL, A. (1996). Histogram regression estimation using data-dependent partitions. Ann. Statist. 24 1084-1105.
-
(1996)
Ann. Statist.
, vol.24
, pp. 1084-1105
-
-
Nobel, A.1
-
16
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
ROBBINS, H. (1952). Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. 58 527-535.
-
(1952)
Bull. Amer. Math. Soc.
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
17
-
-
0000017483
-
One-armed bandit problems with covariates
-
SARKAR, J. (1991). One-armed bandit problems with covariates. Ann. Statist. 19 1978-2002.
-
(1991)
Ann. Statist.
, vol.19
, pp. 1978-2002
-
-
Sarkar, J.1
-
18
-
-
0000388992
-
Consistent nonparametric regression
-
STONE, C. S. (1977). Consistent nonparametric regression. Ann. Statist. 5 595-620.
-
(1977)
Ann. Statist.
, vol.5
, pp. 595-620
-
-
Stone, C.S.1
-
20
-
-
0001631327
-
A one-armed bandit problem with a concomitant variable
-
WOODROOFE, M. (1979). A one-armed bandit problem with a concomitant variable. J. Amer. Statist. Assoc. 74 799-806.
-
(1979)
J. Amer. Statist. Assoc.
, vol.74
, pp. 799-806
-
-
Woodroofe, M.1
|