-
1
-
-
0000854435
-
Adaptive treatment allocation and the multi-armed bandit problem
-
T. L. Lai, "Adaptive treatment allocation and the multi-armed bandit problem," Ann. Stat., vol. 15, no. 3, pp. 1091-1114, 1987.
-
(1987)
Ann. Stat
, vol.15
, Issue.3
, pp. 1091-1114
-
-
Lai, T.L.1
-
2
-
-
0002955623
-
A dynamic allocation index for the sequential design of experiments
-
J. Gani, Ed. Amsterdam: North-Holland
-
J. Gittins and D. M. Jones, "A dynamic allocation index for the sequential design of experiments," in Progress in Statistics, J. Gani, Ed. Amsterdam: North-Holland, 1974, pp. 241-266.
-
(1974)
Progress in Statistics
, pp. 241-266
-
-
Gittins, J.1
Jones, D.M.2
-
3
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T. L. Lai and H. Robbins, "Asymptotically efficient adaptive allocation rules," Adv. Appl. Math., vol. 6, pp. 4-22, 1985.
-
(1985)
Adv. Appl. Math
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
4
-
-
0001395850
-
On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
-
W. R. Thompson, "On the likelihood that one unknown probability exceeds another in view of the evidence of two samples," Biometrika, vol. 25, pp. 285-294, 1933.
-
(1933)
Biometrika
, vol.25
, pp. 285-294
-
-
Thompson, W.R.1
-
5
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol. 58, pp. 527-535, 1952.
-
(1952)
Bull. Amer. Math. Soc
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
7
-
-
0001492860
-
Contributions to the "two-armed bandit" problem
-
D. Feldman, "Contributions to the "two-armed bandit" problem," Ann. Math. Stat., vol. 33, pp. 847-856, 1962.
-
(1962)
Ann. Math. Stat
, vol.33
, pp. 847-856
-
-
Feldman, D.1
-
8
-
-
0010948196
-
Further contributions to the "two-armed bandit" problem
-
R. Keener, "Further contributions to the "two-armed bandit" problem," Ann. Stat., vol. 13, no. 1, pp. 418-422, 1985.
-
(1985)
Ann. Stat
, vol.13
, Issue.1
, pp. 418-422
-
-
Keener, R.1
-
10
-
-
0000532482
-
Response surface bandits
-
J. Ginebra and M. K. Clayton, "Response surface bandits," J. Roy. Stat. Soc. B, vol. 57, no. 4, pp. 771-784, 1995.
-
(1995)
J. Roy. Stat. Soc. B
, vol.57
, Issue.4
, pp. 771-784
-
-
Ginebra, J.1
Clayton, M.K.2
-
14
-
-
0009943101
-
Incomplete learning from endogenous data in dynamic allocation
-
M. Brezzi and T. L. Lai, "Incomplete learning from endogenous data in dynamic allocation," Econometrica, vol. 68, no. 6, pp. 1511-1516, 2000.
-
(2000)
Econometrica
, vol.68
, Issue.6
, pp. 1511-1516
-
-
Brezzi, M.1
Lai, T.L.2
-
15
-
-
0000024577
-
On dynamic programming with unbounded rewards
-
S. A. Lippman, "On dynamic programming with unbounded rewards," Management Sci., vol. 21, no. 11, pp. 1225-1233, 1975.
-
(1975)
Management Sci
, vol.21
, Issue.11
, pp. 1225-1233
-
-
Lippman, S.A.1
|