-
1
-
-
84966203785
-
"Some aspects of the sequential design of experiments"
-
H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol. 58, pp. 527-535, 1952.
-
(1952)
Bull. Amer. Math. Soc.
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
2
-
-
0035207353
-
"Learning while searching for the best alternative"
-
K. Adam, "Learning while searching for the best alternative," J. Econ. Theory, vol. 101, pp. 252-280, 2001.
-
(2001)
J. Econ. Theory
, vol.101
, pp. 252-280
-
-
Adam, K.1
-
3
-
-
0001400331
-
"A Bernoulli two-armed bandit"
-
Jun
-
D. A. Berry, "A Bernoulli two-armed bandit," Ann. Math. Stat., vol. 43, no. 3, pp. 871-897, Jun. 1972.
-
(1972)
Ann. Math. Stat.
, vol.43
, Issue.3
, pp. 871-897
-
-
Berry, D.A.1
-
6
-
-
0000169010
-
"Bandit processes and dynamic allocation indices"
-
J. C. Gittins, "Bandit processes and dynamic allocation indices," J. Royal Stat. Soc. B, vol. 41, no. 2, pp. 148-177, 1979.
-
(1979)
J. Royal Stat. Soc. B
, vol.41
, Issue.2
, pp. 148-177
-
-
Gittins, J.C.1
-
7
-
-
0018709825
-
"A dynamic allocation index for the discounted multiarmed bandit problem"
-
Dec
-
J. C. Gittins, "A dynamic allocation index for the discounted multiarmed bandit problem," Biometrika, vol. 66, no. 3, pp. 561-565, Dec. 1979.
-
(1979)
Biometrika
, vol.66
, Issue.3
, pp. 561-565
-
-
Gittins, J.C.1
-
8
-
-
0001732282
-
"Asymptotically optimal allocation of treatments in sequential experiments"
-
T. J. Santner and A. C. Tamhane, New York: MarcelDekker
-
T. L. Lai and H. Robbins, "Asymptotically optimal allocation of treatments in sequential experiments," in Design of Experiments: Ranking and Selection, T. J. Santner and A. C. Tamhane, Eds. New York: MarcelDekker, 1984.
-
(1984)
Design of Experiments: Ranking and Selection
-
-
Lai, T.L.1
Robbins, H.2
-
9
-
-
0002899547
-
"Asymptotically efficient allocation rules"
-
T. L. Lai and H. Robbins, "Asymptotically efficient allocation rules," Adv. Appl. Math., vol. 6, no. 1, pp. 4-22, 1985.
-
(1985)
Adv. Appl. Math.
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
10
-
-
0029344133
-
"Machine learning and nonparametric bandit theory"
-
Jul
-
T. L. Lai and S. Yakowitz, "Machine learning and nonparametric bandit theory," IEEE Trans. Autom. Control, vol. 40, no. 7, pp. 1199-1209, Jul. 1995.
-
(1995)
IEEE Trans. Autom. Control
, vol.40
, Issue.7
, pp. 1199-1209
-
-
Lai, T.L.1
Yakowitz, S.2
-
11
-
-
0024089489
-
"Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost"
-
Oct
-
R. Agrawal, M. V. Hegde, and D. Teneketzis, "Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost," IEEE Trans. Autom. Control, vol. 33, no. 10, pp. 899-906, Oct. 1988.
-
(1988)
IEEE Trans. Autom. Control
, vol.33
, Issue.10
, pp. 899-906
-
-
Agrawal, R.1
Hegde, M.V.2
Teneketzis, D.3
-
12
-
-
0024626787
-
"Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: Finite parameter space"
-
Mar
-
R. Agrawal, D. Teneketzis, and V. Anantharam, "Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: Finite parameter space," IEEE Trans. Autom. Control, vol. 34, no. 3, pp. 258-267, Mar. 1989.
-
(1989)
IEEE Trans. Autom. Control
, vol.34
, Issue.3
, pp. 258-267
-
-
Agrawal, R.1
Teneketzis, D.2
Anantharam, V.3
-
13
-
-
0024886640
-
"Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space"
-
Dec
-
R. Agrawal, D. Teneketzis, and V. Anantharam, "Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space," IEEE Trans. Autom. Control, vol. 34, no. 12, pp. 1249-1259, Dec. 1989.
-
(1989)
IEEE Trans. Autom. Control
, vol.34
, Issue.12
, pp. 1249-1259
-
-
Agrawal, R.1
Teneketzis, D.2
Anantharam, V.3
-
14
-
-
0023453059
-
"Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. rewards"
-
Nov
-
V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. rewards," IEEE Trans. Autom. Control, vol. AC-32, no. 11, pp. 968-976, Nov. 1987.
-
(1987)
IEEE Trans. Autom. Control
, vol.AC-32
, Issue.11
, pp. 968-976
-
-
Anantharam, V.1
Varaiya, P.2
Walrand, J.3
-
15
-
-
0023450663
-
"Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part II: Markovian rewards"
-
Nov
-
V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part II: Markovian rewards," IEEE Trans. Autom. Control, vol. AC-32, no. 11, pp. 977-982, Nov. 1987.
-
(1987)
IEEE Trans. Autom. Control
, vol.AC-32
, Issue.11
, pp. 977-982
-
-
Anantharam, V.1
Varaiya, P.2
Walrand, J.3
-
16
-
-
0029047314
-
"Sequential choice from several populations"
-
Sep
-
M. N. Katehakis and H. Robbins, "Sequential choice from several populations," in Proc. Nat. Acad. Sci., vol. 92, Sep. 1995, pp. 8584-8585.
-
(1995)
Proc. Nat. Acad. Sci.
, vol.92
, pp. 8584-8585
-
-
Katehakis, M.N.1
Robbins, H.2
-
17
-
-
0034171759
-
"Finite-time lower bounds for the two-armed bandit problem"
-
Apr
-
S. R. Kulkarni and G. Lugosi, "Finite-time lower bounds for the two-armed bandit problem," IEEE Trans. Autom. Control, vol. 45, no. 4, pp. 711-714, Apr. 2000.
-
(2000)
IEEE Trans. Autom. Control
, vol.45
, Issue.4
, pp. 711-714
-
-
Kulkarni, S.R.1
Lugosi, G.2
-
18
-
-
0013218879
-
"Covariate models for Bernoulli bandits"
-
M. K. Clayton, "Covariate models for Bernoulli bandits," Seq. Anal. vol. 8, no. 4, pp. 405-426, 1989.
-
(1989)
Seq. Anal.
, vol.8
, Issue.4
, pp. 405-426
-
-
Clayton, M.K.1
-
19
-
-
0242460275
-
"On bandit problems with side observations and learn-ability"
-
Sep
-
S. R. Kulkarni, "On bandit problems with side observations and learn-ability," in Proc. 31st Allerton Conf. Communications, Control, Computing, Sep. 1993, pp. 83-92.
-
(1993)
Proc. 31st Allerton Conf. Communications, Control, Computing
, pp. 83-92
-
-
Kulkarni, S.R.1
-
20
-
-
0000017483
-
"One-armed bandit problems with covariates"
-
J. Sarkar, "One-armed bandit problems with covariates," Ann. Statist., vol. 19, no. 4, pp. 1978-2002, 1991.
-
(1991)
Ann. Statist.
, vol.19
, Issue.4
, pp. 1978-2002
-
-
Sarkar, J.1
-
21
-
-
0001631327
-
"A one-armed bandit problem with a concomitant variable"
-
Dec
-
M. Woodroofe, "A one-armed bandit problem with a concomitant variable," J. Amer. Stat. Assoc., vol. 74, no. 368, pp. 799-806, Dec. 1979.
-
(1979)
J. Amer. Stat. Assoc.
, vol.74
, Issue.368
, pp. 799-806
-
-
Woodroofe, M.1
-
22
-
-
0242628745
-
"Optimal allocations in sequential tests involving two populations with covariates"
-
T. Zoubeidi, "Optimal allocations in sequential tests involving two populations with covariates," Commun. Statist.: Theory Meth., vol. 23, no. 4, pp. 1215-1225, 1994.
-
(1994)
Commun. Statist.: Theory Meth.
, vol.23
, Issue.4
, pp. 1215-1225
-
-
Zoubeidi, T.1
|