SCOPUS 정보 검색 플랫폼

IEEE Transactions on Automatic Control

Volumn 50, Issue 3, 2005, Pages 338-355

Bandit problems with side observations

(3) Wang, Chih Chun a Kulkarni, Sanjeev R a Poor, H Vincent a

a Princeton University (United States)

Author keywords

Adaptive; Allocation rule; Asymptotic; Efficient; Inferior sampling time; Side information; Two armed bandit

Indexed keywords

ALGORITHMS; ASYMPTOTIC STABILITY; DECISION THEORY; OPTIMAL CONTROL SYSTEMS; RANDOM PROCESSES;

ALLOCATION RULE; INFERIOR SAMPLING TIME; SIDE INFORMATION; TWO ARMED BANDIT;

ADAPTIVE CONTROL SYSTEMS;

EID: 15844389867 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2005.844079 Document Type: Article

Times cited : (122)

References (24)

1
- 84966203785
- "Some aspects of the sequential design of experiments"
- H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol. 58, pp. 527-535, 1952.
- (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-535
- Robbins, H.¹

2
- 0035207353
- "Learning while searching for the best alternative"
- K. Adam, "Learning while searching for the best alternative," J. Econ. Theory, vol. 101, pp. 252-280, 2001.
- (2001) J. Econ. Theory , vol.101 , pp. 252-280
- Adam, K.¹

3
- 0001400331
- "A Bernoulli two-armed bandit"
- Jun
- D. A. Berry, "A Bernoulli two-armed bandit," Ann. Math. Stat., vol. 43, no. 3, pp. 871-897, Jun. 1972.
- (1972) Ann. Math. Stat. , vol.43 , Issue.3 , pp. 871-897
- Berry, D.A.¹

4
- 0003758390
- Philadelphia, PA: SIAM
- H. Chernoff, Sequential Analysis and Optimal Design. Philadelphia, PA: SIAM, 1972.
- (1972) Sequential Analysis and Optimal Design
- Chernoff, H.¹

5
- 0004268444
- New York: Marcel Dekker
- B. Ghosh and P. K. Sen, Handbook of Sequential Analysis. New York: Marcel Dekker, 1991.
- (1991) Handbook of Sequential Analysis
- Ghosh, B.¹ Sen, P.K.²

6
- 0000169010
- "Bandit processes and dynamic allocation indices"
- J. C. Gittins, "Bandit processes and dynamic allocation indices," J. Royal Stat. Soc. B, vol. 41, no. 2, pp. 148-177, 1979.
- (1979) J. Royal Stat. Soc. B , vol.41 , Issue.2 , pp. 148-177
- Gittins, J.C.¹

7
- 0018709825
- "A dynamic allocation index for the discounted multiarmed bandit problem"
- Dec
- J. C. Gittins, "A dynamic allocation index for the discounted multiarmed bandit problem," Biometrika, vol. 66, no. 3, pp. 561-565, Dec. 1979.
- (1979) Biometrika , vol.66 , Issue.3 , pp. 561-565
- Gittins, J.C.¹

8
- 0001732282
- "Asymptotically optimal allocation of treatments in sequential experiments"
- T. J. Santner and A. C. Tamhane, New York: MarcelDekker
- T. L. Lai and H. Robbins, "Asymptotically optimal allocation of treatments in sequential experiments," in Design of Experiments: Ranking and Selection, T. J. Santner and A. C. Tamhane, Eds. New York: MarcelDekker, 1984.
- (1984) Design of Experiments: Ranking and Selection
- Lai, T.L.¹ Robbins, H.²

9
- 0002899547
- "Asymptotically efficient allocation rules"
- T. L. Lai and H. Robbins, "Asymptotically efficient allocation rules," Adv. Appl. Math., vol. 6, no. 1, pp. 4-22, 1985.
- (1985) Adv. Appl. Math. , vol.6 , Issue.1 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

10
- 0029344133
- "Machine learning and nonparametric bandit theory"
- Jul
- T. L. Lai and S. Yakowitz, "Machine learning and nonparametric bandit theory," IEEE Trans. Autom. Control, vol. 40, no. 7, pp. 1199-1209, Jul. 1995.
- (1995) IEEE Trans. Autom. Control , vol.40 , Issue.7 , pp. 1199-1209
- Lai, T.L.¹ Yakowitz, S.²

11
- 0024089489
- "Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost"
- Oct
- R. Agrawal, M. V. Hegde, and D. Teneketzis, "Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost," IEEE Trans. Autom. Control, vol. 33, no. 10, pp. 899-906, Oct. 1988.
- (1988) IEEE Trans. Autom. Control , vol.33 , Issue.10 , pp. 899-906
- Agrawal, R.¹ Hegde, M.V.² Teneketzis, D.³

12
- 0024626787
- "Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: Finite parameter space"
- Mar
- R. Agrawal, D. Teneketzis, and V. Anantharam, "Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: Finite parameter space," IEEE Trans. Autom. Control, vol. 34, no. 3, pp. 258-267, Mar. 1989.
- (1989) IEEE Trans. Autom. Control , vol.34 , Issue.3 , pp. 258-267
- Agrawal, R.¹ Teneketzis, D.² Anantharam, V.³

13
- 0024886640
- "Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space"
- Dec
- R. Agrawal, D. Teneketzis, and V. Anantharam, "Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space," IEEE Trans. Autom. Control, vol. 34, no. 12, pp. 1249-1259, Dec. 1989.
- (1989) IEEE Trans. Autom. Control , vol.34 , Issue.12 , pp. 1249-1259
- Agrawal, R.¹ Teneketzis, D.² Anantharam, V.³

14
- 0023453059
- "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. rewards"
- Nov
- V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. rewards," IEEE Trans. Autom. Control, vol. AC-32, no. 11, pp. 968-976, Nov. 1987.
- (1987) IEEE Trans. Autom. Control , vol.AC-32 , Issue.11 , pp. 968-976
- Anantharam, V.¹ Varaiya, P.² Walrand, J.³

15
- 0023450663
- "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part II: Markovian rewards"
- Nov
- V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part II: Markovian rewards," IEEE Trans. Autom. Control, vol. AC-32, no. 11, pp. 977-982, Nov. 1987.
- (1987) IEEE Trans. Autom. Control , vol.AC-32 , Issue.11 , pp. 977-982
- Anantharam, V.¹ Varaiya, P.² Walrand, J.³

16
- 0029047314
- "Sequential choice from several populations"
- Sep
- M. N. Katehakis and H. Robbins, "Sequential choice from several populations," in Proc. Nat. Acad. Sci., vol. 92, Sep. 1995, pp. 8584-8585.
- (1995) Proc. Nat. Acad. Sci. , vol.92 , pp. 8584-8585
- Katehakis, M.N.¹ Robbins, H.²

17
- 0034171759
- "Finite-time lower bounds for the two-armed bandit problem"
- Apr
- S. R. Kulkarni and G. Lugosi, "Finite-time lower bounds for the two-armed bandit problem," IEEE Trans. Autom. Control, vol. 45, no. 4, pp. 711-714, Apr. 2000.
- (2000) IEEE Trans. Autom. Control , vol.45 , Issue.4 , pp. 711-714
- Kulkarni, S.R.¹ Lugosi, G.²

18
- 0013218879
- "Covariate models for Bernoulli bandits"
- M. K. Clayton, "Covariate models for Bernoulli bandits," Seq. Anal. vol. 8, no. 4, pp. 405-426, 1989.
- (1989) Seq. Anal. , vol.8 , Issue.4 , pp. 405-426
- Clayton, M.K.¹

19
- 0242460275
- "On bandit problems with side observations and learn-ability"
- Sep
- S. R. Kulkarni, "On bandit problems with side observations and learn-ability," in Proc. 31st Allerton Conf. Communications, Control, Computing, Sep. 1993, pp. 83-92.
- (1993) Proc. 31st Allerton Conf. Communications, Control, Computing , pp. 83-92
- Kulkarni, S.R.¹

20
- 0000017483
- "One-armed bandit problems with covariates"
- J. Sarkar, "One-armed bandit problems with covariates," Ann. Statist., vol. 19, no. 4, pp. 1978-2002, 1991.
- (1991) Ann. Statist. , vol.19 , Issue.4 , pp. 1978-2002
- Sarkar, J.¹

21
- 0001631327
- "A one-armed bandit problem with a concomitant variable"
- Dec
- M. Woodroofe, "A one-armed bandit problem with a concomitant variable," J. Amer. Stat. Assoc., vol. 74, no. 368, pp. 799-806, Dec. 1979.
- (1979) J. Amer. Stat. Assoc. , vol.74 , Issue.368 , pp. 799-806
- Woodroofe, M.¹

22
- 0242628745
- "Optimal allocations in sequential tests involving two populations with covariates"
- T. Zoubeidi, "Optimal allocations in sequential tests involving two populations with covariates," Commun. Statist.: Theory Meth., vol. 23, no. 4, pp. 1215-1225, 1994.
- (1994) Commun. Statist.: Theory Meth. , vol.23 , Issue.4 , pp. 1215-1225
- Zoubeidi, T.¹

23
- 0004102205
- New York: Wiley
- J. A. Bucklew, Large Deviation Techniques in Decision, Simulation, and Estimation. New York: Wiley, 1990.
- (1990) Large Deviation Techniques in Decision, Simulation, and Estimation
- Bucklew, J.A.¹

24
- 0003836047
- New York, NY: Springer-Verlag
- A. Dembo and O. Zeitouni, Large Deviation Techniques and Applications New York, NY: Springer-Verlag, 1998.
- (1998) Large Deviation Techniques and Applications
- Dembo, A.¹ Zeitouni, O.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.