메뉴 건너뛰기




Volumn 50, Issue 3, 2005, Pages 338-355

Bandit problems with side observations

Author keywords

Adaptive; Allocation rule; Asymptotic; Efficient; Inferior sampling time; Side information; Two armed bandit

Indexed keywords

ALGORITHMS; ASYMPTOTIC STABILITY; DECISION THEORY; OPTIMAL CONTROL SYSTEMS; RANDOM PROCESSES;

EID: 15844389867     PISSN: 00189286     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAC.2005.844079     Document Type: Article
Times cited : (120)

References (24)
  • 1
    • 84966203785 scopus 로고
    • "Some aspects of the sequential design of experiments"
    • H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol. 58, pp. 527-535, 1952.
    • (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-535
    • Robbins, H.1
  • 2
    • 0035207353 scopus 로고    scopus 로고
    • "Learning while searching for the best alternative"
    • K. Adam, "Learning while searching for the best alternative," J. Econ. Theory, vol. 101, pp. 252-280, 2001.
    • (2001) J. Econ. Theory , vol.101 , pp. 252-280
    • Adam, K.1
  • 3
    • 0001400331 scopus 로고
    • "A Bernoulli two-armed bandit"
    • Jun
    • D. A. Berry, "A Bernoulli two-armed bandit," Ann. Math. Stat., vol. 43, no. 3, pp. 871-897, Jun. 1972.
    • (1972) Ann. Math. Stat. , vol.43 , Issue.3 , pp. 871-897
    • Berry, D.A.1
  • 6
    • 0000169010 scopus 로고
    • "Bandit processes and dynamic allocation indices"
    • J. C. Gittins, "Bandit processes and dynamic allocation indices," J. Royal Stat. Soc. B, vol. 41, no. 2, pp. 148-177, 1979.
    • (1979) J. Royal Stat. Soc. B , vol.41 , Issue.2 , pp. 148-177
    • Gittins, J.C.1
  • 7
    • 0018709825 scopus 로고
    • "A dynamic allocation index for the discounted multiarmed bandit problem"
    • Dec
    • J. C. Gittins, "A dynamic allocation index for the discounted multiarmed bandit problem," Biometrika, vol. 66, no. 3, pp. 561-565, Dec. 1979.
    • (1979) Biometrika , vol.66 , Issue.3 , pp. 561-565
    • Gittins, J.C.1
  • 8
    • 0001732282 scopus 로고
    • "Asymptotically optimal allocation of treatments in sequential experiments"
    • T. J. Santner and A. C. Tamhane, New York: MarcelDekker
    • T. L. Lai and H. Robbins, "Asymptotically optimal allocation of treatments in sequential experiments," in Design of Experiments: Ranking and Selection, T. J. Santner and A. C. Tamhane, Eds. New York: MarcelDekker, 1984.
    • (1984) Design of Experiments: Ranking and Selection
    • Lai, T.L.1    Robbins, H.2
  • 9
    • 0002899547 scopus 로고
    • "Asymptotically efficient allocation rules"
    • T. L. Lai and H. Robbins, "Asymptotically efficient allocation rules," Adv. Appl. Math., vol. 6, no. 1, pp. 4-22, 1985.
    • (1985) Adv. Appl. Math. , vol.6 , Issue.1 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 10
    • 0029344133 scopus 로고
    • "Machine learning and nonparametric bandit theory"
    • Jul
    • T. L. Lai and S. Yakowitz, "Machine learning and nonparametric bandit theory," IEEE Trans. Autom. Control, vol. 40, no. 7, pp. 1199-1209, Jul. 1995.
    • (1995) IEEE Trans. Autom. Control , vol.40 , Issue.7 , pp. 1199-1209
    • Lai, T.L.1    Yakowitz, S.2
  • 11
    • 0024089489 scopus 로고
    • "Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost"
    • Oct
    • R. Agrawal, M. V. Hegde, and D. Teneketzis, "Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost," IEEE Trans. Autom. Control, vol. 33, no. 10, pp. 899-906, Oct. 1988.
    • (1988) IEEE Trans. Autom. Control , vol.33 , Issue.10 , pp. 899-906
    • Agrawal, R.1    Hegde, M.V.2    Teneketzis, D.3
  • 12
    • 0024626787 scopus 로고
    • "Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: Finite parameter space"
    • Mar
    • R. Agrawal, D. Teneketzis, and V. Anantharam, "Asymptotically efficient adaptive allocation schemes for controlled i.i.d. processes: Finite parameter space," IEEE Trans. Autom. Control, vol. 34, no. 3, pp. 258-267, Mar. 1989.
    • (1989) IEEE Trans. Autom. Control , vol.34 , Issue.3 , pp. 258-267
    • Agrawal, R.1    Teneketzis, D.2    Anantharam, V.3
  • 13
    • 0024886640 scopus 로고
    • "Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space"
    • Dec
    • R. Agrawal, D. Teneketzis, and V. Anantharam, "Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space," IEEE Trans. Autom. Control, vol. 34, no. 12, pp. 1249-1259, Dec. 1989.
    • (1989) IEEE Trans. Autom. Control , vol.34 , Issue.12 , pp. 1249-1259
    • Agrawal, R.1    Teneketzis, D.2    Anantharam, V.3
  • 14
    • 0023453059 scopus 로고
    • "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. rewards"
    • Nov
    • V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. rewards," IEEE Trans. Autom. Control, vol. AC-32, no. 11, pp. 968-976, Nov. 1987.
    • (1987) IEEE Trans. Autom. Control , vol.AC-32 , Issue.11 , pp. 968-976
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 15
    • 0023450663 scopus 로고
    • "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part II: Markovian rewards"
    • Nov
    • V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part II: Markovian rewards," IEEE Trans. Autom. Control, vol. AC-32, no. 11, pp. 977-982, Nov. 1987.
    • (1987) IEEE Trans. Autom. Control , vol.AC-32 , Issue.11 , pp. 977-982
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 16
    • 0029047314 scopus 로고
    • "Sequential choice from several populations"
    • Sep
    • M. N. Katehakis and H. Robbins, "Sequential choice from several populations," in Proc. Nat. Acad. Sci., vol. 92, Sep. 1995, pp. 8584-8585.
    • (1995) Proc. Nat. Acad. Sci. , vol.92 , pp. 8584-8585
    • Katehakis, M.N.1    Robbins, H.2
  • 17
    • 0034171759 scopus 로고    scopus 로고
    • "Finite-time lower bounds for the two-armed bandit problem"
    • Apr
    • S. R. Kulkarni and G. Lugosi, "Finite-time lower bounds for the two-armed bandit problem," IEEE Trans. Autom. Control, vol. 45, no. 4, pp. 711-714, Apr. 2000.
    • (2000) IEEE Trans. Autom. Control , vol.45 , Issue.4 , pp. 711-714
    • Kulkarni, S.R.1    Lugosi, G.2
  • 18
    • 0013218879 scopus 로고
    • "Covariate models for Bernoulli bandits"
    • M. K. Clayton, "Covariate models for Bernoulli bandits," Seq. Anal. vol. 8, no. 4, pp. 405-426, 1989.
    • (1989) Seq. Anal. , vol.8 , Issue.4 , pp. 405-426
    • Clayton, M.K.1
  • 20
    • 0000017483 scopus 로고
    • "One-armed bandit problems with covariates"
    • J. Sarkar, "One-armed bandit problems with covariates," Ann. Statist., vol. 19, no. 4, pp. 1978-2002, 1991.
    • (1991) Ann. Statist. , vol.19 , Issue.4 , pp. 1978-2002
    • Sarkar, J.1
  • 21
    • 0001631327 scopus 로고
    • "A one-armed bandit problem with a concomitant variable"
    • Dec
    • M. Woodroofe, "A one-armed bandit problem with a concomitant variable," J. Amer. Stat. Assoc., vol. 74, no. 368, pp. 799-806, Dec. 1979.
    • (1979) J. Amer. Stat. Assoc. , vol.74 , Issue.368 , pp. 799-806
    • Woodroofe, M.1
  • 22
    • 0242628745 scopus 로고
    • "Optimal allocations in sequential tests involving two populations with covariates"
    • T. Zoubeidi, "Optimal allocations in sequential tests involving two populations with covariates," Commun. Statist.: Theory Meth., vol. 23, no. 4, pp. 1215-1225, 1994.
    • (1994) Commun. Statist.: Theory Meth. , vol.23 , Issue.4 , pp. 1215-1225
    • Zoubeidi, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.