메뉴 건너뛰기




Volumn 57, Issue 3, 2011, Pages 1707-1713

A note on performance limitations in bandit problems with side information

Author keywords

Allocation rule; inferior sampling rate; lower bound; side information; two armed bandit

Indexed keywords

ALLOCATION RULE; INFERIOR SAMPLING RATE; LOWER BOUND; SIDE INFORMATION; TWO-ARMED BANDIT;

EID: 79951890373     PISSN: 00189448     EISSN: None     Source Type: Journal    
DOI: 10.1109/TIT.2011.2104450     Document Type: Article
Times cited : (10)

References (20)
  • 1
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • DOI 10.1023/A:1013689704352, Computational Learning Theory
    • P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite time analysis of the multiarmed bandit problem," Mach. Learn., vol. 47, pp. 235-256, 2002. (Pubitemid 34126111)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 2
    • 0023453059 scopus 로고
    • Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part I: I.i.d. rewards
    • V. Anantharam, P. Varaiya, and J. Warland, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part I: I.i.d. rewards," IEEE Trans. Autom. Control, vol. 34, pp. 968-976, 1987a. (Pubitemid 18521625)
    • (1987) IEEE Transactions on Automatic Control , vol.AC-32 , Issue.11 , pp. 968-976
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 3
    • 0023450663 scopus 로고
    • Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple playspart II: Markovian rewards
    • V. Anantharam, P. Varaiya, and J. Warland, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple playspart II: Markovian rewards," IEEE Trans. Autom. Control, vol. 34, pp. 977-982, 1987b. (Pubitemid 18521626)
    • (1987) IEEE Transactions on Automatic Control , vol.AC-32 , Issue.11 , pp. 977-982
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 5
    • 0013218879 scopus 로고
    • Covariate models for bernoulli bandits
    • M. K. Clayton, "Covariate models for bernoulli bandits," Sequential Anal., vol. 8, pp. 405-426, 1989.
    • (1989) Sequential Anal. , vol.8 , pp. 405-426
    • Clayton, M.K.1
  • 7
    • 70049095891 scopus 로고    scopus 로고
    • Woodroofe's one armed bandit problem revisited
    • A. Goldenshluger and A. Zeevi, "Woodroofe's one armed bandit problem revisited," Ann. Appl. Probab., vol. 19, pp. 1603-1633, 2009.
    • (2009) Ann. Appl. Probab. , vol.19 , pp. 1603-1633
    • Goldenshluger, A.1    Zeevi, A.2
  • 8
    • 0034171759 scopus 로고    scopus 로고
    • Finite time lower bounds for the two-armed bandit problem
    • S. Kulkarni and G. Lugosi, "Finite time lower bounds for the two-armed bandit problem," IEEE Trans. Autom. Control, vol. 45, pp. 711-714, 2000.
    • (2000) IEEE Trans. Autom. Control , vol.45 , pp. 711-714
    • Kulkarni, S.1    Lugosi, G.2
  • 9
    • 0001732282 scopus 로고
    • Asymptotically optimal allocation of treatments in sequential experiments
    • New York: Dekker
    • T. L. Lai and H. Robbins, "Asymptotically optimal allocation of treatments in sequential experiments," in Design of Experiments. New York: Dekker, 1984, pp. 127-142.
    • (1984) Design of Experiments , pp. 127-142
    • Lai, T.L.1    Robbins, H.2
  • 10
    • 0002899547 scopus 로고
    • Asymptotically efficient allocation rules
    • T. L. Lai and H. Robbins, "Asymptotically efficient allocation rules," Adv. Appl. Math., vol. 6, pp. 4-22, 1985.
    • (1985) Adv. Appl. Math. , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 11
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and the multiarmed bandit problem
    • T. L. Lai, "Adaptive treatment allocation and the multiarmed bandit problem," Ann. Statist., vol. 15, pp. 1091-1114, 1987.
    • (1987) Ann. Statist. , vol.15 , pp. 1091-1114
    • Lai, T.L.1
  • 12
    • 0029344133 scopus 로고
    • Machine learning and nonparametric bandit theory
    • Jul.
    • T. L. Lai and S.Yakowitz, "Machine learning and nonparametric bandit theory," IEEE Trans. Autom. Control, vol. 40, no. 7, pp. 1199-1209, Jul. 1995.
    • (1995) IEEE Trans. Autom. Control , vol.40 , Issue.7 , pp. 1199-1209
    • Lai, T.L.1    Yakowitz, S.2
  • 14
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol. 55, pp. 527-535, 1952.
    • (1952) Bull. Amer. Math. Soc. , vol.55 , pp. 527-535
    • Robbins, H.1
  • 15
    • 0000017483 scopus 로고
    • One-armed bandit problems with covariates
    • J. Sarkar, "One-armed bandit problems with covariates," Ann. Statist., vol. 19, pp. 1978-2002, 1991.
    • (1991) Ann. Statist. , vol.19 , pp. 1978-2002
    • Sarkar, J.1
  • 17
  • 18
    • 0001631327 scopus 로고
    • A one-armed bandit problem with a concomitant variable
    • M.Woodroofe, "A one-armed bandit problem with a concomitant variable," J. Amer. Statist. Assoc., vol. 74, pp. 799-806, 1979.
    • (1979) J. Amer. Statist. Assoc. , vol.74 , pp. 799-806
    • Woodroofe, M.1
  • 19
    • 0006030678 scopus 로고
    • Sequential allocation with covariates
    • M. Woodroofe, "Sequential allocation with covariates," Sankhya Ser., vol. 44, pp. 403-414, 1982.
    • (1982) Sankhya Ser. , vol.44 , pp. 403-414
    • Woodroofe, M.1
  • 20
    • 0036108219 scopus 로고    scopus 로고
    • Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
    • DOI 10.1214/aos/1015362186
    • Y. Yang and D. Zhu, "Randomized allocation with nonparametric estimation for a multiarmed bandit problem with covariates," Ann. Statis., vol. 30, pp. 100-121, 2002. (Pubitemid 37095370)
    • (2002) Annals of Statistics , vol.30 , Issue.1 , pp. 100-121
    • Yang, Y.1    Zhu, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.