메뉴 건너뛰기




Volumn 30, Issue 1, 2002, Pages 100-121

Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates

Author keywords

Concomitant variable; Multi armed bandits; Nonparametric regression; Randomized allocation; Sequential allocation

Indexed keywords


EID: 0036108219     PISSN: 00905364     EISSN: None     Source Type: Journal    
DOI: 10.1214/aos/1015362186     Document Type: Article
Times cited : (95)

References (20)
  • 4
    • 0000492892 scopus 로고    scopus 로고
    • Minimum contrast estimators on sieves: Exponential bounds and rates of convergence
    • BIRGÉ, L. and MASSART, P. (1998). Minimum contrast estimators on sieves: exponential bounds and rates of convergence. Bernoulli 4 329-375.
    • (1998) Bernoulli , vol.4 , pp. 329-375
    • Birgé, L.1    Massart, P.2
  • 5
    • 0013218879 scopus 로고
    • Covariate models for Bernoulli bandits
    • CLAYTON, M. K. (1989). Covariate models for Bernoulli bandits. Sequential Anal. 8 405-426.
    • (1989) Sequential Anal. , vol.8 , pp. 405-426
    • Clayton, M.K.1
  • 6
    • 0001844070 scopus 로고
    • 1 error of partitioning estimates of a regression function
    • (F. Konecny, J. Mogyoródi and W. Wertz, eds.) Akadémiai Kiadó, Budapest
    • 1 error of partitioning estimates of a regression function. In Proceedings of the Fourth Pannonian Symposium on Mathematical Statistics (F. Konecny, J. Mogyoródi and W. Wertz, eds.) 67-76. Akadémiai Kiadó, Budapest.
    • (1985) Proceedings of the Fourth Pannonian Symposium on Mathematical Statistics , pp. 67-76
    • Devroye, L.1    Györfi, L.2
  • 7
    • 21844511932 scopus 로고
    • On the strong universal consistency of nearest neighbor regression function estimates
    • DEVROYE, L., GYÖRFI, L., KRZYZAK, A. and LUGOSI, G. (1994). On the strong universal consistency of nearest neighbor regression function estimates. Ann. Statist. 22 1371-1385.
    • (1994) Ann. Statist. , vol.22 , pp. 1371-1385
    • Devroye, L.1    Györfi, L.2    Krzyzak, A.3    Lugosi, G.4
  • 12
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • LAI, T. L. and ROBBINS, H. (1985). Asymptotically efficient adaptive allocation rules. Adv. In Appl. Math. 6 4-22.
    • (1985) Adv. In Appl. Math. , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 13
    • 0029344133 scopus 로고
    • Machine learning and nonparametric bandit theory
    • LAI, T. L. and YAKOWITZ, S. (1995). Machine learning and nonparametric bandit theory. IEEE Trans. Automat. Control 40 1199-1209.
    • (1995) IEEE Trans. Automat. Control , vol.40 , pp. 1199-1209
    • Lai, T.L.1    Yakowitz, S.2
  • 14
    • 0030489341 scopus 로고    scopus 로고
    • Histogram regression estimation using data-dependent partitions
    • NOBEL, A. (1996). Histogram regression estimation using data-dependent partitions. Ann. Statist. 24 1084-1105.
    • (1996) Ann. Statist. , vol.24 , pp. 1084-1105
    • Nobel, A.1
  • 16
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • ROBBINS, H. (1952). Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. 58 527-535.
    • (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-535
    • Robbins, H.1
  • 17
    • 0000017483 scopus 로고
    • One-armed bandit problems with covariates
    • SARKAR, J. (1991). One-armed bandit problems with covariates. Ann. Statist. 19 1978-2002.
    • (1991) Ann. Statist. , vol.19 , pp. 1978-2002
    • Sarkar, J.1
  • 18
    • 0000388992 scopus 로고
    • Consistent nonparametric regression
    • STONE, C. S. (1977). Consistent nonparametric regression. Ann. Statist. 5 595-620.
    • (1977) Ann. Statist. , vol.5 , pp. 595-620
    • Stone, C.S.1
  • 20
    • 0001631327 scopus 로고
    • A one-armed bandit problem with a concomitant variable
    • WOODROOFE, M. (1979). A one-armed bandit problem with a concomitant variable. J. Amer. Statist. Assoc. 74 799-806.
    • (1979) J. Amer. Statist. Assoc. , vol.74 , pp. 799-806
    • Woodroofe, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.