메뉴 건너뛰기




Volumn , Issue , 2010, Pages 511-518

Efficient selection of multiple bandit arms: Theory and practice

Author keywords

[No Author keywords available]

Indexed keywords

EMPIRICAL COMPARISON; MULTI ARMED BANDIT; PAC BOUNDS; SAMPLING ALGORITHM; STOCHASTIC OPTIMIZATION ALGORITHM; SUBSET SELECTION; THEORETICAL BASIS; THEORY AND PRACTICE;

EID: 77956526578     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (108)

References (15)
  • 2
    • 61349118110 scopus 로고    scopus 로고
    • Efficient simulation budget allocation for selecting an optimal subset
    • Chen, Chun-Hung, He, Donghai, Fu, Michail, and Lee, Loo Hay. Efficient simulation budget allocation for selecting an optimal subset. INFORMS Journal on Computing, 20(4):579-595, 2008.
    • (2008) INFORMS Journal on Computing , vol.20 , Issue.4 , pp. 579-595
    • Chen, C.-H.1    He, D.2    Fu, M.3    Lee, L.H.4
  • 4
    • 33745295134 scopus 로고    scopus 로고
    • Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
    • Even-Dar, Eyal, Mannor, Shie, and Mansour, Yishay. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Mach. Learn. Research, 7: 1079-1105, 2006.
    • (2006) Journal of Mach. Learn. Research , vol.7 , pp. 1079-1105
    • Even-Dar, E.1    Mannor, S.2    Mansour, Y.3
  • 5
    • 71149099672 scopus 로고    scopus 로고
    • Hoeffding and Bernstein races for selecting policies in evolutionary direct policy search
    • ACM
    • Heidrich-Meisner, Verena and Igel, Christian. Hoeffding and Bernstein races for selecting policies in evolutionary direct policy search. In Proc. ICML 2009, pp. 401-408. ACM, 2009.
    • (2009) Proc. ICML 2009 , pp. 401-408
    • Heidrich-Meisner, V.1    Igel, C.2
  • 7
    • 0346963698 scopus 로고    scopus 로고
    • A fully sequential procedure for indifference-zone selection in simulation
    • Kim, Seong-Hee and Nelson, Barry L. A fully sequential procedure for indifference-zone selection in simulation. ACM Transactions on Modeling and Computer Simulation, 11(3):251-273, 2001.
    • (2001) ACM Transactions on Modeling and Computer Simulation , vol.11 , Issue.3 , pp. 251-273
    • Kim, S.-H.1    Nelson, B.L.2
  • 8
    • 0001640560 scopus 로고
    • A procedure for selecting a subset of size m containing the I best of k independent normal populations, with applications to simulation
    • Koenig, Lloyd W. and Law, Averiii M. A procedure for selecting a subset of size m containing the I best of k independent normal populations, with applications to simulation. Communications in statistics. Simulation and computation, 14(3):719-734, 1985.
    • (1985) Communications in Statistics. Simulation and Computation , vol.14 , Issue.3 , pp. 719-734
    • Koenig, L.W.1    Law, A.M.2
  • 9
    • 33749817930 scopus 로고    scopus 로고
    • Active model selection
    • AUAI Press
    • Madani, Omid and Lizotte, Daniel J. Greiner, Russell. Active model selection. In Proc. VAI 2004, pp. 357- 365. AUAI Press, 2004.
    • (2004) Proc. VAI 2004 , pp. 357-365
    • Madani, O.1    Lizotte, D.J.G.2    Russell3
  • 10
    • 30044441333 scopus 로고    scopus 로고
    • The sample complexity of exploration in the multi-armed bandit problem
    • Mannor, Shie and Tsitsiklis, John N. The sample complexity of exploration in the multi-armed bandit problem. Journal of Mach. Learn. Research, 5: 623-648, 2004.
    • (2004) Journal of Mach. Learn. Research , vol.5 , pp. 623-648
    • Mannor, S.1    Tsitsiklis, J.N.2
  • 11
    • 0031069121 scopus 로고    scopus 로고
    • The racing algorithm: Model selection for lazy learners
    • Maron, Oded and Moore, Andrew W. The racing algorithm: Model selection for lazy learners. Artificial Intelligence Review, 11(1-5):193-225, 1997.
    • (1997) Artificial Intelligence Review , vol.11 , Issue.1-5 , pp. 193-225
    • Maron, O.1    Moore, A.W.2
  • 12
    • 21444438033 scopus 로고    scopus 로고
    • Genetic algorithms, selection schemes, and the varying effects of noise
    • Miller, Brad L. and Goldberg, David E. Genetic algorithms, selection schemes, and the varying effects of noise. Evolutionary Computation, 4(2):113-131, 1996.
    • (1996) Evolutionary Computation , vol.4 , Issue.2 , pp. 113-131
    • Miller, B.L.1    Goldberg, D.E.2
  • 14
    • 33745783272 scopus 로고    scopus 로고
    • Integrating techniques from statistical ranking into evolutionary algorithms
    • volume 3907 of LNCS, Springer
    • Schmidt, Christian, Branke, Jürgen, and Chick, Stephen E. Integrating techniques from statistical ranking into evolutionary algorithms. In Appi, of Evolutionary Comp., volume 3907 of LNCS, pp. 752-763. Springer, 2006.
    • (2006) Appi, of Evolutionary Comp. , pp. 752-763
    • Schmidt, C.1    Branke, J.2    Chick, S.E.3
  • 15
    • 33750226914 scopus 로고    scopus 로고
    • On-line evolutionary computation for reinforcement learning in stochastic domains
    • ACM
    • Whiteson, Shimon and Stone, Peter. On-line evolutionary computation for reinforcement learning in stochastic domains. In Proc. GECCO 2006, pp. 1577-1584. ACM, 2006.
    • (2006) Proc. GECCO 2006 , pp. 1577-1584
    • Whiteson, S.1    Stone, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.