메뉴 건너뛰기




Volumn , Issue , 2011, Pages 181-194

Successive reduction of arms in multi-armed bandits

Author keywords

[No Author keywords available]

Indexed keywords

ONLINE SYSTEMS; SAMPLING;

EID: 84881535815     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/978-1-4471-2318-7_13     Document Type: Conference Paper
Times cited : (5)

References (20)
  • 1
    • 77951164997 scopus 로고    scopus 로고
    • Explore/exploit schemes for web content optimization
    • D. Agarwal, B-C. Chen, and P. Elango. Explore/exploit schemes for web content optimization. In ICDM, 2009
    • (2009) ICDM
    • Agarwal, D.1    Chen, B.-C.2    Elango, P.3
  • 2
    • 0000616723 scopus 로고
    • Sample mean based index policies with o(log n) regret for multi-armed bandit problem
    • November
    • R. Agrawal. Sample mean based index policies with o(log n) regret for multi-armed bandit problem. Advances in Applied Probability., 27:1054-1078, November 1995
    • (1995) Advances in Applied Probability , vol.27 , pp. 1054-1078
    • Agrawal, R.1
  • 3
    • 0036568025 scopus 로고    scopus 로고
    • Finite time analysis of multi-armed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite time analysis of multi-armed bandit problem. Machine Learning, 27(2-3):235-256, 2002
    • (2002) Machine Learning , vol.27 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 8
    • 77956543367 scopus 로고    scopus 로고
    • Web-scale bayesian click-Through rate prediction for sponsored search advertising in microsofts bing search engine
    • T. Graepel, J. Q. Candela, T. Borchert, and R. Herbrich. Web-scale bayesian click-Through rate prediction for sponsored search advertising in microsofts bing search engine. In ICML, 2010
    • (2010) ICML
    • Graepel, T.1    Candela, J.Q.2    Borchert, T.3    Herbrich, R.4
  • 9
    • 84881520361 scopus 로고    scopus 로고
    • The bayesian learning automaton empirical evaluation with two-armed bernoulli bandit problems
    • O-C. Granmo. The bayesian learning automaton empirical evaluation with two-armed bernoulli bandit problems. Research and Development in Intelligent Systems XXV, pages 235-248, 2009
    • (2009) Research and Development in Intelligent Systems XXV , pp. 235-248
    • Granmo, O.-C.1
  • 10
    • 78549244167 scopus 로고    scopus 로고
    • Solving two-armed bernoulli bandit problems using a bayesian learning automaton
    • O-C. Granmo. Solving two-armed bernoulli bandit problems using a bayesian learning automaton. International Journal of Intelligent Computing and Cybernetics, 2(3):207-234, 2010
    • (2010) International Journal of Intelligent Computing and Cybernetics , vol.2 , Issue.3 , pp. 207-234
    • Granmo, O.-C.1
  • 12
    • 0004093274 scopus 로고
    • of Wiley Series in Probability and Mathematical Statistics, chapter 4. John Wiley and Sons, Inc., second edition
    • S. S. Gupta and S. Panchapakesan. Multiple Decision Procedures, volume 1 of Wiley Series in Probability and Mathematical Statistics, chapter 4, pages 59-93. John Wiley and Sons, Inc., second edition, 1979
    • (1979) Multiple Decision Procedures , vol.1 , pp. 59-93
    • Gupta, S.S.1    Panchapakesan, S.2
  • 13
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963
    • (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
    • Hoeffding, W.1
  • 14
    • 84864970677 scopus 로고    scopus 로고
    • Best arm identification in multi-armed bandits
    • Jean-Yves, S. Bubeck, and R. Munos. Best arm identification in multi-armed bandits. In COLT, 2010
    • (2010) COLT
    • Bubeck, J.S.1    Munos, R.2
  • 16
    • 0001923944 scopus 로고
    • Hoeffding races: Accelerating model selection search for classification and function approximation
    • O. Maron and A. W. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In NIPS, 1994
    • (1994) NIPS
    • Maron, O.1    Moore, A.W.2
  • 18
    • 77956297112 scopus 로고    scopus 로고
    • A generic solution to multiarmed bernoulli bandit problems based on random sampling from sibling conjugate priors
    • T. Norheim, T. Brdland, O-C. Granmo, and B. J. Oommen. A generic solution to multiarmed bernoulli bandit problems based on random sampling from sibling conjugate priors. In ICAART, 2010
    • (2010) ICAART
    • Norheim, T.1    Brdland, T.2    Granmo, O.-C.3    Oommen, B.J.4
  • 19
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • W. R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 1933
    • (1933) Biometrika
    • Thompson, W.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.