메뉴 건너뛰기




Volumn 22, Issue , 2012, Pages 592-600

On Bayesian upper confidence bounds for bandit problems

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; BAYESIAN NETWORKS; STOCHASTIC SYSTEMS;

EID: 84954519509     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Conference Paper
Times cited : (271)

References (18)
  • 1
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • P. Auer, N. Cesa-Bianchi, P. Fischer. Finite-time analysis of the multiarmed bandit problem Machine Learning 47, 235-256, 2002.
    • (2002) Machine Learning , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 3
    • 0030159874 scopus 로고    scopus 로고
    • Optimal adaptive policies for sequential allocation problems
    • A.N. Burnetas and M.N. Katehakis. Optimal adaptive policies for sequential allocation problems. in Advances in Applied Mathematics, 17 (2): 122-142, 1996.
    • (1996) Advances in Applied Mathematics , vol.17 , Issue.2 , pp. 122-142
    • Burnetas, A.N.1    Katehakis, M.N.2
  • 10
    • 84898077171 scopus 로고    scopus 로고
    • An asymptotically optimal bandit algorithm for bounded support models
    • T. Kalai and M. Mohri, editors
    • J. Honda and A. Takemura. An asymptotically optimal bandit algorithm for bounded support models. In T. Kalai and M. Mohri, editors, Conference On Learning Theory COLT, 2010.
    • (2010) Conference on Learning Theory COLT
    • Honda, J.1    Takemura, A.2
  • 11
    • 0000854435 scopus 로고
    • Adaptive treatment allocation and the multi-armed bandit problem
    • T.L. Lai. Adaptive treatment allocation and the multi-armed bandit problem. In Annals of Statistics 15 (3): 1091-1114, 1987.
    • (1987) Annals of Statistics , vol.15 , Issue.3 , pp. 1091-1114
    • Lai, T.L.1
  • 12
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • T.L. Lai, H. Robbins. Asymptotically efficient adaptive allocation rules. In Advances in Applied Mathematics 6 (1): 4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , Issue.1 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 15
    • 84874038864 scopus 로고    scopus 로고
    • A finite-time analysis of Multi-armed bandits problems with Kullback-Leibler Divergence
    • O. Maillard, R. Munos, G. Stoltz. A finite-time analysis of Multi-armed bandits problems with Kullback- Leibler Divergence In Conference On Learning Theory COLT , 2011.
    • (2011) Conference on Learning Theory COLT
    • Maillard, O.1    Munos, R.2    Stoltz, G.3
  • 18
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • W.R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. In Biometrika 25: 285-294, 1933.
    • (1933) Biometrika , vol.25 , pp. 285-294
    • Thompson, W.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.