메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Thompson sampling for 1-dimensional exponential family bandits

Author keywords

[No Author keywords available]

Indexed keywords

FISHER INFORMATION MATRIX;

EID: 84898959192     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (150)

References (17)
  • 4
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2):235-256, 2002.
    • (2002) Machine Learning , vol.47 , Issue.2 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 7
    • 84898949562 scopus 로고    scopus 로고
    • Kullback-Leibler upper confidence bounds for optimal sequential allocation
    • O. Cappé, A. Garivier, O-A. Maillard, R. Munos, and G. Stoltz. Kullback-Leibler upper confidence bounds for optimal sequential allocation. Annals of Statistics, 41(3):516-541, 2013.
    • (2013) Annals of Statistics , vol.41 , Issue.3 , pp. 516-541
    • Cappé, O.1    Garivier, A.2    Maillard, O.-A.3    Munos, R.4    Stoltz, G.5
  • 9
    • 84898077171 scopus 로고    scopus 로고
    • An asymptotically optimal bandit algorithm for bounded support models
    • J. Honda and A. Takemura. An asymptotically optimal bandit algorithm for bounded support models. In Conference On Learning Theory (COLT), 2010.
    • (2010) Conference on Learning Theory (COLT)
    • Honda, J.1    Takemura, A.2
  • 10
    • 84873751778 scopus 로고
    • An invariant form for prior probability in estimation problem
    • H. Jeffreys. An invariant form for prior probability in estimation problem. Proceedings of the Royal Society of London, 186:453-461, 1946.
    • (1946) Proceedings of the Royal Society of London , vol.186 , pp. 453-461
    • Jeffreys, H.1
  • 12
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6(1):4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , Issue.1 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 15
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • W.R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25:285-294, 1933.
    • (1933) Biometrika , vol.25 , pp. 285-294
    • Thompson, W.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.