메뉴 건너뛰기




Volumn 4754 LNAI, Issue , 2007, Pages 150-165

Tuning bandit algorithms in stochastic environments

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; PARAMETER ESTIMATION; POLYNOMIALS; PROBLEM SOLVING; RISK ANALYSIS;

EID: 38149013086     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-75225-7_15     Document Type: Conference Paper
Times cited : (145)

References (9)
  • 1
    • 0000616723 scopus 로고
    • Sample mean based index policies with O(log n) regret for the multiarmed bandit problem
    • Agrawal, R.: Sample mean based index policies with O(log n) regret for the multiarmed bandit problem. Advances in Applied Probability 27, 1054-1078 (1995)
    • (1995) Advances in Applied Probability , vol.27 , pp. 1054-1078
    • Agrawal, R.1
  • 2
    • 38149098629 scopus 로고    scopus 로고
    • Variance estimates and exploration function in multi-armed bandit
    • 07-31, Certis, Ecole des Ponts
    • Audibert, J.-Y., Munos, R., Szepesvári,Cs.: Variance estimates and exploration function in multi-armed bandit. Research report 07-31, Certis - Ecole des Ponts (2007), http://cermics.enpc.fr/~audibert/RR0731.pdf
    • (2007) Research report
    • Audibert, J.-Y.1    Munos, R.2    Szepesvári, C.3
  • 3
    • 0036568025 scopus 로고    scopus 로고
    • Finite time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite time analysis of the multiarmed bandit problem. Machine Learning 47(2-3), 235-256 (2002)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 6
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T.L., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 8
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H.: Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society 58, 527-535 (1952)
    • (1952) Bulletin of the American Mathematics Society , vol.58 , pp. 527-535
    • Robbins, H.1
  • 9
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • Thompson, W.R.: On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25, 285-294 (1933)
    • (1933) Biometrika , vol.25 , pp. 285-294
    • Thompson, W.R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.