메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 655-662

PAC subset selection in stochastic multi-armed bandits

Author keywords

[No Author keywords available]

Indexed keywords

FORMAL ANALYSIS; LOWER BOUNDS; MULTI ARMED BANDIT; REGRET MINIMIZATION; SAMPLE COMPLEXITY; SAMPLE COMPLEXITY BOUNDS; SAMPLING ALGORITHM; SUBSET SELECTION;

EID: 84867131498     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (387)

References (8)
  • 1
    • 84864970677 scopus 로고    scopus 로고
    • Best arm identification in multi-armed bandits
    • Omnipress
    • Audibert, Jean-Yves, Bubeck, Sébastien, and Munos, Rémi. Best arm identification in multi-armed bandits. In Proc. COLT 2010, pp. 41-53. Omnipress, 2010.
    • (2010) Proc. COLT 2010 , pp. 41-53
    • Audibert, J.-Y.1    Bubeck, S.2    Munos, R.3
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, Peter, Cesa-Bianchi, Nicolò, and Fischer, Paul. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 3
    • 33745295134 scopus 로고    scopus 로고
    • Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
    • Even-Dar, Eyal, Mannor, Shie, and Mansour, Yishay. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. JMLR, 7:1079-1105, 2006.
    • (2006) JMLR , vol.7 , pp. 1079-1105
    • Even-Dar, E.1    Mannor, S.2    Mansour, Y.3
  • 5
    • 77956526578 scopus 로고    scopus 로고
    • Efficient selection of multiple bandit arms: Theory and practice
    • Omnipress
    • Kalyanakrishnan, Shivaram and Stone, Peter. Efficient selection of multiple bandit arms: Theory and practice. In Proc. ICML 2010, pp. 511-518. Omnipress, 2010.
    • (2010) Proc. ICML 2010 , pp. 511-518
    • Kalyanakrishnan, S.1    Stone, P.2
  • 6
    • 30044441333 scopus 로고    scopus 로고
    • The sample complexity of exploration in the multi-armed bandit problem
    • Mannor, Shie and Tsitsiklis, John N. The sample complexity of exploration in the multi-armed bandit problem. JMLR, 5:623-648, 2004.
    • (2004) JMLR , vol.5 , pp. 623-648
    • Mannor, S.1    Tsitsiklis, J.N.2
  • 8
    • 33750731783 scopus 로고
    • Sequential PAC learning
    • ACM
    • Schuurmans, Dale and Greiner, Russell. Sequential PAC learning. In Proc. COLT 1995, pp. 377-384. ACM, 1995.
    • (1995) Proc. COLT 1995 , pp. 377-384
    • Schuurmans, D.1    Greiner, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.