메뉴 건너뛰기




Volumn 4, Issue , 2012, Pages 3212-3220

Best arm identification: A unified approach to fixed budget and fixed confidence

Author keywords

[No Author keywords available]

Indexed keywords

COMMON STRUCTURES; FIXED BUDGET; MULTI ARMED BANDIT; PERFORMANCE BOUNDS; UNIFIED APPROACH;

EID: 84877730309     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (305)

References (15)
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multi-armed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47:235-256, 2002.
    • (2002) Machine Learning , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 4
    • 84877752876 scopus 로고    scopus 로고
    • Multiple identifications in multi-armed bandits
    • abs/1205.3181
    • S. Bubeck, T. Wang, and N. Viswanathan. Multiple identifications in multi-armed bandits. CoRR, abs/1205.3181, 2012.
    • (2012) CoRR
    • Bubeck, S.1    Wang, T.2    Viswanathan, N.3
  • 6
    • 33745295134 scopus 로고    scopus 로고
    • Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
    • E. Even-Dar, S. Mannor, and Y. Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006.
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
    • Even-Dar, E.1    Mannor, S.2    Mansour, Y.3
  • 9
    • 84867121052 scopus 로고    scopus 로고
    • PhD thesis, Department of Computer Science, The University of Texas at Austin, Austin, Texas, USA, December. Published as UT Austin Computer Science Technical Report TR-11-41
    • S. Kalyanakrishnan. Learning Methods for Sequential Decision Making with Imperfect Representations. PhD thesis, Department of Computer Science, The University of Texas at Austin, Austin, Texas, USA, December 2011. Published as UT Austin Computer Science Technical Report TR-11-41.
    • (2011) Learning Methods for Sequential Decision Making with Imperfect Representations
    • Kalyanakrishnan, S.1
  • 12
    • 0001923944 scopus 로고
    • Hoeffding races: Accelerating model selection search for classification and function approximation
    • O. Maron and A. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In Proceedings of Advances in Neural Information Processing Systems 6, pages 59-66, 1993.
    • (1993) Proceedings of Advances in Neural Information Processing Systems , vol.6 , pp. 59-66
    • Maron, O.1    Moore, A.2
  • 15


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.