메뉴 건너뛰기




Volumn , Issue , 2011, Pages 241-248

Beat the mean bandit

Author keywords

[No Author keywords available]

Indexed keywords

ONLINE LEARNING;

EID: 80053438514     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (130)

References (11)
  • 3
    • 33745295134 scopus 로고    scopus 로고
    • Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
    • Even-Dar, Eyal, Mannor, Shie, and Mansour, Yishay. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research (JMLR), 7:1079-1105, 2006.
    • (2006) Journal of Machine Learning Research (JMLR) , vol.7 , pp. 1079-1105
    • Even-Dar, E.1    Mannor, S.2    Mansour, Y.3
  • 6
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T. L. and Robbins, Herbert. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 7
    • 30044441333 scopus 로고    scopus 로고
    • The sample complexity of exploration in the multi-armed bandit problem
    • Mannor, Shie and Tsitsiklis, John N. The sample complexity of exploration in the multi-armed bandit problem. Journal of Machine Learning Research (JMLR), 5: 623-648, 2004.
    • (2004) Journal of Machine Learning Research (JMLR) , vol.5 , pp. 623-648
    • Mannor, S.1    Tsitsiklis, J.N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.