SCOPUS 정보 검색 플랫폼

Volumn 4, Issue , 2012, Pages 3212-3220

Best arm identification: A unified approach to fixed budget and fixed confidence

a INRIA (France)

Author keywords

[No Author keywords available]

Indexed keywords

COMMON STRUCTURES; FIXED BUDGET; MULTI ARMED BANDIT; PERFORMANCE BOUNDS; UNIFIED APPROACH;

ALGORITHMS;

BUDGET CONTROL;

EID: 84877730309 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (305)

References (15)

1
- 84864970677
- Best arm identification in multi-armed bandits
- J.-Y. Audibert, S. Bubeck, and R. Munos. Best arm identification in multi-armed bandits. In Proceedings of the Twenty-Third Annual Conference on Learning Theory, pages 41-53, 2010.
- (2010) Proceedings of the Twenty-Third Annual Conference on Learning Theory , pp. 41-53
- Audibert, J.-Y.¹ Bubeck, S.² Munos, R.³

2
- 0036568025
- Finite-time analysis of the multi-armed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47:235-256, 2002.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 77952070805
- Pure exploration in multi-armed bandit problems
- S. Bubeck, R. Munos, and G. Stoltz. Pure exploration in multi-armed bandit problems. In Proceedings of the Twentieth International Conference on Algorithmic Learning Theory, pages 23-37, 2009.
- (2009) Proceedings of the Twentieth International Conference on Algorithmic Learning Theory , pp. 23-37
- Bubeck, S.¹ Munos, R.² Stoltz, G.³

5
- 80053148209
- Active learning for developing personalized treatment
- K. Deng, J. Pineau, and S. Murphy. Active learning for developing personalized treatment. In Proceedings of the Twenty-Seventh International Conference on Uncertainty in Artificial Intelligence, pages 161-168, 2011.
- (2011) Proceedings of the Twenty-Seventh International Conference on Uncertainty in Artificial Intelligence , pp. 161-168
- Deng, K.¹ Pineau, J.² Murphy, S.³

6
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- E. Even-Dar, S. Mannor, and Y. Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
- Even-Dar, E.¹ Mannor, S.² Mansour, Y.³

7
- 84877727727
- Technical report 00747005, October
- V. Gabillon, M. Ghavamzadeh, and A. Lazaric. Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence. Technical report 00747005, October 2012.
- (2012) Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
- Gabillon, V.¹ Ghavamzadeh, M.² Lazaric, A.³

8
- 85162482585
- Multi-bandit best arm identification
- V. Gabillon, M. Ghavamzadeh, A. Lazaric, and S. Bubeck. Multi-bandit best arm identification. In Proceedings of Advances in Neural Information Processing Systems 25, pages 2222-2230, 2011.
- (2011) Proceedings of Advances in Neural Information Processing Systems , vol.25 , pp. 2222-2230
- Gabillon, V.¹ Ghavamzadeh, M.² Lazaric, A.³ Bubeck, S.⁴

9
- 84867121052
- PhD thesis, Department of Computer Science, The University of Texas at Austin, Austin, Texas, USA, December. Published as UT Austin Computer Science Technical Report TR-11-41
- S. Kalyanakrishnan. Learning Methods for Sequential Decision Making with Imperfect Representations. PhD thesis, Department of Computer Science, The University of Texas at Austin, Austin, Texas, USA, December 2011. Published as UT Austin Computer Science Technical Report TR-11-41.
- (2011) Learning Methods for Sequential Decision Making with Imperfect Representations
- Kalyanakrishnan, S.¹

10
- 77956526578
- Efficient selection of multiple bandit arms: Theory and practice
- S. Kalyanakrishnan and P. Stone. Efficient selection of multiple bandit arms: Theory and practice. In Proceedings of the Twenty-Seventh International Conference on Machine Learning, pages 511-518, 2010.
- (2010) Proceedings of the Twenty-Seventh International Conference on Machine Learning , pp. 511-518
- Kalyanakrishnan, S.¹ Stone, P.²

11
- 84867131498
- Pac subset selection in stochastic multiarmed bandits
- S. Kalyanakrishnan, A. Tewari, P. Auer, and P. Stone. Pac subset selection in stochastic multiarmed bandits. In Proceedings of the Twentieth International Conference on Machine Learning, 2012.
- (2012) Proceedings of the Twentieth International Conference on Machine Learning
- Kalyanakrishnan, S.¹ Tewari, A.² Auer, P.³ Stone, P.⁴

12
- 0001923944
- Hoeffding races: Accelerating model selection search for classification and function approximation
- O. Maron and A. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In Proceedings of Advances in Neural Information Processing Systems 6, pages 59-66, 1993.
- (1993) Proceedings of Advances in Neural Information Processing Systems , vol.6 , pp. 59-66
- Maron, O.¹ Moore, A.²

13
- 84898061133
- Empirical bernstein bounds and sample-variance penalization
- A. Maurer and M. Pontil. Empirical bernstein bounds and sample-variance penalization. In 22th annual conference on learning theory, 2009.
- (2009) 22th Annual Conference on Learning Theory
- Maurer, A.¹ Pontil, M.²

14
- 56449108844
- Empirical Bernstein stopping
- V. Mnih, Cs. Szepesvári, and J.-Y. Audibert. Empirical Bernstein stopping. In Proceedings of the Twenty-Fifth International Conference on Machine Learning, pages 672-679, 2008.
- (2008) Proceedings of the Twenty-Fifth International Conference on Machine Learning , pp. 672-679
- Mnih, V.¹ Szepesvári, Cs.² Audibert, J.-Y.³

15
- 84966203785
- Some aspects of the sequential design of experiments
- H. Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society, 58:527-535, 1952.
- (1952) Bulletin of the American Mathematics Society , vol.58 , pp. 527-535
- Robbins, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.