SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2011, Pages

Multi-bandit best arm identification

a INRIA (France)

Author keywords

[No Author keywords available]

Indexed keywords

ALLOCATION STRATEGY; PERFORMANCE; PROBABILITIES OF ERROR; SMALL GAPS; SYNTHETIC PROBLEM; UPPER BOUND;

EID: 85162482585 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (106)

References (13)

1
- 84864970677
- Best arm identification in multi-armed bandits
- J.-Y. Audibert, S. Bubeck, and R. Munos. Best arm identification in multi-armed bandits. In Proceedings of the Twenty-Third Annual Conference on Learning Theory, pages 41-53, 2010.
- (2010) Proceedings of the Twenty-Third Annual Conference on Learning Theory , pp. 41-53
- Audibert, J.-Y.¹ Bubeck, S.² Munos, R.³

3
- 0036568025
- Finite-time analysis of the multi-armed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multi-armed bandit problem. Machine Learning, 47:235-256, 2002.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

4
- 77952070805
- Pure exploration in multi-armed bandit problems
- S. Bubeck, R. Munos, and G. Stoltz. Pure exploration in multi-armed bandit problems. In Proceedings of the Twentieth International Conference on Algorithmic Learning Theory, pages 23-37, 2009.
- (2009) Proceedings of the Twentieth International Conference on Algorithmic Learning Theory , pp. 23-37
- Bubeck, S.¹ Munos, R.² Stoltz, G.³

5
- 80052250780
- Active learning for personalizing treatment
- K. Deng, J. Pineau, and S. Murphy. Active learning for personalizing treatment. In IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, 2011.
- (2011) IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning
- Deng, K.¹ Pineau, J.² Murphy, S.³

6
- 48349140736
- Rollout sampling approximate policy iteration
- C. Dimitrakakis and M. Lagoudakis. Rollout sampling approximate policy iteration. Machine Learning Journal, 72(3):157-171, 2008.
- (2008) Machine Learning Journal , vol.72 , Issue.3 , pp. 157-171
- Dimitrakakis, C.¹ Lagoudakis, M.²

7
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Eyal Even-Dar, ShieMannor, and YishayMansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal ofMachine Learning Research, 7:1079-1105, 2006.
- (2006) Journal OfMachine Learning Research , vol.7 , pp. 1079-1105
- Even-Dar, E.¹ Mannor, S.² Mansour, Y.³

8
- 85162463938
- Technical Report 00632523, INRIA
- V. Gabillon,M. Ghavamzadeh,A. Lazaric, and S. Bubeck.Multi-bandit best armidentification. Technical Report 00632523, INRIA, 2011.
- (2011) Multi-bandit Best Armidentification
- Gabillon, V.¹ Ghavamzadeh, M.² Lazaric, A.³ Bubeck, S.⁴

9
- 1942420814
- Reinforcement learning as classification: Leveraging modern classifiers
- M. Lagoudakis and R. Parr. Reinforcement learning as classification: Leveraging modern classifiers. In Proceedings of the Twentieth International Conference on Machine Learning, pages 424-431, 2003.
- (2003) Proceedings of the Twentieth International Conference on Machine Learning , pp. 424-431
- Lagoudakis, M.¹ Parr, R.²

10
- 0001923944
- Hoeffding races: Accelerating model selection search for classification and function approximation
- O. Maron and A. Moore. Hoeffding races: Accelerating model selection search for classification and function approximation. In Proceedings of Advances in Neural Information Processing Systems 6, 1993.
- (1993) Proceedings of Advances in Neural Information Processing Systems , vol.6
- Maron, O.¹ Moore, A.²

11
- 84898061133
- Empirical Bernstein bounds and sample-variance penalization
- A. Maurer and M. Pontil. Empirical bernstein bounds and sample-variance penalization. In 22th annual conference on learning theory, 2009.
- (2009) 22th Annual Conference on Learning Theory
- Maurer, A.¹ Pontil, M.²

12
- 56449108844
- Empirical Bernstein stopping
- V. Mnih, Cs. Szepesvári, and J.-Y. Audibert. Empirical Bernstein stopping. In Proceedings of the Twenty-Fifth International Conference on Machine Learning, pages 672-679, 2008.
- (2008) Proceedings of the Twenty-Fifth International Conference on Machine Learning , pp. 672-679
- Mnih, V.¹ Szepesvári, Cs.² Audibert, J.-Y.³

13
- 84966203785
- Some aspects of the sequential design of experiments
- H. Robbins. Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society, 58:527-535, 1952.
- (1952) Bulletin of the American Mathematics Society , vol.58 , pp. 527-535
- Robbins, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.