SCOPUS 정보 검색 플랫폼

Volumn , Issue PART 3, 2013, Pages 2275-2283

Almost optimal exploration in multi-armed bandits

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING SYSTEMS;

LARGE-SCALE APPLICATIONS; LOWER AND UPPER BOUNDS; LOWER BOUNDS; MULTI ARMED BANDIT; PROBLEM PARAMETERS; UPPER BOUND;

ALGORITHMS;

EID: 84897478950 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (244)

References (17)

1
- 77951164997
- Explore/exploit schemes for web content optimization
- Agarwal, D., Chen, B., and Elango, P. Explore/exploit schemes for web content optimization. In Proc. Ninth IEEE International Conference on Data Mining (ICDM'2009), pp. 1-10, 2009.
- (2009) Proc. Ninth IEEE International Conference on Data Mining (ICDM'2009) , pp. 1-10
- Agarwal, D.¹ Chen, B.² Elango, P.³

2
- 84864970677
- Best arm identification in multi-armed bandits
- Audibert, J.Y., Bubeck, S., and Munos, R. Best arm identification in multi-armed bandits. In COLT, pp. 41-53, 2010.
- (2010) COLT , pp. 41-53
- Audibert, J.Y.¹ Bubeck, S.² Munos, R.³

3
- 77957337199
- UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem
- Auer, P. and Ortner, R. UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem. Periodica Mathematica Hungarica, 61(1-2):55-65, 2010.
- (2010) Periodica Mathematica Hungarica , vol.61 , Issue.1-2 , pp. 55-65
- Auer, P.¹ Ortner, R.²

6
- 84897498871
- Multiple identifications in multi-armed bandits
- Bubeck, S., Wang, T., and Viswanathan, N. Multiple identifications in multi-armed bandits. In Proceedings of the 30th International Conference on Machine Learning, 2013.
- Proceedings of the 30th International Conference on Machine Learning, 2013
- Bubeck, S.¹ Wang, T.² Viswanathan, N.³

7
- 84863406076
- Mortal multi-armed bandits
- Chakrabarti, D., Kumar, R., Radlinski, F., and Upfal, E. Mortal multi-armed bandits. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (NIPS'2008), pp. 273-280, 2008.
- (2008) Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (NIPS'2008) , pp. 273-280
- Chakrabarti, D.¹ Kumar, R.² Radlinski, F.³ Upfal, E.⁴

9
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Even-Dar, E., Mannor, S., and Mansour, Y. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. The Journal of Machine Learning Research, 7:1079-1105, 2006. (Pubitemid 43938989)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
- Even-Bar, E.¹ Mannor, S.² Mansour, Y.³

10
- 85162482585
- Multi-bandit best arm identification
- Gabillon, V., Ghavamzadeh, M., Lazaric, A., and Bubeck, S. Multi-bandit best arm identification. In Advances in Neural Information Processing Systems 24, pp. 2222-2230. 2011.
- (2011) Advances in Neural Information Processing Systems 24 , pp. 2222-2230
- Gabillon, V.¹ Ghavamzadeh, M.² Lazaric, A.³ Bubeck, S.⁴

11
- 84877730309
- Best arm identification: A unified approach to fixed budget and fixed confidence
- Gabillon, V., Ghavamzadeh, M., and Lazaric, A. Best arm identification: A unified approach to fixed budget and fixed confidence. In Advances in Neural Information Processing Systems 25, pp. 3221-3229, 2012.
- (2012) Advances in Neural Information Processing Systems 25 , pp. 3221-3229
- Gabillon, V.¹ Ghavamzadeh, M.² Lazaric, A.³

12
- 77956526578
- Efficient selection of multiple bandit arms: Theory and practice
- Kalyanakrishnan, S. and Stone, P. Efficient selection of multiple bandit arms: Theory and practice. In Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 511-518, 2010.
- (2010) Proceedings of the 27th International Conference on Machine Learning (ICML 2010) , pp. 511-518
- Kalyanakrishnan, S.¹ Stone, P.²

13
- 84867131498
- PAC subset selection in stochastic multi-armed bandits
- Kalyanakrishnan, S., Tewari, A., Auer, P., and Stone, P. PAC subset selection in stochastic multi-armed bandits. In Proceedings of the 29th International Conference on Machine Learning, ICML 2012, 2012.
- (2012) Proceedings of the 29th International Conference on Machine Learning, ICML 2012
- Kalyanakrishnan, S.¹ Tewari, A.² Auer, P.³ Stone, P.⁴

14
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, Tze Leung and Robbins, Herbert. Asymptotically efficient adaptive allocation rules. Advances in applied mathematics, 6(1):4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , Issue.1 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

15
- 30044441333
- The sample complexity of exploration in the multi-armed bandit problem
- Mannor, S. and Tsitsiklis, J.N. The sample complexity of exploration in the multi-armed bandit problem. The Journal of Machine Learning Research, 5:623-648, 2004.
- (2004) The Journal of Machine Learning Research , vol.5 , pp. 623-648
- Mannor, S.¹ Tsitsiklis, J.N.²

16
- 70049106076
- Bandits for taxonomies: A model based approach
- Pandey, S., Agarwal, D., Chakrabarti, D., and Josifovski, V. Bandits for taxonomies: A model based approach. In In Proceedings of the SIAM International Conference on Data Mining. SDM, 2007.
- Proceedings of the SIAM International Conference on Data Mining. SDM, 2007
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.