SCOPUS 정보 검색 플랫폼

Volumn 8188 LNAI, Issue PART 1, 2013, Pages 241-256

Greedy confidence pursuit: A pragmatic approach to multi-bandit optimization

Author keywords

[No Author keywords available]

Indexed keywords

BASELINE METHODS; HIGH CONFIDENCE; MULTI ARMED BANDIT; PRACTICAL PROBLEMS; PURSUIT PROBLEMS;

LEARNING SYSTEMS; OPTIMIZATION;

PROBLEM SOLVING;

EID: 84886556435 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-40988-2_16 Document Type: Conference Paper

Times cited : (1)

References (16)

1
- 84886540275
- Analysis of thompson sampling for the multi-armed bandit problem
- Agrawal, S., Goyal, N.: Analysis of thompson sampling for the multi-armed bandit problem. In: COLT (2012)
- (2012) COLT
- Agrawal, S.¹ Goyal, N.²

2
- 84864970677
- Best arm identification in multi-armed bandits
- Audibert, J.-Y., Bubeck, S., Munos, R.: Best arm identification in multi-armed bandits. In: COLT (2010)
- (2010) COLT
- Audibert, J.-Y.¹ Bubeck, S.² Munos, R.³

3
- 0004218171
- Chapman and Hall Ltd.
- Berry, D.A., Fristedt, B.: Bandit Problems. Chapman and Hall Ltd. (1985)
- (1985) Bandit Problems
- Berry, D.A.¹ Fristedt, B.²

5
- 85162416700
- An empirical evaluation of thompson sampling
- Chappelle, O., Li, L.: An empirical evaluation of thompson sampling. In: Advances in Neural Information Processing Systems (2011)
- (2011) Advances in Neural Information Processing Systems
- Chappelle, O.¹ Li, L.²

6
- 80052250780
- Active learning for personalizing treatment
- Deng, K., Pineau, J., Murphy, S.: Active learning for personalizing treatment. In: IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (2011)
- IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (2011)
- Deng, K.¹ Pineau, J.² Murphy, S.³

7
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Even-Dar, E., Mannor, S., Mansour, Y.: Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research 7, 1079-1105 (2006)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
- Even-Dar, E.¹ Mannor, S.² Mansour, Y.³

8
- 85162482585
- Multi-bandit best arm identification
- Gabillon, V., Ghavamzadeh, M., Lazaric, A., Bubeck, S.: Multi-bandit best arm identification. In: Advances in Neural Information Processing Systems (2011)
- (2011) Advances in Neural Information Processing Systems
- Gabillon, V.¹ Ghavamzadeh, M.² Lazaric, A.³ Bubeck, S.⁴

9
- 77956526578
- Efficient selection of multiple bandit arms: Theory and practice
- Kalyanakrishnan, S., Stone, P.: Efficient selection of multiple bandit arms: Theory and practice. In: International Conference on Machine Learning (2010)
- International Conference on Machine Learning (2010)
- Kalyanakrishnan, S.¹ Stone, P.²

10
- 84867131498
- Pac subset selection in stochastic multi-armed bandits
- Kalyanakrishnan, S., Tewari, A., Auer, P., Stone, P.: Pac subset selection in stochastic multi-armed bandits. In: International Conference on Machine Learning (2012)
- International Conference on Machine Learning (2012)
- Kalyanakrishnan, S.¹ Tewari, A.² Auer, P.³ Stone, P.⁴

11
- 85029696856
- Open problem: Regret bounds for thompson sampling
- Li, L., Chappelle, O.: Open problem: Regret bounds for thompson sampling. In: COLT (2012)
- (2012) COLT
- Li, L.¹ Chappelle, O.²

12
- 77953631184
- The budgeted multi-armed bandit problem
- Madani, O., Lizotte, D.J., Greiner, R.: The budgeted multi-armed bandit problem. In: COLT (2004)
- (2004) COLT
- Madani, O.¹ Lizotte, D.J.² Greiner, R.³

13
- 30044441333
- The sample complexity of exploration in the multiarmed bandit problem
- Mannor, S., Tsitsiklis, J.N.: The sample complexity of exploration in the multiarmed bandit problem. Journal of Machine Learning Research 5, 623-648 (2004)
- (2004) Journal of Machine Learning Research , vol.5 , pp. 623-648
- Mannor, S.¹ Tsitsiklis, J.N.²

14
- 84898961155
- arXiv:1301.2609v1 [cs.LG]
- Russo, D., Van Roy, B.: Learning to optimize via posterior sampling. arXiv:1301.2609v1 [cs.LG] (2013)
- (2013) Learning to Optimize Via Posterior Sampling
- Russo, D.¹ Van Roy, B.²

15
- 78650505735
- A modern bayesian look at the multi-armed bandit
- Scott, S.L.: A modern bayesian look at the multi-armed bandit. Applied Stochastic Models in Business and Industry 26, 639-658 (2010)
- (2010) Applied Stochastic Models in Business and Industry , vol.26 , pp. 639-658
- Scott, S.L.¹

16
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- Thompson,W.R.: On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25(3-4), 285-294 (1933)
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
- Thompson, W.R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.