SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 23, Issue , 2012, Pages

Analysis of thompson sampling for the multi-armed bandit problem

(2) Agrawal, Shipra a Goyal, Navin a

a MICROSOFT RESEARCH (United States)

Author keywords

Bayesian algorithm; Multi armed bandit; Online learning; Thompson sampling

Indexed keywords

DECISION THEORY; LEARNING ALGORITHMS; OPTIMIZATION; PROBABILITY; STOCHASTIC SYSTEMS;

BAYESIAN ALGORITHMS; CONSTANT FACTORS; EXPLORATION/EXPLOITATION; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; ONLINE LEARNING; SEQUENTIAL DECISIONS; THOMPSON SAMPLINGS;

STATISTICS;

EID: 84874084136 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Conference Paper

Times cited : (425)

References (14)

1
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

2
- 85162416700
- An empirical evaluation of thompson sampling
- O. Chapelle and L. Li. An empirical evaluation of thompson sampling. In NIPS, 2011.
- (2011) NIPS
- Chapelle, O.¹ Li, L.²

3
- 84863920694
- The KL-UCB algorithm for bounded stochastic bandits and beyond
- A. Garivier and O. Capṕe. The KL-UCB algorithm for bounded stochastic bandits and beyond. In Conference on Learning Theory (COLT), 2011.
- (2011) Conference on Learning Theory (COLT)
- Garivier, A.¹ Capṕe, O.²

4
- 84891584370
- Wiley Interscience Series in Systems and Optimization. John Wiley and Son
- J. C. Gittins. Multi-armed Bandit Allocation Indices. Wiley Interscience Series in Systems and Optimization. John Wiley and Son, 1989.
- (1989) Multi-armed Bandit Allocation Indices
- Gittins, J.C.¹

5
- 77956543367
- Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine
- T. Graepel, J. Q. Candela, T. Borchert, and R. Herbrich. Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine. In ICML, pages 13-20, 2010.
- (2010) ICML , pp. 13-20
- Graepel, T.¹ Candela, J.Q.² Borchert, T.³ Herbrich, R.⁴

6
- 78549244167
- Solving two-armed bernoulli bandit problems using a bayesian learning automaton
- O.-C. Granmo. Solving two-armed bernoulli bandit problems using a bayesian learning automaton. International Journal of Intelligent Computing and Cybernetics (IJICC), 3(2):207-234, 2010.
- (2010) International Journal of Intelligent Computing and Cybernetics (IJICC) , vol.3 , Issue.2 , pp. 207-234
- Granmo, O.-C.¹

7
- 0011027964
- Monotone convergence of binomial probabilities and a generalization of ramanujan's equation
- K. Jogdeo and S. M. Samuels. Monotone Convergence of Binomial Probabilities and A Generalization of Ramanujan's equation. The Annals of Mathematical Statistics, (4):1191-1195, 1968.
- (1968) The Annals of Mathematical Statistics , Issue.4 , pp. 1191-1195
- Jogdeo, K.¹ Samuels, S.M.²

8
- 84867888879
- On bayesian upper confidence bounds for bandit problems
- E. Kaufmann, O. Capṕe, and A. Garivier. On bayesian upper confidence bounds for bandit problems. In Fifteenth International Conference on Artificial Intelligence and Statistics (AISTAT), 2012.
- (2012) Fifteenth International Conference on Artificial Intelligence and Statistics (AISTAT)
- Kaufmann, E.¹ Capṕe, O.² Garivier, A.³

9
- 0002899547
- Asymptotically efficient adaptive allocation rules
- T. L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

10
- 84874038864
- Finite-time analysis of multi-armed bandits problems with kullback-leibler divergences
- O.-A. Maillard, R.Munos, and G. Stoltz. Finite-time analysis of multi-armed bandits problems with kullback-leibler divergences. In Conference on Learning Theory (COLT), 2011.
- (2011) Conference on Learning Theory (COLT)
- Maillard, O.-A.¹ Munos, R.² Stoltz, G.³

11
- 84860647553
- Simulation studies in optimistic bayesian sampling in contextual-bandit problems
- University of Bristol
- B. C. May and D. S. Leslie. Simulation studies in optimistic bayesian sampling in contextual-bandit problems. Technical Report 11:02, Statistics Group, Department of Mathematics, University of Bristol, 2011.
- (2011) Technical Report 11:02, Statistics Group, Department of Mathematics
- May, B.C.¹ Leslie, D.S.²

12
- 84860620509
- Optimistic bayesian sampling in contextual-bandit problems
- University of Bristol
- B. C. May, N. Korda, A. Lee, and D. S. Leslie. Optimistic bayesian sampling in contextual-bandit problems. Technical Report 11:01, Statistics Group, Department of Mathematics, University of Bristol, 2011.
- (2011) Technical Report 11:01, Statistics Group, Department of Mathematics
- May, B.C.¹ Korda, N.² Lee, A.³ Leslie, D.S.⁴

13
- 78650505735
- A modern bayesian look at the multi-armed bandit
- S. Scott. A modern bayesian look at the multi-armed bandit. Applied Stochastic Models in Business and Industry, 26:639-658, 2010.
- (2010) Applied Stochastic Models in Business and Industry , vol.26 , pp. 639-658
- Scott, S.¹

14
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- W. R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
- Thompson, W.R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.