SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 23, Issue , 2012, Pages

Open problem: Regret bounds for thompson sampling

(2) Li, Lihong a Chapelle, Olivier a,b

a YAHOO RESEARCH (United States)

b Criteo (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; SOFTWARE ENGINEERING;

BALANCING EXPLORATION AND EXPLOITATIONS; BAYESIAN APPROACHES; EMPIRICAL STUDIES; FINDING SOLUTIONS; PARAMETRIC MODELS; REAL-WORLD PROBLEM; REGRET BOUNDS; THOMPSON SAMPLINGS;

BAYESIAN NETWORKS;

EID: 84900037074 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Conference Paper

Times cited : (1)

References (13)

1
- 84898972474
- Contextual bandit learning under the realizability assumption
- A. Agarwal, M. Dud́ik, S. Kale, J. Langford, and R. E. Schapire. Contextual bandit learning under the realizability assumption. In AISTATS, 2012.
- (2012) AISTATS
- Agarwal, A.¹ Dud́ik, M.² Kale, S.³ Langford, J.⁴ Schapire, R.E.⁵

2
- 84900015267
- CoRR, abs/1111.1797
- S. Agrawal and N. Goyal. Analysis of Thompson sampling for the multi-armed bandit problem. CoRR, abs/1111.1797, 2011.
- (2011) Analysis of Thompson Sampling for the Multi-armed Bandit Problem
- Agrawal, S.¹ Goyal, N.²

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002.
- (2002) SIAM Journal on Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 84875736450
- An empirical evaluation of thompson sampling
- O. Chapelle and L. Li. An empirical evaluation of Thompson sampling. In Advances in Neural Information Processing Systems 24, pages 2249-2257, 2012.
- (2012) Advances in Neural Information Processing Systems , vol.24 , pp. 2249-2257
- Chapelle, O.¹ Li, L.²

5
- 80053154335
- Efficient optimal learning for contextual bandits
- M. Dud́ik, D. Hsu, S. Kale, N. Karampatziakis, J. Langford, L. Reyzin, and T. Zhang. Efficient optimal learning for contextual bandits. In UAI, pages 169-178, 2011.
- (2011) UAI , pp. 169-178
- Dud́ik, M.¹ Hsu, D.² Kale, S.³ Karampatziakis, N.⁴ Langford, J.⁵ Reyzin, L.⁶ Zhang, T.⁷

6
- 77956543367
- Web-scale bayesian click-through rate prediction for sponsored search advertising in microsoft's bing search engine
- T. Graepel, J. Q. Candela, T. Borchert, and R. Herbrich. Web-scale Bayesian click-through rate prediction for sponsored search advertising in Microsoft's Bing search engine. In ICML, pages 13-20, 2010.
- (2010) ICML , pp. 13-20
- Graepel, T.¹ Candela, J.Q.² Borchert, T.³ Herbrich, R.⁴

7
- 78549244167
- Solving two-armed bernoulli bandit problems using a bayesian learning automaton
- O.-C. Granmo. Solving two-armed bernoulli bandit problems using a bayesian learning automaton. Int'l Journal of Intellient Computing and Cybernetics, 3(2):207-234, 2010.
- (2010) Int'l Journal of Intellient Computing and Cybernetics , vol.3 , Issue.2 , pp. 207-234
- Granmo, O.-C.¹

8
- 0002899547
- Asymptotically efficient adaptive allocation rules
- T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

9
- 85162018594
- The epoch-greedy algorithm for contextual multi-armed bandits
- J. Langford and T. Zhang. The epoch-greedy algorithm for contextual multi-armed bandits. In Advances in Neural Information Processing Systems 20, pages 1096-1103, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 1096-1103
- Langford, J.¹ Zhang, T.²

10
- 84860647553
- Simulation studies in optimistic bayesian sampling in contextual-bandit problems
- Univ. of Bristol
- B. C. May and D.S. Leslie. Simulation studies in optimistic Bayesian sampling in contextual-bandit problems. Technical Report 11:02, Dept. of Mathematics, Univ. of Bristol, 2011.
- (2011) Technical Report 11:02, Dept. of Mathematics
- May, B.C.¹ Leslie, D.S.²

11
- 84860620509
- Optimistic bayesian sampling in contextual-bandit problems
- Univ. of Bristol
- B. C. May, N. Korda, A. Lee, and D.S. Leslie. Optimistic Bayesian sampling in contextual-bandit problems. Technical Report 11:01, Dept. of Mathematics, Univ. of Bristol, 2011.
- (2011) Technical Report 11:01, Dept. of Mathematics
- May, B.C.¹ Korda, N.² Lee, A.³ Leslie, D.S.⁴

12
- 78650505735
- A modern bayesian look at the multi-armed bandit
- S. Scott. A modern Bayesian look at the multi-armed bandit. Applied Stochastic Models in Business and Industry, 26:639-658, 2010.
- (2010) Applied Stochastic Models in Business and Industry , vol.26 , pp. 639-658
- Scott, S.¹

13
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- W. R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
- Thompson, W.R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.