SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

31st International Conference on Machine Learning, ICML 2014

Volumn 5, Issue , 2014, Pages 3611-3619

Taming the monster: A fast and simple algorithm for contextual bandits

(6) Agarwal, Alekh a Hsu, Daniel b Kale, Satyen c Langford, John a Li, Lihong a Schapire, Robert E d

a MICROSOFT RESEARCH (United States)

b Och Spine at New York Presbyterian Hospitals (United States)

c YAHOO RESEARCH (United States)

d PRINCETON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS;

CONTEXTUAL BANDITS; COST-SENSITIVE CLASSIFICATION; LEARNING PROBLEM; OPTIMAL REGRET; PROOF OF CONCEPT; SIMPLE ALGORITHM; STATISTICAL PERFORMANCE;

LEARNING ALGORITHMS;

EID: 84919787147 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (213)

References (19)

1
- 84919804450
- Taming the monster: A fast and simple algorithm for contextual bandits
- abs/1402.0555
- Agarwal, Alekh, Hsu, Daniel, Kale, Satyen, Langford, John, Li, Lihong, and Schapire, Robert E. Taming the monster: A fast and simple algorithm for contextual bandits. CoRR, abs/1402.0555, 2014.
- (2014) CoRR
- Alekh, A.¹ Daniel, H.² Satyen, K.³ John, L.⁴ Lihong, L.⁵ Schapire Robert, E.⁶

2
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- Auer, Peter. Using confidence bounds for exploitation-exploration trade-offs. Journal of Machine Learning Research, 3:397-422, 2002.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 397-422
- Peter, A.¹

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- Auer, Peter, Cesa-Bianchi, Nicolo, Freund, Yoav, and Schapire, Robert E. The nonstochastic multiarmed bandit problem. SIAM Journal of Computing, 32(l):48-77, 2002.
- (2002) SIAM Journal of Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 70350664424
- The offset tree for learning with partial labels
- Beygelzimer, Alina and Langford, John. The offset tree for learning with partial labels. In KDD, 2009.
- (2009) KDD
- Beygelzimer, A.¹ Langford, J.²

5
- 80053144086
- Contextual bandit algorithms with supervised learning guarantees
- Beygelzimer, AUna, Langford, John, Li, Lihong, Reyzin, Lev, and Schapire, Robert E. Contextual bandit algorithms with supervised learning guarantees. In AISTATS, 2011.
- (2011) AISTATS
- Beygelzimer, A.¹ Langford, J.² Li, L.³ Reyzin, L.⁴ Schapire, R.E.⁵

6
- 0033280893
- Beating the holdout: Bounds for k-fold and progressive cross-validation
- Blum, Avrim, Kalai, Adam, and Langford, John. Beating the holdout: Bounds for k-fold and progressive cross-validation. In COLT, 1999.
- (1999) COLT
- Blum, A.¹ Kalai, A.² Langford, J.³

7
- 85162416700
- An empirical evaluation of Thompson sampling
- Chapelle, Olivier and Li, Lihong. An empirical evaluation of Thompson sampling. In NIPS, 2011.
- (2011) NIPS
- Chapelle, O.¹ Li, L.²

8
- 84860620518
- Contextual bandits with linear payoff functions
- Chu, Wei, Li, Lihong, Reyzin, Lev, and Schapire, Robert E. Contextual bandits with linear payoff functions. In AISTATS, 2011.
- (2011) AISTATS
- Chu, W.¹ Li, L.² Reyzin, L.³ Schapire, R.E.⁴

9
- 80053154335
- Efficient optimal learning for contextual bandits
- Dudik, Miroslav, Hsu, Daniel, Kale, Satyen, Karampatzi-akis, Nikos, Langford, John, Reyzin, Lev, and Zhang, Tong. Efficient optimal learning for contextual bandits. In UAI, 2011a.
- (2011) UAI
- Dudik, M.¹ Hsu, D.² Kale, S.³ Karampatzi-Akis, N.⁴ Langford, J.⁵ Reyzin, L.⁶ Zhang, T.⁷

10
- 80053456223
- Doubly robust policy evaluation and learning
- Dudik, Miroslav, Langford, John, and Li, Lihong. Doubly robust policy evaluation and learning. In ICML, 2011b.
- (2011) ICML
- Dudik, M.¹ Langford, J.² Li, L.³

11
- 0031122905
- Predicting neariy as well as the best pruning of a decision tree
- Helmbold, David P. and Schapire, Robert E. Predicting neariy as well as the best pruning of a decision tree. Machine Learning, 27(l):51-68, 1997.
- (1997) Machine Learning , vol.27 , Issue.1 , pp. 51-68
- Helmbold, D.P.¹ Schapire, R.E.²

12
- 84919804447
- January
- Langford, John. Interactive machine learning, January 2014. URL http://hunch.net/-jl/projects/interactive/index.html.
- (2014) Interactive Machine Learning
- Langford, J.¹

13
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- Langford, John and Zhang, Tong. The epoch-greedy algorithm for contextual multi-armed bandits. In NIPS, 2007.
- (2007) NIPS
- Langford, J.¹ Zhang, T.²

14
- 84876811202
- Rev I: A new benchmark collection for text categorization research
- Lewis, David D, Yang, Yiming, Rose, Tony G, and Li, Fan. Rev I: A new benchmark collection for text categorization research. The Journal of Machine Learning Research, 5:361-397, 2004.
- (2004) The Journal of Machine Learning Research , vol.5 , pp. 361-397
- Lewis, D.D.¹ Yang, Y.² Rose, T.G.³ Li, F.⁴

15
- 84919804446
- Generalized Thompson sampling for contextual bandits
- abs/1310.7163
- Li, Lihong. Generalized Thompson sampling for contextual bandits. CoRR, abs/1310.7163, 2013.
- (2013) CoRR
- Li, L.¹

16
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- Li, Lihong, Chu, Wei, Langford, John, and Schapire, Robert E. A contextual-bandit approach to personalized news article recommendation. In WWW, 2010.
- (2010) WWW
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

17
- 84898068653
- Tighter bounds for multi-armed bandits with expert advice
- McMahan, H. Brendan and Streeter, Matthew. Tighter bounds for multi-armed bandits with expert advice. In COLT, 2009.
- (2009) COLT
- McMahan, H.B.¹ Streeter, M.²

18
- 0004102479
- MIT Press
- Sutton, Richard S. and Barto, Andrew G. Reinforcement learning, an introduction. MIT Press, 1998.
- (1998) Reinforcement Learning, An Introduction
- Sutton, R.S.¹ Barto, A.G.²

19
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- Thompson, William R. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285-294, 1933.
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285-294
- Thompson, W.R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.