SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2011, Pages 169-178

Efficient optimal learning for contextual bandits

a NONE

Author keywords

[No Author keywords available]

Indexed keywords

CLASSIFICATION RULES; CONTEXTUAL BANDITS; COST SENSITIVE CLASSIFICATIONS; FEEDBACK DELAY; ON-LINE SETTING; OPTIMAL REGRET; RUNNING TIME;

ARTIFICIAL INTELLIGENCE;

EID: 80053154335 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (204)

References (16)

1
- 0041966002
- Using confidence bounds for exploitationexploration trade-offs
- Peter Auer. Using confidence bounds for exploitationexploration trade-offs. Journal of Machine Learning Research, 3:397-422, 2002.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 397-422
- Auer, P.¹

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal of Computing, 32(1):48-77, 2002b.
- (2002) SIAM Journal of Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 78650085692
- Adaptive online gradient descent
- P. L. Bartlett, E. Hazan, and A. Rakhlin. Adaptive online gradient descent. In NIPS, 2007.
- (2007) NIPS
- Bartlett, P.L.¹ Hazan, E.² Rakhlin, A.³

5
- 80053156815
- Error correcting tournaments
- Alina Beygelzimer, John Langford, and Pradeep Ravikumar. Error correcting tournaments. In ALT, 2009.
- (2009) ALT
- Beygelzimer, A.¹ Langford, J.² Ravikumar, P.³

6
- 80053144086
- Contextual bandit algorithms with supervised learning guarantees
- Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, and Robert E. Schapire. Contextual bandit algorithms with supervised learning guarantees. In AISTATS, 2011
- (2011) AISTATS
- Beygelzimer, A.¹ Langford, J.² Li, L.³ Reyzin, L.⁴ Schapire, R.E.⁵

7
- 33745295134
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Eyal Even-Dar, Shie Mannor, and Yishay Mansour. Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems. Journal of Machine Learning Research, 7:1079-1105, 2006. (Pubitemid 43938989)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1079-1105
- Even-Bar, E.¹ Mannor, S.² Mansour, Y.³

8
- 0002384441
- On tail probabilities for martingales
- David A. Freedman. On tail probabilities for martingales. Annals of Probability, 3(1):100-118, 1975.
- (1975) Annals of Probability , vol.3 , Issue.1 , pp. 100-118
- Freedman, D.A.¹

9
- 0031211090
- A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
- Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1): 119-139, 1997. (Pubitemid 127433398)
- (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
- Freund, Y.¹ Schapire, R.E.²

10
- 84864059297
- From batch to transductive online learning
- Sham M. Kakade and Adam Kalai. From batch to transductive online learning. In NIPS, 2005.
- (2005) NIPS
- Kakade, S.M.¹ Kalai, A.²

12
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Tze Leung Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

13
- 80052488062
- Slow learners are fast
- J. Langford, A. Smola, and M. Zinkevich. Slow learners are fast. In NIPS, 2009.
- (2009) NIPS
- Langford, J.¹ Smola, A.² Zinkevich, M.³

14
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- John Langford and Tong Zhang. The epoch-greedy algorithm for contextual multi-armed bandits. In NIPS, 2007.
- (2007) NIPS
- Langford, J.¹ Zhang, T.²

15
- 84972513554
- On general minimax theorems
- Maurice Sion. On general minimax theorems. Pacific J. Math., 8(1):171-176, 1958.
- (1958) Pacific J. Math. , vol.8 , Issue.1 , pp. 171-176
- Sion, M.¹

16
- 77956501313
- Gaussian process optimization in the bandit setting: No regret and experimental design
- Niranjan Srinivas, Andreas Krause, Sham Kakade, and Matthias Seeger. Gaussian process optimization in the bandit setting: No regret and experimental design. In ICML, 2010
- (2010) ICML
- Srinivas, N.¹ Krause, A.² Kakade, S.³ Seeger, M.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.