SCOPUS 정보 검색 플랫폼

Volumn , Issue PART 1, 2013, Pages 588-596

Large-scale bandit problems and KWIK learning

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; SOFTWARE ENGINEERING;

ACTION SPACES; BANDIT PROBLEMS; IMPOSSIBILITY RESULTS; MULTI ARMED BANDIT;

SUPERVISED LEARNING;

EID: 84897541181 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (18)

1
- 80053151768
- Graphical models for bandit problems
- Amin, K., Kearns, M., and Syed, U. Graphical models for bandit problems. In Proceedings of the 27th Annual Conference Uncertainty in Artificial Intelligence (UAI), 2011a.
- Proceedings of the 27th Annual Conference Uncertainty in Artificial Intelligence (UAI), 2011a
- Amin, K.¹ Kearns, M.² Syed, U.³

2
- 84897500833
- Bandits, query learning, and the haystack dimension
- Amin, K., Kearns, M., and Syed, U. Bandits, query learning, and the haystack dimension. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011b.
- Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011b
- Amin, K.¹ Kearns, M.² Syed, U.³

3
- 38049040954
- Improved rates for the stochastic continuum-armed bandit problem
- Auer, Peter, Ortner, Ronald, and Szepesvári, Csaba. Improved rates for the stochastic continuum-armed bandit problem. In In 20th Conference on Learning Theory (COLT), pp. 454-468, 2007.
- (2007) In 20th Conference on Learning Theory (COLT) , pp. 454-468
- Auer, P.¹ Ortner, R.² Szepesvári, C.³

4
- 70350664424
- The offset tree for learning with partial labels
- Beygelzimer, Alina and Langford, John. The offset tree for learning with partial labels. In KDD, pp. 129-138, 2009.
- (2009) KDD , pp. 129-138
- Beygelzimer, A.¹ Langford, J.²

5
- 80053144086
- Contextual bandit algorithms with supervised learning guarantees
- Beygelzimer, Alina, Langford, John, Li, Lihong, Reyzin, Lev, and Schapire, Robert E. Contextual bandit algorithms with supervised learning guarantees. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS), 2011.
- Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS), 2011
- Beygelzimer, A.¹ Langford, J.² Li, L.³ Reyzin, L.⁴ Schapire, R.E.⁵

6
- 77952027689
- Online optimization in x-armed bandits
- Bubeck, Sébastien, Munos, Rémi, Stoltz, Gilles, and Szepesvári, Csaba. Online optimization in x-armed bandits. In NIPS, pp. 201-208, 2008.
- (2008) NIPS , pp. 201-208
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvári, C.⁴

8
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- Langford, John and Zhang, Tong. The epoch-greedy algorithm for contextual multi-armed bandits. In Advances in Neural Information Processing Systems 20 (NIPS), 2007.
- (2007) Advances in Neural Information Processing Systems 20 (NIPS)
- Langford, J.¹ Zhang, T.²

9
- 78649496546
- Reducing reinforcement learning to KWIK online regression
- Li, L. and Liftman, M.L. Reducing reinforcement learning to KWIK online regression. Annals of Mathematics and Artificial Intelligence, 58(3):217-237, 2010.
- (2010) Annals of Mathematics and Artificial Intelligence , vol.58 , Issue.3 , pp. 217-237
- Li, L.¹ Liftman, M.L.²

10
- 56449122733
- Knows what it knows: A framework for self-aware learning
- Li, L., Littman, M.L., and Walsh, T.J. Knows what it knows: a framework for self-aware learning. In Proceedings of the 25th International Conference on Machine Learning (ICML), pp. 568-575, 2008.
- (2008) Proceedings of the 25th International Conference on Machine Learning (ICML) , pp. 568-575
- Li, L.¹ Littman, M.L.² Walsh, T.J.³

11
- 79958797519
- Knows what it knows: A framework for self-aware learning
- Li, L., Littman, M.L., Walsh, T.J., and Strehl, A.L. Knows what it knows: a framework for self-aware learning. Machine Learning, 82(3):399-443, 2011.
- (2011) Machine Learning , vol.82 , Issue.3 , pp. 399-443
- Li, L.¹ Littman, M.L.² Walsh, T.J.³ Strehl, A.L.⁴

12
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- Li, Lihong, Chu, Wei, Langford, John, and Schapire, Robert E. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th International World Wide Web Conference, 2010.
- Proceedings of the 19th International World Wide Web Conference, 2010
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

13
- 84898452145
- Contextual multi-armed bandits
- Lu, Tyler, Pal, David, and Pal, Martin. Contextual multi-armed bandits. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), 2010.
- Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), 2010
- Lu, T.¹ Pal, D.² Pal, M.³

14
- 85162044870
- Trading off mistakes and don't-know predictions
- Sayedi, A., Zadimoghaddam, M., and Blum, A. Trading off mistakes and don't-know predictions. In NIPS, 2010.
- (2010) NIPS
- Sayedi, A.¹ Zadimoghaddam, M.² Blum, A.³

15
- 84892931731
- Contextual bandits with similarity information
- Slivkins, Aleksandrs. Contextual bandits with similarity information. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011.
- Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011
- Slivkins, A.¹

16
- 77955832538
- Online linear regression and its application to model-based reinforcement learning
- Strehl, Alexander and Littman, Michael L. Online linear regression and its application to model-based reinforcement learning. In Advances in Neural Information Processing Systems 20 (NIPS), 2007.
- (2007) Advances in Neural Information Processing Systems 20 (NIPS)
- Strehl, A.¹ Littman, M.L.²

18
- 84863381440
- Algorithms for infinitely many-armed bandits
- Wang, Yizao, Audibert, Jean-Yves, and Munos, Rémi. Algorithms for infinitely many-armed bandits. In Advances in Neural Information Processing Systems 21 (NIPS), pp. 1729-1736, 2008.
- (2008) Advances in Neural Information Processing Systems 21 (NIPS) , pp. 1729-1736
- Wang, Y.¹ Audibert, J.-Y.² Munos, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.