-
1
-
-
80053151768
-
Graphical models for bandit problems
-
Amin, K., Kearns, M., and Syed, U. Graphical models for bandit problems. In Proceedings of the 27th Annual Conference Uncertainty in Artificial Intelligence (UAI), 2011a.
-
Proceedings of the 27th Annual Conference Uncertainty in Artificial Intelligence (UAI), 2011a
-
-
Amin, K.1
Kearns, M.2
Syed, U.3
-
2
-
-
84897500833
-
Bandits, query learning, and the haystack dimension
-
Amin, K., Kearns, M., and Syed, U. Bandits, query learning, and the haystack dimension. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011b.
-
Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011b
-
-
Amin, K.1
Kearns, M.2
Syed, U.3
-
3
-
-
38049040954
-
Improved rates for the stochastic continuum-armed bandit problem
-
Auer, Peter, Ortner, Ronald, and Szepesvári, Csaba. Improved rates for the stochastic continuum-armed bandit problem. In In 20th Conference on Learning Theory (COLT), pp. 454-468, 2007.
-
(2007)
In 20th Conference on Learning Theory (COLT)
, pp. 454-468
-
-
Auer, P.1
Ortner, R.2
Szepesvári, C.3
-
4
-
-
70350664424
-
The offset tree for learning with partial labels
-
Beygelzimer, Alina and Langford, John. The offset tree for learning with partial labels. In KDD, pp. 129-138, 2009.
-
(2009)
KDD
, pp. 129-138
-
-
Beygelzimer, A.1
Langford, J.2
-
5
-
-
80053144086
-
Contextual bandit algorithms with supervised learning guarantees
-
Beygelzimer, Alina, Langford, John, Li, Lihong, Reyzin, Lev, and Schapire, Robert E. Contextual bandit algorithms with supervised learning guarantees. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS), 2011.
-
Proceedings of the 14th International Conference on Artificial Intelligence and Statistics (AISTATS), 2011
-
-
Beygelzimer, A.1
Langford, J.2
Li, L.3
Reyzin, L.4
Schapire, R.E.5
-
6
-
-
77952027689
-
Online optimization in x-armed bandits
-
Bubeck, Sébastien, Munos, Rémi, Stoltz, Gilles, and Szepesvári, Csaba. Online optimization in x-armed bandits. In NIPS, pp. 201-208, 2008.
-
(2008)
NIPS
, pp. 201-208
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvári, C.4
-
7
-
-
57049185311
-
Multi-armed bandits in metric spaces
-
New York, NY, USA, ISBN 978-1-60558-047-0. doi
-
Kleinberg, Robert, Slivkins, Aleksandrs, and Upfal, Eli. Multi-armed bandits in metric spaces. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing (STOC), pp. 681-690, New York, NY, USA, 2008. ACM. ISBN 978-1-60558-047-0. doi: http://doi.acm.org/10.1145/1374376.1374475.
-
(2008)
Proceedings of the 40th Annual ACM Symposium on Theory of Computing (STOC)
, pp. 681-690
-
-
Kleinberg, R.1
Slivkins, A.2
Upfal, E.3
-
10
-
-
56449122733
-
Knows what it knows: A framework for self-aware learning
-
Li, L., Littman, M.L., and Walsh, T.J. Knows what it knows: a framework for self-aware learning. In Proceedings of the 25th International Conference on Machine Learning (ICML), pp. 568-575, 2008.
-
(2008)
Proceedings of the 25th International Conference on Machine Learning (ICML)
, pp. 568-575
-
-
Li, L.1
Littman, M.L.2
Walsh, T.J.3
-
11
-
-
79958797519
-
Knows what it knows: A framework for self-aware learning
-
Li, L., Littman, M.L., Walsh, T.J., and Strehl, A.L. Knows what it knows: a framework for self-aware learning. Machine Learning, 82(3):399-443, 2011.
-
(2011)
Machine Learning
, vol.82
, Issue.3
, pp. 399-443
-
-
Li, L.1
Littman, M.L.2
Walsh, T.J.3
Strehl, A.L.4
-
12
-
-
77954641643
-
A contextual-bandit approach to personalized news article recommendation
-
Li, Lihong, Chu, Wei, Langford, John, and Schapire, Robert E. A contextual-bandit approach to personalized news article recommendation. In Proceedings of the 19th International World Wide Web Conference, 2010.
-
Proceedings of the 19th International World Wide Web Conference, 2010
-
-
Li, L.1
Chu, W.2
Langford, J.3
Schapire, R.E.4
-
13
-
-
84898452145
-
Contextual multi-armed bandits
-
Lu, Tyler, Pal, David, and Pal, Martin. Contextual multi-armed bandits. In Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), 2010.
-
Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), 2010
-
-
Lu, T.1
Pal, D.2
Pal, M.3
-
14
-
-
85162044870
-
Trading off mistakes and don't-know predictions
-
Sayedi, A., Zadimoghaddam, M., and Blum, A. Trading off mistakes and don't-know predictions. In NIPS, 2010.
-
(2010)
NIPS
-
-
Sayedi, A.1
Zadimoghaddam, M.2
Blum, A.3
-
17
-
-
79958846996
-
Exploring compact reinforcement-learning representations with linear regression
-
AUAI Press
-
Walsh, T.J., Szita, I., Diuk, C., and Littman, M.L. Exploring compact reinforcement-learning representations with linear regression. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI), pp. 591-598. AUAI Press, 2009.
-
(2009)
Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (UAI)
, pp. 591-598
-
-
Walsh, T.J.1
Szita, I.2
Diuk, C.3
Littman, M.L.4
-
18
-
-
84863381440
-
Algorithms for infinitely many-armed bandits
-
Wang, Yizao, Audibert, Jean-Yves, and Munos, Rémi. Algorithms for infinitely many-armed bandits. In Advances in Neural Information Processing Systems 21 (NIPS), pp. 1729-1736, 2008.
-
(2008)
Advances in Neural Information Processing Systems 21 (NIPS)
, pp. 1729-1736
-
-
Wang, Y.1
Audibert, J.-Y.2
Munos, R.3
|