-
1
-
-
0345224411
-
The continuum-armed bandit problem
-
Rajeev Agrawal. The continuum-armed bandit problem. SIAM J. Control Optim., 33(6):1926-1951, 1995.
-
(1995)
SIAM J. Control Optim.
, vol.33
, Issue.6
, pp. 1926-1951
-
-
Agrawal, R.1
-
2
-
-
0023453059
-
Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. Rewards
-
Venkatachalam Anantharam, Pravin Varaiya, and Jean Walrand. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: i.i.d. rewards. IEEE Transactions on Automatic Control, AC-32(11):968-976, 1987.
-
(1987)
IEEE Transactions on Automatic Control
, vol.AC-32
, Issue.11
, pp. 968-976
-
-
Anantharam, V.1
Varaiya, P.2
Walrand, J.3
-
5
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002a.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
6
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002b.
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
9
-
-
33847255926
-
Dynamic assortment with demand learning for seasonal consumer goods
-
Felipe Caro and Jérémie Gallien. Dynamic assortment with demand learning for seasonal consumer goods. Management Science, 53:276-292, 2007.
-
(2007)
Management Science
, vol.53
, pp. 276-292
-
-
Caro, F.1
Gallien, J.2
-
11
-
-
84897515317
-
Combinatorial multi-armed bandit: General framework, results, and applications
-
Wei Chen, Yajun Wang, and Yang Yuan. Combinatorial multi-armed bandit: General framework, results, and applications. In Proceedings of the 30th International Conference on Machine Learning (ICML), 2013.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning (ICML)
-
-
Chen, W.1
Wang, Y.2
Yuan, Y.3
-
13
-
-
84867858040
-
Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
-
Yi Gai, Bhaskar Krishnamachari, and Rahul Jain. Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations. IEEE/ACM Transactions on Networking, 20, 2012.
-
(2012)
IEEE/ACM Transactions on Networking
, vol.20
-
-
Gai, Y.1
Krishnamachari, B.2
Jain, R.3
-
17
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963.
-
(1963)
Journal of the American Statistical Association
, vol.58
, Issue.301
, pp. 13-30
-
-
Hoeffding, W.1
-
18
-
-
80053454111
-
Playing games with approximation algorithms
-
Sham M. Kakade, Adam Tauman Kalai, and Katrina Ligett. Playing games with approximation algorithms. SIAM Journal on Computing, 39(3):1088-1106, 2009.
-
(2009)
SIAM Journal on Computing
, vol.39
, Issue.3
, pp. 1088-1106
-
-
Kakade, S.M.1
Kalai, A.T.2
Ligett, K.3
-
22
-
-
84923299004
-
Matroid bandits: Fast combinatorial optimization with learning
-
Branislav Kveton, Zheng Wen, Azin Ashkan, Hoda Eydgahi, and Brian Eriksson. Matroid bandits: Fast combinatorial optimization with learning. In Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence (UAI), 2014.
-
(2014)
Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence (UAI)
-
-
Kveton, B.1
Wen, Z.2
Ashkan, A.3
Eydgahi, H.4
Eriksson, B.5
-
23
-
-
84965179449
-
Tight regret bounds for stochastic combinatorial semi-bandits
-
to appear, with arxiv version arXiv:1410.0949
-
Branislav Kveton, Zheng Wen, Azin Ashkan, and Csaba Szepesvári. Tight regret bounds for stochastic combinatorial semi-bandits. In Proceedings of the 18th International Conference on Artificial Intelligence and Statistics, 2015. to appear, with arxiv version arXiv:1410.0949.
-
(2015)
Proceedings of the 18th International Conference on Artificial Intelligence and Statistics
-
-
Kveton, B.1
Wen, Z.2
Ashkan, A.3
Szepesvári, C.4
-
24
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Tze Leung Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
25
-
-
84919902752
-
Combinatorial partial monitoring game with linear feedback and its applications
-
Tian Lin, Bruno Abrahao, Robert Kleinberg, John C. S. Lui, and Wei Chen. Combinatorial partial monitoring game with linear feedback and its applications. In Proceedings of the 31st International Conference on Machine Learning (ICML), 2014.
-
(2014)
Proceedings of the 31st International Conference on Machine Learning (ICML)
-
-
Lin, T.1
Abrahao, B.2
Kleinberg, R.3
Lui John, C.S.4
Chen, W.5
-
26
-
-
80051636024
-
Logarithmic weak regret of non-bayesian restless multi-armed bandit
-
Haoyang Liu, Keqin Liu, and Qing Zhao. Logarithmic weak regret of non-bayesian restless multi-armed bandit. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2011.
-
(2011)
Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Liu, H.1
Liu, K.2
Zhao, Q.3
-
30
-
-
0000095809
-
An analysis of the approximations for maximizing submodular set functions
-
G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher. An analysis of the approximations for maximizing submodular set functions. Mathematical Programming, 14(1):265-294, 1978.
-
(1978)
Mathematical Programming
, vol.14
, Issue.1
, pp. 265-294
-
-
Nemhauser, G.L.1
Wolsey, L.A.2
Fisher, M.L.3
|