-
1
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T. Lai and H. Robbins, "Asymptotically efficient adaptive allocation rules," Advances in Applied Mathematics, vol. 6, no. 1, pp. 4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
2
-
-
0023450663
-
Asymptotically efficient allocation rules for the multi-armed bandit problem with multiple plays - Part ii: Markovian rewards
-
V. Anantharam, P. Varaiya, and J. Walrand, "Asymptotically efficient allocation rules for the multi-armed bandit problem with multiple plays - part ii: Markovian rewards," IEEE Transactions on Automatic Control, vol. 32, no. 11, pp. 977-982, 1987.
-
(1987)
IEEE Transactions on Automatic Control
, vol.32
, Issue.11
, pp. 977-982
-
-
Anantharam, V.1
Varaiya, P.2
Walrand, J.3
-
3
-
-
0000616723
-
Sample mean based index policies with (O(log n)) regret for the multi-armed bandit problem
-
R. Agrawal, "Sample mean based index policies with (O(log n)) regret for the multi-armed bandit problem," Advances in Applied Probability, Vol. 27, No. 4, pp. 1054-1078, 1995.
-
(1995)
Advances in Applied Probability
, vol.27
, Issue.4
, pp. 1054-1078
-
-
Agrawal, R.1
-
4
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Machine Learning, vol. 47, no. 2, pp. 235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
5
-
-
84867858040
-
Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
-
to appear
-
Y. Gai, B. Krishnamachari, and R. Jain, "Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations," IEEE/ACM Trans. on Networking, to appear, 2012.
-
(2012)
IEEE/ACM Trans. on Networking
-
-
Gai, Y.1
Krishnamachari, B.2
Jain, R.3
-
6
-
-
79953827701
-
Distributed learning in multi-armed bandit with multiple players
-
November
-
K. Liu and Q. Zhao, "Distributed learning in multi-armed bandit with multiple players," IEEE Transactions on Signal Processing, vol. 58, pp. 5667-5681, November, 2010.
-
(2010)
IEEE Transactions on Signal Processing
, vol.58
, pp. 5667-5681
-
-
Liu, K.1
Zhao, Q.2
-
7
-
-
79953194834
-
Distributed algorithms for learning and cognitive medium access with logarithmic regret
-
April
-
A. Anandkumar, N. Michael, A. Tang, and A. Swami, "Distributed algorithms for learning and cognitive medium access with logarithmic regret," IEEE JSAC on Advances in Cognitive Radio Networking and Communications, April, 2011.
-
(2011)
IEEE JSAC on Advances in Cognitive Radio Networking and Communications
-
-
Anandkumar, A.1
Michael, N.2
Tang, A.3
Swami, A.4
-
8
-
-
34249831790
-
Auction algorithms for network flow problems: A tutorial introduction
-
D. P. Bertsekas, "Auction algorithms for network flow problems: A tutorial introduction," Computational Optimization and Applications, vol. 1, pp. 7-66, 1992.
-
(1992)
Computational Optimization and Applications
, vol.1
, pp. 7-66
-
-
Bertsekas, D.P.1
-
10
-
-
84874259376
-
-
Submitted, June
-
D. Kalathil, N. Nayyar, and R. Jain, "Decentralized learning for multi-player multi-armed bandits," Submitted, available at : http://arxiv.org/abs/1206.3582, June, 2012.
-
(2012)
Decentralized Learning for Multi-player Multi-armed Bandits
-
-
Kalathil, D.1
Nayyar, N.2
Jain, R.3
|