-
1
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
H. Robbins, "Some aspects of the sequential design of experiments," Bull. Amer. Math. Soc., vol. 58, no. 5, pp. 527-535, 1952.
-
(1952)
Bull. Amer. Math. Soc
, vol.58
, Issue.5
, pp. 527-535
-
-
Robbins, H.1
-
3
-
-
61449109791
-
Multi-armed bandit problems
-
NewYork, NY, USA: Springer-Verlag
-
A. Mahajan and D. Teneketzis, , A. O. Hero, III, D. A. Castanon, D. Cochran, and K. Kastella, Eds., "Multi-armed bandit problems," in Foundations and Applications of Sensor Management. NewYork, NY, USA: Springer-Verlag, 2007.
-
(2007)
Foundations and Applications of Sensor Management
-
-
Mahajan, A.1
Teneketzis, D.2
Hero Iii., A.O.3
Castanon, D.A.4
Cochran, D.5
Kastella Eds, K.6
-
4
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T. Lai and H. Robbins, "Asymptotically efficient adaptive allocation rules," Adv. Appl. Math., vol. 6, no. 1, pp. 4-22, 1985.
-
(1985)
Adv. Appl. Math
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
5
-
-
0000616723
-
Sample mean based index policies with regret for the multi-armed bandit problem
-
R. Agrawal, "Sample mean based index policies with regret for the multi-armed bandit problem," Adv. Appl. Probab., vol. 27, pp. 1054-1078, 1995.
-
(1995)
Adv. Appl. Probab
, vol.27
, pp. 1054-1078
-
-
Agrawal, R.1
-
6
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Mach. Learn., vol. 47, pp. 235-256, 2002. (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
7
-
-
84873932839
-
Learning in a changing world: Restless multi-armed bandit with unknown dynamics
-
Mar.
-
H. Liu, K. Liu, and Q. Zhao, "Learning in a changing world: Restless multi-armed bandit with unknown dynamics," IEEE Trans. Inf. Theory, vol. 59, no. 3, pp. 1902-1916, Mar. 2013.
-
(2013)
IEEE Trans. Inf. Theory
, vol.59
, Issue.3
, pp. 1902-1916
-
-
Liu, H.1
Liu, K.2
Zhao, Q.3
-
11
-
-
79953827701
-
Distributed learning in multi-armed bandit with multiple players
-
Nov.
-
K. Liu and Q. Zhao, "Distributed learning in multi-armed bandit with multiple players," IEEE Trans. Signal Process., vol. 58, no. 11, pp. 5667-5681, Nov. 2010.
-
(2010)
IEEE Trans. Signal Process
, vol.58
, Issue.11
, pp. 5667-5681
-
-
Liu, K.1
Zhao, Q.2
-
12
-
-
79953194834
-
Distributed algorithms for learning and cognitive medium access with logarithmic regret
-
Mar.
-
A. Anandkumar, N. Michael, A. K. Tang, and A. Swami, "Distributed algorithms for learning and cognitive medium access with logarithmic regret," IEEE J. Sel. Areas Commun., vol. 29, no. 4, pp. 731-745, Mar. 2011.
-
(2011)
IEEE J. Sel. Areas Commun
, vol.29
, Issue.4
, pp. 731-745
-
-
Anandkumar, A.1
Michael, N.2
Tang, A.K.3
Swami, A.4
-
13
-
-
84857218599
-
Decentralized online learning algorithms for opportunistic spectrum access
-
Houston, TX, USA, Dec
-
Y. Gai and B. Krishnamachari, "Decentralized online learning algorithms for opportunistic spectrum access," in Proc. IEEE Global Commun. Conf. (GLOBECOM '11), Houston, TX, USA, Dec. 2011.
-
(2011)
Proc. IEEE Global Commun. Conf. (GLOBECOM '11)
-
-
Gai, Y.1
Krishnamachari, B.2
-
15
-
-
84874251645
-
Decentralized learning for multiplayer multi-armed bandits
-
Dec
-
D. Kalathil, N. Nayyar, and R. Jain, "Decentralized learning for multiplayer multi-armed bandits," in Proc. IEEE Conf. Decision Control (CDC), Dec. 2012, pp. 3960-3965.
-
(2012)
Proc. IEEE Conf. Decision Control (CDC)
, pp. 3960-3965
-
-
Kalathil, D.1
Nayyar, N.2
Jain, R.3
-
16
-
-
84874251645
-
Decentralized learning for multiplayer multi-armed bandits
-
[Online]. Available submitted for publication
-
D. Kalathil, N. Nayyar, and R. Jain, "Decentralized learning for multiplayer multi-armed bandits," IEEE Trans. Inf. Theory Apr. 2012 [Online]. Available: http://arxiv.org/abs/1206.3582, submitted for publication
-
(2012)
IEEE Trans. Inf. Theory Apr
-
-
Kalathil, D.1
Nayyar, N.2
Jain, R.3
-
17
-
-
84863956678
-
Online learning of rested and restless bandits
-
Aug.
-
C. Tekin and M. Liu, "Online learning of rested and restless bandits," IEEE Trans. Inf. Theory, vol. 58, no. 8, pp. 5588-5611, Aug. 2012.
-
(2012)
IEEE Trans. Inf. Theory
, vol.58
, Issue.8
, pp. 5588-5611
-
-
Tekin, C.1
Liu, M.2
-
18
-
-
80051623306
-
The non-Bayesian restless multi-armed bandit: A case of near-logarithmic regret
-
May
-
W. Dai, Y. Gai, B. Krishnamachari, and Q. Zhao, "The non-Bayesian restless multi-armed bandit: A case of near-logarithmic regret," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), May 2011, pp. 2940-2943.
-
(2011)
Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
, pp. 2940-2943
-
-
Dai, W.1
Gai, Y.2
Krishnamachari, B.3
Zhao, Q.4
-
19
-
-
84861588214
-
Approximately optimal adaptive learning in opportunistic spectrum access
-
Orlando, FL, USA, Mar
-
C. Tekin and M. Liu, "Approximately optimal adaptive learning in opportunistic spectrum access," in Proc. Int. Conf. Comput. Commun. (INFOCOM), Orlando, FL, USA, Mar. 2012.
-
(2012)
Proc. Int. Conf. Comput. Commun. (INFOCOM)
-
-
Tekin, C.1
Liu, M.2
-
20
-
-
84856091352
-
Adaptive learning of uncontrolled restless bandits with logarithmic regret
-
Sep
-
C. Tekin and M. Liu, "Adaptive learning of uncontrolled restless bandits with logarithmic regret," in Proc. Allerton Conf. Commun. , Control, Comput., Sep. 2011.
-
(2011)
Proc. Allerton Conf. Commun. , Control, Comput
-
-
Tekin, C.1
Liu, M.2
-
21
-
-
84867858040
-
Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
-
Oct.
-
Y. Gai, B. Krishnamachari, and R. Jain, "Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations," IEEE/ACM Trans. Netw., vol. 20, no. 5, pp. 1466-1478, Oct. 2012.
-
(2012)
IEEE/ACM Trans. Netw
, vol.20
, Issue.5
, pp. 1466-1478
-
-
Gai, Y.1
Krishnamachari, B.2
Jain, R.3
-
22
-
-
35448960376
-
Online linear optimization and adaptive routing
-
DOI 10.1016/j.jcss.2007.04.016, PII S0022000007000621, Learning Theory 2004
-
B. Awerbuch and R. Kleinberg, "Online linear optimization and adaptive routing," J. Comput. Syst. Sci., pp. 97-114, 2008. (Pubitemid 47625408)
-
(2008)
Journal of Computer and System Sciences
, vol.74
, Issue.1
, pp. 97-114
-
-
Awerbuch, B.1
Kleinberg, R.2
-
23
-
-
84867967913
-
Online learning for combinatorial network optimization with restless Markovian rewards
-
Y. Gai, B. Krishnamachari, and M. Liu, "Online learning for combinatorial network optimization with restless Markovian rewards," in Proc. 9th Annu. IEEE Conf. Sens., Mesh, Ad Hoc Commun. Networks (SECON), 2012.
-
(2012)
Proc. 9th Annu. IEEE Conf. Sens., Mesh, Ad Hoc Commun. Networks (SECON)
-
-
Gai, Y.1
Krishnamachari, B.2
Liu, M.3
-
24
-
-
84860425852
-
Locally sub-Gaussian random variable and the strong law of large numbers
-
P. Chareka, O. Chareka, and S. Kennendy, "Locally sub-Gaussian random variable and the strong law of large numbers," Atlantic Electron. J. Math., vol. 1, no. 1, pp. 75-81, 2006.
-
(2006)
Atlantic Electron. J. Math
, vol.1
, Issue.1
, pp. 75-81
-
-
Chareka, P.1
Chareka, O.2
Kennendy, S.3
-
25
-
-
0345224411
-
The continuum-armed bandit problem
-
Nov
-
R. Agrawal, "The continuum-armed bandit problem," SIAM J. Control Optimiz., vol. 33, no. 6, pp. 1926-1951, Nov. 1995.
-
(1995)
SIAM J. Control Optimiz
, vol.33
, Issue.6
, pp. 1926-1951
-
-
Agrawal, R.1
-
26
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Mar
-
W. Hoeffding, "Probability inequalities for sums of bounded random variables," J. Amer. Statist. Assoc., vol. 58, no. 301, pp. 13-30, Mar. 1963.
-
(1963)
J. Amer. Statist. Assoc
, vol.58
, Issue.301
, pp. 13-30
-
-
Hoeffding, W.1
-
27
-
-
0012972085
-
On the best constant in Marcinkiewicz-Zygmund inequality
-
DOI 10.1016/S0167-7152(01)00015-3, PII S0167715201000153
-
Y. Ren andH. Liang, "On the best constant in Marcinkiewicz-Zygmund inequality," Statist. Probab. Lett., vol. 53, pp. 227-233, Jun. 2001. (Pubitemid 33623382)
-
(2001)
Statistics and Probability Letters
, vol.53
, Issue.3
, pp. 227-233
-
-
Ren, Y.-F.1
Liang, H.-Y.2
|