-
1
-
-
79953827701
-
Distributed learning in multi-Armed bandit with multiple players
-
K. Liu and Q. Zhao, "Distributed learning in multi-Armed bandit with multiple players," IEEE Trans. Signal Process., vol. 58, no. 11, pp. 5667-5681, 2010.
-
(2010)
IEEE Trans. Signal Process.
, vol.58
, Issue.11
, pp. 5667-5681
-
-
Liu, K.1
Zhao, Q.2
-
2
-
-
84874320633
-
Online learning in decentralized multi-user spectrum access with synchronized explorations
-
C. Tekin and M. Liu, "Online learning in decentralized multi-user spectrum access with synchronized explorations," in Proc. IEEEMILCOM, 2012, pp. 1-6.
-
(2012)
Proc. IEEEMILCOM
, pp. 1-6
-
-
Tekin, C.1
Liu, M.2
-
3
-
-
57049185311
-
Multi-Armed bandits in metric spaces
-
R. Kleinberg, A. Slivkins, and E. Upfal, "Multi-Armed bandits in metric spaces," in Proc. 40th Annu. ACM Symp. Theory Comput., 2008, pp. 681-690.
-
(2008)
Proc. 40th Annu. ACM Symp. Theory Comput.
, pp. 681-690
-
-
Kleinberg, R.1
Slivkins, A.2
Upfal, E.3
-
4
-
-
79960128338
-
X-Armed bandits
-
S. Bubeck, R. Munos, G. Stoltz, and C. Szepesvari, "X-Armed bandits," J. Mach. Learn. Res., vol. 12, pp. 1655-1695, 2011.
-
(2011)
J. Mach. Learn. Res.
, vol.12
, pp. 1655-1695
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
Szepesvari, C.4
-
5
-
-
84874058621
-
Contextual bandits with similarity information
-
Jun.
-
A. Slivkins, "Contextual bandits with similarity information," in Proc. 24th Annu. Conf. Learn. Theory (COLT), Jun. 2011, vol. 19, pp. 679-702.
-
(2011)
Proc. 24th Annu. Conf. Learn. Theory (COLT)
, vol.19
, pp. 679-702
-
-
Slivkins, A.1
-
6
-
-
84889587330
-
-
ArXiv preprint arXiv: 1106.2369 [Online]. Available:
-
M. Dudik, D. Hsu, S. Kale, N. Karampatziakis, J. Langford, L. Reyzin, and T. Zhang, "Efficient optimal learning for contextual bandits," 2011, ArXiv preprint arXiv:1106.2369 [Online]. Available: http://arxiv.org/abs/1106.2369
-
(2011)
Efficient Optimal Learning for Contextual Bandits
-
-
Dudik, M.1
Hsu, D.2
Kale, S.3
Karampatziakis, N.4
Langford, J.5
Reyzin, L.6
Zhang, T.7
-
7
-
-
85162018594
-
The epoch-greedy algorithm for contextual multi-Armed bandits
-
J. Langford and T. Zhang, "The epoch-greedy algorithm for contextual multi-Armed bandits," Adv. Neural Inf. Process. Syst., vol. 20, pp. 1096-1103, 2007.
-
(2007)
Adv. Neural Inf. Process. Syst.
, vol.20
, pp. 1096-1103
-
-
Langford, J.1
Zhang, T.2
-
8
-
-
84862295531
-
Contextual bandits with linear payoff functions
-
Apr.
-
W. Chu, L. Li, L. Reyzin, and R. E. Schapire, "Contextual bandits with linear payoff functions," in Proc. 14th Int. Conf. Artif. Intell. Statist. (AISTATS), Apr. 2011, vol. 15, pp. 208-214.
-
(2011)
Proc. 14th Int. Conf. Artif. Intell. Statist. (AISTATS)
, vol.15
, pp. 208-214
-
-
Chu, W.1
Li, L.2
Reyzin, L.3
Schapire, R.E.4
-
9
-
-
77954641643
-
A contextual-bandit approach to personalized news article recommendation
-
L. Li, W. Chu, J. Langford, and R. E. Schapire, "A contextual-bandit approach to personalized news article recommendation," in Proc. 19th Int. Conf. World Wide Web, 2010, pp. 661-670.
-
(2010)
Proc. 19th Int. Conf. World Wide Web
, pp. 661-670
-
-
Li, L.1
Chu, W.2
Langford, J.3
Schapire, R.E.4
-
10
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Mach. Learn., vol. 47, pp. 235-256, 2002.
-
(2002)
Mach. Learn.
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
11
-
-
84874710652
-
Multiclass classification with bandit feedback using adaptive regularization
-
K. Crammer and C. Gentile, "Multiclass classification with bandit feedback using adaptive regularization," Mach. Learn., vol. 90, no. 3, pp. 347-383, 2013.
-
(2013)
Mach. Learn.
, vol.90
, Issue.3
, pp. 347-383
-
-
Crammer, K.1
Gentile, C.2
-
12
-
-
77953320021
-
Opportunistic spectrum access with multiple players: Learning under competition
-
Mar.
-
A. Anandkumar, N. Michael, and A. Tang, "Opportunistic spectrum access with multiple players: Learning under competition," in Proc. IEEE INFOCOM, Mar. 2010.
-
(2010)
Proc. IEEE INFOCOM
-
-
Anandkumar, A.1
Michael, N.2
Tang, A.3
-
13
-
-
84863956678
-
Online learning of rested and restless bandits
-
C. Tekin and M. Liu, "Online learning of rested and restless bandits," IEEE Trans. Inf. Theory, vol. 58, no. 8, pp. 5588-5611, 2012.
-
(2012)
IEEE Trans. Inf. Theory
, vol.58
, Issue.8
, pp. 5588-5611
-
-
Tekin, C.1
Liu, M.2
-
14
-
-
84873932839
-
Learning in a changing world: Restless multiarmed bandit with unknown dynamics
-
H. Liu, K. Liu, and Q. Zhao, "Learning in a changing world: Restless multiarmed bandit with unknown dynamics," IEEE Trans. Inf. Theory, vol. 59, no. 3, pp. 1902-1916, 2013.
-
(2013)
IEEE Trans. Inf. Theory
, vol.59
, Issue.3
, pp. 1902-1916
-
-
Liu, H.1
Liu, K.2
Zhao, Q.3
-
15
-
-
84899449536
-
Dcops and bandits: Exploration and exploitation in decentralised coordination
-
R. Stranders, L. Tran-Thanh, F. M. D. Fave, A. Rogers, and N. R. Jennings, "DCOPs and bandits: exploration and exploitation in decentralised coordination," in Proc. 11th Int. Conf. Autonom. Agents Multiagent Syst.-Volume 1, 2012, pp. 289-296.
-
(2012)
Proc. 11th Int. Conf. Autonom. Agents Multiagent Syst.
, vol.1
, pp. 289-296
-
-
Stranders, R.1
Tran-Thanh, L.2
Fave, F.M.D.3
Rogers, A.4
Jennings, N.R.5
-
16
-
-
84867858040
-
Combinatorial network optimization with unknown variables: Multi-Armed bandits with linear rewards and individual observations
-
Y. Gai, B. Krishnamachari, and R. Jain, "Combinatorial network optimization with unknown variables: multi-Armed bandits with linear rewards and individual observations," IEEE/ACM Trans. Netw., vol. 20, no. 5, pp. 1466-1478, 2012.
-
(2012)
IEEE/ACM Trans. Netw.
, vol.20
, Issue.5
, pp. 1466-1478
-
-
Gai, Y.1
Krishnamachari, B.2
Jain, R.3
-
17
-
-
78049361018
-
Distributed stochastic subgradient projection algorithms for convex optimization
-
S. S. Ram, A. Nedic, and V. V. Veeravalli, "Distributed stochastic subgradient projection algorithms for convex optimization," J. Optim. Theory Appl., vol. 147, no. 3, pp. 516-545, 2010.
-
(2010)
J. Optim. Theory Appl.
, vol.147
, Issue.3
, pp. 516-545
-
-
Ram, S.S.1
Nedic, A.2
Veeravalli, V.V.3
-
18
-
-
84884765296
-
Distributed autonomous online learning: Regrets and intrinsic privacy-preserving properties
-
F. Yan, S. Sundaram, S. Vishwanathan, and Y. Qi, "Distributed autonomous online learning: regrets and intrinsic privacy-preserving properties," IEEE Trans. Knowl. Data Eng., vol. 25, no. 11, pp. 2483-2493, 2013.
-
(2013)
IEEE Trans. Knowl. Data Eng.
, vol.25
, Issue.11
, pp. 2483-2493
-
-
Yan, F.1
Sundaram, S.2
Vishwanathan, S.3
Qi, Y.4
-
19
-
-
80053163611
-
Decentralized online convex programming with local information
-
M. Raginsky, N. Kiarashi, and R. Willett, "Decentralized online convex programming with local information," in Proc. Amer. Control Conf. (ACC), 2011, pp. 5363-5369.
-
Proc. Amer. Control Conf. (ACC)
, vol.2011
, pp. 5363-5369
-
-
Raginsky, M.1
Kiarashi, N.2
Willett, R.3
-
20
-
-
84904749323
-
Distributed online learning in social recommender systems
-
Aug.
-
C. Tekin, S. Zhang, and M. van der Schaar, "Distributed online learning in social recommender systems," IEEE J. Sel. Topics Signal Process, vol. 8, no. 4, pp. 638-652, Aug. 2014.
-
(2014)
IEEE J. Sel. Topics Signal Process
, vol.8
, Issue.4
, pp. 638-652
-
-
Tekin, C.1
Zhang, S.2
Schaar Der M.Van3
-
21
-
-
80051629493
-
-
Univ. of California- Davis, Tech. Rep.
-
H. Liu, K. Liu, and Q. Zhao, "Learning in a changing world: Non-Bayesian restless multi-Armed bandit," Univ. of California- Davis, Tech. Rep., 2010.
-
(2010)
Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit
-
-
Liu, H.1
Liu, K.2
Zhao, Q.3
-
22
-
-
77956284023
-
Exploiting similarity information in reinforcement learning
-
R. Ortner, "Exploiting similarity information in reinforcement learning," in Proc. 2nd ICAART, 2010, pp. 203-210.
-
(2010)
Proc. 2nd ICAART
, pp. 203-210
-
-
Ortner, R.1
-
23
-
-
62249098440
-
An approximate formula for a partial sum of the divergent p-series
-
E. Chlebus, "An approximate formula for a partial sum of the divergent p-series," Appl. Math. Lett., vol. 22, no. 5, pp. 732-737, 2009.
-
(2009)
Appl. Math. Lett.
, vol.22
, Issue.5
, pp. 732-737
-
-
Chlebus, E.1
|