SCOPUS 정보 검색 플랫폼

Volumn 63, Issue 14, 2015, Pages 3740-3754

Distributed online learning via cooperative contextual bandits

(2) Tekin, Cem a Van Der Schaar, Mihaela a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Contextual bandits; cooperative learning; distributed learning; multi user bandits; multi user learning; online learning

Indexed keywords

ALGORITHMS; BIG DATA; DATA MINING; E-LEARNING; LEARNING ALGORITHMS; ONLINE SYSTEMS; SENSOR NETWORKS; SOCIAL NETWORKING (ONLINE);

CONTEXTUAL BANDITS; COOPERATIVE LEARNING; DISTRIBUTED LEARNING; MULTI-USER; ONLINE LEARNING;

LEARNING SYSTEMS;

EID: 84933576393 PISSN: 1053587X EISSN: None Source Type: Journal
DOI: 10.1109/TSP.2015.2430837 Document Type: Article

Times cited : (57)

References (23)

1
- 79953827701
- Distributed learning in multi-Armed bandit with multiple players
- K. Liu and Q. Zhao, "Distributed learning in multi-Armed bandit with multiple players," IEEE Trans. Signal Process., vol. 58, no. 11, pp. 5667-5681, 2010.
- (2010) IEEE Trans. Signal Process. , vol.58 , Issue.11 , pp. 5667-5681
- Liu, K.¹ Zhao, Q.²

2
- 84874320633
- Online learning in decentralized multi-user spectrum access with synchronized explorations
- C. Tekin and M. Liu, "Online learning in decentralized multi-user spectrum access with synchronized explorations," in Proc. IEEEMILCOM, 2012, pp. 1-6.
- (2012) Proc. IEEEMILCOM , pp. 1-6
- Tekin, C.¹ Liu, M.²

3
- 57049185311
- Multi-Armed bandits in metric spaces
- R. Kleinberg, A. Slivkins, and E. Upfal, "Multi-Armed bandits in metric spaces," in Proc. 40th Annu. ACM Symp. Theory Comput., 2008, pp. 681-690.
- (2008) Proc. 40th Annu. ACM Symp. Theory Comput. , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

4
- 79960128338
- X-Armed bandits
- S. Bubeck, R. Munos, G. Stoltz, and C. Szepesvari, "X-Armed bandits," J. Mach. Learn. Res., vol. 12, pp. 1655-1695, 2011.
- (2011) J. Mach. Learn. Res. , vol.12 , pp. 1655-1695
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvari, C.⁴

5
- 84874058621
- Contextual bandits with similarity information
- Jun.
- A. Slivkins, "Contextual bandits with similarity information," in Proc. 24th Annu. Conf. Learn. Theory (COLT), Jun. 2011, vol. 19, pp. 679-702.
- (2011) Proc. 24th Annu. Conf. Learn. Theory (COLT) , vol.19 , pp. 679-702
- Slivkins, A.¹

6
- 84889587330
- ArXiv preprint arXiv: 1106.2369 [Online]. Available:
- M. Dudik, D. Hsu, S. Kale, N. Karampatziakis, J. Langford, L. Reyzin, and T. Zhang, "Efficient optimal learning for contextual bandits," 2011, ArXiv preprint arXiv:1106.2369 [Online]. Available: http://arxiv.org/abs/1106.2369
- (2011) Efficient Optimal Learning for Contextual Bandits
- Dudik, M.¹ Hsu, D.² Kale, S.³ Karampatziakis, N.⁴ Langford, J.⁵ Reyzin, L.⁶ Zhang, T.⁷

7
- 85162018594
- The epoch-greedy algorithm for contextual multi-Armed bandits
- J. Langford and T. Zhang, "The epoch-greedy algorithm for contextual multi-Armed bandits," Adv. Neural Inf. Process. Syst., vol. 20, pp. 1096-1103, 2007.
- (2007) Adv. Neural Inf. Process. Syst. , vol.20 , pp. 1096-1103
- Langford, J.¹ Zhang, T.²

8
- 84862295531
- Contextual bandits with linear payoff functions
- Apr.
- W. Chu, L. Li, L. Reyzin, and R. E. Schapire, "Contextual bandits with linear payoff functions," in Proc. 14th Int. Conf. Artif. Intell. Statist. (AISTATS), Apr. 2011, vol. 15, pp. 208-214.
- (2011) Proc. 14th Int. Conf. Artif. Intell. Statist. (AISTATS) , vol.15 , pp. 208-214
- Chu, W.¹ Li, L.² Reyzin, L.³ Schapire, R.E.⁴

9
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- L. Li, W. Chu, J. Langford, and R. E. Schapire, "A contextual-bandit approach to personalized news article recommendation," in Proc. 19th Int. Conf. World Wide Web, 2010, pp. 661-670.
- (2010) Proc. 19th Int. Conf. World Wide Web , pp. 661-670
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

10
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Mach. Learn., vol. 47, pp. 235-256, 2002.
- (2002) Mach. Learn. , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

11
- 84874710652
- Multiclass classification with bandit feedback using adaptive regularization
- K. Crammer and C. Gentile, "Multiclass classification with bandit feedback using adaptive regularization," Mach. Learn., vol. 90, no. 3, pp. 347-383, 2013.
- (2013) Mach. Learn. , vol.90 , Issue.3 , pp. 347-383
- Crammer, K.¹ Gentile, C.²

12
- 77953320021
- Opportunistic spectrum access with multiple players: Learning under competition
- Mar.
- A. Anandkumar, N. Michael, and A. Tang, "Opportunistic spectrum access with multiple players: Learning under competition," in Proc. IEEE INFOCOM, Mar. 2010.
- (2010) Proc. IEEE INFOCOM
- Anandkumar, A.¹ Michael, N.² Tang, A.³

13
- 84863956678
- Online learning of rested and restless bandits
- C. Tekin and M. Liu, "Online learning of rested and restless bandits," IEEE Trans. Inf. Theory, vol. 58, no. 8, pp. 5588-5611, 2012.
- (2012) IEEE Trans. Inf. Theory , vol.58 , Issue.8 , pp. 5588-5611
- Tekin, C.¹ Liu, M.²

14
- 84873932839
- Learning in a changing world: Restless multiarmed bandit with unknown dynamics
- H. Liu, K. Liu, and Q. Zhao, "Learning in a changing world: Restless multiarmed bandit with unknown dynamics," IEEE Trans. Inf. Theory, vol. 59, no. 3, pp. 1902-1916, 2013.
- (2013) IEEE Trans. Inf. Theory , vol.59 , Issue.3 , pp. 1902-1916
- Liu, H.¹ Liu, K.² Zhao, Q.³

15
- 84899449536
- Dcops and bandits: Exploration and exploitation in decentralised coordination
- R. Stranders, L. Tran-Thanh, F. M. D. Fave, A. Rogers, and N. R. Jennings, "DCOPs and bandits: exploration and exploitation in decentralised coordination," in Proc. 11th Int. Conf. Autonom. Agents Multiagent Syst.-Volume 1, 2012, pp. 289-296.
- (2012) Proc. 11th Int. Conf. Autonom. Agents Multiagent Syst. , vol.1 , pp. 289-296
- Stranders, R.¹ Tran-Thanh, L.² Fave, F.M.D.³ Rogers, A.⁴ Jennings, N.R.⁵

16
- 84867858040
- Combinatorial network optimization with unknown variables: Multi-Armed bandits with linear rewards and individual observations
- Y. Gai, B. Krishnamachari, and R. Jain, "Combinatorial network optimization with unknown variables: multi-Armed bandits with linear rewards and individual observations," IEEE/ACM Trans. Netw., vol. 20, no. 5, pp. 1466-1478, 2012.
- (2012) IEEE/ACM Trans. Netw. , vol.20 , Issue.5 , pp. 1466-1478
- Gai, Y.¹ Krishnamachari, B.² Jain, R.³

17
- 78049361018
- Distributed stochastic subgradient projection algorithms for convex optimization
- S. S. Ram, A. Nedic, and V. V. Veeravalli, "Distributed stochastic subgradient projection algorithms for convex optimization," J. Optim. Theory Appl., vol. 147, no. 3, pp. 516-545, 2010.
- (2010) J. Optim. Theory Appl. , vol.147 , Issue.3 , pp. 516-545
- Ram, S.S.¹ Nedic, A.² Veeravalli, V.V.³

18
- 84884765296
- Distributed autonomous online learning: Regrets and intrinsic privacy-preserving properties
- F. Yan, S. Sundaram, S. Vishwanathan, and Y. Qi, "Distributed autonomous online learning: regrets and intrinsic privacy-preserving properties," IEEE Trans. Knowl. Data Eng., vol. 25, no. 11, pp. 2483-2493, 2013.
- (2013) IEEE Trans. Knowl. Data Eng. , vol.25 , Issue.11 , pp. 2483-2493
- Yan, F.¹ Sundaram, S.² Vishwanathan, S.³ Qi, Y.⁴

19
- 80053163611
- Decentralized online convex programming with local information
- M. Raginsky, N. Kiarashi, and R. Willett, "Decentralized online convex programming with local information," in Proc. Amer. Control Conf. (ACC), 2011, pp. 5363-5369.
- Proc. Amer. Control Conf. (ACC) , vol.2011 , pp. 5363-5369
- Raginsky, M.¹ Kiarashi, N.² Willett, R.³

20
- 84904749323
- Distributed online learning in social recommender systems
- Aug.
- C. Tekin, S. Zhang, and M. van der Schaar, "Distributed online learning in social recommender systems," IEEE J. Sel. Topics Signal Process, vol. 8, no. 4, pp. 638-652, Aug. 2014.
- (2014) IEEE J. Sel. Topics Signal Process , vol.8 , Issue.4 , pp. 638-652
- Tekin, C.¹ Zhang, S.² Schaar Der M.Van³

21
- 80051629493
- Univ. of California- Davis, Tech. Rep.
- H. Liu, K. Liu, and Q. Zhao, "Learning in a changing world: Non-Bayesian restless multi-Armed bandit," Univ. of California- Davis, Tech. Rep., 2010.
- (2010) Learning in A Changing World: Non-Bayesian Restless Multi-Armed Bandit
- Liu, H.¹ Liu, K.² Zhao, Q.³

22
- 77956284023
- Exploiting similarity information in reinforcement learning
- R. Ortner, "Exploiting similarity information in reinforcement learning," in Proc. 2nd ICAART, 2010, pp. 203-210.
- (2010) Proc. 2nd ICAART , pp. 203-210
- Ortner, R.¹

23
- 62249098440
- An approximate formula for a partial sum of the divergent p-series
- E. Chlebus, "An approximate formula for a partial sum of the divergent p-series," Appl. Math. Lett., vol. 22, no. 5, pp. 732-737, 2009.
- (2009) Appl. Math. Lett. , vol.22 , Issue.5 , pp. 732-737
- Chlebus, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.