메뉴 건너뛰기




Volumn 63, Issue 14, 2015, Pages 3740-3754

Distributed online learning via cooperative contextual bandits

Author keywords

Contextual bandits; cooperative learning; distributed learning; multi user bandits; multi user learning; online learning

Indexed keywords

ALGORITHMS; BIG DATA; DATA MINING; E-LEARNING; LEARNING ALGORITHMS; ONLINE SYSTEMS; SENSOR NETWORKS; SOCIAL NETWORKING (ONLINE);

EID: 84933576393     PISSN: 1053587X     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSP.2015.2430837     Document Type: Article
Times cited : (57)

References (23)
  • 1
    • 79953827701 scopus 로고    scopus 로고
    • Distributed learning in multi-Armed bandit with multiple players
    • K. Liu and Q. Zhao, "Distributed learning in multi-Armed bandit with multiple players," IEEE Trans. Signal Process., vol. 58, no. 11, pp. 5667-5681, 2010.
    • (2010) IEEE Trans. Signal Process. , vol.58 , Issue.11 , pp. 5667-5681
    • Liu, K.1    Zhao, Q.2
  • 2
    • 84874320633 scopus 로고    scopus 로고
    • Online learning in decentralized multi-user spectrum access with synchronized explorations
    • C. Tekin and M. Liu, "Online learning in decentralized multi-user spectrum access with synchronized explorations," in Proc. IEEEMILCOM, 2012, pp. 1-6.
    • (2012) Proc. IEEEMILCOM , pp. 1-6
    • Tekin, C.1    Liu, M.2
  • 5
    • 84874058621 scopus 로고    scopus 로고
    • Contextual bandits with similarity information
    • Jun.
    • A. Slivkins, "Contextual bandits with similarity information," in Proc. 24th Annu. Conf. Learn. Theory (COLT), Jun. 2011, vol. 19, pp. 679-702.
    • (2011) Proc. 24th Annu. Conf. Learn. Theory (COLT) , vol.19 , pp. 679-702
    • Slivkins, A.1
  • 7
    • 85162018594 scopus 로고    scopus 로고
    • The epoch-greedy algorithm for contextual multi-Armed bandits
    • J. Langford and T. Zhang, "The epoch-greedy algorithm for contextual multi-Armed bandits," Adv. Neural Inf. Process. Syst., vol. 20, pp. 1096-1103, 2007.
    • (2007) Adv. Neural Inf. Process. Syst. , vol.20 , pp. 1096-1103
    • Langford, J.1    Zhang, T.2
  • 10
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Mach. Learn., vol. 47, pp. 235-256, 2002.
    • (2002) Mach. Learn. , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 11
    • 84874710652 scopus 로고    scopus 로고
    • Multiclass classification with bandit feedback using adaptive regularization
    • K. Crammer and C. Gentile, "Multiclass classification with bandit feedback using adaptive regularization," Mach. Learn., vol. 90, no. 3, pp. 347-383, 2013.
    • (2013) Mach. Learn. , vol.90 , Issue.3 , pp. 347-383
    • Crammer, K.1    Gentile, C.2
  • 12
    • 77953320021 scopus 로고    scopus 로고
    • Opportunistic spectrum access with multiple players: Learning under competition
    • Mar.
    • A. Anandkumar, N. Michael, and A. Tang, "Opportunistic spectrum access with multiple players: Learning under competition," in Proc. IEEE INFOCOM, Mar. 2010.
    • (2010) Proc. IEEE INFOCOM
    • Anandkumar, A.1    Michael, N.2    Tang, A.3
  • 13
    • 84863956678 scopus 로고    scopus 로고
    • Online learning of rested and restless bandits
    • C. Tekin and M. Liu, "Online learning of rested and restless bandits," IEEE Trans. Inf. Theory, vol. 58, no. 8, pp. 5588-5611, 2012.
    • (2012) IEEE Trans. Inf. Theory , vol.58 , Issue.8 , pp. 5588-5611
    • Tekin, C.1    Liu, M.2
  • 14
    • 84873932839 scopus 로고    scopus 로고
    • Learning in a changing world: Restless multiarmed bandit with unknown dynamics
    • H. Liu, K. Liu, and Q. Zhao, "Learning in a changing world: Restless multiarmed bandit with unknown dynamics," IEEE Trans. Inf. Theory, vol. 59, no. 3, pp. 1902-1916, 2013.
    • (2013) IEEE Trans. Inf. Theory , vol.59 , Issue.3 , pp. 1902-1916
    • Liu, H.1    Liu, K.2    Zhao, Q.3
  • 16
    • 84867858040 scopus 로고    scopus 로고
    • Combinatorial network optimization with unknown variables: Multi-Armed bandits with linear rewards and individual observations
    • Y. Gai, B. Krishnamachari, and R. Jain, "Combinatorial network optimization with unknown variables: multi-Armed bandits with linear rewards and individual observations," IEEE/ACM Trans. Netw., vol. 20, no. 5, pp. 1466-1478, 2012.
    • (2012) IEEE/ACM Trans. Netw. , vol.20 , Issue.5 , pp. 1466-1478
    • Gai, Y.1    Krishnamachari, B.2    Jain, R.3
  • 17
    • 78049361018 scopus 로고    scopus 로고
    • Distributed stochastic subgradient projection algorithms for convex optimization
    • S. S. Ram, A. Nedic, and V. V. Veeravalli, "Distributed stochastic subgradient projection algorithms for convex optimization," J. Optim. Theory Appl., vol. 147, no. 3, pp. 516-545, 2010.
    • (2010) J. Optim. Theory Appl. , vol.147 , Issue.3 , pp. 516-545
    • Ram, S.S.1    Nedic, A.2    Veeravalli, V.V.3
  • 18
    • 84884765296 scopus 로고    scopus 로고
    • Distributed autonomous online learning: Regrets and intrinsic privacy-preserving properties
    • F. Yan, S. Sundaram, S. Vishwanathan, and Y. Qi, "Distributed autonomous online learning: regrets and intrinsic privacy-preserving properties," IEEE Trans. Knowl. Data Eng., vol. 25, no. 11, pp. 2483-2493, 2013.
    • (2013) IEEE Trans. Knowl. Data Eng. , vol.25 , Issue.11 , pp. 2483-2493
    • Yan, F.1    Sundaram, S.2    Vishwanathan, S.3    Qi, Y.4
  • 19
    • 80053163611 scopus 로고    scopus 로고
    • Decentralized online convex programming with local information
    • M. Raginsky, N. Kiarashi, and R. Willett, "Decentralized online convex programming with local information," in Proc. Amer. Control Conf. (ACC), 2011, pp. 5363-5369.
    • Proc. Amer. Control Conf. (ACC) , vol.2011 , pp. 5363-5369
    • Raginsky, M.1    Kiarashi, N.2    Willett, R.3
  • 20
    • 84904749323 scopus 로고    scopus 로고
    • Distributed online learning in social recommender systems
    • Aug.
    • C. Tekin, S. Zhang, and M. van der Schaar, "Distributed online learning in social recommender systems," IEEE J. Sel. Topics Signal Process, vol. 8, no. 4, pp. 638-652, Aug. 2014.
    • (2014) IEEE J. Sel. Topics Signal Process , vol.8 , Issue.4 , pp. 638-652
    • Tekin, C.1    Zhang, S.2    Schaar Der M.Van3
  • 22
    • 77956284023 scopus 로고    scopus 로고
    • Exploiting similarity information in reinforcement learning
    • R. Ortner, "Exploiting similarity information in reinforcement learning," in Proc. 2nd ICAART, 2010, pp. 203-210.
    • (2010) Proc. 2nd ICAART , pp. 203-210
    • Ortner, R.1
  • 23
    • 62249098440 scopus 로고    scopus 로고
    • An approximate formula for a partial sum of the divergent p-series
    • E. Chlebus, "An approximate formula for a partial sum of the divergent p-series," Appl. Math. Lett., vol. 22, no. 5, pp. 732-737, 2009.
    • (2009) Appl. Math. Lett. , vol.22 , Issue.5 , pp. 732-737
    • Chlebus, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.