SCOPUS 정보 검색 플랫폼

Volumn 17, Issue , 2016, Pages

Combinatorial multi-armed bandit and its extension to probabilistically triggered arms

(4) Chen, Wei a Wang, Yajun a Yuan, Yang b Wang, Qinshi c

Author keywords

Combinatorial multi armed bandit; Online advertising; Online learning; Social influence maximization; Upper confidence bound

Indexed keywords

ALGORITHMS; APPROXIMATION ALGORITHMS; E-LEARNING; ECONOMIC AND SOCIAL EFFECTS; LEARNING ALGORITHMS; MARKETING;

MULTI ARMED BANDIT; ONLINE ADVERTISING; ONLINE LEARNING; SOCIAL INFLUENCE; UPPER CONFIDENCE BOUND;

PROBABILITY DISTRIBUTIONS;

EID: 84979937519 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (266)

References (36)

1
- 0345224411
- The continuum-armed bandit problem
- Rajeev Agrawal. The continuum-armed bandit problem. SIAM J. Control Optim., 33(6):1926-1951, 1995.
- (1995) SIAM J. Control Optim. , vol.33 , Issue.6 , pp. 1926-1951
- Agrawal, R.¹

2
- 0023453059
- Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: I.i.d. Rewards
- Venkatachalam Anantharam, Pravin Varaiya, and Jean Walrand. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays - Part I: i.i.d. rewards. IEEE Transactions on Automatic Control, AC-32(11):968-976, 1987.
- (1987) IEEE Transactions on Automatic Control , vol.AC-32 , Issue.11 , pp. 968-976
- Anantharam, V.¹ Varaiya, P.² Walrand, J.³

3
- 84898079018
- Minimax policies for adversarial and stochastic bandits
- Jean-Yves Audibert, Sébastien Bubeck, and Gábor Lugosi. Minimax policies for adversarial and stochastic bandits. In Proceedings of the 22nd Annual Conference on Learning Theory (COLT), 2009.
- (2009) Proceedings of the 22nd Annual Conference on Learning Theory (COLT)
- Audibert, J.-Y.¹ Bubeck, S.² Lugosi, G.³

4
- 84954283323
- Minimax policies for combinatorial prediction games
- Jean-Yves Audibert, Sébastien Bubeck, and Gábor Lugosi. Minimax policies for combinatorial prediction games. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011.
- (2011) Proceedings of the 24th Annual Conference on Learning Theory (COLT)
- Audibert, J.-Y.¹ Bubeck, S.² Lugosi, G.³

5
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002a.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

6
- 0037709910
- The nonstochastic multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002b.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

7
- 0004181906
- Chapman and Hall
- Donald A. Berry and Bert Fristedt. Bandit problems: Sequential Allocation of Experiments. Chapman and Hall, 1985.
- (1985) Bandit Problems: Sequential Allocation of Experiments
- Berry, D.A.¹ Fristedt, B.²

8
- 84887470752
- Towards minimax policies for online linear optimization with bandit feedback
- Sébastien Bubeck, Nicolò Cesa-Bianchi, and Sham M. Kakade. Towards minimax policies for online linear optimization with bandit feedback. In Proceedings of the 25th Annual Conference on Learning Theory (COLT), 2012.
- (2012) Proceedings of the 25th Annual Conference on Learning Theory (COLT)
- Bubeck, S.¹ Cesa-Bianchi, N.² Kakade, S.M.³

9
- 33847255926
- Dynamic assortment with demand learning for seasonal consumer goods
- Felipe Caro and Jérémie Gallien. Dynamic assortment with demand learning for seasonal consumer goods. Management Science, 53:276-292, 2007.
- (2007) Management Science , vol.53 , pp. 276-292
- Caro, F.¹ Gallien, J.²

10
- 84898061746
- Combinatorial bandits
- Nicolò Cesa-Bianchi and Gábor Lugosi. Combinatorial bandits. In Proceedings of the 22nd Conference on Learning Theory, 2009.
- (2009) Proceedings of the 22nd Conference on Learning Theory
- Cesa-Bianchi, N.¹ Lugosi, G.²

11
- 84897515317
- Combinatorial multi-armed bandit: General framework, results, and applications
- Wei Chen, Yajun Wang, and Yang Yuan. Combinatorial multi-armed bandit: General framework, results, and applications. In Proceedings of the 30th International Conference on Machine Learning (ICML), 2013.
- (2013) Proceedings of the 30th International Conference on Machine Learning (ICML)
- Chen, W.¹ Wang, Y.² Yuan, Y.³

12
- 77953180719
- Learning multiuser channel allocations in cognitive radio networks: A combinatorial multi-armed bandit formulation
- Yi Gai, Bhaskar Krishnamachari, and Rahul Jain. Learning multiuser channel allocations in cognitive radio networks: A combinatorial multi-armed bandit formulation. In Proceedings of IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks (DySPAN), 2010.
- (2010) Proceedings of IEEE Symposium on New Frontiers in Dynamic Spectrum Access Networks (DySPAN)
- Gai, Y.¹ Krishnamachari, B.² Jain, R.³

13
- 84867858040
- Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
- Yi Gai, Bhaskar Krishnamachari, and Rahul Jain. Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations. IEEE/ACM Transactions on Networking, 20, 2012.
- (2012) IEEE/ACM Transactions on Networking , vol.20
- Gai, Y.¹ Krishnamachari, B.² Jain, R.³

14
- 84863920694
- The KL-UCB algorithm for bounded stochastic bandits and beyond
- Aurélien Garivier and Olivier Cappé. The KL-UCB algorithm for bounded stochastic bandits and beyond. In Proceedings of the 24th Annual Conference on Learning Theory (COLT), 2011.
- (2011) Proceedings of the 24th Annual Conference on Learning Theory (COLT)
- Garivier, A.¹ Cappé, O.²

15
- 84919793884
- Thompson sampling for complex online problems
- Aditya Gopalan, Shie Mannor, and Yishay mansour. Thompson sampling for complex online problems. In Proceedings of the 31st International Conference on Machine Learning (ICML), 2014.
- (2014) Proceedings of the 31st International Conference on Machine Learning (ICML)
- Gopalan, A.¹ Mannor, S.² Mansour, Y.³

16
- 80053442097
- Online submodular minimization
- Elad Hazan and Satyen Kale. Online submodular minimization. In Proceedings of the 23rd Annual Conference on Neural Information Processing Systems (NIPS), 2009.
- (2009) Proceedings of the 23rd Annual Conference on Neural Information Processing Systems (NIPS)
- Hazan, E.¹ Kale, S.²

17
- 84947403595
- Probability inequalities for sums of bounded random variables
- Wassily Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13-30, 1963.
- (1963) Journal of the American Statistical Association , vol.58 , Issue.301 , pp. 13-30
- Hoeffding, W.¹

18
- 80053454111
- Playing games with approximation algorithms
- Sham M. Kakade, Adam Tauman Kalai, and Katrina Ligett. Playing games with approximation algorithms. SIAM Journal on Computing, 39(3):1088-1106, 2009.
- (2009) SIAM Journal on Computing , vol.39 , Issue.3 , pp. 1088-1106
- Kakade, S.M.¹ Kalai, A.T.² Ligett, K.³

19
- 33747172362
- Maximizing the spread of influence through a social network
- David Kempe, Jon M. Kleinberg, and Éva Tardos. Maximizing the spread of influence through a social network. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pages 137-146, 2003.
- (2003) Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) , pp. 137-146
- Kempe, D.¹ Kleinberg, J.M.² Tardos, É.³

20
- 57049185311
- Multi-armed bandits in metric spaces
- Robert Kleinberg, Aleksandrs Slivkins, and Eli Upfal. Multi-armed bandits in metric spaces. In ACM Symposium on Theory of Computing (STOC), 2008.
- (2008) ACM Symposium on Theory of Computing (STOC)
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

21
- 84898981061
- Nearly tight bounds for the continuum-armed bandit problem
- Robert D. Kleinberg. Nearly tight bounds for the continuum-armed bandit problem. In Proceedings of the 17th Annual Conference on Neural Information Processing Systems (NIPS), 2004.
- (2004) Proceedings of the 17th Annual Conference on Neural Information Processing Systems (NIPS)
- Kleinberg, R.D.¹

22
- 84923299004
- Matroid bandits: Fast combinatorial optimization with learning
- Branislav Kveton, Zheng Wen, Azin Ashkan, Hoda Eydgahi, and Brian Eriksson. Matroid bandits: Fast combinatorial optimization with learning. In Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence (UAI), 2014.
- (2014) Proceedings of the 30th Conference on Uncertainty in Artificial Intelligence (UAI)
- Kveton, B.¹ Wen, Z.² Ashkan, A.³ Eydgahi, H.⁴ Eriksson, B.⁵

23
- 84965179449
- Tight regret bounds for stochastic combinatorial semi-bandits
- to appear, with arxiv version arXiv:1410.0949
- Branislav Kveton, Zheng Wen, Azin Ashkan, and Csaba Szepesvári. Tight regret bounds for stochastic combinatorial semi-bandits. In Proceedings of the 18th International Conference on Artificial Intelligence and Statistics, 2015. to appear, with arxiv version arXiv:1410.0949.
- (2015) Proceedings of the 18th International Conference on Artificial Intelligence and Statistics
- Kveton, B.¹ Wen, Z.² Ashkan, A.³ Szepesvári, C.⁴

24
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Tze Leung Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

25
- 84919902752
- Combinatorial partial monitoring game with linear feedback and its applications
- Tian Lin, Bruno Abrahao, Robert Kleinberg, John C. S. Lui, and Wei Chen. Combinatorial partial monitoring game with linear feedback and its applications. In Proceedings of the 31st International Conference on Machine Learning (ICML), 2014.
- (2014) Proceedings of the 31st International Conference on Machine Learning (ICML)
- Lin, T.¹ Abrahao, B.² Kleinberg, R.³ Lui John, C.S.⁴ Chen, W.⁵

26
- 80051636024
- Logarithmic weak regret of non-bayesian restless multi-armed bandit
- Haoyang Liu, Keqin Liu, and Qing Zhao. Logarithmic weak regret of non-bayesian restless multi-armed bandit. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2011.
- (2011) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
- Liu, H.¹ Liu, K.² Zhao, Q.³

27
- 84866935969
- Adaptive shortest-path routing under unknown and stochastically varying link states
- Keqin Liu and Qing Zhao. Adaptive shortest-path routing under unknown and stochastically varying link states. In Proceedings of the 10th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt), 2012.
- (2012) Proceedings of the 10th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt)
- Liu, K.¹ Zhao, Q.²

28
- 85162303757
- From bandits to experts: On the value of side-observations
- Shie Mannor and Ohad Shamir. From bandits to experts: On the value of side-observations. In Proceedings of the 25th Annual Conference on Neural Information Processing Systems (NIPS), 2011.
- (2011) Proceedings of the 25th Annual Conference on Neural Information Processing Systems (NIPS)
- Mannor, S.¹ Shamir, O.²

29
- 33746398102
- Cambridge University Press
- Michael Mitzenmacher and Eli Upfal. Probability and Computing. Cambridge University Press, 2005.
- (2005) Probability and Computing
- Mitzenmacher, M.¹ Upfal, E.²

30
- 0000095809
- An analysis of the approximations for maximizing submodular set functions
- G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher. An analysis of the approximations for maximizing submodular set functions. Mathematical Programming, 14(1):265-294, 1978.
- (1978) Mathematical Programming , vol.14 , Issue.1 , pp. 265-294
- Nemhauser, G.L.¹ Wolsey, L.A.² Fisher, M.L.³

31
- 84959887930
- Contextual combinatorial bandit and its application on diversified online recommendation
- Lijing Qin, Shouyuan Chen, and Xiaoyan Zhu. Contextual combinatorial bandit and its application on diversified online recommendation. In Proceedings of the 2014 SIAM International Conference on Data Mining (SDM), 2014.
- (2014) Proceedings of the 2014 SIAM International Conference on Data Mining (SDM)
- Qin, L.¹ Chen, S.² Zhu, X.³

32
- 56449088596
- Learning diverse rankings with multi-armed bandits
- Filip Radlinski, Robert Kleinberg, and Thorsten Joachims. Learning diverse rankings with multi-armed bandits. In Proceedings of the 25th International Conference on Machine learning (ICML), 2008.
- (2008) Proceedings of the 25th International Conference on Machine Learning (ICML)
- Radlinski, F.¹ Kleinberg, R.² Joachims, T.³

33
- 85047019092
- An online algorithm for maximizing submodular functions
- Matthew Streeter and Daniel Golovin. An online algorithm for maximizing submodular functions. In Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (NIPS), 2008.
- (2008) Proceedings of the 22nd Annual Conference on Neural Information Processing Systems (NIPS)
- Streeter, M.¹ Golovin, D.²

34
- 79957966922
- Online learning of assignments
- Matthew Streeter, Daniel Golovin, and Andreas Krause. Online learning of assignments. In Proceedings of the 23rd Annual Conference on Neural Information Processing Systems (NIPS), 2009.
- (2009) Proceedings of the 23rd Annual Conference on Neural Information Processing Systems (NIPS)
- Streeter, M.¹ Golovin, D.² Krause, A.³

35
- 0004102479
- MIT Press
- Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

36
- 0003422462
- Springer
- Vijay V. Vazirani. Approximation Algorithms. Springer, 2004.
- (2004) Approximation Algorithms
- Vazirani, V.V.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.