SCOPUS 정보 검색 플랫폼

Volumn , Issue PART 1, 2013, Pages 151-159

Combinatorial multi-armed bandit: General framework, results and applications

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION ALGORITHMS; LEARNING SYSTEMS; MARKETING; OPTIMIZATION;

CONSTANT FACTORS; MAXIMUM COVERAGE; MULTI ARMED BANDIT; NEW APPLICATIONS; ONLINE ADVERTISING; REWARD FUNCTION; SOCIAL INFLUENCE; VIRAL MARKETING;

PROBABILITY DISTRIBUTIONS;

EID: 84897515317 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (695)

References (25)

1
- 0023453059
- Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays Part I: I.I.D. Rewards; Part II: Markovian rewards
- Anantharam, V., Varaiya, P., and Walrand, J. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays Part I: i.i.d. rewards; Part II: Markovian rewards. IEEE Transactions on Automatic Control, AC-32(11):968-982, 1987a.
- (1987) IEEE Transactions on Automatic Control , vol.AC-32 , Issue.11 , pp. 968-982
- Anantharam, V.¹ Varaiya, P.² Walrand, J.³

2
- 84898079018
- Minimax policies for adversarial and stochastic bandits
- Audibert, J.-Y., and Bubeck, S. Minimax policies for adversarial and stochastic bandits. In COLT, 2009.
- (2009) COLT
- Audibert, J.-Y.¹ Bubeck, S.²

3
- 84954283323
- Minimax policies for combinatorial prediction games
- Audibert, J.-Y., Bubeck, S., and Lugosi, G. Minimax policies for combinatorial prediction games. In COLT, 2011.
- (2011) COLT
- Audibert, J.-Y.¹ Bubeck, S.² Lugosi, G.³

5
- 0037709910
- The nonstochastic multiarmed bandit problem
- Auer, P., Cesa-Bianchi, N., Freund, Y., and Schapire, R. E. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002b.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

6
- 0004218171
- Chapman and Hall
- Berry, D. and Fristedt, B. Bandit problems. Chapman and Hall, 1985.
- (1985) Bandit Problems
- Berry, D.¹ Fristedt, B.²

7
- 84887470752
- Towards Minimax Policies for Online Linear Optimization with Bandit Feedback
- Bubeck, S., Cesa-Bianchi, N., and Kakade, S. M. Towards Minimax Policies for Online Linear Optimization with Bandit Feedback. In COLT, 2012.
- (2012) COLT
- Bubeck, S.¹ Cesa-Bianchi, N.² Kakade, S.M.³

9
- 79959599117
- Cesa-Bianchi, N. and Lugosi, G. Combinatorial bandits.
- Combinatorial Bandits
- Cesa-Bianchi, N.¹ Lugosi, G.²

10
- 77953180719
- Learning multiuser channel allocations in cognitive radio networks: A combinatorial multi-armed bandit formulation
- Gai, Y., Krishnamachari, B., and Jain, R. Learning multiuser channel allocations in cognitive radio networks: A combinatorial multi-armed bandit formulation. In DySPAN, 2010.
- (2010) DySPAN
- Gai, Y.¹ Krishnamachari, B.² Jain, R.³

11
- 84867858040
- Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
- Gai, Y., Krishnamachari, B., and Jain, R. Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations. IEEE/ACM Transactions on Networking, 20, 2012.
- (2012) IEEE/ACM Transactions on Networking , vol.20
- Gai, Y.¹ Krishnamachari, B.² Jain, R.³

12
- 84863920694
- The KL-UCB algorithm for bounded stochastic bandits and beyond
- Garivier, A. and Cappé, O. The KL-UCB algorithm for bounded stochastic bandits and beyond. In COLT, 2011.
- (2011) COLT
- Garivier, A.¹ Cappé, O.²

13
- 80053442097
- Online submodular minimization
- Hazan, E. and Kale, S. Online submodular minimization. In NIPS, 2009.
- (2009) NIPS
- Hazan, E.¹ Kale, S.²

14
- 80053454111
- Playing games with approximation algorithms
- Kakade, S. M., Kalai, A. T., and Ligett, K. Playing games with approximation algorithms. SIAM Journal on Computing, 39(3):1088-1106, 2009.
- (2009) SIAM Journal on Computing , vol.39 , Issue.3 , pp. 1088-1106
- Kakade, S.M.¹ Kalai, A.T.² Ligett, K.³

15
- 33747172362
- Maximizing the spread of influence through a social network
- Kempe, D., Kleinberg, J. M., and Tardos, E. Maximizing the spread of influence through a social network. In KDD, 2003.
- (2003) KDD
- Kempe, D.¹ Kleinberg, J.M.² Tardos, E.³

16
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, T. L. and Robbins, H. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

17
- 80051636024
- Logarithmic weak regret of non-bayesian restless multi-armed bandit
- Liu, H., Liu, K., and Zhao, Q. Logarithmic weak regret of non-bayesian restless multi-armed bandit. In ICASSP, 2011.
- (2011) ICASSP
- Liu, H.¹ Liu, K.² Zhao, Q.³

18
- 84897490597
- Arxiv preprint arXiv:1201.4906
- Liu, K. and Zhao, Q. Adaptive shortest-path routing under unknown and stochastically varying link states. Arxiv preprint arXiv:1201.4906, 2012.
- (2012) Adaptive Shortest-path Routing under Unknown and Stochastically Varying Link States
- Liu, K.¹ Zhao, Q.²

19
- 85162303757
- From Bandits to Experts: On the Value of Side-Observations
- Mannor, S., and Shamir, O. From Bandits to Experts: On the Value of Side-Observations. In NIPS, 2011.
- (2011) NIPS
- Mannor, S.¹ Shamir, O.²

20
- 0000095809
- An analysis of the approximations for maximizing submodular set functions
- Nemhauser, G., Wolsey, L., and Fisher, M. An analysis of the approximations for maximizing submodular set functions. Mathematical Programming, 14:265-294, 1978.
- (1978) Mathematical Programming , vol.14 , pp. 265-294
- Nemhauser, G.¹ Wolsey, L.² Fisher, M.³

21
- 56449088596
- Learning diverse rankings with multi-armed bandits
- Radlinski, F., Kleinberg, R., and Joachims, T. Learning diverse rankings with multi-armed bandits. In ICML, 2008.
- (2008) ICML
- Radlinski, F.¹ Kleinberg, R.² Joachims, T.³

22
- 79957966922
- Online learning of assignments
- Streeter, M., Golovin, D., and Krause, A. Online learning of assignments. In NIPS, 2009.
- (2009) NIPS
- Streeter, M.¹ Golovin, D.² Krause, A.³

23
- 85047019092
- An online algorithm for maximizing submodular functions
- Streeter, M. and Golovin, D. An online algorithm for maximizing submodular functions. In NIPS, 2008.
- (2008) NIPS
- Streeter, M.¹ Golovin, D.²

24
- 0004102479
- MIT Press
- Sutton, R. and Barto, A. Reinforcement learning, an introduction. MIT Press, 1998.
- (1998) Reinforcement Learning, An Introduction
- Sutton, R.¹ Barto, A.²

25
- 0003422462
- Springer
- Vazirani, V. V. Approximation Algorithms Springer, 2004
- (2004) Approximation Algorithms
- Vazirani, V.V.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.