SCOPUS 정보 검색 플랫폼

Proceedings - Annual IEEE Symposium on Foundations of Computer Science, FOCS

Volumn , Issue , 2013, Pages 207-216

Bandits with knapsacks (extended abstract)

(3) Badanidiyuru, Ashwinkumar a Kleinberg, Robert a Slivkins, Aleksandrs b

a School of Operations Research and Information Engineering (United States)

Author keywords

Dynamic ad allocation; Dynamic pricing; Dynamic procurement; Exploration exploitation tradeoff; Multi armed bandits; Regret; Stochastic packing

Indexed keywords

DYNAMIC PRICING; EXPLORATION EXPLOITATIONS; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; POLY-LOGARITHMIC FACTORS; PRIMAL DUAL ALGORITHMS; REGRET; STOCHASTIC INTEGER PROGRAMMING;

COMPUTER SCIENCE; INTEGER PROGRAMMING; OPTIMIZATION; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

ALGORITHMS;

EID: 84893451322 PISSN: 02725428 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/FOCS.2013.30 Document Type: Conference Paper

Times cited : (278)

References (33)

1
- 0000248624
- Multi-armed bandits and the gittins index
- P. Whittle, "Multi-armed bandits and the Gittins index, " J. Royal Statistical Society, Series B, vol. 42, no. 2, pp. 143- 149, 1980.
- (1980) J. Royal Statistical Society, Series B , vol.42 , Issue.2 , pp. 143-149
- Whittle, P.¹

2
- 0032628612
- The complexity of optimal queuing network control
- C. H. Papadimitriou and J. N. Tsitsiklis, "The complexity of optimal queuing network control, " Math. Oper. Res., vol. 24, no. 2, pp. 293-305, 1999.
- (1999) Math. Oper. Res. , vol.24 , Issue.2 , pp. 293-305
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- preliminary version in 36th IEEE FOCS, 1995
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire, "The nonstochastic multiarmed bandit problem." SIAM J. Comput., vol. 32, no. 1, pp. 48-77, 2002, preliminary version in 36th IEEE FOCS, 1995.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 84871887590
- Blind network revenue management
- O. Besbes and A. J. Zeevi, "Blind network revenue management, " Operations Research, vol. 60, no. 6, pp. 1537-1550, 2012.
- (2012) Operations Research , vol.60 , Issue.6 , pp. 1537-1550
- Besbes, O.¹ Zeevi, A.J.²

5
- 84863507274
- Learning on a budget: Posted price mechanisms for online procurement
- A. Badanidiyuru, R. Kleinberg, and Y. Singer, "Learning on a budget: posted price mechanisms for online procurement, " in 13th ACM EC, 2012, pp. 128-145.
- (2012) 13th ACM EC , pp. 128-145
- Badanidiyuru, A.¹ Kleinberg, R.² Singer, Y.³

6
- 84893485049
- Adaptive crowdsourcing algorithms for the bandit survey problem
- I. Abraham, O. Alonso, V. Kandylas, and A. Slivkins, "Adaptive crowdsourcing algorithms for the bandit survey problem, " in 26th COLT, 2013.
- (2013) 26th COLT
- Abraham, I.¹ Alonso, O.² Kandylas, V.³ Slivkins, A.⁴

7
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- preliminary version in 15th ICML, 1998
- P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem." Machine Learning, vol. 47, no. 2-3, pp. 235-256, 2002, preliminary version in 15th ICML, 1998.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

8
- 35148838877
- The weighted majority algorithm
- N. Littlestone and M. K. Warmuth, "The weighted majority algorithm, " Information and Computation, vol. 108, no. 2, pp. 212-260, 1994.
- (1994) Information and Computation , vol.108 , Issue.2 , pp. 212-260
- Littlestone, N.¹ Warmuth, M.K.²

9
- 0031211090
- A decision-theoretic generalization of on-line learning and an application to boosting
- Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting, " Journal of Computer and System Sciences, vol. 55, no. 1, pp. 119-139, 1997.
- (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
- Freund, Y.¹ Schapire, R.E.²

10
- 70350251174
- Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
- O. Besbes and A. Zeevi, "Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms, " Operations Research, vol. 57, pp. 1407-1420, 2009.
- (2009) Operations Research , vol.57 , pp. 1407-1420
- Besbes, O.¹ Zeevi, A.²

11
- 84863515685
- Dynamic pricing with limited supply
- M. Babaioff, S. Dughmi, R. Kleinberg, and A. Slivkins, "Dynamic pricing with limited supply, " in 13th ACM EC, 2012.
- (2012) 13th ACM EC
- Babaioff, M.¹ Dughmi, S.² Kleinberg, R.³ Slivkins, A.⁴

12
- 0037740018
- Online learning in online auctions
- A. Blum, V. Kumar, A. Rudra, and F. Wu, "Online learning in online auctions, " in 14th ACM-SIAM SODA, 2003, pp. 202- 204.
- (2003) 14th ACM-SIAM SODA , pp. 202-204
- Blum, A.¹ Kumar, V.² Rudra, A.³ Wu, F.⁴

13
- 0345412655
- The value of knowing a demand curve: Bounds on regret for online posted-price auctions
- R. Kleinberg and T. Leighton, "The value of knowing a demand curve: Bounds on regret for online posted-price auctions." in 44th IEEE FOCS, 2003, pp. 594-605.
- (2003) 44th IEEE FOCS , pp. 594-605
- Kleinberg, R.¹ Leighton, T.²

14
- 84893043989
- Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
- A. Singla and A. Krause, "Truthful incentives in crowdsourcing tasks using regret minimization mechanisms, " in 22nd WWW, 2013, pp. 1167-1178.
- (2013) 22nd WWW , pp. 1167-1178
- Singla, A.¹ Krause, A.²

15
- 84898072179
- Stochastic linear optimization under bandit feedback
- V. Dani, T. P. Hayes, and S. Kakade, "Stochastic Linear Optimization under Bandit Feedback, " in 21th COLT, 2008, pp. 355-366.
- (2008) 21th COLT , pp. 355-366
- Dani, V.¹ Hayes, T.P.² Kakade, S.³

16
- 57049185311
- Multi-Armed bandits in metric spaces
- R. Kleinberg, A. Slivkins, and E. Upfal, "Multi-Armed Bandits in Metric Spaces, " in 40th ACM STOC, 2008, pp. 681- 690.
- (2008) 40th ACM STOC , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

17
- 0002899547
- Asymptotically efficient adaptive allocations rules
- T. L. Lai and H. Robbins, "Asymptotically efficient adaptive allocations rules, " Adv. in Appl. Math., vol. 6, pp. 4-22, 1985.
- (1985) Adv. in Appl. Math. , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

18
- 84874045238
- Regret analysis of stochastic and nonstochastic multi-armed bandit problems
- S. Bubeck and N. Cesa-Bianchi, "Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, " Foundations and Trends in Machine Learning, vol. 5, no. 1, pp. 1-122, 2012.
- (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
- Bubeck, S.¹ Cesa-Bianchi, N.²

19
- 84858998073
- Multi-armed bandits with metric switching costs
- S. Guha and K. Munagala, "Multi-armed Bandits with Metric Switching Costs, " in 36th ICALP, 2007, pp. 496-507.
- (2007) 36th ICALP , pp. 496-507
- Guha, S.¹ Munagala, K.²

20
- 84863332792
- Approximation algorithms for correlated knapsacks and nonmartingale bandits
- A. Gupta, R. Krishnaswamy, M. Molinaro, and R. Ravi, "Approximation algorithms for correlated knapsacks and nonmartingale bandits, " in 52nd IEEE FOCS, 2011, pp. 827-836.
- (2011) 52nd IEEE FOCS , pp. 827-836
- Gupta, A.¹ Krishnaswamy, R.² Molinaro, M.³ Ravi, R.⁴

21
- 84893465912
- Ph.D. dissertation, University of Southampton
- L. Tran-Thanh, "Budget-limited multi-armed bandits, " Ph.D. dissertation, University of Southampton, 2012.
- (2012) Budget-limited Multi-armed Bandits
- Tran-Thanh, L.¹

22
- 77958583895
- Ε-first policies for budget-limited multiarmed bandits
- L. Tran-Thanh, A. Chapman, E. M. de Cote, A. Rogers, and N. R. Jennings, "ε-first policies for budget-limited multiarmed bandits, " in 24th AAAI, 2010, pp. 1211-1216.
- (2010) 24th AAAI , pp. 1211-1216
- Tran-Thanh, L.¹ Chapman, A.² De Cote, E.M.³ Rogers, A.⁴ Jennings, N.R.⁵

23
- 84868281643
- Knapsack based optimal policies for budget-limited multiarmed bandits
- L. Tran-Thanh, A. Chapman, A. Rogers, and N. R. Jennings, "Knapsack based optimal policies for budget-limited multiarmed bandits, " in 26th AAAI, 2012, pp. 1134-1140.
- (2012) 26th AAAI , pp. 1134-1140
- Tran-Thanh, L.¹ Chapman, A.² Rogers, A.³ Jennings, N.R.⁴

24
- 84876049382
- Regret minimization for reserve prices in second-price auctions
- N. Cesa-Bianchi, C. Gentile, and Y. Mansour, "Regret minimization for reserve prices in second-price auctions, " in ACMSIAM SODA, 2013.
- (2013) Acmsiam Soda
- Cesa-Bianchi, N.¹ Gentile, C.² Mansour, Y.³

25
- 76749148680
- The AdWords problem: Online keyword matching with budgeted bidders under random permutations
- N. R. Devanur and T. P. Hayes, "The AdWords problem: Online keyword matching with budgeted bidders under random permutations, " in 10th ACM EC, 2009, pp. 71-78.
- (2009) 10th ACM EC , pp. 71-78
- Devanur, N.R.¹ Hayes, T.P.²

26
- 78249253781
- technical report, Available from arXiv at
- S. Agrawal, Z. Wang, and Y. Ye, "A dynamic nearoptimal algorithm for online linear programming, " 2009, technical report. Available from arXiv at http://arxiv.org/abs/0911.2974.
- (2009) A Dynamic Nearoptimal Algorithm for Online Linear Programming
- Agrawal, S.¹ Wang, Z.² Ye, Y.³

27
- 79959593176
- Near optimal online algorithms and fast approximation algorithms for resource allocation problems
- N. R. Devanur, K. Jain, B. Sivan, and C. A. Wilkens, "Near optimal online algorithms and fast approximation algorithms for resource allocation problems, " in 12th ACM EC, 2011, pp. 29-38.
- (2011) 12th ACM EC , pp. 29-38
- Devanur, N.R.¹ Jain, K.² Sivan, B.³ Wilkens, C.A.⁴

28
- 78249289102
- Online stochastic packing applied to display ad allocation
- J. Feldman, M. Henzinger, N. Korula, V. S. Mirrokni, and C. Stein, "Online stochastic packing applied to display ad allocation, " in 18th ESA, 2010, pp. 182-194.
- (2010) 18th ESA , pp. 182-194
- Feldman, J.¹ Henzinger, M.² Korula, N.³ Mirrokni, V.S.⁴ Stein, C.⁵

29
- 84883758510
- Geometry of online packing linear programs
- M. Molinaro and R. Ravi, "Geometry of online packing linear programs, " in 39th ICALP, 2012, pp. 701-713.
- (2012) 39th ICALP , pp. 701-713
- Molinaro, M.¹ Ravi, R.²

30
- 84871023469
- The multiplicative weights update method: A meta-algorithm and applications
- S. Arora, E. Hazan, and S. Kale, "The multiplicative weights update method: A meta-algorithm and applications, " Theory of Computing, vol. 8, no. 1, pp. 121-164, 2012.
- (2012) Theory of Computing , vol.8 , Issue.1 , pp. 121-164
- Arora, S.¹ Hazan, E.² Kale, S.³

31
- 42549097902
- Faster and simpler algorithms for multicommodity flow and other fractional packing problems
- N. Garg and J. Könemann, "Faster and simpler algorithms for multicommodity flow and other fractional packing problems, " SIAM J. Computing, vol. 37, no. 2, pp. 630-652, 2007.
- (2007) SIAM J. Computing , vol.37 , Issue.2 , pp. 630-652
- Garg, N.¹ Könemann, J.²

32
- 0000126694
- Fast approximation algorithms for fractional packing and covering problems
- S. A. Plotkin, D. B. Shmoys, and E. Tardos, "Fast approximation algorithms for fractional packing and covering problems, " Mathematics of Operations Research, vol. 20, pp. 257-301, 1995.
- (1995) Mathematics of Operations Research , vol.20 , pp. 257-301
- Plotkin, S.A.¹ Shmoys, D.B.² Tardos, E.³

33
- 70049106076
- Bandits for taxonomies: A Model-based Approach
- S. Pandey, D. Agarwal, D. Chakrabarti, and V. Josifovski, "Bandits for Taxonomies: A Model-based Approach, " in SDM, 2007.
- (2007) SDM
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.