메뉴 건너뛰기




Volumn , Issue , 2013, Pages 207-216

Bandits with knapsacks (extended abstract)

Author keywords

Dynamic ad allocation; Dynamic pricing; Dynamic procurement; Exploration exploitation tradeoff; Multi armed bandits; Regret; Stochastic packing

Indexed keywords

DYNAMIC PRICING; EXPLORATION EXPLOITATIONS; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; POLY-LOGARITHMIC FACTORS; PRIMAL DUAL ALGORITHMS; REGRET; STOCHASTIC INTEGER PROGRAMMING;

EID: 84893451322     PISSN: 02725428     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/FOCS.2013.30     Document Type: Conference Paper
Times cited : (266)

References (33)
  • 1
    • 0000248624 scopus 로고
    • Multi-armed bandits and the gittins index
    • P. Whittle, "Multi-armed bandits and the Gittins index, " J. Royal Statistical Society, Series B, vol. 42, no. 2, pp. 143- 149, 1980.
    • (1980) J. Royal Statistical Society, Series B , vol.42 , Issue.2 , pp. 143-149
    • Whittle, P.1
  • 2
    • 0032628612 scopus 로고    scopus 로고
    • The complexity of optimal queuing network control
    • C. H. Papadimitriou and J. N. Tsitsiklis, "The complexity of optimal queuing network control, " Math. Oper. Res., vol. 24, no. 2, pp. 293-305, 1999.
    • (1999) Math. Oper. Res. , vol.24 , Issue.2 , pp. 293-305
    • Papadimitriou, C.H.1    Tsitsiklis, J.N.2
  • 3
    • 0037709910 scopus 로고    scopus 로고
    • The nonstochastic multiarmed bandit problem
    • preliminary version in 36th IEEE FOCS, 1995
    • P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire, "The nonstochastic multiarmed bandit problem." SIAM J. Comput., vol. 32, no. 1, pp. 48-77, 2002, preliminary version in 36th IEEE FOCS, 1995.
    • (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
    • Auer, P.1    Cesa-Bianchi, N.2    Freund, Y.3    Schapire, R.E.4
  • 4
    • 84871887590 scopus 로고    scopus 로고
    • Blind network revenue management
    • O. Besbes and A. J. Zeevi, "Blind network revenue management, " Operations Research, vol. 60, no. 6, pp. 1537-1550, 2012.
    • (2012) Operations Research , vol.60 , Issue.6 , pp. 1537-1550
    • Besbes, O.1    Zeevi, A.J.2
  • 5
    • 84863507274 scopus 로고    scopus 로고
    • Learning on a budget: Posted price mechanisms for online procurement
    • A. Badanidiyuru, R. Kleinberg, and Y. Singer, "Learning on a budget: posted price mechanisms for online procurement, " in 13th ACM EC, 2012, pp. 128-145.
    • (2012) 13th ACM EC , pp. 128-145
    • Badanidiyuru, A.1    Kleinberg, R.2    Singer, Y.3
  • 6
    • 84893485049 scopus 로고    scopus 로고
    • Adaptive crowdsourcing algorithms for the bandit survey problem
    • I. Abraham, O. Alonso, V. Kandylas, and A. Slivkins, "Adaptive crowdsourcing algorithms for the bandit survey problem, " in 26th COLT, 2013.
    • (2013) 26th COLT
    • Abraham, I.1    Alonso, O.2    Kandylas, V.3    Slivkins, A.4
  • 7
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • preliminary version in 15th ICML, 1998
    • P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem." Machine Learning, vol. 47, no. 2-3, pp. 235-256, 2002, preliminary version in 15th ICML, 1998.
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 9
    • 0031211090 scopus 로고    scopus 로고
    • A decision-theoretic generalization of on-line learning and an application to boosting
    • Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting, " Journal of Computer and System Sciences, vol. 55, no. 1, pp. 119-139, 1997.
    • (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
    • Freund, Y.1    Schapire, R.E.2
  • 10
    • 70350251174 scopus 로고    scopus 로고
    • Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
    • O. Besbes and A. Zeevi, "Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms, " Operations Research, vol. 57, pp. 1407-1420, 2009.
    • (2009) Operations Research , vol.57 , pp. 1407-1420
    • Besbes, O.1    Zeevi, A.2
  • 13
    • 0345412655 scopus 로고    scopus 로고
    • The value of knowing a demand curve: Bounds on regret for online posted-price auctions
    • R. Kleinberg and T. Leighton, "The value of knowing a demand curve: Bounds on regret for online posted-price auctions." in 44th IEEE FOCS, 2003, pp. 594-605.
    • (2003) 44th IEEE FOCS , pp. 594-605
    • Kleinberg, R.1    Leighton, T.2
  • 14
    • 84893043989 scopus 로고    scopus 로고
    • Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
    • A. Singla and A. Krause, "Truthful incentives in crowdsourcing tasks using regret minimization mechanisms, " in 22nd WWW, 2013, pp. 1167-1178.
    • (2013) 22nd WWW , pp. 1167-1178
    • Singla, A.1    Krause, A.2
  • 15
    • 84898072179 scopus 로고    scopus 로고
    • Stochastic linear optimization under bandit feedback
    • V. Dani, T. P. Hayes, and S. Kakade, "Stochastic Linear Optimization under Bandit Feedback, " in 21th COLT, 2008, pp. 355-366.
    • (2008) 21th COLT , pp. 355-366
    • Dani, V.1    Hayes, T.P.2    Kakade, S.3
  • 16
    • 57049185311 scopus 로고    scopus 로고
    • Multi-Armed bandits in metric spaces
    • R. Kleinberg, A. Slivkins, and E. Upfal, "Multi-Armed Bandits in Metric Spaces, " in 40th ACM STOC, 2008, pp. 681- 690.
    • (2008) 40th ACM STOC , pp. 681-690
    • Kleinberg, R.1    Slivkins, A.2    Upfal, E.3
  • 17
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocations rules
    • T. L. Lai and H. Robbins, "Asymptotically efficient adaptive allocations rules, " Adv. in Appl. Math., vol. 6, pp. 4-22, 1985.
    • (1985) Adv. in Appl. Math. , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 18
    • 84874045238 scopus 로고    scopus 로고
    • Regret analysis of stochastic and nonstochastic multi-armed bandit problems
    • S. Bubeck and N. Cesa-Bianchi, "Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, " Foundations and Trends in Machine Learning, vol. 5, no. 1, pp. 1-122, 2012.
    • (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
    • Bubeck, S.1    Cesa-Bianchi, N.2
  • 19
    • 84858998073 scopus 로고    scopus 로고
    • Multi-armed bandits with metric switching costs
    • S. Guha and K. Munagala, "Multi-armed Bandits with Metric Switching Costs, " in 36th ICALP, 2007, pp. 496-507.
    • (2007) 36th ICALP , pp. 496-507
    • Guha, S.1    Munagala, K.2
  • 20
    • 84863332792 scopus 로고    scopus 로고
    • Approximation algorithms for correlated knapsacks and nonmartingale bandits
    • A. Gupta, R. Krishnaswamy, M. Molinaro, and R. Ravi, "Approximation algorithms for correlated knapsacks and nonmartingale bandits, " in 52nd IEEE FOCS, 2011, pp. 827-836.
    • (2011) 52nd IEEE FOCS , pp. 827-836
    • Gupta, A.1    Krishnaswamy, R.2    Molinaro, M.3    Ravi, R.4
  • 23
    • 84868281643 scopus 로고    scopus 로고
    • Knapsack based optimal policies for budget-limited multiarmed bandits
    • L. Tran-Thanh, A. Chapman, A. Rogers, and N. R. Jennings, "Knapsack based optimal policies for budget-limited multiarmed bandits, " in 26th AAAI, 2012, pp. 1134-1140.
    • (2012) 26th AAAI , pp. 1134-1140
    • Tran-Thanh, L.1    Chapman, A.2    Rogers, A.3    Jennings, N.R.4
  • 24
    • 84876049382 scopus 로고    scopus 로고
    • Regret minimization for reserve prices in second-price auctions
    • N. Cesa-Bianchi, C. Gentile, and Y. Mansour, "Regret minimization for reserve prices in second-price auctions, " in ACMSIAM SODA, 2013.
    • (2013) Acmsiam Soda
    • Cesa-Bianchi, N.1    Gentile, C.2    Mansour, Y.3
  • 25
    • 76749148680 scopus 로고    scopus 로고
    • The AdWords problem: Online keyword matching with budgeted bidders under random permutations
    • N. R. Devanur and T. P. Hayes, "The AdWords problem: Online keyword matching with budgeted bidders under random permutations, " in 10th ACM EC, 2009, pp. 71-78.
    • (2009) 10th ACM EC , pp. 71-78
    • Devanur, N.R.1    Hayes, T.P.2
  • 27
    • 79959593176 scopus 로고    scopus 로고
    • Near optimal online algorithms and fast approximation algorithms for resource allocation problems
    • N. R. Devanur, K. Jain, B. Sivan, and C. A. Wilkens, "Near optimal online algorithms and fast approximation algorithms for resource allocation problems, " in 12th ACM EC, 2011, pp. 29-38.
    • (2011) 12th ACM EC , pp. 29-38
    • Devanur, N.R.1    Jain, K.2    Sivan, B.3    Wilkens, C.A.4
  • 28
  • 29
    • 84883758510 scopus 로고    scopus 로고
    • Geometry of online packing linear programs
    • M. Molinaro and R. Ravi, "Geometry of online packing linear programs, " in 39th ICALP, 2012, pp. 701-713.
    • (2012) 39th ICALP , pp. 701-713
    • Molinaro, M.1    Ravi, R.2
  • 30
    • 84871023469 scopus 로고    scopus 로고
    • The multiplicative weights update method: A meta-algorithm and applications
    • S. Arora, E. Hazan, and S. Kale, "The multiplicative weights update method: A meta-algorithm and applications, " Theory of Computing, vol. 8, no. 1, pp. 121-164, 2012.
    • (2012) Theory of Computing , vol.8 , Issue.1 , pp. 121-164
    • Arora, S.1    Hazan, E.2    Kale, S.3
  • 31
    • 42549097902 scopus 로고    scopus 로고
    • Faster and simpler algorithms for multicommodity flow and other fractional packing problems
    • N. Garg and J. Könemann, "Faster and simpler algorithms for multicommodity flow and other fractional packing problems, " SIAM J. Computing, vol. 37, no. 2, pp. 630-652, 2007.
    • (2007) SIAM J. Computing , vol.37 , Issue.2 , pp. 630-652
    • Garg, N.1    Könemann, J.2
  • 32
    • 0000126694 scopus 로고
    • Fast approximation algorithms for fractional packing and covering problems
    • S. A. Plotkin, D. B. Shmoys, and E. Tardos, "Fast approximation algorithms for fractional packing and covering problems, " Mathematics of Operations Research, vol. 20, pp. 257-301, 1995.
    • (1995) Mathematics of Operations Research , vol.20 , pp. 257-301
    • Plotkin, S.A.1    Shmoys, D.B.2    Tardos, E.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.