-
1
-
-
0000248624
-
Multi-armed bandits and the gittins index
-
P. Whittle, "Multi-armed bandits and the Gittins index, " J. Royal Statistical Society, Series B, vol. 42, no. 2, pp. 143- 149, 1980.
-
(1980)
J. Royal Statistical Society, Series B
, vol.42
, Issue.2
, pp. 143-149
-
-
Whittle, P.1
-
2
-
-
0032628612
-
The complexity of optimal queuing network control
-
C. H. Papadimitriou and J. N. Tsitsiklis, "The complexity of optimal queuing network control, " Math. Oper. Res., vol. 24, no. 2, pp. 293-305, 1999.
-
(1999)
Math. Oper. Res.
, vol.24
, Issue.2
, pp. 293-305
-
-
Papadimitriou, C.H.1
Tsitsiklis, J.N.2
-
3
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
preliminary version in 36th IEEE FOCS, 1995
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire, "The nonstochastic multiarmed bandit problem." SIAM J. Comput., vol. 32, no. 1, pp. 48-77, 2002, preliminary version in 36th IEEE FOCS, 1995.
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
84871887590
-
Blind network revenue management
-
O. Besbes and A. J. Zeevi, "Blind network revenue management, " Operations Research, vol. 60, no. 6, pp. 1537-1550, 2012.
-
(2012)
Operations Research
, vol.60
, Issue.6
, pp. 1537-1550
-
-
Besbes, O.1
Zeevi, A.J.2
-
5
-
-
84863507274
-
Learning on a budget: Posted price mechanisms for online procurement
-
A. Badanidiyuru, R. Kleinberg, and Y. Singer, "Learning on a budget: posted price mechanisms for online procurement, " in 13th ACM EC, 2012, pp. 128-145.
-
(2012)
13th ACM EC
, pp. 128-145
-
-
Badanidiyuru, A.1
Kleinberg, R.2
Singer, Y.3
-
6
-
-
84893485049
-
Adaptive crowdsourcing algorithms for the bandit survey problem
-
I. Abraham, O. Alonso, V. Kandylas, and A. Slivkins, "Adaptive crowdsourcing algorithms for the bandit survey problem, " in 26th COLT, 2013.
-
(2013)
26th COLT
-
-
Abraham, I.1
Alonso, O.2
Kandylas, V.3
Slivkins, A.4
-
7
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
preliminary version in 15th ICML, 1998
-
P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem." Machine Learning, vol. 47, no. 2-3, pp. 235-256, 2002, preliminary version in 15th ICML, 1998.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
8
-
-
35148838877
-
The weighted majority algorithm
-
N. Littlestone and M. K. Warmuth, "The weighted majority algorithm, " Information and Computation, vol. 108, no. 2, pp. 212-260, 1994.
-
(1994)
Information and Computation
, vol.108
, Issue.2
, pp. 212-260
-
-
Littlestone, N.1
Warmuth, M.K.2
-
9
-
-
0031211090
-
A decision-theoretic generalization of on-line learning and an application to boosting
-
Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting, " Journal of Computer and System Sciences, vol. 55, no. 1, pp. 119-139, 1997.
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.1
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.E.2
-
10
-
-
70350251174
-
Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
-
O. Besbes and A. Zeevi, "Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms, " Operations Research, vol. 57, pp. 1407-1420, 2009.
-
(2009)
Operations Research
, vol.57
, pp. 1407-1420
-
-
Besbes, O.1
Zeevi, A.2
-
11
-
-
84863515685
-
Dynamic pricing with limited supply
-
M. Babaioff, S. Dughmi, R. Kleinberg, and A. Slivkins, "Dynamic pricing with limited supply, " in 13th ACM EC, 2012.
-
(2012)
13th ACM EC
-
-
Babaioff, M.1
Dughmi, S.2
Kleinberg, R.3
Slivkins, A.4
-
12
-
-
0037740018
-
Online learning in online auctions
-
A. Blum, V. Kumar, A. Rudra, and F. Wu, "Online learning in online auctions, " in 14th ACM-SIAM SODA, 2003, pp. 202- 204.
-
(2003)
14th ACM-SIAM SODA
, pp. 202-204
-
-
Blum, A.1
Kumar, V.2
Rudra, A.3
Wu, F.4
-
13
-
-
0345412655
-
The value of knowing a demand curve: Bounds on regret for online posted-price auctions
-
R. Kleinberg and T. Leighton, "The value of knowing a demand curve: Bounds on regret for online posted-price auctions." in 44th IEEE FOCS, 2003, pp. 594-605.
-
(2003)
44th IEEE FOCS
, pp. 594-605
-
-
Kleinberg, R.1
Leighton, T.2
-
14
-
-
84893043989
-
Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
-
A. Singla and A. Krause, "Truthful incentives in crowdsourcing tasks using regret minimization mechanisms, " in 22nd WWW, 2013, pp. 1167-1178.
-
(2013)
22nd WWW
, pp. 1167-1178
-
-
Singla, A.1
Krause, A.2
-
15
-
-
84898072179
-
Stochastic linear optimization under bandit feedback
-
V. Dani, T. P. Hayes, and S. Kakade, "Stochastic Linear Optimization under Bandit Feedback, " in 21th COLT, 2008, pp. 355-366.
-
(2008)
21th COLT
, pp. 355-366
-
-
Dani, V.1
Hayes, T.P.2
Kakade, S.3
-
16
-
-
57049185311
-
Multi-Armed bandits in metric spaces
-
R. Kleinberg, A. Slivkins, and E. Upfal, "Multi-Armed Bandits in Metric Spaces, " in 40th ACM STOC, 2008, pp. 681- 690.
-
(2008)
40th ACM STOC
, pp. 681-690
-
-
Kleinberg, R.1
Slivkins, A.2
Upfal, E.3
-
17
-
-
0002899547
-
Asymptotically efficient adaptive allocations rules
-
T. L. Lai and H. Robbins, "Asymptotically efficient adaptive allocations rules, " Adv. in Appl. Math., vol. 6, pp. 4-22, 1985.
-
(1985)
Adv. in Appl. Math.
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
18
-
-
84874045238
-
Regret analysis of stochastic and nonstochastic multi-armed bandit problems
-
S. Bubeck and N. Cesa-Bianchi, "Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems, " Foundations and Trends in Machine Learning, vol. 5, no. 1, pp. 1-122, 2012.
-
(2012)
Foundations and Trends in Machine Learning
, vol.5
, Issue.1
, pp. 1-122
-
-
Bubeck, S.1
Cesa-Bianchi, N.2
-
19
-
-
84858998073
-
Multi-armed bandits with metric switching costs
-
S. Guha and K. Munagala, "Multi-armed Bandits with Metric Switching Costs, " in 36th ICALP, 2007, pp. 496-507.
-
(2007)
36th ICALP
, pp. 496-507
-
-
Guha, S.1
Munagala, K.2
-
20
-
-
84863332792
-
Approximation algorithms for correlated knapsacks and nonmartingale bandits
-
A. Gupta, R. Krishnaswamy, M. Molinaro, and R. Ravi, "Approximation algorithms for correlated knapsacks and nonmartingale bandits, " in 52nd IEEE FOCS, 2011, pp. 827-836.
-
(2011)
52nd IEEE FOCS
, pp. 827-836
-
-
Gupta, A.1
Krishnaswamy, R.2
Molinaro, M.3
Ravi, R.4
-
22
-
-
77958583895
-
Ε-first policies for budget-limited multiarmed bandits
-
L. Tran-Thanh, A. Chapman, E. M. de Cote, A. Rogers, and N. R. Jennings, "ε-first policies for budget-limited multiarmed bandits, " in 24th AAAI, 2010, pp. 1211-1216.
-
(2010)
24th AAAI
, pp. 1211-1216
-
-
Tran-Thanh, L.1
Chapman, A.2
De Cote, E.M.3
Rogers, A.4
Jennings, N.R.5
-
23
-
-
84868281643
-
Knapsack based optimal policies for budget-limited multiarmed bandits
-
L. Tran-Thanh, A. Chapman, A. Rogers, and N. R. Jennings, "Knapsack based optimal policies for budget-limited multiarmed bandits, " in 26th AAAI, 2012, pp. 1134-1140.
-
(2012)
26th AAAI
, pp. 1134-1140
-
-
Tran-Thanh, L.1
Chapman, A.2
Rogers, A.3
Jennings, N.R.4
-
24
-
-
84876049382
-
Regret minimization for reserve prices in second-price auctions
-
N. Cesa-Bianchi, C. Gentile, and Y. Mansour, "Regret minimization for reserve prices in second-price auctions, " in ACMSIAM SODA, 2013.
-
(2013)
Acmsiam Soda
-
-
Cesa-Bianchi, N.1
Gentile, C.2
Mansour, Y.3
-
25
-
-
76749148680
-
The AdWords problem: Online keyword matching with budgeted bidders under random permutations
-
N. R. Devanur and T. P. Hayes, "The AdWords problem: Online keyword matching with budgeted bidders under random permutations, " in 10th ACM EC, 2009, pp. 71-78.
-
(2009)
10th ACM EC
, pp. 71-78
-
-
Devanur, N.R.1
Hayes, T.P.2
-
27
-
-
79959593176
-
Near optimal online algorithms and fast approximation algorithms for resource allocation problems
-
N. R. Devanur, K. Jain, B. Sivan, and C. A. Wilkens, "Near optimal online algorithms and fast approximation algorithms for resource allocation problems, " in 12th ACM EC, 2011, pp. 29-38.
-
(2011)
12th ACM EC
, pp. 29-38
-
-
Devanur, N.R.1
Jain, K.2
Sivan, B.3
Wilkens, C.A.4
-
28
-
-
78249289102
-
Online stochastic packing applied to display ad allocation
-
J. Feldman, M. Henzinger, N. Korula, V. S. Mirrokni, and C. Stein, "Online stochastic packing applied to display ad allocation, " in 18th ESA, 2010, pp. 182-194.
-
(2010)
18th ESA
, pp. 182-194
-
-
Feldman, J.1
Henzinger, M.2
Korula, N.3
Mirrokni, V.S.4
Stein, C.5
-
29
-
-
84883758510
-
Geometry of online packing linear programs
-
M. Molinaro and R. Ravi, "Geometry of online packing linear programs, " in 39th ICALP, 2012, pp. 701-713.
-
(2012)
39th ICALP
, pp. 701-713
-
-
Molinaro, M.1
Ravi, R.2
-
30
-
-
84871023469
-
The multiplicative weights update method: A meta-algorithm and applications
-
S. Arora, E. Hazan, and S. Kale, "The multiplicative weights update method: A meta-algorithm and applications, " Theory of Computing, vol. 8, no. 1, pp. 121-164, 2012.
-
(2012)
Theory of Computing
, vol.8
, Issue.1
, pp. 121-164
-
-
Arora, S.1
Hazan, E.2
Kale, S.3
-
31
-
-
42549097902
-
Faster and simpler algorithms for multicommodity flow and other fractional packing problems
-
N. Garg and J. Könemann, "Faster and simpler algorithms for multicommodity flow and other fractional packing problems, " SIAM J. Computing, vol. 37, no. 2, pp. 630-652, 2007.
-
(2007)
SIAM J. Computing
, vol.37
, Issue.2
, pp. 630-652
-
-
Garg, N.1
Könemann, J.2
-
32
-
-
0000126694
-
Fast approximation algorithms for fractional packing and covering problems
-
S. A. Plotkin, D. B. Shmoys, and E. Tardos, "Fast approximation algorithms for fractional packing and covering problems, " Mathematics of Operations Research, vol. 20, pp. 257-301, 1995.
-
(1995)
Mathematics of Operations Research
, vol.20
, pp. 257-301
-
-
Plotkin, S.A.1
Shmoys, D.B.2
Tardos, E.3
-
33
-
-
70049106076
-
Bandits for taxonomies: A Model-based Approach
-
S. Pandey, D. Agarwal, D. Chakrabarti, and V. Josifovski, "Bandits for Taxonomies: A Model-based Approach, " in SDM, 2007.
-
(2007)
SDM
-
-
Pandey, S.1
Agarwal, D.2
Chakrabarti, D.3
Josifovski, V.4
|