SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 35, Issue , 2014, Pages 1109-1134

Resourceful contextual bandits

(3) Badanidiyuru, Ashwinkumar a Langford, John b Slivkins, Aleksandrs b

a School of Operations Research and Information Engineering (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

ALGORITHM FOR SOLVING; CONTEXTUAL BANDITS; DYNAMIC PRICING; NEAR-OPTIMAL; REAL-WORLD; RESOURCE CONSTRAINT; STATISTICAL PROPERTIES;

SOFTWARE ENGINEERING;

EID: 84939636813 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Conference Paper

Times cited : (134)

References (36)

1
- 84898972474
- Contextual bandit learning under the realizability assumption
- Alekh Agarwal, Miroslav Dudik, Satyen Kale, and John Langford. Contextual bandit learning under the realizability assumption. In 15th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS), 2012.
- (2012) 15th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS)
- Agarwal, A.¹ Dudik, M.² Kale, S.³ Langford, J.⁴

2
- 84919787147
- Taming the monster: A fast and simple algorithm for contextual bandits
- Alekh Agarwal, Daniel Hsu, Satyen Kale, John Langford, Lihong Li, and Robert Schapire. Taming the monster: A fast and simple algorithm for contextual bandits. In 31st Intl. Conf. on Machine Learning (ICML), 2013.
- (2013) 31st Intl. Conf. on Machine Learning (ICML)
- Agarwal, A.¹ Hsu, D.² Kale, S.³ Langford, J.⁴ Li, L.⁵ Schapire, R.⁶

3
- 84939620711
- Bandits with concave rewards and convex knapsacks
- Shipra Agrawal and Nikhil R. Devanur. Bandits with concave rewards and convex knapsacks. In 15th, 2014.
- (2014) 15th
- Agrawal, S.¹ Devanur, N.R.²

4
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- Preliminary version in 41st IEEE FOCS
- Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. J. of Machine Learning Research (JMLR), 3:397-422, 2002. Preliminary version in 41st IEEE FOCS, 2000.
- (2000) J. of Machine Learning Research (JMLR) , vol.3 , pp. 397-422
- Auer, P.¹

5
- 0037709910
- The nonstochastic multiarmed bandit problem
- Preliminary version in 36th IEEE FOCS
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002. Preliminary version in 36th IEEE FOCS, 1995.
- (1995) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

6
- 84863515685
- Dynamic pricing with limited supply
- Moshe Babaioff, Shaddin Dughmi, Robert Kleinberg, and Aleksandrs Slivkins. Dynamic pricing with limited supply. In 13th ACM Conf. on Electronic Commerce (EC), 2012.
- (2012) 13th ACM Conf. on Electronic Commerce (EC)
- Babaioff, M.¹ Dughmi, S.² Kleinberg, R.³ Slivkins, A.⁴

7
- 84863507274
- Learning on a budget: Posted price mechanisms for online procurement
- Ashwinkumar Badanidiyuru, Robert Kleinberg, and Yaron Singer. Learning on a budget: posted price mechanisms for online procurement. In 13th ACM Conf. on Electronic Commerce (EC), pages 128-145, 2012.
- (2012) 13th ACM Conf. on Electronic Commerce (EC) , pp. 128-145
- Badanidiyuru, A.¹ Kleinberg, R.² Singer, Y.³

8
- 84893451322
- Bandits with knapsacks
- Ashwinkumar Badanidiyuru, Robert Kleinberg, and Aleksandrs Slivkins. Bandits with knapsacks. In 54th IEEE Symp. on Foundations of Computer Science (FOCS), 2013a.
- (2013) 54th IEEE Symp. on Foundations of Computer Science (FOCS)
- Badanidiyuru, A.¹ Kleinberg, R.² Slivkins, A.³

9
- 84939612587
- A technical report on arxiv.org., May
- Ashwinkumar Badanidiyuru, Robert Kleinberg, and Aleksandrs Slivkins. Bandits with knapsacks. A technical report on arxiv.org., May 2013b.
- (2013) Bandits with Knapsacks
- Badanidiyuru, A.¹ Kleinberg, R.² Slivkins, A.³

10
- 70350251174
- Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms
- Omar Besbes and Assaf Zeevi. Dynamic pricing without knowing the demand function: Risk bounds and near-optimal algorithms. Operations Research, 57:1407-1420, 2009.
- (2009) Operations Research , vol.57 , pp. 1407-1420
- Besbes, O.¹ Zeevi, A.²

11
- 84871887590
- Blind network revenue management
- Omar Besbes and Assaf J. Zeevi. Blind network revenue management. Operations Research, 60(6):1537-1550, 2012.
- (2012) Operations Research , vol.60 , Issue.6 , pp. 1537-1550
- Besbes, O.¹ Zeevi, A.J.²

12
- 84939621750
- Efficient optimal leanring for contextual bandits
- Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, and Robert E. Schapire. Efficient optimal leanring for contextual bandits. In 14th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS), 2011.
- (2011) 14th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS)
- Beygelzimer, A.¹ Langford, J.² Li, L.³ Reyzin, L.⁴ Schapire, R.E.⁵

13
- 84874045238
- Regret analysis of stochastic and nonstochastic multiarmed bandit problems
- Sébastien Bubeck and Nicolo Cesa-Bianchi. Regret Analysis of Stochastic and Nonstochastic Multiarmed Bandit Problems. Foundations and Trends in Machine Learning, 5(1):1-122, 2012.
- (2012) Foundations and Trends in Machine Learning , vol.5 , Issue.1 , pp. 1-122
- Bubeck, S.¹ Cesa-Bianchi, N.²

14
- 84874065869
- The best of both worlds: Stochastic and adversarial bandits
- Sébastien Bubeck and Aleksandrs Slivkins. The best of both worlds: stochastic and adversarial bandits. In 25th Conf. on Learning Theory (COLT), 2012.
- (2012) 25th Conf. on Learning Theory (COLT)
- Bubeck, S.¹ Slivkins, A.²

15
- 4544235463
- The spending constraint model for market equilibrium: Algorithmic, existence and uniqueness results
- Nikhil Devanur and Vijay Vazirani. The spending constraint model for market equilibrium: Algorithmic, existence and uniqueness results. In 36th ACM Symp. on Theory of Computing (STOC), 2004.
- (2004) 36th ACM Symp. on Theory of Computing (STOC)
- Devanur, N.¹ Vazirani, V.²

16
- 76749148680
- The AdWords problem: Online keyword matching with budgeted bidders under random permutations
- Nikhil R. Devanur and Thomas P. Hayes. The AdWords problem: Online keyword matching with budgeted bidders under random permutations. In 10th ACM Conf. on Electronic Commerce (EC), pages 71-78, 2009.
- (2009) 10th ACM Conf. on Electronic Commerce (EC) , pp. 71-78
- Devanur, N.R.¹ Hayes, T.P.²

17
- 79959593176
- Near optimal online algorithms and fast approximation algorithms for resource allocation problems
- Nikhil R. Devanur, Kamal Jain, Balasubramanian Sivan, and Christopher A. Wilkens. Near optimal online algorithms and fast approximation algorithms for resource allocation problems. In 12th ACM Conf. on Electronic Commerce (EC), pages 29-38, 2011.
- (2011) 12th ACM Conf. on Electronic Commerce (EC) , pp. 29-38
- Devanur, N.R.¹ Jain, K.² Sivan, B.³ Wilkens, C.A.⁴

18
- 80053154335
- Efficient optimal leanring for contextual bandits
- Miroslav Dudik, Daniel Hsu, Satyen Kale, Nikos Karampatziakis, John Langford, Lev Reyzin, and Tong Zhang. Efficient optimal leanring for contextual bandits. In 27th Conf. on Uncertainty in Artificial Intelligence (UAI), 2011.
- (2011) 27th Conf. on Uncertainty in Artificial Intelligence (UAI)
- Dudik, M.¹ Hsu, D.² Kale, S.³ Karampatziakis, N.⁴ Langford, J.⁵ Reyzin, L.⁶ Zhang, T.⁷

19
- 0002384441
- On tail probabilities for martingales
- D. A. Freedman. On tail probabilities for martingales. The Annals of Probability, 3:100-118, 1975.
- (1975) The Annals of Probability , vol.3 , pp. 100-118
- Freedman, D.A.¹

20
- 84891584370
- John Wiley & Sons
- John Gittins, Kevin Glazebrook, and Richard Weber. Multi-Armed Bandit Allocation Indices. John Wiley & Sons, 2011.
- (2011) Multi-Armed Bandit Allocation Indices
- Gittins, J.¹ Glazebrook, K.² Weber, R.³

21
- 84858998073
- Multi-armed bandits with metric switching costs
- Sudipta Guha and Kamesh Munagala. Multi-armed Bandits with Metric Switching Costs. In 36th Intl. Colloquium on Automata, Languages and Programming (ICALP), pages 496-507, 2007.
- (2007) 36th Intl. Colloquium on Automata, Languages and Programming (ICALP) , pp. 496-507
- Guha, S.¹ Munagala, K.²

22
- 69449097218
- Approximation algorithms for restless bandit problems.
- Combined final version of papers in
- Sudipta Guha, Kamesh Munagala, and Peng Shi. Approximation algorithms for restless bandit problems., 2010. Combined final version of papers in IEEE FOCS 2007 and ACM-SIAM SODA 2009.
- (2009) IEEE FOCS 2007 and ACM-SIAM SODA
- Guha, S.¹ Munagala, K.² Shi, P.³

23
- 84863332792
- Approximation algorithms for correlated knapsacks and non-martingale bandits
- Anupam Gupta, Ravishankar Krishnaswamy, Marco Molinaro, and R. Ravi. Approximation algorithms for correlated knapsacks and non-martingale bandits. In 52nd IEEE Symp. on Foundations of Computer Science (FOCS), pages 827-836, 2011.
- (2011) 52nd IEEE Symp. on Foundations of Computer Science (FOCS) , pp. 827-836
- Gupta, A.¹ Krishnaswamy, R.² Molinaro, M.³ Ravi, R.⁴

24
- 84880882858
- Continuous time associative bandit problems
- András György, Levente Kocsis, Ivett Szabó, and Csaba Szepesvári. Continuous time associative bandit problems. In 20th Intl. Joint Conf. on Artificial Intelligence (IJCAI), pages 830-835, 2007.
- (2007) 20th Intl. Joint Conf. on Artificial Intelligence (IJCAI) , pp. 830-835
- György, A.¹ Kocsis, L.² Szabó, I.³ Szepesvári, C.⁴

25
- 77951694424
- Sharp dichotomies for regret minimization in metric spaces
- Robert Kleinberg and Aleksandrs Slivkins. Sharp Dichotomies for Regret Minimization in Metric Spaces. In 21st ACM-SIAM Symp. on Discrete Algorithms (SODA), 2010.
- (2010) 21st ACM-SIAM Symp. on Discrete Algorithms (SODA)
- Kleinberg, R.¹ Slivkins, A.²

26
- 57049185311
- Multi-armed bandits in metric spaces
- Robert Kleinberg, Aleksandrs Slivkins, and Eli Upfal. Multi-Armed Bandits in Metric Spaces. In 40th ACM Symp. on Theory of Computing (STOC), pages 681-690, 2008.
- (2008) 40th ACM Symp. on Theory of Computing (STOC) , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

27
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- John Langford and Tong Zhang. The Epoch-Greedy Algorithm for Contextual Multi-armed Bandits. In 21st Advances in Neural Information Processing Systems (NIPS), 2007.
- (2007) 21st Advances in Neural Information Processing Systems (NIPS)
- Langford, J.¹ Zhang, T.²

28
- 84898068653
- Tighter bounds for multi-armed bandits with expert advice
- Brendan McMahan and Matthew Streeter. Tighter bounds for multi-armed bandits with expert advice. In 22nd Conf. on Learning Theory (COLT), 2009.
- (2009) 22nd Conf. on Learning Theory (COLT)
- McMahan, B.¹ Streeter, M.²

29
- 84893043989
- Truthful incentives in crowdsourcing tasks using regret minimization mechanisms
- Adish Singla and Andreas Krause. Truthful incentives in crowdsourcing tasks using regret minimization mechanisms. In 22nd Intl. World Wide Web Conf. (WWW), pages 1167-1178, 2013.
- (2013) 22nd Intl. World Wide Web Conf. (WWW) , pp. 1167-1178
- Singla, A.¹ Krause, A.²

30
- 84972513554
- On general minimax theorems
- Maurice Sion. On general minimax theorems. Pac. J. Math., 8:171176, 1958.
- (1958) Pac. J. Math. , vol.8 , pp. 171176
- Sion, M.¹

31
- 84874058621
- Contextual bandits with similarity information
- To appear in J. of Machine Learning Research JMLR
- Aleksandrs Slivkins. Contextual Bandits with Similarity Information. In 24th Conf. on Learning Theory (COLT), 2011. To appear in J. of Machine Learning Research (JMLR), 2014.
- (2011) 24th Conf. on Learning Theory (COLT)
- Slivkins, A.¹

32
- 84939604762
- A technical report on arxiv.org/abs/1306.0155, June
- Aleksandrs Slivkins. Dynamic ad allocation: Bandits with budgets. A technical report on arxiv.org/abs/1306.0155, June 2013.
- (2013) Dynamic Ad Allocation: Bandits with Budgets
- Slivkins, A.¹

33
- 84963496106
- Online decision making in crowdsourcing markets: Theoretical challenges
- December, Position Paper and survey
- Aleksandrs Slivkins and Jennifer Wortman Vaughan. Online decision making in crowdsourcing markets: Theoretical challenges. SIGecom Exchanges, 12(2), December 2013. Position Paper and survey.
- (2013) SIGecom Exchanges , vol.12 , Issue.2
- Slivkins, A.¹ Vaughan, J.W.²

34
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- William R. Thompson. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika, 25(3-4):285294, 1933.
- (1933) Biometrika , vol.25 , Issue.3-4 , pp. 285294
- Thompson, W.R.¹

35
- 77958583895
- E-first policies for budget-limited multi-armed bandits
- Long Tran-Thanh, Archie Chapman, Enrique Munoz De Cote, Alex Rogers, and Nicholas R. Jennings. e-first policies for budget-limited multi-armed bandits. In 24th AAAI Conference on Artificial Intelligence (AAAI), pages 1211-1216, 2010.
- (2010) 24th AAAI Conference on Artificial Intelligence (AAAI) , pp. 1211-1216
- Tran-Thanh, L.¹ Chapman, A.² De Cote, E.M.³ Rogers, A.⁴ Jennings, N.R.⁵

36
- 84868281643
- Knapsack based optimal policies for budget-limited multi-armed bandits
- Long Tran-Thanh, Archie Chapman, Alex Rogers, and Nicholas R. Jennings. Knapsack based optimal policies for budget-limited multi-armed bandits. In 26th AAAI Conference on Artificial Intelligence (AAAI), pages 1134-1140, 2012.
- (2012) 26th AAAI Conference on Artificial Intelligence (AAAI) , pp. 1134-1140
- Tran-Thanh, L.¹ Chapman, A.² Rogers, A.³ Jennings, N.R.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.