SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 14, Issue 1, 2013, Pages 399-436

Ranked bandits in metric spaces: Learning diverse rankings over large document collections

(3) Slivkins, Aleksandrs a Radlinski, Filip b Gollapudi, Sreenivas a

a MICROSOFT RESEARCH (United States)

b MICROSOFT RESEARCH (United Kingdom)

Author keywords

Clickthrough data; Contextual bandits; Diversity; Metric spaces; Multi armed bandits; Online learning; Regret

Indexed keywords

CLICKTHROUGH DATA; CONTEXTUAL BANDITS; DIVERSITY; METRIC SPACES; MULTI ARMED BANDIT; ONLINE LEARNING; REGRET;

OPTIMIZATION; SET THEORY; TOPOLOGY;

ALGORITHMS;

EID: 84875138796 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (97)

References (52)

1
- 84898063697
- Competing in the dark: An efficient algorithm for bandit linear optimization
- Jacob Abernethy, Elad Hazan, and Alexander Rakhlin. Competing in the dark: An efficient algorithm for bandit linear optimization. In 21th Conf. on Learning Theory (COLT), pages 263-274, 2008.
- (2008) 21th Conf. on Learning Theory (COLT) , pp. 263-274
- Abernethy, J.¹ Hazan, E.² Rakhlin, A.³

2
- 0345224411
- The continuum-armed bandit problem
- Rajeev Agrawal. The continuum-armed bandit problem. SIAM J. Control and Optimization, 33(6):1926-1951, 1995.
- (1995) SIAM J. Control and Optimization , vol.33 , Issue.6 , pp. 1926-1951
- Agrawal, R.¹

3
- 0000363076
- Exchangeability and related topics
- David J. Aldous. Exchangeability and related topics. In Ecole d'Été de Probabilités de Saint-Flour XIII, pages 1-198, 1985.
- (1985) Ecole d'Été de Probabilités de Saint-Flour , vol.13 , pp. 1-198
- Aldous, D.J.¹

4
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- Preliminary version in 41st IEEE FOCS, 2000
- Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. J. of Machine Learning Research (JMLR), 3:397-422, 2002. Preliminary version in 41st IEEE FOCS, 2000.
- (2002) J. of Machine Learning Research (JMLR) , vol.3 , pp. 397-422
- Auer, P.¹

5
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- DOI 10.1023/A:1013689704352, Computational Learning Theory
- Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002a. Preliminary version in 15th ICML, 1998. (Pubitemid 34126111)
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

6
- 0037709910
- The nonstochastic multiarmed bandit problem
- Preliminary version in 36th IEEE FOCS, 1995
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002b. Preliminary version in 36th IEEE FOCS, 1995.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

7
- 38049040954
- Improved rates for the stochastic continuumarmed bandit problem
- Peter Auer, Ronald Ortner, and Csaba Szepesvári. Improved rates for the stochastic continuumarmed bandit problem. In 20th Conf. on Learning Theory (COLT), pages 454-468, 2007.
- (2007) 20th Conf. on Learning Theory (COLT) , pp. 454-468
- Auer, P.¹ Ortner, R.² Szepesvári, C.³

8
- 35448960376
- Online linear optimization and adaptive routing
- DOI 10.1016/j.jcss.2007.04.016, PII S0022000007000621, Learning Theory 2004
- Baruch Awerbuch and Robert Kleinberg. Online linear optimization and adaptive routing. J. of Computer and System Sciences, 74(1):97-114, February 2008. Preliminary version in 36th ACM STOC, 2004. (Pubitemid 47625408)
- (2008) Journal of Computer and System Sciences , vol.74 , Issue.1 , pp. 97-114
- Awerbuch, B.¹ Kleinberg, R.²

9
- 84875207263
- Probabilistic approximations of metric spaces and its algorithmic applications
- Yair Bartal. Probabilistic approximations of metric spaces and its algorithmic applications. In IEEE Symp. on Foundations of Computer Science (FOCS), 1996.
- (1996) IEEE Symp. on Foundations of Computer Science (FOCS)
- Bartal, Y.¹

10
- 38049057924
- Bandit problems
- Steven Durlauf and Larry Blume, editors, 2nd ed. Macmillan Press
- Dirk Bergemann and Juuso Välimäki. Bandit problems. In Steven Durlauf and Larry Blume, editors, The New Palgrave Dictionary of Economics, 2nd ed. Macmillan Press, 2006.
- (2006) The New Palgrave Dictionary of Economics
- Bergemann, D.¹ Välimäki, J.²

11
- 84874045238
- Regret analysis of stochastic and nonstochastic multiarmed bandit problems
- Sébastien Bubeck and Nicolo Cesa-Bianchi. Regret analysis of stochastic and nonstochastic multiarmed bandit problems. Foundations and Trends in Machine Learning (Draft under submission), 2012. Available at www.princeton. edu/?sbubeck/pub.html.
- (2012) Foundations and Trends in Machine Learning (Draft Under submission)
- Bubeck, S.¹ Cesa-Bianchi, N.²

12
- 84888141227
- Open loop optimistic planning
- Sébastien Bubeck and Rémi Munos. Open loop optimistic planning. In 23rd Conf. on Learning Theory (COLT), pages 477-489, 2010.
- (2010) 23rd Conf. on Learning Theory (COLT) , pp. 477-489
- Bubeck, S.¹ Munos, R.²

13
- 84860634388
- Online optimization in xarmed bandits
- Preliminary version in NIPS 2008
- Sébastien Bubeck, Rémi Munos, Gilles Stoltz, and Csaba Szepesvari. Online optimization in xarmed bandits. J. of Machine Learning Research (JMLR), 12:1587-1627, 2011. Preliminary version in NIPS 2008.
- (2011) J. of Machine Learning Research (JMLR) , vol.12 , pp. 1587-1627
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvari, C.⁴

14
- 31844446958
- Learning to rank using gradient descent
- DOI 10.1145/1102351.1102363, ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning
- Christopher J. C. Burges, Tal Shaked, Erin Renshaw, Ari Lazier, Matt Deeds, Nicole Hamilton, and Gregory N. Hullender. Learning to rank using gradient descent. In Intl. Conf. on Machine Learning (ICML), pages 89-96, 2005. (Pubitemid 43183320)
- (2005) ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning , pp. 89-96
- Burges, C.¹ Shaked, T.² Renshaw, E.³ Lazier, A.⁴ Deeds, M.⁵ Hamilton, N.⁶ Hullender, G.⁷

15
- 0032270694
- The use of MMR, diversity-based reranking for reordering documents and producing summaries
- Jaime G. Carbonell and Jade Goldstein. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In ACM Intl. Conf. on Research and Development in Information Retrieval (SIGIR), pages 335-336, 1998.
- (1998) ACM Intl. Conf. on Research and Development in Information Retrieval (SIGIR) , pp. 335-336
- Carbonell, J.G.¹ Goldstein, J.²

16
- 84926078662
- Cambridge Univ. Press
- Nicolò Cesa-Bianchi and Gábor Lugosi. Prediction, Learning, and Games. Cambridge Univ. Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

17
- 21844453228
- Gaussian processes for ordinal regression
- Wei Chu and Zoubin Ghahramani. Gaussian processes for ordinal regression. J. of Machine Learning Research, 6:1019-1041, 2005.
- (2005) J. of Machine Learning Research , vol.6 , pp. 1019-1041
- Chu, W.¹ Ghahramani, Z.²

18
- 84888139304
- Contextual bandits with linear payoff functions
- Wei Chu, Lihong Li, Lev Reyzin, and Robert E. Schapire. Contextual bandits with linear payoff functions. In 14th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS), 2011.
- (2011) 14th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS)
- Chu, W.¹ Li, L.² Reyzin, L.³ Schapire, R.E.⁴

19
- 70349295143
- The price of bandit information for online optimization
- Varsha Dani, Thomas P. Hayes, and Sham Kakade. The price of bandit information for online optimization. In 20th Advances in Neural Information Processing Systems (NIPS), 2007.
- (2007) 20th Advances in Neural Information Processing Systems (NIPS)
- Dani, V.¹ Hayes, T.P.² Kakade, S.³

20
- 4544291996
- A tight bound on approximating arbitrary metrics by tree metrics
- Jittat Fakcharoenphol, Satish Rao, and Kunal Talwar. A tight bound on approximating arbitrary metrics by tree metrics. J. of Computer and System Sciences, 69(3):485-497, 2004.
- (2004) J. of Computer and System Sciences , vol.69 , Issue.3 , pp. 485-497
- Fakcharoenphol, J.¹ Rao, S.² Talwar, K.³

21
- 20744454447
- Online convex optimization in the bandit setting: Gradient descent without a gradient
- Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms
- Abraham Flaxman, Adam Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: Gradient descent without a gradient. In 16th ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 385-394, 2005. (Pubitemid 40851394)
- (2005) Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms , pp. 385-394
- Flaxman, A.D.¹ Kalai, A.T.² McMahan, H.B.³

22
- 79957966922
- Online learning of assignments
- Daniel Golovin, Andreas Krause, and Matthew Streeter. Online learning of assignments. In Advances in Neural Information Processing Systems (NIPS), 2009.
- (2009) Advances in Neural Information Processing Systems (NIPS)
- Golovin, D.¹ Krause, A.² Streeter, M.³

23
- 0344550482
- Bounded geometries, fractals, and low-distortion embeddings
- Anupam Gupta, Robert Krauthgamer, and James R. Lee. Bounded geometries, fractals, and low-distortion embeddings. In IEEE Symp. on Foundations of Computer Science (FOCS), 2003.
- (2003) IEEE Symp. on Foundations of Computer Science (FOCS)
- Gupta, A.¹ Krauthgamer, R.² Lee, J.R.³

24
- 70349128132
- Better algorithms for benign bandits
- Elad Hazan and Satyen Kale. Better algorithms for benign bandits. In 20th ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 38-47, 2009.
- (2009) 20th ACM-SIAM Symp. on Discrete Algorithms (SODA) , pp. 38-47
- Hazan, E.¹ Kale, S.²

25
- 38048999685
- Online learning with prior information
- Elad Hazan and Nimrod Megiddo. Online learning with prior information. In 20th Conf. on Learning Theory (COLT), pages 499-513, 2007.
- (2007) 20th Conf. on Learning Theory (COLT) , pp. 499-513
- Hazan, E.¹ Megiddo, N.²

26
- 0242456822
- Optimizing search engines using clickthrough data
- Thorsten Joachims. Optimizing search engines using clickthrough data. In 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD), pages 133-142, 2002.
- (2002) 8th ACM SIGKDD Intl. Conf. on Knowledge Discovery and Data Mining (KDD) , pp. 133-142
- Joachims, T.¹

27
- 85162455616
- Non-stochastic bandit slate problems
- Satyen Kale, Lev Reyzin, and Robert E. Schapire. Non-stochastic bandit slate problems. In 24th Advances in Neural Information Processing Systems (NIPS), pages 1054-1062, 2010.
- (2010) 24th Advances in Neural Information Processing Systems (NIPS) , pp. 1054-1062
- Kale, S.¹ Reyzin, L.² Schapire, R.E.³

28
- 84898981061
- Nearly tight bounds for the continuum-armed bandit problem
- Robert Kleinberg. Nearly tight bounds for the continuum-armed bandit problem. In 18th Advances in Neural Information Processing Systems (NIPS), 2004.
- (2004) 18th Advances in Neural Information Processing Systems (NIPS)
- Kleinberg, R.¹

29
- 77951694424
- Sharp dichotomies for regret minimization in metric spaces
- Robert Kleinberg and Aleksandrs Slivkins. Sharp dichotomies for regret minimization in metric spaces. In 21st ACM-SIAM Symp. on Discrete Algorithms (SODA), 2010.
- (2010) 21st ACM-SIAM Symp. on Discrete Algorithms (SODA)
- Kleinberg, R.¹ Slivkins, A.²

30
- 84862291603
- Regret bounds for sleeping experts and bandits
- Robert Kleinberg, Alexandru Niculescu-Mizil, and Yogeshwer Sharma. Regret bounds for sleeping experts and bandits. In 21st Conf. on Learning Theory (COLT), pages 425-436, 2008a.
- (2008) 21st Conf. on Learning Theory (COLT) , pp. 425-436
- Kleinberg, R.¹ Niculescu-Mizil, A.² Sharma, Y.³

31
- 57049185311
- Multi-armed bandits in metric spaces
- Robert Kleinberg, Aleksandrs Slivkins, and Eli Upfal. Multi-armed bandits in metric spaces. In 40th ACM Symp. on Theory of Computing (STOC), pages 681-690, 2008b.
- (2008) 40th ACM Symp. on Theory of Computing (STOC) , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

32
- 33750293964
- Bandit based Monte-Carlo planning
- Machine Learning: ECML 2006 - 17th European Conference on Machine Learning, Proceedings LNAI
- Levente Kocsis and Csaba Szepesvari. Bandit based Monte-Carlo planning. In 17th European Conf. on Machine Learning (ECML), pages 282-293, 2006. (Pubitemid 44618839)
- (2006) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) , vol.4212 , pp. 282-293
- Kocsis, L.¹ Szepesvari, C.²

33
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Tze Leung Lai and Herbert Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

34
- 77956144722
- The epoch-greedy algorithm for contextual multi-armed bandits
- John Langford and Tong Zhang. The epoch-greedy algorithm for contextual multi-armed bandits. In 21st Advances in Neural Information Processing Systems (NIPS), 2007.
- (2007) 21st Advances in Neural Information Processing Systems (NIPS)
- Langford, J.¹ Zhang, T.²

35
- 77954641643
- A contextual-bandit approach to personalized news article recommendation
- Lihong Li, Wei Chu, John Langford, and Robert E. Schapire. A contextual-bandit approach to personalized news article recommendation. In 19th Intl. World Wide Web Conf. (WWW), 2010.
- (2010) 19th Intl. World Wide Web Conf. (WWW)
- Li, L.¹ Chu, W.² Langford, J.³ Schapire, R.E.⁴

36
- 79952384747
- Unbiased offline evaluation of contextualbandit-based news article recommendation algorithms
- Lihong Li, Wei Chu, John Langford, and Xuanhui Wang. Unbiased offline evaluation of contextualbandit-based news article recommendation algorithms. In 4th ACM Intl. Conf. on Web Search and Data Mining (WSDM), 2011.
- (2011) 4th ACM Intl. Conf. on Web Search and Data Mining (WSDM)
- Li, L.¹ Chu, W.² Langford, J.³ Wang, X.⁴

37
- 84898452145
- Showing relevant ads via Lipschitz context multi-armed bandits
- Tyler Lu, DáSvid Pál, and Martin Pál. Showing relevant ads via Lipschitz context multi-armed bandits. In 14th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS), 2010.
- (2010) 14th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS)
- Lu, T.¹ Pál, D.² Pál, M.³

38
- 78049368895
- Online learning in adversarial lipschitz environments
- Odalric-Ambrym Maillard and Rémi Munos. Online learning in adversarial lipschitz environments. In European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), pages 305-320, 2010.
- (2010) European Conf. on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) , pp. 305-320
- Maillard, O.-A.¹ Munos, R.²

39
- 70349275222
- Bandit algorithms for tree search
- Rémi Munos and Pierre-Arnaud Coquelin. Bandit algorithms for tree search. In 23rd Conf. on Uncertainty in Artificial Intelligence (UAI), 2007.
- (2007) 23rd Conf. on Uncertainty in Artificial Intelligence (UAI)
- Munos, R.¹ Coquelin, P.²

40
- 70049106076
- Bandits for taxonomies: A model-based approach
- Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, and Vanja Josifovski. Bandits for taxonomies: A model-based approach. In SIAM Intl. Conf. on Data Mining (SDM), 2007.
- (2007) SIAM Intl. Conf. on Data Mining (SDM)
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

41
- 56449088596
- Learning diverse rankings with multiarmed bandits
- Filip Radlinski, Robert Kleinberg, and Thorsten Joachims. Learning diverse rankings with multiarmed bandits. In 25th Intl. Conf. on Machine Learning (ICML), pages 784-791, 2008.
- (2008) 25th Intl. Conf. on Machine Learning (ICML) , pp. 784-791
- Radlinski, F.¹ Kleinberg, R.² Joachims, T.³

42
- 80053440857
- Nonparametric bandits with covariates
- Philippe Rigollet and Assaf Zeevi. Nonparametric bandits with covariates. In 23rd Conf. on Learning Theory (COLT), pages 54-66, 2010.
- (2010) 23rd Conf. on Learning Theory (COLT) , pp. 54-66
- Rigollet, P.¹ Zeevi, A.²

43
- 84892931731
- Has been published in 24th COLT 2011
- Aleksandrs Slivkins. Contextual bandits with similarity information. http://arxiv.org/abs/0907.3986, 2009. Has been published in 24th COLT 2011.
- (2009) Contextual Bandits with Similarity Information
- Slivkins, A.¹

44
- 85162320142
- Multi-armed bandits on implicit metric spaces
- Aleksandrs Slivkins. Multi-armed bandits on implicit metric spaces. In 25th Advances in Neural Information Processing Systems (NIPS), 2011.
- (2011) 25th Advances in Neural Information Processing Systems (NIPS)
- Slivkins, A.¹

45
- 77956501313
- Gaussian process optimization in the bandit setting: No regret and experimental design
- Niranjan Srinivas, Andreas Krause, Sham Kakade, and Matthias Seeger. Gaussian process optimization in the bandit setting: No regret and experimental design. In 27th Intl. Conf. on Machine Learning (ICML), pages 1015-1022, 2010.
- (2010) 27th Intl. Conf. on Machine Learning (ICML) , pp. 1015-1022
- Srinivas, N.¹ Krause, A.² Kakade, S.³ Seeger, M.⁴

46
- 84858777403
- An online algorithm for maximizing submodular functions
- Matthew Streeter and Daniel Golovin. An online algorithm for maximizing submodular functions. In Advances in Neural Information Processing Systems (NIPS), pages 1577-1584, 2008.
- (2008) Advances in Neural Information Processing Systems (NIPS) , pp. 1577-1584
- Streeter, M.¹ Golovin, D.²

47
- 34548750873
- Generalized bandit problems
- David Austen-Smith and John Duggan, editors, Springer, First appeared as Working Paper, Stern School of Business
- Rangarajan K. Sundaram. Generalized bandit problems. In David Austen-Smith and John Duggan, editors, Social Choice and Strategic Decisions: Essays in Honor of Jeffrey S. Banks (Studies in Choice and Welfare), pages 131-162. Springer, 2005. First appeared as Working Paper, Stern School of Business, 2003.
- (2003) Social Choice and Strategic Decisions: Essays in Honor of Jeffrey S. Banks (Studies in Choice and Welfare) , pp. 131-162
- Sundaram, R.K.¹

48
- 42549161120
- Softrank: Optimizing nonsmooth rank metrics
- Michael J. Taylor, John Guiver, Stephen Robertson, and Tom Minka. Softrank: Optimizing nonsmooth rank metrics. In ACM Intl. Conf. on Web Search and Data Mining (WSDM), pages 77-86, 2008.
- (2008) ACM Intl. Conf. on Web Search and Data Mining (WSDM) , pp. 77-86
- Taylor, M.J.¹ Guiver, J.² Robertson, S.³ Minka, T.⁴

49
- 78249288447
- Algorithms for adversarial bandit problems with multiple plays
- Taishi Uchiya, Atsuyoshi Nakamura, and Mineichi Kudo. Algorithms for adversarial bandit problems with multiple plays. In 21st Intl. Conf. on Algorithmic Learning Theory (ALT), pages 375-389, 2010.
- (2010) 21st Intl. Conf. on Algorithmic Learning Theory (ALT) , pp. 375-389
- Uchiya, T.¹ Nakamura, A.² Kudo, M.³

50
- 15844389867
- Bandit problems with side observations
- Chih-Chun Wang, Sanjeev R. Kulkarni, and H. Vincent Poor. Bandit problems with side observations. IEEE Trans. on Automatic Control, 50(3):338355, 2005.
- (2005) IEEE Trans. on Automatic Control , vol.50 , Issue.3 , pp. 338355
- Wang, C.-C.¹ Kulkarni, S.R.² Vincent Poor, H.³

51
- 84863381440
- Algorithms for infinitely many-armed bandits
- Yizao Wang, Jean-Yves Audibert, and Rémi Munos. Algorithms for infinitely many-armed bandits. In Advances in Neural Information Processing Systems (NIPS), pages 1729-1736, 2008.
- (2008) Advances in Neural Information Processing Systems (NIPS) , pp. 1729-1736
- Wang, Y.¹ Audibert, J.² Munos, R.³

52
- 0001631327
- A one-armed bandit problem with a concomitant variable
- Michael Woodroofe. A one-armed bandit problem with a concomitant variable. J. Amer. Statist. Assoc., 74(368), 1979.
- (1979) J. Amer. Statist. Assoc. , vol.74 , Issue.368
- Woodroofe, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.