SCOPUS 정보 검색 플랫폼

Proceedings of the Annual ACM-SIAM Symposium on Discrete Algorithms

Volumn , Issue , 2010, Pages 827-846

Sharp dichotomies for regret minimization in metric spaces

(2) Kleinberg, Robert a Slivkins, Aleksandrs b

a School of Operations Research and Information Engineering (United States)

b MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

PROBABILITY; SET THEORY;

FINITE METRIC SPACES; MULTI ARMED BANDIT; MULTI-ARMED BANDIT PROBLEM; ONLINE LEARNING; REGRET MINIMIZATION; SIDE INFORMATION; TOPOLOGICAL NOTIONS; UPPER AND LOWER BOUNDS;

TOPOLOGY;

EID: 77951694424 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1137/1.9781611973075.68 Document Type: Conference Paper

Times cited : (32)

References (34)

1
- 0345224411
- The continuum-armed bandit problem
- Rajeev Agrawal. The continuum-armed bandit problem. SIAM J. Control and Optimization, 33(6):1926-1951, 1995.
- (1995) SIAM J. Control and Optimization , vol.33 , Issue.6 , pp. 1926-1951
- Agrawal, R.¹

2
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- Preliminary version in 41st IEEE FOCS, 2000
- Peter Auer. Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learning Research, 3:397-422, 2002. Preliminary version in 41st IEEE FOCS, 2000.
- (2002) J. Machine Learning Research , vol.3 , pp. 397-422
- Auer, P.¹

3
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Preliminary version in 15th ICML, 1998
- Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002. Preliminary version in 15th ICML, 1998.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

4
- 0037709910
- The nonstochastic multiarmed bandit problem
- Preliminary version in 36th IEEE FOCS, 1995
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002. Preliminary version in 36th IEEE FOCS, 1995.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

5
- 38049040954
- Improved Rates for the Stochastic Continuum-Armed Bandit Problem
- Peter Auer, Ronald Ortner, and Csaba Szepesvári. Improved Rates for the Stochastic Continuum-Armed Bandit Problem. In 20th Conference on Learning Theory (COLT), pages 454-468, 2007.
- (2007) 20th Conference on Learning Theory (COLT) , pp. 454-468
- Auer, P.¹ Ortner, R.² Szepesvári, C.³

6
- 35448960376
- Online linear optimization and adaptive routing
- February. Preliminary version appeared in 36th ACM STOC, 2004
- Baruch Awerbuch and Robert Kleinberg. Online linear optimization and adaptive routing. Journal of Computer and System Sciences, 74(1):97-114, February 2008. Preliminary version appeared in 36th ACM STOC, 2004.
- (2008) Journal of Computer and System Sciences , vol.74 , Issue.1 , pp. 97-114
- Awerbuch, B.¹ Kleinberg, R.²

7
- 0000768035
- Denumerable-armed bandits
- Jeffrey Banks and Rangarajan Sundaram. Denumerable-armed bandits. Econometrica, 60(5):1071-1096, 1992.
- (1992) Econometrica , vol.60 , Issue.5 , pp. 1071-1096
- Banks, J.¹ Sundaram, R.²

8
- 1242275243
- Über unendliche, lineare Punktmannichfaltigkeiten, 4
- G. Cantor. Über unendliche, lineare Punktmannichfaltigkeiten, 4. Mathematische Annalen, 21:51-58, 1883.
- (1883) Mathematische Annalen , vol.21 , pp. 51-58
- Cantor, G.¹

9
- 0012205195
- Berlin: Teubner, reprinted in; reprint ed., Hildesheim: Olms, 1966
- In G. Cantor, Gesammelte Abhandlungen mathematischen und philosophischen Inhalts, Berlin: Teubner, 1932; reprinted in 1980; reprint ed., Hildesheim: Olms, 1966.
- (1932) Gesammelte Abhandlungen Mathematischen und Philosophischen Inhalts
- Cantor, G.¹

10
- 0031140246
- How to use expert advice
- Nicolò Cesa-Bianchi, Yoav Freund, David Haussler, David P. Helmbold, Robert E. Schapire, and Manfred K. Warmuth. How to use expert advice. J. ACM, 44(3):427-485, 1997.
- (1997) J. ACM , vol.44 , Issue.3 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Haussler, D.³ Helmbold, D.P.⁴ Schapire, R.E.⁵ Warmuth, M.K.⁶

11
- 84926078662
- Cambridge University Press
- Nicolò Cesa-Bianchi and Gábor Lugosi. Prediction, learning, and games. Cambridge University Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

12
- 33244464431
- Unpublished manuscript
- Eric Cope. Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces, 2004. Unpublished manuscript.
- (2004) Regret and Convergence Bounds for Immediate-reward Reinforcement Learning with Continuous Action Spaces
- Cope, E.¹

13
- 84889281816
- John Wiley & Sons, New York
- Thomas M. Cover and Joy A. Thomas. Elements of Information Theory. John Wiley & Sons, New York, 1991.
- (1991) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

14
- 33244456637
- Robbing the bandit: Less regret in online geometric optimization against an adaptive adversary
- Varsha Dani and Thomas P. Hayes. Robbing the bandit: less regret in online geometric optimization against an adaptive adversary. In 17th ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 937-943, 2006.
- (2006) 17th ACM-SIAM Symp. on Discrete Algorithms (SODA) , pp. 937-943
- Dani, V.¹ Hayes, T.P.²

15
- 70349295143
- The Price of Bandit Information for Online Optimization
- Varsha Dani, Thomas P. Hayes, and Sham Kakade. The Price of Bandit Information for Online Optimization. In 20th Advances in Neural Information Processing Systems (NIPS), 2007.
- (2007) 20th Advances in Neural Information Processing Systems (NIPS)
- Dani, V.¹ Hayes, T.P.² Kakade, S.³

16
- 20744454447
- Online Convex Optimization in the Bandit Setting: Gradient Descent, without a Gradient
- Abraham Flaxman, Adam Kalai, and H. Brendan McMahan. Online Convex Optimization in the Bandit Setting: Gradient Descent, without a Gradient. In 16th ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 385-394, 2005.
- (2005) 16th ACM-SIAM Symp. on Discrete Algorithms (SODA) , pp. 385-394
- Flaxman, A.¹ Kalai, A.² Brendan McMahan, H.³

17
- 84891584370
- John Wiley & Sons
- J. C. Gittins. Multi-Armed Bandit Allocation Indices. John Wiley & Sons, 1989.
- (1989) Multi-Armed Bandit Allocation Indices
- Gittins, J.C.¹

18
- 0002955623
- A dynamic allocation index for the sequential design of experiments
- J. Gani et al., editor, North-Holland
- J. C. Gittins and D. M. Jones. A dynamic allocation index for the sequential design of experiments. In J. Gani et al., editor, Progress in Statistics, pages 241-266. North-Holland, 1974.
- (1974) Progress in Statistics , pp. 241-266
- Gittins, J.C.¹ Jones, D.M.²

19
- 46749146164
- Approximation algorithms for partial-information based stochastic control with Markovian rewards
- Sudipta Guha and Kamesh Munagala. Approximation algorithms for partial-information based stochastic control with Markovian rewards. In 48th Symp. on Foundations of Computer Science (FOCS), pages 483-493, 2007.
- (2007) 48th Symp. on Foundations of Computer Science (FOCS) , pp. 483-493
- Guha, S.¹ Munagala, K.²

20
- 69449097218
- Approximation algorithms for restless bandit problems
- Sudipta Guha, Kamesh Munagala, and Peng Shi. Approximation algorithms for restless bandit problems. In 20th ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 28-37, 2009.
- (2009) 20th ACM-SIAM Symp. on Discrete Algorithms (SODA) , pp. 28-37
- Guha, S.¹ Munagala, K.² Shi, P.³

21
- 77951694751
- Private communication
- Anupam Gupta, Mike Dinitz, and Kanat Tangwongsan. Private communication, 2007.
- (2007)
- Gupta, A.¹ Dinitz, M.² Tangwongsan, K.³

22
- 70349128132
- Better algorithms for benign bandits
- Elad Hazan and Satyen Kale. Better algorithms for benign bandits. In 20th ACM-SIAM Symp. on Discrete Algorithms (SODA), pages 38-47, 2009.
- (2009) 20th ACM-SIAM Symp. on Discrete Algorithms (SODA) , pp. 38-47
- Hazan, E.¹ Kale, S.²

23
- 38048999685
- Online Learning with Prior Information
- Elad Hazan and Nimrod Megiddo. Online Learning with Prior Information. In 20th Conference on Learning Theory (COLT), pages 499-513, 2007.
- (2007) 20th Conference on Learning Theory (COLT) , pp. 499-513
- Hazan, E.¹ Megiddo, N.²

24
- 35448983517
- Playing Games with Approximation Algorithms
- Sham M. Kakade, Adam T. Kalai, and Katrina Ligett. Playing Games with Approximation Algorithms. In 39th ACM Symp. on Theory of Computing (STOC), 2007.
- 39th ACM Symp. on Theory of Computing (STOC), 2007
- Sham, M.¹ Kalai, K.A.T.² Ligett, K.³

25
- 84898981061
- Nearly tight bounds for the continuum-armed bandit problem
- Full version appeared in the author's thesis (MIT, 1995)
- Robert Kleinberg. Nearly tight bounds for the continuum-armed bandit problem. In 18th Advances in Neural Information Processing Systems (NIPS), 2004. Full version appeared in the author's thesis (MIT, 1995).
- (2004) 18th Advances in Neural Information Processing Systems (NIPS)
- Kleinberg, R.¹

26
- 33748679987
- PhD thesis, MIT, Boston, MA
- Robert Kleinberg. Online Decision Problems with Large Strategy Sets. PhD thesis, MIT, Boston, MA, 2005.
- (2005) Online Decision Problems with Large Strategy Sets
- Kleinberg, R.¹

27
- 84862291603
- Regret bounds for sleeping experts and bandits
- Robert Kleinberg, Alexandru Niculescu-Mizil, and Yogeshwer Sharma. Regret bounds for sleeping experts and bandits. In 21st Conference on Learning Theory (COLT), pages 425-436, 2008.
- (2008) 21st Conference on Learning Theory (COLT) , pp. 425-436
- Kleinberg, R.¹ Niculescu-Mizil, A.² Sharma, Y.³

28
- 57049185311
- Multi-Armed Bandits in Metric Spaces
- Robert Kleinberg, Aleksandrs Slivkins, and Eli Upfal. Multi-Armed Bandits in Metric Spaces. In 40th ACM Symp. on Theory of Computing (STOC), pages 681-690, 2008.
- (2008) 40th ACM Symp. on Theory of Computing (STOC) , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

29
- 0002899547
- Asymptotically efficient Adaptive Allocation Rules
- T.L. Lai and Herbert Robbins. Asymptotically efficient Adaptive Allocation Rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

30
- 0002365425
- Contribution à la topologie des ensembles dénombrables
- S. Mazurkiewicz and W. Sierpinski. Contribution à la topologie des ensembles dénombrables. Fund. Math., 1:17-27, 1920.
- (1920) Fund. Math. , vol.1 , pp. 17-27
- Mazurkiewicz, S.¹ Sierpinski, W.²

31
- 9444257628
- Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary
- H. Brendan McMahan and Avrim Blum. Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary. In 17th Conference on Learning Theory (COLT), pages 109-123, 2004.
- (2004) 17th Conference on Learning Theory (COLT) , pp. 109-123
- Brendan McMahan, H.¹ Blum, A.²

32
- 70049106076
- Bandits for Taxonomies: A Model-based Approach
- Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, and Vanja Josifovski. Bandits for Taxonomies: A Model-based Approach. In SIAM Intl. Conf. on Data Mining (SDM), 2007.
- SIAM Intl. Conf. on Data Mining (SDM), 2007
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

33
- 0032047115
- A game of prediction with expert advice
- V. Vovk. A game of prediction with expert advice. J. Computer and System Sciences, 56(2):153-173, 1998.
- (1998) J. Computer and System Sciences , vol.56 , Issue.2 , pp. 153-173
- Vovk, V.¹

34
- 0001043843
- Restless bandits: Activity allocation in a changing world
- P. Whittle. Restless bandits: Activity allocation in a changing world. J. of Appl. Prob., 25A:287-298, 1988.
- (1988) J. of Appl. Prob. , vol.25 A , pp. 287-298
- Whittle, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.