SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011

Volumn , Issue , 2011, Pages

Multi-armed bandits on implicit metric spaces

(1) Slivkins, Aleksandrs a

a MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ECONOMIC AND SOCIAL EFFECTS; SET THEORY; TREES (MATHEMATICS);

EXPLORATION AND EXPLOITATION; LEARNING TASKS; LIPSCHITZ; METRIC SPACES; MULTIARMED BANDIT PROBLEMS (MABP); MULTIARMED BANDITS (MABS); ON-LINE ALGORITHMS; ONLINE LEARNING; SIMILARITY METRICS; TRADE OFF;

CLASSIFICATION (OF INFORMATION);

EID: 85162320142 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (58)

References (29)

1
- 0345224411
- The continuum-armed bandit problem
- Rajeev Agrawal. The continuum-armed bandit problem. SIAM J. Control and Optimization, 33(6):1926-1951, 1995.
- (1995) SIAM J. Control and Optimization , vol.33 , Issue.6 , pp. 1926-1951
- Agrawal, R.¹

2
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Preliminary version in 15th ICML, 1998
- Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002. Preliminary version in 15th ICML, 1998.
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

3
- 0037709910
- The nonstochastic multiarmed bandit problem
- Preliminary version in 36th IEEE FOCS, 1995
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1):48-77, 2002. Preliminary version in 36th IEEE FOCS, 1995.
- (2002) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

4
- 38049040954
- Improved rates for the stochastic continuum-armed bandit problem
- Peter Auer, Ronald Ortner, and Csaba Szepesvári. Improved Rates for the Stochastic Continuum-Armed Bandit Problem. In 20th COLT, pages 454-468, 2007.
- (2007) 20th COLT , pp. 454-468
- Auer, P.¹ Ortner, R.² Szepesvári, C.³

5
- 35448960376
- Online linear optimization and adaptive routing
- February. Preliminary version in 36th ACM STOC, 2004
- Baruch Awerbuch and Robert Kleinberg. Online linear optimization and adaptive routing. J. of Computer and System Sciences, 74(1):97-114, February 2008. Preliminary version in 36th ACM STOC, 2004.
- (2008) J. of Computer and System Sciences , vol.74 , Issue.1 , pp. 97-114
- Awerbuch, B.¹ Kleinberg, R.²

6
- 36448945038
- A semantic approach to contextual advertising
- Andrei Broder, Marcus Fontoura, Vanja Josifovski, and Lance Riedel. A semantic approach to contextual advertising. In 30th SIGIR, pages 559-566, 2007.
- (2007) 30th SIGIR , pp. 559-566
- Broder, A.¹ Fontoura, M.² Josifovski, V.³ Riedel, L.⁴

7
- 84860634388
- Online optimization in X-armed bandits
- Preliminary version in NIPS 2008
- Sébastien Bubeck, Rémi Munos, Gilles Stoltz, and Csaba Szepesvari. Online Optimization in X-Armed Bandits. J. of Machine Learning Research (JMLR), 12:1587-1627, 2011. Preliminary version in NIPS 2008.
- (2011) J. of Machine Learning Research (JMLR) , vol.12 , pp. 1587-1627
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvari, C.⁴

8
- 84926078662
- Cambridge Univ. Press
- Nicolò Cesa-Bianchi and Gábor Lugosi. Prediction, learning, and games. Cambridge Univ. Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

9
- 67649577204
- Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces
- A manuscript from 2004
- Eric Cope. Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces. IEEE Trans. on Automatic Control, 54(6):1243-1253, 2009. A manuscript from 2004.
- (2009) IEEE Trans. on Automatic Control , vol.54 , Issue.6 , pp. 1243-1253
- Cope, E.¹

10
- 33244456637
- Robbing the bandit: Less regret in online geometric optimization against an adaptive adversary
- Varsha Dani and Thomas P. Hayes. Robbing the bandit: less regret in online geometric optimization against an adaptive adversary. In 17th ACM-SIAM SODA, pages 937-943, 2006.
- (2006) 17th ACM-SIAM SODA , pp. 937-943
- Dani, V.¹ Hayes, T.P.²

11
- 70349295143
- The price of bandit information for online optimization
- Varsha Dani, Thomas P. Hayes, and Sham Kakade. The Price of Bandit Information for Online Optimization. In 20th NIPS, 2007.
- (2007) 20th NIPS
- Dani, V.¹ Hayes, T.P.² Kakade, S.³

12
- 20744454447
- Online convex optimization in the bandit setting: Gradient descent without a gradient
- Abraham Flaxman, Adam Kalai, and H. Brendan McMahan. Online Convex Optimization in the Bandit Setting: Gradient Descent without a Gradient. In 16th ACM-SIAM SODA, pages 385-394, 2005.
- (2005) 16th ACM-SIAM SODA , pp. 385-394
- Flaxman, A.¹ Kalai, A.² McMahan, H.B.³

13
- 77958578450
- Combining online and offline knowledge in UCT
- Sylvain Gelly and David Silver. Combining online and offline knowledge in UCT. In 24th ICML, 2007.
- (2007) 24th ICML
- Gelly, S.¹ Silver, D.²

14
- 70349295261
- Achieving master level play in 9x9 computer go
- Sylvain Gelly and David Silver. Achieving master level play in 9x9 computer go. In 23rd AAAI, 2008.
- (2008) 23rd AAAI
- Gelly, S.¹ Silver, D.²

15
- 0344550482
- Bounded geometries, fractals, and low- distortion embeddings
- Anupam Gupta, Robert Krauthgamer, and James R. Lee. Bounded geometries, fractals, and low- distortion embeddings. In 44th IEEE FOCS, pages 534-543, 2003.
- (2003) 44th IEEE FOCS , pp. 534-543
- Gupta, A.¹ Krauthgamer, R.² Lee, J.R.³

16
- 35448983517
- Playing games with approximation algorithms
- Sham M. Kakade, Adam T. Kalai, and Katrina Ligett. Playing Games with Approximation Algorithms. In 39th ACM STOC, 2007.
- (2007) 39th ACM STOC
- Kakade, S.M.¹ Kalai, A.T.² Ligett, K.³

17
- 38049011420
- Nearly tight bounds for the continuum-armed bandit problem
- Robert Kleinberg. Nearly tight bounds for the continuum-armed bandit problem. In 18th NIPS, 2004.
- (2004) 18th NIPS
- Kleinberg, R.¹

18
- 33748679987
- PhD thesis, MIT
- Robert Kleinberg. Online Decision Problems with Large Strategy Sets. PhD thesis, MIT, 2005.
- (2005) Online Decision Problems with Large Strategy Sets
- Kleinberg, R.¹

19
- 77951694424
- Sharp dichotomies for regret minimization in metric spaces
- Robert Kleinberg and Aleksandrs Slivkins. Sharp Dichotomies for Regret Minimization in Metric Spaces. In 21st ACM-SIAM SODA, 2010.
- (2010) 21st ACM-SIAM SODA
- Kleinberg, R.¹ Slivkins, A.²

20
- 57049185311
- Multi-armed bandits in metric spaces
- Robert Kleinberg, Aleksandrs Slivkins, and Eli Upfal. Multi-Armed Bandits in Metric Spaces. In 40th ACM STOC, pages 681-690, 2008.
- (2008) 40th ACM STOC , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

21
- 33750293964
- Bandit based monte-carlo planning
- Levente Kocsis and Csaba Szepesvari. Bandit Based Monte-Carlo Planning. In 17th ECML, pages 282-293, 2006.
- (2006) 17th ECML , pp. 282-293
- Kocsis, L.¹ Szepesvari, C.²

22
- 0002899547
- Asymptotically efficient adaptive allocation rules
- T.L. Lai and Herbert Robbins. Asymptotically efficient Adaptive Allocation Rules. Advances in Applied Mathematics, 6:4-22, 1985.
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.L.¹ Robbins, H.²

23
- 9444257628
- Online geometric optimization in the bandit setting against an adaptive adversary
- H. Brendan McMahan and Avrim Blum. Online Geometric Optimization in the Bandit Setting Against an Adaptive Adversary. In 17th COLT, pages 109-123, 2004.
- (2004) 17th COLT , pp. 109-123
- McMahan, H.B.¹ Blum, A.²

24
- 84860618045
- Bandit algorithms for tree search
- Rémi Munos and Pierre-Arnaud Coquelin. Bandit algorithms for tree search. In 23rd UAI, 2007.
- (2007) 23rd UAI
- Munos, R.¹ Coquelin, P.-A.²

25
- 70049106076
- Bandits for taxonomies: A model-based approach
- Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, and Vanja Josifovski. Bandits for Taxonomies: A Model-based Approach. In SDM, 2007.
- (2007) SDM
- Pandey, S.¹ Agarwal, D.² Chakrabarti, D.³ Josifovski, V.⁴

26
- 70350700875
- Multi-armed bandit problems with dependent arms
- Sandeep Pandey, Deepayan Chakrabarti, and Deepak Agarwal. Multi-armed Bandit Problems with Dependent Arms. In 24th ICML, 2007.
- (2007) 24th ICML
- Pandey, S.¹ Chakrabarti, D.² Agarwal, D.³

27
- 77954582369
- Classification-enhanced ranking
- Susan T. Dumais Paul N. Bennett, Krysta Marie Svore. Classification- enhanced ranking. In 19th WWW,pages 111-120, 2010.
- (2010) 19th WWW , pp. 111-120
- Dumais, S.T.¹ Bennett, P.N.² Svore, K.M.³

28
- 56449088596
- Learning diverse rankings with multi-armed bandits
- Filip Radlinski, Robert Kleinberg, and Thorsten Joachims. Learning diverse rankings with multi-armed bandits. In 25th ICML, pages 784-791, 2008.
- (2008) 25th ICML , pp. 784-791
- Radlinski, F.¹ Kleinberg, R.² Joachims, T.³

29
- 77956542736
- Learning optimally diverse rankings over large document collections
- Aleksandrs Slivkins, Filip Radlinski, and Sreenivas Gollapudi. Learning optimally diverse rankings over large document collections. In 27th ICML, pages 983-990, 2010.
- (2010) 27th ICML , pp. 983-990
- Slivkins, A.¹ Radlinski, F.² Gollapudi, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.