SCOPUS 정보 검색 플랫폼

21st Annual Conference on Learning Theory, COLT 2008

Volumn , Issue , 2008, Pages 263-273

Competing in the dark: An efficient algorithm for bandit linear optimization

(3) Abernethy, Jacob a Hazan, Elad b Rakhlin, Alexander a

a UNIVERSITY OF CALIFORNIA (United States)

b IBM ALMADEN RESEARCH CENTER (United States)

Author keywords

[No Author keywords available]

Indexed keywords

INTERIOR POINT METHODS; LINEAR OPTIMIZATION; MULTI-ARMED BANDIT PROBLEM; NATURAL GENERALIZATION; ONLINE LEARNING; ONLINE LINEAR OPTIMIZATIONS; OPTIMAL ALGORITHM; POTENTIAL FUNCTION;

ALGORITHMS;

OPTIMIZATION;

EID: 84898063697 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (322)

References (21)

1
- 0037709910
- The nonstochastic multiarmed bandit problem
- Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32(1): 48-77, 2003.
- (2003) SIAM J. Comput. , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

2
- 4544345025
- Adaptive routing with end-to-end feedback: Distributed learning and geometric approaches
- New York, NY, USA ACM
- Baruch Awerbuch and Robert D. Kleinberg. Adaptive routing with end-to-end feedback: distributed learning and geometric approaches. In STOC '04: Proceedings of the thirty-sixth annual ACM symposium on Theory of computing, pages 45-53, New York, NY, USA, 2004. ACM.
- (2004) STOC '04: Proceedings of the Thirty-sixth Annual ACM Symposium on Theory of Computing , pp. 45-53
- Awerbuch, B.¹ Kleinberg, R.D.²

3
- 84898060591
- High-probability bounds for the regret of bandit online linear optimization, 2008
- submission to 2008
- P. Bartlett, V. Dani, T. Hayes, S. Kakade, A. Rakhlin, and A. Tewari. High-probability bounds for the regret of bandit online linear optimization, 2008. In submission to COLT 2008.
- COLT
- Bartlett, P.¹ Dani, V.² Hayes, T.³ Kakade, S.⁴ Rakhlin, A.⁵ Tewari, A.⁶

4
- 0003878614
- SIAM, Philadelphia
- A. Ben-Tal and A. Nemirovski. Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications, volume 2 of MPS/SIAM Series on Optimization. SIAM, Philadelphia, 2001.
- (2001) Lectures on Modern Convex Optimization: Analysis, Algorithms, and Engineering Applications, Volume 2 of MPS/SIAM Series on Optimization
- Ben-Tal, A.¹ Nemirovski, A.²

5
- 84926078662
- Cambridge University Press
- Nicolò Cesa-Bianchi and Gábor Lugosi. Prediction, Learning, and Games. Cambridge University Press, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

6
- 85162050055
- The price of bandit information for online optimization
- J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors MIT Press, Cambridge, MA
- Varsha Dani, Thomas Hayes, and Sham Kakade. The price of bandit information for online optimization. In J.C. Platt, D. Koller, Y. Singer, and S. Roweis, editors, Advances in Neural Information Processing Systems 20. MIT Press, Cambridge, MA, 2008.
- (2008) Advances in Neural Information Processing Systems , vol.20
- Dani, V.¹ Hayes, T.² Kakade, S.³

7
- 33244456637
- Robbing the bandit: Less regret in online geometric optimization against an adaptive adversary
- New York, NY, USA ACM
- Varsha Dani and Thomas P. Hayes. Robbing the bandit: less regret in online geometric optimization against an adaptive adversary. In SODA '06: Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, pages 937-943, New York, NY, USA, 2006. ACM.
- (2006) SODA '06: Proceedings of the Seventeenth Annual ACM-SIAM Symposium on Discrete Algorithm , pp. 937-943
- Dani, V.¹ Hayes, T.P.²

8
- 84898069824
- Correction to 'universal prediction of individual sequences' (jul 92 1258-1270)
- Meir Feder, Neri Merhav, and Michael Gutman. Correction to 'universal prediction of individual sequences' (jul 92 1258-1270). IEEE Transactions on Information Theory, 40(1): 285, 1994.
- (1994) IEEE Transactions on Information Theory , vol.40 , Issue.1 , pp. 285
- Feder, M.¹ Merhav, N.² Gutman, M.³

9
- 20744454447
- Online convex optimization in the bandit setting: Gradient descent without a gradient
- Philadelphia, PA, USA Society for Industrial and Applied Mathematics
- Abraham D. Flaxman, Adam Tauman Kalai, and H. Brendan McMahan. Online convex optimization in the bandit setting: gradient descent without a gradient. In SODA '05: Proceedings of the sixteenth annual ACM-SIAM symposium on Discrete algorithms, pages 385-394, Philadelphia, PA, USA, 2005. Society for Industrial and Applied Mathematics.
- (2005) SODA '05: Proceedings of the Sixteenth Annual ACM-SIAM Symposium on Discrete Algorithms , pp. 385-394
- Flaxman, A.D.¹ Kalai, A.T.² Brendan McMahan, H.³

10
- 0033336069
- Relative loss bounds for single neurons
- November
- D. P. Helmbold, J. Kivinen, and M. K. Warmuth. Relative loss bounds for single neurons. IEEE Transactions on Neural Networks, 10(6): 1291-1304, November 1999.
- (1999) IEEE Transactions on Neural Networks , vol.10 , Issue.6 , pp. 1291-1304
- Helmbold, D.P.¹ Kivinen, J.² Warmuth, M.K.³

11
- 24644463787
- Efficient algorithms for online decision problems
- Adam Kalai and Santosh Vempala. Efficient algorithms for online decision problems. Journal of Computer and System Sciences, 71(3): 291-307, 2005.
- (2005) Journal of Computer and System Sciences , vol.71 , Issue.3 , pp. 291-307
- Kalai, A.¹ Vempala, S.²

12
- 0008815681
- Exponentiated gradient versus gradient descent for linear predictors
- Jyrki Kivinen and Manfred K. Warmuth. Exponentiated gradient versus gradient descent for linear predictors. Inf. Comput., 132(1): 1-63, 1997.
- (1997) Inf. Comput. , vol.132 , Issue.1 , pp. 1-63
- Kivinen, J.¹ Warmuth, M.K.²

13
- 35148838877
- The weighted majority algorithm
- Nick Littlestone and Manfred K. Warmuth. The weighted majority algorithm. Information and Computation, 108(2): 212-261, 1994.
- (1994) Information and Computation , vol.108 , Issue.2 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.K.²

14
- 9444257628
- Online geometric optimization in the bandit setting against an adaptive adversary
- H. Brendan McMahan and Avrim Blum. Online geometric optimization in the bandit setting against an adaptive adversary. In COLT, pages 109-123, 2004.
- (2004) COLT , pp. 109-123
- Brendan McMahan, H.¹ Blum, A.²

15
- 62549145135
- Lecture Notes
- A.S. Nemirovskii. Interior point polynomial time methods in convex programming, 2004. Lecture Notes.
- (2004) Interior Point Polynomial Time Methods in Convex Programming
- Nemirovskii, A.S.¹

16
- 0003254250
- Interior point polynomial algorithms in convex programming
- Philadelphia
- Y. E. Nesterov and A. S. Nemirovskii. Interior Point Polynomial Algorithms in Convex Programming. SIAM, Philadelphia, 1994.
- (1994) SIAM
- Nesterov, Y.E.¹ Nemirovskii, A.S.²

17
- 84898065361
- graduate algorithms
- Satish Rao. Lecure notes: Cs 270, graduate algorithms. 2006.
- (2006) Lecure Notes: Cs 270
- Rao, S.¹

18
- 84966203785
- Some aspects of the sequential design of experiments
- Herbert Robbins. Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc., 58(5): 527-535, 1952.
- (1952) Bull. Amer. Math. Soc. , vol.58 , Issue.5 , pp. 527-535
- Robbins, H.¹

19
- 35348915372
- A primal-dual perspective of online learning algorithms
- Shai Shalev-Shwartz and Yoram Singer. A primal-dual perspective of online learning algorithms. Mach. Learn., 69(2-3): 115-142, 2007.
- (2007) Mach. Learn. , vol.69 , Issue.2-3 , pp. 115-142
- Shalev-Shwartz, S.¹ Singer, Y.²

20
- 3142657664
- Path kernels and multiplicative updates
- Eiji Takimoto and Manfred K. Warmuth. Path kernels and multiplicative updates. J. Mach. Learn. Res., 4: 773-818, 2003.
- (2003) J. Mach. Learn. Res. , vol.4 , pp. 773-818
- Takimoto, E.¹ Warmuth, M.K.²

21
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML, pages 928-936, 2003.
- (2003) ICML , pp. 928-936
- Zinkevich, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.