SCOPUS 정보 검색 플랫폼

SIAM Journal on Optimization

Volumn 23, Issue 1, 2013, Pages 213-240

Stochastic convex optimization with bandit feedback

(5) Agarwal, Alekh a Foster, Dean P b Hsu, Daniel c Kakade, Sham M c Rakhlin, Alexander b

a MICROSOFT RESEARCH (United States)

b UNIVERSITY OF PENNSYLVANIA (United States)

c MICROSOFT RESEARCH (United Kingdom)

Author keywords

Bandit optimization; Derivative free optimization; Ellipsoid method

Indexed keywords

BANDIT FEEDBACKS; DERIVATIVE-FREE OPTIMIZATION; ELLIPSOID ALGORITHM; ELLIPSOID METHOD; FUNCTION VALUES; LIPSCHITZ FUNCTIONS; OPTIMAL FUNCTION; QUANTITY OF INTEREST;

CONVEX OPTIMIZATION; OPTIMIZATION; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

ALGORITHMS;

EID: 84877750537 PISSN: 10526234 EISSN: None Source Type: Journal
DOI: 10.1137/110850827 Document Type: Article

Times cited : (73)

References (20)

1
- 84860610530
- Optimal algorithms for online convex optimization with multi-point bandit feedback
- A. Agarwal, O. Dekel, and L. Xiao, Optimal algorithms for online convex optimization with multi-point bandit feedback, in Proceedings of COLT, 2010.
- (2010) Proceedings of COLT
- Agarwal, A.¹ Dekel, O.² Xiao, L.³

2
- 0345224411
- The continuum-armed bandit problem
- R. Agrawal, The continuum-armed bandit problem, SIAM J. Control Optim., 33 (1995), pp. 1926-1951.
- (1995) SIAM J. Control Optim. , vol.33 , pp. 1926-1951
- Agrawal, R.¹

3
- 38049040954
- Improved rates for the stochastic continuum-armed bandit problem
- P. Auer, R. Ortner, and C. Szepesvári, Improved rates for the stochastic continuum-armed bandit problem, in Proceedings of COLT, 2007, pp. 454-468.
- (2007) Proceedings of COLT , pp. 454-468
- Auer, P.¹ Ortner, R.² Szepesvári, C.³

4
- 4243173687
- Solving Convexprograms by random walks
- D. Bertsimas and S. Vempala, Solving convexprograms by random walks, J. ACM, 51 (2004), pp. 540-556.
- (2004) J. ACM , vol.51 , pp. 540-556
- Bertsimas, D.¹ Vempala, S.²

5
- 79960128338
- X-armed bandits
- S. Bubeck, R. Munos, G. Stolz, and C. Szepesvári, X-armed bandits, J. Mach. Learn. Res., 12 (2011), pp. 1655-1695.
- (2011) J. Mach. Learn. Res. , vol.12 , pp. 1655-1695
- Bubeck, S.¹ Munos, R.² Stolz, G.³ Szepesvári, C.⁴

6
- 0007298971
- Sub-Gaussian random variables
- V. V. Buldygin and Yu. V. Kozachenko, Sub-Gaussian random variables, Ukrainian Math. J., 32 (1980), pp. 483-489.
- (1980) Ukrainian Math. J. , vol.32 , pp. 483-489
- Buldygin, V.V.¹ Kozachenko, Y.V.²

7
- 67650355939
- SIAM, Philadelphia
- A. R. Conn, K. Scheinberg, and L. N. Vicente, Introduction to Derivative-Free Optimization, SIAM, Philadelphia, 2009.
- (2009) Introduction to Derivative-Free Optimization
- Conn, A.R.¹ Scheinberg, K.² Vicente, L.N.³

8
- 67649577204
- Regret and convergence bounds for a class of continuum-armed bandit problems
- E. W. Cope, Regret and convergence bounds for a class of continuum-armed bandit problems, IEEE Trans. Automat. Control, 54 (2009), pp. 1243-1253.
- (2009) IEEE Trans. Automat. Control , vol.54 , pp. 1243-1253
- Cope, E.W.¹

9
- 84898072179
- Stochastic linear optimization under bandit feedback
- V. Dani, T. P. Hayes, and S. M. Kakade, Stochastic linear optimization under bandit feedback, in Proceedings of the 21st Annual Conference on Learning Theory (COLT), 2008.
- (2008) Proceedings of the 21st Annual Conference on Learning Theory (COLT)
- Dani, V.¹ Hayes, T.P.² Kakade, S.M.³

10
- 20744454447
- Online convex optimization in the bandit setting: Gradient descent without a gradient
- A. D. Flaxman, A. T. Kalai, and B. H. Mcmahan, Online convex optimization in the bandit setting: Gradient descent without a gradient, in Proceedings of the 16th Annual ACM- SIAM Symposium on Discrete Algorithms, 2005, pp. 385-394.
- (2005) Proceedings of the 16th Annual ACM-SIAM Symposium on Discrete Algorithms , pp. 385-394
- Flaxman, A.D.¹ Kalai, A.T.² McMahan, B.H.³

11
- 0020132663
- Modifications and implementation of the ellipsoid algorithm for linear programming
- D. Goldfarb and M. J. Todd, Modifications and implementation of the ellipsoid algorithm for linear programming, Math. Program., 23 (1982), pp. 1-19.
- (1982) Math. Program. , vol.23 , pp. 1-19
- Goldfarb, D.¹ Todd, M.J.²

12
- 0001079593
- Stochastic estimation of the maximum of a regression function
- J. Kiefer and J. Wolfowitz, Stochastic estimation of the maximum of a regression function, Ann. Math. Statist., 23 (1952), pp. 462-466.
- (1952) Ann. Math. Statist. , vol.23 , pp. 462-466
- Kiefer, J.¹ Wolfowitz, J.²

13
- 84898981061
- Nearly Tight Bounds for the continuum-armed bandit problem
- R. Kleinberg, Nearly tight bounds for the continuum-armed bandit problem, Adv. Neural Inf. Process. Syst., 18 (2005).
- (2005) Adv. Neural Inf. Process. Syst. , vol.18
- Kleinberg, R.¹

14
- 57049185311
- Multi-armed bandits in metric spaces
- R. Kleinberg, A. Slivkins, and E. Upfal, Multi-armed bandits in metric spaces, in Proceedings of the 40th Annual ACM Symposium on Theory of Computing, 2008, pp. 681-690.
- (2008) Proceedings of the 40th Annual ACM Symposium on Theory of Computing , pp. 681-690
- Kleinberg, R.¹ Slivkins, A.² Upfal, E.³

15
- 24244479537
- Geometric algorithms and algorithmic geometry
- L. Lovász, Geometric algorithms and algorithmic geometry, in Proceedings of International Congress of Mathematicians, 1990, pp. 139-154.
- (1990) Proceedings of International Congress of Mathematicians , pp. 139-154
- Lovász, L.¹

16
- 0003692801
- Wiley, New York
- A. NEMiROVSKI AND D. YuDiN, Problem Complexity and Method Efficiency in Optimization, Wiley, New York, 1983.
- (1983) Problem Complexity and Method Efficiency in Optimization
- Nemirovski, A.¹ Yudin, D.²

17
- 84860610528
- Center for Operations Research and Econometrics, Universit́e catholique de Lou- vain
- Y. NESTEROV, Random Gradient-Free Minimization of Convex Functions, Technical report 2011/1, Center for Operations Research and Econometrics, Universit́e catholique de Lou- vain, 2011.
- (2011) Random Gradient-Free Minimization of Convex Functions, Technical Report 2011/1
- Nesterov, Y.¹

18
- 80053997013
- Information-based complexity, feedback and dynamics in convex programming
- M. RAGiNSKY AND A. RAKHLiN, Information-based complexity, feedback and dynamics in convex programming, IEEE Trans. Inform. Theory, 57 (2011), pp. 7036-7056.
- (2011) IEEE Trans. Inform. Theory , vol.57 , pp. 7036-7056
- Raginsky, M.¹ Rakhlin, A.²

19
- 77956522732
- arXiv: 0912.3995
- N. SRiNIVAS, A. KRÄuSE, S.M. KÄKADE, ÄND M. SEeGER, Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design, arXiv:0912.3995, 2009.
- (2009) Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design
- Srinivas, N.¹ Kräuse, A.² Käkade, S.M.³ Seeger, M.⁴

20
- 80053457608
- Unimodal bandits
- J. Y. Yu AND S. MANNOR, Unimodal bandits, in Proceedings of ICML, 2011.
- (2011) Proceedings of ICML
- Yu, J.Y.¹ Mannor, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.