메뉴 건너뛰기




Volumn 23, Issue 1, 2013, Pages 213-240

Stochastic convex optimization with bandit feedback

Author keywords

Bandit optimization; Derivative free optimization; Ellipsoid method

Indexed keywords

BANDIT FEEDBACKS; DERIVATIVE-FREE OPTIMIZATION; ELLIPSOID ALGORITHM; ELLIPSOID METHOD; FUNCTION VALUES; LIPSCHITZ FUNCTIONS; OPTIMAL FUNCTION; QUANTITY OF INTEREST;

EID: 84877750537     PISSN: 10526234     EISSN: None     Source Type: Journal    
DOI: 10.1137/110850827     Document Type: Article
Times cited : (73)

References (20)
  • 1
    • 84860610530 scopus 로고    scopus 로고
    • Optimal algorithms for online convex optimization with multi-point bandit feedback
    • A. Agarwal, O. Dekel, and L. Xiao, Optimal algorithms for online convex optimization with multi-point bandit feedback, in Proceedings of COLT, 2010.
    • (2010) Proceedings of COLT
    • Agarwal, A.1    Dekel, O.2    Xiao, L.3
  • 2
    • 0345224411 scopus 로고
    • The continuum-armed bandit problem
    • R. Agrawal, The continuum-armed bandit problem, SIAM J. Control Optim., 33 (1995), pp. 1926-1951.
    • (1995) SIAM J. Control Optim. , vol.33 , pp. 1926-1951
    • Agrawal, R.1
  • 3
    • 38049040954 scopus 로고    scopus 로고
    • Improved rates for the stochastic continuum-armed bandit problem
    • P. Auer, R. Ortner, and C. Szepesvári, Improved rates for the stochastic continuum-armed bandit problem, in Proceedings of COLT, 2007, pp. 454-468.
    • (2007) Proceedings of COLT , pp. 454-468
    • Auer, P.1    Ortner, R.2    Szepesvári, C.3
  • 4
    • 4243173687 scopus 로고    scopus 로고
    • Solving Convexprograms by random walks
    • D. Bertsimas and S. Vempala, Solving convexprograms by random walks, J. ACM, 51 (2004), pp. 540-556.
    • (2004) J. ACM , vol.51 , pp. 540-556
    • Bertsimas, D.1    Vempala, S.2
  • 8
    • 67649577204 scopus 로고    scopus 로고
    • Regret and convergence bounds for a class of continuum-armed bandit problems
    • E. W. Cope, Regret and convergence bounds for a class of continuum-armed bandit problems, IEEE Trans. Automat. Control, 54 (2009), pp. 1243-1253.
    • (2009) IEEE Trans. Automat. Control , vol.54 , pp. 1243-1253
    • Cope, E.W.1
  • 11
    • 0020132663 scopus 로고
    • Modifications and implementation of the ellipsoid algorithm for linear programming
    • D. Goldfarb and M. J. Todd, Modifications and implementation of the ellipsoid algorithm for linear programming, Math. Program., 23 (1982), pp. 1-19.
    • (1982) Math. Program. , vol.23 , pp. 1-19
    • Goldfarb, D.1    Todd, M.J.2
  • 12
    • 0001079593 scopus 로고
    • Stochastic estimation of the maximum of a regression function
    • J. Kiefer and J. Wolfowitz, Stochastic estimation of the maximum of a regression function, Ann. Math. Statist., 23 (1952), pp. 462-466.
    • (1952) Ann. Math. Statist. , vol.23 , pp. 462-466
    • Kiefer, J.1    Wolfowitz, J.2
  • 13
    • 84898981061 scopus 로고    scopus 로고
    • Nearly Tight Bounds for the continuum-armed bandit problem
    • R. Kleinberg, Nearly tight bounds for the continuum-armed bandit problem, Adv. Neural Inf. Process. Syst., 18 (2005).
    • (2005) Adv. Neural Inf. Process. Syst. , vol.18
    • Kleinberg, R.1
  • 18
    • 80053997013 scopus 로고    scopus 로고
    • Information-based complexity, feedback and dynamics in convex programming
    • M. RAGiNSKY AND A. RAKHLiN, Information-based complexity, feedback and dynamics in convex programming, IEEE Trans. Inform. Theory, 57 (2011), pp. 7036-7056.
    • (2011) IEEE Trans. Inform. Theory , vol.57 , pp. 7036-7056
    • Raginsky, M.1    Rakhlin, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.