메뉴 건너뛰기




Volumn 6925 LNAI, Issue , 2011, Pages 144-158

Lipschitz bandits without the Lipschitz constant

Author keywords

[No Author keywords available]

Indexed keywords

BANDIT PROBLEMS; LIPSCHITZ; LIPSCHITZ CONSTANT; MINIMAX; ORDERS OF MAGNITUDE; PERFORMANCE GUARANTEES; TIME INSTANCES; TUNING PARAMETER;

EID: 80054092590     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-24412-4_14     Document Type: Conference Paper
Times cited : (82)

References (18)
  • 1
    • 78649420293 scopus 로고    scopus 로고
    • Regret bounds and minimax policies under partial monitoring
    • Audibert, J.-Y., Bubeck, S.: Regret bounds and minimax policies under partial monitoring. Journal of Machine Learning Research 11, 2635-2686 (2010)
    • (2010) Journal of Machine Learning Research , vol.11 , pp. 2635-2686
    • Audibert, J.-Y.1    Bubeck, S.2
  • 3
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning Journal 47(2-3), 235-256 (2002)
    • (2002) Machine Learning Journal , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 6
    • 38049040954 scopus 로고    scopus 로고
    • Improved rates for the stochastic continuum-armed bandit problem
    • Bshouty, N.H., Gentile, C. (eds.) COLT. Springer, Heidelberg
    • Auer, P., Ortner, R., Szepesvári, C.: Improved rates for the stochastic continuum-armed bandit problem. In: Bshouty, N.H., Gentile, C. (eds.) COLT. LNCS (LNAI), vol. 4539, pp. 454-468. Springer, Heidelberg (2007)
    • (2007) LNCS (LNAI) , vol.4539 , pp. 454-468
    • Auer, P.1    Ortner, R.2    Szepesvári, C.3
  • 10
    • 67649577204 scopus 로고    scopus 로고
    • Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces
    • Cope, E.: Regret and convergence bounds for immediate-reward reinforcement learning with continuous action spaces. IEEE Transactions on Automatic Control 54(6), 1243-1253 (2009)
    • (2009) IEEE Transactions on Automatic Control , vol.54 , Issue.6 , pp. 1243-1253
    • Cope, E.1
  • 12
    • 30344439147 scopus 로고    scopus 로고
    • Optimal algorithms for global optimization in case of unknown Lipschitz constant
    • Horn, M.: Optimal algorithms for global optimization in case of unknown Lipschitz constant. Journal of Complexity 22(1) (2006)
    • (2006) Journal of Complexity , vol.22 , Issue.1
    • Horn, M.1
  • 16
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H.: Some aspects of the sequential design of experiments. Bulletin of the American Mathematics Society 58, 527-535 (1952)
    • (1952) Bulletin of the American Mathematics Society , vol.58 , pp. 527-535
    • Robbins, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.