메뉴 건너뛰기




Volumn , Issue , 2011, Pages 41-48

Unimodal bandits

Author keywords

[No Author keywords available]

Indexed keywords

GRAPH STRUCTURES; MAXIMAL DEGREE; MULTI-ARMED BANDIT PROBLEM; UNIMODAL; UNIMODALITY;

EID: 80053457608     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (76)

References (25)
  • 1
    • 0345224411 scopus 로고
    • The continuum-armed bandit problem
    • Agrawal, R. The continuum-armed bandit problem. SIAM J. Control Optim., 33:1926-1951, 1995.
    • (1995) SIAM J. Control Optim. , vol.33 , pp. 1926-1951
    • Agrawal, R.1
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • DOI 10.1023/A:1013689704352, Computational Learning Theory
    • Auer, P., Cesa-Bianchi, N., and Fischer, P. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, 2002a. (Pubitemid 34126111)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 4
    • 80053457129 scopus 로고    scopus 로고
    • Improved rates for the stochastic continuum-armed bandit problem
    • Auer, P., Ortner, R., and Szepesvári, C. Improved rates for the stochastic continuum-armed bandit problem. In Proc. COLT, 2007.
    • Proc. COLT, 2007
    • Auer, P.1    Ortner, R.2    Szepesvári, C.3
  • 11
    • 67649577204 scopus 로고    scopus 로고
    • Regret and convergence bounds for a class of continuum-armed bandit problems
    • Cope, E. W. Regret and convergence bounds for a class of continuum-armed bandit problems. IEEE Trans. Automat. Control, 54(6):1243-1253, 2009.
    • (2009) IEEE Trans. Automat. Control , vol.54 , Issue.6 , pp. 1243-1253
    • Cope, E.W.1
  • 12
    • 9444277556 scopus 로고    scopus 로고
    • PAC bounds for multi-armed bandit and Markov decision processes
    • Even-Dar, E., Mannor, S., and Mansour, Y. PAC bounds for multi-armed bandit and Markov decision processes. In Proc. COLT, 2002.
    • Proc. COLT, 2002
    • Even-Dar, E.1    Mannor, S.2    Mansour, Y.3
  • 13
    • 80053443906 scopus 로고    scopus 로고
    • On upper-confidence bound policies for non-stationary bandit problems
    • Garivier, A. and Moulines, E. On upper-confidence bound policies for non-stationary bandit problems. In Proc. EWRL, 2008.
    • Proc. EWRL, 2008
    • Garivier, A.1    Moulines, E.2
  • 15
    • 84968496713 scopus 로고
    • Sequential minimax search for a maximum
    • Kiefer, J. Sequential minimax search for a maximum. Proc. Amer. Math. Soc., 4(3):502-506, 1953.
    • (1953) Proc. Amer. Math. Soc. , vol.4 , Issue.3 , pp. 502-506
    • Kiefer, J.1
  • 16
    • 84898981061 scopus 로고    scopus 로고
    • Nearly tight bounds for the continuum-armed bandit problem
    • Kleinberg, R. Nearly tight bounds for the continuum-armed bandit problem. In Proc. NIPS, 2004.
    • Proc. NIPS, 2004
    • Kleinberg, R.1
  • 18
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T. L. and Robbins, H. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 23
    • 73849094380 scopus 로고    scopus 로고
    • Online ad auctions
    • Varian, H. R. Online ad auctions. American Economic Review, 99(2):430âǍŞ-434, 2009.
    • (2009) American Economic Review , vol.99 , Issue.2 , pp. 430-434
    • Varian, H.R.1
  • 25
    • 71149111013 scopus 로고    scopus 로고
    • Piecewise-stationary bandit problems with side observations
    • Yu, J. Y. and Mannor, S. Piecewise-stationary bandit problems with side observations. In Proc. ICML, 2009.
    • Proc. ICML, 2009
    • Yu, J.Y.1    Mannor, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.