메뉴 건너뛰기




Volumn 80, Issue 2-3, 2010, Pages 245-272

Regret bounds for sleeping experts and bandits

Author keywords

Computational learning theory; Online algorithms; Regret

Indexed keywords

COMPUTATIONAL LEARNING THEORY; DECISION ALGORITHMS; MULTI ARMED BANDIT; ON-LINE ALGORITHMS; ONLINE DECISIONS; OPTIMAL REGRET; PRACTICAL PROBLEMS; REGRET;

EID: 77955660815     PISSN: 08856125     EISSN: 15730565     Source Type: Journal    
DOI: 10.1007/s10994-010-5178-7     Document Type: Article
Times cited : (179)

References (23)
  • 2
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • DOI 10.1023/A:1013689704352, Computational Learning Theory
    • P. Auer N. Cesa-Bianchi P. Fischer 2002 Finite-time analysis of the multiarmed bandit problem Machine Learning 47 235 256 1012.68093 10.1023/A:1013689704352 (Pubitemid 34126111)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 3
    • 0037709910 scopus 로고    scopus 로고
    • The nonstochastic multiarmed bandit problem
    • 1029.68087 10.1137/S0097539701398375 1954855
    • P. Auer N. Cesa-Bianchi Y. Freund R. E. Schapire 2002 The nonstochastic multiarmed bandit problem SIAM Journal on Computing 32 48 77 1029.68087 10.1137/S0097539701398375 1954855
    • (2002) SIAM Journal on Computing , vol.32 , pp. 48-77
    • Auer, P.1    Cesa-Bianchi, N.2    Freund, Y.3    Schapire, R.E.4
  • 4
    • 84972574511 scopus 로고
    • Weighted sums of certain dependent random variables
    • 0178.21103 10.2748/tmj/1178243286 221571
    • K. Azuma 1967 Weighted sums of certain dependent random variables Tohoku Mathematical Journal 19 357 367 0178.21103 10.2748/tmj/1178243286 221571
    • (1967) Tohoku Mathematical Journal , vol.19 , pp. 357-367
    • Azuma, K.1
  • 8
    • 0002267135 scopus 로고    scopus 로고
    • Adaptive game playing using multiplicative weights
    • 0964.91007 10.1006/game.1999.0738 1729311
    • Y. Freund R. E. Schapire 1999 Adaptive game playing using multiplicative weights Games and Economic Behavior 29 79 103 0964.91007 10.1006/game.1999.0738 1729311
    • (1999) Games and Economic Behavior , vol.29 , pp. 79-103
    • Freund, Y.1    Schapire, R.E.2
  • 12
    • 0018709825 scopus 로고
    • A dynamic allocation index for the discounted multiarmed bandit problem
    • 10.1093/biomet/66.3.561
    • J. C. Gittins D. M. Jones 1979 A dynamic allocation index for the discounted multiarmed bandit problem Biometrika 66 561 565 10.1093/biomet/66.3. 561
    • (1979) Biometrika , vol.66 , pp. 561-565
    • Gittins, J.C.1    Jones, D.M.2
  • 13
    • 0001976283 scopus 로고
    • Approximation to Bayes risk in repeated plays
    • M. Dresher A. Tucker P. Wolfe (eds). Princeton University Press Princeton
    • Hannan, J. (1957). Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, & P. Wolfe (Eds.), Contributions to the theory of games (pp. 97-139). Princeton: Princeton University Press.
    • (1957) Contributions to the Theory of Games , pp. 97-139
    • Hannan, J.1
  • 14
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • 0127.10602 10.2307/2282952 144363
    • W. Hoeffding 1963 Probability inequalities for sums of bounded random variables Journal of the American Statistical Association 58 13 30 0127.10602 10.2307/2282952 144363
    • (1963) Journal of the American Statistical Association , vol.58 , pp. 13-30
    • Hoeffding, W.1
  • 15
    • 24644463787 scopus 로고    scopus 로고
    • Efficient algorithms for online decision problems
    • DOI 10.1016/j.jcss.2004.10.016, PII S0022000004001394
    • A. T. Kalai S. Vempala 2005 Efficient algorithms for on-line optimization Journal of Computer and System Sciences 71 291 307 1094.68112 10.1016/j.jcss.2004.10.016 2168355 (Pubitemid 41278182)
    • (2005) Journal of Computer and System Sciences , vol.71 , Issue.3 , pp. 291-307
    • Kalai, A.1    Vempala, S.2
  • 17
    • 0001280583 scopus 로고
    • Über dyadische Brüche
    • 10.1007/BF01192399 1544623
    • A. Khintchine 1923 Über dyadische Brüche Mathematische Zeitschsift 18 109 116 10.1007/BF01192399 1544623
    • (1923) Mathematische Zeitschsift , vol.18 , pp. 109-116
    • Khintchine, A.1
  • 18
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocations rules
    • 0568.62074 10.1016/0196-8858(85)90002-8 776826
    • T. L. Lai H. Robbins 1985 Asymptotically efficient adaptive allocations rules Advances in Applied Mathematics 6 4 22 0568.62074 10.1016/0196-8858(85) 90002-8 776826
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.L.1    Robbins, H.2
  • 20
    • 35148838877 scopus 로고
    • The weighted majority algorithm
    • 0804.68121 10.1006/inco.1994.1009 1265851 An extended abstract appeared in IEEE symposium on foundations of computer science, 1989 (pp. 256-261)
    • N. Littlestone M. K. Warmuth 1994 The weighted majority algorithm Information and Computation 108 212 261 0804.68121 10.1006/inco.1994.1009 1265851 An extended abstract appeared in IEEE symposium on foundations of computer science, 1989 (pp. 256-261)
    • (1994) Information and Computation , vol.108 , pp. 212-261
    • Littlestone, N.1    Warmuth, M.K.2
  • 21
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • 0049.37009 10.1090/S0002-9904-1952-09620-8 50246
    • H. Robbins 1952 Some aspects of the sequential design of experiments Bulletin of the American Mathematical Society 58 527 535 0049.37009 10.1090/S0002-9904-1952-09620-8 50246
    • (1952) Bulletin of the American Mathematical Society , vol.58 , pp. 527-535
    • Robbins, H.1
  • 23
    • 0032047115 scopus 로고    scopus 로고
    • A game of prediction with expert advice
    • 0945.68528 10.1006/jcss.1997.1556 1629690 An extended abstract appeared in COLT, 1995 (pp. 51-60)
    • V. G. Vovk 1998 A game of prediction with expert advice Journal of Computer and System Sciences 56 153 173 0945.68528 10.1006/jcss.1997.1556 1629690 An extended abstract appeared in COLT, 1995 (pp. 51-60)
    • (1998) Journal of Computer and System Sciences , vol.56 , pp. 153-173
    • Vovk, V.G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.