-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
P. Auer N. Cesa-Bianchi P. Fischer 2002 Finite-time analysis of the multiarmed bandit problem Machine Learning 47 235 256 1012.68093 10.1023/A:1013689704352 (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
1029.68087 10.1137/S0097539701398375 1954855
-
P. Auer N. Cesa-Bianchi Y. Freund R. E. Schapire 2002 The nonstochastic multiarmed bandit problem SIAM Journal on Computing 32 48 77 1029.68087 10.1137/S0097539701398375 1954855
-
(2002)
SIAM Journal on Computing
, vol.32
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
84972574511
-
Weighted sums of certain dependent random variables
-
0178.21103 10.2748/tmj/1178243286 221571
-
K. Azuma 1967 Weighted sums of certain dependent random variables Tohoku Mathematical Journal 19 357 367 0178.21103 10.2748/tmj/1178243286 221571
-
(1967)
Tohoku Mathematical Journal
, vol.19
, pp. 357-367
-
-
Azuma, K.1
-
8
-
-
0002267135
-
Adaptive game playing using multiplicative weights
-
0964.91007 10.1006/game.1999.0738 1729311
-
Y. Freund R. E. Schapire 1999 Adaptive game playing using multiplicative weights Games and Economic Behavior 29 79 103 0964.91007 10.1006/game.1999.0738 1729311
-
(1999)
Games and Economic Behavior
, vol.29
, pp. 79-103
-
-
Freund, Y.1
Schapire, R.E.2
-
9
-
-
0030643068
-
Using and combining predictors that specialize
-
Freund, Y., Schapire, R. E., Singer, Y., & Warmuth, M. K. (1997). Using and combining predictors that specialize. In Proceedings of the 29th ACM symp. on theory of computing (STOC) (pp. 334-343).
-
(1997)
Proceedings of the 29th ACM Symp. on Theory of Computing (STOC)
, pp. 334-343
-
-
Freund, Y.1
Schapire, R.E.2
Singer, Y.3
Warmuth, M.K.4
-
12
-
-
0018709825
-
A dynamic allocation index for the discounted multiarmed bandit problem
-
10.1093/biomet/66.3.561
-
J. C. Gittins D. M. Jones 1979 A dynamic allocation index for the discounted multiarmed bandit problem Biometrika 66 561 565 10.1093/biomet/66.3. 561
-
(1979)
Biometrika
, vol.66
, pp. 561-565
-
-
Gittins, J.C.1
Jones, D.M.2
-
13
-
-
0001976283
-
Approximation to Bayes risk in repeated plays
-
M. Dresher A. Tucker P. Wolfe (eds). Princeton University Press Princeton
-
Hannan, J. (1957). Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, & P. Wolfe (Eds.), Contributions to the theory of games (pp. 97-139). Princeton: Princeton University Press.
-
(1957)
Contributions to the Theory of Games
, pp. 97-139
-
-
Hannan, J.1
-
14
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
0127.10602 10.2307/2282952 144363
-
W. Hoeffding 1963 Probability inequalities for sums of bounded random variables Journal of the American Statistical Association 58 13 30 0127.10602 10.2307/2282952 144363
-
(1963)
Journal of the American Statistical Association
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
15
-
-
24644463787
-
Efficient algorithms for online decision problems
-
DOI 10.1016/j.jcss.2004.10.016, PII S0022000004001394
-
A. T. Kalai S. Vempala 2005 Efficient algorithms for on-line optimization Journal of Computer and System Sciences 71 291 307 1094.68112 10.1016/j.jcss.2004.10.016 2168355 (Pubitemid 41278182)
-
(2005)
Journal of Computer and System Sciences
, vol.71
, Issue.3
, pp. 291-307
-
-
Kalai, A.1
Vempala, S.2
-
17
-
-
0001280583
-
Über dyadische Brüche
-
10.1007/BF01192399 1544623
-
A. Khintchine 1923 Über dyadische Brüche Mathematische Zeitschsift 18 109 116 10.1007/BF01192399 1544623
-
(1923)
Mathematische Zeitschsift
, vol.18
, pp. 109-116
-
-
Khintchine, A.1
-
18
-
-
0002899547
-
Asymptotically efficient adaptive allocations rules
-
0568.62074 10.1016/0196-8858(85)90002-8 776826
-
T. L. Lai H. Robbins 1985 Asymptotically efficient adaptive allocations rules Advances in Applied Mathematics 6 4 22 0568.62074 10.1016/0196-8858(85) 90002-8 776826
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
20
-
-
35148838877
-
The weighted majority algorithm
-
0804.68121 10.1006/inco.1994.1009 1265851 An extended abstract appeared in IEEE symposium on foundations of computer science, 1989 (pp. 256-261)
-
N. Littlestone M. K. Warmuth 1994 The weighted majority algorithm Information and Computation 108 212 261 0804.68121 10.1006/inco.1994.1009 1265851 An extended abstract appeared in IEEE symposium on foundations of computer science, 1989 (pp. 256-261)
-
(1994)
Information and Computation
, vol.108
, pp. 212-261
-
-
Littlestone, N.1
Warmuth, M.K.2
-
21
-
-
84966203785
-
Some aspects of the sequential design of experiments
-
0049.37009 10.1090/S0002-9904-1952-09620-8 50246
-
H. Robbins 1952 Some aspects of the sequential design of experiments Bulletin of the American Mathematical Society 58 527 535 0049.37009 10.1090/S0002-9904-1952-09620-8 50246
-
(1952)
Bulletin of the American Mathematical Society
, vol.58
, pp. 527-535
-
-
Robbins, H.1
-
23
-
-
0032047115
-
A game of prediction with expert advice
-
0945.68528 10.1006/jcss.1997.1556 1629690 An extended abstract appeared in COLT, 1995 (pp. 51-60)
-
V. G. Vovk 1998 A game of prediction with expert advice Journal of Computer and System Sciences 56 153 173 0945.68528 10.1006/jcss.1997.1556 1629690 An extended abstract appeared in COLT, 1995 (pp. 51-60)
-
(1998)
Journal of Computer and System Sciences
, vol.56
, pp. 153-173
-
-
Vovk, V.G.1
|