-
1
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
electronic
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32:48-77 (electronic), 2002/03.
-
(2002)
SIAM J. Comput.
, vol.32
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
2
-
-
0037520770
-
d+1
-
F. Aurenhammer. A criterion for the affine equivalence of cell complexes in Rd and convex polyhedra in Rd+1. Discrete Comput. Geom., 2:49-64, 1987.
-
(1987)
Discrete Comput. Geom.
, vol.2
, pp. 49-64
-
-
Aurenhammer, F.1
-
3
-
-
84972574511
-
Weighted sums of certain dependent random variables
-
K. Azuma. Weighted sums of certain dependent random variables. Tôhoku Math. J. (2), 19:357-367, 1967.
-
(1967)
Tôhoku Math. J.
, vol.2
, Issue.19
, pp. 357-367
-
-
Azuma, K.1
-
5
-
-
84972545864
-
An analog of the minimax theorem for vector payoffs
-
D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6:1-8, 1956a.
-
(1956)
Pacific J. Math.
, vol.6
, pp. 1-8
-
-
Blackwell, D.1
-
8
-
-
0039956166
-
Partition of space
-
R. C. Buck. Partition of space. Amer. Math. Monthly, 50:541-544, 1943.
-
(1943)
Amer. Math. Monthly
, vol.50
, pp. 541-544
-
-
Buck, R.C.1
-
10
-
-
20544462399
-
Minimizing regret with label efficient prediction
-
DOI 10.1109/TIT.2005.847729
-
N. Cesa-Bianchi, G. Lugosi, and G. Stoltz. Minimizing regret with label efficient prediction. IEEE Trans. Inform. Theory, 51:2152-2162, 2005. (Pubitemid 40843632)
-
(2005)
IEEE Transactions on Information Theory
, vol.51
, Issue.6
, pp. 2152-2162
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
11
-
-
0030544315
-
Laws of large numbers for Hilbert space-valued mixingales with applications
-
X. Chen and H. White. Laws of large numbers for Hilbert space-valued mixingales with applications. Econometric Theory, 12:284-304, 1996.
-
(1996)
Econometric Theory
, vol.12
, pp. 284-304
-
-
Chen, X.1
White, H.2
-
12
-
-
84950454029
-
The well-calibrated Bayesian
-
A. P. Dawid. The well-calibrated Bayesian. J. Amer. Statist. Assoc., 77:605-613, 1982.
-
(1982)
J. Amer. Statist. Assoc.
, vol.77
, pp. 605-613
-
-
Dawid, A.P.1
-
13
-
-
0031256578
-
Calibrated learning and correlated equilibrium
-
DOI 10.1006/game.1997.0595, PII S0899825697905959
-
D. P. Foster and R. V. Vohra. Calibrated learning and correlated equilibrium. Games Econom. Behav., 21:40-55, 1997. (Pubitemid 127175523)
-
(1997)
Games and Economic Behavior
, vol.21
, Issue.1-2
, pp. 40-55
-
-
Foster, D.P.1
Vohra, R.V.2
-
14
-
-
0037539108
-
Asymptotic calibration
-
D. P. Foster and R. V. Vohra. Asymptotic calibration. Biometrika, 85:379-390, 1998.
-
(1998)
Biometrika
, vol.85
, pp. 379-390
-
-
Foster, D.P.1
Vohra, R.V.2
-
15
-
-
0002384441
-
On tail probabilities for martingales
-
D. A. Freedman. On tail probabilities for martingales. Ann. Probability, 3:100-118, 1975.
-
(1975)
Ann. Probability
, vol.3
, pp. 100-118
-
-
Freedman, D.A.1
-
18
-
-
0000908510
-
A simple adaptive procedure leading to correlated equilibrium
-
S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
-
(2000)
Econometrica
, vol.68
, pp. 1127-1150
-
-
Hart, S.1
Mas-Colell, A.2
-
19
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
W. Hoeffding. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc., 58:13-30, 1963.
-
(1963)
J. Amer. Statist. Assoc.
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
20
-
-
77951952841
-
Near-optimal regret bounds for reinforcement learning
-
T. Jaksch, R. Ortner, and P. Auer. Near-optimal regret bounds for reinforcement learning. J. Mach. Learn. Res., 11:1563-1600, 2010.
-
(2010)
J. Mach. Learn. Res.
, vol.11
, pp. 1563-1600
-
-
Jaksch, T.1
Ortner, R.2
Auer, P.3
-
23
-
-
61349116274
-
Strategies for prediction under imperfect monitoring
-
G. Lugosi, S. Mannor, and G. Stoltz. Strategies for prediction under imperfect monitoring. Math. Oper. Res., 33:513-528, 2008.
-
(2008)
Math. Oper. Res.
, vol.33
, pp. 513-528
-
-
Lugosi, G.1
Mannor, S.2
Stoltz, G.3
-
25
-
-
0030523539
-
Projections of polytopes and the generalized baues conjecture
-
J. Rambau and G. M. Ziegler. Projections of polytopes and the generalized Baues conjecture. Discrete Comput. Geom., 16:215-237, 1996. (Pubitemid 126317943)
-
(1996)
Discrete and Computational Geometry
, vol.16
, Issue.3
, pp. 215-237
-
-
Rambau, J.1
Ziegler, G.M.2
-
27
-
-
0013327190
-
Minimizing regret: The general case
-
A. Rustichini. Minimizing regret: the general case. Games Econom. Behav., 29:224-243, 1999.
-
(1999)
Games Econom. Behav.
, vol.29
, pp. 224-243
-
-
Rustichini, A.1
-
28
-
-
0003570325
-
-
Springer Series in Statistics. Springer-Verlag, New York, second edition
-
E. Seneta. Nonnegative Matrices and Markov Chains. Springer Series in Statistics. Springer-Verlag, New York, second edition, 1981.
-
(1981)
Nonnegative Matrices and Markov Chains
-
-
Seneta, E.1
-
29
-
-
0040104631
-
Supergames
-
Econom. Theory Econometrics Math. Econom., Academic Press, San Diego, CA
-
S. Sorin. Supergames. In Game theory and applications (Columbus, OH, 1987), Econom. Theory Econometrics Math. Econom., pages 46-63. Academic Press, San Diego, CA, 1990.
-
(1990)
Game Theory and Applications (Columbus, OH 1987)
, pp. 46-63
-
-
Sorin, S.1
-
31
-
-
0000836223
-
Exponential inequalities for sums of random vectors
-
V. Yurinskii. Exponential inequalities for sums of random vectors. Journal of Multivariate Analysis, 6:473-499, 1976.
-
(1976)
Journal of Multivariate Analysis
, vol.6
, pp. 473-499
-
-
Yurinskii, V.1
|