-
1
-
-
0041966002
-
Using confidence bounds for exploitation-exploration trade-offs
-
Auer, P. 2002. Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3 397-422.
-
(2002)
J. Machine Learn. Res.
, vol.3
, pp. 397-422
-
-
Auer, P.1
-
4
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Auer, P., N. Cesa-Bianchi, Y. Freund, R. E. Schapire. 2002. The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32 48-77.
-
(2002)
SIAM J. Comput.
, vol.32
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
5
-
-
84972574511
-
Weighted sums of certain dependent random variables
-
Azuma, K. 1967. Weighted sums of certain dependent random variables. Tohoku Math. J. 68 357-367.
-
(1967)
Tohoku Math. J.
, vol.68
, pp. 357-367
-
-
Azuma, K.1
-
6
-
-
0038623721
-
On pseudo-games
-
Baños, A. 1968. On pseudo-games. Ann. Math. Statist. 39 1932-1945.
-
(1968)
Ann. Math. Statist.
, vol.39
, pp. 1932-1945
-
-
Baños, A.1
-
7
-
-
84972545864
-
An analog of the minimax theorem for vector payoffs
-
Blackwell, D. 1956. An analog of the minimax theorem for vector payoffs. Pacific J. Math. 6 1-8.
-
(1956)
Pacific J. Math.
, vol.6
, pp. 1-8
-
-
Blackwell, D.1
-
10
-
-
4444253732
-
Online learning in online auctions
-
Blum, A., V. Kumar, A. Rudra, F. Wu. 2004. Online learning in online auctions. Theoret. Comput. Sci. 324 137-146.
-
(2004)
Theoret. Comput. Sci.
, vol.324
, pp. 137-146
-
-
Blum, A.1
Kumar, V.2
Rudra, A.3
Wu, F.4
-
11
-
-
0033234631
-
On prediction of individual sequences
-
Cesa-Bianchi, N., G. Lugosi. 1999. On prediction of individual sequences. Ann. Statist. 27 1865-1895.
-
(1999)
Ann. Statist.
, vol.27
, pp. 1865-1895
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
12
-
-
0037614825
-
Potential-based algorithms in on-line prediction and game theory
-
Cesa-Bianchi, N., G. Lugosi. 2003. Potential-based algorithms in on-line prediction and game theory. Machine Learn. 51 239-261.
-
(2003)
Machine Learn.
, vol.51
, pp. 239-261
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
13
-
-
84926078662
-
-
Cambridge University Press, Cambridge, UK
-
Cesa-Bianchi, N., G. Lugosi. 2006. Prediction, Learning, and Games. Cambridge University Press, Cambridge, UK.
-
(2006)
Prediction, Learning, and Games
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
15
-
-
0031140246
-
How to use expert advice
-
Cesa-Bianchi, N., Y. Freund, D. P. Helmbold, D. Haussler, R. Schapire, M. K. Warmuth. 1997. How to use expert advice. J. ACM 44 427-485.
-
(1997)
J. ACM
, vol.44
, pp. 427-485
-
-
Cesa-Bianchi, N.1
Freund, Y.2
Helmbold, D.P.3
Haussler, D.4
Schapire, R.5
Warmuth, M.K.6
-
18
-
-
0031256578
-
Calibrated learning and correlated equilibrium
-
Foster, D., R. Vohra. 1997. Calibrated learning and correlated equilibrium. Games Econom. Behav. 21 40-55.
-
(1997)
Games Econom. Behav.
, vol.21
, pp. 40-55
-
-
Foster, D.1
Vohra, R.2
-
19
-
-
0037539108
-
Asymptotic calibration
-
Foster, D., R. Vohra. 1998. Asymptotic calibration. Biometrika 85 379-390.
-
(1998)
Biometrika
, vol.85
, pp. 379-390
-
-
Foster, D.1
Vohra, R.2
-
20
-
-
0002476325
-
Regret in the on-line decision problem
-
Foster, D., R. Vohra. 1999. Regret in the on-line decision problem. Games Econom. Behav. 29 7-36.
-
(1999)
Games Econom. Behav.
, vol.29
, pp. 7-36
-
-
Foster, D.1
Vohra, R.2
-
21
-
-
0002384441
-
On tail probabilities for martingales
-
Freedman, D. A. 1975. On tail probabilities for martingales. Ann. Probab. 3 100-118.
-
(1975)
Ann. Probab.
, vol.3
, pp. 100-118
-
-
Freedman, D.A.1
-
22
-
-
0000668347
-
Universal consistency and cautious fictitious play
-
Fudenberg, D., D. K. Levine. 1995. Universal consistency and cautious fictitious play. J. Econom. Dynam. Control 19 1065-1089.
-
(1995)
J. Econom. Dynam. Control
, vol.19
, pp. 1065-1089
-
-
Fudenberg, D.1
Levine, D.K.2
-
24
-
-
0001976283
-
Approximation to Bayes risk in repeated play
-
M. Dresher, A. W. Tucker, P. Wolfe, eds. Princeton University Press, Princeton, NJ
-
Hannan, J. 1957. Approximation to Bayes risk in repeated play. M. Dresher, A. W. Tucker, P. Wolfe, eds. Contributions to the Theory of Games, Vol. 3. Princeton University Press, Princeton, NJ, 97-139.
-
(1957)
Contributions to the Theory of Games
, vol.3
, pp. 97-139
-
-
Hannan, J.1
-
25
-
-
0000908510
-
A simple adaptive procedure leading to correlated equilibrium
-
Hart, S., A. Mas-Colell. 2000. A simple adaptive procedure leading to correlated equilibrium. Econometrica 68 1127-1150.
-
(2000)
Econometrica
, vol.68
, pp. 1127-1150
-
-
Hart, S.1
Mas-Colell, A.2
-
26
-
-
0013327463
-
A general class of adaptive strategies
-
Hart, S., A. Mas-Colell. 2001. A general class of adaptive strategies. J. Econom. Theory 98 26-54.
-
(2001)
J. Econom. Theory
, vol.98
, pp. 26-54
-
-
Hart, S.1
Mas-Colell, A.2
-
27
-
-
0242684983
-
A reinforcement procedure leading to correlated equilibrium
-
G. Debreu, W. Neuefeind, W. Trockel, eds. Springer, New York
-
Hart, S., A. Mas-Colell. 2002. A reinforcement procedure leading to correlated equilibrium. G. Debreu, W. Neuefeind, W. Trockel, eds. Economic Essays: A Festschrift for Werner Hildenbrand. Springer, New York, 181-200.
-
(2002)
Economic Essays: A Festschrift for Werner Hildenbrand
, pp. 181-200
-
-
Hart, S.1
Mas-Colell, A.2
-
30
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Hoeffding, W. 1963. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc. 58 13-30.
-
(1963)
J. Amer. Statist. Assoc.
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
31
-
-
0345412655
-
The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions
-
IEEE Press, Piscataway, NJ
-
Kleinberg, R., T. Leighton. 2003. The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions. Proc. 44th Annual IEEE Sympos. Foundations Comput. Sci. IEEE Press, Piscataway, NJ, 594-605.
-
(2003)
Proc. 44th Annual IEEE Sympos. Foundations Comput. Sci.
, pp. 594-605
-
-
Kleinberg, R.1
Leighton, T.2
-
35
-
-
0038675791
-
On repeated games with incomplete information played by non-Bayesian players
-
Megiddo, N. 1980. On repeated games with incomplete information played by non-Bayesian players. Internat. J. Game Theory 9 157-167.
-
(1980)
Internat. J. Game Theory
, vol.9
, pp. 157-167
-
-
Megiddo, N.1
-
37
-
-
0003351019
-
Repeated games
-
9421, CORE, Louvain-la-Neuve, Belgium
-
Mertens, J.-F., S. Sorin, S. Zamir. 1994. Repeated games. Discussion Paper 9420, 9421, 9422, CORE, Louvain-la-Neuve, Belgium.
-
(1994)
Discussion Paper
, vol.9420-9422
-
-
Mertens, J.-F.1
Sorin, S.2
Zamir, S.3
-
39
-
-
0013327190
-
Minimizing regret: The general case
-
Rustichini, A. 1999. Minimizing regret: The general case. Games Econom. Behav. 29 224-243.
-
(1999)
Games Econom. Behav.
, vol.29
, pp. 224-243
-
-
Rustichini, A.1
-
40
-
-
21244487467
-
Internal regret in on-line portfolio selection
-
Stoltz, G., G. Lugosi. 2005. Internal regret in on-line portfolio selection. Machine Learn. 59 125-159.
-
(2005)
Machine Learn.
, vol.59
, pp. 125-159
-
-
Stoltz, G.1
Lugosi, G.2
-
42
-
-
0035413537
-
Competitive on-line statistics
-
Vovk, V. G. 2001. Competitive on-line statistics. Internat. Statist. Rev. 69 213-248.
-
(2001)
Internat. Statist. Rev.
, vol.69
, pp. 213-248
-
-
Vovk, V.G.1
-
43
-
-
0035443342
-
Universal prediction of binary individual sequences in the presence of noise
-
Weissman, T., N. Merhav. 2001. Universal prediction of binary individual sequences in the presence of noise. IEEE Trans. Inform. Theory 47 2151-2173.
-
(2001)
IEEE Trans. Inform. Theory
, vol.47
, pp. 2151-2173
-
-
Weissman, T.1
Merhav, N.2
-
44
-
-
0035397523
-
Twofold universal prediction schemes for achieving the finite state predictability of a noisy individual binary sequence
-
Weissman, T., N. Merhav, A. Somekh-Baruch. 2001. Twofold universal prediction schemes for achieving the finite state predictability of a noisy individual binary sequence. IEEE Trans. Inform. Theory 47 1849-1866.
-
(2001)
IEEE Trans. Inform. Theory
, vol.47
, pp. 1849-1866
-
-
Weissman, T.1
Merhav, N.2
Somekh-Baruch, A.3
|