-
1
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R.E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002a.
-
(2002)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
2
-
-
0036477185
-
Adaptive and self-confident on-line learning algorithms
-
P. Auer, N. Cesa-Bianchi, and C. Gentile. Adaptive and self-confident on-line learning algorithms. Journal of Computing and System Sciences, 64(1):48-75, 2002b.
-
(2002)
Journal of Computing and System Sciences
, vol.64
, Issue.1
, pp. 48-75
-
-
Auer, P.1
Cesa-Bianchi, N.2
Gentile, C.3
-
3
-
-
0002430114
-
Subjectivity and correlation in randomized strategies
-
R. J. Aumann. Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1:67-96, 1974.
-
(1974)
Journal of Mathematical Economics
, vol.1
, pp. 67-96
-
-
Aumann, R.J.1
-
4
-
-
84972545864
-
An analog of the mimimax theorem for vector payoffs
-
D. Blackwell. An analog of the mimimax theorem for vector payoffs. Pacific Journal of Mathematics, 6:1-8, 1956.
-
(1956)
Pacific Journal of Mathematics
, vol.6
, pp. 1-8
-
-
Blackwell, D.1
-
5
-
-
0030819669
-
Empirical support for Winnow and Weighted-Majority based algorithms: Results on a calendar scheduling domain
-
A. Blum. Empirical support for Winnow and Weighted-Majority based algorithms: Results on a calendar scheduling domain. Machine Learning, 26:5-23, 1997.
-
(1997)
Machine Learning
, vol.26
, pp. 5-23
-
-
Blum, A.1
-
6
-
-
0031140246
-
How to use expert advice
-
N. Cesa-Bianchi, Y. Freund, D.P. Helmbold, D. Haussler, R.E. Schapire, and M.K. Warmuth. How to use expert advice. Journal of the Association for Computing Machinery (JACM), 44(3):427-485, 1997.
-
(1997)
Journal of the Association for Computing Machinery (JACM)
, vol.44
, Issue.3
, pp. 427-485
-
-
Cesa-Bianchi, N.1
Freund, Y.2
Helmbold, D.P.3
Haussler, D.4
Schapire, R.E.5
Warmuth, M.K.6
-
7
-
-
0037614825
-
Potential-based algorithms in on-line prediction and game theory
-
N. Cesa-Bianchi and G. Lugosi. Potential-based algorithms in on-line prediction and game theory. Machine Learning, 51(3):239-261, 2003.
-
(2003)
Machine Learning
, vol.51
, Issue.3
, pp. 239-261
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
11
-
-
0001345686
-
Context-sensitive learning methods for text categorization
-
W. Cohen and Y. Singer. Context-sensitive learning methods for text categorization. ACM Transactions on Information Systems, 17(2):141-173, 1999.
-
(1999)
ACM Transactions on Information Systems
, vol.17
, Issue.2
, pp. 141-173
-
-
Cohen, W.1
Singer, Y.2
-
14
-
-
0002095886
-
A randomization rule for selecting forecasts
-
July-August
-
D. Foster and R. Vohra. A randomization rule for selecting forecasts. Operations Research, 41(4): 704-709, July-August 1993.
-
(1993)
Operations Research
, vol.41
, Issue.4
, pp. 704-709
-
-
Foster, D.1
Vohra, R.2
-
15
-
-
0031256578
-
Calibrated learning and correlated equilibrium
-
D. Foster and R. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21:40-55, 1997.
-
(1997)
Games and Economic Behavior
, vol.21
, pp. 40-55
-
-
Foster, D.1
Vohra, R.2
-
16
-
-
0037539108
-
Asymptotic calibration
-
D. Foster and R. Vohra. Asymptotic calibration. Biometrika, 85:379-390, 1998.
-
(1998)
Biometrika
, vol.85
, pp. 379-390
-
-
Foster, D.1
Vohra, R.2
-
18
-
-
0031211090
-
A decision-theoretic generalization of on-line learning and an application to boosting
-
Y. Freund and R.E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119-139, 1997.
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.1
, pp. 119-139
-
-
Freund, Y.1
Schapire, R.E.2
-
19
-
-
0002267135
-
Adaptive game playing using multiplicative weights
-
Y. Freund and R.E. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79-103, 1999.
-
(1999)
Games and Economic Behavior
, vol.29
, pp. 79-103
-
-
Freund, Y.1
Schapire, R.E.2
-
20
-
-
0030643068
-
Using and combining predictors that specialize
-
Y. Freund, R.E. Schapire, Y. Singer, and M.K. Warmuth. Using and combining predictors that specialize. In Proceedings of the 29th Annual Symposium on Theory of Computing, pages 334-343, 1997.
-
(1997)
Proceedings of the 29th Annual Symposium on Theory of Computing
, pp. 334-343
-
-
Freund, Y.1
Schapire, R.E.2
Singer, Y.3
Warmuth, M.K.4
-
21
-
-
0001976283
-
Approximation to Bayes risk in repeated plays
-
M. Dresher, A. Tucker, and P. Wolfe, editors, Princeton University Press
-
J. Hannan. Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games, volume 3, pages 97-139. Princeton University Press, 1957.
-
(1957)
Contributions to the Theory of Games
, vol.3
, pp. 97-139
-
-
Hannan, J.1
-
22
-
-
0000908510
-
A simple adaptive procedure leading to correlated equilibrium
-
S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
-
(2000)
Econometrica
, vol.68
, pp. 1127-1150
-
-
Hart, S.1
Mas-Colell, A.2
-
23
-
-
0242684983
-
A reinforcement procedure leading to correlated equilibrium
-
Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, Springer
-
S. Hart and A. Mas-Colell. A reinforcement procedure leading to correlated equilibrium. In Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, Economic Essays, pages 181-200. Springer, 2001.
-
(2001)
Economic Essays
, pp. 181-200
-
-
Hart, S.1
Mas-Colell, A.2
-
24
-
-
0038404996
-
A wide range no-regret theorem
-
E. Lehrer. A wide range no-regret theorem. Games and Economic Behavior, 42:101-115, 2003.
-
(2003)
Games and Economic Behavior
, vol.42
, pp. 101-115
-
-
Lehrer, E.1
-
25
-
-
34250091945
-
Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
-
N. Littlestone. Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2:285-318, 1988.
-
(1988)
Machine Learning
, vol.2
, pp. 285-318
-
-
Littlestone, N.1
-
28
-
-
21244487467
-
Internal regret in on-line portfolio selection
-
G. Stoltz and G. Lugosi. Internal regret in on-line portfolio selection. Machine Learning, 59(1-2): 125-159, 2005.
-
(2005)
Machine Learning
, vol.59
, Issue.1-2
, pp. 125-159
-
-
Stoltz, G.1
Lugosi, G.2
-
29
-
-
33947600544
-
Learning correlated equilibria in games with compact sets of strategies
-
G. Stoltz and G. Lugosi. Learning correlated equilibria in games with compact sets of strategies. Games and Economic Behavior, 59:187-209, 2007.
-
(2007)
Games and Economic Behavior
, vol.59
, pp. 187-209
-
-
Stoltz, G.1
Lugosi, G.2
|