-
1
-
-
84898073179
-
Beating the adaptive bandit with high probability
-
J. Abernethy and A. Rakhlin. Beating the adaptive bandit with high probability. In COLT, 2009.
-
(2009)
COLT
-
-
Abernethy, J.1
Rakhlin, A.2
-
2
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R.E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2003.
-
(2003)
SIAM Journal on Computing
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
3
-
-
78249286512
-
Toward a classification of finite partial-monitoring games
-
Springer
-
G. Bartok, D. Pal, and C. Szepesvari. Toward a classification of finite partial-monitoring games. In Algorithmic Learning Theory, pages 224-238. Springer, 2010.
-
(2010)
Algorithmic Learning Theory
, pp. 224-238
-
-
Bartok, G.1
Pal, D.2
Szepesvari, C.3
-
4
-
-
84867849262
-
Mini-max regret of finite partial-monitoring games in stochastic environments
-
G. Bartok, D. Pal, and C. Szepesvari. Mini-max regret of finite partial-monitoring games in stochastic environments. In Conference on Learning Theory, 2011.
-
(2011)
Conference on Learning Theory
-
-
Bartok, G.1
Pal, D.2
Szepesvari, C.3
-
6
-
-
33748442333
-
Regret minimization under partial monitoring
-
N. Cesa-Bianchi, G. Lugosi, and G. Stoltz. Regret minimization under partial monitoring. Mathematics of Operations Research, 31(3):562-580, 2006.
-
(2006)
Mathematics of Operations Research
, vol.31
, Issue.3
, pp. 562-580
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
7
-
-
0031256578
-
Calibrated learning and correlated equilibrium
-
D.P. Foster and R.V. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21(1-2):40-55, 1997.
-
(1997)
Games and Economic Behavior
, vol.21
, Issue.1-2
, pp. 40-55
-
-
Foster, D.P.1
Vohra, R.V.2
-
8
-
-
61349116274
-
Strategies for prediction under imperfect monitoring
-
G. Lugosi, S. Mannor, and G. Stoltz. Strategies for prediction under imperfect monitoring. Math. Oper. Res, 33:513-528, 2008.
-
(2008)
Math. Oper. Res.
, vol.33
, pp. 513-528
-
-
Lugosi, G.1
Mannor, S.2
Stoltz, G.3
-
9
-
-
79960129843
-
Internal regret with partial monitoring: Calibration-based optimal algorithms
-
V. Perchet. Internal regret with partial monitoring: Calibration-based optimal algorithms. Journal of Machine Learning Research, 12:1893-1921, 2011.
-
(2011)
Journal of Machine Learning Research
, vol.12
, pp. 1893-1921
-
-
Perchet, V.1
-
10
-
-
84898041886
-
Discrete prediction games with arbitrary feedback and loss
-
Springer
-
A. Piccolboni and C. Schindelhauer. Discrete prediction games with arbitrary feedback and loss. In Computational Learning Theory, pages 208-223. Springer, 2001.
-
(2001)
Computational Learning Theory
, pp. 208-223
-
-
Piccolboni, A.1
Schindelhauer, C.2
-
11
-
-
0013327190
-
Minimizing regret: The general case
-
A. Rustichini. Minimizing regret: The general case. Games and Economic Behavior, 29(1-2):224-243, 1999.
-
(1999)
Games and Economic Behavior
, vol.29
, Issue.1-2
, pp. 224-243
-
-
Rustichini, A.1
|