-
2
-
-
84898063697
-
Competing in the dark: An efficient algorithm for bandit linear optimization
-
Abernethy J, Hazan E, Rakhlin A (2008) Competing in the dark: An efficient algorithm for bandit linear optimization. Proc. 21st Conf. Learn. Theory COLT '08 (Omnipress, Madison, WI), 263-273.
-
(2008)
Proc. 21st Conf. Learn. Theory COLT '08 (Omnipress, Madison, WI)
, pp. 263-273
-
-
Abernethy, J.1
Hazan, E.2
Rakhlin, A.3
-
3
-
-
84862282885
-
Optimal allocation strategies for the dark pool problem
-
Agarwal A, Bartlett P, Dama M (2010) Optimal allocation strategies for the dark pool problem. Proc. 13th Internat. Conf. Artificial Intelligence Statist. AISTATS '10, Sardinia, Italy, 9-16.
-
(2010)
Proc. 13th Internat. Conf. Artificial Intelligence Statist. AISTATS '10, Sardinia, Italy
, pp. 9-16
-
-
Agarwal, A.1
Bartlett, P.2
Dama, M.3
-
6
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Auer P, Cesa-Bianchi N, Freund Y, Schapire RE (2002) The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32(1):48-77.
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
7
-
-
84898032961
-
A near-optimal algorithm for finite partial-monitoring games against adversarial opponents
-
Bartók G (2013) A near-optimal algorithm for finite partial-monitoring games against adversarial opponents. Proc. 26th Conf. Learn. Theory, COLT '13, Princeton, New Jersey, 696-710.
-
(2013)
Proc. 26th Conf. Learn. Theory, COLT '13, Princeton, New Jersey
, pp. 696-710
-
-
Bartók, G.1
-
8
-
-
84867874515
-
Partial monitoring with side information
-
Bshouty NH, Stoltz G, Vayatis N, Zeugmann T, eds.
-
Bartók G, Szepesvári Cs (2012) Partial monitoring with side information. Bshouty NH, Stoltz G, Vayatis N, Zeugmann T, eds. Proc. 23rd Internat. Conf. Algorithmic Learn. Theory, ALT '12 (Springer, Berlin, Heidelberg), 305-319.
-
(2012)
Proc. 23rd Internat. Conf. Algorithmic Learn. Theory, ALT '12 (Springer, Berlin, Heidelberg)
, pp. 305-319
-
-
Bartók, G.1
Szepesvári, Cs.2
-
9
-
-
78249286512
-
Toward a classification of finite partial-monitoring games
-
Hutter M, Stephan F, Vovk V, Zeugmann T, eds.
-
Bartók G, Pál D, Szepesvári C (2010) Toward a classification of finite partial-monitoring games. Hutter M, Stephan F, Vovk V, Zeugmann T, eds. Proc. 21st Internat. Conf. Algorithmic Learn. Theory, ALT '10 (Springer, Berlin, Heidelberg), 224-238.
-
(2010)
Proc. 21st Internat. Conf. Algorithmic Learn. Theory, ALT '10 (Springer, Berlin, Heidelberg)
, pp. 224-238
-
-
Bartók, G.1
Pál, D.2
Szepesvári, C.3
-
10
-
-
84867849262
-
Minimax regret of finite partial-monitoring games in stochastic environments
-
Kakade SM, Von Luxburg U, eds.
-
Bartók G, Pál D, Szepesvári C (2011) Minimax regret of finite partial-monitoring games in stochastic environments. Kakade SM, Von Luxburg U, eds. Proc. 24th Conf. Learn. Theory, COLT '11, Budapest, Hungary, 133-154.
-
(2011)
Proc. 24th Conf. Learn. Theory, COLT '11, Budapest, Hungary
, pp. 133-154
-
-
Bartók, G.1
Pál, D.2
Szepesvári, C.3
-
11
-
-
84867133798
-
An adaptive algorithm for finite stochastic partial monitoring
-
Bartók G, Zolghadr N, Szepesvári Cs (2012) An adaptive algorithm for finite stochastic partial monitoring. Proc. 29th Internat. Conf. Machine Learn., ICML '12 (Omnipress, Madison, WI).
-
(2012)
Proc. 29th Internat. Conf. Machine Learn., ICML '12 (Omnipress, Madison, WI)
-
-
Bartók, G.1
Zolghadr, N.2
Szepesvári, Cs.3
-
12
-
-
34547254640
-
From external to internal regret
-
Blum A, Mansour Y (2007) From external to internal regret. J. Machine Learn. Res. 8:1307-1324.
-
(2007)
J. Machine Learn. Res.
, vol.8
, pp. 1307-1324
-
-
Blum, A.1
Mansour, Y.2
-
14
-
-
33748442333
-
Regret minimization under partial monitoring
-
Cesa-Bianchi N, Lugosi G, Stoltz G (2006) Regret minimization under partial monitoring. Math. Oper. Res. 31(3):562-580.
-
(2006)
Math. Oper. Res.
, vol.31
, Issue.3
, pp. 562-580
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
16
-
-
20744454447
-
Online convex optimization in the bandit setting: Gradient descent without a gradient
-
Flaxman AD, Kalai AT, McMahan HB (2005) Online convex optimization in the bandit setting: Gradient descent without a gradient. Proc. 16th ACM-SIAM Sympos. Discrete Algorithms, SODA '05 (SIAM, Philadelphia), 385-394.
-
(2005)
Proc. 16th ACM-SIAM Sympos. Discrete Algorithms, SODA '05 (SIAM, Philadelphia)
, pp. 385-394
-
-
Flaxman, A.D.1
Kalai, A.T.2
McMahan, H.B.3
-
18
-
-
0031256578
-
Calibrated learning and correlated equilibrium
-
Foster DP, Vohra RV (1997) Calibrated learning and correlated equilibrium. Games Econom. Behav. 21(1-2):40-55.
-
(1997)
Games Econom. Behav.
, vol.21
, Issue.1-2
, pp. 40-55
-
-
Foster, D.P.1
Vohra, R.V.2
-
19
-
-
0345412655
-
The value of knowing a demand curve: Bounds on regret for online posted-price auctions
-
Kleinberg R, Leighton T (2003) The value of knowing a demand curve: Bounds on regret for online posted-price auctions. Proc. 44th IEEE Sympos. Foundations Comput. Sci. 2003, FOCS '03 (IEEE Compurter Society, Washington, DC), 594-605.
-
(2003)
Proc. 44th IEEE Sympos. Foundations Comput. Sci. 2003, FOCS '03 (IEEE Compurter Society, Washington, DC)
, pp. 594-605
-
-
Kleinberg, R.1
Leighton, T.2
-
21
-
-
84926078662
-
-
Cambridge University Press, Cambridge, UK
-
Lugosi G, Cesa-Bianchi N (2006) Prediction, Learning, and Games (Cambridge University Press, Cambridge, UK).
-
(2006)
Prediction, Learning, and Games
-
-
Lugosi, G.1
Cesa-Bianchi, N.2
-
22
-
-
61349116274
-
Strategies for prediction under imperfect monitoring
-
Lugosi G, Mannor S, Stoltz G (2008) Strategies for prediction under imperfect monitoring. Math. Oper. Res 33(3):513-528.
-
(2008)
Math. Oper. Res
, vol.33
, Issue.3
, pp. 513-528
-
-
Lugosi, G.1
Mannor, S.2
Stoltz, G.3
-
23
-
-
79960129843
-
Internal regret with partial monitoring: Calibration-based optimal algorithms
-
Perchet V (2011) Internal regret with partial monitoring: Calibration-based optimal algorithms. J. Machine Learn. Res. 12:1893-1921.
-
(2011)
J. Machine Learn. Res.
, vol.12
, pp. 1893-1921
-
-
Perchet, V.1
-
24
-
-
84898041886
-
Discrete prediction games with arbitrary feedback and loss
-
Springer, Berlin, Heidelberg
-
Piccolboni A, Schindelhauer C (2001) Discrete prediction games with arbitrary feedback and loss. Computational Learning Theory Lecture Notes in Computer Science, Vol. 2111 (Springer, Berlin, Heidelberg), 208-223.
-
(2001)
Computational Learning Theory Lecture Notes in Computer Science
, vol.2111
, pp. 208-223
-
-
Piccolboni, A.1
Schindelhauer, C.2
-
25
-
-
0013327190
-
Minimizing regret: The general case
-
Rustichini A (1999) Minimizing regret: The general case. Games Econom. Behav. 29(1-2):224-243.
-
(1999)
Games Econom. Behav.
, vol.29
, Issue.1-2
, pp. 224-243
-
-
Rustichini, A.1
|