-
1
-
-
33750733956
-
Hannan consistency in on-line learning in case of unbounded losses under partial monitoring
-
Balcázar, J.L., Long, P.M., Stephan, F. (eds.) ALT 2006. Springer, Heidelberg
-
Allenberg, C., Auer, P., Györfi, L., Ottucsák, G.: Hannan consistency in on-line learning in case of unbounded losses under partial monitoring. In: Balcázar, J.L., Long, P.M., Stephan, F. (eds.) ALT 2006. LNCS (LNAI), vol. 4264, pp. 229-243. Springer, Heidelberg (2006)
-
(2006)
LNCS (LNAI)
, vol.4264
, pp. 229-243
-
-
Allenberg, C.1
Auer, P.2
Györfi, L.3
Ottucsák, G.4
-
2
-
-
78649420293
-
Regret bounds and minimax policies under partial monitoring
-
Audibert, J.-Y., Bubeck, S.: Regret bounds and minimax policies under partial monitoring. Journal of Machine Learning Research 11, 2635-2686 (2010)
-
(2010)
Journal of Machine Learning Research
, vol.11
, pp. 2635-2686
-
-
Audibert, J.-Y.1
Bubeck, S.2
-
4
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32(1), 48-77 (2002)
-
(2002)
SIAM J. Comput.
, vol.32
, Issue.1
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
6
-
-
84898039203
-
Towards minimax policies for online linear optimization with bandit feedback
-
Bubeck, S., Cesa-Bianchi, N., Kakade, S.M.: Towards minimax policies for online linear optimization with bandit feedback. In: Proceedings of the 25th Annual Conference on Learning Theory (COLT), pp. 1-14 (2012)
-
(2012)
Proceedings of the 25th Annual Conference on Learning Theory (COLT)
, pp. 1-14
-
-
Bubeck, S.1
Cesa-Bianchi, N.2
Kakade, S.M.3
-
7
-
-
84926078662
-
-
Cambridge University Press, New York
-
Cesa-Bianchi, N., Lugosi, G.: Prediction, Learning, and Games. Cambridge University Press, New York (2006)
-
(2006)
Prediction, Learning, and Games
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
-
9
-
-
85162050055
-
The price of bandit information for online optimization
-
Dani, V., Hayes, T., Kakade, S.: The price of bandit information for online optimization. In: Advances in Neural Information Processing Systems (NIPS), vol. 20, pp. 345-352 (2008)
-
(2008)
Advances in Neural Information Processing Systems (NIPS)
, vol.20
, pp. 345-352
-
-
Dani, V.1
Hayes, T.2
Kakade, S.3
-
10
-
-
35948943542
-
The on-line shortest path problem under partial monitoring
-
György, A., Linder, T., Lugosi, G., Ottucsák, G.: The on-line shortest path problem under partial monitoring. Journal of Machine Learning Research 8, 2369-2403 (2007)
-
(2007)
Journal of Machine Learning Research
, vol.8
, pp. 2369-2403
-
-
György, A.1
Linder, T.2
Lugosi, G.3
Ottucsák, G.4
-
11
-
-
0001976283
-
Approximation to Bayes risk in repeated play
-
Hannan, J.: Approximation to Bayes risk in repeated play. Contributions to the Theory of Games 3, 97-139 (1957)
-
(1957)
Contributions to the Theory of Games
, vol.3
, pp. 97-139
-
-
Hannan, J.1
-
13
-
-
84860647444
-
Hedging structured concepts
-
Koolen, W., Warmuth, M., Kivinen, J.: Hedging structured concepts. In: Proceedings of the 23rd Annual Conference on Learning Theory (COLT), pp. 93-105 (2010)
-
(2010)
Proceedings of the 23rd Annual Conference on Learning Theory (COLT)
, pp. 93-105
-
-
Koolen, W.1
Warmuth, M.2
Kivinen, J.3
-
14
-
-
9444257628
-
Online geometric optimization in the bandit setting against an adaptive adversary
-
Shawe-Taylor, J., Singer, Y. (eds.) COLT 2004. Springer, Heidelberg
-
McMahan, H.B., Blum, A.: Online geometric optimization in the bandit setting against an adaptive adversary. In: Shawe-Taylor, J., Singer, Y. (eds.) COLT 2004. LNCS (LNAI), vol. 3120, pp. 109-123. Springer, Heidelberg (2004)
-
(2004)
LNCS (LNAI)
, vol.3120
, pp. 109-123
-
-
McMahan, H.B.1
Blum, A.2
-
15
-
-
33646753171
-
FPL analysis for adaptive bandits
-
Lupanov, O.B., Kasim-Zade, O.M., Chaskin, A.V., Steinhöfel, K. (eds.) SAGA 2005. Springer, Heidelberg
-
Poland, J.: FPL analysis for adaptive bandits. In: Lupanov, O.B., Kasim-Zade, O.M., Chaskin, A.V., Steinhöfel, K. (eds.) SAGA 2005. LNCS, vol. 3777, pp. 58-69. Springer, Heidelberg (2005)
-
(2005)
LNCS
, vol.3777
, pp. 58-69
-
-
Poland, J.1
-
16
-
-
84867856605
-
Online prediction under submodular constraints
-
Bshouty, N.H., Stoltz, G., Vayatis, N., Zeugmann, T. (eds.) ALT 2012. Springer, Heidelberg
-
Suehiro, D., Hatano, K., Kijima, S., Takimoto, E., Nagano, K.: Online prediction under submodular constraints. In: Bshouty, N.H., Stoltz, G., Vayatis, N., Zeugmann, T. (eds.) ALT 2012. LNCS, vol. 7568, pp. 260-274. Springer, Heidelberg (2012)
-
(2012)
LNCS
, vol.7568
, pp. 260-274
-
-
Suehiro, D.1
Hatano, K.2
Kijima, S.3
Takimoto, E.4
Nagano, K.5
|