-
1
-
-
84919945245
-
Toward a classification of finite partial- monitoring games
-
Antos, Andras, Bartok, Gabor, Pal, David, and Szepesvari, Csaba. Toward a classification of finite partial- monitoring games. Theoretical Computer Science, 2012.
-
(2012)
Theoretical Computer Science
-
-
Antos, A.1
Bartok, G.2
Pal, D.3
Szepesvari, C.4
-
2
-
-
84898079018
-
Minimax policies for adversarial and stochastic bandits
-
Audibert, Jean-Yves and Bubeck, Sebastien. Minimax policies for adversarial and stochastic bandits. In COLT, 2009.
-
(2009)
COLT
-
-
Audibert, J.-Y.1
Bubeck, S.2
-
3
-
-
0036568025
-
Finite-time analysis of the multi armed bandit problem
-
Auer, Peter, Cesa-Bianchi, Nicolo, and Fischer, Paul. Finite-time analysis of the multi armed bandit problem. Machine learning, 47(2-3):235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
84919930512
-
An adaptive algorithm for finite stochastic partial monitoring (extended version)
-
June
-
Bartok, G., Zolghadr, N., and Szepesvari, Cs. An adaptive algorithm for finite stochastic partial monitoring (extended version). In ICML, pp. 1-20, June 2012.
-
(2012)
ICML
, pp. 1-20
-
-
Bartok, G.1
Zolghadr, N.2
Szepesvari, C.3
-
5
-
-
84867849262
-
Minimax regret of finite partial-monitoring games in stochastic environments
-
Bartok, Gabor, Pal, David, and Szepesvari, Csaba. Minimax regret of finite partial-monitoring games in stochastic environments. Journal of Machine Learning Research-Proceedings Track, 19:133-154, 2011.
-
(2011)
Journal of Machine Learning Research-Proceedings Track
, vol.19
, pp. 133-154
-
-
Bartok, G.1
Pal, D.2
Szepesvari, C.3
-
6
-
-
84874045238
-
Regret analysis of stochastic and non stochastic multi-armed bandit problems
-
Bubeck, Sebastien and Cesa-Bianchi, Nicolo. Regret analysis of stochastic and non stochastic multi-armed bandit problems. Foundations and Trends in Machine Learning, 5(1):1-122, 2012.
-
(2012)
Foundations and Trends in Machine Learning
, vol.5
, Issue.1
, pp. 1-122
-
-
Bubeck, S.1
Cesa-Bianchi, N.2
-
9
-
-
33748442333
-
Regret minimization under partial monitoring
-
Cesa-Bianchi, Nicolo, Lugosi, Gabor, and Stoltz, Gilles. Regret minimization under partial monitoring. Mathematics of Operations Research, 31(3):562-580, 2006.
-
(2006)
Mathematics of Operations Research
, vol.31
, Issue.3
, pp. 562-580
-
-
Cesa-Bianchi, N.1
Lugosi, G.2
Stoltz, G.3
-
10
-
-
84897515317
-
Combinatorial multi-armed bandit: General framework and applications
-
Chen, Wei, Wang, Yajun, and Yuan, Yang. Combinatorial multi-armed bandit: General framework and applications. In Proceedings of the 30th International Conference on Machine Learning (ICML-13), pp. 151-159, 2013.
-
(2013)
Proceedings of the 30th International Conference on Machine Learning (ICML-13)
, pp. 151-159
-
-
Chen, W.1
Wang, Y.2
Yuan, Y.3
-
11
-
-
84867858040
-
Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations
-
October
-
Gai, Yi, Krishnamachari, Bhaskar, and Jain, Rahul. Combinatorial network optimization with unknown variables: Multi-armed bandits with linear rewards and individual observations. IEEE/ACM Trans. Netw., 20(5):1466- 1478, October 2012. ISSN 1063-6692.
-
(2012)
IEEE/ACM Trans. Netw.
, vol.20
, Issue.5
, pp. 1466-1478
-
-
Gai, Y.1
Krishnamachari, B.2
Jain, R.3
-
13
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, Tze Leung and Robbins, Herbert. Asymptotically efficient adaptive allocation rules. Advances in applied mathematics, 6(1):4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
14
-
-
0024766543
-
The weighted majority algorithm
-
IEEE
-
Littlestone, Nick and Warmuth, Manfred K. The weighted majority algorithm. In Foundations of Computer Science, 1989., 30th Annual Symposium on, pp. 256-261. IEEE, 1989.
-
(1989)
Foundations of Computer Science, 1989., 30th Annual Symposium on
, pp. 256-261
-
-
Littlestone, N.1
Warmuth, M.K.2
-
15
-
-
84898041886
-
Discrete prediction games with arbitrary feedback and loss
-
Springer
-
Piccolboni, Antonio and Schindelhauer, Christian. Discrete prediction games with arbitrary feedback and loss. In Computational Learning Theory, pp. 208-223. Springer, 2001.
-
(2001)
Computational Learning Theory
, pp. 208-223
-
-
Piccolboni, A.1
Schindelhauer, C.2
-
16
-
-
84893549814
-
Some aspects of the sequential design of experiments
-
Springer
-
Robbins, Herbert. Some aspects of the sequential design of experiments. In Herbert Robbins Selected Papers, pp. 169-177. Springer, 1985.
-
(1985)
Herbert Robbins Selected Papers
, pp. 169-177
-
-
Robbins, H.1
|