-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, and Paul Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47, 2002a.
-
(2002)
Machine Learning
, vol.47
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal of Computing, 32(1), 2002b.
-
(2002)
SIAM Journal of Computing
, vol.32
, Issue.1
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
80053144086
-
Contextual bandit algorithms with supervised learning guarantees
-
Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, and Robert Schapire. Contextual bandit algorithms with supervised learning guarantees. In Proceedings on the International Conference on Artificial Intelligence and Statistics (AISTATS), 2011.
-
(2011)
Proceedings on the International Conference on Artificial Intelligence and Statistics (AISTATS)
-
-
Beygelzimer, A.1
Langford, J.2
Li, L.3
Reyzin, L.4
Schapire, R.5
-
6
-
-
0028442413
-
Associative reinforcement learning: Functions in k-DNF
-
Leslie Pack Kaelbling. Associative reinforcement learning: Functions in k-DNF. Machine Learning, 15, 1994.
-
(1994)
Machine Learning
, vol.15
-
-
Kaelbling, L.P.1
-
9
-
-
79551505096
-
PAC-Bayesian generalization error bounds for Gaussian process classification
-
Matthias Seeger. PAC-Bayesian generalization error bounds for Gaussian process classification. Journal of Machine Learning Research, 2002.
-
(2002)
Journal of Machine Learning Research
-
-
Seeger, M.1
-
14
-
-
0032166068
-
Structural risk minimization over data-dependent hierarchies
-
John Shawe-Taylor, Peter L. Bartlett, Robert C. Williamson, and Martin Anthony. Structural risk minimization over data-dependent hierarchies. IEEE Transactions on Information Theory, 44(5), 1998.
-
(1998)
IEEE Transactions on Information Theory
, vol.44
, Issue.5
-
-
Shawe-Taylor, J.1
Bartlett, P.L.2
Williamson, R.C.3
Anthony, M.4
-
17
-
-
79957591834
-
Information theory of decisions and actions
-
Vassilis Cutsuridis, Amir Hussain, John G. Taylor, and Daniel Polani, editors. Springer
-
Naftali Tishby and Daniel Polani. Information theory of decisions and actions. In Vassilis Cutsuridis, Amir Hussain, John G. Taylor, and Daniel Polani, editors, Perception-Reason-Action Cycle: Models, Algorithms and Systems. Springer, 2010.
-
(2010)
Perception-Reason-Action Cycle: Models, Algorithms and Systems
-
-
Tishby, N.1
Polani, D.2
|