-
3
-
-
0033234630
-
Smooth discrimination analysis
-
Enno Mammen and Alexander B. Tsybakov. Smooth discrimination analysis. The Annals of Statistics, 27(6):1808-1829, 1999.
-
(1999)
The Annals of Statistics
, vol.27
, Issue.6
, pp. 1808-1829
-
-
Mammen, E.1
Tsybakov, A.B.2
-
4
-
-
3142725508
-
Optimal aggregation of classifiers in statistical learning
-
Alexander B. Tsybakov. Optimal aggregation of classifiers in statistical learning. The Annals of Statistics, 32 (1):135-166, 2004.
-
(2004)
The Annals of Statistics
, vol.32
, Issue.1
, pp. 135-166
-
-
Tsybakov, A.B.1
-
5
-
-
34547706430
-
Fast learning rates for plug-in classifiers
-
Jean-Yves Audibert and Alexander B. Tsybakov. Fast learning rates for plug-in classifiers. The Annals of Statistics, 35(2):608-633, 2007.
-
(2007)
The Annals of Statistics
, vol.35
, Issue.2
, pp. 608-633
-
-
Audibert, J.-Y.1
Tsybakov, A.B.2
-
6
-
-
77957604813
-
Generalized density clustering
-
Alessandro Rinaldo and Larry Wasserman. Generalized density clustering. The Annals of Statistics, 38(5):2678-2722, 2010.
-
(2010)
The Annals of Statistics
, vol.38
, Issue.5
, pp. 2678-2722
-
-
Rinaldo, A.1
Wasserman, L.2
-
9
-
-
85162059109
-
A reduction from apprenticeship learning to classification
-
J. Lafferty, C. K. I.Williams, J. Shawe-Taylor, R.S. Zemel, and A. Culotta, editors
-
Omar Syed and Robert E. Schapire. A reduction from apprenticeship learning to classification. In J. Lafferty, C. K. I.Williams, J. Shawe-Taylor, R.S. Zemel, and A. Culotta, editors, Advances in Neural Information Processing Systems (NIPS - 23), pages 2253-2261, 2010.
-
(2010)
Advances in Neural Information Processing Systems (NIPS - 23)
, pp. 2253-2261
-
-
Syed, O.1
Schapire, R.E.2
-
10
-
-
40849145988
-
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
-
András Antos, Csaba Szepesvári, and Rémi Munos. Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path. Machine Learning, 71:89-129, 2008.
-
(2008)
Machine Learning
, vol.71
, pp. 89-129
-
-
Antos, A.1
Szepesvári, C.2
Munos, R.3
-
12
-
-
70449644892
-
Regularized fitted Q-iteration for planning in continuous-space markovian decision problems
-
June
-
Amir-massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, and Shie Mannor. Regularized fitted Q-iteration for planning in continuous-space Markovian Decision Problems. In Proceedings of American Control Conference (ACC), pages 725-730, June 2009.
-
(2009)
Proceedings of American Control Conference (ACC(
, pp. 725-730
-
-
Farahmand, A.-M.1
Ghavamzadeh, M.2
Szepesvári, C.3
Mannor, S.4
-
13
-
-
70049096468
-
Regularized policy iteration
-
D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors. MIT Press
-
Amir-massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, and Shie Mannor. Regularized policy iteration. In D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, editors, Advances in Neural Information Processing Systems (NIPS - 21), pages 441-448. MIT Press, 2009.
-
(2009)
Advances in Neural Information Processing Systems (NIPS - 21)
, pp. 441-448
-
-
Farahmand, A.-M.1
Ghavamzadeh, M.2
Szepesvári, C.3
Mannor, S.4
-
17
-
-
77956541799
-
Toward off-policy learning control with function approximation
-
Johannes Fürnkranz and Thorsten Joachims, editors, Haifa, Israel, June. Omnipress
-
Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, and Richard S. Sutton. Toward off-policy learning control with function approximation. In Johannes Fürnkranz and Thorsten Joachims, editors, Proceedings of the 27th International Conference on Machine Learning (ICML-10), pages 719-726, Haifa, Israel, June 2010. Omnipress.
-
(2010)
Proceedings of the 27th International Conference on Machine Learning (ICML-10)
, pp. 719-726
-
-
Maei, H.R.1
Szepesvári, C.2
Bhatnagar, S.3
Sutton, R.S.4
-
19
-
-
33646398129
-
Neural fitted Q iteration - First experiences with a data efficient neural reinforcement learning method
-
Martin Riedmiller. Neural fitted Q iteration - first experiences with a data efficient neural reinforcement learning method. In 16th European Conference on Machine Learning, pages 317-328, 2005.
-
(2005)
16th European Conference on Machine Learning
, pp. 317-328
-
-
Riedmiller, M.1
-
21
-
-
0348090400
-
The linear programming approach to approximate dynamic programming
-
Daniela Pucci de Farias and Benjamin Van Roy. The linear programming approach to approximate dynamic programming. Operations Research, 51(6):850-865, 2003.
-
(2003)
Operations Research
, vol.51
, Issue.6
, pp. 850-865
-
-
De Farias, D.P.1
Van Roy, B.2
-
22
-
-
71149105671
-
Constraint relaxation in approximate linear programs
-
New York, NY, USA. ACM
-
Marek Petrik and Shlomo Zilberstein. Constraint relaxation in approximate linear programs. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pages 809-816, New York, NY, USA, 2009. ACM.
-
(2009)
Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09
, pp. 809-816
-
-
Petrik, M.1
Zilberstein, S.2
-
25
-
-
85162063395
-
Error propagation for approximate policy and value iteration
-
J. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R.S. Zemel, and A. Culotta, editors
-
Amir-massoud Farahmand, Rémi Munos, and Csaba Szepesvári. Error propagation for approximate policy and value iteration. In J. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R.S. Zemel, and A. Culotta, editors, Advances in Neural Information Processing Systems (NIPS - 23), pages 568-576. 2010.
-
(2010)
Advances in Neural Information Processing Systems (NIPS - 23)
, pp. 568-576
-
-
Farahmand, A.-M.1
Munos, R.2
Szepesvári, C.3
|