-
1
-
-
80052395875
-
Preference-based policy learning
-
Akrour, R., Schoenauer, M., and Sebag, M. Preference-based policy learning. In Proceedings ECMLPKDD 2011, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, pp. 12-27, 2011.
-
(2011)
Proceedings ECMLPKDD 2011, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
, pp. 12-27
-
-
Akrour, R.1
Schoenauer, M.2
Sebag, M.3
-
3
-
-
0346398224
-
General bounds on statistical query learning and PAC learning with noise via hypothesis boosting
-
Aslam, J.A. and Decatur, S.E. General bounds on statistical query learning and PAC learning with noise via hypothesis boosting. Inf. Comput., 141(2):85-118, 1998.
-
(1998)
Inf. Comput.
, vol.141
, Issue.2
, pp. 85-118
-
-
Aslam, J.A.1
Decatur, S.E.2
-
4
-
-
38149013086
-
Tuning bandit algorithms in stochastic environments
-
Audibert, J.Y., Munos, R., and Szepesvari, C. Tuning bandit algorithms in stochastic environments. In Proceedings of the Algorithmic Learning Theory, pp. 150-165, 2007.
-
(2007)
Proceedings of the Algorithmic Learning Theory
, pp. 150-165
-
-
Audibert, J.Y.1
Munos, R.2
Szepesvari, C.3
-
5
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer, P., Cesa-Bianchi, N., and Fischer, P. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
8
-
-
0038589165
-
The anatomy of a large-scale hypertextual web search engine
-
Brin, S. and Page, L. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30 (1-7):107-117, 1998.
-
(1998)
Computer Networks
, vol.30
, Issue.1-7
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
9
-
-
85126505753
-
Preference-based policy iteration: Leveraging preference learning for reinforcement learning
-
Cheng, W., Fiirnkranz, J., Hiillermeier, E., and Park, S.H. Preference-based policy iteration: Leveraging preference learning for reinforcement learning. In Proceedings ECMLPKDD 2011, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, pp. 414-429, 2011.
-
(2011)
Proceedings ECMLPKDD 2011, European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
, pp. 414-429
-
-
Cheng, W.1
Fiirnkranz, J.2
Hiillermeier, E.3
Park, S.H.4
-
10
-
-
84937398609
-
PAC bounds for multi-armed bandit and markov decision processes
-
Even-Dar, E., Mannor, S., and Mansour, Y. PAC bounds for multi-armed bandit and markov decision processes. In Proceedings of the 15th Annual Conference on Computational Learning Theory, pp. 255-270, 2002.
-
(2002)
Proceedings of the 15th Annual Conference on Computational Learning Theory
, pp. 255-270
-
-
Even-Dar, E.1
Mannor, S.2
Mansour, Y.3
-
11
-
-
38249043311
-
Sensitivity of the stationary distribution vector for an ergodic markov chain
-
Funderlic, R.E. and Meyer, C.D. Sensitivity of the stationary distribution vector for an ergodic markov chain. Linear Algebra and its Applications, 76:1-17, 1986.
-
(1986)
Linear Algebra and Its Applications
, vol.76
, pp. 1-17
-
-
Funderlic, R.E.1
Meyer, C.D.2
-
14
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
Hoeffding, W. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58:13-30, 1963.
-
(1963)
Journal of the American Statistical Association
, vol.58
, pp. 13-30
-
-
Hoeffding, W.1
-
15
-
-
84867131498
-
Pac subset selection in stochastic multi-armed bandits
-
Kalyanakrishnan, S., Tewari, A., Auer, P., and Stone, P. Pac subset selection in stochastic multi-armed bandits. In Proceedings of the Twenty-ninth International Conference on Machine Learning (ICML 2012), pp. 655-662, 2012.
-
(2012)
Proceedings of the Twenty-ninth International Conference on Machine Learning (ICML 2012)
, pp. 655-662
-
-
Kalyanakrishnan, S.1
Tewari, A.2
Auer, P.3
Stone, P.4
-
16
-
-
45749111346
-
Protein classification based on propagation on unrooted binary trees
-
Kocsor, A., Busa-Fekete, R., and Pongor, S. Protein classification based on propagation on unrooted binary trees. Protein and Peptide Letters, 15(5):428-34, 2008.
-
(2008)
Protein and Peptide Letters
, vol.15
, Issue.5
, pp. 428-434
-
-
Kocsor, A.1
Busa-Fekete, R.2
Pongor, S.3
-
18
-
-
0001923944
-
Hoeffding races: Accelerating model selection search for classification and function approximation
-
Maron, O. and Moore, A.W. Hoeffding races: accelerating model selection search for classification and function approximation. In Advances in Neural Information Processing Systems, pp. 59-66, 1994.
-
(1994)
Advances in Neural Information Processing Systems
, pp. 59-66
-
-
Maron, O.1
Moore, A.W.2
-
19
-
-
0031069121
-
The racing algorithm: Model selection for lazy learners
-
Maron, O. and Moore, A.W. The racing algorithm: Model selection for lazy learners. Artificial Intelligence Review, 5(1):193-225, 1997.
-
(1997)
Artificial Intelligence Review
, vol.5
, Issue.1
, pp. 193-225
-
-
Maron, O.1
Moore, A.W.2
-
20
-
-
56449108844
-
Empirical Bernstein stopping
-
Mnih, V., Szepesvári, C., and Audibert, J.Y. Empirical Bernstein stopping. In Proceedings of the 25th international conference on Machine learning, pp. 672-679, 2008.
-
(2008)
Proceedings of the 25th International Conference on Machine Learning
, pp. 672-679
-
-
Mnih, V.1
Szepesvári, C.2
Audibert, J.Y.3
-
22
-
-
84877777099
-
Iterative ranking from pairwise comparisons
-
Negahban, S., Oh, S., and Shah, D. Iterative ranking from pairwise comparisons. In Advances in Neural Information Processing Systems, pp. 2483-2491, 2012.
-
Advances in Neural Information Processing Systems
, vol.2012
, pp. 2483-2491
-
-
Negahban, S.1
Oh, S.2
Shah, D.3
-
23
-
-
38249001667
-
Sensitivity of finite markov chains under perturbation
-
Seneta, E. Sensitivity of finite markov chains under perturbation. Statistics & probability letters, 17(2): 163-168, 1992.
-
(1992)
Statistics & Probability Letters
, vol.17
, Issue.2
, pp. 163-168
-
-
Seneta, E.1
-
25
-
-
84861586270
-
The k-armed dueling bandits problem
-
Yue, Y., Broder, J., Kleinberg, R., and Joachims, T. The k-armed dueling bandits problem. Journal of Computer and System Sciences, 78(5):1538-1556, 2012.
-
(2012)
Journal of Computer and System Sciences
, vol.78
, Issue.5
, pp. 1538-1556
-
-
Yue, Y.1
Broder, J.2
Kleinberg, R.3
Joachims, T.4
|