-
1
-
-
34548752490
-
-
Antes, A., Szepesvári, C., Munos, R.: Value-iteration based fitted policy iteration: learning with a single trajectory. In: IEEE ADPRL, pp. 330-337 (2007)
-
Antes, A., Szepesvári, C., Munos, R.: Value-iteration based fitted policy iteration: learning with a single trajectory. In: IEEE ADPRL, pp. 330-337 (2007)
-
-
-
-
2
-
-
58449095085
-
-
Antos, A., Munos, R., Szepesvári, C.: Fitted Q-iteration in continuous action-space MDPs. In: Advances in Neural Information Processing Systems 20, NIPS 2007 (in print, 2008)
-
Antos, A., Munos, R., Szepesvári, C.: Fitted Q-iteration in continuous action-space MDPs. In: Advances in Neural Information Processing Systems 20, NIPS 2007 (in print, 2008)
-
-
-
-
3
-
-
40849145988
-
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
-
Antos, A., Szepesvári, C., Munos, R.: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path. Machine Learning 71, 89-129 (2008)
-
(2008)
Machine Learning
, vol.71
, pp. 89-129
-
-
Antos, A.1
Szepesvári, C.2
Munos, R.3
-
5
-
-
50849114939
-
Sparsity oracle inequalities for the lasso
-
Bunea, F., Tsybakov, A., Wegkamp, M.: Sparsity oracle inequalities for the lasso. Electronic Journal of Statistics 1, 169-194 (2007)
-
(2007)
Electronic Journal of Statistics
, vol.1
, pp. 169-194
-
-
Bunea, F.1
Tsybakov, A.2
Wegkamp, M.3
-
6
-
-
3543096272
-
The kernel recursive least squares algorithm
-
Engel, Y., Mannor, S., Meir, R.: The kernel recursive least squares algorithm. IEEE Transaction on Signal Processing 52(8), 2275-2285 (2004)
-
(2004)
IEEE Transaction on Signal Processing
, vol.52
, Issue.8
, pp. 2275-2285
-
-
Engel, Y.1
Mannor, S.2
Meir, R.3
-
7
-
-
31844451013
-
Reinforcement learning with Gaussian processes
-
ACM, New York
-
Engel, Y., Mannor, S., Meir, R.: Reinforcement learning with Gaussian processes. In: ICML 2005: Proceedings of the 22nd international conference on Machine learning, pp. 201-208. ACM, New York (2005)
-
(2005)
ICML 2005: Proceedings of the 22nd international conference on Machine learning
, pp. 201-208
-
-
Engel, Y.1
Mannor, S.2
Meir, R.3
-
8
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503-556 (2005)
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
9
-
-
70049096468
-
Regularized policy iteration
-
to appear
-
Farahmand, A.M., Ghavamzadeh, M., Szepesvari, C., Mannor, S.: Regularized policy iteration. In: Advances in Neural Information Processing Systems 21, NIPS 2008 (to appear, 2008)
-
(2008)
Advances in Neural Information Processing Systems 21, NIPS 2008
-
-
Farahmand, A.M.1
Ghavamzadeh, M.2
Szepesvari, C.3
Mannor, S.4
-
10
-
-
0003624357
-
-
Springer, New York
-
Györfi, L., Kohler, M., Krzyzak, A., Walk, H.: A distribution-free theory of non-parametric regression. Springer, New York (2002)
-
(2002)
A distribution-free theory of non-parametric regression
-
-
Györfi, L.1
Kohler, M.2
Krzyzak, A.3
Walk, H.4
-
11
-
-
84885993384
-
-
Jung, T., Polani, D.: Least squares SVM for least squares TD learning. In: ECAI, pp. 499-503 (2006)
-
Jung, T., Polani, D.: Least squares SVM for least squares TD learning. In: ECAI, pp. 499-503 (2006)
-
-
-
-
12
-
-
84880649215
-
A sparse sampling algorithm for near-optimal planning in large Markovian decision processes
-
Kearns, M., Mansour, Y., Ng, A.Y.: A sparse sampling algorithm for near-optimal planning in large Markovian decision processes. In: Proceedings of IJCAI 1999, pp. 1324-1331 (1999)
-
(1999)
Proceedings of IJCAI
, pp. 1324-1331
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.Y.3
-
13
-
-
1942420814
-
Reinforcement learning as classification: Leveraging modern classifiers
-
Lagoudakis, M.G., Parr, R.: Reinforcement learning as classification: Leveraging modern classifiers. In: ICML 2003, pp. 424-431 (2003)
-
(2003)
ICML 2003
, pp. 424-431
-
-
Lagoudakis, M.G.1
Parr, R.2
-
15
-
-
17444414191
-
Basis function adaptation in temporal difference reinforcement learning
-
Mannor, S., Menache, I., Shimkin, N.: Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research 134, 215-238 (2005)
-
(2005)
Annals of Operations Research
, vol.134
, pp. 215-238
-
-
Mannor, S.1
Menache, I.2
Shimkin, N.3
-
18
-
-
0036832956
-
Kernel-based reinforcement learning
-
Ormoneit, D., Sen, S.: Kernel-based reinforcement learning. Machine Learning 49, 161-178 (2002)
-
(2002)
Machine Learning
, vol.49
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
19
-
-
34547982545
-
Analyzing feature generation for value-function approximation
-
Parr, R., Painter-Wakefield, C., Li, L., Littman, M.L.: Analyzing feature generation for value-function approximation. In: ICML, pp. 737-744 (2007)
-
(2007)
ICML
, pp. 737-744
-
-
Parr, R.1
Painter-Wakefield, C.2
Li, L.3
Littman, M.L.4
-
21
-
-
33746031418
-
-
Srebro, N., Ben-David, S.: Learning bounds for support vector machines with learned kernels. In: Lugosi, G., Simon, H.U. (eds.) COLT 2006. LNCS, 4005, pp. 169-183. Springer, Heidelberg (2006)
-
Srebro, N., Ben-David, S.: Learning bounds for support vector machines with learned kernels. In: Lugosi, G., Simon, H.U. (eds.) COLT 2006. LNCS, vol. 4005, pp. 169-183. Springer, Heidelberg (2006)
-
-
-
-
22
-
-
34547098844
-
Kernel-based least squares policy iteration for reinforcement learning
-
Xu, X., Hu, D., Lu, X.: Kernel-based least squares policy iteration for reinforcement learning. IEEE Trans, on Neural Networks 18, 973-992 (2007)
-
(2007)
IEEE Trans, on Neural Networks
, vol.18
, pp. 973-992
-
-
Xu, X.1
Hu, D.2
Lu, X.3
-
23
-
-
0038105204
-
Capacity of reproducing kernel spaces in learning theory
-
Zhou, D.-X.: Capacity of reproducing kernel spaces in learning theory. IEEE Transactions on Information Theory 49, 1743-1752 (2003)
-
(2003)
IEEE Transactions on Information Theory
, vol.49
, pp. 1743-1752
-
-
Zhou, D.-X.1
|