-
2
-
-
1942421151
-
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
-
Y. Engel, S. Mannor, and R. Meir, "Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning," in Proceedings of the International Conference on Machine Learning (ICML 03), 2003, pp. 154-161.
-
Proceedings of the International Conference on Machine Learning (ICML 03), 2003
, pp. 154-161
-
-
Engel, Y.1
Mannor, S.2
Meir, R.3
-
3
-
-
67650458797
-
Kalman Temporal Differences: The deterministic case
-
M. Geist, O. Pietquin, and G. Fricout, "Kalman Temporal Differences: the deterministic case," in IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA, April 2009.
-
IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA, April 2009
-
-
Geist, M.1
Pietquin, O.2
Fricout, G.3
-
6
-
-
85024429815
-
A new approach to linear filtering and prediction problems
-
R. E. Kalman, "A new approach to linear filtering and prediction problems," Transactions of the ASME-Journal of Basic Engineering, vol. 82, no. Series D, pp. 35-45, 1960.
-
(1960)
Transactions of the ASME-Journal of Basic Engineering
, vol.82
, Issue.SERIES D
, pp. 35-45
-
-
Kalman, R.E.1
-
7
-
-
21244437999
-
Unscented filtering and nonlinear estimation
-
S. J. Julier and J. K. Uhlmann, "Unscented filtering and nonlinear estimation," Proceedings of the IEEE, vol. 92, no. 3, pp. 401-422, 2004.
-
(2004)
Proceedings of the IEEE
, vol.92
, Issue.3
, pp. 401-422
-
-
Julier, S.J.1
Uhlmann, J.K.2
-
9
-
-
40849145988
-
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
-
April
-
A. Antos, C. Szepesvári, and R. Munos, "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path," Machine Learning, vol. 71, no. 1, pp. 89-129, April 2008.
-
(2008)
Machine Learning
, vol.71
, Issue.1
, pp. 89-129
-
-
Antos, A.1
Szepesvári, C.2
Munos, R.3
-
11
-
-
0038595396
-
Least-squares temporal difference learning
-
Morgan Kaufmann, San Francisco, CA
-
J. A. Boyan, "Least-squares temporal difference learning," in Proceedings of the 16th International Conference on Machine Learning (ICML 99). Morgan Kaufmann, San Francisco, CA, 1999, pp. 49-56.
-
(1999)
Proceedings of the 16th International Conference on Machine Learning (ICML 99)
, pp. 49-56
-
-
Boyan, J.A.1
-
13
-
-
8344287766
-
-
Ph.D. dissertation, OGI School of Science & Engineering, Oregon Health & Science University, Portland, OR, USA, April
-
R. van der Merwe, "Sigma-point kalman filters for probabilistic inference in dynamic state-space models," Ph.D. dissertation, OGI School of Science & Engineering, Oregon Health & Science University, Portland, OR, USA, April 2004.
-
(2004)
Sigma-point Kalman Filters for Probabilistic Inference in Dynamic State-space Models
-
-
Van Der Merwe, R.1
-
14
-
-
84966204836
-
Methods for Modifying Matrix Factorization
-
April
-
P. E. Gill, G. H. Golub, W. Murray, and M. A. Saunders, "Methods for Modifying Matrix Factorization," Mathematics of Computation, vol. 28, no. 126, pp. 505-535, April 1974.
-
(1974)
Mathematics of Computation
, vol.28
, Issue.126
, pp. 505-535
-
-
Gill, P.E.1
Golub, G.H.2
Murray, W.3
Saunders, M.A.4
-
16
-
-
79951474303
-
Managing Uncertainty within Value Function Approximation in Reinforcement Learning
-
M. Geist and O. Pietquin, "Managing Uncertainty within Value Function Approximation in Reinforcement Learning," in Active Learning and Experimental Design workshop (collocated with AISTATS 2010), Sardinia, Italy, May 2010.
-
Active Learning and Experimental Design Workshop (Collocated with AISTATS 2010), Sardinia, Italy, May 2010
-
-
Geist, M.1
Pietquin, O.2
-
17
-
-
0031143730
-
An analysis of temporal-difference learning with function approximation
-
J. N. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Transactions on Automatic Control, vol. 42, pp. 674-690, 1997.
-
(1997)
IEEE Transactions on Automatic Control
, vol.42
, pp. 674-690
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
|