-
2
-
-
0031143730
-
An analysis of temporal-difference learning with function approximation
-
J. N. Tsitsiklisc and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Transactions on Automatic Control, vol. 42, pp. 674-690, 1997.
-
(1997)
IEEE Transactions on Automatic Control
, vol.42
, pp. 674-690
-
-
Tsitsiklisc, J.N.1
Van Roy, B.2
-
3
-
-
0001771345
-
Linear Least-Squares algorithms for temporal difference learning
-
S. J. Bradtke and A. G. Barto, "Linear Least-Squares algorithms for temporal difference learning," Machine Learning, vol. 22, no. 1-3, pp. 33-57, 1996.
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 33-57
-
-
Bradtke, S.J.1
Barto, A.G.2
-
4
-
-
79951481923
-
Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
-
H. Maei, C. Szepesvari, S. Bhatnagar, D. Precup, D. Silver, and R. Sutton, "Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation," in Advances in Neural Information Processing Systems 22, 2009, pp. 1204-1212.
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1204-1212
-
-
Maei, H.1
Szepesvari, C.2
Bhatnagar, S.3
Precup, D.4
Silver, D.5
Sutton, R.6
-
5
-
-
33646435300
-
A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning
-
D. Choi and B. Van Roy, "A Generalized Kalman Filter for Fixed Point Approximation and Efficient Temporal-Difference Learning," Discrete Event Dynamic Systems, vol. 16, pp. 207-239, 2006.
-
(2006)
Discrete Event Dynamic Systems
, vol.16
, pp. 207-239
-
-
Choi, D.1
Van Roy, B.2
-
7
-
-
1942421151
-
Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning
-
Y. Engel, S. Mannor, and R. Meir, "Bayes Meets Bellman: The Gaussian Process Approach to Temporal Difference Learning," in Proceedings of the International Conference on Machine Learning (ICML 03), 2003, pp. 154-161.
-
Proceedings of the International Conference on Machine Learning (ICML 03), 2003
, pp. 154-161
-
-
Engel, Y.1
Mannor, S.2
Meir, R.3
-
8
-
-
67650458797
-
Kalman Temporal Differences: The deterministic case
-
M. Geist, O. Pietquin, and G. Fricout, "Kalman Temporal Differences: the deterministic case," in IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA, April 2009.
-
IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2009), Nashville, TN, USA, April 2009
-
-
Geist, M.1
Pietquin, O.2
Fricout, G.3
-
10
-
-
79951485912
-
Eligibility Traces through Colored Noises
-
M. Geist and O. Pietquin, "Eligibility Traces through Colored Noises," in International Conference on Ultra Modern Control systems (ICUMT 2010 (Control Systems)), Moscow, Russia, October 2010.
-
International Conference on Ultra Modern Control Systems (ICUMT 2010 (Control Systems)), Moscow, Russia, October 2010
-
-
Geist, M.1
Pietquin, O.2
-
15
-
-
21244437999
-
Unscented filtering and nonlinear estimation
-
S. J. Julier and J. K. Uhlmann, "Unscented filtering and nonlinear estimation," Proceedings of the IEEE, vol. 92, no. 3, pp. 401-422, 2004.
-
(2004)
Proceedings of the IEEE
, vol.92
, Issue.3
, pp. 401-422
-
-
Julier, S.J.1
Uhlmann, J.K.2
-
16
-
-
0034326226
-
New developments in state estimation for nonlinear systems
-
P. Nørg̊ard, N. Poulsen, and O. Ravn, "New developments in state estimation for nonlinear systems," Automatica, vol. 36, no. 11, pp. 1627-1638, 2000.
-
(2000)
Automatica
, vol.36
, Issue.11
, pp. 1627-1638
-
-
Nørg̊ard, P.1
Poulsen, N.2
Ravn, O.3
-
17
-
-
78449267579
-
Statistically Linearized Recursive Least Squares
-
to appear
-
M. Geist and O. Pietquin, "Statistically Linearized Recursive Least Squares," in Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2010), Kittilä (Finland), August-September 2010, 5 pages, to appear.
-
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2010), Kittilä (Finland), August-September 2010
, pp. 5
-
-
Geist, M.1
Pietquin, O.2
-
18
-
-
84966204836
-
Methods for Modifying Matrix Factorization
-
April
-
P. E. Gill, G. H. Golub, W. Murray, and M. A. Saunders, "Methods for Modifying Matrix Factorization," Mathematics of Computation, vol. 28, no. 126, pp. 505-535, April 1974.
-
(1974)
Mathematics of Computation
, vol.28
, Issue.126
, pp. 505-535
-
-
Gill, P.E.1
Golub, G.H.2
Murray, W.3
Saunders, M.A.4
-
19
-
-
40849145988
-
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
-
A. Antos, C. Szepesvári, and R. Munos, "Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path," Machine Learning, vol. 71, no. 1, pp. 89-129, 2008.
-
(2008)
Machine Learning
, vol.71
, Issue.1
, pp. 89-129
-
-
Antos, A.1
Szepesvári, C.2
Munos, R.3
-
20
-
-
33646398129
-
Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method
-
M. Riedmiller, "Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method," in European Conference on Machine Learning, 2005, pp. 317-328.
-
European Conference on Machine Learning, 2005
, pp. 317-328
-
-
Riedmiller, M.1
|