-
1
-
-
4243567726
-
Temporal differences-based policy iteration and applications in neuro-dynamic programming
-
Bertsekas, D. and Ioffe, S. Temporal differences-based policy iteration and applications in neuro-dynamic programming. Technical report, MIT, 1996.
-
(1996)
Technical Report, MIT
-
-
Bertsekas, D.1
Ioffe, S.2
-
3
-
-
0036832950
-
Technical update: Least-squares temporal difference learning
-
Boyan, J. A. Technical update: Least-squares temporal difference learning. Machine Learning, 49:233-246, 2002.
-
(2002)
Machine Learning
, vol.49
, pp. 233-246
-
-
Boyan, J.A.1
-
4
-
-
0001771345
-
Linear least-squares algorithms for temporal difference learning
-
Bradtke, S. J. and Barto, A.G. Linear least-squares algorithms for temporal difference learning. Machine Learning, 22:33-57, 1996.
-
(1996)
Machine Learning
, vol.22
, pp. 33-57
-
-
Bradtke, S.J.1
Barto, A.G.2
-
7
-
-
35048819671
-
Least-squares methods in reinforcement learning for control
-
Springer-Verlag
-
Lagoudakis, Michail G., Parr, Ronald, and Littman, Michael L. Least-squares methods in reinforcement learning for control. In In SETN'02: Proceedings of the Second Hellenic Conference on AI, pp. 249-260. Springer-Verlag, 2002.
-
(2002)
SETN'02: Proceedings of the Second Hellenic Conference on AI
, pp. 249-260
-
-
Lagoudakis, M.G.1
Parr, R.2
Littman, M.L.3
-
8
-
-
0037288398
-
Least squares policy evaluation algorithms with linear function approximation
-
Nedić, A. and Bertsekas, D. P. Least squares policy evaluation algorithms with linear function approximation. Discrete Event Dynamic Systems, 13(1-2): 79-110, 2003.
-
(2003)
Discrete Event Dynamic Systems
, vol.13
, Issue.1-2
, pp. 79-110
-
-
Nedić, A.1
Bertsekas, D.P.2
-
10
-
-
1942482175
-
Optimality of reinforcement learning algorithms with linear function approximation
-
Schoknecht, Ralf. Optimality of reinforcement learning algorithms with linear function approximation. In NIPS, pp. 1555-1562, 2002.
-
(2002)
NIPS
, pp. 1555-1562
-
-
Schoknecht, R.1
-
11
-
-
71149099079
-
Fast gradient-descent methods for temporal-difference learning with linear function approximation
-
Sutton, R. S., Maei, H. R., Precup, D., Bhatna-gar, S., Silver, D., Szepesvári, C, and Wiewiora, E. Fast gradient-descent methods for temporal-difference learning with linear function approximation. In ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 993-1000, 2009.
-
(2009)
ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning
, pp. 993-1000
-
-
Sutton, R.S.1
Maei, H.R.2
Precup, D.3
Bhatna-Gar, S.4
Silver, D.5
Szepesvári, C.6
Wiewiora, E.7
-
16
-
-
67949109470
-
Convergence results for some temporal difference methods based on least squares
-
Yu, H. and Bertsekas, D. P. Convergence Results for Some Temporal Difference Methods Based on Least Squares. IEEE Trans. Automatic Control, 54:1515-1531, 2009.
-
(2009)
IEEE Trans. Automatic Control
, vol.54
, pp. 1515-1531
-
-
Yu, H.1
Bertsekas, D.P.2
|