-
1
-
-
0024680419
-
Adaptive aggregation methods for infinite horizon dynamic programming
-
D. Bertsekas and D. Castañon. Adaptive aggregation methods for infinite horizon dynamic programming. IEEE Transactions on Automatic Control, 34:589-598, 1989.
-
(1989)
IEEE Transactions on Automatic Control
, vol.34
, pp. 589-598
-
-
Bertsekas, D.1
Castañon, D.2
-
4
-
-
85162034137
-
An investigation of basis construction from power series expansions of value functions
-
Amherst
-
B. Liu and S. Mahadevan. An investigation of basis construction from power series expansions of value functions. Technical report, University Massachusetts, Amherst, 2010.
-
(2010)
Technical Report, University Massachusetts
-
-
Liu, B.1
Mahadevan, S.2
-
6
-
-
70349322784
-
Learning representation and control in Markov Decision Processes: New frontiers
-
S. Mahadevan. Learning representation and control in Markov Decision Processes: New frontiers. Foundations and Trends in Machine Learning, 1(4):403-565, 2009.
-
(2009)
Foundations and Trends in Machine Learning
, vol.1
, Issue.4
, pp. 403-565
-
-
Mahadevan, S.1
-
7
-
-
35748957806
-
Proto-value functions: A Laplacian framework for learning representation and control in Markov Decision Processes
-
S. Mahadevan and M. Maggioni. Proto-value functions: A Laplacian framework for learning representation and control in Markov Decision Processes. Journal of Machine Learning Research, 8:2169-2231, 2007.
-
(2007)
Journal of Machine Learning Research
, vol.8
, pp. 2169-2231
-
-
Mahadevan, S.1
Maggioni, M.2
-
8
-
-
56449092660
-
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning
-
R. Parr, Li. L., G. Taylor, C. Painter-Wakefield, and M. Littman. An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning. In Proceedings of the International Conference on Machine Learning (ICML), 2008.
-
(2008)
Proceedings of the International Conference on Machine Learning (ICML)
-
-
Parr Li. L, R.1
Taylor, G.2
Painter-Wakefield, C.3
Littman, M.4
-
9
-
-
34547982545
-
Analyzing feature generation for value function approximation
-
R. Parr, C. Painter-Wakefield, L. Li, and M. Littman. Analyzing feature generation for value function approximation. In Proceedings of the International Conference on Machine Learning (ICML), pages 737-744, 2007.
-
(2007)
Proceedings of the International Conference on Machine Learning (ICML)
, pp. 737-744
-
-
Parr, R.1
Painter-Wakefield, C.2
Li, L.3
Littman, M.4
-
13
-
-
85152626183
-
A reinforcement learning method for maximizing undiscounted rewards
-
Morgan Kaufmann, San Francisco, CA
-
A. Schwartz. A reinforcement learning method for maximizing undiscounted rewards. In Proc. 10th International Conf. on Machine Learning. Morgan Kaufmann, San Francisco, CA, 1993.
-
(1993)
Proc. 10th International Conf. on Machine Learning
-
-
Schwartz, A.1
-
14
-
-
0346922977
-
Numerical methods for computing stationary distributions of finite irreducible markov chains
-
Kluwer Academic Publishers
-
William J. Stewart. Numerical methods for computing stationary distributions of finite irreducible markov chains. In Advances in Computational Probability. Kluwer Academic Publishers, 1997.
-
(1997)
Advances in Computational Probability
-
-
Stewart, W.J.1
-
15
-
-
0012841228
-
Successive matrix squaring algorithm for computing the Drazin inverse
-
Y. Wei. Successive matrix squaring algorithm for computing the Drazin inverse. Applied Mathematics and Computation, 108:67-75, 2000.
-
(2000)
Applied Mathematics and Computation
, vol.108
, pp. 67-75
-
-
Wei, Y.1
|