-
1
-
-
0031074521
-
Locally weighted learning
-
Atkeson, C. G., Moore, A. W., & Schaal, S. (1997). Locally weighted learning. Artificial Intelligence Review, 11, 11-73.
-
(1997)
Artificial Intelligence Review
, vol.11
, pp. 11-73
-
-
Atkeson, C.G.1
Moore, A.W.2
Schaal, S.3
-
2
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
Cambridge, MA: The MIT Press
-
Boyan, J. A., & Moore, A. W. (1995). Generalization in reinforcement learning: Safely approximating the value function. Advances in Neural Information Processing Systems 7 (pp. 369-376). Cambridge, MA: The MIT Press.
-
(1995)
Advances in Neural Information Processing Systems 7
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
3
-
-
0001771345
-
Linear least-squares algorithms for temporal difference learning
-
Bradtke, S., & Barto, A. (1996). Linear least-squares algorithms for temporal difference learning. Machine Learning, 2, 33-58.
-
(1996)
Machine Learning
, vol.2
, pp. 33-58
-
-
Bradtke, S.1
Barto, A.2
-
4
-
-
84983110889
-
A decision-theoretic generalization of on-line learning and an application to boosting
-
Proc. of the Second European Conference on Computational Learning Theory
-
Freund, Y., & Schapire, R. (1995). A decision-theoretic generalization of on-line learning and an application to boosting. Proc. of the Second European Conference on Computational Learning Theory. LNCS.
-
(1995)
LNCS
-
-
Freund, Y.1
Schapire, R.2
-
10
-
-
17444414191
-
Basis function adaptation in temporal difference reinforcement learning
-
Menache, I., Mannor, S., & Shimkin, N. (2005). Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134.
-
(2005)
Annals of Operations Research
, pp. 134
-
-
Menache, I.1
Mannor, S.2
Shimkin, N.3
-
12
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
14
-
-
84887252594
-
Support vector method for function approximation, regression estimation, and signal processing
-
Cambridge, MA: MIT Press
-
Vapnik, V., Golowich, S., & Smola, A. (1997). Support vector method for function approximation, regression estimation, and signal processing. Advances in Neural Information Processing Systems 9 (pp. 281-287). Cambridge, MA: MIT Press.
-
(1997)
Advances in Neural Information Processing Systems 9
, pp. 281-287
-
-
Vapnik, V.1
Golowich, S.2
Smola, A.3
-
15
-
-
34547991475
-
Convergence results for some temporal difference methods based on least squares
-
LIDS-2697, Laboratory for Information and Decision Systems, Massachusetts Institute of Technology
-
Yu, H., & Bertsekas, D. (2006). Convergence results for some temporal difference methods based on least squares (Technical Report LIDS-2697). Laboratory for Information and Decision Systems, Massachusetts Institute of Technology.
-
(2006)
Technical Report
-
-
Yu, H.1
Bertsekas, D.2
|