-
3
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
5
-
-
33646398129
-
Neural fitted Q-iteration - First experiences with a data efficient neural reinforcement learning method
-
M. Riedmiller, "Neural fitted Q-iteration - first experiences with a data efficient neural reinforcement learning method," in Proc. of the 16th European Conf. on Machine Learning, 2005, pp. 317-328.
-
Proc. of the 16th European Conf. on Machine Learning, 2005
, pp. 317-328
-
-
Riedmiller, M.1
-
6
-
-
80053403826
-
Ensemble methods in machine learning
-
T. Dietterich, "Ensemble methods in machine learning," Multiple classifier systems, pp. 1-15, 2000.
-
(2000)
Multiple Classifier Systems
, pp. 1-15
-
-
Dietterich, T.1
-
7
-
-
49049105169
-
Ensemble algorithms in reinforcement learning
-
M. Wiering and H. van Hasselt, "Ensemble algorithms in reinforcement learning." IEEE transactions on systems, man, and cybernetics, vol. 38, no. 4, 2008.
-
(2008)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.38
, Issue.4
-
-
Wiering, M.1
Van Hasselt, H.2
-
8
-
-
0035478854
-
Random forests
-
L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5-32, 2001.
-
(2001)
Machine Learning
, vol.45
, Issue.1
, pp. 5-32
-
-
Breiman, L.1
-
10
-
-
84898995808
-
Reinforcement learning with function approximation converges to a region
-
G. J. Gordon, "Reinforcement learning with function approximation converges to a region," Advances in neural information processing systems, pp. 1040-1046, 2001.
-
(2001)
Advances in Neural Information Processing Systems
, pp. 1040-1046
-
-
Gordon, G.J.1
-
13
-
-
0346242001
-
-
Ph.D. dissertation, The Australian National University
-
C. Gaskett, "Q-learning for robot control," Ph.D. dissertation, The Australian National University, 2002.
-
(2002)
Q-learning for Robot Control
-
-
Gaskett, C.1
-
14
-
-
0030211964
-
Bagging predictors
-
L. Breiman, "Bagging predictors," Machine learning, vol. 24, no. 2, pp. 123-140, 1996.
-
(1996)
Machine Learning
, vol.24
, Issue.2
, pp. 123-140
-
-
Breiman, L.1
-
15
-
-
0001963082
-
A short introduction to boosting
-
Y. Freund, R. Schapire, and N. Abe, "A short introduction to boosting," Journal of the Japanese Society for Artificial Intelligence, vol. 14, pp. 771-780, 1999.
-
(1999)
Journal of the Japanese Society for Artificial Intelligence
, vol.14
, pp. 771-780
-
-
Freund, Y.1
Schapire, R.2
Abe, N.3
-
16
-
-
33749841590
-
Modeling large dynamical systems with dynamical consistent neural networks
-
S. Haykin, J. Principe, T. Sejnowski, and J. McWhirter, Eds. MIT Press
-
H.-G. Zimmermann, R. Grothmann, A. M. Schaefer, and C. Tietz, "Modeling large dynamical systems with dynamical consistent neural networks," in New Directions in Statistical Signal Processing: From Systems to Brain, S. Haykin, J. Principe, T. Sejnowski, and J. McWhirter, Eds. MIT Press, 2006, pp. 203-242.
-
(2006)
New Directions in Statistical Signal Processing: From Systems to Brain
, pp. 203-242
-
-
Zimmermann, H.-G.1
Grothmann, R.2
Schaefer, A.M.3
Tietz, C.4
-
17
-
-
34548763441
-
A recurrent control neural network for data efficient reinforcement learning
-
A. M. Schaefer, S. Udluft, and H.-G. Zimmermann, "A recurrent control neural network for data efficient reinforcement learning," in Proc. of the IEEE Int. Symposium on Approximate Dynamic Programming and Reinforcement Learning, Honolulu, HI, 2007.
-
(2007)
Proc. of the IEEE Int. Symposium on Approximate Dynamic Programming and Reinforcement Learning, Honolulu, HI
-
-
Schaefer, A.M.1
Udluft, S.2
Zimmermann, H.-G.3
-
18
-
-
79952408018
-
-
personal communication
-
H. van Hasselt, personal communication, 2010.
-
(2010)
-
-
Van Hasselt, H.1
-
19
-
-
0009589301
-
How to train neural networks
-
G. B. Orr and K.-R. Müller, Eds.
-
R. Neuneier and H.-G. Zimmermann, "How to train neural networks," in Neural Networks: Tricks of the Trade, G. B. Orr and K.-R. Müller, Eds., 1996, pp. 373-423.
-
(1996)
Neural Networks: Tricks of the Trade
, pp. 373-423
-
-
Neuneier, R.1
Zimmermann, H.-G.2
|