-
1
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, L. Wehenkel, and L. Littman, "Tree-based batch mode reinforcement learning", Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
Littman, L.4
-
2
-
-
33646398129
-
Neural Fitted Q Iteration - First experiences with a data efficient neural reinforcement learning method
-
Porto, Portugal, October
-
M. Riedmiller, "Neural Fitted Q Iteration - first experiences with a data efficient neural reinforcement learning method", in Lecture Notes in Computer Science: Proc. of the European Conference on Machine Learning, ECML 2005, Porto, Portugal, October 2005, pp. 317-328.
-
(2005)
Lecture Notes in Computer Science: Proc. of the European Conference on Machine Learning, ECML 2005
, pp. 317-328
-
-
Riedmiller, M.1
-
3
-
-
58449110583
-
Regularized fitted Q-iteration: Application to planning
-
ser. Lecture Notes in Computer Science, S. Girgin, M. Loth, R. Munos, P. Preux, and D. Ryabko, Eds. Springer Berlin Heidelberg
-
A. Farahmand, M. Ghavamzadeh, C. Szepesvári, and S. Mannor, "Regularized fitted Q-iteration: Application to planning", in Recent Advances in Reinforcement Learning, ser. Lecture Notes in Computer Science, S. Girgin, M. Loth, R. Munos, P. Preux, and D. Ryabko, Eds. Springer Berlin Heidelberg, 2008, vol. 5323, pp. 55-68.
-
(2008)
Recent Advances in Reinforcement Learning
, vol.5323
, pp. 55-68
-
-
Farahmand, A.1
Ghavamzadeh, M.2
Szepesvári, C.3
Mannor, S.4
-
4
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Morgan Kaufmann
-
R. S. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming", in Proceedings of the Seventh International Conference on Machine Learning (ICML 1990). Morgan Kaufmann, 1990, pp. 216-224.
-
(1990)
Proceedings of the Seventh International Conference on Machine Learning (ICML 1990)
, pp. 216-224
-
-
Sutton, R.S.1
-
7
-
-
0001307907
-
RPROP: A fast and robust backpropagation learning strategy
-
M. Jabri, Ed., Melbourne
-
M. Riedmiller and H. Braun, "RPROP: A fast and robust backpropagation learning strategy", in Fourth Australian Conference on Neural Networks, M. Jabri, Ed., Melbourne, 1993, pp. 169-172.
-
(1993)
Fourth Australian Conference on Neural Networks
, pp. 169-172
-
-
Riedmiller, M.1
Braun, H.2
-
8
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
A. G. Barto, R. S. Sutton, and C. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems", IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-13, no. 5, 1983.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.SMC-13
, Issue.5
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.3
-
9
-
-
84872531075
-
10 steps and some tricks to set up neural reinforcement controllers
-
2nd ed.
-
M. Riedmiller, "10 steps and some tricks to set up neural reinforcement controllers." in Neural Networks: Tricks of the Trade (2nd ed.), 2012, pp. 735-757.
-
(2012)
Neural Networks: Tricks of the Trade
, pp. 735-757
-
-
Riedmiller, M.1
-
10
-
-
34547223380
-
Decentralized reinforcement learning control of a robotic manipulator
-
Singapore
-
L. Busoniu, B. De Schutter, and R. Babuska, "Decentralized reinforcement learning control of a robotic manipulator", in Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision (ICARCV 2006), Singapore, 2006, pp. 1347-1352.
-
(2006)
Proceedings of the 9th International Conference on Control, Automation, Robotics and Vision (ICARCV 2006)
, pp. 1347-1352
-
-
Busoniu, L.1
De Schutter, B.2
Babuska, R.3
|