-
2
-
-
0004049893
-
-
Ph.D. dissertation, Cambridge University, Cambridge, United Kingdom
-
C. J. C. H. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Cambridge University, Cambridge, United Kingdom, 1989.
-
(1989)
Learning from delayed rewards
-
-
Watkins, C.J.C.H.1
-
3
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, Vol. 6, pp. 503-556, 2005. (Pubitemid 40958851)
-
(2005)
Journal of Machine Learning Research
, vol.6
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
4
-
-
0036832956
-
Kernel-based reinforcement learning
-
DOI 10.1023/A:1017928328829
-
D. Ormoneit and Ś. Sen, "Kernel-based reinforcement learning," Machine Learning, Vol. 49, no. 2-3, pp. 161-178, 2002. (Pubitemid 34325684)
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 161-178
-
-
Ormoneit, D.1
Sen, A.2
-
7
-
-
84957629024
-
Q-learning in continuous state and action spaces
-
Springer-Verlag
-
C. Gaskett, D. Wettergreen, and E. Zelinsky, "Q-learning in continuous state and action spaces," in Proceedings of the 12th Australian Joint Conference on Artificial Intelligence. Springer-Verlag, 1999, pp. 417-428.
-
(1999)
Proceedings of the 12th Australian Joint Conference on Artificial Intelligence
, pp. 417-428
-
-
Gaskett, C.1
Wettergreen, D.2
Zelinsky, E.3
-
8
-
-
0031645424
-
A neural field approach to topological reinforcement learning in continuous action spaces
-
H. M. Gross, V. Stephan, and M. Krabbes, "A neural field approach to topological reinforcement learning in continuous action spaces," in Proceedings of the International Joint Conference on Neural Networks, 1998, pp. 1992-1997.
-
(1998)
Proceedings of the International Joint Conference on Neural Networks
, pp. 1992-1997
-
-
Gross, H.M.1
Stephan, V.2
Krabbes, M.3
-
10
-
-
0031341345
-
Neural reinforcement learning for behaviour synthesis
-
PII S0921889097000420
-
C. Touzet, "Neural reinforcement learning for behaviour synthesis," Robotics and Autonomous Systems, Vol. 22, pp. 251-281, 1997. (Pubitemid 127398213)
-
(1997)
Robotics and Autonomous Systems
, vol.22
, Issue.3-4
, pp. 251-281
-
-
Touzet, C.F.1
-
11
-
-
85161968592
-
Reinforcement learning in continuous action spaces through sequential monte carlo methods
-
Cambridge, MA: MIT Press
-
A. Lazaric, M. Restelli, and A. Bonarini, "Reinforcement learning in continuous action spaces through sequential monte carlo methods," in Advances in Neural Information Processing Systems 20. Cambridge, MA: MIT Press, 2008, pp. 833-840.
-
(2008)
Advances in Neural Information Processing Systems
, vol.20
, pp. 833-840
-
-
Lazaric, A.1
Restelli, M.2
Bonarini, A.3
-
12
-
-
0347625319
-
A learning algorithm for the control of continuous action set-point regulator systems
-
A. O. Esogbue and W. E. Hearnes, "A learning algorithm for the control of continuous action set-point regulator systems," Journal of Computational Analysis and Applications, Vol. 1, no. 2, pp. 121-234, 1999.
-
(1999)
Journal of Computational Analysis and Applications
, vol.1
, Issue.2
, pp. 121-234
-
-
Esogbue, A.O.1
Hearnes, W.E.2
-
13
-
-
32844474095
-
Reinforcement learning with factored states and actions
-
B. Sallans and G. E. Hinton, "Reinforcement learning with factored states and actions," Journal of Machine Learning Research, Vol. 5, pp. 1063-1088, 2004.
-
(2004)
Journal of Machine Learning Research
, vol.5
, pp. 1063-1088
-
-
Sallans, B.1
Hinton, G.E.2
-
14
-
-
0031231885
-
Experiments with reinforcement learning in problems with continuous state and action spaces
-
J. C. Santamaría, R. S. Sutton, and A. Ram, "Experiments with reinforcement learning in problems with continuous state and action spaces," Adaptive Behavior, Vol. 6, pp. 163-218, 1998.
-
(1998)
Adaptive Behavior
, vol.6
, pp. 163-218
-
-
Santamaria, J.C.1
Sutton, R.S.2
Ram, A.3
-
15
-
-
67650370700
-
Application of a self-learning controller with continuous control signals based on the DOE-approach
-
M. Riedmiller, "Application of a self-learning controller with continuous control signals based on the DOE-approach," in Proceedings of the European Symposium on Neural Networks, 1997.
-
(1997)
Proceedings of the European Symposium on Neural Networks
-
-
Riedmiller, M.1
-
16
-
-
0030082891
-
An approach to fuzzy control of nonlinear systems: Stability and design issues
-
PII S106367069600639X
-
H. O. Wang, K. Tanaka, and M. F. Griffin, "An approach to fuzzy control of nonlinear systems: Stability and design issues," IEEE Transactions on Fuzzy Systems, Vol. 4, no. 1, pp. 14-23, 1996. (Pubitemid 126782417)
-
(1996)
IEEE Transactions on Fuzzy Systems
, vol.4
, Issue.1
, pp. 14-23
-
-
Wang, H.O.1
Tanaka, K.2
Griffin, M.F.3
|