-
1
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
M. Abu-Khalaf, and F.L. Lewis Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach Automatica 41 2005 779 791
-
(2005)
Automatica
, vol.41
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
2
-
-
84857501996
-
Experience replay for real-time reinforcement learning control
-
S. Adam, L. Busoniu, and R. Babuska Experience replay for real-time reinforcement learning control IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews 42 2012 201 212
-
(2012)
IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews
, vol.42
, pp. 201-212
-
-
Adam, S.1
Busoniu, L.2
Babuska, R.3
-
3
-
-
0003785722
-
-
Ph.D. dissertation, Elec. Eng. Dep., Rensselaer Polytech. Ins., Troy, NY
-
Beard, R.W. (1995). Improving the closed-loop performance of nonlinear systems. Ph.D. dissertation, Elec. Eng. Dep., Rensselaer Polytech. Ins., Troy, NY.
-
(1995)
Improving the Closed-loop Performance of Nonlinear Systems
-
-
Beard, R.W.1
-
5
-
-
84871319455
-
A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
-
S. Bhasin, R. Kamalapurkar, M. Johnson, K.G Vamvoudakis, F.L Lewis, and W.E. Dixon A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems Automatica 49 2012 82 92
-
(2012)
Automatica
, vol.49
, pp. 82-92
-
-
Bhasin, S.1
Kamalapurkar, R.2
Johnson, M.3
Vamvoudakis, K.G.4
Lewis, F.L.5
Dixon, W.E.6
-
7
-
-
84883670357
-
Concurrent learning for convergence in adaptive control without
-
Atlanta GA
-
Chowdhary, G.V., & Johnson, E. (2010). Concurrent learning for convergence in adaptive control without. In IEEE CDC. Atlanta GA (pp. 3675-3679).
-
(2010)
IEEE CDC
, pp. 3675-3679
-
-
Chowdhary, G.V.1
Johnson, E.2
-
8
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
K. Doya Reinforcement learning in continuous time and space Neural Computation 12 2000 219 245
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
9
-
-
56749173285
-
Efficient experience reuse in non-Markovian environments
-
Tokyo, Japan
-
Dung, L.T., Komeda, T., & Takagi, M. (2008). Efficient experience reuse in non-Markovian environments. In Proc. int. conf. instrum. control inf. technol. Tokyo, Japan (pp. 3327-3332).
-
(2008)
Proc. Int. Conf. Instrum. Control Inf. Technol
, pp. 3327-3332
-
-
Dung, L.T.1
Komeda, T.2
Takagi, M.3
-
19
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L.J. Lin Self-improving reactive agents based on reinforcement learning, planning and teaching Machine Learning 8 1992 293 321
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.J.1
-
20
-
-
84881324637
-
Optimal control of nonlinear continuous-time systems: Design of bounded controllers via generalized nonquadratic functionals
-
Lyshevski, S.E. (1998). Optimal control of nonlinear continuous-time systems: design of bounded controllers via generalized nonquadratic functionals. In Proceedings of American control conference (pp. 205-209).
-
(1998)
Proceedings of American Control Conference
, pp. 205-209
-
-
Lyshevski, S.E.1
-
22
-
-
0036588686
-
Adaptive dynamic programming
-
J.J. Murray, C.J. Cox, G.G. Lendaris, and R. Saeks Adaptive dynamic programming IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews 32 2002 140 153
-
(2002)
IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews
, vol.32
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
26
-
-
77950630017
-
Online actor-critic algorithm to solve the continuous infinite-time horizon optimal control problem
-
K. Vamvoudakis, and F.L. Lewis Online actor-critic algorithm to solve the continuous infinite-time horizon optimal control problem Automatica 46 2010 878 888
-
(2010)
Automatica
, vol.46
, pp. 878-888
-
-
Vamvoudakis, K.1
Lewis, F.L.2
-
28
-
-
58349110975
-
Adaptive optimal control for continuous-time linear systems based on policy iteration
-
D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F.L. Lewis Adaptive optimal control for continuous-time linear systems based on policy iteration Automatica 45 2009 477 484
-
(2009)
Automatica
, vol.45
, pp. 477-484
-
-
Vrabie, D.1
Pastravanu, O.2
Abu-Khalaf, M.3
Lewis, F.L.4
-
29
-
-
67349145396
-
Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
-
D. Vrabie, and F.L. Lewis Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems Neural Networks 22 2009 237 246
-
(2009)
Neural Networks
, vol.22
, pp. 237-246
-
-
Vrabie, D.1
Lewis, F.L.2
-
30
-
-
71749106087
-
Real-time reinforcement learning by sequential actor-critics and experience replay
-
P. Wawrzynski Real-time reinforcement learning by sequential actor-critics and experience replay Neural Networks 22 2009 1484 1497
-
(2009)
Neural Networks
, vol.22
, pp. 1484-1497
-
-
Wawrzynski, P.1
-
31
-
-
0002031779
-
Approximate dynamic programming for real time control and neural modeling
-
D.A. White, D.A. Sofge, Multiscience Press
-
P.J. Werbos Approximate dynamic programming for real time control and neural modeling D.A. White, D.A. Sofge, Handbook of intelligent control 1992 Multiscience Press
-
(1992)
Handbook of Intelligent Control
-
-
Werbos, P.J.1
-
32
-
-
84862815087
-
Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
-
H. Xu, S. Jagannathan, and F.L. Lewis Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses Automatica 48 2012 1017 1030
-
(2012)
Automatica
, vol.48
, pp. 1017-1030
-
-
Xu, H.1
Jagannathan, S.2
Lewis, F.L.3
|