-
5
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
Doya K,. Reinforcement learning in continuous time and space. Neural Computation. 2000 12 1: 219-245.
-
(2000)
Neural Computation
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
8
-
-
84921399937
-
-
John Wiley: Hoboken, NJ
-
Si J, Barto A, Powel W, Wunch D,. 2004. Handbook of Learning and Approximate Dynamic Programming, John Wiley: Hoboken, NJ.
-
(2004)
Handbook of Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.2
Powel, W.3
Wunch, D.4
-
10
-
-
77950629367
-
Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework
-
Thessaloniki, Greece, June
-
Vrabie D, Vamvoudakis K, Lewis F,. Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework, Proceedings of the IEEE Mediterranean Conference on Control and Automation. Thessaloniki, Greece, June 2009; 1402-1409.
-
(2009)
Proceedings of the IEEE Mediterranean Conference on Control and Automation
, pp. 1402-1409
-
-
Vrabie, D.1
Vamvoudakis, K.2
Lewis, F.3
-
13
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
White D.A. Sofge D.A. (eds), Van Nostrand Reinhold, New York
-
Werbos PJ,. 1992. Approximate dynamic programming for real-time control and neural modeling, In Handbook of Intelligent Control, White DA, Sofge DA, (eds), Van Nostrand Reinhold, New York.
-
(1992)
Handbook of Intelligent Control
-
-
Werbos, P.J.1
-
14
-
-
77953770221
-
-
Ph.D. Thesis, Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX
-
Vrabie D,. 2009. Online adaptive optimal control for continuous time systems, Ph.D. Thesis, Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX.
-
(2009)
Online Adaptive Optimal Control for Continuous Time Systems
-
-
Vrabie, D.1
-
15
-
-
77950630017
-
Online actor-critic algorithm to solve the continuous-time inifinite horizon optimal control problem
-
Vamvoudakis KG, Lewis FL,. Online actor-critic algorithm to solve the continuous-time inifinite horizon optimal control problem. Automatica. 2010 46 5: 878-888.
-
(2010)
Automatica
, vol.46
, Issue.5
, pp. 878-888
-
-
Vamvoudakis, K.G.1
Lewis, F.L.2
-
20
-
-
48949116222
-
Neurodynamic programming and zero-sum games for constrained control systems
-
Abu-Khalaf M, Lewis FL,. Neurodynamic programming and zero-sum games for constrained control systems. IEEE Transactions on Neural Networks. 2008 19 7: 1243-1252.
-
(2008)
IEEE Transactions on Neural Networks
, vol.19
, Issue.7
, pp. 1243-1252
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
22
-
-
84914965022
-
On an iterative technique for Riccati equation computations
-
Kleinman D,. On an iterative technique for Riccati equation computations. IEEE Transactions on Automatic Control. 1968 13 1: 114-115.
-
(1968)
IEEE Transactions on Automatic Control
, vol.13
, Issue.1
, pp. 114-115
-
-
Kleinman, D.1
-
24
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
-
Abu-Khalaf M, Lewis FL,. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica. 2005 41 5: 779-791. (Pubitemid 40352391)
-
(2005)
Automatica
, vol.41
, Issue.5
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
26
-
-
0025627940
-
Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks
-
Hornik K, Stinchcombe M, White H,. Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Networks. 1990 3 5: 551-560.
-
(1990)
Neural Networks
, vol.3
, Issue.5
, pp. 551-560
-
-
Hornik, K.1
Stinchcombe, M.2
White, H.3
-
31
-
-
62949149213
-
Constrained nonlinear optimal control: A converse HJB approach
-
Pasadena, CA
-
Nevistic V, Primbs JA,. 1996. Constrained nonlinear optimal control: a converse HJB approach, Technical Report Technical Report 96-021, California Institute of Technology, Pasadena, CA.
-
(1996)
Technical Report Technical Report 96-021, California Institute of Technology
-
-
Nevistic, V.1
Primbs, J.A.2
-
33
-
-
0004178386
-
-
Prentice-Hall: Upper Saddle River, NJ
-
Khalil HK,. 1996. Nonlinear Systems, Prentice-Hall: Upper Saddle River, NJ.
-
(1996)
Nonlinear Systems
-
-
Khalil, H.K.1
-
34
-
-
79953151751
-
A model free robust policy iteration algorithm for optimal control of nonlinear systems
-
Atlanta, GA, 15-17 December
-
Bhasin S, Johnson M, Dixon WE,. A model free robust policy iteration algorithm for optimal control of nonlinear systems, Proceedings of the 49th IEEE Conference on Decision and Control, Atlanta, GA, 15-17 December 2010; 3060-3065.
-
(2010)
Proceedings of the 49th IEEE Conference on Decision and Control
, pp. 3060-3065
-
-
Bhasin, S.1
Johnson, M.2
Dixon, W.E.3
|