-
2
-
-
0003787146
-
-
New Jersey: Princeton University Press
-
Bellman RE (1957) Dynamic programming. Princeton University Press, New Jersey.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
4
-
-
0015680499
-
Some new algorithms for recursive estimation in constant linear systems
-
Kailath T (1973) Some new algorithms for recursive estimation in constant linear systems. IEEE Trans Inf Theory 19(6): 750-760.
-
(1973)
IEEE Trans Inf Theory
, vol.19
, Issue.6
, pp. 750-760
-
-
Kailath, T.1
-
5
-
-
0018681625
-
A Schur method for solving algebraic Riccati equations
-
Laub AJ (1979) A Schur method for solving algebraic Riccati equations. IEEE Trans Autom Control 24(6): 913-921.
-
(1979)
IEEE Trans Autom Control
, vol.24
, Issue.6
, pp. 913-921
-
-
Laub, A.J.1
-
7
-
-
0018441647
-
An approximation theory of optimal control for trainable manipulators
-
Saridis GN, Lee CS (1979) An approximation theory of optimal control for trainable manipulators. IEEE Trans Syst Man Cybern 9(3): 152-159.
-
(1979)
IEEE Trans Syst Man Cybern
, vol.9
, Issue.3
, pp. 152-159
-
-
Saridis, G.N.1
Lee, C.S.2
-
8
-
-
0031332446
-
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
-
Beard R, Saridis G, Wen J (1997) Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation. Automatica 33(12): 2159-2177.
-
(1997)
Automatica
, vol.33
, Issue.12
, pp. 2159-2177
-
-
Beard, R.1
Saridis, G.2
Wen, J.3
-
9
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
Abu-Khalaf M, Lewis FL (2005) Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41(5): 779-791.
-
(2005)
Automatica
, vol.41
, Issue.5
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
10
-
-
0036588686
-
Adaptive dynamic programming
-
Murray JJ, Cox CJ, Lendaris GG, Saeks R (2002) Adaptive dynamic programming. IEEE Transa Syst Man Cybern C Appl Rev 32(2): 140-153.
-
(2002)
IEEE Transa Syst Man Cybern C Appl Rev
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
11
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
D. A. White and D. A. Sofge (Eds.), New York: Van Nostrand Reinhold
-
Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. In: White DA, Sofge DA (eds) Handbook of intelligent control: neural, fuzzy, and adaptive approaches. van Nostrand Reinhold, New York, pp 493-525.
-
(1992)
Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches.
, pp. 493-525
-
-
Werbos, P.J.1
-
12
-
-
66449130966
-
Adaptive dynamic programming: an introduction
-
Wang FY, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2): 39-47.
-
(2009)
IEEE Comput Intell Mag
, vol.4
, Issue.2
, pp. 39-47
-
-
Wang, F.Y.1
Zhang, H.2
Liu, D.3
-
13
-
-
70349116541
-
Reinforcement learning and adaptive dynamic programming for feedback control
-
Lewis FL, Vrabie D (2009) Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst Mag 9(3): 32-50.
-
(2009)
IEEE Circuits Syst Mag
, vol.9
, Issue.3
, pp. 32-50
-
-
Lewis, F.L.1
Vrabie, D.2
-
14
-
-
49049089962
-
Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
-
Al-Tamimi A, Lewis FL, Abu-Khalaf M (2008) Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans Syst Man Cybern B Cybern 38(4): 943-949.
-
(2008)
IEEE Trans Syst Man Cybern B Cybern
, vol.38
, Issue.4
, pp. 943-949
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
15
-
-
80054767702
-
Neural-network-based optimal control for a class of nonlinear discrete-time systems with control constraints using the iterative GDHP algorithm
-
San Jose, CA
-
Liu D, Wang D, Zhao D (2011) Neural-network-based optimal control for a class of nonlinear discrete-time systems with control constraints using the iterative GDHP algorithm. In: Proceedings of international joint conference on neural networks, San Jose, CA, pp 53-60.
-
(2011)
Proceedings of International Joint Conference On Neural Networks
, pp. 53-60
-
-
Liu, D.1
Wang, D.2
Zhao, D.3
-
17
-
-
67349145396
-
Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
-
Vrabie D, Lewis FL (2009) Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems. Neural Netw 22(3): 237-246.
-
(2009)
Neural Netw
, vol.22
, Issue.3
, pp. 237-246
-
-
Vrabie, D.1
Lewis, F.L.2
-
18
-
-
77950630017
-
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
-
Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5): 878-888.
-
(2010)
Automatica
, vol.46
, Issue.5
, pp. 878-888
-
-
Vamvoudakis, K.G.1
Lewis, F.L.2
-
19
-
-
79953145872
-
A novel generalized value iteration scheme for uncertain continuous-time linear systems
-
Atlanta, GA
-
Lee JY, Park JB, Choi YH (2010) A novel generalized value iteration scheme for uncertain continuous-time linear systems. In: Proceedings of the 49th IEEE conference on decision and control, Atlanta, GA, pp 4637-4642.
-
(2010)
Proceedings of the 49th IEEE Conference On Decision and Control
, pp. 4637-4642
-
-
Lee, J.Y.1
Park, J.B.2
Choi, Y.H.3
-
22
-
-
0025627940
-
Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks
-
Hornik K, Stinchcombe M, White H (1990) Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Netw 3(5): 551-560.
-
(1990)
Neural Netw
, vol.3
, Issue.5
, pp. 551-560
-
-
Hornik, K.1
Stinchcombe, M.2
White, H.3
-
24
-
-
79551685808
-
Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data
-
Lewis FL, Vamvoudakis KG (2011) Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data. IEEE Trans Syst Man Cybern B Cybern 41(1): 14-25.
-
(2011)
IEEE Trans Syst Man Cybern B Cybern
, vol.41
, Issue.1
, pp. 14-25
-
-
Lewis, F.L.1
Vamvoudakis, K.G.2
|