-
1
-
-
48949116222
-
Neurodynamic programming and zerosum games for constrained control systems
-
Jul
-
M. Abu-Khalaf and F. L. Lewis, "Neurodynamic programming and zerosum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Jul. 2008.
-
(2008)
IEEE Trans. Neural Netw
, vol.19
, Issue.7
, pp. 1243-1252
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
2
-
-
33846781129
-
Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
-
DOI 10.1016/j.automatica.2006.09.019, PII S0005109806004249
-
A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control," Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007. (Pubitemid 46209050)
-
(2007)
Automatica
, vol.43
, Issue.3
, pp. 473-481
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
4
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Oct
-
A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst., Man, Cybern., vol. 13, no. 5, pp. 835-846, Oct. 1983.
-
(1983)
IEEE Trans. Syst., Man, Cybern
, vol.13
, Issue.5
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
5
-
-
0031332446
-
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
-
Dec
-
R. Beard, G. Saridis, and J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation," Automatica, vol. 33, no. 12, pp. 2159-2177, Dec. 1997.
-
(1997)
Automatica
, vol.33
, Issue.12
, pp. 2159-2177
-
-
Beard, R.1
Saridis, G.2
Wen, J.3
-
6
-
-
85012688561
-
-
Princeton, NJ, USA: Princeton Univ. Press
-
R. E. Bellman, Dynamic Programming. Princeton, NJ, USA: Princeton Univ. Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
8
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
Jan
-
K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, Jan. 2000.
-
(2000)
Neural Comput
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
10
-
-
0003690086
-
-
New York, NY, USA: Springer-Verlag
-
A. Isidori, Nonlinear Control Systems, vol. 2. New York, NY, USA: Springer-Verlag, 1999.
-
(1999)
Nonlinear Control Systems
, vol.2
-
-
Isidori, A.1
-
11
-
-
84865467087
-
Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
-
Oct
-
Y. Jiang and Z. P. Jiang, "Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics," Automatica, vol. 48, no. 10, pp. 2699-2704, Oct. 2012.
-
(2012)
Automatica
, vol.48
, Issue.10
, pp. 2699-2704
-
-
Jiang, Y.1
Jiang, Z.P.2
-
12
-
-
84877914583
-
Robust adaptive dynamic programming with an application to power systems
-
Jul
-
Y. Jiang and Z. P. Jiang, "Robust adaptive dynamic programming with an application to power systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 7, pp. 1150-1156, Jul. 2013.
-
(2013)
IEEE Trans. Neural Netw. Learn. Syst
, vol.24
, Issue.7
, pp. 1150-1156
-
-
Jiang, Y.1
Jiang, Z.P.2
-
13
-
-
84860701570
-
Robust approximate dynamic programming and global stabilization with nonlinear dynamic uncertainties
-
Y. Jiang and Z. P. Jiang, "Robust approximate dynamic programming and global stabilization with nonlinear dynamic uncertainties," in Proc. 50th IEEE CDC-ECC, Orlando, FL, USA, Dec. 2011, pp. 115-120.
-
Proc. 50th IEEE CDC-ECC, Orlando, FL, USA, Dec
, vol.2011
, pp. 115-120
-
-
Jiang, Y.1
Jiang, Z.P.2
-
14
-
-
84873534424
-
Robust adaptive dynamic programming
-
F L. Lewis and D. Liu, Eds. New York, NY, USA: Wiley
-
Y. Jiang and Z. P. Jiang, "Robust adaptive dynamic programming," in Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, F. L. Lewis and D. Liu, Eds. New York, NY, USA: Wiley, 2012.
-
(2012)
Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
-
-
Jiang, Y.1
Jiang, Z.P.2
-
15
-
-
84884901270
-
Robust adaptive dynamic programming for linear and nonlinear systems: An overview
-
Sep
-
Z. P. Jiang and Y. Jiang, "Robust adaptive dynamic programming for linear and nonlinear systems: An overview," Eur. J. Control, vol. 19, no. 5, pp. 417-425, Sep. 2013.
-
(2013)
Eur. J. Control
, vol.19
, Issue.5
, pp. 417-425
-
-
Jiang, Z.P.1
Jiang, Y.2
-
16
-
-
0030218302
-
A Lyapunov formulation of the nonlinear small-gain theorem for interconnected ISS systems
-
DOI 10.1016/0005-1098(96)00051-9, PII S0005109896000519
-
Z. P. Jiang, I. Mareels, and Y. Wang, "A Lyapunov formulation of the nonlinear small gain theorem for interconnected ISS systems," Automatica, vol. 32, no. 8, pp. 1211-1215, Aug. 1996. (Pubitemid 126363671)
-
(1996)
Automatica
, vol.32
, Issue.8
, pp. 1211-1215
-
-
Jiang, Z.-P.1
Mareels, I.M.Y.2
Wang, Y.3
-
17
-
-
0031102352
-
A small-gain control method for nonlinear cascaded systems with dynamic uncertainties
-
PII S0018928697020321
-
Z. P. Jiang and I. M. Y. Mareels, "A small-gain control method for nonlinear cascaded systems with dynamic uncertainties," IEEE Trans. Autom. Control, vol. 42, no. 3, pp. 292-308, Mar. 1997. (Pubitemid 127760593)
-
(1997)
IEEE Transactions on Automatic Control
, vol.42
, Issue.3
, pp. 292-308
-
-
Jiang, Z.-P.1
Mareels, I.M.Y.2
-
18
-
-
33846166511
-
Small-gain theorem for ISS systems and applications
-
Z. P. Jiang, A. R. Teel, and L. Praly, "Small-gain theorem for ISS systems and applications," Math. Control, Signals, Syst., vol. 7, no. 2, pp. 95-120, 1994.
-
(1994)
Math. Control, Signals, Syst
, vol.7
, Issue.2
, pp. 95-120
-
-
Jiang, Z.P.1
Teel, A.R.2
Praly, L.3
-
20
-
-
0004178386
-
-
3rd ed. Englewood Cliffs, NJ, USA: Prentice-Hall
-
H. K. Khalil, Nonlinear Systems, 3rd ed. Englewood Cliffs, NJ, USA: Prentice-Hall, 2002.
-
(2002)
Nonlinear Systems
-
-
Khalil, H.K.1
-
21
-
-
0003427482
-
-
New York, NY, USA: Wiley
-
M. Krstic, I. Kanellakopoulos, and P. V. Kokotovic, Nonlinear and Adaptive Control Design. New York, NY, USA: Wiley, 1995.
-
(1995)
Nonlinear and Adaptive Control Design
-
-
Krstic, M.1
Kanellakopoulos, I.2
Kokotovic, P.V.3
-
22
-
-
0003393654
-
-
New York, NY, USA: McGraw-Hill
-
P. Kundur, N. J. Balu, and M. G. Lauby, Power System Stability and Control. New York, NY, USA: McGraw-Hill, 1994.
-
(1994)
Power System Stability and Control
-
-
Kundur, P.1
Balu, N.J.2
Lauby, M.G.3
-
23
-
-
0004163205
-
-
3rd ed. New York, NY, USA: Wiley
-
F. L. Lewis, D. Vrabie, and V. L. Syrmos, Optimal Control, 3rd ed. New York, NY, USA: Wiley, 2012.
-
(2012)
Optimal Control
-
-
Lewis, F.L.1
Vrabie, D.2
Syrmos, V.L.3
-
24
-
-
70349116541
-
Reinforcement learning and adaptive dynamic programming for feedback control
-
Apr./Jun
-
F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, Apr./Jun. 2009.
-
(2009)
IEEE Circuits Syst. Mag
, vol.9
, Issue.3
, pp. 32-50
-
-
Lewis, F.L.1
Vrabie, D.2
-
25
-
-
79551685808
-
Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
-
Feb
-
F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 41, no. 1, pp. 14-23, Feb. 2011.
-
(2011)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.41
, Issue.1
, pp. 14-23
-
-
Lewis, F.L.1
Vamvoudakis, K.G.2
-
27
-
-
0003427480
-
-
New York, NY, USA: Wiley
-
R. Marino and P. Tomei, Nonlinear Control Design: Geometric, Adaptive, Robust. New York, NY, USA: Wiley, 1995.
-
(1995)
Nonlinear Control Design: Geometric, Adaptive, Robust
-
-
Marino, R.1
Tomei, P.2
-
28
-
-
77956759998
-
Reinforcement learning control and pattern recognition systems
-
J M. Mendel and K. S. Fu, Eds. New York, NY, USA: Academic
-
J. M. Mendel and R. W. McLaren, "Reinforcement learning control and pattern recognition systems," in Adaptive, Learning and Pattern Recognition Systems: Theory and Applications, J. M. Mendel and K. S. Fu, Eds. New York, NY, USA: Academic, 1970, pp. 287-318.
-
(1970)
Adaptive, Learning and Pattern Recognition Systems: Theory and Applications
, pp. 287-318
-
-
Mendel, J.M.1
McLaren, R.W.2
-
29
-
-
84937350040
-
Steps toward artificial intelligence
-
Jan
-
M. Minsky, "Steps toward artificial intelligence," Proc. IRE, vol. 49, no. 1, pp. 8-30, Jan. 1961.
-
(1961)
Proc. IRE
, vol.49
, Issue.1
, pp. 8-30
-
-
Minsky, M.1
-
30
-
-
0022440306
-
Theory of post-Stall transients in axial compression systems: PART I - Development of equations
-
F. K. Moore and E. M. Greitzer, "A theory of post-stall transients in axial compression systems-Part I: Development of equations," J. Eng. Gas Turbines Power, vol. 108, no. 1, pp. 68-76, Jan. 1986. (Pubitemid 16475797)
-
(1986)
Journal of Engineering for Gas Turbines and Power
, vol.108
, Issue.1
, pp. 68-76
-
-
Moore, F.K.1
Greitzer, E.M.2
-
31
-
-
0036588686
-
Adaptive dynamic programming
-
DOI 10.1109/TSMCC.2002.801727
-
J. J. Murray, C. J. Cox, and G. G. Lendaris, "Adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002. (Pubitemid 35289398)
-
(2002)
IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
33
-
-
0029726983
-
Stabilization in spite of matched unmodeled dynamics and an equivalent definition of input-to-state stability
-
L. Praly and Y. Wang, "Stabilization in spite of matched unmodeled dynamics and an equivalent definition of input-to-state stability," Math. Control, Signals, Syst., vol. 9, no. 1, pp. 1-33, 1996. (Pubitemid 126597701)
-
(1996)
Mathematics of Control, Signals, and Systems
, vol.9
, Issue.1
, pp. 1-33
-
-
Praly, L.1
Wang, Y.2
-
34
-
-
0018441647
-
An approximation theory of optimal control for trainable manipulators
-
Mar
-
G. N. Saridis and C.-S. G. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Syst., Man, Cybern., vol. 9, no. 3, pp. 152-159, Mar. 1979.
-
(1979)
IEEE Trans. Syst., Man, Cybern
, vol.9
, Issue.3
, pp. 152-159
-
-
Saridis, G.N.1
Lee, C.-S.G.2
-
35
-
-
0024647058
-
Smooth stabilization implies coprime factorization
-
Apr
-
E. D. Sontag, "Smooth stabilization implies coprime factorization," IEEE Trans. Autom. Control, vol. 34, no. 4, pp. 435-443, Apr. 1989.
-
(1989)
IEEE Trans. Autom. Control
, vol.34
, Issue.4
, pp. 435-443
-
-
Sontag, E.D.1
-
36
-
-
0025419843
-
Further facts about input to state stabilization
-
Apr
-
E. D. Sontag, "Further facts about input to state stabilization," IEEE Trans. Autom. Control, vol. 35, no. 4, pp. 473-476, Apr. 1990.
-
(1990)
IEEE Trans. Autom. Control
, vol.35
, Issue.4
, pp. 473-476
-
-
Sontag, E.D.1
-
37
-
-
0029288045
-
On characterizations of the input-to-state stability property
-
Apr
-
E. D. Sontag and Y. Wang, "On characterizations of the input-to-state stability property," Syst. Control Lett., vol. 24, no. 5, pp. 351-359, Apr. 1995.
-
(1995)
Syst. Control Lett
, vol.24
, Issue.5
, pp. 351-359
-
-
Sontag, E.D.1
Wang, Y.2
-
39
-
-
33847202724
-
Learning to predict by the method of temporal difference
-
Aug
-
R. S. Sutton, "Learning to predict by the method of temporal difference," Mach. Learn., vol. 3, no. 1, pp. 9-44, Aug. 1988.
-
(1988)
Mach. Learn
, vol.3
, Issue.1
, pp. 9-44
-
-
Sutton, R.S.1
-
40
-
-
0029377703
-
Tools for semiglobal stabilization by partial state and output feedback
-
Sep
-
A. Teel and L. Praly, "Tools for semiglobal stabilization by partial state and output feedback," SIAM J. Control Optim., vol. 33, no. 5, pp. 1443-1488, Sep. 1995.
-
(1995)
SIAM J. Control Optim
, vol.33
, Issue.5
, pp. 1443-1488
-
-
Teel, A.1
Praly, L.2
-
41
-
-
0029219894
-
Partial-state global stabilization for general triangular systems
-
Jan
-
J. Tsinias, "Partial-state global stabilization for general triangular systems," Syst. Control Lett., vol. 24, no. 2, pp. 139-145, Jan. 1995.
-
(1995)
Syst. Control Lett
, vol.24
, Issue.2
, pp. 139-145
-
-
Tsinias, J.1
-
42
-
-
79960897012
-
Multi-player non zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
-
Aug
-
K. G. Vamvoudakis and F. L. Lewis, "Multi-player non zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, Aug. 2011.
-
(2011)
Automatica
, vol.47
, Issue.8
, pp. 1556-1569
-
-
Vamvoudakis, K.G.1
Lewis, F.L.2
-
43
-
-
67349145396
-
Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
-
Apr
-
D. Vrabie and F. Lewis, "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, Apr. 2009.
-
(2009)
Neural Netw
, vol.22
, Issue.3
, pp. 237-246
-
-
Vrabie, D.1
Lewis, F.2
-
44
-
-
58349110975
-
Adaptive optimal control for continuous-time linear systems based on policy iteration
-
Feb
-
D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, Feb. 2009.
-
(2009)
Automatica
, vol.45
, Issue.2
, pp. 477-484
-
-
Vrabie, D.1
Pastravanu, O.2
Abu-Khalaf, M.3
Lewis, F.L.4
-
45
-
-
0000562031
-
A heuristic approach to reinforcement learning control systems
-
Oct
-
M. D. Waltz and K. S. Fu, "A heuristic approach to reinforcement learning control systems," IEEE Trans. Autom. Control, vol. 10, no. 4, pp. 390-398, Oct. 1965.
-
(1965)
IEEE Trans. Autom. Control
, vol.10
, Issue.4
, pp. 390-398
-
-
Waltz, M.D.1
Fu, K.S.2
-
46
-
-
66449130966
-
Adaptive dynamic programming: An introduction
-
May
-
F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
-
(2009)
IEEE Comput. Intell. Mag
, vol.4
, Issue.2
, pp. 39-47
-
-
Wang, F.Y.1
Zhang, H.2
Liu, D.3
-
47
-
-
0004049893
-
-
Ph.D. thesis King's College, Cambridge Univ., Cambridge, U.K May
-
C. Watkins, "Learning from delayed rewards," Ph.D. thesis, King's College, Cambridge Univ., Cambridge, U.K., May 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.1
-
49
-
-
0003529238
-
-
Ph.D. thesis Committee Appl. Math., Harvard Univ., Cambridge, MA, USA
-
P. J. Werbos, "Beyond regression: New tools for prediction and analysis in the behavioral sciences," Ph.D. thesis, Committee Appl. Math., Harvard Univ., Cambridge, MA, USA, 1974.
-
(1974)
Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences
-
-
Werbos, P.J.1
-
50
-
-
0024888479
-
Neural networks for control and system identification
-
Dec
-
P. J. Werbos, "Neural networks for control and system identification," in Proc. 28th IEEE Conf. Decision Control, vol. 1. Dec. 1989, pp. 260-265.
-
(1989)
Proc. 28th IEEE Conf. Decision Control
, vol.1
, pp. 260-265
-
-
Werbos, P.J.1
-
51
-
-
0002011091
-
A menu of designs for reinforcement learning over time
-
W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press
-
P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press, 1991, pp. 67-95.
-
(1991)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.J.1
-
52
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
D. A. White and D. A. Sofge, Eds. New York, NY, USA: Van Nostrand
-
P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, D. A. White and D. A. Sofge, Eds. New York, NY, USA: Van Nostrand, 1992.
-
(1992)
Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
-
-
Werbos, P.J.1
-
53
-
-
67349247013
-
Intelligence in the brain: A theory of how it works and how to build it
-
Apr
-
P. J. Werbos, "Intelligence in the brain: A theory of how it works and how to build it," Neural Netw., vol. 22, no. 3, pp. 200-212, Apr. 2009.
-
(2009)
Neural Netw
, vol.22
, Issue.3
, pp. 200-212
-
-
Werbos, P.J.1
-
54
-
-
78650805234
-
An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
-
Jan
-
H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, Jan. 2011.
-
(2011)
Automatica
, vol.47
, Issue.1
, pp. 207-214
-
-
Zhang, H.1
Wei, Q.2
Liu, D.3
-
55
-
-
0000922214
-
Stable neural controller design for unknown nonlinear systems using backstepping
-
Nov
-
Y. Zhang, P. Y. Peng, and Z. P. Jiang, "Stable neural controller design for unknown nonlinear systems using backstepping," IEEE Trans. Neural Netw., vol. 11, no. 6, pp. 1347-1360, Nov. 2000.
-
(2000)
IEEE Trans. Neural Netw
, vol.11
, Issue.6
, pp. 1347-1360
-
-
Zhang, Y.1
Peng, P.Y.2
Jiang, Z.P.3
|