-
4
-
-
0031213212
-
Optimal design of adaptive tracking controllers for non-linear systems
-
PII S0005109897000721
-
Z.-H. Li and M. Krstic, "Optimal design of adaptive tracking controllers for nonlinear systems," Automatica, vol. 33, no. 8, pp. 1459-1473, 1997. (Pubitemid 127392279)
-
(1997)
Automatica
, vol.33
, Issue.8
, pp. 1459-1473
-
-
Li, Z.-H.1
Krstic, M.2
-
7
-
-
0002011091
-
A menu of designs for reinforcement learning over time
-
W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
-
P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1991, pp. 67-95.
-
(1991)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.J.1
-
8
-
-
49049110053
-
Special issue on adaptive dynamic programming and reinforcement learning for feedback control
-
Aug.
-
F. L. Lewis, G. Lendaris, and D. Liu, "Special issue on adaptive dynamic programming and reinforcement learning for feedback control," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 896-897, Aug. 2008.
-
(2008)
IEEE Trans. Syst., Man, Cybern. B
, vol.38
, Issue.4
, pp. 896-897
-
-
Lewis, F.L.1
Lendaris, G.2
Liu, D.3
-
10
-
-
77956759998
-
Reinforcement learning control and pattern recognition systems
-
J. M. Mendel and K. S. Fu, Eds. New York: Academic
-
J. M. Mendel and R. W. MacLaren, "Reinforcement learning control and pattern recognition systems," in Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications, J. M. Mendel and K. S. Fu, Eds. New York: Academic, 1970, pp. 287-318.
-
(1970)
Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications
, pp. 287-318
-
-
Mendel, J.M.1
MacLaren, R.W.2
-
11
-
-
77955814101
-
-
Boca Raton, FL: CRC Press
-
L. Busoniu, R. Babuska, B. De Schutter, and D. Ernst, Reinforcement Learning and Dynamic Programming Using Function Approximators. Boca Raton, FL: CRC Press, 2009.
-
(2009)
Reinforcement Learning and Dynamic Programming Using Function Approximators
-
-
Busoniu, L.1
Babuska, R.2
De Schutter, B.3
Ernst, D.4
-
12
-
-
1842684992
-
Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology
-
DOI 10.1016/j.conb.2004.03.017, PII S0959438804000492
-
W. Schultz, "Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioral ecology," Current Opinion Neurobiol., vol. 14, no. 2, pp. 139-147, 2004. (Pubitemid 38479929)
-
(2004)
Current Opinion in Neurobiology
, vol.14
, Issue.2
, pp. 139-147
-
-
Schultz, W.1
-
13
-
-
0035422340
-
Neural mechanisms for learning and control
-
Aug.
-
K. Doya, H. Kimura, and M. Kawato, "Neural mechanisms for learning and control," IEEE Control Syst. Mag., vol. 21, no. 4, pp. 42-54, Aug. 2000.
-
(2000)
IEEE Control Syst. Mag.
, vol.21
, Issue.4
, pp. 42-54
-
-
Doya, K.1
Kimura, H.2
Kawato, M.3
-
14
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold
-
P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, 1992.
-
(1992)
Handbook of Intelligent Control
-
-
Werbos, P.J.1
-
15
-
-
67349145396
-
Neural network approach to continuoustime direct adaptive optimal control for partially-unknown nonlinear systems
-
Apr.
-
D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially-unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, Apr. 2009.
-
(2009)
Neural Netw.
, vol.22
, Issue.3
, pp. 237-246
-
-
Vrabie, D.1
Lewis, F.L.2
-
16
-
-
0020970738
-
Neuron-like adaptive elements that can solve difficult learning control problems
-
Sep./Oct.
-
A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuron-like adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst., Man, Cybern., vol. SMC-13, no. 5, pp. 834-846, Sep./Oct. 1983.
-
(1983)
IEEE Trans. Syst., Man, Cybern.
, vol.SMC-13
, Issue.5
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
18
-
-
0003787146
-
-
Princeton, NJ: Princeton Univ. Press
-
R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
19
-
-
0024888479
-
Neural networks for control and system identification
-
P. J. Werbos, "Neural networks for control and system identification," in Proc. IEEE Conf. Decision Control, Tampa, FL, 1989, pp. 260-265.
-
Proc. IEEE Conf. Decision Control, Tampa, FL, 1989
, pp. 260-265
-
-
Werbos, P.J.1
-
20
-
-
0031236002
-
Adaptive critic designs
-
Sep.
-
D. Prokhorov and D. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
-
(1997)
IEEE Trans. Neural Netw.
, vol.8
, Issue.5
, pp. 997-1007
-
-
Prokhorov, D.1
Wunsch, D.2
-
21
-
-
84921399937
-
-
Piscataway, NJ: IEEE Press
-
J. Si, A. Barto, W. Powell, and D. Wunsch, Handbook of Learning and Approximate Dynamic Programming. Piscataway, NJ: IEEE Press, 2004.
-
(2004)
Handbook of Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.2
Powell, W.3
Wunsch, D.4
-
22
-
-
49049111594
-
Issues on stability of ADP feedback controllers for dynamical systems
-
Aug.
-
S. N. Balakrishnan, J. Ding, and F. L. Lewis, "Issues on stability of ADP feedback controllers for dynamical systems," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 913-917, Aug. 2008.
-
(2008)
IEEE Trans. Syst., Man, Cybern. B
, vol.38
, Issue.4
, pp. 913-917
-
-
Balakrishnan, S.N.1
Ding, J.2
Lewis, F.L.3
-
23
-
-
66449130966
-
Adaptive dynamic programming: An introduction
-
May
-
F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput, Intell, Mag., vol. 4, no. 2, pp. 39-47, May 2009.
-
(2009)
IEEE Comput, Intell, Mag.
, vol.4
, Issue.2
, pp. 39-47
-
-
Wang, F.Y.1
Zhang, H.2
Liu, D.3
-
24
-
-
70349116541
-
Reinforcement learning and adaptive dynamic programming for feedback control
-
F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, 2009.
-
(2009)
IEEE Circuits Syst. Mag.
, vol.9
, Issue.3
, pp. 32-50
-
-
Lewis, F.L.1
Vrabie, D.2
-
25
-
-
0036641793
-
State-constrained agile missile control with adaptive-critic-based neural networks
-
DOI 10.1109/TCST.2002.1014669, PII S1063653602053605
-
D. Han and S. N. Balakrishnan, "State-constrained agile missile control with adaptive-critic-based neural networks," IEEE Trans. Control Syst. Technol., vol. 10, no. 4, pp. 481-489, Jul. 2002. (Pubitemid 34798672)
-
(2002)
IEEE Transactions on Control Systems Technology
, vol.10
, Issue.4
, pp. 481-489
-
-
Han, D.1
Balakrishnan, S.N.2
-
28
-
-
0029592634
-
Adaptive critic designs: A case study for neurocontrol
-
DOI 10.1016/0893-6080(95)00042-9
-
D. Prokhorov, R. A. Santiago, and D. C. Wunsch, II, "Adaptive critic designs: A case study for neurocontrol," Neural Netw., vol. 8, no. 9, pp. 1367-1372, 1995. (Pubitemid 26072896)
-
(1995)
Neural Networks
, vol.8
, Issue.9
, pp. 1367-1372
-
-
Prokhorov, D.V.1
Santiago, R.A.2
Wunsch II, D.C.3
-
29
-
-
0036588686
-
Adaptive dynamic programming
-
J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man Cybern. C, vol. 32, no. 2, pp. 140-153, 2002.
-
(2002)
IEEE Trans. Syst., Man Cybern. C
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
30
-
-
0042767744
-
Helicopter flight control reconfiguration for main rotor actuator failures
-
R. Enns and J. Si, "Helicopter flight control reconfiguration for main rotor actuator failures," AIAA J. Guidance, Control, Dynamics, vol. 26, no. 4, pp. 572-584, 2003.
-
(2003)
AIAA J. Guidance, Control, Dynamics
, vol.26
, Issue.4
, pp. 572-584
-
-
Enns, R.1
Si, J.2
-
31
-
-
49049106959
-
Direct heuristic dynamic programming method for power system stability enhancement
-
C. Lu, J. Si, and X. Xie, "Direct heuristic dynamic programming method for power system stability enhancement," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 1008-1013, 2008.
-
(2008)
IEEE Trans. Syst., Man, Cybern. B
, vol.38
, Issue.4
, pp. 1008-1013
-
-
Lu, C.1
Si, J.2
Xie, X.3
-
32
-
-
0033685661
-
Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle
-
G. G. Lendaris, L. Schultz, and T. Shannon, "Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle," in Proc. Int. Conf. Neural Networks, 2000, pp. 73-78.
-
Proc. Int. Conf. Neural Networks, 2000
, pp. 73-78
-
-
Lendaris, G.G.1
Schultz, L.2
Shannon, T.3
-
34
-
-
49049089962
-
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
-
Aug.
-
A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 943-949, Aug. 2008.
-
(2008)
IEEE Trans. Syst., Man, Cybern. B
, vol.38
, Issue.4
, pp. 943-949
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
37
-
-
58349110975
-
Adaptive optimal control for continuous-time linear systems based on policy iteration
-
D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
-
(2009)
Automatica
, vol.45
, Issue.2
, pp. 477-484
-
-
Vrabie, D.1
Pastravanu, O.2
Abu-Khalaf, M.3
Lewis, F.L.4
-
39
-
-
0022738693
-
Decentralized learning in finite Markov chains
-
June
-
R. M. Wheeler and K. S. Narendra, "Decentralized learning in finite Markov chains," IEEE Trans. Autom. Control, vol. 31, no. 6, pp. 519-526, June 1986.
-
(1986)
IEEE Trans. Autom. Control
, vol.31
, Issue.6
, pp. 519-526
-
-
Wheeler, R.M.1
Narendra, K.S.2
-
41
-
-
67650505616
-
Algorithm and stability of ATC receding horizon control
-
H. Zhang, J. Huang, and F. L. Lewis, "Algorithm and stability of ATC receding horizon control," in Proc. IEEE Symp. Adaptive Dynamic Programming Reinforcement, Nashville, TN, Mar. 2009, pp. 28-35.
-
Proc. IEEE Symp. Adaptive Dynamic Programming Reinforcement, Nashville, TN, Mar. 2009
, pp. 28-35
-
-
Zhang, H.1
Huang, J.2
Lewis, F.L.3
-
42
-
-
0004049893
-
-
Ph.D. dissertation, Cambridge University, Cambridge, U.K.
-
C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Cambridge University, Cambridge, U.K., 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.1
-
43
-
-
34249833101
-
Q-learning
-
C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3-4, pp. 279-292, 1992.
-
(1992)
Mach. Learn.
, vol.8
, Issue.3-4
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
-
44
-
-
0015109409
-
An iterative technique for the computation of the steady state gains for the discrete optimal regulator
-
Aug.
-
G. A. Hewer, "An iterative technique for the computation of the steady state gains for the discrete optimal regulator," IEEE Trans Autom. Control, vol. 16, no. 4, pp. 382-384, Aug. 1971.
-
(1971)
IEEE Trans Autom. Control
, vol.16
, Issue.4
, pp. 382-384
-
-
Hewer, G.A.1
-
47
-
-
0026883666
-
L2-gain analysis of nonlinear systems and nonlinear state feedback H? Control
-
A. J. Van, "L2-gain analysis of nonlinear systems and nonlinear state feedback H? control," IEEE Trans. Autom. Control, vol. 37, no. 6, pp. 770-784, 1992.
-
(1992)
IEEE Trans. Autom. Control
, vol.37
, Issue.6
, pp. 770-784
-
-
Van, A.J.1
-
49
-
-
0028584964
-
Adaptive linear quadratic control using policy iteration
-
S. Bradtke, B. Ydstie, and A. Barto, "Adaptive linear quadratic control using policy iteration," in Proc. Amer. Control Conf., Baltimore, MD, 1994, pp. 3475-3479.
-
Proc. Amer. Control Conf., Baltimore, MD, 1994
, pp. 3475-3479
-
-
Bradtke, S.1
Ydstie, B.2
Barto, A.3
-
50
-
-
79551685808
-
Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
-
Feb.
-
F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, vol. 41, no. 1, pp. 14-25, Feb. 2011.
-
(2011)
IEEE Trans. Syst., Man, Cybern. B
, vol.41
, Issue.1
, pp. 14-25
-
-
Lewis, F.L.1
Vamvoudakis, K.G.2
-
51
-
-
33845759425
-
Policy iterations on the Hamilton-Jacobi-Isaacs equation for H state feedback control with input saturation
-
DOI 10.1109/TAC.2006.884959
-
M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Policy iterations on the Hamilton-Jacobi-Isaacs equation for state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1989-1995, Dec. 2006. (Pubitemid 46002295)
-
(2006)
IEEE Transactions on Automatic Control
, vol.51
, Issue.12
, pp. 1989-1995
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
Huang, J.3
-
52
-
-
0028733775
-
Reinforcement learning in continuous time: Advantage updating
-
L. C. Baird, "Reinforcement learning in continuous time: Advantage updating," in Proc. Int. Conf. Neural Networks, Orlando, FL, June1994, pp. 2448-2453.
-
Proc. Int. Conf. Neural Networks, Orlando, FL, June1994
, pp. 2448-2453
-
-
Baird, L.C.1
-
53
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
-
(2000)
Neural Comput.
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
54
-
-
34249047468
-
Continuous-time adaptive critics
-
May
-
T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Netw., vol. 18, no. 3, pp. 631-647, May 2007.
-
(2007)
IEEE Trans. Neural Netw.
, vol.18
, Issue.3
, pp. 631-647
-
-
Hanselmann, T.1
Noakes, L.2
Zaknich, A.3
-
55
-
-
84914965022
-
On an iterative technique for Riccati equation computations
-
Feb.
-
D. L. Kleinman, "On an iterative technique for Riccati equation computations," IEEE Trans. Autom. Control, vol. AC-13, no. 1, pp. 114-115, Feb. 1968.
-
(1968)
IEEE Trans. Autom. Control
, vol.AC-13
, Issue.1
, pp. 114-115
-
-
Kleinman, D.L.1
-
56
-
-
62949149213
-
-
Dept. Control Dynamical Systems, California Institute of Technology, Pasadena, CA, Tech. Rep. 96-021
-
V. Nevistic and J. Primbs, "Constrained nonlinear optimal control: A converse HJB approach," Dept. Control Dynamical Systems, California Institute of Technology, Pasadena, CA, Tech. Rep. 96-021, 1996.
-
(1996)
Constrained Nonlinear Optimal Control: A Converse HJB Approach
-
-
Nevistic, V.1
Primbs, J.2
-
57
-
-
77950630017
-
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
-
K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
-
(2010)
Automatica
, vol.46
, Issue.5
, pp. 878-888
-
-
Vamvoudakis, K.G.1
Lewis, F.L.2
-
58
-
-
79960443754
-
Adaptive dynamic programming for online solution of a zero-sum differential game
-
D. Vrabie and F. L. Lewis, "Adaptive dynamic programming for online solution of a zero-sum differential game," J. Control Theory: Its Appl., vol. 9, no. 3, pp. 353-360, 2011.
-
(2011)
J. Control Theory: Its Appl.
, vol.9
, Issue.3
, pp. 353-360
-
-
Vrabie, D.1
Lewis, F.L.2
-
59
-
-
79960897012
-
Multi-player non-zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
-
K. G. Vamvoudakis and F. Lewis, "Multi-player non-zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 556-569, 2011.
-
(2011)
Automatica
, vol.47
, Issue.8
, pp. 556-569
-
-
Vamvoudakis, K.G.1
Lewis, F.2
-
60
-
-
77955423822
-
Model-free H-infinity control design for unknown linear discrete-time systems via Q-learning with LMI
-
Aug.
-
J. H. Kim and F. L. Lewis, "Model-free H-infinity control design for unknown linear discrete-time systems via Q-learning with LMI," Automatica, vol. 46, no. 8, pp. 1320-1326, Aug. 2010.
-
(2010)
Automatica
, vol.46
, Issue.8
, pp. 1320-1326
-
-
Kim, J.H.1
Lewis, F.L.2
|