-
1
-
-
0004102479
-
Reinforcement Learning: An Introduction
-
MIT Press
-
R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction, MIT Press, 1998.
-
(1998)
-
-
Sutton, R.S.1
Barto, A.G.2
-
2
-
-
0003529238
-
Beyond regression: new tools for prediction and analysis in the behavioural sciences
-
Ph.D. Thesis, Harvard University
-
P.J. Werbos. Beyond regression: new tools for prediction and analysis in the behavioural sciences, Ph.D. Thesis, Harvard University, 1972.
-
(1972)
-
-
Werbos, P.J.1
-
3
-
-
0024888479
-
Neural networks for control and system identification
-
Proceedings of IEEE Conference on Decision and Control
-
P.J. Werbos. Neural networks for control and system identification, Proceedings of IEEE Conference on Decision and Control, 260-265, 1989.
-
(1989)
, pp. 260-265
-
-
Werbos, P.J.1
-
4
-
-
0002011091
-
A menu of designs for reinforcement learning over time
-
ed. W. T. Miller, R. S. Sutton, P. J. Werbos, Cambridge: MIT Press
-
P.J. Werbos. A menu of designs for reinforcement learning over time, Neural Networks for Control, ed. W. T. Miller, R. S. Sutton, P. J. Werbos, Cambridge: MIT Press, 1991, pp 67-95.
-
(1991)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.J.1
-
5
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold
-
P. J. Werbos. Approximate dynamic programming for real-time control and neural modeling, Handbook of Intelligent Control, ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold, 1992.
-
(1992)
Handbook of Intelligent Control
-
-
Werbos, P.J.1
-
6
-
-
0004049893
-
Learning from delayed rewards
-
Ph.D. thesis, King's College of Cambridge, UK
-
C. Watkins. Learning from delayed rewards, Ph.D. thesis, King's College of Cambridge, UK, 1989.
-
(1989)
-
-
Watkins, C.1
-
7
-
-
33846781129
-
Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control, Automatica
-
A. Al-Tamimi, F.L. Lewis, and M. Abu-Khalaf. Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control, Automatica, 43(3):473-481, 2007.
-
(2007)
, vol.43
, Issue.3
, pp. 473-481
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
8
-
-
0028584964
-
Adaptive linear quadratic control using policy iteration
-
Proceedings of American Control Conference
-
S.J. Bradtke, B.E. Ydstie, and A.G. Barto. Adaptive linear quadratic control using policy iteration, Proceedings of American Control Conference, 3:3475-3479, 1994.
-
(1994)
, vol.3
, pp. 3475-3479
-
-
Bradtke, S.J.1
Ydstie, B.E.2
Barto, A.G.3
-
10
-
-
83655167263
-
Approximate dynamic programming for optimal stationary control with control-dependent noise
-
Y. Jiang and Z. P. Jiang. Approximate dynamic programming for optimal stationary control with control-dependent noise, IEEE Transactions on Neural Networks, 22(12):2392-2398, 2011.
-
(2011)
IEEE Transactions on Neural Networks
, vol.22
, Issue.12
, pp. 2392-2398
-
-
Jiang, Y.1
Jiang, Z.P.2
-
11
-
-
79551685808
-
Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data
-
SMCB-41
-
F. L. Lewis, K. G. Vamvoudakis. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data, IEEE Transactions Systems, Man, and Cybernetics, Part B, SMCB-41(1):14-23, 2011.
-
(2011)
IEEE Transactions Systems, Man, and Cybernetics, Part B
, vol.41 SMCB
, Issue.1
, pp. 14-23
-
-
Lewis, F.L.1
Vamvoudakis, K.G.2
-
12
-
-
58349110975
-
Adaptive optimal control for continuous-time linear systems based on policy iteration
-
D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis. Adaptive optimal control for continuous-time linear systems based on policy iteration, Automatica, 45(2):477-484, 2009.
-
(2009)
Automatica
, vol.45
, Issue.2
, pp. 477-484
-
-
Vrabie, D.1
Pastravanu, O.2
Abu-Khalaf, M.3
Lewis, F.L.4
-
13
-
-
78650805234
-
An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
-
H. Zhang, Q. Wei, and D. Liu. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games,Automatica, 47(1):207-214, 2011.
-
(2011)
Automatica
, vol.47
, Issue.1
, pp. 207-214
-
-
Zhang, H.1
Wei, Q.2
Liu, D.3
-
14
-
-
0003427482
-
Nonlinear and Adaptive Control Design
-
John Wiley & Sons
-
M. Krstic, I. Kanellakopoulos, and P. V. Kokotovic. Nonlinear and Adaptive Control Design, John Wiley & Sons, 1995.
-
(1995)
-
-
Krstic, M.1
Kanellakopoulos, I.2
Kokotovic, P.V.3
-
15
-
-
0025419843
-
Further facts about input to state stabilization
-
E. D. Sontag. Further facts about input to state stabilization, IEEE Transactions on Automatic Control, AC-35(4):473-476, 1990.
-
(1990)
IEEE Transactions on Automatic Control
, vol.35 AC
, Issue.4
, pp. 473-476
-
-
Sontag, E.D.1
-
16
-
-
0029288045
-
On characterizations of the input-to-state stability property
-
E. D. Sontag and Y. Wang. On characterizations of the input-to-state stability property, Systems & Control Letters, 24:351-359, 1995.
-
(1995)
Systems & Control Letters
, vol.24
, pp. 351-359
-
-
Sontag, E.D.1
Wang, Y.2
-
17
-
-
33846166511
-
Small-gain theorem for ISS systems and applications
-
Z. P. Jiang, A. R. Teel, and L. Praly. Small-gain theorem for ISS systems and applications, Mathematics of Control, Signals, and Systems, 7(2):95-120, 1994.
-
(1994)
Mathematics of Control, Signals, and Systems
, vol.7
, Issue.2
, pp. 95-120
-
-
Jiang, Z.P.1
Teel, A.R.2
Praly, L.3
-
18
-
-
0030218302
-
A Lyapunov formulation of the nonlinear small gain theorem for interconnected ISS systems
-
Z. P. Jiang, I. Mareels and Y. Wang. A Lyapunov formulation of the nonlinear small gain theorem for interconnected ISS systems, Automatica, 32(8):1211-1215, 1996.
-
(1996)
Automatica
, vol.32
, Issue.8
, pp. 1211-1215
-
-
Jiang, Z.P.1
Mareels, I.2
Wang, Y.3
-
19
-
-
0025522760
-
Global stabilization of partially linear composite systems
-
A. Saberi, P. V. Kokotovic, and H. J. Sussmann. Global stabilization of partially linear composite systems, SIAM Journal on Control and Optimization, 2(6):1491-1503, 1990.
-
(1990)
SIAM Journal on Control and Optimization
, vol.2
, Issue.6
, pp. 1491-1503
-
-
Saberi, A.1
Kokotovic, P.V.2
Sussmann, H.J.3
-
20
-
-
0004163205
-
Optimal Control
-
John Wiley & Sons
-
F.L. Lewis and V.L. Syrmos. Optimal Control, John Wiley & Sons, 1995.
-
(1995)
-
-
Lewis, F.L.1
Syrmos, V.L.2
-
21
-
-
79959444159
-
Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game
-
D. Vrabie, F. Lewis. Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game, The 2010 International Joint Conference on Neural Networks, 1-8, 2010.
-
(2010)
The 2010 International Joint Conference on Neural Networks
, pp. 1-8
-
-
Vrabie, D.1
Lewis, F.2
-
22
-
-
0004178386
-
Nonlinear Systems
-
3rd edition, Prentice-Hall
-
H. K. Khalil. Nonlinear Systems, 3rd edition, Prentice-Hall, 2002.
-
(2002)
-
-
Khalil, H.K.1
-
23
-
-
84914965022
-
On an iterative technique for Riccati equation computations
-
D. Kleinman. On an iterative technique for Riccati equation computations, IEEE Transactions on Automatic Control, AC-13(1):114-115, 1969.
-
(1969)
IEEE Transactions on Automatic Control
, vol.13 AC
, Issue.1
, pp. 114-115
-
-
Kleinman, D.1
-
24
-
-
0004151494
-
Matrix Analysis
-
Cambridge University Press, NY
-
R. A. Horn and C. R. Johnson. Matrix Analysis, Cambridge University Press, NY, 1985.
-
(1985)
-
-
Horn, R.A.1
Johnson, C.R.2
-
25
-
-
84860701570
-
Robust approximate dynamic programming and global stabilization with nonlinear dynamic uncertainties
-
Proceedings of the Joint 2011 IEEE Conference on Decision and Control and European Control Conference, Orlando, FL
-
Y. Jiang and Z. P. Jiang. Robust approximate dynamic programming and global stabilization with nonlinear dynamic uncertainties, Proceedings of the Joint 2011 IEEE Conference on Decision and Control and European Control Conference, Orlando, FL, 115-120, 2011.
-
(2011)
, pp. 115-120
-
-
Jiang, Y.1
Jiang, Z.P.2
-
26
-
-
67349247013
-
Intelligence in the brain: a theory of how it works and how to build it
-
P.J.Werbos. Intelligence in the brain: a theory of how it works and how to build it, Neural Networks, 22(3):200-212, 2009.
-
(2009)
Neural Networks
, vol.22
, Issue.3
, pp. 200-212
-
-
Werbos, P.J.1
-
27
-
-
0003690086
-
Nonlinear Control Systems
-
Springer-Verlag
-
A. Isidori. Nonlinear Control Systems. Vol. II, Springer-Verlag, 1999.
-
(1999)
, vol.II
-
-
Isidori, A.1
-
28
-
-
0032115473
-
Design of robust adaptive controllers for nonlinear systems with dynamic uncertainties
-
Z. P. Jiang and L. Praly. Design of robust adaptive controllers for nonlinear systems with dynamic uncertainties, Automatica, 34(7):825-840, 1998.
-
(1998)
Automatica
, vol.34
, Issue.7
, pp. 825-840
-
-
Jiang, Z.P.1
Praly, L.2
-
29
-
-
0027148081
-
Robust load-frequency controller design for power systems
-
Y.Wang, R. Zhou, and C.Wen. Robust load-frequency controller design for power systems, IEE Proceedings-Generation, Transmission and Distribution, 104(1):11-16, 1993.
-
(1993)
IEE Proceedings-Generation, Transmission and Distribution
, vol.104
, Issue.1
, pp. 11-16
-
-
Wang, Y.1
Zhou, R.2
Wen, C.3
-
30
-
-
0003564550
-
Linear Control Systems Engineering
-
The McGraw-Hill Companies, Inc.
-
M. Driels. Linear Control Systems Engineering. The McGraw-Hill Companies, Inc., 1996.
-
(1996)
-
-
Driels, M.1
|