-
2
-
-
84921399937
-
-
Wiley-IEEE Press
-
J. Si, A. G. Barto, W. B. Powell, and D. Wunsch, Handbook of Learning and Approximate Dynamic Programming, Wiley-IEEE Press, 2004.
-
(2004)
Handbook of Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.G.2
Powell, W.B.3
Wunsch, D.4
-
3
-
-
66449130966
-
Adaptive dynamic programming: An introduction
-
F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: an introduction," IEEE Computational Magazine, vol. 4, no. 2, pp. 39-47, 2009.
-
(2009)
IEEE Computational Magazine
, vol.4
, Issue.2
, pp. 39-47
-
-
Wang, F.Y.1
Zhang, H.2
Liu, D.3
-
4
-
-
70349116541
-
Reinforcement learning and adaptive dynamic programming for feedback control
-
F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits and Systems Magazine, vol. 9, no. 3, pp. 32-50, 2009.
-
(2009)
IEEE Circuits and Systems Magazine
, vol.9
, Issue.3
, pp. 32-50
-
-
Lewis, F.L.1
Vrabie, D.2
-
5
-
-
0031236002
-
Adaptive critic designs
-
D. V. Prokhorov and D. C. Wunsch II, "Adaptive critic designs," IEEE Trans. Neural Networks, vol. 8, no. 5, pp. 997-1007, 1997.
-
(1997)
IEEE Trans. Neural Networks
, vol.8
, Issue.5
, pp. 997-1007
-
-
Prokhorov, D.V.1
Wunsch II, D.C.2
-
6
-
-
0028584964
-
Adaptive linear quadratic control using policy iteration
-
Baltimore, Maryland
-
S. J. Bradtke and B. E. Ydstie, "Adaptive linear quadratic control using policy iteration," Proc. American Control Conference, Baltimore, Maryland, pp. 3475-3479, 1994.
-
(1994)
Proc. American Control Conference
, pp. 3475-3479
-
-
Bradtke, S.J.1
Ydstie, B.E.2
-
7
-
-
33847648898
-
Adaptive critic designs for discrete-time zero-sum games with application to H1 control
-
A. Al-Tamimi, M. Abu-Khalaf, and F. L. Lewis, "Adaptive critic designs for discrete-time zero-sum games with application to H1 control," IEEE Trans. Syst., Man, Cybern.-Part B, vol. 37, no. 1, pp. 240-247, 2007.
-
(2007)
IEEE Trans. Syst., Man, Cybern.-Part B
, vol.37
, Issue.1
, pp. 240-247
-
-
Al-Tamimi, A.1
Abu-Khalaf, M.2
Lewis, F.L.3
-
8
-
-
33846781129
-
Model-free Q-learning designs for discrete-time zero-sum games with application to H1 control
-
A. Al-Tamimi, M. Abu-Khalaf, and F. L. Lewis, "Model-free Q-learning designs for discrete-time zero-sum games with application to H1 control," Automatica, vol. 43, no. 3, 473-481, 2007.
-
(2007)
Automatica
, vol.43
, Issue.3
, pp. 473-481
-
-
Al-Tamimi, A.1
Abu-Khalaf, M.2
Lewis, F.L.3
-
9
-
-
49049089962
-
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
-
A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof," IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics, vol. 38, no. 4, 2008.
-
(2008)
IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics
, vol.38
, Issue.4
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
10
-
-
0033629916
-
Reinforcement learning in continuous-time and space
-
K. Doya, "Reinforcement learning in continuous-time and space," Neural Computation 12, pp. 219-245, 2000.
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
11
-
-
34249047468
-
Continuous-time adaptive critics
-
T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Network, vol. 18, no. 3, pp. 631-647, 2007.
-
(2007)
IEEE Trans. Neural Network
, vol.18
, Issue.3
, pp. 631-647
-
-
Hanselmann, T.1
Noakes, L.2
Zaknich, A.3
-
12
-
-
0036588686
-
Adaptive dynamic programming
-
J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive Dynamic Programming," IEEE Trans. Systems, Mans and Cybernetics- PART B: Cybernetics, vol. 32, no. 2, pp. 140-153, 2002.
-
(2002)
IEEE Trans. Systems, Mans and Cybernetics- PART B: Cybernetics
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
13
-
-
58349110975
-
Adaptive optimal control for continuous-time linear systems based on policy iteration
-
D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
-
(2009)
Automatica
, vol.45
, Issue.2
, pp. 477-484
-
-
Vrabie, D.1
Pastravanu, O.2
Abu-Khalaf, M.3
Lewis, F.L.4
-
14
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
-
(2005)
Automatica
, vol.41
, Issue.5
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
15
-
-
67349145396
-
Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
-
D. Vrable and F. L. Lewis "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems," Neural Networks, vol. 22, no. 3, pp. 237-246, 2009.
-
(2009)
Neural Networks
, vol.22
, Issue.3
, pp. 237-246
-
-
Vrable, D.1
Lewis, F.L.2
-
16
-
-
77950630017
-
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
-
878-888
-
K. G. Vamvoudakis, F. L. Lewis "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, pp. 878-888 vol. 46, no. 5, pp. 878-888, 2010.
-
(2010)
Automatica
, vol.46
, Issue.5
, pp. 878-888
-
-
Vamvoudakis, K.G.1
Lewis, F.L.2
-
17
-
-
79953151751
-
A model-free robust policy iteration algorithm for optimal control of nonlinear systems
-
Atlanta, GA
-
S. Bhasin, M. Johnson, W. E. Dixon, "A model-free robust policy iteration algorithm for optimal control of nonlinear systems," 49th IEEE Conf. Decision and Control, Atlanta, GA, pp. 3060-3065, 2010.
-
(2010)
49th IEEE Conf. Decision and Control
, pp. 3060-3065
-
-
Bhasin, S.1
Johnson, M.2
Dixon, W.E.3
-
18
-
-
78751528766
-
Policy-iteration-based adaptive optimal control for uncertain continuous-time linear systems with excitation signals
-
Ilsan, Kyonggi-Do, South Korea, Oct.
-
J. Y. Lee, J. B. Park, and Y. H. Choi, "Policy-iteration-based adaptive optimal control for uncertain continuous-time linear systems with excitation signals," in Proc. Int'l Conf. on Control, Automation, and Systems (ICCAS), Ilsan, Kyonggi-Do, South Korea, pp. 646-651, Oct. 2010.
-
(2010)
Proc. Int'l Conf. on Control, Automation, and Systems (ICCAS)
, pp. 646-651
-
-
Lee, J.Y.1
Park, J.B.2
Choi, Y.H.3
-
19
-
-
84867400046
-
Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
-
accepted for publication
-
J. Y. Lee, J. B. Park, and Y. H. Choi, "Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems," Automatica, accepted for publication, 2012.
-
(2012)
Automatica
-
-
Lee, J.Y.1
Park, J.B.2
Choi, Y.H.3
-
20
-
-
0018441647
-
An approximation theory of optimal control for trainable manipulators
-
G. N. Saridis and C. G. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics, vol. 9, no. 3, 1979.
-
(1979)
IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics
, vol.9
, Issue.3
-
-
Saridis, G.N.1
Lee, C.G.2
-
21
-
-
14544289894
-
A note on persistency of excitation
-
J. C. Willems, P. Rapisarda, I. Markovsky, and Bart L. M. Moor, "A note on persistency of excitation," Systems & Control Letters, vol. 54, no. 4, pp. 325-329, 2005.
-
(2005)
Systems & Control Letters
, vol.54
, Issue.4
, pp. 325-329
-
-
Willems, J.C.1
Rapisarda, P.2
Markovsky, I.3
Moor, B.L.M.4
-
22
-
-
62949149213
-
Constrained nonlinear optimal control: A converse HJB approach
-
Pasadena, CA 91125
-
V. Nevistic and J. A. Primbs, "Constrained nonlinear optimal control: a converse HJB approach," Technical report CIT-CDS 96-021, California Institute of Technology, Pasadena, CA 91125, 1996
-
(1996)
Technical Report CIT-CDS 96-021, California Institute of Technology
-
-
Nevistic, V.1
Primbs, J.A.2
|