-
1
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
M Abu-Khalaf, F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach", Automatica, vol. 41, no. 5, pp. 779-791, 2005.
-
(2005)
Automatica
, vol.41
, Issue.5
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
2
-
-
33845759425
-
Policy Iterations and the Hamilton-Jacobi-Isaacs equation for H-infmity state-feedback control with input saturation
-
December
-
M. Abu-Khalaf, F. L. Lewis, Huang, J., "Policy Iterations and the Hamilton-Jacobi-Isaacs equation for H-infmity state-feedback control with input saturation, " IEEE Transactions on Automatic Control, pp. 1989-1995, December, 2006.
-
(2006)
IEEE Transactions on Automatic Control
, pp. 1989-1995
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
Huang, J.3
-
3
-
-
33846781129
-
Model-Free Q-Learning Designs for Discrete-Time Zero-Sum Games with Application to H-Infinity Control
-
A. Al-Tamimi, F. L. Lewis, M. Abu-Khalaf, "Model-Free Q-Learning Designs for Discrete-Time Zero-Sum Games with Application to H-Infinity Control", Automatica, Vol. 43, pp. 473-481, 2007.
-
(2007)
Automatica
, vol.43
, pp. 473-481
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
4
-
-
33847648898
-
-
A. Al-Tamimi, M. Abu-Khalaf, F. L. Lewis, Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to H-infinity Control, IEEE Trans. on Sys., Man, and Cyb -B, 37, No. l, February, 2007.
-
A. Al-Tamimi, M. Abu-Khalaf, F. L. Lewis, "Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to H-infinity Control", IEEE Trans. on Sys., Man, and Cyb -B, Vol. 37, No. l, February, 2007.
-
-
-
-
5
-
-
0028584964
-
Adaptive linear quadratic control using policy iteration,
-
Baltmore, Maryland, June
-
S. J. Bradtke, B. E. Ydestie, A. G. Barto, "Adaptive linear quadratic control using policy iteration, " Proceedings of the American Control Conference, pp. 3475-3476, Baltmore, Maryland, June, 1994
-
(1994)
Proceedings of the American Control Conference
, pp. 3475-3476
-
-
Bradtke, S.J.1
Ydestie, B.E.2
Barto, A.G.3
-
6
-
-
0031332446
-
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
-
R. Beard, G. Saridis, J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation", Automatica, vol. 33, no. 12, pp. 2159-2177, 1997.
-
(1997)
Automatica
, vol.33
, Issue.12
, pp. 2159-2177
-
-
Beard, R.1
Saridis, G.2
Wen, J.3
-
8
-
-
0033629916
-
Reinforcement Learning In Continuous Time and Space
-
K. Doya, "Reinforcement Learning In Continuous Time and Space", Neural Computation, 12 (1), pp. 219-245, 2000.
-
(2000)
Neural Computation
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
9
-
-
34249047468
-
Continuous-time adaptive critics
-
T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics", IEEE Transactions on Neural Networks, 18 (3), 631-647, 2007.
-
(2007)
IEEE Transactions on Neural Networks
, vol.18
, Issue.3
, pp. 631-647
-
-
Hanselmann, T.1
Noakes, L.2
Zaknich, A.3
-
10
-
-
0025627940
-
Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks
-
K. Hornik, M. Stinchcombe, H. White, "Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks", Neural Networks, 3, pp. 551-560, 1990.
-
(1990)
Neural Networks
, vol.3
, pp. 551-560
-
-
Hornik, K.1
Stinchcombe, M.2
White, H.3
-
11
-
-
0002526302
-
Construction of Suboptimal Control Sequences
-
R. J. Leake, Ruey-Wen Liu, "Construction of Suboptimal Control Sequences", J. SIAM Control, 5 (1), 1967.
-
(1967)
J. SIAM Control
, vol.5
, Issue.1
-
-
Leake, R.J.1
Wen Liu, R.2
-
13
-
-
84914965022
-
On an iterative technique for Riccati equation computations
-
February
-
D. Kleinman, "On an iterative technique for Riccati equation computations", IEEE Trans. on Automatic Control, vol. 13, pp. 114-115, February, 1968.
-
(1968)
IEEE Trans. on Automatic Control
, vol.13
, pp. 114-115
-
-
Kleinman, D.1
-
15
-
-
0036588686
-
Adaptive dynamic programming
-
J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming", IEEE Trans. on Systems, Man and Cybernetics, vol. 32, no. 2, pp 140-153, 2002.
-
(2002)
IEEE Trans. on Systems, Man and Cybernetics
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
16
-
-
0031236002
-
Adaptive critic designs
-
D. Prokhorov, D. Wunsch, "Adaptive critic designs, " IEEE Trans. on Neural Networks, vol. 8, no 5, pp. 997-1007, 1997.
-
(1997)
IEEE Trans. on Neural Networks
, vol.8
, Issue.5
, pp. 997-1007
-
-
Prokhorov, D.1
Wunsch, D.2
-
17
-
-
84921399937
-
-
John Wiley, New Jersey
-
J. Si, A. Barto, W. Powell, D. Wunsch, Handbook of Learning and Approximate Dynamic Programming, John Wiley, New Jersey, 2004.
-
(2004)
Handbook of Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.2
Powell, W.3
Wunsch, D.4
-
19
-
-
0004102479
-
-
MIT Press, Cambridge, Massachusetts
-
R. S. Sutton, A. G. Barto, Reinforcement Learning mdash; An Introduction, MIT Press, Cambridge, Massachusetts, 1998.
-
(1998)
Reinforcement Learning mdash; An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
20
-
-
0042466434
-
On the convergence of optimistic policy iteration
-
J. N. Tsitsiklis, "On the convergence of optimistic policy iteration", Journal of Machine Learning Research, 3, pp. 59-72, 2002.
-
(2002)
Journal of Machine Learning Research
, vol.3
, pp. 59-72
-
-
Tsitsiklis, J.N.1
-
21
-
-
63049136575
-
Adaptive optimal control algorithm for continuous-time nonlinear systems based on policy iteration
-
IEEE
-
D. Vrabie, F. Lewis, "Adaptive optimal control algorithm for continuous-time nonlinear systems based on policy iteration", IEEE Proc. CDC'08, IEEE, 2008.
-
(2008)
IEEE Proc. CDC'08
-
-
Vrabie, D.1
Lewis, F.2
-
22
-
-
58349110975
-
Adaptive optimal control for continuous-time linear systems based on policy iteration
-
to be published, doi:10.1016/j.automatica.2008.08.017
-
D. Vrabie, O. Pastravanu, F. Lewis, M. Abu-Khalaf, "Adaptive optimal control for continuous-time linear systems based on policy iteration", Automatica (to be published), doi:10.1016/j.automatica.2008.08.017.
-
Automatica
-
-
Vrabie, D.1
Pastravanu, O.2
Lewis, F.3
Abu-Khalaf, M.4
-
24
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling,
-
ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold
-
P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling, " Handbook of Intelligent Control, ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold, 1992.
-
(1992)
Handbook of Intelligent Control
-
-
Werbos, P.J.1
-
25
-
-
0024888479
-
Neural networks for control and system identification
-
IEEE
-
P. Werbos, "Neural networks for control and system identification", IEEE Proc. CDC'89, IEEE, 1989.
-
(1989)
IEEE Proc. CDC'89
-
-
Werbos, P.1
|