-
1
-
-
84871319455
-
A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
-
S. Bhasin, R. Kamalapurkar, M. Johnson, K. Vamvoudakis, F. L. Lewis, and W. Dixon, "A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems, " Automatica, vol. 49, no. 1, pp. 89-92, 2013.
-
(2013)
Automatica
, vol.49
, Issue.1
, pp. 89-92
-
-
Bhasin, S.1
Kamalapurkar, R.2
Johnson, M.3
Vamvoudakis, K.4
Lewis, F.L.5
Dixon, W.6
-
3
-
-
77950630017
-
Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
-
K. Vamvoudakis and F. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem, " Automatica, vol. 46, pp. 878-888, 2010.
-
(2010)
Automatica
, vol.46
, pp. 878-888
-
-
Vamvoudakis, K.1
Lewis, F.2
-
4
-
-
34548721141
-
Continuous-time adp for linear systems with partially unknown dynamics
-
D. Vrabie, M. Abu-Khalaf, F. Lewis, and Y. Wang, "Continuous-time ADP for linear systems with partially unknown dynamics, " in Proc. IEEE Int. Symp. Approx. Dyn. Program. Reinf. Learn., 2007, pp. 247- 253.
-
(2007)
Proc. IEEE Int. Symp. Approx. Dyn. Program. Reinf. Learn
, pp. 247-253
-
-
Vrabie, D.1
Abu-Khalaf, M.2
Lewis, F.3
Wang, Y.4
-
5
-
-
67349145396
-
Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
-
D. Vrabie and F. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems, " Neural Netw., vol. 22, no. 3, pp. 237 - 246, 2009.
-
(2009)
Neural Netw.
, vol.22
, Issue.3
, pp. 237-246
-
-
Vrabie, D.1
Lewis, F.2
-
6
-
-
68149180889
-
Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
-
T. Dierks, B. Thumati, and S. Jagannathan, "Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence, " Neural Netw., vol. 22, no. 5-6, pp. 851-860, 2009.
-
(2009)
Neural Netw.
, vol.22
, Issue.5-6
, pp. 851-860
-
-
Dierks, T.1
Thumati, B.2
Jagannathan, S.3
-
7
-
-
77950853735
-
Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics
-
T. Dierks and S. Jagannathan, "Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics, " in Proc. IEEE Conf. Decis. Control, 2009, pp. 6750-6755.
-
(2009)
Proc. IEEE Conf. Decis. Control
, pp. 6750-6755
-
-
Dierks, T.1
Jagannathan, S.2
-
8
-
-
83655163786
-
Data-driven robust approx-imate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
-
H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approx-imate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method, " IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, 2011.
-
(2011)
IEEE Trans. Neural Netw
, vol.22
, Issue.12
, pp. 2226-2236
-
-
Zhang, H.1
Cui, L.2
Zhang, X.3
Luo, Y.4
-
9
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
K. Doya, "Reinforcement learning in continuous time and space, " Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
-
(2000)
Neural Comput.
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
10
-
-
33846781129
-
Model-free gleaming designs for linear discrete-time zero-sum games with application to H(X) control
-
A. AI-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free gleaming designs for linear discrete-time zero-sum games with application to H(X) control, " Automatica, vol. 43, pp. 473-481, 2007.
-
(2007)
Automatica
, vol.43
, pp. 473-481
-
-
Ai-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
11
-
-
49049089962
-
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
-
A. AI-Tamimi, F. L. Lewis, and M. Abu-Khalaf "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof, " IEEE Trans. Syst. Man Cybern. Part B Cybern., vol. 38, pp. 943-949, 2008.
-
(2008)
IEEE Trans. Syst. Man Cybern. Part B Cybern
, vol.38
, pp. 943-949
-
-
Ai-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
12
-
-
33751238181
-
A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
-
R. Padhi, N. Unnikrishnan, X. Wang, and S. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems, " Neural Netw., vol. 19, no. 10, pp. 1648-1660, 2006.
-
(2006)
Neural Netw.
, vol.19
, Issue.10
, pp. 1648-1660
-
-
Padhi, R.1
Unnikrishnan, N.2
Wang, X.3
Balakrishnan, S.4
-
13
-
-
77950806766
-
Q-learning and pontryagin's minimum principle
-
Dec
-
P. Mehta and S. Meyn, "Q-learning and pontryagin's minimum principle, " in Proc. IEEE Conf Decis. Control, Dec. 2009, pp. 3598 -3605.
-
(2009)
Proc. IEEE Conf Decis. Control
, pp. 3598-3605
-
-
Mehta, P.1
Meyn, S.2
-
16
-
-
79952472584
-
Theory and flight-test validation of a concurrent-learning adaptive controller
-
March
-
G. Y. Chowdhary and E. N. Johnson, "Theory and flight-test validation of a concurrent-learning adaptive controller, " J. Guid. Contr. Dynam., vol. 34, no. 2, pp. 592-607, March 2011.
-
(2011)
J. Guid. Contr. Dynam.
, vol.34
, Issue.2
, pp. 592-607
-
-
Chowdhary, G.Y.1
Johnson, E.N.2
-
17
-
-
84880587824
-
Concurrent learning adaptive control of linear systems with exponentially convergent bounds
-
G. Chowdhary, T. Yucelen, M. Mlihlegg, and E. N. Johnson, "Concurrent learning adaptive control of linear systems with exponentially convergent bounds, " Int. J. Adapt Control Signal Process., 2012.
-
(2012)
Int. J. Adapt Control Signal Process
-
-
Chowdhary, G.1
Yucelen, T.2
Mlihlegg, M.3
Johnson, E.N.4
-
19
-
-
0027804823
-
Neural net robot controller with guaranteed tracking performance
-
Chicago, Illinois
-
F. Lewis, K. Liu, and A. Yesildirek, "Neural net robot controller with guaranteed tracking performance, " in Proc. IEEE Int. Symp. Intell. Control, Chicago, Illinois, 1993, pp. 225-231.
-
(1993)
Proc. IEEE Int. Symp. Intell. Control
, pp. 225-231
-
-
Lewis, F.1
Liu, K.2
Yesildirek, A.3
-
20
-
-
0025399567
-
Identification and control of dynamical systems using neural networks
-
K. Narendra and K. Parthasarathy, "Identification and control of dynamical systems using neural networks, " IEEE Trans. Neural Networks, vol. 1, no. 1, pp. 4-27, 1990.
-
(1990)
IEEE Trans. Neural Networks
, vol.1
, Issue.1
, pp. 4-27
-
-
Narendra, K.1
Parthasarathy, K.2
-
21
-
-
4043069840
-
On actor-critic algorithms
-
Y. Konda and J. TsitsikIis, "On actor-critic algorithms, " SIAM J. Contr. Optim., vol. 42, no. 4, pp. 1143-1166, 2004.
-
(2004)
SIAM J. Contr. Optim.
, vol.42
, Issue.4
, pp. 1143-1166
-
-
Konda, Y.1
Tsitsikiis, J.2
-
22
-
-
77957777969
-
Optimal control of affine nonlinear continuous-time systems
-
T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems, " in Proc. Am. Control Conf, 2010, pp. 1568- 1573.
-
(2010)
Proc. Am. Control Conf
, pp. 1568-1573
-
-
Dierks, T.1
Jagannathan, S.2
|