-
1
-
-
0015667648
-
Punish/reward: learning with a critic in adaptive threshold systems
-
Widrow B, Gupta N K, Maitra S. Punish/reward: learning with a critic in adaptive threshold systems. IEEE Transactions on Systems, Man, and Cybernetics, 1973, 3(5): 455-465
-
(1973)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.3
, Issue.5
, pp. 455-465
-
-
Widrow, B.1
Gupta, N.K.2
Maitra, S.3
-
2
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Barto A G, Sutton R S, Anderson C W. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics, 1983, 13(5): 835-846
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.13
, Issue.5
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
6
-
-
0035273403
-
Online learning control by association and reinforcement
-
Si J, Wang Y T. Online learning control by association and reinforcement. IEEE Transactions on Neural Networks, 2001, 12(2): 264-276
-
(2001)
IEEE Transactions on Neural Networks
, vol.12
, Issue.2
, pp. 264-276
-
-
Si, J.1
Wang, Y.T.2
-
8
-
-
0036588686
-
Adaptive dynamic programming
-
Murray J J, Cox C J, Lendaris G G, Saeks R. Adaptive dynamic programming. IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews, 2002, 32(2): 140-153
-
(2002)
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
9
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
Abu-Khalaf M, Lewis F L. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica, 2005, 41(5): 779-791
-
(2005)
Automatica
, vol.41
, Issue.5
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
10
-
-
15044349684
-
Approximate dynamic programming for self-learning control
-
Liu De-Rong. Approximate dynamic programming for self-learning control. Acta Automatica Sinica, 2005, 31(1): 13-18
-
(2005)
Acta Automatica Sinica
, vol.31
, Issue.1
, pp. 13-18
-
-
Liu, D.-R.1
-
11
-
-
34249712124
-
A neural dynamic programming approach for learning control of failure avoidance problems
-
Liu D R, Zhang H G. A neural dynamic programming approach for learning control of failure avoidance problems. International Journal of Intelligent Control and Systems, 2005, 10(1): 21-32
-
(2005)
International Journal of Intelligent Control and Systems
, vol.10
, Issue.1
, pp. 21-32
-
-
Liu, D.R.1
Zhang, H.G.2
-
12
-
-
33751238181
-
A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonliner systems
-
Padhi R, Unnikrishnan N, Wang X H, Balakrishnan S N. A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonliner systems. Neural Networks, 2006, 19(10): 1648-1660
-
(2006)
Neural Networks
, vol.19
, Issue.10
, pp. 1648-1660
-
-
Padhi, R.1
Unnikrishnan, N.2
Wang, X.H.3
Balakrishnan, S.N.4
-
15
-
-
36348986773
-
Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach
-
Cheng T, Lewis F L, Abu-Khalaf M. Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach. IEEE Transactions on Neural Networks, 2007, 18(6): 1725-1736
-
(2007)
IEEE Transactions on Neural Networks
, vol.18
, Issue.6
, pp. 1725-1736
-
-
Cheng, T.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
16
-
-
49049119493
-
A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm
-
Zhang H G, Wei Q L, Luo Y H. A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008, 38(4): 937-942
-
(2008)
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
, vol.38
, Issue.4
, pp. 937-942
-
-
Zhang, H.G.1
Wei, Q.L.2
Luo, Y.H.3
-
17
-
-
49049087720
-
Reinforcement learning in continuous time and space: interference and not ill conditioning is the main problem when using distributed function approximators
-
Baddeley B. Reinforcement learning in continuous time and space: interference and not ill conditioning is the main problem when using distributed function approximators. IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 2008, 38(4): 950-956
-
(2008)
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
, vol.38
, Issue.4
, pp. 950-956
-
-
Baddeley, B.1
-
18
-
-
66449130966
-
Adaptive dynamic programming: an introduction
-
Wang F Y, Zhang H G, Liu D R. Adaptive dynamic programming: an introduction. IEEE Computational Intelligence Magazine, 2009, 4(2): 39-47
-
(2009)
IEEE Computational Intelligence Magazine
, vol.4
, Issue.2
, pp. 39-47
-
-
Wang, F.Y.1
Zhang, H.G.2
Liu, D.R.3
-
19
-
-
39549085591
-
Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete-time systems
-
Chen Z, Jagannathan S. Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete-time systems. IEEE Transactions on Neural Networks, 2008, 19(1): 90-106
-
(2008)
IEEE Transactions on Neural Networks
, vol.19
, Issue.1
, pp. 90-106
-
-
Chen, Z.1
Jagannathan, S.2
-
20
-
-
0242627940
-
Nonlinear discrete-time systems: constrained optimization and application of nonquadratic costs
-
Philadelphia, USA: IEEE
-
Lyshevski S E. Nonlinear discrete-time systems: constrained optimization and application of nonquadratic costs. In: Proceedings of the American Control Conference. Philadelphia, USA: IEEE, 1998. 3699-3703
-
(1998)
Proceedings of the American Control Conference
, pp. 3699-3703
-
-
Lyshevski, S.E.1
|