-
1
-
-
14844340822
-
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
-
May
-
M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, May 2005.
-
(2005)
Automatica
, vol.41
, Issue.5
, pp. 779-791
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
2
-
-
33847648898
-
∞ control
-
Feb
-
∞ control," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 37, no. 1, pp. 240-247, Feb. 2007.
-
(2007)
IEEE Trans. Syst. Man Cybern. B, Cybern
, vol.37
, Issue.1
, pp. 240-247
-
-
Al-Tamimi, A.1
Abu-Khalaf, M.2
Lewis, F.L.3
-
3
-
-
34548709862
-
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
-
Honolulu, HI, Apr
-
A. Al-Tamimi and F. L. Lewis, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 38-43.
-
(2007)
Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn
, pp. 38-43
-
-
Al-Tamimi, A.1
Lewis, F.L.2
-
4
-
-
49049089962
-
Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
-
Aug
-
A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
-
(2008)
IEEE Trans. Syst. Man Cybern. B, Cybern
, vol.38
, Issue.4
, pp. 943-949
-
-
Al-Tamimi, A.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
5
-
-
84898962948
-
Policy search by dynamic programming
-
Vancouver, BC, Canada, Dec
-
J. Bagnell, S. Kakade, A. Ng, and J. Schneider, "Policy search by dynamic programming," in Proc. 17th Annu. Conf. Neural Inf. Process. Syst., Vancouver, BC, Canada, Dec. 2003, vol. 16, pp. 831-838.
-
(2003)
Proc. 17th Annu. Conf. Neural Inf. Process. Syst
, vol.16
, pp. 831-838
-
-
Bagnell, J.1
Kakade, S.2
Ng, A.3
Schneider, J.4
-
7
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Sep./Oct
-
A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst. Man Cybern., vol. SMC-13, no. 5, pp. 835-846, Sep./Oct. 1983.
-
(1983)
IEEE Trans. Syst. Man Cybern
, vol.SMC-13
, Issue.5
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
8
-
-
0003785722
-
-
Ph.D. dissertation, Electr. Eng. Dept, Rensselaer Polytech. Inst, Troy, NY
-
R. Beard, "Improving the closed-loop performance of nonlinear systems," Ph.D. dissertation, Electr. Eng. Dept., Rensselaer Polytech. Inst., Troy, NY, 1995.
-
(1995)
Improving the closed-loop performance of nonlinear systems
-
-
Beard, R.1
-
9
-
-
0003787146
-
-
Princeton, NJ: Princeton Univ. Press
-
R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
10
-
-
0029403342
-
Optimal nonlinear, but continuous, feedback control of systems with saturating actuators
-
D. S. Bernstein, "Optimal nonlinear, but continuous, feedback control of systems with saturating actuators," Int. J. Control, vol. 62, no. 5, pp. 1209-1216, 1995.
-
(1995)
Int. J. Control
, vol.62
, Issue.5
, pp. 1209-1216
-
-
Bernstein, D.S.1
-
12
-
-
39549085591
-
Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems
-
Jan
-
Z. Chen and S. Jagannathan, "Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems," IEEE Trans. Neural Netw, vol. 19, no. 1, pp. 90-106, Jan. 2008.
-
(2008)
IEEE Trans. Neural Netw
, vol.19
, Issue.1
, pp. 90-106
-
-
Chen, Z.1
Jagannathan, S.2
-
13
-
-
36348986773
-
Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach
-
Nov
-
T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach," IEEE Trans. Neural Netw, vol. 18, no. 6, pp. 1725-1736, Nov. 2007.
-
(2007)
IEEE Trans. Neural Netw
, vol.18
, Issue.6
, pp. 1725-1736
-
-
Cheng, T.1
Lewis, F.L.2
Abu-Khalaf, M.3
-
14
-
-
61849156138
-
A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes
-
Munich, Germany, Oct
-
J. Dankert, Y. Lei, and J. Si, "A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes," in Proc. Int. Symp. Intell. Control, Munich, Germany, Oct. 2006, pp. 458-463.
-
(2006)
Proc. Int. Symp. Intell. Control
, pp. 458-463
-
-
Dankert, J.1
Lei, Y.2
Si, J.3
-
15
-
-
0043026775
-
Helicopter trimming and tracking control using direct neural dynamic programming
-
Jul
-
R. Enns and J. Si, "Helicopter trimming and tracking control using direct neural dynamic programming," IEEE Trans. Neural Netw., vol. 14, no. 4, pp. 929-939, Jul. 2003.
-
(2003)
IEEE Trans. Neural Netw
, vol.14
, Issue.4
, pp. 929-939
-
-
Enns, R.1
Si, J.2
-
16
-
-
4944255184
-
Online adaptive critic flight control
-
S. Ferrari and R. F. Stengel, "Online adaptive critic flight control," J. Guid. Control Dyn., vol. 27, no. 5, pp. 777-786, 2004.
-
(2004)
J. Guid. Control Dyn
, vol.27
, Issue.5
, pp. 777-786
-
-
Ferrari, S.1
Stengel, R.F.2
-
17
-
-
34249047468
-
Continuous-time adaptive critics
-
May
-
T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Netw., vol. 18, no. 3, pp. 631-647, May 2007.
-
(2007)
IEEE Trans. Neural Netw
, vol.18
, Issue.3
, pp. 631-647
-
-
Hanselmann, T.1
Noakes, L.2
Zaknich, A.3
-
18
-
-
0030702730
-
Training strategies for critic and action neural networks in dual heuristic programming method
-
Houston, TX, Jun
-
G. G. Lendaris and C. Paintz, "Training strategies for critic and action neural networks in dual heuristic programming method," in Proc. Int. Joint Conf. Neural Netw., Houston, TX, Jun. 1997, vol. 2, pp. 712-717.
-
(1997)
Proc. Int. Joint Conf. Neural Netw
, vol.2
, pp. 712-717
-
-
Lendaris, G.G.1
Paintz, C.2
-
19
-
-
34548752493
-
Discrete-time adaptive dynamic programming using wavelet basis function neural networks
-
Honolulu, HI, Apr
-
N. Jin, D. Liu, T. Huang, and Z. Pang, "Discrete-time adaptive dynamic programming using wavelet basis function neural networks," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 135-142.
-
(2007)
Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn
, pp. 135-142
-
-
Jin, N.1
Liu, D.2
Huang, T.3
Pang, Z.4
-
20
-
-
34548772562
-
Robust dynamic programming for discounted infinite-horizon Markov decision processes with uncertain stationary transition matrices
-
Honolulu, HI, Apr
-
B. Li and J. Si, "Robust dynamic programming for discounted infinite-horizon Markov decision processes with uncertain stationary transition matrices," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 96-102.
-
(2007)
Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn
, pp. 96-102
-
-
Li, B.1
Si, J.2
-
21
-
-
0034548295
-
Convergence analysis of adaptive critic based optimal control
-
Chicago, IL, Jun
-
X. Liu and S. N. Balakrishnan, "Convergence analysis of adaptive critic based optimal control," in Proc. Amer. Control Conf., Chicago, IL, Jun. 2000, pp. 1929-1933.
-
(2000)
Proc. Amer. Control Conf
, pp. 1929-1933
-
-
Liu, X.1
Balakrishnan, S.N.2
-
22
-
-
49049108697
-
Adaptive critic learning techniques for engine torque and air-fuel ratio control
-
Aug
-
D. Liu, H. Javaherian, O. Kovalenko, and T. Huang, "Adaptive critic learning techniques for engine torque and air-fuel ratio control," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 38, no. 4, pp. 988-993, Aug. 2008.
-
(2008)
IEEE Trans. Syst. Man Cybern. B, Cybern
, vol.38
, Issue.4
, pp. 988-993
-
-
Liu, D.1
Javaherian, H.2
Kovalenko, O.3
Huang, T.4
-
23
-
-
0034863083
-
Action-dependent adaptive critic designs
-
Washington, DC, Jul
-
D. Liu, X. Xiong, and Y. Zhang, "Action-dependent adaptive critic designs," in Proc. Int. Joint Conf. Neural Netw., Washington, DC, Jul. 2001, vol. 2, pp. 990-995.
-
(2001)
Proc. Int. Joint Conf. Neural Netw
, vol.2
, pp. 990-995
-
-
Liu, D.1
Xiong, X.2
Zhang, Y.3
-
24
-
-
34249712124
-
A neural dynamic programming approach for learning control of failure avoidance problems
-
D. Liu and H. Zhang, "A neural dynamic programming approach for learning control of failure avoidance problems," Int. J. Intell. Control Syst., vol. 10, no. 1, pp. 21-32, 2005.
-
(2005)
Int. J. Intell. Control Syst
, vol.10
, Issue.1
, pp. 21-32
-
-
Liu, D.1
Zhang, H.2
-
25
-
-
26844483839
-
A self-learning call admission control scheme for CDMA cellular networks
-
Sep
-
D. Liu, Y. Zhang, and H. Zhang, "A self-learning call admission control scheme for CDMA cellular networks," IEEE Trans. Neural Netw., vol. 16, no. 5, pp. 1219-1228, Sep. 2005.
-
(2005)
IEEE Trans. Neural Netw
, vol.16
, Issue.5
, pp. 1219-1228
-
-
Liu, D.1
Zhang, Y.2
Zhang, H.3
-
26
-
-
0036996219
-
Optimization of dynamic systems using novel performance functionals
-
Las Vegas, NV, Dec
-
S. E. Lyshevski, "Optimization of dynamic systems using novel performance functionals," in Proc. 41st Conf. Decision Control, Las Vegas, NV, Dec. 2002, pp. 753-758.
-
(2002)
Proc. 41st Conf. Decision Control
, pp. 753-758
-
-
Lyshevski, S.E.1
-
27
-
-
84881324637
-
Optimal control of nonlinear continuous-time systems: Design of bounded controllers via generalized nonquadratic functionals
-
Philadelphia, PA, Jun
-
S. E. Lyshevski, "Optimal control of nonlinear continuous-time systems: Design of bounded controllers via generalized nonquadratic functionals," in Proc. Amer. Control Conf., Philadelphia, PA, Jun. 1998, pp. 205-209.
-
(1998)
Proc. Amer. Control Conf
, pp. 205-209
-
-
Lyshevski, S.E.1
-
28
-
-
0242627940
-
Nonlinear discrete-time systems: Constrained optimization and application of nonquadratic costs
-
Philadelphia, PA, Jun
-
S. E. Lyshevski, "Nonlinear discrete-time systems: Constrained optimization and application of nonquadratic costs," in Proc. Amer. Control Conf., Philadelphia, PA, Jun. 1998, pp. 3699-3703.
-
(1998)
Proc. Amer. Control Conf
, pp. 3699-3703
-
-
Lyshevski, S.E.1
-
29
-
-
0036588686
-
Adaptive dynamic programming
-
May
-
J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst. Man Cybern. C, Appl. Rev. vol. 32, no. 2, pp. 140-153, May 2002.
-
(2002)
IEEE Trans. Syst. Man Cybern. C, Appl. Rev
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
30
-
-
33751238181
-
A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
-
Dec
-
R. Padhi, N. Unnikrishnan, X.Wang, and S. N. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems," Neural Netw., vol. 19, no. 10, pp. 1648-1660, Dec. 2006.
-
(2006)
Neural Netw
, vol.19
, Issue.10
, pp. 1648-1660
-
-
Padhi, R.1
Unnikrishnan, N.2
Wang, X.3
Balakrishnan, S.N.4
-
31
-
-
0242337541
-
Adaptive-critic-based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks
-
Sep./Oct
-
J.-W. Park, R. G. Harley, and G. K. Venayagamoorthy, "Adaptive-critic-based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks," IEEE Trans. Ind. Appl., vol. 39, no. 5, pp. 1529-1540, Sep./Oct. 2003.
-
(2003)
IEEE Trans. Ind. Appl
, vol.39
, Issue.5
, pp. 1529-1540
-
-
Park, J.-W.1
Harley, R.G.2
Venayagamoorthy, G.K.3
-
32
-
-
0031236002
-
Adaptive critic designs
-
Sep
-
D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
-
(1997)
IEEE Trans. Neural Netw
, vol.8
, Issue.5
, pp. 997-1007
-
-
Prokhorov, D.V.1
Wunsch, D.C.2
-
33
-
-
0030104564
-
Control of linear systems with saturating actuators
-
Mar
-
A. Saberi, Z. Lin, and A. Teel, "Control of linear systems with saturating actuators," IEEE Trans. Autom. Control, vol. 41, no. 3, pp. 368-378, Mar. 1996.
-
(1996)
IEEE Trans. Autom. Control
, vol.41
, Issue.3
, pp. 368-378
-
-
Saberi, A.1
Lin, Z.2
Teel, A.3
-
34
-
-
0018441647
-
An approximation theory of optimal control for trainable manipulators
-
Mar
-
G. Saridis and C. S. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Syst. Man Cybern., vol. SMC-9, no. 2, pp. 152-159, Mar. 1979.
-
(1979)
IEEE Trans. Syst. Man Cybern
, vol.SMC-9
, Issue.2
, pp. 152-159
-
-
Saridis, G.1
Lee, C.S.2
-
35
-
-
0039434283
-
Suboptimal control of nonlinear stochastic systems
-
G. N. Saridis and F. Y. Wang, "Suboptimal control of nonlinear stochastic systems," Control-Theory Adv. Technol., vol. 10, no. 4, pp. 847-871, 1994.
-
(1994)
Control-Theory Adv. Technol
, vol.10
, Issue.4
, pp. 847-871
-
-
Saridis, G.N.1
Wang, F.Y.2
-
36
-
-
0041376883
-
Intelligent supply chain management using adaptive critic learning
-
Mar
-
S. Shervais, T. T. Shannon, and G. G. Lendaris, "Intelligent supply chain management using adaptive critic learning," IEEE Trans. Syst. Man Cybern. A, Syst. Humans, vol. 33, no. 2, pp. 235-244, Mar. 2003.
-
(2003)
IEEE Trans. Syst. Man Cybern. A, Syst. Humans
, vol.33
, Issue.2
, pp. 235-244
-
-
Shervais, S.1
Shannon, T.T.2
Lendaris, G.G.3
-
37
-
-
0035273403
-
On-line learning control by association and reinforcement
-
Mar
-
J. Si and Y.-T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
-
(2001)
IEEE Trans. Neural Netw
, vol.12
, Issue.2
, pp. 264-276
-
-
Si, J.1
Wang, Y.-T.2
-
38
-
-
0028712602
-
A general result on the stabilization of linear systems using bounded controls
-
Dec
-
H. Sussmann, E. D. Sontag, and Y. Yang, "A general result on the stabilization of linear systems using bounded controls," IEEE Trans. Autom. Control, vol. 39, no. 12, pp. 2411-2425, Dec. 1994.
-
(1994)
IEEE Trans. Autom. Control
, vol.39
, Issue.12
, pp. 2411-2425
-
-
Sussmann, H.1
Sontag, E.D.2
Yang, Y.3
-
39
-
-
34047218055
-
Suboptimal control for nonlinear stochastic systems
-
Tucson, AZ, Dec
-
F.-Y. Wang and G. N. Saridis, "Suboptimal control for nonlinear stochastic systems," in Proc. 31st IEEE Conf. Decision Control, Tucson, AZ, Dec. 1992, pp. 1856-1861.
-
(1992)
Proc. 31st IEEE Conf. Decision Control
, pp. 1856-1861
-
-
Wang, F.-Y.1
Saridis, G.N.2
-
40
-
-
0004049893
-
Learning from delayed rewards,
-
Ph.D. dissertation, Dept. Psychol, Cambridge University, Cambridge, U.K
-
C.Watkins, "Learning from delayed rewards," Ph.D. dissertation, Dept. Psychol., Cambridge University, Cambridge, U.K., 1989.
-
(1989)
-
-
Watkins, C.1
-
41
-
-
0002011091
-
A menu of designs for reinforcement learning over time
-
W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
-
P. J.Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1991, pp. 67-95.
-
(1991)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.J.1
-
42
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
D. A.White and D. A. Sofge, Eds. New York: Van Nostrand, ch. 13
-
P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D. A.White and D. A. Sofge, Eds. New York: Van Nostrand, 1992, ch. 13.
-
(1992)
Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches
-
-
Werbos, P.J.1
-
43
-
-
34548766755
-
Using ADP to understand and replicate brain intelligence: The next level design
-
Honolulu, HI, Apr
-
P. J. Werbos, "Using ADP to understand and replicate brain intelligence: The next level design," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 209-216.
-
(2007)
Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn
, pp. 209-216
-
-
Werbos, P.J.1
-
44
-
-
0015667648
-
Punish/reward: Learning with a critic in adaptive threshold systems
-
Sep
-
B. Widrow, N. Gupta, and S. Maitra, "Punish/reward: Learning with a critic in adaptive threshold systems," IEEE Trans. Syst. Man Cybern. vol. SMC-3, no. 5, pp. 455-465, Sep. 1973.
-
(1973)
IEEE Trans. Syst. Man Cybern
, vol.SMC-3
, Issue.5
, pp. 455-465
-
-
Widrow, B.1
Gupta, N.2
Maitra, S.3
-
45
-
-
34547133970
-
Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks
-
Jul
-
V. Yadav, R. Padhi, and S. N. Balakrishnan, "Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 1115-1128, Jul. 2007.
-
(2007)
IEEE Trans. Neural Netw
, vol.18
, Issue.4
, pp. 1115-1128
-
-
Yadav, V.1
Padhi, R.2
Balakrishnan, S.N.3
-
46
-
-
34548730950
-
Online reinforcement learning neural network controller design for nanomanipulation
-
Honolulu, HI, Apr
-
Q. Yang and S. Jagannathan, "Online reinforcement learning neural network controller design for nanomanipulation," in Proc. IEEE Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 225-232.
-
(2007)
Proc. IEEE Symp. Approx. Dyn. Programm. Reinforcement Learn
, pp. 225-232
-
-
Yang, Q.1
Jagannathan, S.2
-
47
-
-
49049119493
-
A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm
-
Aug
-
H. Zhang, Q.Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
-
(2008)
IEEE Trans. Syst. Man Cybern. B, Cybern
, vol.38
, Issue.4
, pp. 937-942
-
-
Zhang, H.1
Wei, Q.2
Luo, Y.3
|