메뉴 건너뛰기




Volumn 20, Issue 9, 2009, Pages 1490-1503

Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints

Author keywords

Adaptive dynamic programming; Approximate dynamic programming; Control constraints; Convergence analysis; Near optimal control; Neural networks

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; CONTROL CONSTRAINTS; CONVERGENCE ANALYSIS; NEAR-OPTIMAL CONTROL;

EID: 70349253929     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/TNN.2009.2027233     Document Type: Article
Times cited : (594)

References (47)
  • 1
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • May
    • M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, May 2005.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 3
    • 34548709862 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Honolulu, HI, Apr
    • A. Al-Tamimi and F. L. Lewis, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 38-43.
    • (2007) Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn , pp. 38-43
    • Al-Tamimi, A.1    Lewis, F.L.2
  • 4
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Aug
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
    • (2008) IEEE Trans. Syst. Man Cybern. B, Cybern , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 7
    • 0020970738 scopus 로고
    • Neuronlike adaptive elements that can solve difficult learning control problems
    • Sep./Oct
    • A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst. Man Cybern., vol. SMC-13, no. 5, pp. 835-846, Sep./Oct. 1983.
    • (1983) IEEE Trans. Syst. Man Cybern , vol.SMC-13 , Issue.5 , pp. 835-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 9
    • 0003787146 scopus 로고
    • Princeton, NJ: Princeton Univ. Press
    • R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 10
    • 0029403342 scopus 로고
    • Optimal nonlinear, but continuous, feedback control of systems with saturating actuators
    • D. S. Bernstein, "Optimal nonlinear, but continuous, feedback control of systems with saturating actuators," Int. J. Control, vol. 62, no. 5, pp. 1209-1216, 1995.
    • (1995) Int. J. Control , vol.62 , Issue.5 , pp. 1209-1216
    • Bernstein, D.S.1
  • 12
    • 39549085591 scopus 로고    scopus 로고
    • Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems
    • Jan
    • Z. Chen and S. Jagannathan, "Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems," IEEE Trans. Neural Netw, vol. 19, no. 1, pp. 90-106, Jan. 2008.
    • (2008) IEEE Trans. Neural Netw , vol.19 , Issue.1 , pp. 90-106
    • Chen, Z.1    Jagannathan, S.2
  • 13
    • 36348986773 scopus 로고    scopus 로고
    • Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach
    • Nov
    • T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach," IEEE Trans. Neural Netw, vol. 18, no. 6, pp. 1725-1736, Nov. 2007.
    • (2007) IEEE Trans. Neural Netw , vol.18 , Issue.6 , pp. 1725-1736
    • Cheng, T.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 14
    • 61849156138 scopus 로고    scopus 로고
    • A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes
    • Munich, Germany, Oct
    • J. Dankert, Y. Lei, and J. Si, "A performance gradient perspective on approximate dynamic programming and its application to partially observable Markov decision processes," in Proc. Int. Symp. Intell. Control, Munich, Germany, Oct. 2006, pp. 458-463.
    • (2006) Proc. Int. Symp. Intell. Control , pp. 458-463
    • Dankert, J.1    Lei, Y.2    Si, J.3
  • 15
    • 0043026775 scopus 로고    scopus 로고
    • Helicopter trimming and tracking control using direct neural dynamic programming
    • Jul
    • R. Enns and J. Si, "Helicopter trimming and tracking control using direct neural dynamic programming," IEEE Trans. Neural Netw., vol. 14, no. 4, pp. 929-939, Jul. 2003.
    • (2003) IEEE Trans. Neural Netw , vol.14 , Issue.4 , pp. 929-939
    • Enns, R.1    Si, J.2
  • 16
    • 4944255184 scopus 로고    scopus 로고
    • Online adaptive critic flight control
    • S. Ferrari and R. F. Stengel, "Online adaptive critic flight control," J. Guid. Control Dyn., vol. 27, no. 5, pp. 777-786, 2004.
    • (2004) J. Guid. Control Dyn , vol.27 , Issue.5 , pp. 777-786
    • Ferrari, S.1    Stengel, R.F.2
  • 17
  • 18
    • 0030702730 scopus 로고    scopus 로고
    • Training strategies for critic and action neural networks in dual heuristic programming method
    • Houston, TX, Jun
    • G. G. Lendaris and C. Paintz, "Training strategies for critic and action neural networks in dual heuristic programming method," in Proc. Int. Joint Conf. Neural Netw., Houston, TX, Jun. 1997, vol. 2, pp. 712-717.
    • (1997) Proc. Int. Joint Conf. Neural Netw , vol.2 , pp. 712-717
    • Lendaris, G.G.1    Paintz, C.2
  • 19
  • 20
    • 34548772562 scopus 로고    scopus 로고
    • Robust dynamic programming for discounted infinite-horizon Markov decision processes with uncertain stationary transition matrices
    • Honolulu, HI, Apr
    • B. Li and J. Si, "Robust dynamic programming for discounted infinite-horizon Markov decision processes with uncertain stationary transition matrices," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 96-102.
    • (2007) Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn , pp. 96-102
    • Li, B.1    Si, J.2
  • 21
    • 0034548295 scopus 로고    scopus 로고
    • Convergence analysis of adaptive critic based optimal control
    • Chicago, IL, Jun
    • X. Liu and S. N. Balakrishnan, "Convergence analysis of adaptive critic based optimal control," in Proc. Amer. Control Conf., Chicago, IL, Jun. 2000, pp. 1929-1933.
    • (2000) Proc. Amer. Control Conf , pp. 1929-1933
    • Liu, X.1    Balakrishnan, S.N.2
  • 22
    • 49049108697 scopus 로고    scopus 로고
    • Adaptive critic learning techniques for engine torque and air-fuel ratio control
    • Aug
    • D. Liu, H. Javaherian, O. Kovalenko, and T. Huang, "Adaptive critic learning techniques for engine torque and air-fuel ratio control," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 38, no. 4, pp. 988-993, Aug. 2008.
    • (2008) IEEE Trans. Syst. Man Cybern. B, Cybern , vol.38 , Issue.4 , pp. 988-993
    • Liu, D.1    Javaherian, H.2    Kovalenko, O.3    Huang, T.4
  • 23
    • 0034863083 scopus 로고    scopus 로고
    • Action-dependent adaptive critic designs
    • Washington, DC, Jul
    • D. Liu, X. Xiong, and Y. Zhang, "Action-dependent adaptive critic designs," in Proc. Int. Joint Conf. Neural Netw., Washington, DC, Jul. 2001, vol. 2, pp. 990-995.
    • (2001) Proc. Int. Joint Conf. Neural Netw , vol.2 , pp. 990-995
    • Liu, D.1    Xiong, X.2    Zhang, Y.3
  • 24
    • 34249712124 scopus 로고    scopus 로고
    • A neural dynamic programming approach for learning control of failure avoidance problems
    • D. Liu and H. Zhang, "A neural dynamic programming approach for learning control of failure avoidance problems," Int. J. Intell. Control Syst., vol. 10, no. 1, pp. 21-32, 2005.
    • (2005) Int. J. Intell. Control Syst , vol.10 , Issue.1 , pp. 21-32
    • Liu, D.1    Zhang, H.2
  • 25
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • Sep
    • D. Liu, Y. Zhang, and H. Zhang, "A self-learning call admission control scheme for CDMA cellular networks," IEEE Trans. Neural Netw., vol. 16, no. 5, pp. 1219-1228, Sep. 2005.
    • (2005) IEEE Trans. Neural Netw , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 26
    • 0036996219 scopus 로고    scopus 로고
    • Optimization of dynamic systems using novel performance functionals
    • Las Vegas, NV, Dec
    • S. E. Lyshevski, "Optimization of dynamic systems using novel performance functionals," in Proc. 41st Conf. Decision Control, Las Vegas, NV, Dec. 2002, pp. 753-758.
    • (2002) Proc. 41st Conf. Decision Control , pp. 753-758
    • Lyshevski, S.E.1
  • 27
    • 84881324637 scopus 로고    scopus 로고
    • Optimal control of nonlinear continuous-time systems: Design of bounded controllers via generalized nonquadratic functionals
    • Philadelphia, PA, Jun
    • S. E. Lyshevski, "Optimal control of nonlinear continuous-time systems: Design of bounded controllers via generalized nonquadratic functionals," in Proc. Amer. Control Conf., Philadelphia, PA, Jun. 1998, pp. 205-209.
    • (1998) Proc. Amer. Control Conf , pp. 205-209
    • Lyshevski, S.E.1
  • 28
    • 0242627940 scopus 로고    scopus 로고
    • Nonlinear discrete-time systems: Constrained optimization and application of nonquadratic costs
    • Philadelphia, PA, Jun
    • S. E. Lyshevski, "Nonlinear discrete-time systems: Constrained optimization and application of nonquadratic costs," in Proc. Amer. Control Conf., Philadelphia, PA, Jun. 1998, pp. 3699-3703.
    • (1998) Proc. Amer. Control Conf , pp. 3699-3703
    • Lyshevski, S.E.1
  • 30
    • 33751238181 scopus 로고    scopus 로고
    • A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
    • Dec
    • R. Padhi, N. Unnikrishnan, X.Wang, and S. N. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems," Neural Netw., vol. 19, no. 10, pp. 1648-1660, Dec. 2006.
    • (2006) Neural Netw , vol.19 , Issue.10 , pp. 1648-1660
    • Padhi, R.1    Unnikrishnan, N.2    Wang, X.3    Balakrishnan, S.N.4
  • 31
    • 0242337541 scopus 로고    scopus 로고
    • Adaptive-critic-based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks
    • Sep./Oct
    • J.-W. Park, R. G. Harley, and G. K. Venayagamoorthy, "Adaptive-critic-based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks," IEEE Trans. Ind. Appl., vol. 39, no. 5, pp. 1529-1540, Sep./Oct. 2003.
    • (2003) IEEE Trans. Ind. Appl , vol.39 , Issue.5 , pp. 1529-1540
    • Park, J.-W.1    Harley, R.G.2    Venayagamoorthy, G.K.3
  • 32
    • 0031236002 scopus 로고    scopus 로고
    • Adaptive critic designs
    • Sep
    • D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
    • (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
    • Prokhorov, D.V.1    Wunsch, D.C.2
  • 33
    • 0030104564 scopus 로고    scopus 로고
    • Control of linear systems with saturating actuators
    • Mar
    • A. Saberi, Z. Lin, and A. Teel, "Control of linear systems with saturating actuators," IEEE Trans. Autom. Control, vol. 41, no. 3, pp. 368-378, Mar. 1996.
    • (1996) IEEE Trans. Autom. Control , vol.41 , Issue.3 , pp. 368-378
    • Saberi, A.1    Lin, Z.2    Teel, A.3
  • 34
    • 0018441647 scopus 로고
    • An approximation theory of optimal control for trainable manipulators
    • Mar
    • G. Saridis and C. S. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Syst. Man Cybern., vol. SMC-9, no. 2, pp. 152-159, Mar. 1979.
    • (1979) IEEE Trans. Syst. Man Cybern , vol.SMC-9 , Issue.2 , pp. 152-159
    • Saridis, G.1    Lee, C.S.2
  • 35
    • 0039434283 scopus 로고
    • Suboptimal control of nonlinear stochastic systems
    • G. N. Saridis and F. Y. Wang, "Suboptimal control of nonlinear stochastic systems," Control-Theory Adv. Technol., vol. 10, no. 4, pp. 847-871, 1994.
    • (1994) Control-Theory Adv. Technol , vol.10 , Issue.4 , pp. 847-871
    • Saridis, G.N.1    Wang, F.Y.2
  • 37
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • Mar
    • J. Si and Y.-T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 38
    • 0028712602 scopus 로고
    • A general result on the stabilization of linear systems using bounded controls
    • Dec
    • H. Sussmann, E. D. Sontag, and Y. Yang, "A general result on the stabilization of linear systems using bounded controls," IEEE Trans. Autom. Control, vol. 39, no. 12, pp. 2411-2425, Dec. 1994.
    • (1994) IEEE Trans. Autom. Control , vol.39 , Issue.12 , pp. 2411-2425
    • Sussmann, H.1    Sontag, E.D.2    Yang, Y.3
  • 39
    • 34047218055 scopus 로고
    • Suboptimal control for nonlinear stochastic systems
    • Tucson, AZ, Dec
    • F.-Y. Wang and G. N. Saridis, "Suboptimal control for nonlinear stochastic systems," in Proc. 31st IEEE Conf. Decision Control, Tucson, AZ, Dec. 1992, pp. 1856-1861.
    • (1992) Proc. 31st IEEE Conf. Decision Control , pp. 1856-1861
    • Wang, F.-Y.1    Saridis, G.N.2
  • 40
    • 0004049893 scopus 로고
    • Learning from delayed rewards,
    • Ph.D. dissertation, Dept. Psychol, Cambridge University, Cambridge, U.K
    • C.Watkins, "Learning from delayed rewards," Ph.D. dissertation, Dept. Psychol., Cambridge University, Cambridge, U.K., 1989.
    • (1989)
    • Watkins, C.1
  • 41
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
    • P. J.Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 42
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A.White and D. A. Sofge, Eds. New York: Van Nostrand, ch. 13
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D. A.White and D. A. Sofge, Eds. New York: Van Nostrand, 1992, ch. 13.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches
    • Werbos, P.J.1
  • 43
    • 34548766755 scopus 로고    scopus 로고
    • Using ADP to understand and replicate brain intelligence: The next level design
    • Honolulu, HI, Apr
    • P. J. Werbos, "Using ADP to understand and replicate brain intelligence: The next level design," in Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 209-216.
    • (2007) Proc. IEEE Int. Symp. Approx. Dyn. Programm. Reinforcement Learn , pp. 209-216
    • Werbos, P.J.1
  • 44
    • 0015667648 scopus 로고
    • Punish/reward: Learning with a critic in adaptive threshold systems
    • Sep
    • B. Widrow, N. Gupta, and S. Maitra, "Punish/reward: Learning with a critic in adaptive threshold systems," IEEE Trans. Syst. Man Cybern. vol. SMC-3, no. 5, pp. 455-465, Sep. 1973.
    • (1973) IEEE Trans. Syst. Man Cybern , vol.SMC-3 , Issue.5 , pp. 455-465
    • Widrow, B.1    Gupta, N.2    Maitra, S.3
  • 45
    • 34547133970 scopus 로고    scopus 로고
    • Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks
    • Jul
    • V. Yadav, R. Padhi, and S. N. Balakrishnan, "Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 1115-1128, Jul. 2007.
    • (2007) IEEE Trans. Neural Netw , vol.18 , Issue.4 , pp. 1115-1128
    • Yadav, V.1    Padhi, R.2    Balakrishnan, S.N.3
  • 46
    • 34548730950 scopus 로고    scopus 로고
    • Online reinforcement learning neural network controller design for nanomanipulation
    • Honolulu, HI, Apr
    • Q. Yang and S. Jagannathan, "Online reinforcement learning neural network controller design for nanomanipulation," in Proc. IEEE Symp. Approx. Dyn. Programm. Reinforcement Learn., Honolulu, HI, Apr. 2007, pp. 225-232.
    • (2007) Proc. IEEE Symp. Approx. Dyn. Programm. Reinforcement Learn , pp. 225-232
    • Yang, Q.1    Jagannathan, S.2
  • 47
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm
    • Aug
    • H. Zhang, Q.Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm," IEEE Trans. Syst. Man Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
    • (2008) IEEE Trans. Syst. Man Cybern. B, Cybern , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.