메뉴 건너뛰기




Volumn 22, Issue 2, 2013, Pages 219-227

A neural-network-based iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints

Author keywords

Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Neural dynamic programming; Neural networks; Optimal control; Reinforcement learning

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; CELLULAR RADIO SYSTEMS; CONSTRAINED OPTIMIZATION; COST FUNCTIONS; DIGITAL CONTROL SYSTEMS; DISCRETE TIME CONTROL SYSTEMS; FUNCTIONAL PROGRAMMING; HEURISTIC PROGRAMMING; ITERATIVE METHODS; NEURAL NETWORKS; OPTIMAL CONTROL SYSTEMS; REINFORCEMENT LEARNING;

EID: 84872617336     PISSN: 09410643     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00521-011-0707-2     Document Type: Article
Times cited : (32)

References (38)
  • 1
    • 46249099270 scopus 로고    scopus 로고
    • On near optimal neural control of multiple-input nonlinear systems
    • Chen D, Yang J, Mohler RR (2008) On near optimal neural control of multiple-input nonlinear systems. Neural Comput Appl 17(4): 327-337.
    • (2008) Neural Comput Appl , vol.17 , Issue.4 , pp. 327-337
    • Chen, D.1    Yang, J.2    Mohler, R.R.3
  • 2
    • 0030392685 scopus 로고    scopus 로고
    • Constrained optimization and control of nonlinear systems: new results in optimal control
    • Kobe, Japan
    • Lyshevski SE (1996) Constrained optimization and control of nonlinear systems: new results in optimal control. In: Proceedings of the 35th IEEE conference on decision and control, Kobe, Japan, pp 541-546.
    • (1996) Proceedings of the 35th IEEE conference on decision and control , pp. 541-546
    • Lyshevski, S.E.1
  • 3
    • 0242627940 scopus 로고    scopus 로고
    • Nonlinear discrete-time systems: constrained optimization and application of nonquadratic costs
    • Philadelphia
    • Lyshevski SE (1998) Nonlinear discrete-time systems: constrained optimization and application of nonquadratic costs. In: Proceedings of the American control conference, Philadelphia, pp 3699-3703.
    • (1998) Proceedings of the American control conference , pp. 3699-3703
    • Lyshevski, S.E.1
  • 4
    • 0003787146 scopus 로고
    • Princeton: Princeton University Press
    • Bellman RE (1957) Dynamic programming. Princeton University Press, Princeton.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 7
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A. White and D. A. Sofge (Eds.), New York: Van Nostrand Reinhold
    • Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. In: White DA, Sofge DA (eds) Handbook of intelligent control: neural, fuzzy, and adaptive approaches. van Nostrand Reinhold, New York, pp 493-525.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches , pp. 493-525
    • Werbos, P.J.1
  • 8
    • 49049091767 scopus 로고    scopus 로고
    • ADP: The key direction for future research in intelligent control and understanding brain intelligence
    • Werbos PJ (2008) ADP: The key direction for future research in intelligent control and understanding brain intelligence. IEEE Trans Syst Man Cybern B Cybern 38(4): 898-900.
    • (2008) IEEE Trans Syst Man Cybern B Cybern , vol.38 , Issue.4 , pp. 898-900
    • Werbos, P.J.1
  • 9
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: a theory of how it works and how to build it
    • Werbos PJ (2009) Intelligence in the brain: a theory of how it works and how to build it. Neural Netw 22(3): 200-212.
    • (2009) Neural Netw , vol.22 , Issue.3 , pp. 200-212
    • Werbos, P.J.1
  • 11
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: an introduction
    • Wang FY, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2): 39-47.
    • (2009) IEEE Comput Intell Mag , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.Y.1    Zhang, H.2    Liu, D.3
  • 12
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • Lewis FL, Vrabie D (2009) Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst Mag 9(3): 32-50.
    • (2009) IEEE Circuits Syst Mag , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 15
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • Si J, Wang YT (2001) On-line learning control by association and reinforcement. IEEE Trans Neural Netw 12(2): 264-276.
    • (2001) IEEE Trans Neural Netw , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 16
    • 34249712124 scopus 로고    scopus 로고
    • A neural dynamic programming approach for learning control of failure avoidance problems
    • Liu D, Zhang H (2005) A neural dynamic programming approach for learning control of failure avoidance problems. Int J Intell Control Syst 10(1): 21-32.
    • (2005) Int J Intell Control Syst , vol.10 , Issue.1 , pp. 21-32
    • Liu, D.1    Zhang, H.2
  • 21
    • 0036565019 scopus 로고    scopus 로고
    • Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
    • Venayagamoorthy GK, Harley RG, Wunsch DC (2002) Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator. IEEE Trans Neural Netw 13(3): 764-773.
    • (2002) IEEE Trans Neural Netw , vol.13 , Issue.3 , pp. 764-773
    • Venayagamoorthy, G.K.1    Harley, R.G.2    Wunsch, D.C.3
  • 22
    • 0242443337 scopus 로고    scopus 로고
    • Implementation of adaptive critic-based neurocontrollers for turbogenerators in a multimachine power system
    • Venayagamoorthy GK, Harley RG, Wunsch DC (2003) Implementation of adaptive critic-based neurocontrollers for turbogenerators in a multimachine power system. IEEE Trans Neural Netw 14(5): 1047-1064.
    • (2003) IEEE Trans Neural Netw , vol.14 , Issue.5 , pp. 1047-1064
    • Venayagamoorthy, G.K.1    Harley, R.G.2    Wunsch, D.C.3
  • 23
    • 17644391408 scopus 로고    scopus 로고
    • Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor
    • Yen GG, Delima PG (2005) Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor. IEEE Trans Autom Sci Eng 2(2): 121-131.
    • (2005) IEEE Trans Autom Sci Eng , vol.2 , Issue.2 , pp. 121-131
    • Yen, G.G.1    Delima, P.G.2
  • 24
    • 57749111482 scopus 로고    scopus 로고
    • Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form
    • Jagannathan S, He P (2008) Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form. IEEE Trans Neural Netw 19(12): 2073-2087.
    • (2008) IEEE Trans Neural Netw , vol.19 , Issue.12 , pp. 2073-2087
    • Jagannathan, S.1    He, P.2
  • 25
    • 33846781133 scopus 로고    scopus 로고
    • A neural network solution for fixed-final time optimal control of nonlinear systems
    • Cheng T, Lewis FL, Abu-Khalaf M (2007) A neural network solution for fixed-final time optimal control of nonlinear systems. Automatica 43(3): 482-490.
    • (2007) Automatica , vol.43 , Issue.3 , pp. 482-490
    • Cheng, T.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 26
    • 0030196717 scopus 로고    scopus 로고
    • Adaptive-critic based neural networks for aircraft optimal control
    • Balakrishnan SN, Biega V (1996) Adaptive-critic based neural networks for aircraft optimal control. J Guid Control Dyn 19(4): 893-898.
    • (1996) J Guid Control Dyn , vol.19 , Issue.4 , pp. 893-898
    • Balakrishnan, S.N.1    Biega, V.2
  • 28
    • 0036641793 scopus 로고    scopus 로고
    • State-constrained agile missile control with adaptive critic-based neural networks
    • Han D, Balakrishnan SN (2002) State-constrained agile missile control with adaptive critic-based neural networks. IEEE Trans Control Syst Technol 10(4): 481-489.
    • (2002) IEEE Trans Control Syst Technol , vol.10 , Issue.4 , pp. 481-489
    • Han, D.1    Balakrishnan, S.N.2
  • 29
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
    • Al-Tamimi A, Lewis FL, Abu-Khalaf M (2008) Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans Syst Man Cybern B Cybern 38(4): 943-949.
    • (2008) IEEE Trans Syst Man Cybern B Cybern , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 30
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Zhang H, Wei Q, Luo Y (2008) A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Trans Syst Man Cybern B Cybern 38(4): 937-942.
    • (2008) IEEE Trans Syst Man Cybern B Cybern , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 31
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • Vrabie D, Pastravanu O, Abu-Khalaf M, Lewis FL (2009) Adaptive optimal control for continuous-time linear systems based on policy iteration. Automatica 45(2): 477-484.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 33
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • Abu-Khalaf M, Lewis FL (2005) Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41(5): 779-791.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 34
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Zhang H, Luo Y, Liu D (2009) Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Trans Neural Netw 20(9): 1490-1503.
    • (2009) IEEE Trans Neural Netw , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 35
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5): 878-888.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 36
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Zhang H, Wei Q, Liu D (2011) An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 47(1): 207-214.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 37
    • 78649933699 scopus 로고    scopus 로고
    • Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming
    • Song R, Zhang H, Luo Y, Wei Q (2010) Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming. Neurocomputing 73(16-18): 3020-3027.
    • (2010) Neurocomputing , vol.73 , Issue.16-18 , pp. 3020-3027
    • Song, R.1    Zhang, H.2    Luo, Y.3    Wei, Q.4
  • 38
    • 46249095687 scopus 로고    scopus 로고
    • Neurodynamic programming: a case study of the traveling salesman problem
    • Ma J, Yang T, Hou ZG, Tan M, Liu D (2008) Neurodynamic programming: a case study of the traveling salesman problem. Neural Comput Appl 17(4): 347-355.
    • (2008) Neural Comput Appl , vol.17 , Issue.4 , pp. 347-355
    • Ma, J.1    Yang, T.2    Hou, Z.G.3    Tan, M.4    Liu, D.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.