메뉴 건너뛰기




Volumn 28, Issue 3-5, 2014, Pages 205-231

Multiperson zero-sum differential games for a class of uncertain nonlinear systems

Author keywords

Adaptive dynamic programming; Multiperson zero sum differential games; Neural networks; Uncertain nonlinear systems

Indexed keywords

CONTROL; DYNAMIC PROGRAMMING; GAME THEORY; NEURAL NETWORKS; NONLINEAR SYSTEMS; OPTIMAL SYSTEMS; STABILITY;

EID: 84899122972     PISSN: 08906327     EISSN: 10991115     Source Type: Journal    
DOI: 10.1002/acs.2349     Document Type: Article
Times cited : (25)

References (41)
  • 1
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • White DA, Sofge DA (eds) Van Nostrand Reinhold: New York, ch. 13
    • Werbos PJ. Approximate dynamic programming for real-time control and neural modeling. In Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, White DA, Sofge DA (eds). Van Nostrand Reinhold: New York, 1992. ch. 13.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1
  • 5
    • 0043026775 scopus 로고    scopus 로고
    • Helicopter trimming and tracking control using direct neural dynamic programming
    • Enns R, Si J. Helicopter trimming and tracking control using direct neural dynamic programming. IEEE Transactions on Neural Networks 2003; 14 (4):929-939.
    • (2003) IEEE Transactions on Neural Networks , vol.14 , Issue.4 , pp. 929-939
    • Enns, R.1    Si, J.2
  • 7
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • Abu-Khalaf M, Lewis FL. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 2005; 41 (5):779-791.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 8
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • Vamvoudakis KG, Lewis FL. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 2010; 46 (5):878-888.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 9
    • 82755160758 scopus 로고    scopus 로고
    • Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
    • Wang D, Liu D, Wei Q. Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach. Neurocomputing 2012; 78 (1):14-22.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 14-22
    • Wang, D.1    Liu, D.2    Wei, Q.3
  • 10
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Zhang H, Wei Q, Liu D. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 2011; 47 (1):207-214.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 11
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Zhang H, Wei Q, Luo Y. A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics 2008; 38 (4):937-942.
    • (2008) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 12
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • Lewis FL, Vrabie D. Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits and Systems Magazine 2009; 9 (3):32-50.
    • (2009) IEEE Circuits and Systems Magazine , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 13
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with-error bound
    • Wang FY, Jin N, Liu D, Wei Q. Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with-error bound. IEEE Transactions on Neural Networks 2011; 22 (1):24-36.
    • (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 16
    • 82655173881 scopus 로고    scopus 로고
    • A three-network architecture for on-line learning and optimization based on adaptive dynamic programming
    • He H, Ni Z, Fu J. A three-network architecture for on-line learning and optimization based on adaptive dynamic programming. Neurocomputing 2012; 78 (1):3-13.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 3-13
    • He, H.1    Ni, Z.2    Fu, J.3
  • 17
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Lewis FL, Vamvoudakis KG. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics 2011; 41 (1):14-25.
    • (2011) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.41 , Issue.1 , pp. 14-25
    • Lewis, F.L.1    Vamvoudakis, K.G.2
  • 18
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • Liu D, Zhang Y, Zhang H. A self-learning call admission control scheme for CDMA cellular networks. IEEE Transactions on Neural Networks 2005; 16 (5):1219-1228.
    • (2005) IEEE Transactions on Neural Networks , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 19
    • 79960897012 scopus 로고    scopus 로고
    • Multi-Player non zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
    • Vamvoudakis KG, Lewis FL. Multi-Player non zero sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations. Automatica 2011; 47 (8):1556-1569.
    • (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 20
    • 61849184281 scopus 로고    scopus 로고
    • Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
    • Wei Q, Zhang H, Dai J. Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing 2009; 72 (7-9):1839-1848.
    • (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1839-1848
    • Wei, Q.1    Zhang, H.2    Dai, J.3
  • 21
    • 84861202999 scopus 로고    scopus 로고
    • Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence
    • Zhang X, Zhang H, Sun Q, Luo Y. Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence. Neurocomputing 2012; 91 (15):48-55.
    • (2012) Neurocomputing , vol.91 , Issue.15 , pp. 48-55
    • Zhang, X.1    Zhang, H.2    Sun, Q.3    Luo, Y.4
  • 27
    • 33845759425 scopus 로고    scopus 로고
    • Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation
    • Abu-Khalaf M, Lewis FL, Huang J. Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation. IEEE Transactions on Automatic Control 2006; 51 (12):1989-1995.
    • (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 28
    • 48949116222 scopus 로고    scopus 로고
    • Neurodynamic programming and zero-sum games for constrained control systems
    • Abu-Khalaf M, Lewis FL, Huang J. Neurodynamic programming and zero-sum games for constrained control systems. IEEE Transactions on Neural Networks 2008; 19 (7):1243-1252.
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.7 , pp. 1243-1252
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 29
    • 33846781129 scopus 로고    scopus 로고
    • Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
    • Al-Tamimi A, Lewis FL, Abu-Khalaf M. Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica 2007; 43 (3):473-481.
    • (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 30
    • 80053055740 scopus 로고    scopus 로고
    • Nonlinear multi-person zero-sum differential games using iterative adaptive dynamic programming
    • Yantai, China
    • Wei Q, Liu D, Nonlinear multi-person zero-sum differential games using iterative adaptive dynamic programming. The 30th Chinese Control Conference, Yantai, China, 2011; 2456-2461.
    • (2011) The 30th Chinese Control Conference , pp. 2456-2461
    • Wei, Q.1    Liu, D.2
  • 31
    • 0024125008 scopus 로고
    • High gain observers applied to problems in the stabilization of uncertain linear systems, disturbance attenuation and N∞ optimization
    • Petersen IR, Hollot CV. High gain observers applied to problems in the stabilization of uncertain linear systems, disturbance attenuation and N∞ optimization. International Journal of Adaptive Control and Signal Processing 1988; 2 (4):347-369.
    • (1988) International Journal of Adaptive Control and Signal Processing , vol.2 , Issue.4 , pp. 347-369
    • Petersen, I.R.1    Hollot, C.V.2
  • 35
    • 33751238181 scopus 로고    scopus 로고
    • A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
    • Padhi R, Unnikrishnan N, Wang X, Balakrishman SN. A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems. Neural Networks 2006; 19 (10):1648-1660.
    • (2006) Neural Networks , vol.19 , Issue.10 , pp. 1648-1660
    • Padhi, R.1    Unnikrishnan, N.2    Wang, X.3    Balakrishman, S.N.4
  • 40
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • Si J, Wang YT. On-line learning control by association and reinforcement. IEEE Transactions on Neural Networks 2001; 12 (2):264-275.
    • (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-275
    • Si, J.1    Wang, Y.T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.