메뉴 건너뛰기




Volumn 28, Issue 3-5, 2014, Pages 232-254

Online solution of nonquadratic two-player zero-sum games arising in the H∞ control of constrained input systems

Author keywords

Control; H ; Input constraints; Neural networks; Policy iteration; Two player zero sum games

Indexed keywords

ALGORITHMS; CONTROL; GAME THEORY; ITERATIVE METHODS; NEURAL NETWORKS; NONLINEAR CONTROL SYSTEMS;

EID: 84899093084     PISSN: 08906327     EISSN: 10991115     Source Type: Journal    
DOI: 10.1002/acs.2348     Document Type: Article
Times cited : (118)

References (42)
  • 1
    • 0019559036 scopus 로고
    • Feedback and optimal sensitivity: Model reference transformations, multiplicative seminorms, and approximate inverses
    • Zames G. Feedback and optimal sensitivity: model reference transformations, multiplicative seminorms, and approximate inverses. IEEE Transactions on Automatic Control 1981; 26 (2):301-320.
    • (1981) IEEE Transactions on Automatic Control , vol.26 , Issue.2 , pp. 301-320
    • Zames, G.1
  • 3
    • 0026883666 scopus 로고
    • L2-gain analysis of nonlinear systems and nonlinear state feedback H∞ control
    • Van der Schaft AJ. L2-gain analysis of nonlinear systems and nonlinear state feedback H∞ control. IEEE Transactions on Automatic Control 1992; 37 (6):770-784.
    • (1992) IEEE Transactions on Automatic Control , vol.37 , Issue.6 , pp. 770-784
    • Van Der Schaft, A.J.1
  • 4
    • 0002145750 scopus 로고    scopus 로고
    • Viscosity solutions of Hamilton-Jacobi equations arising in nonlinear H∞ control
    • Ball J, Helton W. Viscosity solutions of Hamilton-Jacobi equations arising in nonlinear H∞ control. Journal of Mathematical Systems, Estimation, and Control 1996; 6 (1):1-22.
    • (1996) Journal of Mathematical Systems, Estimation, and Control , vol.6 , Issue.1 , pp. 1-22
    • Ball, J.1    Helton, W.2
  • 6
    • 0346207421 scopus 로고    scopus 로고
    • Global L2-gain design for a class of nonlinear systems
    • Isidori A, Lin W. Global L2-gain design for a class of nonlinear systems. Systems and Control Letters 1998; 34 (5):245-252.
    • (1998) Systems and Control Letters , vol.34 , Issue.5 , pp. 245-252
    • Isidori, A.1    Lin, W.2
  • 9
    • 56549098855 scopus 로고    scopus 로고
    • Computing the positive stabilizing solution to algebraic Riccati equations with an indefinite quadratic term via a recursive method
    • Lanzon A, Feng Y, Anderson BDO, Rotkowitz M. Computing the positive stabilizing solution to algebraic Riccati equations with an indefinite quadratic term via a recursive method. IEEE Transactions on Automatic Control 2008; 53 (10):2280-2291.
    • (2008) IEEE Transactions on Automatic Control , vol.53 , Issue.10 , pp. 2280-2291
    • Lanzon, A.1    Feng, Y.2    Bdo, A.3    Rotkowitz, M.4
  • 12
    • 0029371239 scopus 로고
    • Numerical approach to computing nonlinear H∞ control laws
    • Huang J, Lin CF. Numerical approach to computing nonlinear H∞ control laws. Journal of Guidance, Control, and Dynamics 1995; 18 (5):989-994.
    • (1995) Journal of Guidance, Control, and Dynamics , vol.18 , Issue.5 , pp. 989-994
    • Huang, J.1    Lin, C.F.2
  • 19
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • White DA, Sofge DA (eds). Multiscience Press: New York
    • Werbos PJ. Approximate dynamic programming for real-time control and neural modeling. In Handbook of Intelligent Control, White DA, Sofge DA (eds). Multiscience Press: New York, 1992.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1
  • 21
    • 77950629367 scopus 로고    scopus 로고
    • Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework
    • Makedonia Palace, Thessaloniki, Greece
    • Vrabie D, Vamvoudakis K, Lewis FL. Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework. In Proceedings of the IEEE Mediterranean Conference on Control and Automation, Makedonia Palace, Thessaloniki, Greece, 2009; 1402-1409.
    • (2009) Proceedings of the IEEE Mediterranean Conference on Control and Automation , pp. 1402-1409
    • Vrabie, D.1    Vamvoudakis, K.2    Lewis, F.L.3
  • 22
    • 0032202335 scopus 로고    scopus 로고
    • Successive Galerkin approximation algorithms for nonlinear optimal and robust control
    • Beard R, McLain T. Successive Galerkin approximation algorithms for nonlinear optimal and robust control. International Journal of Control 1998; 71 (5):717-743.
    • (1998) International Journal of Control , vol.71 , Issue.5 , pp. 717-743
    • Beard, R.1    McLain, T.2
  • 23
    • 48949116222 scopus 로고    scopus 로고
    • Neurodynamic programming and zero-sum games for constrained control systems
    • Abu-Khalaf M, Lewis FL, Huang J. Neurodynamic programming and zero-sum games for constrained control systems. IEEE Transactions on Neural Networks 2008; 19 (7):1243-1252.
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.7 , pp. 1243-1252
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 24
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Zhang H, Wei Q, Liu D. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 2011; 47 (1):207-214.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 25
    • 13244279592 scopus 로고    scopus 로고
    • Robust reinforcement learning
    • Morimoto J, Doya K. Robust reinforcement learning. Neural Computation 2005; 17 (2):335-359.
    • (2005) Neural Computation , vol.17 , Issue.2 , pp. 335-359
    • Morimoto, J.1    Doya, K.2
  • 26
    • 79960443754 scopus 로고    scopus 로고
    • Adaptive dynamic programming for online solution of a zero-sum differential game
    • Vrabie D, Lewis FL. Adaptive dynamic programming for online solution of a zero-sum differential game. Journal of Control Theory and Applications 2011; 9 (3):353-360.
    • (2011) Journal of Control Theory and Applications , vol.9 , Issue.3 , pp. 353-360
    • Vrabie, D.1    Lewis, F.L.2
  • 29
    • 79960897012 scopus 로고    scopus 로고
    • Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
    • Vamvoudakis K, Lewis FL. Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations. Automatica 2011; 47 (8):1556-1569.
    • (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
    • Vamvoudakis, K.1    Lewis, F.L.2
  • 30
    • 0030392685 scopus 로고    scopus 로고
    • Constrained optimization and control of nonlinear systems: New results in optimal control
    • Kobe, Japan
    • Lyshevski SE. Constrained optimization and control of nonlinear systems: new results in optimal control. In Proceedings of IEEE Conference on Decision and Control, Kobe, Japan, 1996; 541-546.
    • (1996) Proceedings of IEEE Conference on Decision and Control , pp. 541-546
    • Lyshevski, S.E.1
  • 31
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • Abu-Khalaf M, Lewis FL. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 2005; 41:779-791.
    • (2005) Automatica , vol.41 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 32
    • 79251641699 scopus 로고    scopus 로고
    • Bounded robust control of nonlinear systems using neural network-based HJB solution
    • Adhyaru D, Kar IN, Gopal M. Bounded robust control of nonlinear systems using neural network-based HJB solution. Neural Computing and Applications 2011; 20 (1):91-103.
    • (2011) Neural Computing and Applications , vol.20 , Issue.1 , pp. 91-103
    • Adhyaru, D.1    Kar, I.N.2    Gopal, M.3
  • 33
    • 33845759425 scopus 로고    scopus 로고
    • Policy iterations on the Hamilton-Jacobi-Isaacs equations for H∞ state feedback control with input saturation
    • Abu-Khalaf M, Lewis FL, Huang J. Policy iterations on the Hamilton-Jacobi-Isaacs equations for H∞ state feedback control with input saturation. IEEE Transactions on Automatic Control 2006; 51 (12):1989-1995.
    • (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 35
    • 0024866495 scopus 로고
    • On the approximate realization of continuous mappings by neural networks
    • Funahashi K. On the approximate realization of continuous mappings by neural networks. Neural Networks 1989; 2:183-192.
    • (1989) Neural Networks , vol.2 , pp. 183-192
    • Funahashi, K.1
  • 36
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Zhang H, Luo Y, Liu D. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Transactions on Neural Networks 2009; 20 (9):1490-1503.
    • (2009) IEEE Transactions on Neural Networks , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 37
    • 61849156874 scopus 로고    scopus 로고
    • A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control
    • Feng Y, Anderson BD, Rotkowitz M. A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control. Automatica 2009; 45 (4):881-888.
    • (2009) Automatica , vol.45 , Issue.4 , pp. 881-888
    • Feng, Y.1    Anderson, B.D.2    Rotkowitz, M.3
  • 38
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous infinite time horizon optimal control problem
    • Vamvoudakis K, Lewis FL. Online actor-critic algorithm to solve the continuous infinite time horizon optimal control problem. Automatica 2010; 46:878-888.
    • (2010) Automatica , vol.46 , pp. 878-888
    • Vamvoudakis, K.1    Lewis, F.L.2
  • 41
    • 27844531900 scopus 로고    scopus 로고
    • Computer-aided design of nonlinear H∞ control law: The benchmark problem
    • Dalin, China
    • Deng F, Huang J. Computer-aided design of nonlinear H∞ control law: the benchmark problem. In Proceedings of Chinese Control Conference, Dalin, China, 2001; 840-845.
    • (2001) Proceedings of Chinese Control Conference , pp. 840-845
    • Deng, F.1    Huang, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.