메뉴 건너뛰기




Volumn , Issue , 2010, Pages 3040-3047

Online solution of nonlinear two-player zero-sum games using synchronous policy iteration

Author keywords

Approximate dynamic programming; H infinity; Hamilton Jacobi Isaacs equation; Nash equilibrium; Persistence of excitation; Policy iteration; Synchronous zero sum game policy iteration

Indexed keywords

ADAPTIVE ALGORITHMS; ADAPTIVE CONTROL SYSTEMS; CONTINUOUS TIME SYSTEMS; DYNAMIC PROGRAMMING; GAME THEORY; ONLINE SYSTEMS; OPTIMAL SYSTEMS; SYSTEM STABILITY;

EID: 79953155097     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2010.5717607     Document Type: Conference Paper
Times cited : (35)

References (31)
  • 1
    • 14844340822 scopus 로고    scopus 로고
    • Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
    • M. Abu-Khalaf, F. L. Lewis, "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach", Automatica, vol. 41, no. 5, pp. 779-791, 2005.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 2
    • 48949116222 scopus 로고    scopus 로고
    • Neurodynamic Programming and Zero- Sum Games for Constrained Control Systems
    • M. Abu-Khalaf, F. L. Lewis, "Neurodynamic Programming and Zero- Sum Games for Constrained Control Systems," IEEE Transactions on Neural Networks, vol. 19, no. 7, pp. 1243-1252, 2008.
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.7 , pp. 1243-1252
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 10
    • 61849156874 scopus 로고    scopus 로고
    • A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control
    • Y. Feng, B. D. Anderson, M. Rotkowitz, "A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control," Automatica, vol. 45, no. 4, pp. 881-888, 2009.
    • (2009) Automatica , vol.45 , Issue.4 , pp. 881-888
    • Feng, Y.1    Anderson, B.D.2    Rotkowitz, M.3
  • 12
    • 0025627940 scopus 로고
    • Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks
    • K. Hornik, M. Stinchcombe, H. White, Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks, /eural /etworks, vol. 3, pp. 551-560, 1990.
    • (1990) Neural Networks , vol.3 , pp. 551-560
    • Hornik, K.1    Stinchcombe, M.2    White, H.3
  • 15
    • 84914965022 scopus 로고
    • On an Iterative Technique for Riccati Equation Computations
    • February
    • D. Kleinman, "On an Iterative Technique for Riccati Equation Computations," IEEE Transactions on Automatic Control, vol. 13, pp. 114- 115, February, 1968.
    • (1968) IEEE Transactions on Automatic Control , vol.13 , pp. 114-115
    • Kleinman, D.1
  • 22
    • 0029533197 scopus 로고
    • Nonsmooth control Lyapunov functions
    • E. D. Sontag, H. J. Sussman, "Nonsmooth control Lyapunov functions," IEEE Proc. CDC95, pp. 2799-2805. 1995.
    • (1995) IEEE Proc. CDC95 , pp. 2799-2805
    • Sontag, E.D.1    Sussman, H.J.2
  • 26
    • 77950630017 scopus 로고    scopus 로고
    • Online Actor-Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem
    • K. G. Vamvoudakis, F. L. Lewis, "Online Actor-Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 27
    • 70449382072 scopus 로고    scopus 로고
    • Online Actor Critic Algorithm to solve the Continuous-Time Infinite Horizon Optimal Control Problem
    • Atlanta, June
    • K. G. Vamvoudakis, and F. L. Lewis, "Online Actor Critic Algorithm to solve the Continuous-Time Infinite Horizon Optimal Control Problem," Proc. Int. Joint Conf. on /eural /etworks, pp.3180-3187, Atlanta, June 2009.
    • (2009) Proc. Int. Joint Conf. on Neural Networks , pp. 3180-3187
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 29
    • 77953770221 scopus 로고    scopus 로고
    • Ph.D. Thesis, Dept. of Electrical Engineering, Univ. Texas at Arlington, Arlington, TX, USA
    • D. Vrabie, Online Adaptive Optimal Control for Continuous Time Systems, Ph.D. Thesis, Dept. of Electrical Engineering, Univ. Texas at Arlington, Arlington, TX, USA, 2009.
    • (2009) Online Adaptive Optimal Control for Continuous Time Systems
    • Vrabie, D.1
  • 31
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," Handbook of Intelligent Control, ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.