메뉴 건너뛰기




Volumn 44, Issue 8, 2014, Pages 1015-1027

Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics

Author keywords

Adaptive dynamic programming (ADP); approximate dynamic programming; multiplayer nonzero sum games; neural networks; neuro dynamic programming; policy iteration

Indexed keywords

BANACH SPACES; CLOSED LOOP SYSTEMS; CONTINUOUS TIME SYSTEMS; DYNAMIC PROGRAMMING; DYNAMICAL SYSTEMS; E-LEARNING; ITERATIVE METHODS; LEARNING SYSTEMS; NEURAL NETWORKS; ONLINE SYSTEMS;

EID: 84904706555     PISSN: 21682216     EISSN: 21682232     Source Type: Journal    
DOI: 10.1109/TSMC.2013.2295351     Document Type: Article
Times cited : (222)

References (57)
  • 1
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A. White and D. A. Sofge, Eds. New York, NY, USA: Van Nostrand Reinhold, ch. 13
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, D. A. White and D. A. Sofge, Eds. New York, NY, USA: Van Nostrand Reinhold, 1992, ch. 13.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1
  • 4
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: An introduction
    • May
    • F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
    • (2009) IEEE Comput. Intell. Mag. , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.Y.1    Zhang, H.2    Liu, D.3
  • 5
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • Jul.
    • F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, Jul. 2009.
    • (2009) IEEE Circuits Syst. Mag. , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 6
    • 34248512639 scopus 로고    scopus 로고
    • Optimal wide area controller and state predictor for a power system
    • DOI 10.1109/TPWRS.2007.895158
    • S. Mohagheghi, G. K. Venayagamoorthy, and R. G. Harley, "Optimal wide area controller and state predictor for a power system," IEEE Trans. Power Syst., vol. 22, no. 2, pp. 693-705, May 2007. (Pubitemid 46746232)
    • (2007) IEEE Transactions on Power Systems , vol.22 , Issue.2 , pp. 693-705
    • Mohagheghi, S.1    Venayagamoorthy, G.K.2    Harley, R.G.3
  • 7
    • 49049108697 scopus 로고    scopus 로고
    • Adaptive critic learning techniques for engine torque and air-fuel ratio control
    • Aug.
    • D. Liu, H. Javaherian, O. Kovalenko, and T. Huang, "Adaptive critic learning techniques for engine torque and air-fuel ratio control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 988-993, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 988-993
    • Liu, D.1    Javaherian, H.2    Kovalenko, O.3    Huang, T.4
  • 8
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • DOI 10.1109/TNN.2005.853408
    • D. Liu, Y. Zhang, and H. Zhang, "A self-learning call admission control scheme for CDMA cellular networks," IEEE Trans. Neural Netw. (Special Issue on Adaptive Learning Systems in Comminucation Networks), vol. 16, no. 5, pp. 1219-1228, Sep. 2005. (Pubitemid 41444623)
    • (2005) IEEE Transactions on Neural Networks , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 11
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • DOI 10.1109/72.914523, PII S1045922701014047
    • J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001. (Pubitemid 32371483)
    • (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 12
    • 84876158475 scopus 로고    scopus 로고
    • Simple and fast calculation of the second-order gradients for globalized dual heuristic dynamic programming in neural networks
    • Oct.
    • M. Fairbank, E. Alonso, and D. Prokhorov, "Simple and fast calculation of the second-order gradients for globalized dual heuristic dynamic programming in neural networks," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1671-1676, Oct. 2012.
    • (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.7 , pp. 1671-1676
    • Fairbank, M.1    Alonso, E.2    Prokhorov, D.3
  • 13
    • 84875270081 scopus 로고    scopus 로고
    • Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update
    • Jul.
    • T. Dierks and S. Jagannathan, "Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1118-1129, Jul. 2012.
    • (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.7 , pp. 1118-1129
    • Dierks, T.1    Jagannathan, S.2
  • 14
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Sep.
    • H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
    • (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 15
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear sytems with ∈-error bound
    • Dec.
    • F. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear sytems with ∈-error bound," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 1854-1862, Dec. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 1854-1862
    • Wang, F.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 16
    • 84878421441 scopus 로고    scopus 로고
    • Optimal control for discrete-time affine nonlinear systems using general value iteration
    • Dec.
    • H. Li and D. Liu, "Optimal control for discrete-time affine nonlinear systems using general value iteration," IET Control Theory Applicat., vol. 6, no. 18, pp. 2725-2736, Dec. 2012.
    • (2012) IET Control Theory Applicat , vol.6 , Issue.18 , pp. 2725-2736
    • Li, H.1    Liu, D.2
  • 17
    • 84862811062 scopus 로고    scopus 로고
    • An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
    • Aug.
    • Q. Wei and D. Liu, "An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state," Neural Netw., vol. 32, no. 6, pp. 236-244, Aug. 2012.
    • (2012) Neural Netw. , vol.32 , Issue.6 , pp. 236-244
    • Wei, Q.1    Liu, D.2
  • 18
    • 84864489666 scopus 로고    scopus 로고
    • Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    • Aug.
    • D. Wang, D. Liu, Q. Wei, D. Zhao, and N. Jin, "Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming," Automatica, vol. 48, no. 8, pp. 1825-1832, Aug. 2012.
    • (2012) Automatica , vol.48 , Issue.8 , pp. 1825-1832
    • Wang, D.1    Liu, D.2    Wei, Q.3    Zhao, D.4    Jin, N.5
  • 19
    • 84863467146 scopus 로고    scopus 로고
    • Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
    • Jul.
    • D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
    • Liu, D.1    Wang, D.2    Zhao, D.3    Wei, Q.4    Jin, N.5
  • 20
    • 82755160758 scopus 로고    scopus 로고
    • Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
    • Feb.
    • D. Wang, D. Liu, and Q. Wei, "Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach," Neurocomputing, vol. 78, no. 1, pp. 14-22, Feb. 2012.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 14-22
    • Wang, D.1    Liu, D.2    Wei, Q.3
  • 21
    • 84868467610 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
    • Jan.
    • D. Liu, D. Wang, and X. Yang, "An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs," Inf. Sci., vol. 220, pp. 331-342, Jan. 2013.
    • (2013) Inf. Sci. , vol.220 , pp. 331-342
    • Liu, D.1    Wang, D.2    Yang, X.3
  • 22
    • 84872617336 scopus 로고    scopus 로고
    • A neural-networkbased iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints
    • Feb.
    • D. Wang, D. Liu, D. Zhao, Y. Huang, and D. Zhang, "A neural-networkbased iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints," Neural Comput. Applicat., vol. 22, no. 2, pp. 219-227, Feb. 2013.
    • (2013) Neural Comput. Applicat. , vol.22 , Issue.2 , pp. 219-227
    • Wang, D.1    Liu, D.2    Zhao, D.3    Huang, Y.4    Zhang, D.5
  • 23
    • 84881555023 scopus 로고    scopus 로고
    • Finite-approximation-error based optimal control approach for discrete-time nonlinear systems
    • Apr.
    • D. Liu and Q. Wei, "Finite-approximation-error based optimal control approach for discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 43, no. 2, pp. 779-789, Apr. 2013.
    • (2013) IEEE Trans. Cybern. , vol.43 , Issue.2 , pp. 779-789
    • Liu, D.1    Wei, Q.2
  • 25
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Feb.
    • F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 41, no. 1, pp. 14-25, Feb. 2011.
    • (2011) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.41 , Issue.1 , pp. 14-25
    • Lewis, F.L.1    Vamvoudakis, K.G.2
  • 26
    • 84859001250 scopus 로고    scopus 로고
    • Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators
    • Apr.
    • Q. Yang and S. Jagannathan, "Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 2, pp. 377-390, Apr. 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.2 , pp. 377-390
    • Yang, Q.1    Jagannathan, S.2
  • 27
    • 84857501996 scopus 로고    scopus 로고
    • Experience replay for real-time reinforcement learning control
    • Mar.
    • S. Adam, L. Busoniu, and R. Babuska, "Experience replay for real-time reinforcement learning control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 2, pp. 201-212, Mar. 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.2 , pp. 201-212
    • Adam, S.1    Busoniu, L.2    Babuska, R.3
  • 31
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
    • M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, 779-791, May 2005. (Pubitemid 40352391)
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 32
    • 33846781133 scopus 로고    scopus 로고
    • A neural network solution for fixed-final time optimal control of nonlinear systems
    • DOI 10.1016/j.automatica.2006.09.021, PII S0005109806004250
    • T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "A neural network solution for fixed-final time optimal control of nonlinear systems," Automatica, vol. 43, no. 3, pp. 482-490, Mar. 2007. (Pubitemid 46209051)
    • (2007) Automatica , vol.43 , Issue.3 , pp. 482-490
    • Cheng, T.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 33
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • Feb.
    • D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, Feb. 2009.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 34
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
    • Apr.
    • D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, Apr. 2009.
    • (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
    • Vrabie, D.1    Lewis, F.L.2
  • 35
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • May
    • K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 36
    • 83655163786 scopus 로고    scopus 로고
    • Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
    • Dec.
    • H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, Dec. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 2226-2236
    • Zhang, H.1    Cui, L.2    Zhang, X.3    Luo, Y.4
  • 37
    • 79953151751 scopus 로고    scopus 로고
    • A model-free robust policy iteration algorithm for optimal control of nonlinear systems
    • Dec.
    • S. Bhasin, M. Johnson, and W. E. Dixon, "A model-free robust policy iteration algorithm for optimal control of nonlinear systems," in Proc. 49th IEEE Conf. Decision Control, Dec. 2010, pp. 3060-3065.
    • (2010) Proc. 49th IEEE Conf. Decision Control , pp. 3060-3065
    • Bhasin, S.1    Johnson, M.2    Dixon, W.E.3
  • 42
    • 33846781129 scopus 로고    scopus 로고
    • Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
    • DOI 10.1016/j.automatica.2006.09.019, PII S0005109806004249
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free Q-learning designs for linear discretet-time zero-sum games with application to H∞ control," Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007. (Pubitemid 46209050)
    • (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 43
    • 79959413200 scopus 로고    scopus 로고
    • Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks
    • Jul.
    • S. Mehraeen, T. Dierks, S. Jagannathan, and M. L. Crow, "Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks," in Proc. Int. Joint Conf. Neural Netw., Jul. 2010, pp. 1-8.
    • (2010) Proc. Int. Joint Conf. Neural Netw. , pp. 1-8
    • Mehraeen, S.1    Dierks, T.2    Jagannathan, S.3    Crow, M.L.4
  • 44
    • 84865090343 scopus 로고    scopus 로고
    • H∞ control of unknown discrete-time nonlinear systems with control constraints using adaptive dynamic programming
    • Jun.
    • D. Liu, H. Li, and D. Wang, "H∞ control of unknown discrete-time nonlinear systems with control constraints using adaptive dynamic programming," in Proc. Int. Joint Conf. Neural Netw., Jun. 2012, pp. 3056-3061.
    • (2012) Proc. Int. Joint Conf. Neural Netw. , pp. 3056-3061
    • Liu, D.1    Li, H.2    Wang, D.3
  • 45
    • 79959444159 scopus 로고    scopus 로고
    • Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game
    • Jul.
    • D. Varbie and F. L. Lewis, "Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game," in Proc. Int. Joint Conf. Neural Netw., Jul. 2010, pp. 1-8.
    • (2010) Proc. Int. Joint Conf. Neural Netw. , pp. 1-8
    • Varbie, D.1    Lewis, F.L.2
  • 46
    • 33845759425 scopus 로고    scopus 로고
    • ∞ state feedback control with input saturation
    • DOI 10.1109/TAC.2006.884959
    • M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Policy iterations and the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1989-1995, Dec. 2006. (Pubitemid 46002295)
    • (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 47
    • 48949116222 scopus 로고    scopus 로고
    • Neurodynamic programming and zero-sum games for constrained control systems
    • Jul.
    • M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Neurodynamic programming and zero-sum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Jul. 2008.
    • (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.7 , pp. 1243-1252
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 48
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Jan.
    • H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, Jan. 2011.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 49
    • 84864463039 scopus 로고    scopus 로고
    • Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
    • K. G. Vamvoudakis and F. L. Lewis, "Online solution of nonlinear two-player zero-sum games using synchronous policy iteration," Int. J. Robust Nonlinear Control, vol. 22, no. 13, pp. 1460-1483, 2012
    • (2012) Int. J. Robust Nonlinear Control , vol.22 , Issue.13 , pp. 1460-1483
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 50
    • 79953143055 scopus 로고    scopus 로고
    • Optimal control of affine nonlinear continuous-time systems using an online Hamilton-jacobi-isaacs formulation
    • Dec.
    • T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation," in Proc. IEEE Conf. Decision Control, Dec. 2010, pp. 3048-3053.
    • (2010) Proc. IEEE Conf. Decision Control , pp. 3048-3053
    • Dierks, T.1    Jagannathan, S.2
  • 51
    • 84876909440 scopus 로고    scopus 로고
    • Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control
    • Dec.
    • H. Wu and B. Luo, "Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 12, pp. 1884-1895, Dec. 2012.
    • (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.12 , pp. 1884-1895
    • Wu, H.1    Luo, B.2
  • 52
    • 84860670757 scopus 로고    scopus 로고
    • Nonlinear two-player zerosum game approximate solution using a policy iteration algorithm
    • Dec.
    • M. Johnson, S. Bhasin, and W. E. Dixon, "Nonlinear two-player zerosum game approximate solution using a policy iteration algorithm," in Proc. Conf. Decision Control Eur. Control Conf., Dec. 2011, pp. 142-147.
    • (2011) Proc. Conf. Decision Control Eur. Control Conf. , pp. 142-147
    • Johnson, M.1    Bhasin, S.2    Dixon, W.E.3
  • 53
    • 0030086666 scopus 로고    scopus 로고
    • On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games
    • PII S001892869600983X
    • G. Freiling, G. Jank, and H. Abou-Kandil, "On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games," IEEE Trans. Autom. Control, vol. 41, no. 2, pp. 264-269, Feb. 1996. (Pubitemid 126768300)
    • (1996) IEEE Transactions on Automatic Control , vol.41 , Issue.2 , pp. 264-269
    • Freiling, G.1    Jank, G.2    Abou-Kandil, H.3
  • 54
    • 79953133535 scopus 로고    scopus 로고
    • Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games
    • Dec.
    • D. Vrabie and F. L. Lewis, "Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games," in Proc. 49th IEEE Conf. Decision Control, Dec. 2010, pp. 3066-3071.
    • (2010) Proc. 49th IEEE Conf. Decision Control , pp. 3066-3071
    • Vrabie, D.1    Lewis, F.L.2
  • 55
    • 79960897012 scopus 로고    scopus 로고
    • Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-jacobi equations
    • Aug.
    • K. G. Vamvoudakis and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, Aug. 2011.
    • (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
    • Vamvoudakis, K.G.1    Lewis, F.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.