SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man, and Cybernetics: Systems

Volumn 44, Issue 8, 2014, Pages 1015-1027

Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics

(3) Liu, Derong a Li, Hongliang a Wang, Ding a

Author keywords

Adaptive dynamic programming (ADP); approximate dynamic programming; multiplayer nonzero sum games; neural networks; neuro dynamic programming; policy iteration

Indexed keywords

BANACH SPACES; CLOSED LOOP SYSTEMS; CONTINUOUS TIME SYSTEMS; DYNAMIC PROGRAMMING; DYNAMICAL SYSTEMS; E-LEARNING; ITERATIVE METHODS; LEARNING SYSTEMS; NEURAL NETWORKS; ONLINE SYSTEMS;

ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; MULTIPLAYERS; NEURO DYNAMIC PROGRAMMING; POLICY ITERATION;

LEARNING ALGORITHMS;

EID: 84904706555 PISSN: 21682216 EISSN: 21682232 Source Type: Journal
DOI: 10.1109/TSMC.2013.2295351 Document Type: Article

Times cited : (222)

References (57)

1
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D. A. White and D. A. Sofge, Eds. New York, NY, USA: Van Nostrand Reinhold, ch. 13
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, D. A. White and D. A. Sofge, Eds. New York, NY, USA: Van Nostrand Reinhold, 1992, ch. 13.
- (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
- Werbos, P.J.¹

2
- 0003487482
- Belmont, MA, USA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA, USA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 84921399937
- New York, NY, USA: IEEE Press/Wiley
- J. Si, A. G. Barto, W. B. Powell, and D. C. Wunsch, Eds., Handbook of Learning and Approximate Dynamic Programming. New York, NY, USA: IEEE Press/Wiley, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.C.⁴

4
- 66449130966
- Adaptive dynamic programming: An introduction
- May
- F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
- (2009) IEEE Comput. Intell. Mag. , vol.4 , Issue.2 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

5
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- Jul.
- F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, Jul. 2009.
- (2009) IEEE Circuits Syst. Mag. , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

6
- 34248512639
- Optimal wide area controller and state predictor for a power system
- DOI 10.1109/TPWRS.2007.895158
- S. Mohagheghi, G. K. Venayagamoorthy, and R. G. Harley, "Optimal wide area controller and state predictor for a power system," IEEE Trans. Power Syst., vol. 22, no. 2, pp. 693-705, May 2007. (Pubitemid 46746232)
- (2007) IEEE Transactions on Power Systems , vol.22 , Issue.2 , pp. 693-705
- Mohagheghi, S.¹ Venayagamoorthy, G.K.² Harley, R.G.³

7
- 49049108697
- Adaptive critic learning techniques for engine torque and air-fuel ratio control
- Aug.
- D. Liu, H. Javaherian, O. Kovalenko, and T. Huang, "Adaptive critic learning techniques for engine torque and air-fuel ratio control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 988-993, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 988-993
- Liu, D.¹ Javaherian, H.² Kovalenko, O.³ Huang, T.⁴

8
- 26844483839
- A self-learning call admission control scheme for CDMA cellular networks
- DOI 10.1109/TNN.2005.853408
- D. Liu, Y. Zhang, and H. Zhang, "A self-learning call admission control scheme for CDMA cellular networks," IEEE Trans. Neural Netw. (Special Issue on Adaptive Learning Systems in Comminucation Networks), vol. 16, no. 5, pp. 1219-1228, Sep. 2005. (Pubitemid 41444623)
- (2005) IEEE Transactions on Neural Networks , vol.16 , Issue.5 , pp. 1219-1228
- Liu, D.¹ Zhang, Y.² Zhang, H.³

9
- 0004163205
- New York, NY, USA: Wiley
- F. L. Lewis and V. L. Syrmos, Optimal Control. New York, NY, USA: Wiley, 1995.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

10
- 0031236002
- Adaptive critic designs
- PII S1045922797052430
- D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997. (Pubitemid 127763331)
- (1997) IEEE Transactions on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch II, D.C.²

11
- 0035273403
- On-line learning control by association and reinforcement
- DOI 10.1109/72.914523, PII S1045922701014047
- J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001. (Pubitemid 32371483)
- (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.-T.²

12
- 84876158475
- Simple and fast calculation of the second-order gradients for globalized dual heuristic dynamic programming in neural networks
- Oct.
- M. Fairbank, E. Alonso, and D. Prokhorov, "Simple and fast calculation of the second-order gradients for globalized dual heuristic dynamic programming in neural networks," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1671-1676, Oct. 2012.
- (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.7 , pp. 1671-1676
- Fairbank, M.¹ Alonso, E.² Prokhorov, D.³

13
- 84875270081
- Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update
- Jul.
- T. Dierks and S. Jagannathan, "Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1118-1129, Jul. 2012.
- (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.7 , pp. 1118-1129
- Dierks, T.¹ Jagannathan, S.²

14
- 70349253929
- Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Sep.
- H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
- (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

15
- 78651311269
- Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear sytems with ∈-error bound
- Dec.
- F. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear sytems with ∈-error bound," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 1854-1862, Dec. 2011.
- (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 1854-1862
- Wang, F.¹ Jin, N.² Liu, D.³ Wei, Q.⁴

16
- 84878421441
- Optimal control for discrete-time affine nonlinear systems using general value iteration
- Dec.
- H. Li and D. Liu, "Optimal control for discrete-time affine nonlinear systems using general value iteration," IET Control Theory Applicat., vol. 6, no. 18, pp. 2725-2736, Dec. 2012.
- (2012) IET Control Theory Applicat , vol.6 , Issue.18 , pp. 2725-2736
- Li, H.¹ Liu, D.²

17
- 84862811062
- An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
- Aug.
- Q. Wei and D. Liu, "An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state," Neural Netw., vol. 32, no. 6, pp. 236-244, Aug. 2012.
- (2012) Neural Netw. , vol.32 , Issue.6 , pp. 236-244
- Wei, Q.¹ Liu, D.²

18
- 84864489666
- Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
- Aug.
- D. Wang, D. Liu, Q. Wei, D. Zhao, and N. Jin, "Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming," Automatica, vol. 48, no. 8, pp. 1825-1832, Aug. 2012.
- (2012) Automatica , vol.48 , Issue.8 , pp. 1825-1832
- Wang, D.¹ Liu, D.² Wei, Q.³ Zhao, D.⁴ Jin, N.⁵

19
- 84863467146
- Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
- Jul.
- D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
- (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
- Liu, D.¹ Wang, D.² Zhao, D.³ Wei, Q.⁴ Jin, N.⁵

20
- 82755160758
- Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
- Feb.
- D. Wang, D. Liu, and Q. Wei, "Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach," Neurocomputing, vol. 78, no. 1, pp. 14-22, Feb. 2012.
- (2012) Neurocomputing , vol.78 , Issue.1 , pp. 14-22
- Wang, D.¹ Liu, D.² Wei, Q.³

21
- 84868467610
- An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
- Jan.
- D. Liu, D. Wang, and X. Yang, "An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs," Inf. Sci., vol. 220, pp. 331-342, Jan. 2013.
- (2013) Inf. Sci. , vol.220 , pp. 331-342
- Liu, D.¹ Wang, D.² Yang, X.³

22
- 84872617336
- A neural-networkbased iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints
- Feb.
- D. Wang, D. Liu, D. Zhao, Y. Huang, and D. Zhang, "A neural-networkbased iterative GDHP approach for solving a class of nonlinear optimal control problems with control constraints," Neural Comput. Applicat., vol. 22, no. 2, pp. 219-227, Feb. 2013.
- (2013) Neural Comput. Applicat. , vol.22 , Issue.2 , pp. 219-227
- Wang, D.¹ Liu, D.² Zhao, D.³ Huang, Y.⁴ Zhang, D.⁵

23
- 84881555023
- Finite-approximation-error based optimal control approach for discrete-time nonlinear systems
- Apr.
- D. Liu and Q. Wei, "Finite-approximation-error based optimal control approach for discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 43, no. 2, pp. 779-789, Apr. 2013.
- (2013) IEEE Trans. Cybern. , vol.43 , Issue.2 , pp. 779-789
- Liu, D.¹ Wei, Q.²

24
- 0004102479
- Cambridge, MA, USA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA, USA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

25
- 79551685808
- Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
- Feb.
- F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 41, no. 1, pp. 14-25, Feb. 2011.
- (2011) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.41 , Issue.1 , pp. 14-25
- Lewis, F.L.¹ Vamvoudakis, K.G.²

26
- 84859001250
- Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators
- Apr.
- Q. Yang and S. Jagannathan, "Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 2, pp. 377-390, Apr. 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.2 , pp. 377-390
- Yang, Q.¹ Jagannathan, S.²

27
- 84857501996
- Experience replay for real-time reinforcement learning control
- Mar.
- S. Adam, L. Busoniu, and R. Babuska, "Experience replay for real-time reinforcement learning control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 2, pp. 201-212, Mar. 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.2 , pp. 201-212
- Adam, S.¹ Busoniu, L.² Babuska, R.³

28
- 84861185789
- Efficient model learning methods for actor-critic control
- Jun.
- I. Grondman, M. Vaandrager, L. Busoniu, R. Babuska, and E. Schuitema, "Efficient model learning methods for actor-critic control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 3, pp. 591-602, Jun. 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.3 , pp. 591-602
- Grondman, I.¹ Vaandrager, M.² Busoniu, L.³ Babuska, R.⁴ Schuitema, E.⁵

29
- 0003644124
- Cambridge, MA, USA: MIT Press
- R. A. Howard, Dynamic Programming and Markov Processes. Cambridge, MA, USA: MIT Press, 1960.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

30
- 0036588686
- Adaptive dynamic programming
- DOI 10.1109/TSMCC.2002.801727
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002. (Pubitemid 35289398)
- (2002) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

31
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
- M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, 779-791, May 2005. (Pubitemid 40352391)
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

32
- 33846781133
- A neural network solution for fixed-final time optimal control of nonlinear systems
- DOI 10.1016/j.automatica.2006.09.021, PII S0005109806004250
- T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "A neural network solution for fixed-final time optimal control of nonlinear systems," Automatica, vol. 43, no. 3, pp. 482-490, Mar. 2007. (Pubitemid 46209051)
- (2007) Automatica , vol.43 , Issue.3 , pp. 482-490
- Cheng, T.¹ Lewis, F.L.² Abu-Khalaf, M.³

33
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- Feb.
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, Feb. 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

34
- 67349145396
- Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
- Apr.
- D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, Apr. 2009.
- (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.L.²

35
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- May
- K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

36
- 83655163786
- Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
- Dec.
- H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, Dec. 2011.
- (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 2226-2236
- Zhang, H.¹ Cui, L.² Zhang, X.³ Luo, Y.⁴

37
- 79953151751
- A model-free robust policy iteration algorithm for optimal control of nonlinear systems
- Dec.
- S. Bhasin, M. Johnson, and W. E. Dixon, "A model-free robust policy iteration algorithm for optimal control of nonlinear systems," in Proc. 49th IEEE Conf. Decision Control, Dec. 2010, pp. 3060-3065.
- (2010) Proc. 49th IEEE Conf. Decision Control , pp. 3060-3065
- Bhasin, S.¹ Johnson, M.² Dixon, W.E.³

38
- 3142784521
- Delhi, India: Hindustan Book Agency
- S. Tijs, Introduction to Game Theory. Delhi, India: Hindustan Book Agency, 2003.
- (2003) Introduction to Game Theory
- Tijs, S.¹

39
- 0004071782
- 2nd ed. Philadelphia, PA: SIAM
- T. Basar and G. J. Olsder, Dynamic Noncooperative Game Theory, 2nd ed. Philadelphia, PA: SIAM, 1999.
- (1999) Dynamic Noncooperative Game Theory
- Basar, T.¹ Olsder, G.J.²

40
- 0003404761
- 2nd ed. Boston, MA: Birkhäuser
- T. Basar and P. Bernhard, H∞ Optimal Conrol and Related Minimax Design Problems: A Dynamic Game Approach, 2nd ed. Boston, MA: Birkhäuser, 1995.
- (1995) H∞ Optimal Conrol and Related Minimax Design Problems: A Dynamic Game Approach
- Basar, T.¹ Bernhard, P.²

41
- 33847648898
- ∞ control
- DOI 10.1109/TSMCB.2006.880135, Special Issue on Memetic Algorithms
- A. Al-Tamimi, M. Abu-Khalaf, and F. L. Lewis, "Adaptive critic designs for discretet-time zero-sum games with application to H∞ control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 37, no. 1, pp. 240-247, Feb. 2007. (Pubitemid 46358495)
- (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.37 , Issue.1 , pp. 240-247
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

42
- 33846781129
- Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
- DOI 10.1016/j.automatica.2006.09.019, PII S0005109806004249
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free Q-learning designs for linear discretet-time zero-sum games with application to H∞ control," Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007. (Pubitemid 46209050)
- (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

43
- 79959413200
- Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks
- Jul.
- S. Mehraeen, T. Dierks, S. Jagannathan, and M. L. Crow, "Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks," in Proc. Int. Joint Conf. Neural Netw., Jul. 2010, pp. 1-8.
- (2010) Proc. Int. Joint Conf. Neural Netw. , pp. 1-8
- Mehraeen, S.¹ Dierks, T.² Jagannathan, S.³ Crow, M.L.⁴

44
- 84865090343
- H∞ control of unknown discrete-time nonlinear systems with control constraints using adaptive dynamic programming
- Jun.
- D. Liu, H. Li, and D. Wang, "H∞ control of unknown discrete-time nonlinear systems with control constraints using adaptive dynamic programming," in Proc. Int. Joint Conf. Neural Netw., Jun. 2012, pp. 3056-3061.
- (2012) Proc. Int. Joint Conf. Neural Netw. , pp. 3056-3061
- Liu, D.¹ Li, H.² Wang, D.³

45
- 79959444159
- Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game
- Jul.
- D. Varbie and F. L. Lewis, "Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game," in Proc. Int. Joint Conf. Neural Netw., Jul. 2010, pp. 1-8.
- (2010) Proc. Int. Joint Conf. Neural Netw. , pp. 1-8
- Varbie, D.¹ Lewis, F.L.²

46
- 33845759425
- ∞ state feedback control with input saturation
- DOI 10.1109/TAC.2006.884959
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Policy iterations and the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1989-1995, Dec. 2006. (Pubitemid 46002295)
- (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

47
- 48949116222
- Neurodynamic programming and zero-sum games for constrained control systems
- Jul.
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Neurodynamic programming and zero-sum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Jul. 2008.
- (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

48
- 78650805234
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- Jan.
- H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, Jan. 2011.
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.¹ Wei, Q.² Liu, D.³

49
- 84864463039
- Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
- K. G. Vamvoudakis and F. L. Lewis, "Online solution of nonlinear two-player zero-sum games using synchronous policy iteration," Int. J. Robust Nonlinear Control, vol. 22, no. 13, pp. 1460-1483, 2012
- (2012) Int. J. Robust Nonlinear Control , vol.22 , Issue.13 , pp. 1460-1483
- Vamvoudakis, K.G.¹ Lewis, F.L.²

50
- 79953143055
- Optimal control of affine nonlinear continuous-time systems using an online Hamilton-jacobi-isaacs formulation
- Dec.
- T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation," in Proc. IEEE Conf. Decision Control, Dec. 2010, pp. 3048-3053.
- (2010) Proc. IEEE Conf. Decision Control , pp. 3048-3053
- Dierks, T.¹ Jagannathan, S.²

51
- 84876909440
- Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control
- Dec.
- H. Wu and B. Luo, "Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 12, pp. 1884-1895, Dec. 2012.
- (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.12 , pp. 1884-1895
- Wu, H.¹ Luo, B.²

52
- 84860670757
- Nonlinear two-player zerosum game approximate solution using a policy iteration algorithm
- Dec.
- M. Johnson, S. Bhasin, and W. E. Dixon, "Nonlinear two-player zerosum game approximate solution using a policy iteration algorithm," in Proc. Conf. Decision Control Eur. Control Conf., Dec. 2011, pp. 142-147.
- (2011) Proc. Conf. Decision Control Eur. Control Conf. , pp. 142-147
- Johnson, M.¹ Bhasin, S.² Dixon, W.E.³

53
- 0030086666
- On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games
- PII S001892869600983X
- G. Freiling, G. Jank, and H. Abou-Kandil, "On global existence of solutions to coupled matrix Riccati equations in closed-loop Nash games," IEEE Trans. Autom. Control, vol. 41, no. 2, pp. 264-269, Feb. 1996. (Pubitemid 126768300)
- (1996) IEEE Transactions on Automatic Control , vol.41 , Issue.2 , pp. 264-269
- Freiling, G.¹ Jank, G.² Abou-Kandil, H.³

54
- 79953133535
- Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games
- Dec.
- D. Vrabie and F. L. Lewis, "Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games," in Proc. 49th IEEE Conf. Decision Control, Dec. 2010, pp. 3066-3071.
- (2010) Proc. 49th IEEE Conf. Decision Control , pp. 3066-3071
- Vrabie, D.¹ Lewis, F.L.²

55
- 79960897012
- Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-jacobi equations
- Aug.
- K. G. Vamvoudakis and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, Aug. 2011.
- (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
- Vamvoudakis, K.G.¹ Lewis, F.L.²

56
- 0003678750
- New York, NY, USA: Springer-Verlag
- E. Zeidler, Nonlinear Functional Analysis vol. 1: Fixed Point Theorems. New York, NY, USA: Springer-Verlag, 1985.
- (1985) Nonlinear Functional Analysis Vol. 1: Fixed Point Theorems
- Zeidler, E.¹

57
- 62949149213
- California Institute of Technology, Pasadena, CA, USA, Tech. Rep. TR96-021
- V. Nevisti and J. A. Primbs, "Constrained nonlinear optimal control: A converse HJB approach," California Institute of Technology, Pasadena, CA, USA, Tech. Rep. TR96-021, 1996.
- (1996) Constrained Nonlinear Optimal Control: A Converse HJB Approach
- Nevisti, V.¹ Primbs, J.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.