SCOPUS 정보 검색 플랫폼

IEEE Transactions on Cybernetics

Volumn 43, Issue 1, 2013, Pages 206-216

Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP

(3) Zhang, Huaguang a Cui, Lili b Luo, Yanhong c

a STATE KEY LABORATORY OF SYNTHETICAL AUTOMATION FOR PROCESS INDUSTRIES (China)

b SHENYANG NORMAL UNIVERSITY (China)

c NORTHEASTERN UNIVERSITY (China)

Author keywords

Adaptive dynamic programming (ADP); Continuous time nonlinear systems; Neural networks (NNs); Nonzero sum differential games; Optimal control

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; CONTINUOUS TIME NONLINEAR SYSTEMS; NEURAL NETWORKS (NNS); NONZERO-SUM DIFFERENTIAL GAME; OPTIMAL CONTROLS;

CONTINUOUS TIME SYSTEMS; CONTROL; GAME THEORY; NEURAL NETWORKS; NONLINEAR SYSTEMS;

DYNAMIC PROGRAMMING;

EID: 84885835001 PISSN: 21682267 EISSN: None Source Type: Journal
DOI: 10.1109/TSMCB.2012.2203336 Document Type: Article

Times cited : (419)

References (30)

1
- 34247618255
- Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems
- May
- H. Mukaidani, "Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems," Appl. Math. Comput., vol. 188, no. 1, pp. 103-115, May 2007.
- (2007) Appl. Math. Comput. , vol.188 , Issue.1 , pp. 103-115
- Mukaidani, H.¹

2
- 4243352182
- M.S. thesis, Rutgers Univ., Piscataway, NJ
- V. Shah, "Power Control for Wireless Data Services Based on Utility and Pricing," M.S. thesis, Rutgers Univ., Piscataway, NJ, 1998.
- (1998) Power Control for Wireless Data Services Based on Utility and Pricing
- Shah, V.¹

3
- 34250487269
- Nonzero-sum differential games
- A. W. Starr and Y. C. Ho, "Nonzero-sum differential games," J. Optim. Theory Appl., vol. 3, no. 3, pp. 184-206, 1969.
- (1969) J. Optim. Theory Appl. , vol.3 , Issue.3 , pp. 184-206
- Starr, A.W.¹ Ho, Y.C.²

4
- 0003981511
- Philadelphia, PA: SIAM
- T. Baser and G. J. Olsder, Dynamic Noncooperative Game Theory. Philadelphia, PA: SIAM, 1998.
- (1998) Dynamic Noncooperative Game Theory
- Baser, T.¹ Olsder, G.J.²

5
- 85012688561
- Princeton, NJ: Princeton Univ. Press
- R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.E.¹

6
- 79953127250
- Solving coupled riccati equations for closed-loop Nash strategy, by lack of trust approach
- M. Jungers, E. De Pieri, and H. Abou-Kandil, "Solving coupled riccati equations for closed-loop Nash strategy, by lack of trust approach," Int. J. Tomogr. Stat., vol. 7, no. F07, pp. 49-54, 2007.
- (2007) Int. J. Tomogr. Stat. , vol.7 , Issue.F07 , pp. 49-54
- Jungers, M.¹ De Pieri, E.² Abou-Kandil, H.³

7
- 66449130966
- Adaptive dynamic programming: An introduction
- May
- F.-Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
- (2009) IEEE Comput. Intell. Mag. , vol.4 , Issue.2 , pp. 39-47
- Wang, F.-Y.¹ Zhang, H.² Liu, D.³

8
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

9
- 84921399937
- New York: Wiley
- J. Si, A. G. Barto, W. B. Powell, and D. W. II, Handbook of Learning and Approximate Dynamic Programming. New York: Wiley, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ W II, D.⁴

10
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- May
- M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, May 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

11
- 0036588686
- Adaptive dynamic programming
- May
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002.
- (2002) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev. , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

12
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- Feb.
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, Feb. 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

13
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- May
- K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

14
- 34249047468
- Continuous-time adaptive critics
- May
- T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Netw., vol. 18, no. 3, pp. 631-647, May 2007.
- (2007) IEEE Trans. Neural Netw. , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

15
- 77957777969
- Optimal control of affine nonlinear continuous-time systems
- T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems," in Proc. IEEE Amer. Control Conf., Baltimore, MD, Jun. 2010, pp. 1568-1573.
- Proc. IEEE Amer. Control Conf., Baltimore, MD, Jun. 2010 , pp. 1568-1573
- Dierks, T.¹ Jagannathan, S.²

16
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug.
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

17
- 79551685808
- Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
- Feb.
- F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 41, no. 1, pp. 14-25, Feb. 2011.
- (2011) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.41 , Issue.1 , pp. 14-25
- Lewis, F.L.¹ Vamvoudakis, K.G.²

18
- 70349253929
- Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Sep.
- H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
- (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

19
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm
- Aug.
- H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear system based on greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.¹ Wei, Q.² Luo, Y.³

20
- 77950853735
- Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics
- T. Dierks and S. Jagannathan, "Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics," in Proc. Joint 48th IEEE Conf. Decision Control/28th Chin. Control Conf., Shanghai, China, Dec. 2009, pp. 6750-6755.
- Proc. Joint 48th IEEE Conf. Decision Control/28th Chin. Control Conf., Shanghai, China, Dec. 2009 , pp. 6750-6755
- Dierks, T.¹ Jagannathan, S.²

21
- 68149180889
- Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
- Aug.
- T. Dierks, T. Balaje, and S. Jagannathan, "Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence," Neural Netw., vol. 22, no. 5/6, pp. 851-860, Aug. 2009.
- (2009) Neural Netw. , vol.22 , Issue.5-6 , pp. 851-860
- Dierks, T.¹ Balaje, T.² Jagannathan, S.³

22
- 33751238181
- A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
- Dec.
- R. Padhi, N. Unnikrishnan, X. Wang, and S. N. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems," Neural Netw., vol. 19, no. 10, pp. 1648-1660, Dec. 2006.
- (2006) Neural Netw. , vol.19 , Issue.10 , pp. 1648-1660
- Padhi, R.¹ Unnikrishnan, N.² Wang, X.³ Balakrishnan, S.N.⁴

23
- 48949116222
- Neurodynamic programming and zero-sum games for constrained control systems
- Jul.
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Neurodynamic programming and zero-sum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Jul. 2008.
- (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

24
- 67650567581
- Data-based optimal control for discretetime zero-sum games of 2-D systems using adaptive critic designs
- Jun.
- Q. Wei, H. Zhang, and L. Cui, "Data-based optimal control for discretetime zero-sum games of 2-D systems using adaptive critic designs," ACTA Autom. Sin., vol. 35, no. 6, pp. 682-692, Jun. 2009.
- (2009) ACTA Autom. Sin. , vol.35 , Issue.6 , pp. 682-692
- Wei, Q.¹ Zhang, H.² Cui, L.³

25
- 33847648898
- Adaptive critic designs for discrete-time zero-sum games with application to H? Control
- Feb.
- A. Al-Tamimi, M. Abu-Khalaf, and F. L. Lewis, "Adaptive critic designs for discrete-time zero-sum games with application to H? control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 37, no. 1, pp. 240-247, Feb. 2007.
- (2007) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.37 , Issue.1 , pp. 240-247
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

26
- 78650805234
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- Jan.
- H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, Jan. 2011.
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.¹ Wei, Q.² Liu, D.³

27
- 79953143055
- Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation
- T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems using an online Hamilton-Jacobi-Isaacs formulation," in Proc. 49th IEEE Conf. Decision Control, Atlanta, GA, Dec. 2010, pp. 3048-3053.
- Proc. 49th IEEE Conf. Decision Control, Atlanta, GA, Dec. 2010 , pp. 3048-3053
- Dierks, T.¹ Jagannathan, S.²

28
- 79953133535
- Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games
- D. Vrabie and F. L. Lewis, "Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games," in Proc. 49th IEEE Conf. Decision Control, Atlanta, GA, Dec. 2010, pp. 3066-3071.
- Proc. 49th IEEE Conf. Decision Control, Atlanta, GA, Dec. 2010 , pp. 3066-3071
- Vrabie, D.¹ Lewis, F.L.²

29
- 79960897012
- Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Hacobi equations
- Aug. doi:DOI:10.1016/j.automatica.2011.03.005
- K. G. Vamvoudakisand and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Hacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, Aug. 2011. doi:DOI:10.1016/j.automatica.2011.03.005.
- (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
- Vamvoudakisand, K.G.¹ Lewis, F.L.²

30
- 0004178386
- Englewood Cliffs, NJ: Prentice-Hall
- H. K. Khalil, Nonlinear System. Englewood Cliffs, NJ: Prentice-Hall, 1996.
- (1996) Nonlinear System
- Khalil, H.K.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.