SCOPUS 정보 검색 플랫폼

IEEE Transactions on Cybernetics

Volumn 46, Issue 3, 2016, Pages 854-865

Experience Replay for Optimal Control of Nonzero-Sum Game Systems with Unknown Dynamics

(4) Zhao, Dongbin a Zhang, Qichao a Wang, Ding a Zhu, Yuanheng a

a INSTITUTE OF AUTOMATION (China)

Author keywords

Adaptive dynamic programming (ADP); experience replay; nonzero sum (NZS) games; optimal control; unknown dynamics

Indexed keywords

CLOSED LOOP SYSTEMS; DYNAMICS; NETWORK LAYERS; NONLINEAR EQUATIONS; ONLINE SYSTEMS; SYSTEM STABILITY;

ADAPTIVE DYNAMIC PROGRAMMING; EXPERIENCE REPLAY; LYAPUNOV-BASED STABILITY ANALYSIS; NONZERO-SUM (NZS) GAMES; OPTIMAL CONTROLS; PERSISTENCE OF EXCITATION; THREE-LAYER NEURAL NETWORKS; UNIFORM ULTIMATE BOUNDEDNESS;

DYNAMIC PROGRAMMING;

EID: 84945951645 PISSN: 21682267 EISSN: None Source Type: Journal
DOI: 10.1109/TCYB.2015.2488680 Document Type: Article

Times cited : (199)

References (38)

1
- 84891584860
- Hoboken, NJ, USA: Wiley
- F. L. Lewis and D. Liu, Reinforcement Learning and Approximate Dynamic Programming for Feedback Control. Hoboken, NJ, USA: Wiley, 2013.
- (2013) Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
- Lewis, F.L.¹ Liu, D.²

2
- 84862811062
- An iterative optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
- Aug.
- Q. Wei and D. Liu, "An iterative optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state," Neural Netw., vol. 32, no. 6, pp. 236-244, Aug. 2012.
- (2012) Neural Netw. , vol.32 , Issue.6 , pp. 236-244
- Wei, Q.¹ Liu, D.²

3
- 84888019460
- Full-range adaptive cruise control based on supervised adaptive dynamic programming
- Feb.
- D. Zhao et al., "Full-range adaptive cruise control based on supervised adaptive dynamic programming," Neurocomputing, vol. 125, pp. 57-67, Feb. 2014.
- (2014) Neurocomputing , vol.125 , pp. 57-67
- Zhao, D.¹

4
- 49049108697
- Adaptive critic learning techniques for engine torque and air-fuel ratio control
- Aug.
- D. Liu, H. Javaherian, O. Kovalenko, and T. Huang, "Adaptive critic learning techniques for engine torque and air-fuel ratio control," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 988-993, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 988-993
- Liu, D.¹ Javaherian, H.² Kovalenko, O.³ Huang, T.⁴

5
- 0036588686
- Adaptive dynamic programming
- May
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002.
- (2002) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev. , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

6
- 82455175244
- DHP method for ramp metering of freeway traffic
- Dec.
- D. Zhao, X. Bai, F.-Y. Wang, J. Xu, and W. Yu, "DHP method for ramp metering of freeway traffic," IEEE Intell. Transp. Syst. Mag., vol. 12, no. 4, pp. 990-999, Dec. 2011.
- (2011) IEEE Intell. Transp. Syst. Mag. , vol.12 , Issue.4 , pp. 990-999
- Zhao, D.¹ Bai, X.² Wang, F.-Y.³ Xu, J.⁴ Yu, W.⁵

7
- 84961288449
- Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems
- Feb.
- Y. Zhu, D. Zhao, and D. Liu, "Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems," Neurocomputing, vol. 149, pp. 124-131, Feb. 2015.
- (2015) Neurocomputing , vol.149 , pp. 124-131
- Zhu, Y.¹ Zhao, D.² Liu, D.³

8
- 84897594646
- Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
- Mar.
- D. Liu and Q. Wei, "Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 621-634, Mar. 2014.
- (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.3 , pp. 621-634
- Liu, D.¹ Wei, Q.²

9
- 84881026082
- Distributed cooperative secondary control of microgrids using feedback linearization
- Aug.
- A. Bidram, A. Davoudi, F. L. Lewis, and J. M. Guerrero, "Distributed cooperative secondary control of microgrids using feedback linearization," IEEE Trans. Power Syst., vol. 28, no. 3, pp. 3462-3470, Aug. 2013.
- (2013) IEEE Trans. Power Syst. , vol.28 , Issue.3 , pp. 3462-3470
- Bidram, A.¹ Davoudi, A.² Lewis, F.L.³ Guerrero, J.M.⁴

10
- 0036208288
- Fuzzy speed and steering control of an AGV
- Jan.
- K. R. S. Kodagoda, W. S. Wijesoma, and E. K. Teoh, "Fuzzy speed and steering control of an AGV," IEEE Trans. Control Syst. Technol., vol. 10, no. 1, pp. 112-120, Jan. 2002.
- (2002) IEEE Trans. Control Syst. Technol. , vol.10 , Issue.1 , pp. 112-120
- Kodagoda, K.R.S.¹ Wijesoma, W.S.² Teoh, E.K.³

11
- 0004313202
- New York, NY, USA: Springer
- P. Morris, Introduction to Game Theory. New York, NY, USA: Springer, 2012.
- (2012) Introduction to Game Theory
- Morris, P.¹

12
- 84887283655
- Insight into the so-called spatial reciprocity
- Oct., Art. ID
- Z. Wang, S. Kokubo, J. Tanimoto, E. Fukuda, and K. Shigaki, "Insight into the so-called spatial reciprocity," Phys. Rev. E, vol. 88, no. 4, Oct. 2013, Art. ID 042145.
- (2013) Phys. Rev. e , vol.88 , Issue.4
- Wang, Z.¹ Kokubo, S.² Tanimoto, J.³ Fukuda, E.⁴ Shigaki, K.⁵

13
- 84891330472
- Impact of social punishment on cooperative behavior in complex networks
- Oct. Art. ID
- Z. Wang, C.-Y. Xia, S. Meloni, C.-S. Zhou, and Y. Moreno, "Impact of social punishment on cooperative behavior in complex networks," Sci. Rep., vol. 3, Oct. 2013, Art. ID 3055.
- (2013) Sci. Rep. , vol.3
- Wang, Z.¹ Xia, C.-Y.² Meloni, S.³ Zhou, C.-S.⁴ Moreno, Y.⁵

14
- 84883180371
- Optimal interdependence between networks for the evolution of cooperation
- Aug., Art. ID
- Z. Wang, A. Szolnoki, and M. Perc, "Optimal interdependence between networks for the evolution of cooperation," Sci. Rep., vol. 3, Aug. 2013, Art. ID 2470.
- (2013) Sci. Rep. , vol.3
- Wang, Z.¹ Szolnoki, A.² Perc, M.³

15
- 84897997302
- Self-organization towards optimally interdependent networks by means of coevolution
- Art. ID
- Z. Wang, A. Szolnoki, and M. Perc, "Self-organization towards optimally interdependent networks by means of coevolution," New J. Phys., vol. 16, no. 3, 2014, Art. ID 033041.
- (2014) New J. Phys. , vol.16 , Issue.3
- Wang, Z.¹ Szolnoki, A.² Perc, M.³

16
- 34250487269
- Nonzero-sum differential games
- A. W. Starr and Y.-C. Ho, "Nonzero-sum differential games," J. Optim. Theory Appl., vol. 3, no. 3, pp. 184-206, 1969.
- (1969) J. Optim. Theory Appl. , vol.3 , Issue.3 , pp. 184-206
- Starr, A.W.¹ Ho, Y.-C.²

17
- 84961987054
- Res., Tilburg Univ., Tilburg, The Netherlands
- J. C. Engwerda and A. J. M. Weeren, The Open-Loop Nash Equilibrium in LQ-Games Revisited, Center Econ. Res., Tilburg Univ., Tilburg, The Netherlands, 1995.
- (1995) The Open-Loop Nash Equilibrium in LQ-Games Revisited, Center Econ
- Engwerda, J.C.¹ Weeren, A.J.M.²

18
- 84937390462
- Approximate N-player nonzero-sum game solution for an uncertain continuous nonlinear system
- Aug.
- M. Johnson, R. Kamalapurkar, S. Bhasin, and W. E. Dixon, "Approximate N-player nonzero-sum game solution for an uncertain continuous nonlinear system," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 8, pp. 1645-1658, Aug. 2015.
- (2015) IEEE Trans. Neural Netw. Learn. Syst. , vol.26 , Issue.8 , pp. 1645-1658
- Johnson, M.¹ Kamalapurkar, R.² Bhasin, S.³ Dixon, W.E.⁴

19
- 0014509068
- Toward a theory of many player differential games
- J. H. Case, "Toward a theory of many player differential games," SIAM J. Control, vol. 7, no. 2, pp. 179-197, 1969.
- (1969) SIAM J. Control , vol.7 , Issue.2 , pp. 179-197
- Case, J.H.¹

20
- 79551575772
- Mineola, NY, USA: Courier Corporation
- A. Friedman, Differential Games. Mineola, NY, USA: Courier Corporation, 2013.
- (2013) Differential Games
- Friedman, A.¹

21
- 79960897012
- Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
- K. G. Vamvoudakis and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, 2011.
- (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
- Vamvoudakis, K.G.¹ Lewis, F.L.²

22
- 84885835001
- Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using singlenetwork ADP
- Feb.
- H. Zhang, L. Cui, and Y. Luo, "Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using singlenetwork ADP," IEEE Trans. Cybern., vol. 43, no. 1, pp. 206-216, Feb. 2013.
- (2013) IEEE Trans. Cybern. , vol.43 , Issue.1 , pp. 206-216
- Zhang, H.¹ Cui, L.² Luo, Y.³

23
- 79953133535
- Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games
- Atlanta, GA, USA
- D. Vrabie and F. Lewis, "Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games," in Proc. IEEE Conf. Decis. Control (CDC), Atlanta, GA, USA, 2010, pp. 3066-3071.
- (2010) Proc. IEEE Conf. Decis. Control (CDC) , pp. 3066-3071
- Vrabie, D.¹ Lewis, F.²

24
- 85027928575
- Integral reinforcement learning for continuous-time input-affine nonlinear systems with simultaneous invariant explorations
- May
- J. Y. Lee, J. B. Park, and Y. H. Choi, "Integral reinforcement learning for continuous-time input-affine nonlinear systems with simultaneous invariant explorations," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 15, pp. 916-932, May 2015.
- (2015) IEEE Trans. Neural Netw. Learn. Syst. , vol.26 , Issue.15 , pp. 916-932
- Lee, J.Y.¹ Park, J.B.² Choi, Y.H.³

25
- 84904398037
- Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics
- Jul.
- H. Li, D. Liu, and D. Wang, "Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 3, pp. 706-714, Jul. 2014.
- (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.3 , pp. 706-714
- Li, H.¹ Liu, D.² Wang, D.³

26
- 84960449514
- Model-free optimal control for affine nonlinear systems with convergence analysis
- Oct.
- D. Zhao, Z. Xia, and D. Wang, "Model-free optimal control for affine nonlinear systems with convergence analysis," IEEE Trans. Autom. Sci. Eng., vol. 12, no. 4, pp. 1461-1468, Oct. 2014.
- (2014) IEEE Trans. Autom. Sci. Eng. , vol.12 , Issue.4 , pp. 1461-1468
- Zhao, D.¹ Xia, Z.² Wang, D.³

27
- 84863467146
- Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
- Jul.
- D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
- (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
- Liu, D.¹ Wang, D.² Zhao, D.³ Wei, Q.⁴ Jin, N.⁵

28
- 84904706555
- Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics
- Aug.
- D. Liu, H. Li, and D. Wang, "Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics," IEEE Trans. Syst., Man, Cybern., Syst., vol. 44, no. 8, pp. 1015-1027, Aug. 2014.
- (2014) IEEE Trans. Syst., Man, Cybern., Syst. , vol.44 , Issue.8 , pp. 1015-1027
- Liu, D.¹ Li, H.² Wang, D.³

29
- 84862815087
- Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
- H. Xu, S. Jagannathan, and F. L. Lewis, "Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses," Automatica, vol. 48, no. 6, pp. 1017-1030, 2012.
- (2012) Automatica , vol.48 , Issue.6 , pp. 1017-1030
- Xu, H.¹ Jagannathan, S.² Lewis, F.L.³

30
- 84857501996
- Experience replay for real-time reinforcement learning control
- Mar.
- S. Adam, L. Busoniu, and R. Babuska, "Experience replay for real-time reinforcement learning control," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 42, no. 2, pp. 201-212, Mar. 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev. , vol.42 , Issue.2 , pp. 201-212
- Adam, S.¹ Busoniu, L.² Babuska, R.³

31
- 79953141961
- Concurrent learning for convergence in adaptive control without persistency of excitation
- Atlanta, GA, USA
- G. Chowdhary and E. Johnson, "Concurrent learning for convergence in adaptive control without persistency of excitation," in Proc. IEEE. Conf. Decis. Control (CDC), Atlanta, GA, USA, 2010, pp. 3674-3679.
- (2010) Proc. IEEE. Conf. Decis. Control (CDC) , pp. 3674-3679
- Chowdhary, G.¹ Johnson, E.²

32
- 84885176157
- Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks
- Oct.
- H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 10, pp. 1513-1525, Oct. 2013.
- (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.10 , pp. 1513-1525
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.-B.³

33
- 84893708995
- Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
- H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems," Automatica, vol. 50, no. 1, pp. 193-202, 2014.
- (2014) Automatica , vol.50 , Issue.1 , pp. 193-202
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.-B.³

34
- 84961977508
- Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems
- Oct.
- S. Yasini, M. B. N. Sitani, and A. Kirampor, "Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems," Int. J. Mach. Learn. Cybern., pp. 1-14, Oct. 2014.
- (2014) Int. J. Mach. Learn. Cybern. , pp. 1-14
- Yasini, S.¹ Sitani, M.B.N.² Kirampor, A.³

35
- 84921346879
- Concurrent learningbased approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games
- Jul.
- R. Kamalapurkar, J. R. Klotz, and W. E. Dixon, "Concurrent learningbased approximate feedback-Nash equilibrium solution of N-player nonzero-sum differential games," IEEE/CAA J. Autom. Sinica, vol. 1, no. 3, pp. 239-247, Jul. 2014.
- (2014) IEEE/CAA J. Autom. Sinica , vol.1 , Issue.3 , pp. 239-247
- Kamalapurkar, R.¹ Klotz, J.R.² Dixon, W.E.³

36
- 0004071782
- Philadelphia, PA, USA: SIAM
- T. Basar et al., Dynamic Noncooperative Game Theory, vol. 200. Philadelphia, PA, USA: SIAM, 1995.
- (1995) Dynamic Noncooperative Game Theory , vol.200
- Basar, T.¹

37
- 0003917259
- Philadelphia, PA, USA: SIAM
- B. A. Finlayson, The Method of Weighted Residuals and Variational Principles. vol. 73. Philadelphia, PA, USA: SIAM, 2013.
- (2013) The Method of Weighted Residuals and Variational Principles , vol.73
- Finlayson, B.A.¹

38
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.