SCOPUS 정보 검색 플랫폼

IEEE Transactions on Cybernetics

Volumn 47, Issue 5, 2017, Pages 1224-1237

Discrete-time deterministic Q-learning: A novel convergence analysis

(5) Wei, Qinglai a Lewis, Frank L b,c Sun, Qiuye d Yan, Pengfei a Song, Ruizhuo e

a INSTITUTE OF AUTOMATION (China)

b UNIVERSITY OF TEXAS AT ARLINGTON (United States)

c NORTHEASTERN UNIVERSITY (China)

d NORTHEASTERN UNIVERSITY (China)

e UNIVERSITY OF SCIENCE AND TECHNOLOGY BEIJING (China)

Author keywords

Adaptive critic designs; adaptive dynamic programming (ADP); approximate dynamic programming; neural networks (NNs); neuro dynamic programming; optimal control; Q learning

Indexed keywords

ALGORITHMS; ITERATIVE METHODS;

CONTROL SPACES; CONVERGENCE ANALYSIS; CONVERGENCE CRITERION; CONVERGENCE PROPERTIES; ITERATIVE CONTROL; LEARNING RATES; Q-LEARNING ALGORITHMS; UPPER AND LOWER BOUNDS;

LEARNING ALGORITHMS;

EID: 84963604827 PISSN: 21682267 EISSN: None Source Type: Journal
DOI: 10.1109/TCYB.2016.2542923 Document Type: Article

Times cited : (170)

References (62)

1
- 0002557583
- Advanced forecasting methods for global crisis warning, and models of intelligence
- P. J. Werbos, "Advanced forecasting methods for global crisis warning, and models of intelligence," Gen. Syst. Yearbook, vol. 22, pp. 25-38, 1977
- (1977) Gen. Syst. Yearbook , vol.22 , pp. 25-38
- Werbos, P.J.¹

2
- 0002011091
- A menu of designs for reinforcement learning over time
- W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA MIT Press
- P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press, 1991, pp. 67-95
- (1991) Neural Networks for Control , pp. 67-95
- Werbos, P.J.¹

3
- 84959450562
- An event-triggered ADP control approach for continuous-time system with unknown internal states
- to be published
- X. Zhong, and H. He, "An event-triggered ADP control approach for continuous-time system with unknown internal states," IEEE Trans. Cybern., to be published, doi: 10.1109/TCYB.2016.2523878
- IEEE Trans. Cybern
- Zhong, X.¹ He, H.²

4
- 84906781179
- Adaptive dynamic programming for a class of complex-valued nonlinear systems
- Sep
- R. Song, W. Xiao, H. Zhang, and C. Sun, "Adaptive dynamic programming for a class of complex-valued nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 9, pp. 1733-1739, Sep. 2014
- (2014) IEEE Trans. Neural Netw. Learn. Syst , vol.25 , Issue.9 , pp. 1733-1739
- Song, R.¹ Xiao, W.² Zhang, H.³ Sun, C.⁴

5
- 84939617304
- Data-driven zero-sum neuro-optimal control for a class of continuous-time unknown nonlinear systems with disturbance using ADP
- Feb
- Q. Wei, R. Song, and P. Yan, "Data-driven zero-sum neuro-optimal control for a class of continuous-time unknown nonlinear systems with disturbance using ADP," IEEE Trans. Neural Netw. Learn. Syst., vol. 27, no. 2, pp. 444-458, Feb. 2016
- (2016) IEEE Trans. Neural Netw. Learn. Syst , vol.27 , Issue.2 , pp. 444-458
- Wei, Q.¹ Song, R.² Yan, P.³

6
- 85027955915
- GrDHP: A general utility function representation for dual heuristic dynamic programming
- Mar
- Z. Ni, H. He, D. Zhao, X. Xu, and D. V. Prokhorov, "GrDHP: A general utility function representation for dual heuristic dynamic programming," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 3, pp. 614-627, Mar. 2015
- (2015) IEEE Trans. Neural Netw. Learn. Syst , vol.26 , Issue.3 , pp. 614-627
- Ni, Z.¹ He, H.² Zhao, D.³ Xu, X.⁴ Prokhorov, D.V.⁵

7
- 84906778934
- Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification
- Oct
- Q. Wei, and D. Liu, "Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1020-1036, Oct. 2014
- (2014) IEEE Trans. Autom. Sci. Eng , vol.11 , Issue.4 , pp. 1020-1036
- Wei, Q.¹ Liu, D.²

8
- 84887990637
- Goal representation heuristic dynamic programming on maze navigation
- Dec
- Z. Ni, H. He, J. Wen, and X. Xu, "Goal representation heuristic dynamic programming on maze navigation," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 12, pp. 2038-2050, Dec. 2013
- (2013) IEEE Trans. Neural Netw. Learn. Syst , vol.24 , Issue.12 , pp. 2038-2050
- Ni, Z.¹ He, H.² Wen, J.³ Xu, X.⁴

9
- 85027700528
- Value, and policy iterations in optimal control, and adaptive dynamic programming
- to be published
- D. P. Bertsekas, "Value, and policy iterations in optimal control, and adaptive dynamic programming," IEEE Trans. Neural Netw. Learn. Syst., to be published, doi: 10.1109/TNNLS.2015.2503980
- IEEE Trans. Neural Netw. Learn. Syst
- Bertsekas, D.P.¹

10
- 84883537695
- Reinforcement learning, and feedback control: Using natural decision methods to design optimal adaptive controllers
- Dec
- F. L. Lewis, D. Vrabie, and K. G. Vamvoudakis, "Reinforcement learning, and feedback control: Using natural decision methods to design optimal adaptive controllers," IEEE Control Syst., vol. 32, no. 6, pp. 76-105, Dec. 2012
- (2012) IEEE Control Syst , vol.32 , Issue.6 , pp. 76-105
- Lewis, F.L.¹ Vrabie, D.² Vamvoudakis, K.G.³

11
- 85013129810
- London, U.K.: Inst. Eng. Technol
- D. Vrabie, K. G. Vamvoudakis, and F. L. Lewis, Optimal Adaptive Control, and Differential Games by Reinforcement Learning Principles. London, U.K.: Inst. Eng. Technol., 2013
- (2013) Optimal Adaptive Control, and Differential Games by Reinforcement Learning Principles
- Vrabie, D.¹ Vamvoudakis, K.G.² Lewis, F.L.³

12
- 84912026937
- Revisiting approximate dynamic programming, and its convergence
- Dec
- A. Heydari, "Revisiting approximate dynamic programming, and its convergence," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2733-2743, Dec. 2014
- (2014) IEEE Trans. Cybern , vol.44 , Issue.12 , pp. 2733-2743
- Heydari, A.¹

13
- 84875270081
- Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using timebased policy update
- Jul
- T. Dierks, and S. Jagannathan, "Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using timebased policy update," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1118-1129, Jul. 2012
- (2012) IEEE Trans. Neural Netw. Learn. Syst , vol.23 , Issue.7 , pp. 1118-1129
- Dierks, T.¹ Jagannathan, S.²

14
- 84893708995
- Integral reinforcement learning, and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
- Jan
- H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Integral reinforcement learning, and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems," Automatica, vol. 50, no. 1, pp. 193-202, Jan. 2014
- (2014) Automatica , vol.50 , Issue.1 , pp. 193-202
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.-B.³

15
- 84908432682
- Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning
- Nov
- H. Modares, and F. L. Lewis, "Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning," IEEE Trans. Autom. Control, vol. 59, no. 11, pp. 3051-3056, Nov. 2014
- (2014) IEEE Trans. Autom. Control , vol.59 , Issue.11 , pp. 3051-3056
- Modares, H.¹ Lewis, F.L.²

16
- 84912122528
- Finite-approximation-errorbased discrete-time iterative adaptive dynamic programming
- Dec
- Q. Wei, F. Y. Wang, D. Liu, and X. Yang, "Finite-approximation-errorbased discrete-time iterative adaptive dynamic programming," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2820-2833, Dec. 2014
- (2014) IEEE Trans. Cybern , vol.44 , Issue.12 , pp. 2820-2833
- Wei, Q.¹ Wang, F.Y.² Liu, D.³ Yang, X.⁴

17
- 85017730584
- Asymptotically stable adaptive-optimal control algorithm with saturating actuators, and relaxed persistence of excitation
- to be published
- K. G. Vamvoudakis, M. F. Miranda, and J. P. Hespanha, "Asymptotically stable adaptive-optimal control algorithm with saturating actuators, and relaxed persistence of excitation," IEEE Trans. Neural Netw. Learn. Syst., to be published, doi: 10.1109/TNNLS.2015.2487972
- IEEE Trans. Neural Netw. Learn. Syst
- Vamvoudakis, K.G.¹ Miranda, M.F.² Hespanha, J.P.³

18
- 79960897012
- Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
- Aug
- K. G. Vamvoudakis, and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, Aug. 2011
- (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
- Vamvoudakis, K.G.¹ Lewis, F.L.²

19
- 84885835001
- Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using singlenetwork ADP
- Feb
- H. Zhang, L. Cui, and Y. Luo, "Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using singlenetwork ADP," IEEE Trans. Cybern., vol. 43, no. 1, pp. 206-216, Feb. 2013
- (2013) IEEE Trans. Cybern , vol.43 , Issue.1 , pp. 206-216
- Zhang, H.¹ Cui, L.² Luo, Y.³

20
- 85027929469
- Multiple actor-critic structures for continuous-time optimal control using input-output data
- Apr
- R. Song, et al., "Multiple actor-critic structures for continuous-time optimal control using input-output data," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 4, pp. 851-865, Apr. 2015
- (2015) IEEE Trans. Neural Netw. Learn. Syst , vol.26 , Issue.4 , pp. 851-865
- Song, R.¹

21
- 84904739156
- Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
- Jul
- H. Modares, and F. L. Lewis, "Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning," Automatica, vol. 50, no. 7, pp. 1780-1792, Jul. 2014
- (2014) Automatica , vol.50 , Issue.7 , pp. 1780-1792
- Modares, H.¹ Lewis, F.L.²

22
- 84897594646
- Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
- Mar
- D. Liu, and Q. Wei, "Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 621-634, Mar. 2014
- (2014) IEEE Trans. Neural Netw. Learn. Syst , vol.25 , Issue.3 , pp. 621-634
- Liu, D.¹ Wei, Q.²

23
- 33747862706
- Relaxing dynamic programming
- Aug
- B. Lincoln, and A. Rantzer, "Relaxing dynamic programming," IEEE Trans. Autom. Control, vol. 51, no. 8, pp. 1249-1260, Aug. 2006
- (2006) IEEE Trans. Autom. Control , vol.51 , Issue.8 , pp. 1249-1260
- Lincoln, B.¹ Rantzer, A.²

24
- 84930506123
- Multibattery optimal coordination control for home energy management systems via distributed iterative adaptive dynamic programming
- Jul
- Q. Wei, D. Liu, G. Shi, and Y. Liu, "Multibattery optimal coordination control for home energy management systems via distributed iterative adaptive dynamic programming," IEEE Trans. Ind. Electron., vol. 62, no. 7, pp. 4203-4214, Jul. 2015
- (2015) IEEE Trans. Ind. Electron , vol.62 , Issue.7 , pp. 4203-4214
- Wei, Q.¹ Liu, D.² Shi, G.³ Liu, Y.⁴

25
- 84924872284
- A novel dual iterative Q-learning method for optimal battery management in smart residential environments
- Apr
- Q. Wei, D. Liu, and G. Shi, "A novel dual iterative Q-learning method for optimal battery management in smart residential environments," IEEE Trans. Ind. Electron., vol. 62, no. 4, pp. 2509-2518, Apr. 2015
- (2015) IEEE Trans. Ind. Electron , vol.62 , Issue.4 , pp. 2509-2518
- Wei, Q.¹ Liu, D.² Shi, G.³

26
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

27
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
- Aug
- H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.¹ Wei, Q.² Luo, Y.³

28
- 18444379381
- Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes
- Jul
- J. M. Lee, and J. H. Lee, "Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes," Automatica, vol. 41, no. 7, pp. 1281-1288, Jul. 2005
- (2005) Automatica , vol.41 , Issue.7 , pp. 1281-1288
- Lee, J.M.¹ Lee, J.H.²

29
- 85046476577
- Boca Raton FL USA CRC Press
- L. Busoniu, R. Babuska, B. D. Schutter, and D. Ernst, Reinforcement Learning, and Dynamic Programming Using Function Approximators. Boca Raton, FL, USA: CRC Press, 2010
- (2010) Reinforcement Learning Dynamic Programming Using Function Approximators
- Busoniu, L.¹ Babuska, R.² Schutter, B.D.³ Ernst, D.⁴

30
- 84885176157
- Adaptive optimal control of unknown constrained-input systems using policy iteration, and neural networks
- Oct
- H. Modares, F. L. Lewis, and M. B. Naghibi-Sistani, "Adaptive optimal control of unknown constrained-input systems using policy iteration, and neural networks," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 10, pp. 1513-1525, Oct. 2013
- (2013) IEEE Trans. Neural Netw. Learn. Syst , vol.24 , Issue.10 , pp. 1513-1525
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.B.³

31
- 84908658175
- A novel iterative-adaptive dynamic programming for discrete-time nonlinear systems
- Oct
- Q. Wei, and D. Liu, "A novel iterative -adaptive dynamic programming for discrete-time nonlinear systems," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1176-1190, Oct. 2014
- (2014) IEEE Trans. Autom. Sci. Eng , vol.11 , Issue.4 , pp. 1176-1190
- Wei, Q.¹ Liu, D.²

32
- 84902352795
- Data-driven neuro-optimal temperature control of water-gas shift reaction using stable iterative adaptive dynamic programming
- Nov
- Q. Wei, and D. Liu, "Data-driven neuro-optimal temperature control of water-gas shift reaction using stable iterative adaptive dynamic programming," IEEE Trans. Ind. Electron., vol. 61, no. 11, pp. 6399-6408, Nov. 2014
- (2014) IEEE Trans. Ind. Electron , vol.61 , Issue.11 , pp. 6399-6408
- Wei, Q.¹ Liu, D.²

33
- 84928747516
- Off-policy actor-critic structure for optimal control of unknown systems with disturbances
- to be published
- R. Song, F. L. Lewis, Q. Wei, and H. Zhang, "Off-policy actor-critic structure for optimal control of unknown systems with disturbances," IEEE Trans. Cybern., to be published, doi: 10.1109/TCYB.2015.2421338
- IEEE Trans. Cybern
- Song, R.¹ Lewis, F.L.² Wei, Q.³ Zhang, H.⁴

34
- 0004049893
- Ph.D. dissertation Cambridge Univ., Cambridge, U.K
- C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 1989
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

35
- 34249833101
- Q-learning
- May
- C. Watkins, and P. Dayan, "Q-learning," Mach. Learn., vol. 8, nos. 3-4, pp. 279-292, May 1992
- (1992) Mach. Learn , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

36
- 33846781129
- Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
- Mar
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control," Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007
- (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

37
- 77955423822
- Model-free H control design for unknown linear discrete-time systems via Q-learning with LMI
- Aug
- J.-H. Kim, and F. L. Lewis, "Model-free H control design for unknown linear discrete-time systems via Q-learning with LMI," Automatica, vol. 46, no. 8, pp. 1320-1326, Aug. 2010
- (2010) Automatica , vol.46 , Issue.8 , pp. 1320-1326
- Kim, J.-H.¹ Lewis, F.L.²

38
- 84887200130
- A deterministic improved Q-learning for path planning of a mobile robot
- Sep
- A. Konar, I. G. Chakraborty, S. J. Singh, L. C. Jain, and A. K. Nagar, "A deterministic improved Q-learning for path planning of a mobile robot," IEEE Trans. Syst., Man, Cybern., Syst., vol. 43, no. 5, pp. 1141-1153, Sep. 2013
- (2013) IEEE Trans. Syst., Man, Cybern., Syst , vol.43 , Issue.5 , pp. 1141-1153
- Konar, A.¹ Chakraborty, I.G.² Singh, S.J.³ Jain, L.C.⁴ Nagar, A.K.⁵

39
- 0031236002
- Adaptive critic designs
- Sep
- D. V. Prokhorov, and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997
- (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.C.²

40
- 77955828918
- An adaptive Q-learning algorithm developed for agent-based computational modeling of electricity market
- Sep
- M. Rahimiyan, and H. R. Mashhadi, "An adaptive Q-learning algorithm developed for agent-based computational modeling of electricity market," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 40, no. 5, pp. 547-556, Sep. 2010
- (2010) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev , vol.40 , Issue.5 , pp. 547-556
- Rahimiyan, M.¹ Mashhadi, H.R.²

41
- 79958173163
- Reinforcement learning with function approximation for traffic signal control
- Jun
- L. A. Prashanth, and S. Bhatnagar, "Reinforcement learning with function approximation for traffic signal control," IEEE Trans. Intell. Transp. Syst., vol. 12, no. 2, pp. 412-421, Jun. 2011
- (2011) IEEE Trans. Intell. Transp. Syst , vol.12 , Issue.2 , pp. 412-421
- Prashanth, L.A.¹ Bhatnagar, S.²

42
- 84897585055
- QD-learning: A collaborative distributed strategy for multi-agent reinforcement learning through consensus + innovations
- Jul
- S. Kar, J. M. F. Moura, and H. V. Poor, "QD-learning: A collaborative distributed strategy for multi-agent reinforcement learning through consensus + innovations," IEEE Trans. Signal Process., vol. 61, no. 7, pp. 1848-1862, Jul. 2013
- (2013) IEEE Trans. Signal Process , vol.61 , Issue.7 , pp. 1848-1862
- Kar, S.¹ Moura, J.M.F.² Poor, H.V.³

43
- 84872594962
- A self-learning scheme for residential energy system control, and management
- Feb
- T. Huang, and D. Liu, "A self-learning scheme for residential energy system control, and management," Neural Comput. Appl., vol. 22, no. 2, pp. 259-269, Feb. 2013
- (2013) Neural Comput. Appl , vol.22 , Issue.2 , pp. 259-269
- Huang, T.¹ Liu, D.²

44
- 84930645788
- Hybrid threephase/single-phase microgrid architecture with power management capabilities
- Oct
- Q. Sun, J. Zhou, J. M. Guerrero, and H. Zhang, "Hybrid threephase/single-phase microgrid architecture with power management capabilities," IEEE Trans. Power Electron., vol. 30, no. 10, pp. 5964-5977, Oct. 2015
- (2015) IEEE Trans. Power Electron , vol.30 , Issue.10 , pp. 5964-5977
- Sun, Q.¹ Zhou, J.² Guerrero, J.M.³ Zhang, H.⁴

45
- 84959484631
- A multiagent-based consensus algorithm for distributed coordinated control of distributed generators in the energy Internet
- Nov
- Q. Sun, R. Han, H. Zhang, J. Zhou, and J. M. Guerrero, "A multiagent-based consensus algorithm for distributed coordinated control of distributed generators in the energy Internet," IEEE Trans. Smart Grid, vol. 6, no. 6, pp. 3006-3019, Nov. 2015, doi: 10.1109/TSG.2015.2412779
- (2015) IEEE Trans. Smart Grid , vol.6 , Issue.6 , pp. 3006-3019
- Sun, Q.¹ Han, R.² Zhang, H.³ Zhou, J.⁴ Guerrero, J.M.⁵

46
- 85025171615
- A novel energy function-based stability evaluation, and nonlinear control for energy Internet
- to be published
- Q. Sun, Y. Zhang, H. He, D. Ma, and H. Zhang, "A novel energy function-based stability evaluation, and nonlinear control for energy Internet," IEEE Trans. Smart Grid, to be published, doi: 10.1109/TSG.2015.2497691
- IEEE Trans. Smart Grid
- Sun, Q.¹ Zhang, Y.² He, H.³ Ma, D.⁴ Zhang, H.⁵

47
- 84892442931
- A multiagent Q-learningbased optimal allocation approach for urban water resource management system
- Jan
- J. Ni, M. Liu, L. Ren, and S. X. Yang, "A multiagent Q-learningbased optimal allocation approach for urban water resource management system," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 1, pp. 204-214, Jan. 2014
- (2014) IEEE Trans. Autom. Sci. Eng , vol.11 , Issue.1 , pp. 204-214
- Ni, J.¹ Liu, M.² Ren, L.³ Yang, S.X.⁴

48
- 0003787146
- Princeton NJ USA: Princeton Univ. Press
- R. E. Bellman, Dynamic Programming. Princeton, NJ, USA: Princeton Univ. Press, 1957
- (1957) Dynamic Programming
- Bellman, R.E.¹

49
- 85028172044
- Adaptive neural network tracking control of uncertain nonlinear discrete-time systems with nonaffine dead-zone input
- Mar
- Y.-J. Liu, and S. C. Tong, "Adaptive neural network tracking control of uncertain nonlinear discrete-time systems with nonaffine dead-zone input," IEEE Trans. Cybern., vol. 45, no. 3, pp. 497-505, Mar. 2015
- (2015) IEEE Trans. Cybern , vol.45 , Issue.3 , pp. 497-505
- Liu, Y.-J.¹ Tong, S.C.²

50
- 84941079390
- A unified approach to adaptive neural control for nonlinear discrete-time systems with nonlinear dead-zone input
- Jan
- Y. J. Liu, Y. Gao, S. C. Tong, and C. L. P. Chen, "A unified approach to adaptive neural control for nonlinear discrete-time systems with nonlinear dead-zone input," IEEE Trans. Neural Netw. Learn. Syst., vol. 27, no. 1, pp. 139-150, Jan. 2016
- (2016) IEEE Trans. Neural Netw. Learn. Syst , vol.27 , Issue.1 , pp. 139-150
- Liu, Y.J.¹ Gao, Y.² Tong, S.C.³ Chen, C.L.P.⁴

51
- 84919600707
- Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems
- Jan
- Y. J. Liu, L. Tang, S. C. Tong, C. L. P. Chen, and D. J. Li, "Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 1, pp. 165-176, Jan. 2015
- (2015) IEEE Trans. Neural Netw. Learn. Syst , vol.26 , Issue.1 , pp. 165-176
- Liu, Y.J.¹ Tang, L.² Tong, S.C.³ Chen, C.L.P.⁴ Li, D.J.⁵

52
- 84897630989
- A survey on CPG-inspired control models, and system implementation
- Mar
- J. Yu, M. Tan, J. Chen, and J. Zhang, "A survey on CPG-inspired control models, and system implementation," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 441-456, Mar. 2014
- (2014) IEEE Trans. Neural Netw. Learn. Syst , vol.25 , Issue.3 , pp. 441-456
- Yu, J.¹ Tan, M.² Chen, J.³ Zhang, J.⁴

53
- 18844387150
- New York NY USA Prentice Hall
- R. C. Dorf, and R. H. Bishop, Modern Control System. New York, NY, USA: Prentice Hall, 2011
- (2011) Modern Control System
- Dorf, R.C.¹ Bishop, R.H.²

54
- 0003785722
- Ph.D. dissertation Dept. Electr. Eng., Rensselaer Polytech. Inst., Troy, NY, USA
- R. W. Beard, "Improving the closed-loop performance of nonlinear systems," Ph.D. dissertation, Dept. Electr. Eng., Rensselaer Polytech. Inst., Troy, NY, USA, 1995
- (1995) Improving the Closed-loop Performance of Nonlinear Systems
- Beard, R.W.¹

55
- 84955579295
- Finite-horizon near optimal adaptive control of uncertain linear discrete-time systems
- Q. Zhao, H. Xu, and S. Jagannathan, "Finite-horizon near optimal adaptive control of uncertain linear discrete-time systems," Optimal Control Appl. Methods, vol. 36, no. 6, pp. 853-872, 2015, doi: 10.1002/oca.2143
- (2015) Optimal Control Appl. Methods , vol.36 , Issue.6 , pp. 853-872
- Zhao, Q.¹ Xu, H.² Jagannathan, S.³

56
- 84911399192
- Stochastic optimal output feedback design for unknown linear discrete-time system zero-sum games under communication constraints
- Sep
- H. Xu, S. Jagannathan, and F. L. Lewis, "Stochastic optimal output feedback design for unknown linear discrete-time system zero-sum games under communication constraints," Asian J. Control, vol. 16, no. 5, pp. 1263-1276, Sep. 2014
- (2014) Asian J. Control , vol.16 , Issue.5 , pp. 1263-1276
- Xu, H.¹ Jagannathan, S.² Lewis, F.L.³

57
- 84862815087
- Stochastic optimal control of unknown linear networked control systems in the presence of random delays, and packet losses
- Jun
- H. Xu, S. Jagannathan, and F. L. Lewis, "Stochastic optimal control of unknown linear networked control systems in the presence of random delays, and packet losses," Automatica, vol. 48, no. 6, pp. 1017-1030, Jun. 2012
- (2012) Automatica , vol.48 , Issue.6 , pp. 1017-1030
- Xu, H.¹ Jagannathan, S.² Lewis, F.L.³

58
- 84946780761
- Global adaptive dynamic programming for continuous-time nonlinear systems
- Nov
- Y. Jiang, and Z. P. Jiang, "Global adaptive dynamic programming for continuous-time nonlinear systems," IEEE Trans. Autom. Control, vol. 60, no. 11, pp. 2917-2929, Nov. 2015
- (2015) IEEE Trans. Autom. Control , vol.60 , Issue.11 , pp. 2917-2929
- Jiang, Y.¹ Jiang, Z.P.²

59
- 84908120758
- Adaptive dynamic programming, and optimal control of nonlinear nonaffine systems
- Oct
- T. Bian, Y. Jiang, and Z.-P. Jiang, "Adaptive dynamic programming, and optimal control of nonlinear nonaffine systems," Automatica, vol. 50, no. 10, pp. 2624-2632, Oct. 2014
- (2014) Automatica , vol.50 , Issue.10 , pp. 2624-2632
- Bian, T.¹ Jiang, Y.² Jiang, Z.-P.³

60
- 85028229548
- Distributed cooperative optimal control for multiagent systems on directed graphs: An inverse optimal approach
- Jul
- H. Zhang, T. Feng, G. H. Yang, and H. Liang, "Distributed cooperative optimal control for multiagent systems on directed graphs: An inverse optimal approach," IEEE Trans. Cybern., vol. 45, no. 7, pp. 1315-1326, Jul. 2015
- (2015) IEEE Trans. Cybern , vol.45 , Issue.7 , pp. 1315-1326
- Zhang, H.¹ Feng, T.² Yang, G.H.³ Liang, H.⁴

61
- 84946811900
- Value iteration adaptive dynamic programming for optimal control of discrete-time nonlinear systems
- Mar
- Q. Wei, D. Liu, and H. Lin, "Value iteration adaptive dynamic programming for optimal control of discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 46, no. 3, pp. 840-853, Mar. 2016
- (2016) IEEE Trans. Cybern , vol.46 , Issue.3 , pp. 840-853
- Wei, Q.¹ Liu, D.² Lin, H.³

62
- 85027953921
- Infinite horizon self-learning optimal control of nonaffine discrete-time nonlinear systems
- Apr
- Q. Wei, D. Liu, and X. Yang, "Infinite horizon self-learning optimal control of nonaffine discrete-time nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 4, pp. 866-879, Apr. 2015
- (2015) IEEE Trans. Neural Netw. Learn. Syst , vol.26 , Issue.4 , pp. 866-879
- Wei, Q.¹ Liu, D.² Yang, X.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.