SCOPUS 정보 검색 플랫폼

Volumn 20, Issue 2, 2016, Pages 697-706

Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach

(3) Wei, Qinglai a Liu, Derong a Xu, Yancai a

a INSTITUTE OF GEOLOGY AND GEOPHYSICS (China)

Author keywords

Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Neural networks; Nonlinear systems; Optimal control; Reinforcement learning

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; ALGORITHMS; DISCRETE TIME CONTROL SYSTEMS; ITERATIVE METHODS; NAVIGATION; NEURAL NETWORKS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING;

ADAPTIVE CRITIC DESIGNS; ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; CONVERGENCE PROPERTIES; DISCRETE-TIME NONLINEAR SYSTEMS; OPTIMAL CONTROLS; OPTIMAL TRACKING CONTROL; PERFORMANCE INDICES;

DYNAMIC PROGRAMMING;

EID: 84955703536 PISSN: 14327643 EISSN: 14337479 Source Type: Journal
DOI: 10.1007/s00500-014-1533-0 Document Type: Article

Times cited : (32)

References (53)

1
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- Abu-Khalaf M, Lewis FL (2005) Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41(5):779–791
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

2
- 33847648898
- Adaptive critic designs for discrete-time zero-sum games with application to (Formula presented.) control
- Al-Tamimi A, Abu-Khalaf M, Lewis FL (2007) Adaptive critic designs for discrete-time zero-sum games with application to $$H_{\infty }$$H∞ control. IEEE Trans Syst Cybern Part B: Cybern 37(1):240–247
- (2007) IEEE Trans Syst Cybern Part B: Cybern , vol.37 , Issue.1 , pp. 240-247
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

3
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
- Al-Tamimi A, Lewis FL, Abu-Khalaf M (2008) Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans Syst Man Cybern Part B: Cybern 38(4):943–949
- (2008) IEEE Trans Syst Man Cybern Part B: Cybern , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

4
- 84871319455
- A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
- Bhasin S, Kamalapurkar R, Johnson M, Vamvoudakis KG, Lewis FL, Dixon WE (2013) A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems. Automatica 49(1):82–92
- (2013) Automatica , vol.49 , Issue.1 , pp. 82-92
- Bhasin, S.¹ Kamalapurkar, R.² Johnson, M.³ Vamvoudakis, K.G.⁴ Lewis, F.L.⁵ Dixon, W.E.⁶

5
- 85012688561
- Princeton University Press, Princeton
- Bellman RE (1957) Dynamic programming. Princeton University Press, Princeton
- (1957) Dynamic programming
- Bellman, R.E.¹

6
- 0003487482
- Athena Scientific, Belmont
- Bertsekas DP, Tsitsiklis JN (1996) Neuro-dynamic programming. Athena Scientific, Belmont
- (1996) Neuro-dynamic programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

7
- 0003565783
- Athena Scientific, Belmont
- Bertsekas DP (2007) Dynamic programming and optimal control, 3rd edn. Athena Scientific, Belmont
- (2007) Dynamic programming and optimal control
- Bertsekas, D.P.¹

8
- 84901054552
- Utilizing time-linkage property in DOPs: an information sharing based artificial bee colony algorithm for tracking multiple optima in uncertain environments
- Biswas S, Das S, Kundu S, Patra GR (2014) Utilizing time-linkage property in DOPs: an information sharing based artificial bee colony algorithm for tracking multiple optima in uncertain environments. Soft Comput 18(6):1199–1212
- (2014) Soft Comput , vol.18 , Issue.6 , pp. 1199-1212
- Biswas, S.¹ Das, S.² Kundu, S.³ Patra, G.R.⁴

9
- 84871294033
- On functional equations for (Formula presented.)th best policies in Markov decision processes
- Chang HS (2013) On functional equations for $$K$$Kth best policies in Markov decision processes. Automatica 49(1):297–300
- (2013) Automatica , vol.49 , Issue.1 , pp. 297-300
- Chang, H.S.¹

10
- 0043026775
- Helicopter trimming and tracking control using direct neural dynamic programming
- Enns R, Si J (2003) Helicopter trimming and tracking control using direct neural dynamic programming. IEEE Trans Neural Netw 14(8):929–939
- (2003) IEEE Trans Neural Netw , vol.14 , Issue.8 , pp. 929-939
- Enns, R.¹ Si, J.²

11
- 84925291138
- Fortier N, Sheppard J, Strasser S (2014) Abductive inference in Bayesian networks using distributed overlapping swarm intelligence. Soft Comput (in press)
- Fortier N, Sheppard J, Strasser S (2014) Abductive inference in Bayesian networks using distributed overlapping swarm intelligence. Soft Comput (in press). doi:10.1007/s00500-014-1310-0

12
- 84880065287
- Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics
- Heydari A, Balakrishnan SN (2013) Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics. IEEE Trans Neural Netw Learn Syst 24(1):145–157
- (2013) IEEE Trans Neural Netw Learn Syst , vol.24 , Issue.1 , pp. 145-157
- Heydari, A.¹ Balakrishnan, S.N.²

13
- 84872032832
- An algorithm for robust explicit/multi-parametric model predictive control
- Kouramas KI, Panos C, Faisca NP, Pistikopoulos EN (2013) An algorithm for robust explicit/multi-parametric model predictive control. Automatica 49(2):381–389
- (2013) Automatica , vol.49 , Issue.2 , pp. 381-389
- Kouramas, K.I.¹ Panos, C.² Faisca, N.P.³ Pistikopoulos, E.N.⁴

14
- 84922835782
- Kundu S, Das S, Vasilakos AV, Biswas S (2014) A modified differential evolution-based combined routing and sleep scheduling scheme for lifetime maximization of wireless sensor networks. Soft Comput (in press)
- Kundu S, Das S, Vasilakos AV, Biswas S (2014) A modified differential evolution-based combined routing and sleep scheduling scheme for lifetime maximization of wireless sensor networks. Soft Comput (in press). doi:10.1007/s00500-014-1286-9

15
- 84883537695
- Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers
- Lewis FL, Vrabie D, Vamvoudakis KG (2012) Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers. IEEE Control Syst 32(6):76–105
- (2012) IEEE Control Syst , vol.32 , Issue.6 , pp. 76-105
- Lewis, F.L.¹ Vrabie, D.² Vamvoudakis, K.G.³

16
- 33747862706
- Relaxing dynamic programming
- Lincoln B, Rantzer A (2006) Relaxing dynamic programming. IEEE Trans Autom Control 51(8):1249–1260
- (2006) IEEE Trans Autom Control , vol.51 , Issue.8 , pp. 1249-1260
- Lincoln, B.¹ Rantzer, A.²

17
- 49049108697
- Adaptive critic learning techniques for engine torque and air-fuel ratio control
- Liu D, Javaherian H, Kovalenko O, Huang T (2008) Adaptive critic learning techniques for engine torque and air-fuel ratio control. IEEE Trans Syst Man Cybern Part B Cybern 38(4):988–993
- (2008) IEEE Trans Syst Man Cybern Part B Cybern , vol.38 , Issue.4 , pp. 988-993
- Liu, D.¹ Javaherian, H.² Kovalenko, O.³ Huang, T.⁴

18
- 84881555023
- Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
- Liu D, Wei Q (2013) Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems. IEEE Trans Cybern 43(2):779–789
- (2013) IEEE Trans Cybern , vol.43 , Issue.2 , pp. 779-789
- Liu, D.¹ Wei, Q.²

19
- 84899122972
- Multi-person zero-sum differential games for a class of uncertain nonlinear systems
- Liu D, Wei Q (2014a) Multi-person zero-sum differential games for a class of uncertain nonlinear systems. Int J Adaptive Control Signal Process 28(3–5):205–231
- (2014) Int J Adaptive Control Signal Process , vol.28 , Issue.3-5 , pp. 205-231
- Liu, D.¹ Wei, Q.²

20
- 84897594646
- Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
- Liu D, Wei Q (2014b) Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems. IEEE Trans Neural Netw Learn Syst 25(3):621–634
- (2014) IEEE Trans Neural Netw Learn Syst , vol.25 , Issue.3 , pp. 621-634
- Liu, D.¹ Wei, Q.²

21
- 26844483839
- A self-learning call admission control scheme for CDMA cellular networks
- Liu D, Zhang Y, Zhang H (2005) A self-learning call admission control scheme for CDMA cellular networks. IEEE Trans Neural Netw 16(5):1219–1228
- (2005) IEEE Trans Neural Netw , vol.16 , Issue.5 , pp. 1219-1228
- Liu, D.¹ Zhang, Y.² Zhang, H.³

22
- 0019625194
- Optimal control of a class of nonlinear stochastic systems
- Mohler RR, Kolodziej WJ (1981) Optimal control of a class of nonlinear stochastic systems. IEEE Trans Autom Control 26(5):1048–1054
- (1981) IEEE Trans Autom Control , vol.26 , Issue.5 , pp. 1048-1054
- Mohler, R.R.¹ Kolodziej, W.J.²

23
- 0036588686
- Adaptive dynamic programming
- Murray JJ, Cox CJ, Lendaris GG, Saeks R (2002) Adaptive dynamic programming. IEEE Trans Syst Man Cybern Part C Appl Rev 32(2):140–153
- (2002) IEEE Trans Syst Man Cybern Part C Appl Rev , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

24
- 84885936244
- Heuristic dynamic programming with internal goal representation
- Ni Z, He H (2013) Heuristic dynamic programming with internal goal representation. Soft Comput 17(11):2101–2108
- (2013) Soft Comput , vol.17 , Issue.11 , pp. 2101-2108
- Ni, Z.¹ He, H.²

25
- 47349092417
- Wiley, Hoboken
- Powell WB (2007) Approximate dynamic programming. Wiley, Hoboken
- (2007) Approximate dynamic programming
- Powell, W.B.¹

26
- 0031236002
- Adaptive critic designs
- Prokhorov DV, Wunsch DC (1997) Adaptive critic designs. IEEE Trans Neural Netw 8(5):997–1007
- (1997) IEEE Trans Neural Netw , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.C.²

27
- 84947093016
- Rubio JDJ (2014) Adaptive least square control in discrete time of robotic arms. Soft Comput (in press)
- Rubio JDJ (2014) Adaptive least square control in discrete time of robotic arms. Soft Comput (in press). doi:10.1007/s00500-014-1300-2

28
- 0015039815
- System equivalence in a class of nonlinear optimal control problems
- Rugh WJ (1971) System equivalence in a class of nonlinear optimal control problems. IEEE Trans Autom Control 16(2):189–194
- (1971) IEEE Trans Autom Control , vol.16 , Issue.2 , pp. 189-194
- Rugh, W.J.¹

29
- 0035273403
- On-line learning control by association and reinforcement
- Si J, Wang YT (2001) On-line learning control by association and reinforcement. IEEE Trans Neural Netw 12(2):264–276
- (2001) IEEE Trans Neural Netw , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.T.²

30
- 84885923700
- Multi-objective optimal control for a class of nonlinear time-delay systems via adaptive dynamic programming
- Song R, Xiao W, Wei Q (2013) Multi-objective optimal control for a class of nonlinear time-delay systems via adaptive dynamic programming. Soft Comput 17(11):2109–2115
- (2013) Soft Comput , vol.17 , Issue.11 , pp. 2109-2115
- Song, R.¹ Xiao, W.² Wei, Q.³

31
- 84905102927
- Neural-network-based approach to finite-time optimal control for a class of unknown nonlinear systems
- Song R, Xiao W, Wei Q, Sun C (2014) Neural-network-based approach to finite-time optimal control for a class of unknown nonlinear systems. Soft Comput 18(8):1645–1653
- (2014) Soft Comput , vol.18 , Issue.8 , pp. 1645-1653
- Song, R.¹ Xiao, W.² Wei, Q.³ Sun, C.⁴

32
- 0004102479
- MIT Press, Cambridge
- Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT Press, Cambridge
- (1998) Reinforcement learning: an introduction
- Sutton, R.S.¹ Barto, A.G.²

33
- 66449130966
- Adaptive dynamic programming: an introduction
- Wang F, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2):39–47
- (2009) IEEE Comput Intell Mag , vol.4 , Issue.2 , pp. 39-47
- Wang, F.¹ Zhang, H.² Liu, D.³

34
- 78651311269
- Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with (Formula presented.)-error bound
- Wang F, Jin N, Liu D, Wei Q (2011) Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with $$\epsilon $$ϵ-error bound. IEEE Trans Neural Netw 22(1):24–36
- (2011) IEEE Trans Neural Netw , vol.22 , Issue.1 , pp. 24-36
- Wang, F.¹ Jin, N.² Liu, D.³ Wei, Q.⁴

35
- 84862811062
- An iterative (Formula presented.)-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
- Wei Q, Liu D (2012) An iterative $$\epsilon $$ϵ-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state. Neural Netw 32:236–244
- (2012) Neural Netw , vol.32 , pp. 236-244
- Wei, Q.¹ Liu, D.²

36
- 84883327795
- Numerical adaptive learning control scheme for discrete-time nonlinear systems
- Wei Q, Liu D (2013) Numerical adaptive learning control scheme for discrete-time nonlinear systems. IET Control Theory Appl 7(11):1472–1486
- (2013) IET Control Theory Appl , vol.7 , Issue.11 , pp. 1472-1486
- Wei, Q.¹ Liu, D.²

37
- 84887490966
- Dual iterative adaptive dynamic programming for a class of discrete-time nonlinear systems with time-delays
- Wei Q, Wang D, Zhang D (2013) Dual iterative adaptive dynamic programming for a class of discrete-time nonlinear systems with time-delays. Neural Comput Appl 23(7–8):1851–1863
- (2013) Neural Comput Appl , vol.23 , Issue.7-8 , pp. 1851-1863
- Wei, Q.¹ Wang, D.² Zhang, D.³

38
- 84906778934
- Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification
- Wei Q, Liu D (2014a) Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification. IEEE Trans Autom Sci Eng 11(4):1020–1036
- (2014) IEEE Trans Autom Sci Eng , vol.11 , Issue.4 , pp. 1020-1036
- Wei, Q.¹ Liu, D.²

39
- 84908658175
- A novel iterative (Formula presented.)-adaptive dynamic programming for discrete-time nonlinear systems
- Wei Q, Liu D (2014b) A novel iterative $$\theta $$θ-adaptive dynamic programming for discrete-time nonlinear systems. IEEE Trans Autom Sci Eng 11(4):1176–1190
- (2014) IEEE Trans Autom Sci Eng , vol.11 , Issue.4 , pp. 1176-1190
- Wei, Q.¹ Liu, D.²

40
- 84902352795
- Data-driven neuro-optimal temperature control of water gas shift reaction using stable iterative adaptive dynamic programming
- Wei Q, Liu D (2014c) Data-driven neuro-optimal temperature control of water gas shift reaction using stable iterative adaptive dynamic programming. IEEE Trans Ind Electron 61(11):6399–6408
- (2014) IEEE Trans Ind Electron , vol.61 , Issue.11 , pp. 6399-6408
- Wei, Q.¹ Liu, D.²

41
- 84898013913
- Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems
- Wei Q, Liu D (2014d) Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems. Neural Comput Appl 24(6):1355–1367
- (2014) Neural Comput Appl , vol.24 , Issue.6 , pp. 1355-1367
- Wei, Q.¹ Liu, D.²

42
- 84924872284
- Wei Q, Liu D, Shi G (2014) A novel dual iterative Q-learning method for optimal battery management in smart residential environments. IEEE Trans Ind Electron (in press)
- Wei Q, Liu D, Shi G (2014) A novel dual iterative Q-learning method for optimal battery management in smart residential environments. IEEE Trans Ind Electron (in press). doi:10.1109/TIE.2014.2361485

43
- 84912122528
- Wei Q, Wang F, Liu D, Yang X (2014) Finite-approximation-error based discrete-time iterative adaptive dynamic programming. IEEE Trans Cybern (in press)
- Wei Q, Wang F, Liu D, Yang X (2014) Finite-approximation-error based discrete-time iterative adaptive dynamic programming. IEEE Trans Cybern (in press). doi:10.1109/TCYB.2014.2354377

44
- 61849184281
- Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
- Wei Q, Zhang H, Dai J (2009) Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing 72(7–9):1839–1848
- (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1839-1848
- Wei, Q.¹ Zhang, H.² Dai, J.³

45
- 0002557583
- Advanced forecasting methods for global crisis warning and models of intelligence
- Werbos PJ (1977) Advanced forecasting methods for global crisis warning and models of intelligence. General Syst Yearb 22:25–38
- (1977) General Syst Yearb , vol.22 , pp. 25-38
- Werbos, P.J.¹

46
- 0002011091
- A menu of designs for reinforcement learning over time
- Miller WT, Sutton RS, Werbos PJ, (eds), MIT Press, Cambridge
- Werbos PJ (1991) A menu of designs for reinforcement learning over time. In: Miller WT, Sutton RS, Werbos PJ (eds) Neural Netw Control. MIT Press, Cambridge
- (1991) Neural Netw Control
- Werbos, P.J.¹

47
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- White DA, Sofge DA, (eds), Van Nostrand Reinhold, New York
- Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. In: White DA, Sofge DA (eds) Handbook of intelligent control: neural, fuzzy, and adaptive approaches. Van Nostrand Reinhold, New York
- (1992) Handbook of intelligent control: neural, fuzzy, and adaptive approaches
- Werbos, P.J.¹

48
- 84884958993
- Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming
- Xu H, Jagannathan S (2013) Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming. IEEE Trans Neural Netw Learn Syst 24(3):471–484
- (2013) IEEE Trans Neural Netw Learn Syst , vol.24 , Issue.3 , pp. 471-484
- Xu, H.¹ Jagannathan, S.²

49
- 84885835001
- Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP
- Zhang H, Cui L, Luo Y (2013) Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using single-network ADP. IEEE Trans Cybern 43(1):206–216
- (2013) IEEE Trans Cybern , vol.43 , Issue.1 , pp. 206-216
- Zhang, H.¹ Cui, L.² Luo, Y.³

50
- 84892670912
- Approximate optimal solution of the DTHJB equation for a class of nonlinear affine systems with unknown dead-zone constraints
- Zhang D, Liu D, Wang D (2014) Approximate optimal solution of the DTHJB equation for a class of nonlinear affine systems with unknown dead-zone constraints. Soft Comput 18(2):349–357
- (2014) Soft Comput , vol.18 , Issue.2 , pp. 349-357
- Zhang, D.¹ Liu, D.² Wang, D.³

51
- 70349253929
- The RBF neural network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraint
- Zhang H, Luo Y, Liu D (2009) The RBF neural network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraint. IEEE Trans Neural Netw 20(9):1490–1503
- (2009) IEEE Trans Neural Netw , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

52
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
- Zhang H, Wei Q, Luo Y (2008) A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Trans Syst Man Cybern Part B Cybern 38(4):937–942
- (2008) IEEE Trans Syst Man Cybern Part B Cybern , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.¹ Wei, Q.² Luo, Y.³

53
- 78650805234
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- Zhang H, Wei Q, Liu D (2011) An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 47(1):207–214
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.¹ Wei, Q.² Liu, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.