메뉴 건너뛰기




Volumn 23, Issue 7-8, 2013, Pages 1851-1863

Dual iterative adaptive dynamic programming for a class of discrete-time nonlinear systems with time-delays

Author keywords

Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Nonlinear systems; Optimal control; Time delay

Indexed keywords

ADAPTIVE CRITIC DESIGNS; ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; DISCRETE-TIME NONLINEAR SYSTEMS; OPTIMAL CONTROL POLICY; OPTIMAL CONTROL PROBLEM; OPTIMAL CONTROL SCHEME; OPTIMAL CONTROLS;

EID: 84887490966     PISSN: 09410643     EISSN: None     Source Type: Journal    
DOI: 10.1007/s00521-012-1188-7     Document Type: Article
Times cited : (35)

References (48)
  • 1
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • Abu-Khalaf M, Lewis FL (2005) Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41(5): 779-791.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 2
    • 33847648898 scopus 로고    scopus 로고
    • Adaptive critic designs for discrete-time zero-sum games with application to control
    • Al-Tamimi A, Abu-Khalaf M, Lewis FL (2007) Adaptive critic designs for discrete-time zero-sum games with application to control. IEEE Trans Syst Cybern Part B Cybern 37(1): 240-247.
    • (2007) IEEE Trans Syst Cybern Part B Cybern , vol.37 , Issue.1 , pp. 240-247
    • Al-Tamimi, A.1    Abu-Khalaf, M.2    Lewis, F.L.3
  • 3
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
    • Al-Tamimi A, Lewis FL, Abu-Khalaf M (2008) Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans Syst Man Cybern Part B Cybern 38(4): 943-949.
    • (2008) IEEE Trans Syst Man Cybern Part B Cybern , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 4
    • 31344440501 scopus 로고    scopus 로고
    • Optimal control for linear systems with multiple time delays in control input
    • Basin M, Rodriguez-Gonzalez J (2006) Optimal control for linear systems with multiple time delays in control input. IEEE Trans Autom Control 51(1): 91-97.
    • (2006) IEEE Trans Autom Control , vol.51 , Issue.1 , pp. 91-97
    • Basin, M.1    Rodriguez-Gonzalez, J.2
  • 5
    • 34547546153 scopus 로고    scopus 로고
    • Optimal and robust control for linear state-delay systems
    • Basin M, Rodriguez-Gonzaleza J, Fridman L (2007) Optimal and robust control for linear state-delay systems. J Franklin Inst 344(7): 830-845.
    • (2007) J Franklin Inst , vol.344 , Issue.7 , pp. 830-845
    • Basin, M.1    Rodriguez-Gonzaleza, J.2    Fridman, L.3
  • 6
    • 85012688561 scopus 로고
    • Princeton, NJ: Princeton University Press
    • Bellman RE (1957) Dynamic programming. Princeton University Press, Princeton, NJ.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 7
    • 77950867376 scopus 로고    scopus 로고
    • Approximate dynamic programming with a fuzzy parameterization
    • Busoniu L, Ernst D, Schutter BD, Babuska R (2010) Approximate dynamic programming with a fuzzy parameterization. Automatica 46(5): 804-814.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 804-814
    • Busoniu, L.1    Ernst, D.2    Schutter, B.D.3    Babuska, R.4
  • 9
    • 39549085591 scopus 로고    scopus 로고
    • Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems
    • Chen Z, Jagannathan S (2008) Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems. IEEE Trans Neural Netw 19(1): 90-106.
    • (2008) IEEE Trans Neural Netw , vol.19 , Issue.1 , pp. 90-106
    • Chen, Z.1    Jagannathan, S.2
  • 11
    • 49249114606 scopus 로고    scopus 로고
    • Slope-permissive under-voltage load shed relay for delayed voltage recovery mitigation
    • Halpin SM, Harley KA, Jones RA, Taylor LY (2008) Slope-permissive under-voltage load shed relay for delayed voltage recovery mitigation. IEEE Trans Power Syst 23(3): 1211-1216.
    • (2008) IEEE Trans Power Syst , vol.23 , Issue.3 , pp. 1211-1216
    • Halpin, S.M.1    Harley, K.A.2    Jones, R.A.3    Taylor, L.Y.4
  • 12
    • 33750376234 scopus 로고    scopus 로고
    • Universal learning network and its application for nonlinear system with long time delay
    • Han M, Han B, Xi J, Hirasawa K (2006) Universal learning network and its application for nonlinear system with long time delay. Comput Chem Eng 31(1): 13-20.
    • (2006) Comput Chem Eng , vol.31 , Issue.1 , pp. 13-20
    • Han, M.1    Han, B.2    Xi, J.3    Hirasawa, K.4
  • 14
    • 19344376039 scopus 로고    scopus 로고
    • Adaptive neural control for a class of nonlinearly parametric time-delay systems
    • Ho DWC, Li J, Niu Y (2005) Adaptive neural control for a class of nonlinearly parametric time-delay systems. IEEE Trans Neural Netw 16(3): 625-635.
    • (2005) IEEE Trans Neural Netw , vol.16 , Issue.3 , pp. 625-635
    • Ho, D.W.C.1    Li, J.2    Niu, Y.3
  • 15
    • 50649098812 scopus 로고    scopus 로고
    • Optimal scheduling for minimum delay in passive star coupled WDM optical networks
    • Huang X, Ma M (2008) Optimal scheduling for minimum delay in passive star coupled WDM optical networks. IEEE Trans Commun 56(8): 1324-1330.
    • (2008) IEEE Trans Commun , vol.56 , Issue.8 , pp. 1324-1330
    • Huang, X.1    Ma, M.2
  • 16
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • Lewis FL, Vrabie D (2009) Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst Mag 9(3): 32-50.
    • (2009) IEEE Circuits Syst Mag , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 17
    • 76849116795 scopus 로고    scopus 로고
    • A novel robust adaptive-fuzzy-tracking control for a class of nonlinear multi-input/multi-output systems
    • Li T, Tong SC, Feng G (2010) A novel robust adaptive-fuzzy-tracking control for a class of nonlinear multi-input/multi-output systems. IEEE Trans Fuzzy Syst 18(1): 150-160.
    • (2010) IEEE Trans Fuzzy Syst , vol.18 , Issue.1 , pp. 150-160
    • Li, T.1    Tong, S.C.2    Feng, G.3
  • 18
    • 77952580785 scopus 로고    scopus 로고
    • A DSC approach to robust adaptive NN tracking control for strict-feedback nonlinear systems
    • Li T, Wang D, Feng G, Tong SC (2010) A DSC approach to robust adaptive NN tracking control for strict-feedback nonlinear systems. IEEE Trans Syst Man Cybern Part B Cybern 40(3): 915-927.
    • (2010) IEEE Trans Syst Man Cybern Part B Cybern , vol.40 , Issue.3 , pp. 915-927
    • Li, T.1    Wang, D.2    Feng, G.3    Tong, S.C.4
  • 19
    • 77956634872 scopus 로고    scopus 로고
    • Neural-network-based simple adaptive control of uncertain multi-input multi-output non-linear systems
    • Li T, Feng, Wang D, Tong S (2010) Neural-network-based simple adaptive control of uncertain multi-input multi-output non-linear systems. IET Control Theory Appl 4(9): 1543-1557.
    • (2010) IET Control Theory Appl , vol.4 , Issue.9 , pp. 1543-1557
    • Li, T.1    Feng, G.2    Wang, D.3    Tong, S.4
  • 20
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • Liu D, Zhang Y, Zhang H (2005) A self-learning call admission control scheme for CDMA cellular networks. IEEE Trans Neural Netw 16(5): 1219-1228.
    • (2005) IEEE Trans Neural Netw , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 22
    • 27844601849 scopus 로고
    • The discrete-time tracking problem with a time delay in the control
    • Pindyck RS (1992) The distrete-time tracking problem with a time delay in the control. IEEE Trans Autom Control 17(6): 397-398.
    • (1992) IEEE Trans Autom Control , vol.17 , Issue.6 , pp. 397-398
    • Pindyck, R.S.1
  • 25
    • 0041511967 scopus 로고    scopus 로고
    • Time-delay systems: an overview of some recent advances and open problems
    • Richard JP (2003) Time-delay systems: an overview of some recent advances and open problems. Automatica 39(10): 1667-1694.
    • (2003) Automatica , vol.39 , Issue.10 , pp. 1667-1694
    • Richard, J.P.1
  • 26
    • 51749115119 scopus 로고    scopus 로고
    • Optimal estimation in networked control systems subject to random delay and packet drop
    • Schenato L (2008) Optimal estimation in networked control systems subject to random delay and packet drop. IEEE Trans Autom Control 53(5): 1311-1317.
    • (2008) IEEE Trans Autom Control , vol.53 , Issue.5 , pp. 1311-1317
    • Schenato, L.1
  • 27
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • Si J, Wang YT (2001) On-line learning control by association and reinforcement. IEEE Trans Neural Netw 12(2): 264-276.
    • (2001) IEEE Trans Neural Netw , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 29
    • 78649933699 scopus 로고    scopus 로고
    • Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming
    • Song R, Zhang H, Luo Y, Wei Q (2010) Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming. Neurocomputing 73(16-18): 3020-3027.
    • (2010) Neurocomputing , vol.73 , Issue.16-18 , pp. 3020-3027
    • Song, R.1    Zhang, H.2    Luo, Y.3    Wei, Q.4
  • 30
    • 77956494871 scopus 로고    scopus 로고
    • Load distribution model and voltage static profile of Smart Grid
    • Sun Q, Li Z, Yang J, Luo Y (2010) Load distribution model and voltage static profile of Smart Grid. J Central S Univ Technol 17(4): 824-829.
    • (2010) J Central S Univ Technol , vol.17 , Issue.4 , pp. 824-829
    • Sun, Q.1    Li, Z.2    Yang, J.3    Luo, Y.4
  • 31
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5): 878-888.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 32
    • 82755160758 scopus 로고    scopus 로고
    • Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
    • Wang D, Liu D, Wei Q (2012) Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach. Neurocomputing 78(1): 14-22.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 14-22
    • Wang, D.1    Liu, D.2    Wei, Q.3
  • 33
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
    • Wang FY, Jin N, Liu D, Wei Q (2011) Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound. IEEE Trans Neural Netw 22(1): 24-36.
    • (2011) IEEE Trans Neural Netw , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 34
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: an introduction
    • Wang FY, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2): 39-47.
    • (2009) IEEE Comput Intell Mag , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.Y.1    Zhang, H.2    Liu, D.3
  • 36
    • 61849184281 scopus 로고    scopus 로고
    • Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
    • Wei Q, Zhang H, Dai J (2009) Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing 72(7-9): 1839-1848.
    • (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1839-1848
    • Wei, Q.1    Zhang, H.2    Dai, J.3
  • 37
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos (Eds.), Cambridge: MIT Press
    • Werbos PJ (1991) A menu of designs for reinforcement learning over time. In: Miller WT, Sutton RS, Werbos PJ (eds) Neural networks for control. MIT Press, Cambridge, pp 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 38
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A. White and D. A. Sofge (Eds.), New York: Van Nostrand Reinhold
    • Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. In: White DA, Sofge DA (eds) Handbook of intelligent control: neural, fuzzy, and adaptive approaches ch. 13. van Nostrand Reinhold, New York.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches Ch. 13.
    • Werbos, P.J.1
  • 39
    • 0015667648 scopus 로고
    • Punish/reward: learning with a critic in adaptive threshold systems
    • Widrow B, Gupta N, Maitra S (1973) Punish/reward: learning with a critic in adaptive threshold systems. IEEE Trans Syst Man Cybern 3: 455-465.
    • (1973) IEEE Trans Syst Man Cybern , vol.3 , pp. 455-465
    • Widrow, B.1    Gupta, N.2    Maitra, S.3
  • 40
    • 34547133970 scopus 로고    scopus 로고
    • Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks
    • Yadav V, Padhi R, Balakrishnan SN (2007) Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks. IEEE Trans Neural Netw 18(4): 1115-1128.
    • (2007) IEEE Trans Neural Netw , vol.18 , Issue.4 , pp. 1115-1128
    • Yadav, V.1    Padhi, R.2    Balakrishnan, S.N.3
  • 41
    • 2442482637 scopus 로고    scopus 로고
    • A combined backstepping and small-gain approach to robust adaptive fuzzy control for strict-feedback nonlinear systems
    • Yang Y, Feng G, Ren J (2004) A combined backstepping and small-gain approach to robust adaptive fuzzy control for strict-feedback nonlinear systems. IEEE Trans Syst Man Cybern Part A Syst Humans 34(3): 406-420.
    • (2004) IEEE Trans Syst Man Cybern Part A Syst Humans , vol.34 , Issue.3 , pp. 406-420
    • Yang, Y.1    Feng, G.2    Ren, J.3
  • 42
    • 34548689324 scopus 로고    scopus 로고
    • Ito-Volterra optimal state estimation with continuous, multirate, randomly sampled, and delayed measurements
    • Zhang H, Basin MV, Skliar M (2007) Ito-Volterra optimal state estimation with continuous, multirate, randomly sampled, and delayed measurements. IEEE Trans Autom Control 52(3): 401-416.
    • (2007) IEEE Trans Autom Control , vol.52 , Issue.3 , pp. 401-416
    • Zhang, H.1    Basin, M.V.2    Skliar, M.3
  • 43
    • 0035303251 scopus 로고    scopus 로고
    • Modeling, identification and control of a class of nonlinear system
    • Zhang H, Quan Y (2001) Modeling, identification and control of a class of nonlinear system. IEEE Trans Fuzzy Syst 9(2): 349-354.
    • (2001) IEEE Trans Fuzzy Syst , vol.9 , Issue.2 , pp. 349-354
    • Zhang, H.1    Quan, Y.2
  • 44
    • 39649121043 scopus 로고    scopus 로고
    • Delay-dependent guaranteed cost control for uncertain stochastic fuzzy systems with multiple time delays
    • Zhang H, Wang Y, Liu D (2008) Delay-dependent guaranteed cost control for uncertain stochastic fuzzy systems with multiple time delays. IEEE Trans Syst Man Cybern Part B Cybern 38(1): 125-140.
    • (2008) IEEE Trans Syst Man Cybern Part B Cybern , vol.38 , Issue.1 , pp. 125-140
    • Zhang, H.1    Wang, Y.2    Liu, D.3
  • 45
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Zhang H, Wei Q, Luo Y (2008) A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm. IEEE Trans Syst Man Cybern Part B Cybern 38(4): 937-942.
    • (2008) IEEE Trans Syst Man Cybern Part B Cybern , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 46
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Zhang H, Wei Q, Liu D (2011) An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games. Automatica 47(1): 207-214.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 47
    • 83855165164 scopus 로고    scopus 로고
    • Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming
    • Zhang H, Song R, Wei Q, Zhang T (2011) Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming. IEEE Trans Neural Netw 22(12): 1851-1862.
    • (2011) IEEE Trans Neural Netw , vol.22 , Issue.12 , pp. 1851-1862
    • Zhang, H.1    Song, R.2    Wei, Q.3    Zhang, T.4
  • 48
    • 33947594205 scopus 로고    scopus 로고
    • Guaranteed cost networked control for T-S fuzzy systems with time delay
    • Zhang H, Yang D, Chai T (2007) Guaranteed cost networked control for T-S fuzzy systems with time delay. IEEE Trans Syst Man Cybern Part C Appl Rev 37(2): 160-172.
    • (2007) IEEE Trans Syst Man Cybern Part C Appl Rev , vol.37 , Issue.2 , pp. 160-172
    • Zhang, H.1    Yang, D.2    Chai, T.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.