메뉴 건너뛰기




Volumn 220, Issue , 2013, Pages 331-342

An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs

Author keywords

Adaptive dynamic programming; Approximate dynamic programming; Control constraints; Globalized dual heuristic programming; Neural networks; Optimal control

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; CONTROL CONSTRAINT; DUAL HEURISTIC PROGRAMMING; OPTIMAL CONTROLS;

EID: 84868467610     PISSN: 00200255     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ins.2012.07.006     Document Type: Conference Paper
Times cited : (125)

References (37)
  • 1
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • M. Abu-Khalaf, and F.L. Lewis Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach Automatica 41 2005 779 791
    • (2005) Automatica , vol.41 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 4
    • 85012688561 scopus 로고
    • Princeton University Press Princeton, NJ
    • R.E. Bellman Dynamic Programming 1957 Princeton University Press Princeton, NJ
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 6
    • 84860501567 scopus 로고    scopus 로고
    • Temporal difference methods for general projected equations
    • D.P. Bertsekas Temporal difference methods for general projected equations IEEE Transactions on Automatic Control 56 2011 2128 2139
    • (2011) IEEE Transactions on Automatic Control , vol.56 , pp. 2128-2139
    • Bertsekas, D.P.1
  • 7
    • 31444450515 scopus 로고    scopus 로고
    • Direct nonlinear control design: The virtual reference feedback tuning (VRFT) approach
    • M.C. Campi, and S.M. Savaresi Direct nonlinear control design: the virtual reference feedback tuning (VRFT) approach IEEE Transactions on Automatic Control 51 2006 14 27
    • (2006) IEEE Transactions on Automatic Control , vol.51 , pp. 14-27
    • Campi, M.C.1    Savaresi, S.M.2
  • 8
    • 20144369710 scopus 로고    scopus 로고
    • A new approach for neural control of nonlinear discrete dynamic systems
    • J.I. Canelon, L.S. Shieh, and N.B. Karayiannis A new approach for neural control of nonlinear discrete dynamic systems Information Sciences 174 2005 177 196
    • (2005) Information Sciences , vol.174 , pp. 177-196
    • Canelon, J.I.1    Shieh, L.S.2    Karayiannis, N.B.3
  • 9
    • 68149180889 scopus 로고    scopus 로고
    • Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
    • T. Dierks, B.T. Thumati, and S. Jagannathan Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence Neural Networks 22 2009 851 860
    • (2009) Neural Networks , vol.22 , pp. 851-860
    • Dierks, T.1    Thumati, B.T.2    Jagannathan, S.3
  • 10
    • 79960467711 scopus 로고    scopus 로고
    • Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems
    • J. Ding, and S.N. Balakrishnan Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems Journal of Control Theory and Applications 9 2011 370 380
    • (2011) Journal of Control Theory and Applications , vol.9 , pp. 370-380
    • Ding, J.1    Balakrishnan, S.N.2
  • 11
    • 80055064646 scopus 로고    scopus 로고
    • Example-based learning particle swarm optimization for continuous optimization
    • H. Huang, H. Qin, Z. Hao, and A. Lim Example-based learning particle swarm optimization for continuous optimization Information Sciences 182 2012 125 138
    • (2012) Information Sciences , vol.182 , pp. 125-138
    • Huang, H.1    Qin, H.2    Hao, Z.3    Lim, A.4
  • 12
    • 79953906172 scopus 로고    scopus 로고
    • Self-organizing state aggregation for architecture design of Q-learning
    • K.S. Hwang, H.Y. Lin, Y.P. Hsu, and H.H. Yu Self-organizing state aggregation for architecture design of Q-learning Information Sciences 181 2011 2813 2822
    • (2011) Information Sciences , vol.181 , pp. 2813-2822
    • Hwang, K.S.1    Lin, H.Y.2    Hsu, Y.P.3    Yu, H.H.4
  • 14
    • 77955423822 scopus 로고    scopus 로고
    • ∞ control design for unknown linear discrete-time systems via Q-learning with LMI
    • ∞ control design for unknown linear discrete-time systems via Q-learning with LMI Automatica 46 2010 1320 1326
    • (2010) Automatica , vol.46 , pp. 1320-1326
    • Kim, J.H.1    Lewis, F.L.2
  • 15
    • 0027556823 scopus 로고
    • Control of nonlinear dynamical systems using neural networks: Controllability and stabilization
    • A.U. Levin, and K.S. Narendra Control of nonlinear dynamical systems using neural networks: controllability and stabilization IEEE Transactions on Neural Networks 4 1993 192 206
    • (1993) IEEE Transactions on Neural Networks , vol.4 , pp. 192-206
    • Levin, A.U.1    Narendra, K.S.2
  • 17
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • F.L. Lewis, and D. Vrabie Reinforcement learning and adaptive dynamic programming for feedback control IEEE Circuits and Systems Magazine 9 2009 32 50
    • (2009) IEEE Circuits and Systems Magazine , vol.9 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 18
    • 34548268505 scopus 로고    scopus 로고
    • Robust adaptive critic control of nonlinear systems using fuzzy basis function networks: An LMI approach
    • C.K. Lin Robust adaptive critic control of nonlinear systems using fuzzy basis function networks: an LMI approach Information Sciences 177 2007 4934 4946
    • (2007) Information Sciences , vol.177 , pp. 4934-4946
    • Lin, C.K.1
  • 19
    • 80051579328 scopus 로고    scopus 로고
    • A new robust training algorithm for a class of single-hidden layer feedforward neural networks
    • Z. Man, K. Lee, D. Wang, Z. Cao, and C. Miao A new robust training algorithm for a class of single-hidden layer feedforward neural networks Neurocomputing 74 2011 2491 2501
    • (2011) Neurocomputing , vol.74 , pp. 2491-2501
    • Man, Z.1    Lee, K.2    Wang, D.3    Cao, Z.4    Miao, C.5
  • 20
    • 49749098745 scopus 로고    scopus 로고
    • Iterative feedback tuning in fuzzy control systems: Theory and applications
    • S. Preitl, R.E. Precup, J. Fodor, and B. Bede Iterative feedback tuning in fuzzy control systems: theory and applications Acta Polytechnica Hungarica 3 2006 81 96
    • (2006) Acta Polytechnica Hungarica , vol.3 , pp. 81-96
    • Preitl, S.1    Precup, R.E.2    Fodor, J.3    Bede, B.4
  • 23
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • J. Si, and Y.T. Wang On-line learning control by association and reinforcement IEEE Transactions on Neural Networks 12 2001 264 276
    • (2001) IEEE Transactions on Neural Networks , vol.12 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 25
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • K.G. Vamvoudakis, and F.L. Lewis Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem Automatica 46 2010 878 888
    • (2010) Automatica , vol.46 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 26
    • 0036565019 scopus 로고    scopus 로고
    • Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
    • G.K. Venayagamoorthy, R.G. Harley, and D.C. Wunsch Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator IEEE Transactions on Neural Networks 13 2002 764 773
    • (2002) IEEE Transactions on Neural Networks , vol.13 , pp. 764-773
    • Venayagamoorthy, G.K.1    Harley, R.G.2    Wunsch, D.C.3
  • 27
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
    • D. Vrabie, and F.L. Lewis Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems Neural Networks 22 2009 237 246
    • (2009) Neural Networks , vol.22 , pp. 237-246
    • Vrabie, D.1    Lewis, F.L.2
  • 28
    • 79957815678 scopus 로고    scopus 로고
    • Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm
    • Guilin, China
    • D. Wang, D. Liu, Optimal control for a class of unknown nonlinear systems via the iterative GDHP algorithm, in: Proceedings of 8th International Symposium on Neural Networks, Guilin, China, 2011, pp. 630-639.
    • (2011) Proceedings of 8th International Symposium on Neural Networks , pp. 630-639
    • Wang, D.1    Liu, D.2
  • 29
    • 80053069436 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal tracking control of a class of nonlinear systems
    • Yantai, China
    • D. Wang, D. Liu, Q. Wei, Adaptive dynamic programming for finite-horizon optimal tracking control of a class of nonlinear systems, in: Proceedings of the 30th Chinese Control Conference, Yantai, China, 2011, pp. 2450-2455.
    • (2011) Proceedings of the 30th Chinese Control Conference , pp. 2450-2455
    • Wang, D.1    Liu, D.2    Wei, Q.3
  • 31
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with -error bound
    • F.Y. Wang, N. Jin, D. Liu, and Q. Wei Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with -error bound IEEE Transactions on Neural Networks 22 2011 24 36
    • (2011) IEEE Transactions on Neural Networks , vol.22 , pp. 24-36
    • Wang, F.Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 32
    • 34250731840 scopus 로고    scopus 로고
    • A fuzzy actor-critic reinforcement learning network
    • X.S. Wang, Y.H. Cheng, and J.Q. Yi A fuzzy actor-critic reinforcement learning network Information Sciences 177 2007 3764 3781
    • (2007) Information Sciences , vol.177 , pp. 3764-3781
    • Wang, X.S.1    Cheng, Y.H.2    Yi, J.Q.3
  • 33
    • 0002557583 scopus 로고
    • Advanced forecasting methods for global crisis warning and models of intelligence
    • P.J. Werbos Advanced forecasting methods for global crisis warning and models of intelligence General Systems Yearbook 22 1977 25 38
    • (1977) General Systems Yearbook , vol.22 , pp. 25-38
    • Werbos, P.J.1
  • 34
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D.A. White, D.A. Sofge, Van Nostrand Reinhold New York
    • P.J. Werbos Approximate dynamic programming for real-time control and neural modeling D.A. White, D.A. Sofge, Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches 1992 Van Nostrand Reinhold New York
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1
  • 35
    • 76349113332 scopus 로고    scopus 로고
    • A modified gradient-based neuro-fuzzy learning algorithm and its convergence
    • W. Wu, L. Li, J. Yang, and Y. Liu A modified gradient-based neuro-fuzzy learning algorithm and its convergence Information Sciences 180 2010 1630 1642
    • (2010) Information Sciences , vol.180 , pp. 1630-1642
    • Wu, W.1    Li, L.2    Yang, J.3    Liu, Y.4
  • 36
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • H. Zhang, Y. Luo, and D. Liu Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints IEEE Transactions on Neural Networks 20 2009 1490 1503
    • (2009) IEEE Transactions on Neural Networks , vol.20 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.