메뉴 건너뛰기




Volumn 26, Issue 4, 2015, Pages 866-879

Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems

Author keywords

Adaptive critic designs; adaptive dynamic programming (ADP); approximate dynamic programming; generalized policy iteration; neural networks (NNs); neurodynamic programming; nonlinear systems; optimal control; reinforcement learning

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; ALGORITHMS; DISCRETE TIME CONTROL SYSTEMS; ITERATIVE METHODS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING;

EID: 85027953921     PISSN: 2162237X     EISSN: 21622388     Source Type: Journal    
DOI: 10.1109/TNNLS.2015.2401334     Document Type: Article
Times cited : (140)

References (72)
  • 1
    • 0002557583 scopus 로고
    • Advanced forecasting methods for global crisis warning and models of intelligence
    • P. J. Werbos, "Advanced forecasting methods for global crisis warning and models of intelligence," General Syst. Yearbook, vol. 22, pp. 25-38, 1977.
    • (1977) General Syst. Yearbook , vol.22 , pp. 25-38
    • Werbos, P.J.1
  • 2
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 3
    • 84887996993 scopus 로고    scopus 로고
    • An equivalence between adaptive dynamic programming with a critic and backpropagation through time
    • Dec.
    • M. Fairbank, E. Alonso, and D. Prokhorov, "An equivalence between adaptive dynamic programming with a critic and backpropagation through time," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 12, pp. 2088-2100, Dec. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.12 , pp. 2088-2100
    • Fairbank, M.1    Alonso, E.2    Prokhorov, D.3
  • 4
    • 84880065287 scopus 로고    scopus 로고
    • Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics
    • Jan.
    • A. Heydari and S. N. Balakrishnan, "Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 1, pp. 145-157, Jan. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.1 , pp. 145-157
    • Heydari, A.1    Balakrishnan, S.N.2
  • 5
    • 84862798631 scopus 로고    scopus 로고
    • Fusion of multiple behaviors using layered reinforcement learning
    • Jul.
    • K.-S. Hwang, Y.-J. Chen, and C.-J. Wu, "Fusion of multiple behaviors using layered reinforcement learning," IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 42, no. 4, pp. 999-1004, Jul. 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern. A, Syst., Humans , vol.42 , Issue.4 , pp. 999-1004
    • Hwang, K.-S.1    Chen, Y.-J.2    Wu, C.-J.3
  • 6
    • 84881555023 scopus 로고    scopus 로고
    • Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
    • Apr.
    • D. Liu and Q. Wei, "Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 43, no. 2, pp. 779-789, Apr. 2013.
    • (2013) IEEE Trans. Cybern. , vol.43 , Issue.2 , pp. 779-789
    • Liu, D.1    Wei, Q.2
  • 7
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • Sep.
    • D. Liu, Y. Zhang, and H. Zhang, "A self-learning call admission control scheme for CDMA cellular networks," IEEE Trans. Neural Netw., vol. 16, no. 5, pp. 1219-1228, Sep. 2005.
    • (2005) IEEE Trans. Neural Netw. , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 9
    • 84884922436 scopus 로고    scopus 로고
    • Online learning control using adaptive critic designs with sparse kernel machines
    • May
    • X. Xu, Z. Hou, C. Lian, and H. He, "Online learning control using adaptive critic designs with sparse kernel machines," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 5, pp. 762-775, May 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.5 , pp. 762-775
    • Xu, X.1    Hou, Z.2    Lian, C.3    He, H.4
  • 10
    • 0031236002 scopus 로고    scopus 로고
    • Adaptive critic designs
    • Sep.
    • D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
    • (1997) IEEE Trans. Neural Netw. , vol.8 , Issue.5 , pp. 997-1007
    • Prokhorov, D.V.1    Wunsch, D.C.2
  • 11
    • 84881029414 scopus 로고    scopus 로고
    • Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation
    • Aug.
    • J. Liang, D. D. Molina, G. K. Venayagamoorthy, and R. G. Harley, "Two-level dynamic stochastic optimal power flow control for power systems with intermittent renewable generation," IEEE Trans. Power Syst., vol. 28, no. 3, pp. 2670-2678, Aug. 2013.
    • (2013) IEEE Trans. Power Syst. , vol.28 , Issue.3 , pp. 2670-2678
    • Liang, J.1    Molina, D.D.2    Venayagamoorthy, G.K.3    Harley, R.G.4
  • 12
    • 84887990637 scopus 로고    scopus 로고
    • Goal representation heuristic dynamic programming on maze navigation
    • Dec.
    • Z. Ni, H. He, J. Wen, and X. Xu, "Goal representation heuristic dynamic programming on maze navigation," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 12, pp. 2038-2050, Dec. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.12 , pp. 2038-2050
    • Ni, Z.1    He, H.2    Wen, J.3    Xu, X.4
  • 13
    • 84876149222 scopus 로고    scopus 로고
    • Adaptive learning in tracking control based on the dual critic network design
    • Jun.
    • Z. Ni, H. He, and J. Wen, "Adaptive learning in tracking control based on the dual critic network design," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 6, pp. 913-928, Jun. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.6 , pp. 913-928
    • Ni, Z.1    He, H.2    Wen, J.3
  • 14
    • 84906778934 scopus 로고    scopus 로고
    • Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification
    • Oct.
    • Q. Wei and D. Liu, "Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1020-1036, Oct. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.4 , pp. 1020-1036
    • Wei, Q.1    Liu, D.2
  • 15
    • 0141862255 scopus 로고    scopus 로고
    • Automated web navigation using multiagent adaptive dynamic programming
    • May
    • J. Varghese and S. Mukhopadhyay, "Automated web navigation using multiagent adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. A, Syst., Humans, vol. 33, no. 3, pp. 412-417, May 2003.
    • (2003) IEEE Trans. Syst., Man, Cybern. A, Syst., Humans , vol.33 , Issue.3 , pp. 412-417
    • Varghese, J.1    Mukhopadhyay, S.2
  • 16
    • 84898013913 scopus 로고    scopus 로고
    • Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems
    • May
    • Q. Wei and D. Liu, "Stable iterative adaptive dynamic programming algorithm with approximation errors for discrete-time nonlinear systems,"Neural Comput. Appl., vol. 24, no. 6, pp. 1355-1367, May 2014.
    • (2014) Neural Comput. Appl. , vol.24 , Issue.6 , pp. 1355-1367
    • Wei, Q.1    Liu, D.2
  • 17
    • 84912122528 scopus 로고    scopus 로고
    • Finite-approximation-error-based discrete-time iterative adaptive dynamic programming
    • Dec.
    • Q. Wei, F.-Y. Wang, D. Liu, and X. Yang, "Finite-approximation-error-based discrete-time iterative adaptive dynamic programming," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2820-2833, Dec. 2014.
    • (2014) IEEE Trans. Cybern. , vol.44 , Issue.12 , pp. 2820-2833
    • Wei, Q.1    Wang, F.-Y.2    Liu, D.3    Yang, X.4
  • 18
    • 84865467087 scopus 로고    scopus 로고
    • Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    • Oct.
    • Y. Jiang and Z.-P. Jiang, "Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics,"Automatica, vol. 48, no. 10, pp. 2699-2704, Oct. 2012.
    • (2012) Automatica , vol.48 , Issue.10 , pp. 2699-2704
    • Jiang, Y.1    Jiang, Z.-P.2
  • 19
    • 84877914583 scopus 로고    scopus 로고
    • Robust adaptive dynamic programming with an application to power systems
    • Jul.
    • Y. Jiang and Z.-P. Jiang, "Robust adaptive dynamic programming with an application to power systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 7, pp. 1150-1156, Jul. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.7 , pp. 1150-1156
    • Jiang, Y.1    Jiang, Z.-P.2
  • 20
    • 82755160758 scopus 로고    scopus 로고
    • Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
    • Feb.
    • D. Wang, D. Liu, and Q. Wei, "Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach," Neurocomputing, vol. 78, no. 1, pp. 14-22, Feb. 2012.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 14-22
    • Wang, D.1    Liu, D.2    Wei, Q.3
  • 21
    • 84876066909 scopus 로고    scopus 로고
    • Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm
    • Jun.
    • D. Liu, H. Li, and D. Wang, "Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm," Neurocomputing, vol. 110, pp. 92-100, Jun. 2013.
    • (2013) Neurocomputing , vol.110 , pp. 92-100
    • Liu, D.1    Li, H.2    Wang, D.3
  • 22
    • 84868467610 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
    • Jan.
    • D. Liu, D. Wang, and X. Yang, "An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs," Inf. Sci., vol. 220, pp. 331-342, Jan. 2013.
    • (2013) Inf. Sci. , vol.220 , pp. 331-342
    • Liu, D.1    Wang, D.2    Yang, X.3
  • 23
    • 84864489666 scopus 로고    scopus 로고
    • Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    • Aug.
    • D. Wang, D. Liu, Q. Wei, D. Zhao, and N. Jin, "Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming," Automatica, vol. 48, no. 8, pp. 1825-1832, Aug. 2012.
    • (2012) Automatica , vol.48 , Issue.8 , pp. 1825-1832
    • Wang, D.1    Liu, D.2    Wei, Q.3    Zhao, D.4    Jin, N.5
  • 24
    • 84891557251 scopus 로고    scopus 로고
    • Kernel-based approximate dynamic programming for real-time online learning control: An experimental study
    • Jan.
    • X. Xu, C. Lian, L. Zuo, and H. He, "Kernel-based approximate dynamic programming for real-time online learning control: An experimental study," IEEE Trans. Control Syst. Technol., vol. 22, no. 1, pp. 146-156, Jan. 2014.
    • (2014) IEEE Trans. Control Syst. Technol. , vol.22 , Issue.1 , pp. 146-156
    • Xu, X.1    Lian, C.2    Zuo, L.3    He, H.4
  • 25
    • 84875054856 scopus 로고    scopus 로고
    • Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming
    • Jan.
    • D. Molina, G. K. Venayagamoorthy, J. Liang, and R. G. Harley, "Intelligent local area signals based damping of power system oscillations using virtual generators and approximate dynamic programming,"IEEE Trans. Smart Grid, vol. 4, no. 1, pp. 498-508, Jan. 2013.
    • (2013) IEEE Trans. Smart Grid , vol.4 , Issue.1 , pp. 498-508
    • Molina, D.1    Venayagamoorthy, G.K.2    Liang, J.3    Harley, R.G.4
  • 27
    • 84884958993 scopus 로고    scopus 로고
    • Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming
    • Mar.
    • H. Xu and S. Jagannathan, "Stochastic optimal controller design for uncertain nonlinear networked control system via neuro dynamic programming,"IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 3, pp. 471-484, Mar. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.3 , pp. 471-484
    • Xu, H.1    Jagannathan, S.2
  • 28
    • 0043026775 scopus 로고    scopus 로고
    • Helicopter trimming and tracking control using direct neural dynamic programming
    • Jul.
    • R. Enns and J. Si, "Helicopter trimming and tracking control using direct neural dynamic programming," IEEE Trans. Neural Netw., vol. 14, no. 4, pp. 929-939, Jul. 2003.
    • (2003) IEEE Trans. Neural Netw. , vol.14 , Issue.4 , pp. 929-939
    • Enns, R.1    Si, J.2
  • 29
    • 84898853127 scopus 로고    scopus 로고
    • Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
    • Apr.
    • B. Kiumarsi, F. L. Lewis, H. Modares, A. Karimpour, and M.-B. Naghibi-Sistani, "Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics,"Automatica, vol. 50, no. 4, pp. 1167-1175, Apr. 2014.
    • (2014) Automatica , vol.50 , Issue.4 , pp. 1167-1175
    • Kiumarsi, B.1    Lewis, F.L.2    Modares, H.3    Karimpour, A.4    Naghibi-Sistani, M.-B.5
  • 30
    • 84897663275 scopus 로고    scopus 로고
    • Reinforcement learning output feedback NN control using deterministic learning technique
    • Mar.
    • B. Xu, C. Yang, and Z. Shi, "Reinforcement learning output feedback NN control using deterministic learning technique," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 635-641, Mar. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.3 , pp. 635-641
    • Xu, B.1    Yang, C.2    Shi, Z.3
  • 31
    • 84904398037 scopus 로고    scopus 로고
    • Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics
    • Jul.
    • H. Li, D. Liu, and D. Wang, "Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 3, pp. 706-714, Jul. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.3 , pp. 706-714
    • Li, H.1    Liu, D.2    Wang, D.3
  • 32
    • 84875270081 scopus 로고    scopus 로고
    • Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update
    • Jul.
    • T. Dierks and S. Jagannathan, "Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1118-1129, Jul. 2012.
    • (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.7 , pp. 1118-1129
    • Dierks, T.1    Jagannathan, S.2
  • 33
    • 84904706555 scopus 로고    scopus 로고
    • Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics
    • Aug.
    • D. Liu, H. Li, and D. Wang, "Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics," IEEE Trans. Syst., Man, Cybern. A, Syst., vol. 44, no. 8, pp. 1015-1027, Aug. 2014.
    • (2014) IEEE Trans. Syst., Man, Cybern. A, Syst. , vol.44 , Issue.8 , pp. 1015-1027
    • Liu, D.1    Li, H.2    Wang, D.3
  • 34
    • 84893640946 scopus 로고    scopus 로고
    • Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach
    • Feb.
    • D. Liu, D. Wang, and H. Li, "Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 2, pp. 418-428, Feb. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.2 , pp. 418-428
    • Liu, D.1    Wang, D.2    Li, H.3
  • 35
    • 84893708995 scopus 로고    scopus 로고
    • Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
    • Jan.
    • H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems,"Automatica, vol. 50, no. 1, pp. 193-202, Jan. 2014.
    • (2014) Automatica , vol.50 , Issue.1 , pp. 193-202
    • Modares, H.1    Lewis, F.L.2    Naghibi-Sistani, M.-B.3
  • 36
    • 84862811062 scopus 로고    scopus 로고
    • An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
    • Aug.
    • Q. Wei and D. Liu, "An iterative ∈-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state," Neural Netw., vol. 32, pp. 236-244, Aug. 2012.
    • (2012) Neural Netw. , vol.32 , pp. 236-244
    • Wei, Q.1    Liu, D.2
  • 37
    • 61849184281 scopus 로고    scopus 로고
    • Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
    • Mar.
    • Q. Wei, H. Zhang, and J. Dai, "Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions," Neurocomputing, vol. 72, nos. 7-9, pp. 1839-1848, Mar. 2009.
    • (2009) Neurocomputing , vol.72 , Issue.7-9 , pp. 1839-1848
    • Wei, Q.1    Zhang, H.2    Dai, J.3
  • 38
    • 84908658175 scopus 로고    scopus 로고
    • A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems
    • Oct.
    • Q. Wei and D. Liu, "A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1176-1190, Oct. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.4 , pp. 1176-1190
    • Wei, Q.1    Liu, D.2
  • 39
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Jan.
    • H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, Jan. 2011.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 40
    • 84902352795 scopus 로고    scopus 로고
    • Data-driven neuro-optimal temperature control of water-gas shift reaction using stable iterative adaptive dynamic programming
    • Nov.
    • Q. Wei and D. Liu, "Data-driven neuro-optimal temperature control of water-gas shift reaction using stable iterative adaptive dynamic programming," IEEE Trans. Ind. Electron., vol. 61, no. 11, pp. 6399-6408, Nov. 2014.
    • (2014) IEEE Trans. Ind. Electron. , vol.61 , Issue.11 , pp. 6399-6408
    • Wei, Q.1    Liu, D.2
  • 41
    • 84961289199 scopus 로고    scopus 로고
    • Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors
    • Feb.
    • Q. Wei and D. Liu, "Neural-network-based adaptive optimal tracking control scheme for discrete-time nonlinear systems with approximation errors," Neurocomputing, vol. 149, pp. 106-115, Feb. 2015.
    • (2015) Neurocomputing , vol.149 , pp. 106-115
    • Wei, Q.1    Liu, D.2
  • 42
    • 84912073419 scopus 로고    scopus 로고
    • Neural-networkbased online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems
    • Dec.
    • D. Liu, D. Wang, F.-Y. Wang, H. Li, and X. Yang, "Neural-networkbased online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2834-2847, Dec. 2014.
    • (2014) IEEE Trans. Cybern. , vol.44 , Issue.12 , pp. 2834-2847
    • Liu, D.1    Wang, D.2    Wang, F.-Y.3    Li, H.4    Yang, X.5
  • 43
    • 84887035183 scopus 로고    scopus 로고
    • Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints
    • Nov.
    • X. Yang, D. Liu, and Y. Huang, "Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints," IET Control Theory Appl., vol. 7, no. 17, pp. 2037-2047, Nov. 2013.
    • (2013) IET Control Theory Appl. , vol.7 , Issue.17 , pp. 2037-2047
    • Yang, X.1    Liu, D.2    Huang, Y.3
  • 44
    • 84883537695 scopus 로고    scopus 로고
    • Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers
    • Dec.
    • F. L. Lewis, D. Vrabie, and K. G. Vamvoudakis, "Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers," IEEE Control Syst., vol. 32, no. 6, pp. 76-105, Dec. 2012.
    • (2012) IEEE Control Syst. , vol.32 , Issue.6 , pp. 76-105
    • Lewis, F.L.1    Vrabie, D.2    Vamvoudakis, K.G.3
  • 47
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • May
    • M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, May 2005.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 48
    • 84862846667 scopus 로고    scopus 로고
    • Adaptive cooperative tracking control of higher-order nonlinear systems with unknown dynamics
    • Jul.
    • H. Zhang and F. L. Lewis, "Adaptive cooperative tracking control of higher-order nonlinear systems with unknown dynamics," Automatica, vol. 48, no. 7, pp. 1432-1439, Jul. 2012.
    • (2012) Automatica , vol.48 , Issue.7 , pp. 1432-1439
    • Zhang, H.1    Lewis, F.L.2
  • 49
    • 79960897012 scopus 로고    scopus 로고
    • Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
    • Aug.
    • K. G. Vamvoudakis and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, Aug. 2011.
    • (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 50
    • 84864491417 scopus 로고    scopus 로고
    • Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality
    • Aug.
    • K. G. Vamvoudakis, F. L. Lewis, and G. R. Hudas, "Multi-agent differential graphical games: Online adaptive learning solution for synchronization with optimality," Automatica, vol. 48, no. 8, pp. 1598-1611, Aug. 2012.
    • (2012) Automatica , vol.48 , Issue.8 , pp. 1598-1611
    • Vamvoudakis, K.G.1    Lewis, F.L.2    Hudas, G.R.3
  • 51
    • 84897594646 scopus 로고    scopus 로고
    • Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
    • Mar.
    • D. Liu and Q. Wei, "Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 621-634, Mar. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.3 , pp. 621-634
    • Liu, D.1    Wei, Q.2
  • 52
    • 84906781179 scopus 로고    scopus 로고
    • Adaptive dynamic programming for a class of complex-valued nonlinear systems
    • Sep.
    • R. Song, W. Xiao, H. Zhang, and C. Sun, "Adaptive dynamic programming for a class of complex-valued nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 9, pp. 1733-1739, Sep. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.9 , pp. 1733-1739
    • Song, R.1    Xiao, W.2    Zhang, H.3    Sun, C.4
  • 53
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Aug.
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 54
    • 33747862706 scopus 로고    scopus 로고
    • Relaxing dynamic programming
    • Aug.
    • B. Lincoln and A. Rantzer, "Relaxing dynamic programming," IEEE Trans. Autom. Control, vol. 51, no. 8, pp. 1249-1260, Aug. 2006.
    • (2006) IEEE Trans. Autom. Control , vol.51 , Issue.8 , pp. 1249-1260
    • Lincoln, B.1    Rantzer, A.2
  • 55
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Aug.
    • H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 56
    • 84863467146 scopus 로고    scopus 로고
    • Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
    • Jul.
    • D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
    • Liu, D.1    Wang, D.2    Zhao, D.3    Wei, Q.4    Jin, N.5
  • 57
    • 84873876532 scopus 로고    scopus 로고
    • Optimal stopping under partial observation: Near-value iteration
    • Feb.
    • E. Zhou, "Optimal stopping under partial observation: Near-value iteration,"IEEE Trans. Autom. Control, vol. 58, no. 2, pp. 500-506, Feb. 2013.
    • (2013) IEEE Trans. Autom. Control , vol.58 , Issue.2 , pp. 500-506
    • Zhou, E.1
  • 58
    • 84871319455 scopus 로고    scopus 로고
    • A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    • Jan.
    • S. Bhasin, R. Kamalapurkar, M. Johnson, K. G. Vamvoudakis, F. L. Lewis, and W. E. Dixon, "A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems,"Automatica, vol. 49, no. 1, pp. 82-92, Jan. 2013.
    • (2013) Automatica , vol.49 , Issue.1 , pp. 82-92
    • Bhasin, S.1    Kamalapurkar, R.2    Johnson, M.3    Vamvoudakis, K.G.4    Lewis, F.L.5    Dixon, W.E.6
  • 59
    • 84885176157 scopus 로고    scopus 로고
    • Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks
    • Oct.
    • H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Adaptive optimal control of unknown constrained-input systems using policy iteration and neural networks," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 10, pp. 1513-1525, Oct. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.10 , pp. 1513-1525
    • Modares, H.1    Lewis, F.L.2    Naghibi-Sistani, M.-B.3
  • 60
    • 84883327795 scopus 로고    scopus 로고
    • Numerical adaptive learning control scheme for discrete-time non-linear systems
    • Jul.
    • Q. Wei and D. Liu, "Numerical adaptive learning control scheme for discrete-time non-linear systems," IET Control Theory Appl., vol. 7, no. 11, pp. 1472-1486, Jul. 2013.
    • (2013) IET Control Theory Appl. , vol.7 , Issue.11 , pp. 1472-1486
    • Wei, Q.1    Liu, D.2
  • 61
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • May
    • K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem,"Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 62
    • 85028161223 scopus 로고    scopus 로고
    • Multi-battery optimal coordination control for home energy management systems via distributed iterative adaptive dynamic programming
    • to be published
    • Q. Wei, D. Liu, G. Shi, and Y. Liu, "Multi-battery optimal coordination control for home energy management systems via distributed iterative adaptive dynamic programming," IEEE Trans. Ind. Electron., to be published.
    • IEEE Trans. Ind. Electron.
    • Wei, Q.1    Liu, D.2    Shi, G.3    Liu, Y.4
  • 63
    • 85028167137 scopus 로고    scopus 로고
    • A novel dual iterative Q-learning method for optimal battery management in smart residential environments
    • to be published
    • Q.Wei, D. Liu, and G. Shi, "A novel dual iterative Q-learning method for optimal battery management in smart residential environments," IEEE Trans. Ind. Electron., to be published.
    • IEEE Trans. Ind. Electron.
    • Wei, Q.1    Liu, D.2    Shi, G.3
  • 64
    • 84919687575 scopus 로고    scopus 로고
    • Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems
    • Jan.
    • B. Kiumarsi and F. L. Lewis, "Actor-critic-based optimal tracking for partially unknown nonlinear discrete-time systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 26, no. 1, pp. 140-151, Jan. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.26 , Issue.1 , pp. 140-151
    • Kiumarsi, B.1    Lewis, F.L.2
  • 65
    • 84884157580 scopus 로고    scopus 로고
    • Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique
    • Dec.
    • D. Wang and D. Liu, "Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique," Neurocomputing, vol. 121, pp. 218-225, Dec. 2013.
    • (2013) Neurocomputing , vol.121 , pp. 218-225
    • Wang, D.1    Liu, D.2
  • 66
    • 84878421441 scopus 로고    scopus 로고
    • Optimal control for discrete-time affine non-linear systems using general value iteration
    • Dec.
    • H. Li and D. Liu, "Optimal control for discrete-time affine non-linear systems using general value iteration," IET Control Theory Appl., vol. 6, no. 18, pp. 2725-2736, Dec. 2012.
    • (2012) IET Control Theory Appl. , vol.6 , Issue.18 , pp. 2725-2736
    • Li, H.1    Liu, D.2
  • 68
    • 84867400046 scopus 로고    scopus 로고
    • Integral Q -learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
    • Nov.
    • J. Y. Lee, J. B. Park, and Y. H. Choi, "Integral Q -learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems," Automatica, vol. 48, no. 11, pp. 2850-2859, Nov. 2012.
    • (2012) Automatica , vol.48 , Issue.11 , pp. 2850-2859
    • Lee, J.Y.1    Park, J.B.2    Choi, Y.H.3
  • 70
    • 0004147916 scopus 로고
    • Boston, MA, USA: Addison-Wesley
    • nd ed. Boston, MA, USA: Addison-Wesley, 1974.
    • (1974) nd Ed.
    • Apostol, T.M.1
  • 71
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • Mar.
    • J. Si and Y.-T. Wang, "Online learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.