메뉴 건너뛰기




Volumn 26, Issue 4, 2015, Pages 851-865

Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data

Author keywords

Actor critic; approximate dynamic programming (ADP); category; optimal control; shunting inhibitory artificial neural network (SIANN)

Indexed keywords

CONTINUOUS TIME SYSTEMS; NEURAL NETWORKS; NONLINEAR SYSTEMS; RECURRENT NEURAL NETWORKS;

EID: 85027929469     PISSN: 2162237X     EISSN: 21622388     Source Type: Journal    
DOI: 10.1109/TNNLS.2015.2399020     Document Type: Article
Times cited : (138)

References (68)
  • 1
    • 84897663275 scopus 로고    scopus 로고
    • Reinforcement learning output feedback NN control using deterministic learning technique
    • Mar.
    • B. Xu, C. Yang, and Z. Shi, "Reinforcement learning output feedback NN control using deterministic learning technique," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 635-641, Mar. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.3 , pp. 635-641
    • Xu, B.1    Yang, C.2    Shi, Z.3
  • 3
    • 0002011091 scopus 로고
    • A Menu of Designs for Reinforcement Learning over time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 4
    • 0036618011 scopus 로고    scopus 로고
    • Multiple model-based reinforcement learning
    • K. Doya, K. Samejima, K.-I. Katagiri, and M. Kawato, "Multiple model-based reinforcement learning," Neural Comput., vol. 14, no. 6, pp. 1347-1369, 2002.
    • (2002) Neural Comput , vol.14 , Issue.6 , pp. 1347-1369
    • Doya, K.1    Samejima, K.2    Katagiri, K.-I.3    Kawato, M.4
  • 5
    • 0035422340 scopus 로고    scopus 로고
    • Neural mechanisms of learning and control
    • Aug
    • K. Doya, H. Kimura, and M. Kawato, "Neural mechanisms of learning and control," IEEE Control Syst., vol. 21, no. 4, pp. 42-54, Aug. 2001.
    • (2001) IEEE Control Syst , vol.21 , Issue.4 , pp. 42-54
    • Doya, K.1    Kimura, H.2    Kawato, M.3
  • 6
    • 84897594646 scopus 로고    scopus 로고
    • Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
    • Mar
    • D. Liu and Q. Wei, "Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 621-634, Mar. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.3 , pp. 621-634
    • Liu, D.1    Wei, Q.2
  • 7
    • 84883327795 scopus 로고    scopus 로고
    • Numerical adaptive learning control scheme for discrete-time non-linear systems
    • Jul
    • Q. Wei and D. Liu, "Numerical adaptive learning control scheme for discrete-time non-linear systems," IET Control Theory Appl., vol. 7, no. 11, pp. 1472-1486, Jul. 2013.
    • (2013) IET Control Theory Appl , vol.7 , Issue.11 , pp. 1472-1486
    • Wei, Q.1    Liu, D.2
  • 8
    • 84862811062 scopus 로고    scopus 로고
    • An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
    • Aug
    • Q. Wei and D. Liu, "An iterative ε-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state," Neural Netw., vol. 32, pp. 236-244, Aug. 2012.
    • (2012) Neural Netw , vol.32 , pp. 236-244
    • Wei, Q.1    Liu, D.2
  • 9
    • 84881555023 scopus 로고    scopus 로고
    • Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
    • Apr
    • D. Liu and Q. Wei, "Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 43, no. 2, pp. 779-789, Apr. 2013.
    • (2013) IEEE Trans. Cybern. , vol.43 , Issue.2 , pp. 779-789
    • Liu, D.1    Wei, Q.2
  • 10
    • 84899471403 scopus 로고    scopus 로고
    • Robust adaptive dynamic programming and feedback stabilization of nonlinear systems
    • Jan
    • Y. Jiang and Z.-P. Jiang, "Robust adaptive dynamic programming and feedback stabilization of nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 5, pp. 882-893, Jan. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.5 , pp. 882-893
    • Jiang, Y.1    Jiang, Z.-P.2
  • 11
    • 84904389431 scopus 로고    scopus 로고
    • Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming
    • Jul
    • H. Zhang, C. Qing, and Y. Luo, "Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 3, pp. 839-849, Jul. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.3 , pp. 839-849
    • Zhang, H.1    Qing, C.2    Luo, Y.3
  • 12
    • 84912048350 scopus 로고    scopus 로고
    • ∞ state feedback control of unknown affine nonlinear discrete-time systems
    • Dec
    • ∞ state feedback control of unknown affine nonlinear discrete-time systems," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2706-2718, Dec. 2014.
    • (2014) IEEE Trans. Cybern. , vol.44 , Issue.12 , pp. 2706-2718
    • Zhang, H.1    Qing, C.2    Jiang, B.3    Luo, Y.4
  • 13
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • May
    • K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem,"Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 14
    • 84908120758 scopus 로고    scopus 로고
    • Adaptive dynamic programming and optimal control of nonlinear nonaffine systems
    • T. Bian, Y. Jiang, and Z.-P. Jiang, "Adaptive dynamic programming and optimal control of nonlinear nonaffine systems," Automatica, vol. 50, no. 10, pp. 2624-2632, 2014.
    • (2014) Automatica , vol.50 , Issue.10 , pp. 2624-2632
    • Bian, T.1    Jiang, Y.2    Jiang, Z.-P.3
  • 15
    • 83655163786 scopus 로고    scopus 로고
    • Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
    • Dec
    • H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, Dec. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 2226-2236
    • Zhang, H.1    Cui, L.2    Zhang, X.3    Luo, Y.4
  • 16
    • 84883537695 scopus 로고    scopus 로고
    • Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers
    • Dec
    • F. L. Lewis, D. Vrabie, and K. G. Vamvoudakis, "Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers," IEEE Control Syst., vol. 32, no. 6, pp. 76-105, Dec. 2012.
    • (2012) IEEE Control Syst , vol.32 , Issue.6 , pp. 76-105
    • Lewis, F.L.1    Vrabie, D.2    Vamvoudakis, K.G.3
  • 17
    • 84908658175 scopus 로고    scopus 로고
    • A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems
    • Oct
    • Q. Wei and D. Liu, "A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1176-1190, Oct. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.4 , pp. 1176-1190
    • Wei, Q.1    Liu, D.2
  • 18
    • 84884901270 scopus 로고    scopus 로고
    • Robust adaptive dynamic programming for linear and nonlinear systems: An overview
    • Z.-P. Jiang and Y. Jiang, "Robust adaptive dynamic programming for linear and nonlinear systems: An overview," Eur. J. Control, vol. 19, no. 5, pp. 417-425, 2013.
    • (2013) Eur. J. Control , vol.19 , Issue.5 , pp. 417-425
    • Jiang, Z.-P.1    Jiang, Y.2
  • 19
    • 84961378056 scopus 로고    scopus 로고
    • Leader-based optimal coordination control for the consensus problem of multiagent differential games via fuzzy adaptive dynamic programming
    • Feb
    • H. Zhang, J. Zhang, G.-H. Yang, and Y. Luo, "Leader-based optimal coordination control for the consensus problem of multiagent differential games via fuzzy adaptive dynamic programming," IEEE Trans. Fuzzy Syst., vol. 23, no. 1, pp. 152-163, Feb. 2015.
    • (2015) IEEE Trans. Fuzzy Syst. , vol.23 , Issue.1 , pp. 152-163
    • Zhang, H.1    Zhang, J.2    Yang, G.-H.3    Luo, Y.4
  • 20
    • 83855165164 scopus 로고    scopus 로고
    • Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming
    • Dec
    • H. Zhang, R. Song, Q. Wei, and T. Zhang, "Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 1851-1862, Dec. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 1851-1862
    • Zhang, H.1    Song, R.2    Wei, Q.3    Zhang, T.4
  • 21
    • 82755160758 scopus 로고    scopus 로고
    • Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach
    • Feb
    • D. Wang, D. Liu, and Q. Wei, "Finite-horizon neuro-optimal tracking control for a class of discrete-time nonlinear systems using adaptive dynamic programming approach," Neurocomputing, vol. 78, no. 1, pp. 14-22, Feb. 2012.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 14-22
    • Wang, D.1    Liu, D.2    Wei, Q.3
  • 22
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 23
    • 78649933699 scopus 로고    scopus 로고
    • Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming
    • Oct
    • R. Song, H. Zhang, Y. Luo, and Q. Wei, "Optimal control laws for time-delay systems with saturating actuators based on heuristic dynamic programming," Neurocomputing, vol. 73, nos. 16-18, pp. 3020-3027, Oct. 2010.
    • (2010) Neurocomputing , vol.73 , Issue.16-18 , pp. 3020-3027
    • Song, R.1    Zhang, H.2    Luo, Y.3    Wei, Q.4
  • 24
    • 84868467610 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
    • Jan
    • D. Liu, D. Wang, and X. Yang, "An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs," Inf. Sci., vol. 220, pp. 331-342, Jan. 2013.
    • (2013) Inf. Sci. , vol.220 , pp. 331-342
    • Liu, D.1    Wang, D.2    Yang, X.3
  • 25
    • 84885923700 scopus 로고    scopus 로고
    • Multi-objective optimal control for a class of nonlinear time-delay systems via adaptive dynamic programming
    • R. Song, W. Xiao, and Q. Wei, "Multi-objective optimal control for a class of nonlinear time-delay systems via adaptive dynamic programming,"Soft Comput., vol. 17, no. 11, pp. 2109-2115, 2013.
    • (2013) Soft Comput , vol.17 , Issue.11 , pp. 2109-2115
    • Song, R.1    Xiao, W.2    Wei, Q.3
  • 26
    • 84881548237 scopus 로고    scopus 로고
    • Multi-objective optimal control for a class of unknown nonlinear systems based on finite-approximation-error ADP algorithm
    • Nov
    • R. Song, W. Xiao, and H. Zhang, "Multi-objective optimal control for a class of unknown nonlinear systems based on finite-approximation-error ADP algorithm," Neurocomputing, vol. 119, no. 7, pp. 212-221, Nov. 2013.
    • (2013) Neurocomputing , vol.119 , Issue.7 , pp. 212-221
    • Song, R.1    Xiao, W.2    Zhang, H.3
  • 27
    • 84906781179 scopus 로고    scopus 로고
    • Adaptive dynamic programming for a class of complex-valued nonlinear systems
    • Sep
    • R. Song, W. Xiao, H. Zhang, and C. Sun, "Adaptive dynamic programming for a class of complex-valued nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 9, pp. 1733-1739, Sep. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.9 , pp. 1733-1739
    • Song, R.1    Xiao, W.2    Zhang, H.3    Sun, C.4
  • 28
    • 84924872284 scopus 로고    scopus 로고
    • A novel dual iterative Q-learning method for optimal battery management in smart residential environments
    • Q. Wei, D. Liu, and G. Shi, "A novel dual iterative Q-learning method for optimal battery management in smart residential environments," IEEE Trans. Ind. Electron., doi: 10.1109/TIE. 2014.2361485, 2015.
    • (2015) IEEE Trans. Ind. Electron.
    • Wei, Q.1    Liu, D.2    Shi, G.3
  • 29
    • 84924872940 scopus 로고    scopus 로고
    • Decentralized adaptive optimal control of large-scale systems with application to power systems
    • T. Bian, Y. Jiang, and Z. P. Jiang, "Decentralized adaptive optimal control of large-scale systems with application to power systems," IEEE Trans. Ind. Electron., doi: 10.1109/TIE. 2014.2345343, 2014.
    • (2014) IEEE Trans. Ind. Electron.
    • Bian, T.1    Jiang, Y.2    Jiang, Z.P.3
  • 30
    • 84877914583 scopus 로고    scopus 로고
    • Robust adaptive dynamic programming with an application to power systems
    • Jul
    • Y. Jiang and Z.-P. Jiang, "Robust adaptive dynamic programming with an application to power systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 24, no. 7, pp. 1150-1156, Jul. 2013.
    • (2013) IEEE Trans. Neural Netw. Learn. Syst. , vol.24 , Issue.7 , pp. 1150-1156
    • Jiang, Y.1    Jiang, Z.-P.2
  • 31
    • 84912073419 scopus 로고    scopus 로고
    • Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems
    • Dec
    • D. Liu, D. Wang, F.-Y. Wang, H. Li, and X. Yang, "Neural-network-based online HJB solution for optimal robust guaranteed cost control of continuous-time uncertain nonlinear systems," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2834-2847, Dec. 2014.
    • (2014) IEEE Trans. Cybern. , vol.44 , Issue.12 , pp. 2834-2847
    • Liu, D.1    Wang, D.2    Wang, F.-Y.3    Li, H.4    Yang, X.5
  • 32
    • 84884157580 scopus 로고    scopus 로고
    • Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique
    • Dec
    • D. Wang and D. Liu, "Neuro-optimal control for a class of unknown nonlinear dynamic systems using SN-DHP technique," Neurocomputing, vol. 121, pp. 218-225, Dec. 2013.
    • (2013) Neurocomputing , vol.121 , pp. 218-225
    • Wang, D.1    Liu, D.2
  • 33
    • 84912122528 scopus 로고    scopus 로고
    • Finite-approximation-error-based discrete-time iterative adaptive dynamic programming
    • Dec
    • Q. Wei, F.-Y. Wang, D. Liu, and X. Yang, "Finite-approximation-error-based discrete-time iterative adaptive dynamic programming," IEEE Trans. Cybern., vol. 44, no. 12, pp. 2820-2833, Dec. 2014.
    • (2014) IEEE Trans. Cybern. , vol.44 , Issue.12 , pp. 2820-2833
    • Wei, Q.1    Wang, F.-Y.2    Liu, D.3    Yang, X.4
  • 34
    • 84902352795 scopus 로고    scopus 로고
    • Data-driven neuro-optimal temperature control of water-gas shift reaction using stable iterative adaptive dynamic programming
    • Nov
    • Q. Wei and D. Liu, "Data-driven neuro-optimal temperature control of water-gas shift reaction using stable iterative adaptive dynamic programming," IEEE Trans. Ind. Electron., vol. 61, no. 11, pp. 6399-6408, Nov. 2014.
    • (2014) IEEE Trans. Ind. Electron. , vol.61 , Issue.11 , pp. 6399-6408
    • Wei, Q.1    Liu, D.2
  • 35
    • 84876066909 scopus 로고    scopus 로고
    • Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm
    • Jun
    • D. Liu, H. Li, and D. Wang, "Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm," Neurocomputing, vol. 110, pp. 92-100, Jun. 2013.
    • (2013) Neurocomputing , vol.110 , pp. 92-100
    • Liu, D.1    Li, H.2    Wang, D.3
  • 36
    • 84878421441 scopus 로고    scopus 로고
    • Optimal control for discrete-time affine non-linear systems using general value iteration
    • Dec
    • H. Li and D. Liu, "Optimal control for discrete-time affine non-linear systems using general value iteration," IET Control Theory Appl., vol. 6, no. 18, pp. 2725-2736, Dec. 2012.
    • (2012) IET Control Theory Appl , vol.6 , Issue.18 , pp. 2725-2736
    • Li, H.1    Liu, D.2
  • 37
    • 84864489666 scopus 로고    scopus 로고
    • Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    • Aug
    • D. Wang, D. Liu, Q. Wei, D. Zhao, and N. Jin, "Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming," Automatica, vol. 48, no. 8, pp. 1825-1832, Aug. 2012.
    • (2012) Automatica , vol.48 , Issue.8 , pp. 1825-1832
    • Wang, D.1    Liu, D.2    Wei, Q.3    Zhao, D.4    Jin, N.5
  • 38
    • 84904706555 scopus 로고    scopus 로고
    • Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics
    • Aug
    • D. Liu, H. Li, and D. Wang, "Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics," IEEE Trans. Syst., Man, Cybern., Syst., vol. 44, no. 8, pp. 1015-1027, Aug. 2014.
    • (2014) IEEE Trans. Syst., Man, Cybern., Syst. , vol.44 , Issue.8 , pp. 1015-1027
    • Liu, D.1    Li, H.2    Wang, D.3
  • 39
    • 84893640946 scopus 로고    scopus 로고
    • Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach
    • Feb
    • D. Liu, D. Wang, and H. Li, "Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 2, pp. 418-428, Feb. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.2 , pp. 418-428
    • Liu, D.1    Wang, D.2    Li, H.3
  • 40
    • 84863467146 scopus 로고    scopus 로고
    • Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
    • Jul
    • D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
    • Liu, D.1    Wang, D.2    Zhao, D.3    Wei, Q.4    Jin, N.5
  • 43
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: A theory of how it works and how to build it
    • P. J. Werbos, "Intelligence in the brain: A theory of how it works and how to build it," Neural Netw., vol. 22, no. 3, pp. 200-212, 2009.
    • (2009) Neural Netw , vol.22 , Issue.3 , pp. 200-212
    • Werbos, P.J.1
  • 44
    • 0032687566 scopus 로고    scopus 로고
    • Stable adaptive control using new critic designs
    • P. J. Werbos, "Stable adaptive control using new critic designs," in Proc. Adaptation, Noise, Self-Organizing Syst., vol. 3728, pp. 510-579, 1999.
    • (1999) Proc. Adaptation, Noise, Self-Organizing Syst. , vol.3728 , pp. 510-579
    • Werbos, P.J.1
  • 45
    • 0031075855 scopus 로고    scopus 로고
    • Adaptive control using multiple models
    • Feb
    • K. S. Narendra and J. Balakrishnan, "Adaptive control using multiple models," IEEE Trans. Autom. Control, vol. 42, no. 2, pp. 171-187, Feb. 1997.
    • (1997) IEEE Trans. Autom. Control , vol.42 , Issue.2 , pp. 171-187
    • Narendra, K.S.1    Balakrishnan, J.2
  • 46
    • 84860327685 scopus 로고    scopus 로고
    • The eMOSAIC model for humanoid robot control
    • May
    • N. Sugimoto, J. Morimoto, S.-H. Hyon, and M. Kawato, "The eMOSAIC model for humanoid robot control," Neural Netw., vols. 29-30, pp. 8-19, May 2012.
    • (2012) Neural Netw , vol.29-30 , pp. 8-19
    • Sugimoto, N.1    Morimoto, J.2    Hyon, S.-H.3    Kawato, M.4
  • 47
    • 0033213819 scopus 로고    scopus 로고
    • What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
    • Oct./Nov.
    • K. Doya, "What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?" Neural Netw., vol. 12, nos. 7-8, pp. 961-974, Oct./Nov. 1999.
    • (1999) Neural Netw , vol.12 , Issue.7-8 , pp. 961-974
    • Doya, K.1
  • 48
    • 0033214899 scopus 로고    scopus 로고
    • Parallel neural networks for learning sequential procedures
    • O. Hikosaka et al., "Parallel neural networks for learning sequential procedures," Trends Neurosci., vol. 22, no. 10, pp. 464-471, 1999.
    • (1999) Trends Neurosci , vol.22 , Issue.10 , pp. 464-471
    • Hikosaka, O.1
  • 49
    • 18444379381 scopus 로고    scopus 로고
    • Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes
    • Jul
    • J. M. Lee and J. H. Lee, "Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes,"Automatica, vol. 41, no. 7, pp. 1281-1288, Jul. 2005.
    • (2005) Automatica , vol.41 , Issue.7 , pp. 1281-1288
    • Lee, J.M.1    Lee, J.H.2
  • 50
    • 84904398037 scopus 로고    scopus 로고
    • Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics
    • Jul
    • H. Li, D. Liu, and D. Wang, "Integral reinforcement learning for linear continuous-time zero-sum games with completely unknown dynamics,"IEEE Trans. Autom. Sci. Eng., vol. 11, no. 3, pp. 706-714, Jul. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.3 , pp. 706-714
    • Li, H.1    Liu, D.2    Wang, D.3
  • 51
    • 84887035183 scopus 로고    scopus 로고
    • Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints
    • Nov
    • X. Yang, D. Liu, and Y. Huang, "Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints," IET Control Theory Appl., vol. 7, no. 17, pp. 2037-2047, Nov. 2013.
    • (2013) IET Control Theory Appl , vol.7 , Issue.17 , pp. 2037-2047
    • Yang, X.1    Liu, D.2    Huang, Y.3
  • 52
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Feb
    • F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 41, no. 1, pp. 14-25, Feb. 2011.
    • (2011) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.41 , Issue.1 , pp. 14-25
    • Lewis, F.L.1    Vamvoudakis, K.G.2
  • 53
    • 84895922559 scopus 로고    scopus 로고
    • Distributed robust consensus control of multi-agent systems with heterogeneous matching uncertainties
    • Mar
    • Z. Li, Z. Duan, and F. L. Lewis, "Distributed robust consensus control of multi-agent systems with heterogeneous matching uncertainties,"Automatica, vol. 50, no. 3, pp. 883-889, Mar. 2014.
    • (2014) Automatica , vol.50 , Issue.3 , pp. 883-889
    • Li, Z.1    Duan, Z.2    Lewis, F.L.3
  • 54
    • 84893708995 scopus 로고    scopus 로고
    • Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
    • Jan
    • H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems,"Automatica, vol. 50, no. 1, pp. 193-202, Jan. 2014.
    • (2014) Automatica , vol.50 , Issue.1 , pp. 193-202
    • Modares, H.1    Lewis, F.L.2    Naghibi-Sistani, M.-B.3
  • 55
    • 84862846667 scopus 로고    scopus 로고
    • Adaptive cooperative tracking control of higher-order nonlinear systems with unknown dynamics
    • Jul
    • H. Zhang and F. L. Lewis, "Adaptive cooperative tracking control of higher-order nonlinear systems with unknown dynamics," Automatica, vol. 48, no. 7, pp. 1432-1439, Jul. 2012.
    • (2012) Automatica , vol.48 , Issue.7 , pp. 1432-1439
    • Zhang, H.1    Lewis, F.L.2
  • 56
    • 84906778934 scopus 로고    scopus 로고
    • Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification
    • Oct
    • Q. Wei and D. Liu, "Adaptive dynamic programming for optimal tracking control of unknown nonlinear systems with application to coal gasification," IEEE Trans. Autom. Sci. Eng., vol. 11, no. 4, pp. 1020-1036, Oct. 2014.
    • (2014) IEEE Trans. Autom. Sci. Eng. , vol.11 , Issue.4 , pp. 1020-1036
    • Wei, Q.1    Liu, D.2
  • 57
    • 84856629003 scopus 로고    scopus 로고
    • Neural dynamics of affect, gist, probability, and choice
    • May/Jun
    • D. S. Levine, "Neural dynamics of affect, gist, probability, and choice,"Cognit. Syst. Res., vols. 15-16, pp. 57-72, May/Jun. 2012.
    • (2012) Cognit. Syst. Res. , vol.15-16 , pp. 57-72
    • Levine, D.S.1
  • 59
    • 0038721679 scopus 로고    scopus 로고
    • A generalized feedforward neural network architecture for classification and regression
    • G. Arulampalam and A. Bouzerdoum, "A generalized feedforward neural network architecture for classification and regression," Neural Netw., vol. 16, nos. 5-6, pp. 561-568, 2003.
    • (2003) Neural Netw , vol.16 , Issue.5-6 , pp. 561-568
    • Arulampalam, G.1    Bouzerdoum, A.2
  • 60
    • 0033714974 scopus 로고    scopus 로고
    • Classification and function approximation using feed-forward shunting inhibitory artificial neural networks
    • A. Bouzerdoum, "Classification and function approximation using feed-forward shunting inhibitory artificial neural networks," in Proc. IEEE-INNS-ENNS Int. Joint Conf. Neural Netw., vol. 6. 2000, pp. 613-618.
    • (2000) Proc. IEEE-INNS-ENNS Int. Joint Conf. Neural Netw , vol.6 , pp. 613-618
    • Bouzerdoum, A.1
  • 61
    • 19344373705 scopus 로고    scopus 로고
    • Efficient training algorithms for a class of shunting inhibitory convolutional neural networks
    • May
    • F. H. C. Tivive and A. Bouzerdoum, "Efficient training algorithms for a class of shunting inhibitory convolutional neural networks," IEEE Trans. Neural Netw., vol. 16, no. 3, pp. 541-556, May 2005.
    • (2005) IEEE Trans. Neural Netw. , vol.16 , Issue.3 , pp. 541-556
    • Tivive, F.H.C.1    Bouzerdoum, A.2
  • 62
    • 0032631308 scopus 로고    scopus 로고
    • Neural network output feedback control of robot manipulators
    • Apr
    • Y. H. Kim and F. L. Lewis, "Neural network output feedback control of robot manipulators," IEEE Trans. Robot. Autom., vol. 15, no. 2, pp. 301-309, Apr. 1999.
    • (1999) IEEE Trans. Robot. Autom. , vol.15 , Issue.2 , pp. 301-309
    • Kim, Y.H.1    Lewis, F.L.2
  • 63
    • 0004178386 scopus 로고    scopus 로고
    • Englewood Cliffs, NJ, USA: Prentice-Hall
    • H. K. Khalil, Nonlinear Systems. Englewood Cliffs, NJ, USA: Prentice-Hall, 2002.
    • (2002) Nonlinear Systems
    • Khalil, H.K.1
  • 65
    • 0001307541 scopus 로고
    • Degree of approximation results for feedforward networks approximating unknown mappings and their derivatives
    • Nov
    • K. Hornik, M. Stinchcombe, H. White, and P. Auer, "Degree of approximation results for feedforward networks approximating unknown mappings and their derivatives," Neural Comput., vol. 6, no. 6, pp. 1262-1275, Nov. 1994.
    • (1994) Neural Comput , vol.6 , Issue.6 , pp. 1262-1275
    • Hornik, K.1    Stinchcombe, M.2    White, H.3    Auer, P.4
  • 66
    • 84879521499 scopus 로고    scopus 로고
    • Trajectory planning and optimized adaptive control for a class of wheeled inverted pendulum vehicle models
    • Feb
    • C. Yang, Z. Li, and J. Li, "Trajectory planning and optimized adaptive control for a class of wheeled inverted pendulum vehicle models," IEEE Trans. Cybern., vol. 43, no. 1, pp. 24-36, Feb. 2013.
    • (2013) IEEE Trans. Cybern. , vol.43 , Issue.1 , pp. 24-36
    • Yang, C.1    Li, Z.2    Li, J.3
  • 67
    • 84908108987 scopus 로고    scopus 로고
    • Neural network-based motion control of an underactuated wheeled inverted pendulum model
    • Nov
    • C. Yang, Z. Li, R. Cui, and B. Xu, "Neural network-based motion control of an underactuated wheeled inverted pendulum model," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 1, pp. 2004-2016, Nov. 2014.
    • (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.1 , pp. 2004-2016
    • Yang, C.1    Li, Z.2    Cui, R.3    Xu, B.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.