메뉴 건너뛰기




Volumn 25, Issue 3, 2014, Pages 635-641

Reinforcement learning output feedback NN control using deterministic learning technique

Author keywords

Approximate dynamic programming; discrete time system; output feedback control; pure feedback system; radial basis function neural network (RBF NN)

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; DIGITAL CONTROL SYSTEMS; DISCRETE TIME CONTROL SYSTEMS; DYNAMIC PROGRAMMING; FEEDBACK CONTROL; LEARNING ALGORITHMS; RADIAL BASIS FUNCTION NETWORKS; REINFORCEMENT LEARNING;

EID: 84897663275     PISSN: 2162237X     EISSN: 21622388     Source Type: Journal    
DOI: 10.1109/TNNLS.2013.2292704     Document Type: Article
Times cited : (243)

References (26)
  • 1
    • 39649124691 scopus 로고    scopus 로고
    • Decentralized output-feedback neural control for systems with unknown interconnections
    • DOI 10.1109/TSMCB.2007.904544
    • W. Chen and J. Li, "Decentralized output-feedback neural control for systems with unknown interconnections," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 38, no. 1, pp. 258-266, Feb. 2008. (Pubitemid 351285029)
    • (2008) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.38 , Issue.1 , pp. 258-266
    • Chen, W.1    Li, J.2
  • 2
    • 76749143718 scopus 로고    scopus 로고
    • Adaptive tracking for periodically time-varying and nonlinearly parameterized systems using multilayer neural networks
    • Feb.
    • W. Chen and L. Jiao, "Adaptive tracking for periodically time-varying and nonlinearly parameterized systems using multilayer neural networks," IEEE Trans. Neural Netw., vol. 21, no. 2, pp. 345-351, Feb. 2010.
    • (2010) IEEE Trans. Neural Netw. , vol.21 , Issue.2 , pp. 345-351
    • Chen, W.1    Jiao, L.2
  • 3
    • 77951892250 scopus 로고    scopus 로고
    • Robust adaptive neural network control for a class of uncertain MIMO nonlinear systems with input nonlinearities
    • May
    • M. Chen, S. Ge, and B. How, "Robust adaptive neural network control for a class of uncertain MIMO nonlinear systems with input nonlinearities," IEEE Trans. Neural Netw., vol. 21, no. 5, pp. 796-812, May 2010.
    • (2010) IEEE Trans. Neural Netw. , vol.21 , Issue.5 , pp. 796-812
    • Chen, M.1    Ge, S.2    How, B.3
  • 4
    • 0031206774 scopus 로고    scopus 로고
    • Stable inversion for nonlinear systems
    • PII S0005109897000642
    • L. Hunt and G. Meyer, "Stable inversion for nonlinear systems," Automatica, vol. 33, no. 8, pp. 1549-1554, Aug. 1997. (Pubitemid 127392286)
    • (1997) Automatica , vol.33 , Issue.8 , pp. 1549-1554
    • Hunt, L.R.1    Meyer, G.2
  • 6
    • 33645139787 scopus 로고    scopus 로고
    • An ISS-modular approach for adaptive neural control of pure-feedback systems
    • May
    • C. Wang, D. Hill, S. Ge, and G. Chen, "An ISS-modular approach for adaptive neural control of pure-feedback systems," Automatica, vol. 42, no. 5, pp. 723-731, May 2006.
    • (2006) Automatica , vol.42 , Issue.5 , pp. 723-731
    • Wang, C.1    Hill, D.2    Ge, S.3    Chen, G.4
  • 8
    • 52149102489 scopus 로고    scopus 로고
    • Adaptive predictive control using neural network for a class of pure-feedback systems in discrete time
    • Sep.
    • S. Ge, C. Yang, and T. Lee, "Adaptive predictive control using neural network for a class of pure-feedback systems in discrete time," IEEE Trans. Neural Netw., vol. 19, no. 9, pp. 1599-1614, Sep. 2008.
    • (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.9 , pp. 1599-1614
    • Ge, S.1    Yang, C.2    Lee, T.3
  • 9
    • 56449111811 scopus 로고    scopus 로고
    • Output feedback NN control for two classes of discrete-time systems with unknown control directions in a unified approach
    • Nov.
    • C. Yang, S. Ge, C. Xiang, T. Chai, and T. Lee, "Output feedback NN control for two classes of discrete-time systems with unknown control directions in a unified approach," IEEE Trans. Neural Netw., vol. 19, no. 11, pp. 1873-1886, Nov. 2008.
    • (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.11 , pp. 1873-1886
    • Yang, C.1    Ge, S.2    Xiang, C.3    Chai, T.4    Lee, T.5
  • 10
    • 77955324515 scopus 로고    scopus 로고
    • Direct adaptive NN control for a class of discrete-time nonlinear strict-feedback systems
    • Y. Liu, G. Wen, and S. Tong, "Direct adaptive NN control for a class of discrete-time nonlinear strict-feedback systems," Neurocomputing, vol. 73, nos. 13-15, pp. 2498-2505, 2010.
    • (2010) Neurocomputing , vol.73 , Issue.13-15 , pp. 2498-2505
    • Liu, Y.1    Wen, G.2    Tong, S.3
  • 11
    • 79960150696 scopus 로고    scopus 로고
    • Adaptive neural output feedback tracking control for a class of uncertain discrete-time nonlinear systems
    • Jul.
    • Y. Liu, C. Chen, G. Wen, and S. Tong, "Adaptive neural output feedback tracking control for a class of uncertain discrete-time nonlinear systems," IEEE Trans. Neural Netw., vol. 22, no. 7, pp. 1162-1167, Jul. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.7 , pp. 1162-1167
    • Liu, Y.1    Chen, C.2    Wen, G.3    Tong, S.4
  • 12
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • DOI 10.1109/TNN.2005.853408
    • D. Liu, Y. Zhang, and H. Zhang, "A self-learning call admission control scheme for cdma cellular networks," IEEE Trans. Neural Netw., vol. 16, no. 5, pp. 1219-1228, Sep. 2005. (Pubitemid 41444623)
    • (2005) IEEE Transactions on Neural Networks , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 13
    • 84881555023 scopus 로고    scopus 로고
    • Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
    • Apr.
    • D. Liu and Q. Wei, "Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 43, no. 2, pp. 779-789, Apr. 2013.
    • (2013) IEEE Trans. Cybern. , vol.43 , Issue.2 , pp. 779-789
    • Liu, D.1    Wei, Q.2
  • 14
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.4
  • 15
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
    • D. Vrabie and F. Lewis, "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, 2009.
    • (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
    • Vrabie, D.1    Lewis, F.2
  • 16
    • 84863467146 scopus 로고    scopus 로고
    • Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
    • Jul.
    • D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
    • Liu, D.1    Wang, D.2    Zhao, D.3    Wei, Q.4    Jin, N.5
  • 17
    • 84859001250 scopus 로고    scopus 로고
    • Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators
    • Apr.
    • Q. Yang and S. Jagannathan, "Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators," IEEE Trans. Syst., Man, Cybern., B: Cybern., vol. 42, no. 2, pp. 377-390, Apr. 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern., B: Cybern. , vol.42 , Issue.2 , pp. 377-390
    • Yang, Q.1    Jagannathan, S.2
  • 18
    • 34047138362 scopus 로고    scopus 로고
    • Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
    • DOI 10.1109/TSMCB.2006.883869, Special Issue on Robot Learning by Observation, Demonstration and Imitation
    • P. He and S. Jagannathan, "Reinforcement learning neural-networkbased controller for nonlinear discrete-time systems with input constraints," IEEE Trans. Syst., Man, Cybern., B: Cybern., vol. 37, no. 2, pp. 425-436, Apr. 2007. (Pubitemid 46523230)
    • (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.37 , Issue.2 , pp. 425-436
    • He, P.1    Jagannathan, S.2
  • 19
    • 49049091364 scopus 로고    scopus 로고
    • Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks
    • Aug.
    • Q. Yang, J. Vance, and S. Jagannathan, "Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 38, no. 4, pp. 994-1001, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern., B, Cybern. , vol.38 , Issue.4 , pp. 994-1001
    • Yang, Q.1    Vance, J.2    Jagannathan, S.3
  • 21
    • 0027698750 scopus 로고
    • A perceptron network for functional identification and control of nonlinear systems
    • Nov.
    • N. Sadegh, "A perceptron network for functional identification and control of nonlinear systems," IEEE Trans. Neural Netw., vol. 4, no. 6, pp. 982-988, Nov. 1993.
    • (1993) IEEE Trans. Neural Netw. , vol.4 , Issue.6 , pp. 982-988
    • Sadegh, N.1
  • 22
    • 33144454972 scopus 로고    scopus 로고
    • Learning from neural control
    • DOI 10.1109/TNN.2005.860843
    • C. Wang and D. Hill, "Learning from neural control," IEEE Trans. Neural Netw., vol. 17, no. 1, pp. 130-146, Jan. 2006. (Pubitemid 43263946)
    • (2006) IEEE Transactions on Neural Networks , vol.17 , Issue.1 , pp. 130-146
    • Wang, C.1    Hill, D.J.2
  • 23
    • 34248644355 scopus 로고    scopus 로고
    • Deterministic learning and rapid dynamical pattern recognition
    • DOI 10.1109/TNN.2006.889496
    • C. Wang and D. Hill, "Deterministic learning and rapid dynamical pattern recognition," IEEE Trans. Neural Netw., vol. 18, no. 3, pp. 617-630, May 2007. (Pubitemid 46773732)
    • (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.3 , pp. 617-630
    • Wang, C.1    Hill, D.J.2
  • 24
    • 84867972978 scopus 로고    scopus 로고
    • Identification and learning control of ocean surface ship using neural networks
    • Nov.
    • S. Dai, C. Wang, and F. Luo, "Identification and learning control of ocean surface ship using neural networks," IEEE Trans. Ind. Informat., vol. 8, no. 4, pp. 801-810, Nov. 2012.
    • (2012) IEEE Trans. Ind. Informat. , vol.8 , Issue.4 , pp. 801-810
    • Dai, S.1    Wang, C.2    Luo, F.3
  • 25
    • 77950822350 scopus 로고    scopus 로고
    • Learning from neural control for a class of discrete-time nonlinear systems
    • Dec.
    • T. Chen and C. Wang, "Learning from neural control for a class of discrete-time nonlinear systems," in Proc. 48th IEEE CDC/CCC, Dec. 2009, pp. 6732-6737.
    • (2009) Proc. 48th IEEE CDC/CCC , pp. 6732-6737
    • Chen, T.1    Wang, C.2
  • 26
    • 70349615619 scopus 로고    scopus 로고
    • Direct heuristic dynamic programming for nonlinear tracking control with filtered tracking error
    • Dec.
    • L. Yang, J. Si, K. S. Tsakalis, and A. A. Rodriguez, "Direct heuristic dynamic programming for nonlinear tracking control with filtered tracking error," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 39, no. 6, pp. 1617-1622, Dec. 2009.
    • (2009) IEEE Trans. Syst., Man, Cybern., B, Cybern. , vol.39 , Issue.6 , pp. 1617-1622
    • Yang, L.1    Si, J.2    Tsakalis, K.S.3    Rodriguez, A.A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.