메뉴 건너뛰기




Volumn 37, Issue 2, 2007, Pages 425-436

Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints

Author keywords

Approximate dynamic programming; Neural network control; Optimal control; Reinforcement learning

Indexed keywords

ACTUATORS; COMPUTER SIMULATION; CONSTRAINT THEORY; CONTROL EQUIPMENT; DISCRETE TIME CONTROL SYSTEMS; ERRORS; NONLINEAR CONTROL SYSTEMS; OPTIMAL CONTROL SYSTEMS; REINFORCEMENT LEARNING; STATE FEEDBACK;

EID: 34047138362     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2006.883869     Document Type: Article
Times cited : (214)

References (25)
  • 2
    • 0002283578 scopus 로고
    • "Reinforcement learning and adaptive critic methods"
    • in, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand
    • A. G. Barto, "Reinforcement learning and adaptive critic methods," in Handbook of Intelligent Control, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand, 1992, pp. 65-90.
    • (1992) Handbook of Intelligent Control , pp. 65-90
    • Barto, A.G.1
  • 4
    • 0030192117 scopus 로고    scopus 로고
    • "Neural networks in nonlinear aircraft flight control"
    • Jul
    • A. J. Calise, "Neural networks in nonlinear aircraft flight control," IEEE Aerosp. Electron. Syst. Mag., vol. 11, no. 7, pp. 5-10, Jul. 1996.
    • (1996) IEEE Aerosp. Electron. Syst. Mag. , vol.11 , Issue.7 , pp. 5-10
    • Calise, A.J.1
  • 5
    • 0029308580 scopus 로고
    • "Adaptive control of a class of nonlinear discrete-time systems using neural networks"
    • May
    • F. C. Chen and H. K. Khalil, "Adaptive control of a class of nonlinear discrete-time systems using neural networks," IEEE Trans. Autom. Control, vol. 40, no. 5, pp. 791-801, May 1995.
    • (1995) IEEE Trans. Autom. Control , vol.40 , Issue.5 , pp. 791-801
    • Chen, F.C.1    Khalil, H.K.2
  • 6
    • 0029403793 scopus 로고
    • "Stochastic choice of basis functions in adaptive function approximation and the functional-link net"
    • Nov
    • B. Igelnik and Y. H. Pao, "Stochastic choice of basis functions in adaptive function approximation and the functional-link net," IEEE Trans. Neural Netw., vol. 6, no. 6, pp. 1320-1329, Nov. 1995.
    • (1995) IEEE Trans. Neural Netw. , vol.6 , Issue.6 , pp. 1320-1329
    • Igelnik, B.1    Pao, Y.H.2
  • 7
    • 0028542585 scopus 로고
    • "A discrete-time adaptive nonlinear system"
    • Nov
    • I. Kanellakopoulos, "A discrete-time adaptive nonlinear system," IEEE Trans. Autom. Control, vol. 39, no. 11, pp. 2362-2365, Nov. 1994.
    • (1994) IEEE Trans. Autom. Control , vol.39 , Issue.11 , pp. 2362-2365
    • Kanellakopoulos, I.1
  • 8
    • 0028544296 scopus 로고
    • "Adaptive control in the presence of input constraints"
    • Nov
    • S. P. Karason and A. M. Annaswamy, "Adaptive control in the presence of input constraints," IEEE Trans. Autom. Control, vol. 39, no. 11, pp. 2325-2330, Nov. 1994.
    • (1994) IEEE Trans. Autom. Control , vol.39 , Issue.11 , pp. 2325-2330
    • Karason, S.P.1    Annaswamy, A.M.2
  • 11
    • 0034548295 scopus 로고    scopus 로고
    • "Convergence analysis of adaptive critic based optimal control"
    • X. Lin and S. N. Balakrishnan, "Convergence analysis of adaptive critic based optimal control," in Proc. Amer. Control Conf., 2000, pp. 1929-1933.
    • (2000) Proc. Amer. Control Conf. , pp. 1929-1933
    • Lin, X.1    Balakrishnan, S.N.2
  • 14
    • 0025399567 scopus 로고
    • "Identification and control of dynamical systems using neural networks"
    • Mar
    • K. S. Narendra and K. S. Parthasarathy, "Identification and control of dynamical systems using neural networks," IEEE Trans. Neural Netw., vol. 1, no. 1, pp. 4-27, Mar. 1990.
    • (1990) IEEE Trans. Neural Netw. , vol.1 , Issue.1 , pp. 4-27
    • Narendra, K.S.1    Parthasarathy, K.S.2
  • 15
    • 0032310053 scopus 로고    scopus 로고
    • "Analyzing for Lyapunov stability with adaptive critics"
    • in, San Diego, CA
    • D. V. Prokhorov and L. A. Feldkamp, "Analyzing for Lyapunov stability with adaptive critics," in Proc. IEEE Conf. Syst. Man and Cybern., San Diego, CA, 1998, pp. 1658-1661.
    • (1998) Proc. IEEE Conf. Syst. Man and Cybern. , pp. 1658-1661
    • Prokhorov, D.V.1    Feldkamp, L.A.2
  • 16
    • 0031236002 scopus 로고    scopus 로고
    • "Adaptive critic designs"
    • Sep
    • D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
    • (1997) IEEE Trans. Neural Netw. , vol.8 , Issue.5 , pp. 997-1007
    • Prokhorov, D.V.1    Wunsch, D.C.2
  • 17
    • 0024765392 scopus 로고
    • "Adaptive control of linearizable systems"
    • Nov
    • S. S. Sastry and A. Isidori, "Adaptive control of linearizable systems," IEEE Trans. Autom. Control, vol. 34, no. 11, pp. 1123-1131, Nov. 1989.
    • (1989) IEEE Trans. Autom. Control , vol.34 , Issue.11 , pp. 1123-1131
    • Sastry, S.S.1    Isidori, A.2
  • 18
    • 0041376883 scopus 로고    scopus 로고
    • "Intelligent supply chain management using adaptive critic learning"
    • Mar
    • S. Shervais, T. T. Shannon, and G. G. Lendaris, "Intelligent supply chain management using adaptive critic learning," IEEE Trans. Syst., Man, Cybern., vol. 33, no. 2, pp. 235-244, Mar. 2003.
    • (2003) IEEE Trans. Syst., Man, Cybern. , vol.33 , Issue.2 , pp. 235-244
    • Shervais, S.1    Shannon, T.T.2    Lendaris, G.G.3
  • 20
    • 0035273403 scopus 로고    scopus 로고
    • "On-line learning control by association and reinforcement"
    • Mar
    • J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 21
    • 0002437599 scopus 로고
    • "Neurocontrol and supervised learning: An overview and evaluation"
    • in D. A. White and D. A. Sofge, Eds. New York: Van Nostrand
    • P. J. Werbos, "Neurocontrol and supervised learning: An overview and evaluation," in Handbook of Intelligent Control, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand, 1992, pp. 65-90.
    • (1992) Handbook of Intelligent Control , pp. 65-90
    • Werbos, P.J.1
  • 22
    • 0002011091 scopus 로고
    • "A menu of designs for reinforcement learning over time"
    • in W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 23
    • 84979715630 scopus 로고    scopus 로고
    • "Supervised actor-critic reinforcement learning"
    • in J. Si et al. Eds. Piscataway, NJ: IEEE Press
    • M. T. Rosenstein and A. G. Barto, "Supervised actor-critic reinforcement learning," in Handbook of Learning and Approximate Dynamic Programming, J. Si et al. Eds. Piscataway, NJ: IEEE Press, 2004, pp. 359-380.
    • (2004) Handbook of Learning and Approximate Dynamic Programming , pp. 359-380
    • Rosenstein, M.T.1    Barto, A.G.2
  • 24
    • 13644265156 scopus 로고    scopus 로고
    • "Reinforcement-based neuro-output feedback control of discrete-time systems with input constraints"
    • Feb
    • P. He and S. Jagannathan, "Reinforcement-based neuro-output feedback control of discrete-time systems with input constraints," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 1, pp. 150-154, Feb. 2005.
    • (2005) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.35 , Issue.1 , pp. 150-154
    • He, P.1    Jagannathan, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.