메뉴 건너뛰기




Volumn , Issue , 2013, Pages 3845-3850

Optimal tracking control for linear discrete-time systems using reinforcement learning

Author keywords

Algebraic riccati equation; Linear quadratic tracker; Policy iteration; Reinforcement learning

Indexed keywords

ALGEBRA; DIGITAL CONTROL SYSTEMS; DISCRETE TIME CONTROL SYSTEMS; ITERATIVE METHODS; NAVIGATION; NUMBER THEORY; REINFORCEMENT LEARNING; RICCATI EQUATIONS;

EID: 84902308118     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2013.6760476     Document Type: Conference Paper
Times cited : (30)

References (18)
  • 6
    • 33846781129 scopus 로고    scopus 로고
    • Model-free Qlearning designs for linear discrete-time zero-sum games with application to Hifiit control
    • Mar
    • A. AI-Tamimi, F. L. Lewis, and M. Abu-If, "Model-free Qlearning designs for linear discrete-time zero-sum games with application to Hifiit control, " Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007.
    • (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
    • Ai-Tamimi, A.1    Lewis, F.L.2    Abu-If, M.3
  • 7
    • 34047138362 scopus 로고    scopus 로고
    • Ei force et Ierieur Inetwork based controller for nonlinear discrete-time systems with input co trit
    • Apr
    • P. E, d . J, "Ei force et Ierieur Inetwork based controller for nonlinear discrete-time systems with input co trit, " IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics, vol. 37, no. 2, pp. 425-436, Apr. 2007.
    • (2007) IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics , vol.37 , Issue.2 , pp. 425-436
    • Eu, P.1    Ju, D.2
  • 8
    • 84902324584 scopus 로고    scopus 로고
    • Eer lized ilto-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete tiete
    • Jan
    • Z. C Eu, d . J tu " Eer lized ilto-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete tiete, " IEEE Trans. Neural Networks, vol. 19, no. 1, pp.90-106, Jan. 2008.
    • (2008) IEEE Trans. Neural Networks , vol.19 , Issue.1 , pp. 90-106
    • Eu, Z.C.1    Tu, D.J.2
  • 10
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R.S. Sutton, & P. J. Werbos. (Eds.). Cambridge, MA: MIT Press
    • P. J. Werbos, A menu of designs for reinforcement learning over time. In Neural Networks for Control. W. T. Miller, R.S. Sutton, & P. J. Werbos. (Eds.). Cambridge, MA: MIT Press, pp. 67-95, 1991.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 11
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A. White, & D. A. Sofge (Eds.), New York: Van Nostrand Reinhold
    • P. J. Werbos, Approximate dynamic programming for real-time control and neural modeling. In D. A. White, & D. A. Sofge (Eds.), Handbook of intelligent control. New York: Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1
  • 12
    • 70349116541 scopus 로고    scopus 로고
    • Ei force etIerid dtive dicrori for feedbc control
    • Sep
    • F. L. Lewid, D. Vrbie, " Ei force etIerid dtive dicrori for feedbc control, " IEEE Circuit Syst. Mag., vol. 9, no. 3, pp. 32-50, Sep, 2009.
    • (2009) IEEE Circuit Syst. Mag. , vol.9 , Issue.3 , pp. 32-50
    • Lewid, F.L.1    Vrbie, D.2
  • 13
    • 70349253929 scopus 로고    scopus 로고
    • NeurIetwor-based nearoptimal control for a class of discrete-time affine nonlinear systems wit control cotrit
    • Sep
    • H. G. Zhang, Y. H. Luo, and D. Liu, "NeurIetwor-based nearoptimal control for a class of discrete-time affine nonlinear systems wit control cotrit, " IEEE Trans. Neural Networks, vol. 20, no. 9, pp. 1490-1503, Sep, 2009.
    • (2009) IEEE Trans. Neural Networks , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.G.1    Luo, Y.H.2    Liu, D.3
  • 14
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning or partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Feb
    • F. L. Lewis, and K. Vamvoudakis, "Reinforcement learning or partially observable dynamic processes: Adaptive dynamic programming using measured output data, " IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics, vol. 41, no. 1, pp. 14-23, Feb, 2011.
    • (2011) IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics , vol.41 , Issue.1 , pp. 14-23
    • Lewis, F.L.1    Vamvoudakis, K.2
  • 15
    • 0015109409 scopus 로고
    • A itertive tecique for tecouttio of steady ttei for tedicretetiIe ultor
    • Aug
    • A. Ewer, "A itertive tecique for tecouttio of steady ttei for tedicretetiIe ultor, " IEEE Trans. Automatic Control, vol. 16, no. 4, pp. 382-384, Aug, 1971.
    • (1971) IEEE Trans. Automatic Control , vol.16 , Issue.4 , pp. 382-384
    • Ewer, A.1
  • 16
    • 84883537695 scopus 로고    scopus 로고
    • Ei force et learning and feedback control using natural decision methods to design tiIdtive controller
    • Nov
    • F. L. Lewi, D. Vrbied Vvoudi, " Ei force et learning and feedback control using natural decision methods to design tiIdtive controller, " IEEE Syst. Mag., vol. 32, no. 6, pp.76-105, Nov, 2012.
    • (2012) IEEE Syst. Mag. , vol.32 , Issue.6 , pp. 76-105
    • Lewi, F.L.1    Vvoudi, D.V.2
  • 17
    • 84902345227 scopus 로고
    • Uiver Iroitio of unknown mapping and its derivatives using multilayer feedforward etwor
    • Moridite, "Uiver Iroitio of unknown mapping and its derivatives using multilayer feedforward etwor, " Neur I Networ, vol., . 551-560, 1990.
    • (1990) Neur I Networ , pp. 551-560
    • Moridite1
  • 18
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Aug
    • A. AI-Tamimi, F. L. Lewis, and M. Abu-If, "Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof, " IEEE Trans. Systems, Man, and Cybernetics Part B: Cybernetics, vol. 38, no. 4, pp. 943-949, Aug, 2008.
    • (2008) IEEE Trans. Systems, Man, and Cybernetics Part B: Cybernetics , vol.38 , Issue.4 , pp. 943-949
    • Ai-Tamimi, A.1    Lewis, F.L.2    If M.A.-3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.