메뉴 건너뛰기




Volumn 59, Issue 11, 2014, Pages 3051-3056

Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning

Author keywords

Causal solution; integral reinforcement learning; linear quadratic tracking; policy iteration; reinforcement learning.

Indexed keywords

CAUSAL SOLUTIONS; LINEAR QUADRATIC TRACKING; POLICY ITERATION;

EID: 84908432682     PISSN: 00189286     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAC.2014.2317301     Document Type: Article
Times cited : (381)

References (28)
  • 6
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 8
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: An introduction
    • May
    • F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Computational Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
    • (2009) IEEE Computational Intell. Mag. , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.Y.1    Zhang, H.2    Liu, D.3
  • 9
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous-time and space
    • K. Doya, "Reinforcement learning in continuous-time and space," Neural Computation, vol. 12, pp. 219-245, 2000.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 10
    • 33845759425 scopus 로고    scopus 로고
    • Policy iterations on the Hamilton-Jacobi-Isaacs equation for H? State feedback control with input saturation
    • Dec.
    • M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Policy iterations on the Hamilton-Jacobi-Isaacs equation for H? state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1986-1995, Dec. 2006.
    • (2006) IEEE Trans. Autom. Control , vol.51 , Issue.12 , pp. 1986-1995
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 13
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Feb.
    • F. L. Lewis and K. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst, Man Cybern. B, vol. 41, no. 1, pp. 14-23, Feb. 2011.
    • (2011) IEEE Trans. Syst, Man Cybern. B , vol.41 , Issue.1 , pp. 14-23
    • Lewis, F.L.1    Vamvoudakis, K.2
  • 14
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • February
    • D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, February 2009.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 15
    • 84867400046 scopus 로고    scopus 로고
    • Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
    • Nov.
    • J. Y. Lee, J. B. Park, and Y. H. Choi, "Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems," Automatica, vol. 48, no. 11, pp. 2850-2859, Nov. 2012.
    • (2012) Automatica , vol.48 , Issue.11 , pp. 2850-2859
    • Lee, J.Y.1    Park, J.B.2    Choi, Y.H.3
  • 16
    • 84865467087 scopus 로고    scopus 로고
    • Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    • October
    • Y. Jiang and Z. P. Jiang, "Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics," Automatica, vol. 48, no. 10, pp. 2699-2704, October 2012.
    • (2012) Automatica , vol.48 , Issue.10 , pp. 2699-2704
    • Jiang, Y.1    Jiang, Z.P.2
  • 17
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Aug.
    • H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man Cybern. B, vol. 38, no. 4, pp. 937-942, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man Cybern. B , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 19
    • 84880734826 scopus 로고    scopus 로고
    • Optimal tracking control scheme for discrete-time nonlinear systems with approximation errors
    • Q. Wei and D. Liu, "Optimal tracking control scheme for discrete-time nonlinear systems with approximation errors," Adv. Neural Networks-Lecture Notes Comp. Sci., vol. 7952, pp. 1-10, 2013.
    • (2013) Adv. Neural Networks-Lecture Notes Comp. Sci. , vol.7952 , pp. 1-10
    • Wei, Q.1    Liu, D.2
  • 20
    • 84880304409 scopus 로고    scopus 로고
    • Optimal tracking control for a class of nonlinear time-delay systems with actuator saturation
    • R. Song, W. Xiao, and Q. Wei, "Optimal tracking control for a class of nonlinear time-delay systems with actuator saturation," Adv. Brain Inspired Cognitive Syst.-Lecture Notes Comp. Sci., vol. 7888, pp. 208-215, 2013.
    • (2013) Adv. Brain Inspired Cognitive Syst.-Lecture Notes Comp. Sci. , vol.7888 , pp. 208-215
    • Song, R.1    Xiao, W.2    Wei, Q.3
  • 21
    • 84888030007 scopus 로고    scopus 로고
    • Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm
    • Y. Huang and D. Liu, "Neural-network-based optimal tracking control scheme for a class of unknown discrete-time nonlinear systems using iterative ADP algorithm," Neurocomputing, vol. 125, pp. 46-56, 2014.
    • (2014) Neurocomputing , vol.125 , pp. 46-56
    • Huang, Y.1    Liu, D.2
  • 22
    • 83655163786 scopus 로고    scopus 로고
    • Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
    • Dec.
    • H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method," IEEE Trans. Neural Networks, vol. 22, no. 12, pp. 2226-2236, Dec. 2011.
    • (2011) IEEE Trans. Neural Networks , vol.22 , Issue.12 , pp. 2226-2236
    • Zhang, H.1    Cui, L.2    Zhang, X.3    Luo, Y.4
  • 23
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
    • April
    • D. Vrabie and F. L. Lewis, "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems," Neural Network, vol. 22, pp. 237-246, April 2009.
    • (2009) Neural Network , vol.22 , pp. 237-246
    • Vrabie, D.1    Lewis, F.L.2
  • 24
    • 0347997537 scopus 로고    scopus 로고
    • On the infinite-horizon LQ tracker
    • Jun.
    • E. Barbieri and R. Alba-Flores, "On the infinite-horizon LQ tracker," Syst. Control Lett., vol. 40, no. 2, pp. 77-82, Jun. 2000.
    • (2000) Syst. Control Lett. , vol.40 , Issue.2 , pp. 77-82
    • Barbieri, E.1    Alba-Flores, R.2
  • 25
    • 34548119564 scopus 로고    scopus 로고
    • Real-time infinite horizon linearquadratic tracking controller for vibration quenching in flexible beams
    • Oct. 8-11
    • E. Barbieri and R. Alba-Flores, "Real-time infinite horizon linearquadratic tracking controller for vibration quenching in flexible beams," in Proc. IEEE Conf. Syst., Man, Cybern., Taipei, Taiwan, Oct. 8-11, 2006, pp. 38-43.
    • (2006) Proc. IEEE Conf. Syst., Man, Cybern., Taipei, Taiwan , pp. 38-43
    • Barbieri, E.1    Alba-Flores, R.2
  • 27
    • 84914965022 scopus 로고
    • On an iterative technique for Riccati equation computations
    • Feb.
    • D. L. Kleinman, "On an iterative technique for Riccati equation computations," IEEE Trans. Autom. Control, vol. 18, no. 1, pp. 114-115, Feb. 1968.
    • (1968) IEEE Trans. Autom. Control , vol.18 , Issue.1 , pp. 114-115
    • Kleinman, D.L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.