SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Conference on Decision and Control

Volumn , Issue , 2013, Pages 3845-3850

Optimal tracking control for linear discrete-time systems using reinforcement learning

(4) Kiumarsi Khomartash, Bahare a Lewis, Frank L b Naghibi Sistani, Mohammad Bagher a Karimpour, Ali a

a FERDOWSI UNIVERSITY OF MASHHAD (Iran)

b UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

Algebraic riccati equation; Linear quadratic tracker; Policy iteration; Reinforcement learning

Indexed keywords

ALGEBRA; DIGITAL CONTROL SYSTEMS; DISCRETE TIME CONTROL SYSTEMS; ITERATIVE METHODS; NAVIGATION; NUMBER THEORY; REINFORCEMENT LEARNING; RICCATI EQUATIONS;

ALGEBRAIC RICCATI EQUATIONS; LINEAR DISCRETE-TIME SYSTEMS; LINEAR QUADRATIC TRACKERS; OPTIMAL CONTROL SOLUTION; OPTIMAL TRACKING CONTROL; POLICY ITERATION; REFERENCE TRAJECTORIES; SIMULATION EXAMPLE;

LEARNING ALGORITHMS;

EID: 84902308118 PISSN: 07431546 EISSN: 25762370 Source Type: Conference Proceeding
DOI: 10.1109/CDC.2013.6760476 Document Type: Conference Paper

Times cited : (30)

References (18)

1
- 0004163205
- New York: John Wiley
- F. L. Lewis, D. Vrabie, and V. Syrmos, Optimal Control. New York: John Wiley, 2012.
- (2012) Optimal Control
- Lewis, F.L.¹ Vrabie, D.² Syrmos, V.³

2
- 47349092417
- John Wiley
- W. B. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality. John Wiley, 2009.
- (2009) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

3
- 0004102479
- Cambridge, MT: MIT Press
- R. S. Sutton, and A. G. Barto, Reinforcement learning-an introduction. Cambridge, MT: MIT Press, 1998.
- (1998) Reinforcement Learning-An Introduction
- Sutton, R.S.¹ Barto, A.G.²

4
- 84921399937
- Wiley
- J. Si. A. Barto, W. Powell, and D. Wunch, Handbook of Learning and Approximate Dynamic Programming. Wiley, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Barto, J.Si.A.¹ Powell, W.² Wunch, D.³

5
- 0003487482
- MA: Athena Scientific
- D. P. Bertsekas, and J. N. Tsitsiklis, Neuro-dynamic programming. MA: Athena Scientific, 1996.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

6
- 33846781129
- Model-free Qlearning designs for linear discrete-time zero-sum games with application to Hifiit control
- Mar
- A. AI-Tamimi, F. L. Lewis, and M. Abu-If, "Model-free Qlearning designs for linear discrete-time zero-sum games with application to Hifiit control, " Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007.
- (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
- Ai-Tamimi, A.¹ Lewis, F.L.² Abu-If, M.³

7
- 34047138362
- Ei force et Ierieur Inetwork based controller for nonlinear discrete-time systems with input co trit
- Apr
- P. E, d . J, "Ei force et Ierieur Inetwork based controller for nonlinear discrete-time systems with input co trit, " IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics, vol. 37, no. 2, pp. 425-436, Apr. 2007.
- (2007) IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics , vol.37 , Issue.2 , pp. 425-436
- Eu, P.¹ Ju, D.²

8
- 84902324584
- Eer lized ilto-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete tiete
- Jan
- Z. C Eu, d . J tu " Eer lized ilto-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete tiete, " IEEE Trans. Neural Networks, vol. 19, no. 1, pp.90-106, Jan. 2008.
- (2008) IEEE Trans. Neural Networks , vol.19 , Issue.1 , pp. 90-106
- Eu, Z.C.¹ Tu, D.J.²

9
- 84902338593
- NeurIetwor for control dte identification
- Dec
- P. J. Erbo, "NeurIetwor for control dte identification, " Proc. 28th IEEE Conf. Decision and Control, pp. 260-265, Dec, 1989.
- (1989) Proc. 28th IEEE Conf. Decision and Control , pp. 260-265
- Erbo, P.J.¹

10
- 0002011091
- A menu of designs for reinforcement learning over time
- W. T. Miller, R.S. Sutton, & P. J. Werbos. (Eds.). Cambridge, MA: MIT Press
- P. J. Werbos, A menu of designs for reinforcement learning over time. In Neural Networks for Control. W. T. Miller, R.S. Sutton, & P. J. Werbos. (Eds.). Cambridge, MA: MIT Press, pp. 67-95, 1991.
- (1991) Neural Networks for Control , pp. 67-95
- Werbos, P.J.¹

11
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D. A. White, & D. A. Sofge (Eds.), New York: Van Nostrand Reinhold
- P. J. Werbos, Approximate dynamic programming for real-time control and neural modeling. In D. A. White, & D. A. Sofge (Eds.), Handbook of intelligent control. New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

12
- 70349116541
- Ei force etIerid dtive dicrori for feedbc control
- Sep
- F. L. Lewid, D. Vrbie, " Ei force etIerid dtive dicrori for feedbc control, " IEEE Circuit Syst. Mag., vol. 9, no. 3, pp. 32-50, Sep, 2009.
- (2009) IEEE Circuit Syst. Mag. , vol.9 , Issue.3 , pp. 32-50
- Lewid, F.L.¹ Vrbie, D.²

13
- 70349253929
- NeurIetwor-based nearoptimal control for a class of discrete-time affine nonlinear systems wit control cotrit
- Sep
- H. G. Zhang, Y. H. Luo, and D. Liu, "NeurIetwor-based nearoptimal control for a class of discrete-time affine nonlinear systems wit control cotrit, " IEEE Trans. Neural Networks, vol. 20, no. 9, pp. 1490-1503, Sep, 2009.
- (2009) IEEE Trans. Neural Networks , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.G.¹ Luo, Y.H.² Liu, D.³

14
- 79551685808
- Reinforcement learning or partially observable dynamic processes: Adaptive dynamic programming using measured output data
- Feb
- F. L. Lewis, and K. Vamvoudakis, "Reinforcement learning or partially observable dynamic processes: Adaptive dynamic programming using measured output data, " IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics, vol. 41, no. 1, pp. 14-23, Feb, 2011.
- (2011) IEEE Trans. Systems, Man, and Cybernetics-Part B: Cybernetics , vol.41 , Issue.1 , pp. 14-23
- Lewis, F.L.¹ Vamvoudakis, K.²

15
- 0015109409
- A itertive tecique for tecouttio of steady ttei for tedicretetiIe ultor
- Aug
- A. Ewer, "A itertive tecique for tecouttio of steady ttei for tedicretetiIe ultor, " IEEE Trans. Automatic Control, vol. 16, no. 4, pp. 382-384, Aug, 1971.
- (1971) IEEE Trans. Automatic Control , vol.16 , Issue.4 , pp. 382-384
- Ewer, A.¹

16
- 84883537695
- Ei force et learning and feedback control using natural decision methods to design tiIdtive controller
- Nov
- F. L. Lewi, D. Vrbied Vvoudi, " Ei force et learning and feedback control using natural decision methods to design tiIdtive controller, " IEEE Syst. Mag., vol. 32, no. 6, pp.76-105, Nov, 2012.
- (2012) IEEE Syst. Mag. , vol.32 , Issue.6 , pp. 76-105
- Lewi, F.L.¹ Vvoudi, D.V.²

17
- 84902345227
- Uiver Iroitio of unknown mapping and its derivatives using multilayer feedforward etwor
- Moridite, "Uiver Iroitio of unknown mapping and its derivatives using multilayer feedforward etwor, " Neur I Networ, vol., . 551-560, 1990.
- (1990) Neur I Networ , pp. 551-560
- Moridite¹

18
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug
- A. AI-Tamimi, F. L. Lewis, and M. Abu-If, "Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof, " IEEE Trans. Systems, Man, and Cybernetics Part B: Cybernetics, vol. 38, no. 4, pp. 943-949, Aug, 2008.
- (2008) IEEE Trans. Systems, Man, and Cybernetics Part B: Cybernetics , vol.38 , Issue.4 , pp. 943-949
- Ai-Tamimi, A.¹ Lewis, F.L.² If M.A.-³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.