SCOPUS 정보 검색 플랫폼

Volumn 50, Issue 1, 2014, Pages 193-202

Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems

(3) Modares, Hamidreza a Lewis, Frank L b Naghibi Sistani, Mohammad Bagher a

a FERDOWSI UNIVERSITY OF MASHHAD (Iran)

b UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

Experience replay; Input constraints; Integral reinforcement learning; Neural networks; Optimal control

Indexed keywords

ADAPTIVE OPTIMAL CONTROL; EXPERIENCE REPLAY; FEEDBACK CONTROL LAW; HAMILTON JACOBI BELLMAN EQUATION; INPUT CONSTRAINTS; NEAR-OPTIMAL CONTROL; OPTIMAL CONTROLS; PERSISTENCE OF EXCITATION;

CONTROL; CONTROL THEORY; DYNAMIC PROGRAMMING; NEURAL NETWORKS; ONLINE SYSTEMS; REINFORCEMENT LEARNING;

OPTIMAL CONTROL SYSTEMS;

EID: 84893708995 PISSN: 00051098 EISSN: None Source Type: Journal
DOI: 10.1016/j.automatica.2013.09.043 Document Type: Article

Times cited : (474)

References (32)

1
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M. Abu-Khalaf, and F.L. Lewis Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach Automatica 41 2005 779 791
- (2005) Automatica , vol.41 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

2
- 84857501996
- Experience replay for real-time reinforcement learning control
- S. Adam, L. Busoniu, and R. Babuska Experience replay for real-time reinforcement learning control IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews 42 2012 201 212
- (2012) IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews , vol.42 , pp. 201-212
- Adam, S.¹ Busoniu, L.² Babuska, R.³

3
- 0003785722
- Ph.D. dissertation, Elec. Eng. Dep., Rensselaer Polytech. Ins., Troy, NY
- Beard, R.W. (1995). Improving the closed-loop performance of nonlinear systems. Ph.D. dissertation, Elec. Eng. Dep., Rensselaer Polytech. Ins., Troy, NY.
- (1995) Improving the Closed-loop Performance of Nonlinear Systems
- Beard, R.W.¹

4
- 0003487482
- Athena Scientific MA
- D.P. Bertsekas, and J.N. Tsitsiklis Neuro-dynamic programming 1996 Athena Scientific MA
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 84871319455
- A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
- S. Bhasin, R. Kamalapurkar, M. Johnson, K.G Vamvoudakis, F.L Lewis, and W.E. Dixon A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems Automatica 49 2012 82 92
- (2012) Automatica , vol.49 , pp. 82-92
- Bhasin, S.¹ Kamalapurkar, R.² Johnson, M.³ Vamvoudakis, K.G.⁴ Lewis, F.L.⁵ Dixon, W.E.⁶

6
- 79953141961
- PhD Thesis, Georgia institute of technology
- Chowdhary, G.V. (2010). Concurrent learning for convergence in adaptive control without persistency of excitation, PhD Thesis, Georgia institute of technology.
- (2010) Concurrent Learning for Convergence in Adaptive Control Without Persistency of Excitation
- Chowdhary, G.V.¹

7
- 84883670357
- Concurrent learning for convergence in adaptive control without
- Atlanta GA
- Chowdhary, G.V., & Johnson, E. (2010). Concurrent learning for convergence in adaptive control without. In IEEE CDC. Atlanta GA (pp. 3675-3679).
- (2010) IEEE CDC , pp. 3675-3679
- Chowdhary, G.V.¹ Johnson, E.²

8
- 0033629916
- Reinforcement learning in continuous time and space
- K. Doya Reinforcement learning in continuous time and space Neural Computation 12 2000 219 245
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

9
- 56749173285
- Efficient experience reuse in non-Markovian environments
- Tokyo, Japan
- Dung, L.T., Komeda, T., & Takagi, M. (2008). Efficient experience reuse in non-Markovian environments. In Proc. int. conf. instrum. control inf. technol. Tokyo, Japan (pp. 3327-3332).
- (2008) Proc. Int. Conf. Instrum. Control Inf. Technol , pp. 3327-3332
- Dung, L.T.¹ Komeda, T.² Takagi, M.³

10
- 0003917259
- Academic Press New York
- B.A. Finlayson The method of weighted residuals and variational principles 1990 Academic Press New York
- (1990) The Method of Weighted Residuals and Variational Principles
- Finlayson, B.A.¹

11
- 0003543733
- 2nd ed. Cambridge Univ. Press Cambridge, U.K.
- G. Hardy, J. Littlewood, and G. Polya Inequalities 2nd ed. 1989 Cambridge Univ. Press Cambridge, U.K.
- (1989) Inequalities
- Hardy, G.¹ Littlewood, J.² Polya, G.³

12
- 0004106918
- Prentice Hall New Jersey
- P. Ioannou, and J. Sun Robust adaptive control 1996 Prentice Hall New Jersey
- (1996) Robust Adaptive Control
- Ioannou, P.¹ Sun, J.²

13
- 60349130974
- Batch reinforcement learning in a complex domain
- Honolulu, HI
- Kalyanakrishnan, S., & Stone, P. (2007). Batch reinforcement learning in a complex domain. In Proc. 6th Int. Conf. Auton. Agents Multi-Agent Syst. Honolulu, HI (pp. 650-657).
- (2007) Proc. 6th Int. Conf. Auton. Agents Multi-Agent Syst. , pp. 650-657
- Kalyanakrishnan, S.¹ Stone, P.²

14
- 0004178386
- 3rd ed. Prentice Hall
- H.K. Khalil Nonlinear systems 3rd ed. 2002 Prentice Hall
- (2002) Nonlinear Systems
- Khalil, H.K.¹

15
- 0004025786
- Taylor & Francis
- F.L. Lewis, S. Jagannathan, and A. Yesildirek Neural network control of robot manipulators and nonlinear systems 1999 Taylor & Francis
- (1999) Neural Network Control of Robot Manipulators and Nonlinear Systems
- Lewis, F.L.¹ Jagannathan, S.² Yesildirek, A.³

16
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- F.L. Lewis, and D. Vrabie Reinforcement learning and adaptive dynamic programming for feedback control IEEE Circuits & Systems Magazine, Invited Feature Article 9 2009 32 50
- (2009) IEEE Circuits & Systems Magazine, Invited Feature Article , vol.9 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

17
- 84883537695
- Reinforcement learning and feedback control
- F.L. Lewis, D. Vrabie, and K.G. Vamvoudakis Reinforcement learning and feedback control IEEE Control Systems magazine 2012
- (2012) IEEE Control Systems Magazine
- Lewis, F.L.¹ Vrabie, D.² Vamvoudakis, K.G.³

18
- 0004163205
- 3rd ed. Wiley
- F.L. Lewis, D. Vrabie, and V. Syrmos Optimal control 3rd ed. 2012 Wiley
- (2012) Optimal Control
- Lewis, F.L.¹ Vrabie, D.² Syrmos, V.³

19
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L.J. Lin Self-improving reactive agents based on reinforcement learning, planning and teaching Machine Learning 8 1992 293 321
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.J.¹

20
- 84881324637
- Optimal control of nonlinear continuous-time systems: Design of bounded controllers via generalized nonquadratic functionals
- Lyshevski, S.E. (1998). Optimal control of nonlinear continuous-time systems: design of bounded controllers via generalized nonquadratic functionals. In Proceedings of American control conference (pp. 205-209).
- (1998) Proceedings of American Control Conference , pp. 205-209
- Lyshevski, S.E.¹

21
- 84899093084
- Online solution of nonquadratic two-player zero-sum games arising in the H control of constrained input systems
- 10.1002/acs.2348
- H. Modares, F.L. Lewis, and M.B. Naghibi-Sistani Online solution of nonquadratic two-player zero-sum games arising in the H control of constrained input systems International Journal of Adaptive Control and Signal Processing 2012 10.1002/acs.2348
- (2012) International Journal of Adaptive Control and Signal Processing
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.B.³

22
- 0036588686
- Adaptive dynamic programming
- J.J. Murray, C.J. Cox, G.G. Lendaris, and R. Saeks Adaptive dynamic programming IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews 32 2002 140 153
- (2002) IEEE Transactions on Systems Man, and Cybernetics, Part C: Applications and Reviews , vol.32 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

23
- 0027811338
- A robust adaptive nonlinear control design
- Polycarpou, M., & Ioannou, P. (1993). A robust adaptive nonlinear control design. In Proceedings of American control conference (pp. 1365-1369).
- (1993) Proceedings of American Control Conference , pp. 1365-1369
- Polycarpou, M.¹ Ioannou, P.²

24
- 47349092417
- Wiley-Interscience
- W.B. Powell Approximate dynamic programming: solving the curses of dimensionality 2007 Wiley-Interscience
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

25
- 0004102479
- MIT Press Cambridge, MA
- R.S Sutton, and A.G. Barto Reinforcement learning - an introduction 1998 MIT Press Cambridge, MA
- (1998) Reinforcement Learning - An Introduction
- Sutton, R.S.¹ Barto, A.G.²

26
- 77950630017
- Online actor-critic algorithm to solve the continuous infinite-time horizon optimal control problem
- K. Vamvoudakis, and F.L. Lewis Online actor-critic algorithm to solve the continuous infinite-time horizon optimal control problem Automatica 46 2010 878 888
- (2010) Automatica , vol.46 , pp. 878-888
- Vamvoudakis, K.¹ Lewis, F.L.²

27
- 84939468993
- Online adaptive algorithm for optimal control with integral reinforcement learning
- 10.1002/rnc.3018
- K.G. Vamvoudakis, D. Vrabie, and F.L. Lewis Online adaptive algorithm for optimal control with integral reinforcement learning Proc. International (pp. 250-257, 2011). Journal of robust and Nonlinear control 2013 10.1002/rnc.3018
- (2013) Proc. International (Pp. 250-257, 2011). Journal of Robust and Nonlinear Control
- Vamvoudakis, K.G.¹ Vrabie, D.² Lewis, F.L.³

28
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F.L. Lewis Adaptive optimal control for continuous-time linear systems based on policy iteration Automatica 45 2009 477 484
- (2009) Automatica , vol.45 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

29
- 67349145396
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- D. Vrabie, and F.L. Lewis Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems Neural Networks 22 2009 237 246
- (2009) Neural Networks , vol.22 , pp. 237-246
- Vrabie, D.¹ Lewis, F.L.²

30
- 71749106087
- Real-time reinforcement learning by sequential actor-critics and experience replay
- P. Wawrzynski Real-time reinforcement learning by sequential actor-critics and experience replay Neural Networks 22 2009 1484 1497
- (2009) Neural Networks , vol.22 , pp. 1484-1497
- Wawrzynski, P.¹

31
- 0002031779
- Approximate dynamic programming for real time control and neural modeling
- D.A. White, D.A. Sofge, Multiscience Press
- P.J. Werbos Approximate dynamic programming for real time control and neural modeling D.A. White, D.A. Sofge, Handbook of intelligent control 1992 Multiscience Press
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

32
- 84862815087
- Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
- H. Xu, S. Jagannathan, and F.L. Lewis Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses Automatica 48 2012 1017 1030
- (2012) Automatica , vol.48 , pp. 1017-1030
- Xu, H.¹ Jagannathan, S.² Lewis, F.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.