SCOPUS 정보 검색 플랫폼

Automatica

Volumn 48, Issue 11, 2012, Pages 2850-2859

Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems

(3) Lee, Jae Young a Park, Jin Bae a Choi, Yoon Ho b

a Yonsei University (South Korea)

b Kyonggi University (South Korea)

Author keywords

Adaptive control; LQR; Optimization under uncertainties; Policy iteration; Q learning

Indexed keywords

ADAPTIVE CONTROL; LQR; OPTIMIZATION UNDER UNCERTAINTY; POLICY ITERATION; Q-LEARNING;

CONTINUOUS TIME SYSTEMS; INTEGRAL EQUATIONS; ITERATIVE METHODS; LINEAR CONTROL SYSTEMS; OPTIMIZATION; REINFORCEMENT LEARNING;

LEARNING ALGORITHMS;

EID: 84867400046 PISSN: 00051098 EISSN: None Source Type: Journal
DOI: 10.1016/j.automatica.2012.06.008 Document Type: Article

Times cited : (153)

References (32)

1
- 33846781129
- ∞ control
- ∞ control Automatica 43 3 2007 473 481
- (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

2
- 0028733775
- Reinforcement learning in continuous-time: Advantage updating
- Baird, L. C. III (1994). Reinforcement learning in continuous-time: advantage updating. In Proc. of ICNN. vol. 4 (pp. 2448-2453).
- (1994) Proc. of ICNN , vol.4 , pp. 2448-2453
- Baird Iii, L.C.¹

3
- 49049111594
- Issues on stability of ADP feedback controllers for dynamical systems
- S.N. Balakrishnan, D. Ding, and F.L. Lewis Issues on stability of ADP feedback controllers for dynamical systems IEEE Transactions on Systems, Man and Cybernetics, Part B 38 4 2008 913 917
- (2008) IEEE Transactions on Systems, Man and Cybernetics, Part B , vol.38 , Issue.4 , pp. 913-917
- Balakrishnan, S.N.¹ Ding, D.² Lewis, F.L.³

4
- 0003487482
- Athena Scientific Belmont, MA
- D.P. Bertsekas, and J.N. Tsitsiklis Neuro-dynamic programming 1996 Athena Scientific Belmont, MA
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 33645410501
- Dynamic programming and suboptimal control: A survey from ADP to MPC
- D.P. Bertsekas, and J.N. Tsitsiklis Dynamic programming and suboptimal control: a survey from ADP to MPC European Journal of Control 11 2005 310 334
- (2005) European Journal of Control , vol.11 , pp. 310-334
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

6
- 0028584964
- Adaptive linear quadratic control using policy iteration
- Bradtke, S. J.; & Ydstie, B. E. (1994). Adaptive linear quadratic control using policy iteration. In Proc. ACC (pp. 3475-3479).
- (1994) Proc. ACC , pp. 3475-3479
- Bradtke, S.J.¹ Ydstie, B.E.²

7
- 77950838809
- Adaptive approximately optimal control of unknown nonlinear systems based on locally weighted learning
- Dong, W.; & Farrell, J. A. (2009). Adaptive approximately optimal control of unknown nonlinear systems based on locally weighted learning. In Proc. CDC (pp. 345-350).
- (2009) Proc. CDC , pp. 345-350
- Dong, W.¹ Farrell, J.A.²

8
- 0033629916
- Reinforcement learning in continuous-time and space
- K. Doya Reinforcement learning in continuous-time and space Neural Computation 12 2000 219 245
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

9
- 0029679044
- Reinforcement learning: A survey
- L.P. Kaelbling, and A.W. Moore Reinforcement learning: a survey Journal of Artificial Intelligence Research 4 1996 237 285
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Moore, A.W.²

10
- 0004178386
- Prentice Hall
- H.K. Khalil Nonlinear systems 2002 Prentice Hall
- (2002) Nonlinear Systems
- Khalil, H.K.¹

11
- 84914965022
- On the iterative technique for Riccati equation computations
- D. Kleinman On the iterative technique for Riccati equation computations IEEE Transactions on Automatic Control 13 1 1968 114 115
- (1968) IEEE Transactions on Automatic Control , vol.13 , Issue.1 , pp. 114-115
- Kleinman, D.¹

12
- 0003429026
- Academic Press, Inc
- P. Kokotovic, H.H. Khalil, and J. O'Reilly Singular perturbation methods in control: analysis and design 1986 Academic Press, Inc.
- (1986) Singular Perturbation Methods in Control: Analysis and Design
- Kokotovic, P.¹ Khalil, H.H.² O'Reilly, J.³

13
- 0003754075
- Ph.D. dissertation. Sweden: Linkoping University
- Landelius, T. (1997). Reinforcement learning and distributed local model synthesis. Ph.D. dissertation. Sweden: Linkoping University.
- (1997) Reinforcement Learning and Distributed Local Model Synthesis
- Landelius, T.¹

14
- 4544319442
- Approximate dynamic programming strategies and their applicability for process control: A review and future directions
- J.M. Lee, and J.H. Lee Approximate dynamic programming strategies and their applicability for process control: a review and future directions International Journal of Control, Automation, and Systems (IJCAS) 2 3 2004 263 278
- (2004) International Journal of Control, Automation, and Systems (IJCAS) , vol.2 , Issue.3 , pp. 263-278
- Lee, J.M.¹ Lee, J.H.²

15
- 77950824225
- Model-free approximate dynamic programming for continuous-time linear systems
- Lee, J. Y.; Park, J. B.; & Choi, Y. H. (2009). Model-free approximate dynamic programming for continuous-time linear systems. In Proc. CDC (pp. 5009-5014).
- (2009) Proc. CDC , pp. 5009-5014
- Lee, J.Y.¹ Park, J.B.² Choi, Y.H.³

16
- 0004163205
- 2nd ed. Wiley New York
- F.L. Lewis, and V. Syrmos Optimal control 2nd ed. 1995 Wiley New York
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.²

17
- 79551685808
- Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
- F.L. Lewis, and K.G. Vamvoudakis Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data IEEE Transactions on Systems, Man and Cybernetics, Part B 41 1 2010 14 25
- (2010) IEEE Transactions on Systems, Man and Cybernetics, Part B , vol.41 , Issue.1 , pp. 14-25
- Lewis, F.L.¹ Vamvoudakis, K.G.²

18
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- F.L. Lewis, and D. Vrabie Reinforcement learning and adaptive dynamic programming for feedback control IEEE Circuits and Systems Magazine 9 3 2009 32 50
- (2009) IEEE Circuits and Systems Magazine , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

19
- 77950806766
- Q-learning and pontryagins minimum principle
- Mehta, P.; & Meyn, S. (2009). Q-learning and pontryagins minimum principle. In Proc. CDC (pp. 3598-3605).
- (2009) Proc. CDC , pp. 3598-3605
- Mehta, P.¹ Meyn, S.²

20
- 0036588686
- Adaptive dynamic programming
- J.J. Murray, C.J. Cox, G.G. Lendaris, and R. Saeks Adaptive dynamic programming IEEE Transactions on Systems, Man and Cybernetics, Part C 32 2 2002 140 153
- (2002) IEEE Transactions on Systems, Man and Cybernetics, Part C , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

21
- 47349092417
- Wiley-Interscience
- W.B. Powell Approximate dynamic programming: solving the curses of dimensionality 2007 Wiley-Interscience
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

22
- 0031236002
- Adaptive critic designs
- D.V. Prokhorov, and D.C. Wunsch II Adaptive critic designs IEEE Transactions on Neural Networks 8 5 1997 997 1007
- (1997) IEEE Transactions on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, I.I.D.C.²

23
- 84921399937
- Wiley-IEEE Press
- J. Si, A.G. Barto, W.B. Powell, and D. Wunsch Handbook of learning and approximate dynamic programming 2004 Wiley-IEEE Press
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.⁴

24
- 0004102479
- MIT Press Cambridge, Massachusetts
- R.S. Sutton, and A.G. Barto Reinforcement learning - an introduction 1998 MIT Press Cambridge, Massachusetts
- (1998) Reinforcement Learning - An Introduction
- Sutton, R.S.¹ Barto, A.G.²

25
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K.G. Vamvoudakis, and F.L. Lewis Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem Automatica 46 5 2010 878 888
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

26
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F.L. Lewis Adaptive optimal control for continuous-time linear systems based on policy iteration Automatica 45 2 2009 477 484
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

27
- 66449130966
- Adaptive dynamic programming: An introduction
- F.Y. Wang, H. Zhang, and D. Liu Adaptive dynamic programming: an introduction IEEE Computational Intelligence Magazine 4 3 2009 39 47
- (2009) IEEE Computational Intelligence Magazine , vol.4 , Issue.3 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

28
- 0004049893
- Ph.D. dissertation. Cambridge Univ.; Cambridge, UK
- Watkins, C. J. C. H.; & Dayan, P. (1989). Learning from delayed rewards. Ph.D. dissertation. Cambridge Univ.; Cambridge, UK.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹ Dayan, P.²

29
- 34249833101
- Q-learning
- C.J.C.H. Watkins, and P. Dayan Q-learning Machine Learning 8 1992 279 292
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

30
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D.A. White, D.A. Sofge, Van Nostrand Reinhold New York
- P.J. Webos Approximate dynamic programming for real-time control and neural modeling D.A. White, D.A. Sofge, Handbook of intelligent control 1992 Van Nostrand Reinhold New York
- (1992) Handbook of Intelligent Control
- Webos, P.J.¹

31
- 14544289894
- A note on persistency of excitation
- J.C. Willems, P. Rapisarda, I. Markovsky, and B.L.M. Moor A note on persistency of excitation Systems & Control Letters 54 4 2005 325 329
- (2005) Systems & Control Letters , vol.54 , Issue.4 , pp. 325-329
- Willems, J.C.¹ Rapisarda, P.² Markovsky, I.³ Moor, B.L.M.⁴

32
- 67650505616
- Algorithm and stability of ATC receding horizon control
- Zhang, H.; Huang, J.; & Lewis, F. L. (2009). Algorithm and stability of ATC receding horizon control. In IEEE Symp. ADPRL (pp. 28-35).
- (2009) IEEE Symp. ADPRL , pp. 28-35
- Zhang, H.¹ Huang, J.² Lewis, F.L.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.