SCOPUS 정보 검색 플랫폼

Volumn 222, Issue , 2013, Pages 472-485

Simultaneous policy update algorithms for learning the solution of linear continuous-time H∞ state feedback control

Author keywords

Algebra Riccati equation; H state feedback control; Lyapunov equation; Offline; Online; Simultaneous policy update algorithm

Indexed keywords

ACTION POLICIES; COMPARATIVE SIMULATION; FIXED POINT EQUATION; INTERNAL SYSTEMS; LINEAR CONTINUOUS-TIME; LYAPUNOV EQUATION; MODEL BASED APPROACH; MODEL FREE; OFFLINE; ONLINE; ONLINE VERSIONS; ZERO-SUM GAME;

ALGEBRA; FEEDBACK CONTROL; LYAPUNOV FUNCTIONS; REINFORCEMENT LEARNING; RICCATI EQUATIONS; STATE FEEDBACK;

ALGORITHMS;

EID: 84870062175 PISSN: 00200255 EISSN: None Source Type: Journal
DOI: 10.1016/j.ins.2012.08.012 Document Type: Article

Times cited : (81)

References (33)

1
- 48949116222
- Neurodynamic programming and zero-sum games for constrained control systems
- M. Abu-Khalaf, F.L. Lewis, and J. Huang Neurodynamic programming and zero-sum games for constrained control systems IEEE Transactions on Neural Networks 19 7 2008 1243 1252
- (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

2
- 0032202335
- Successive Galerkin approximation algorithms for nonlinear optimal and robust control
- R.W. Beard, and T.W. Mclain Successive Galerkin approximation algorithms for nonlinear optimal and robust control International Journal of Control 71 5 1998 717 743
- (1998) International Journal of Control , vol.71 , Issue.5 , pp. 717-743
- Beard, R.W.¹ McLain, T.W.²

3
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- R.W. Beard, G.N. Saridis, and J. Wen Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation Automatica 33 12 1997 2159 2177
- (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
- Beard, R.W.¹ Saridis, G.N.² Wen, J.³

4
- 0032387028
- Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation
- R.W. Beard, G.N. Saridis, and J. Wen Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation Journal of Optimization Theory and Applications 96 3 1998 589 626
- (1998) Journal of Optimization Theory and Applications , vol.96 , Issue.3 , pp. 589-626
- Beard, R.W.¹ Saridis, G.N.² Wen, J.³

5
- 79960439729
- Approximate policy iteration a survey and some new methods
- D.P. Bertsekas Approximate policy iteration a survey and some new methods Journal of Control Theory and Applications 9 3 2011 310 335
- (2011) Journal of Control Theory and Applications , vol.9 , Issue.3 , pp. 310-335
- Bertsekas, D.P.¹

6
- 0344430554
- Springer-Verlag New York
- W. Cheney Analysis for Applied Mathematics 2001 Springer-Verlag New York
- (2001) Analysis for Applied Mathematics
- Cheney, W.¹

7
- 79958095882
- ∞ control problems
- ∞ control problems Numerical Algorithms 57 3 2011 357 375
- (2011) Numerical Algorithms , vol.57 , Issue.3 , pp. 357-375
- Dragan, V.¹ Ivanov, I.G.²

8
- 79958784455
- A numerical procedure to compute the stabilising solution of game theoretic Riccati equations of stochastic control
- V. Dragan, and I. Ivanov A numerical procedure to compute the stabilising solution of game theoretic Riccati equations of stochastic control International Journal of Control 84 4 2011 783 800
- (2011) International Journal of Control , vol.84 , Issue.4 , pp. 783-800
- Dragan, V.¹ Ivanov, I.²

9
- 79953793452
- A new iterative algorithm to solve periodic Riccati differential equations with sign indefinite quadratic terms
- Y.T. Feng, A. Varga, B.D.O. Anderson, and M. Lovera A new iterative algorithm to solve periodic Riccati differential equations with sign indefinite quadratic terms IEEE Transactions on Automatic Control 56 4 2011 929 934
- (2011) IEEE Transactions on Automatic Control , vol.56 , Issue.4 , pp. 929-934
- Feng, Y.T.¹ Varga, A.² Anderson, B.D.O.³ Lovera, M.⁴

10
- 74449083177
- An iterative algorithm to solve state-perturbed stochastic algebraic Riccati equations in LQ zero-sum games
- Y.T. Feng, and B.D.O. Anderson An iterative algorithm to solve state-perturbed stochastic algebraic Riccati equations in LQ zero-sum games Systems & Control Letters 59 1 2010 50 56
- (2010) Systems & Control Letters , vol.59 , Issue.1 , pp. 50-56
- Feng, Y.T.¹ Anderson, B.D.O.²

11
- 0004141725
- Prentice-Hall Englewood Cliffs, NJ
- M. Green, and D.J.N. Limebeer Linear Robust Control 1995 Prentice-Hall Englewood Cliffs, NJ
- (1995) Linear Robust Control
- Green, M.¹ Limebeer, D.J.N.²

12
- 79953906172
- Self-organizing state aggregation for architecture design of Q-learning
- K.-S. Hwang, H.-Y. Lin, Y. -P Hsu, and H.-H. Yu Self-organizing state aggregation for architecture design of Q-learning Information Sciences 181 13 2011 2813 2822
- (2011) Information Sciences , vol.181 , Issue.13 , pp. 2813-2822
- Hwang, K.-S.¹ Lin, H.-Y.² Hsu, Y.-P.³ Yu, H.-H.⁴

13
- 84863723501
- Induced states in a decision tree constructed by Q-learning
- K.-S. Hwang, Y.-J. Chen, W.-C. Jiang, and T.-W. Yang Induced states in a decision tree constructed by Q-learning Information Sciences 213 5 2012 39 49
- (2012) Information Sciences , vol.213 , Issue.5 , pp. 39-49
- Hwang, K.-S.¹ Chen, Y.-J.² Jiang, W.-C.³ Yang, T.-W.⁴

14
- 51249194918
- The method of successive approximation for functional equations
- L. Kantorovitch The method of successive approximation for functional equations Acta Mathematica 71 1 1939 63 97
- (1939) Acta Mathematica , vol.71 , Issue.1 , pp. 63-97
- Kantorovitch, L.¹

15
- 84914965022
- On an iterative technique for Riccati equation computations
- D.L. Kleinman On an iterative technique for Riccati equation computations IEEE Transactions on Automatic Control 13 1 1968 114 115
- (1968) IEEE Transactions on Automatic Control , vol.13 , Issue.1 , pp. 114-115
- Kleinman, D.L.¹

16
- 56549098855
- Computing the positive stabilizing solution to algebraic Riccati equations with an indefinite quadratic term via a recursive method
- A. Lanzon, Y. Feng, B.D.O. Anderson, and M. Rotkowitz Computing the positive stabilizing solution to algebraic Riccati equations with an indefinite quadratic term via a recursive method IEEE Transactions on Automatic Control 53 10 2008 2280 2291
- (2008) IEEE Transactions on Automatic Control , vol.53 , Issue.10 , pp. 2280-2291
- Lanzon, A.¹ Feng, Y.² Anderson, B.D.O.³ Rotkowitz, M.⁴

17
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- F.L. Lewis, and D. Vrabie Reinforcement learning and adaptive dynamic programming for feedback control IEEE Circuits and Systems Magazine 9 3 2009 32 50
- (2009) IEEE Circuits and Systems Magazine , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

18
- 47349092417
- Wiley-Interscience Hoboken, NJ
- W.B. Powell Approximate Dynamic Programming: Solving the Curses of Dimensionality 2007 Wiley-Interscience Hoboken, NJ
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

19
- 0002521058
- A note on the convergence of Newton's method
- L.B. Rall A note on the convergence of Newton's method SIAM Journal on Numerical Analysis 11 1 1974 34 36
- (1974) SIAM Journal on Numerical Analysis , vol.11 , Issue.1 , pp. 34-36
- Rall, L.B.¹

20
- 0018441647
- An approximation theory of optimal control for trainable manipulators
- G.N. Saridis, and C.G. Lee An approximation theory of optimal control for trainable manipulators IEEE Transactions on Systems, Man and Cybernetics Part B-Cybernetics 9 3 1979 152 159
- (1979) IEEE Transactions on Systems, Man and Cybernetics Part B-Cybernetics , vol.9 , Issue.3 , pp. 152-159
- Saridis, G.N.¹ Lee, C.G.²

21
- 0004044108
- second ed. John Wiley New Jersey
- B. Stevens, and F.L. Lewis Aircraft Control and Simulation second ed. 2003 John Wiley New Jersey
- (2003) Aircraft Control and Simulation
- Stevens, B.¹ Lewis, F.L.²

22
- 0004102479
- MIT Press Cambridge, Mass
- R.S. Sutton, and A.G. Barto Reinforcement Learning: An Introduction 1998 MIT Press Cambridge, Mass
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

23
- 0000816132
- The Kantorovich theorem for Newton's method
- R.A. Tapia The Kantorovich theorem for Newton's method The American Mathematical Monthly 78 4 1971 389 392
- (1971) The American Mathematical Monthly , vol.78 , Issue.4 , pp. 389-392
- Tapia, R.A.¹

24
- 79953155097
- Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
- K.G. Vamvoudakis, F.L. Lewis, Online solution of nonlinear two-player zero-sum games using synchronous policy iteration, in: Proceedings of the 49th IEEE Conference on Decision and Control (CDC), 2010, pp. 3040-3047.
- (2010) Proceedings of the 49th IEEE Conference on Decision and Control (CDC) , pp. 3040-3047
- Vamvoudakis, K.G.¹ Lewis, F.L.²

25
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K.G. Vamvoudakis, and F.L. Lewis Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem Automatica 46 5 2010 878 888
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

26
- 84870060536
- Ph.D dissertation, Faculty of the Graduate School, University of Texas at Arlington
- K.G. Vamvoudakis, Online Learning Algorithms for Differential Dynamic Games and Optimal Control, Ph.D dissertation, Faculty of the Graduate School, University of Texas at Arlington, 2011.
- (2011) Online Learning Algorithms for Differential Dynamic Games and Optimal Control
- Vamvoudakis, K.G.¹

27
- 0026883666
- ∞ control
- ∞ control IEEE Transactions on Automatic Control 37 6 1992 770 784
- (1992) IEEE Transactions on Automatic Control , vol.37 , Issue.6 , pp. 770-784
- Van Der Schaft, A.J.¹

28
- 79952312120
- Hessian matrix distribution for Bayesian policy gradient reinforcement learning
- N.A. Vien, H. Yu, and T.C. Chung Hessian matrix distribution for Bayesian policy gradient reinforcement learning Information Sciences 181 9 2011 1671 1685
- (2011) Information Sciences , vol.181 , Issue.9 , pp. 1671-1685
- Vien, N.A.¹ Yu, H.² Chung, T.C.³

29
- 79960443754
- Adaptive dynamic programming for online solution of a zero-sum differential game
- D. Vrabie, and F.L. Lewis Adaptive dynamic programming for online solution of a zero-sum differential game Journal of Control Theory and Applications 9 3 2011 353 360
- (2011) Journal of Control Theory and Applications , vol.9 , Issue.3 , pp. 353-360
- Vrabie, D.¹ Lewis, F.L.²

30
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F.L. Lewis Adaptive optimal control for continuous-time linear systems based on policy iteration Automatica 45 2 2009 477 484
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

31
- 66449130966
- Adaptive dynamic programming: An introduction
- F. Wang, H. Zhang, and D. Liu Adaptive dynamic programming: an introduction IEEE Computational Intelligence Magazine 4 2 2009 39 47
- (2009) IEEE Computational Intelligence Magazine , vol.4 , Issue.2 , pp. 39-47
- Wang, F.¹ Zhang, H.² Liu, D.³

32
- 34250731840
- A fuzzy actor-critic reinforcement learning network
- X. Wang, Y. Cheng, and J. Yi A fuzzy actor-critic reinforcement learning network Information Sciences 177 18 2007 3764 3781
- (2007) Information Sciences , vol.177 , Issue.18 , pp. 3764-3781
- Wang, X.¹ Cheng, Y.² Yi, J.³

33
- 78650805234
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- H. Zhang, Q. Wei, and D. Liu An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games Automatica 47 1 2011 207 214
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.¹ Wei, Q.² Liu, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.