SCOPUS 정보 검색 플랫폼

IEEE Transactions on Neural Networks and Learning Systems

Volumn 23, Issue 12, 2012, Pages 1884-1895

Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H\infty control

(2) Wu, Huai Ning a Luo, Biao a

a BEIHANG UNIVERSITY (China)

Author keywords

H infty state feedback control; Hamilton Jacobi Isaacs equation; neural network; online; simultaneous policy update algorithm

Indexed keywords

HAMILTON-JACOBI-ISAACS; HAMILTON-JACOBI-ISAACS EQUATIONS; LEAST SQUARE METHODS; NONLINEAR PARTIAL DIFFERENTIAL EQUATIONS; ONLINE; REINFORCEMENT LEARNING TECHNIQUES; UNKNOWN ENVIRONMENTS; UPDATE ALGORITHMS;

ALGORITHMS; BANACH SPACES; LEAST SQUARES APPROXIMATIONS; NEURAL NETWORKS; NEWTON-RAPHSON METHOD; PARTIAL DIFFERENTIAL EQUATIONS; REINFORCEMENT LEARNING; STATE FEEDBACK;

NONLINEAR EQUATIONS;

ALGORITHM; ARTIFICIAL NEURAL NETWORK; AUTOMATED PATTERN RECOGNITION; HUMAN; NONLINEAR SYSTEM; PROCEDURES;

ALGORITHMS; HUMANS; NEURAL NETWORKS (COMPUTER); NONLINEAR DYNAMICS; PATTERN RECOGNITION, AUTOMATED;

EID: 84876909440 PISSN: 2162237X EISSN: 21622388 Source Type: Journal
DOI: 10.1109/TNNLS.2012.2217349 Document Type: Article

Times cited : (191)

References (49)

1
- 0003404761
- 2nd ed Boston, MA: Birkhäuser
- T. Ba̧sar and P. Bernhard, H∞ Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach, 2nd ed. Boston, MA: Birkhäuser, 1995.
- (1995) H∞ Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach
- Ba̧sar, T.¹ Bernhard, P.²

2
- 0003446469
- Berlin Germany: Springer-Verlag
- A. J. van der Schaft, L2-Gain and Passivity Techniques in Nonlinear Control. Berlin, Germany: Springer-Verlag, 1996.
- (1996) L2-Gain and Passivity Techniques in Nonlinear Control
- Schaft Der Van, A.J.¹

3
- 0003585352
- Upper Saddle River NJ: Prentice-Hall
- K. Zhou, J. C. Doyle, and K. Glover, Robust and Optimal Control. Upper Saddle River, NJ: Prentice-Hall, 1996.
- (1996) Robust and Optimal Control
- Zhou, K.¹ Doyle, J.C.² Glover, K.³

4
- 0029264110
- H∞ control via measurement feedback for general nonlinear systems
- Mar
- A. Isidori and W. Kang, "H∞ control via measurement feedback for general nonlinear systems," IEEE Trans. Autom. Control, vol. 40, no. 3, pp. 466-472, Mar. 1995.
- (1995) IEEE Trans. Autom. Control , vol.40 , Issue.3 , pp. 466-472
- Isidori, A.¹ Kang, W.²

5
- 1442313356
- Global H∞ controllers for a class of nonlinear systems
- Feb
- G. Bianchini, R. Genesio, A. Parenti, and A. Tesi, "Global H∞ controllers for a class of nonlinear systems," IEEE Trans. Autom. Control, vol. 49, no. 2, pp. 244-249, Feb. 2004.
- (2004) IEEE Trans. Autom. Control , vol.49 , Issue.2 , pp. 244-249
- Bianchini, G.¹ Genesio, R.² Parenti, A.³ Tesi, A.⁴

6
- 0026883666
- L2-gain analysis of nonlinear systems and nonlinear state-feedback H∞ control
- Jun
- A. J. van der Schaft, "L2-gain analysis of nonlinear systems and nonlinear state-feedback H∞ control," IEEE Trans. Autom. Control, vol. 37, no. 6, pp. 770-784, Jun. 1992.
- (1992) IEEE Trans. Autom. Control , vol.37 , Issue.6 , pp. 770-784
- Schaft Der Van, A.J.¹

7
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern., B, Cybern , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

8
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
- Aug
- H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern., B, Cybern , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.¹ Wei, Q.² Luo, Y.³

9
- 49049091364
- Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks
- Aug
- Q. M. Yang, J. B. Vance, and S. Jagannathan, "Control of nonaffine nonlinear discrete-time systems using reinforcement-learning-based linearly parameterized neural networks," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 38, no. 4, pp. 994-1001, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern., B, Cybern , vol.38 , Issue.4 , pp. 994-1001
- Yang, Q.M.¹ Vance, J.B.² Jagannathan, S.³

10
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

11
- 67349145396
- Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
- D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, 2009.
- (2009) Neural Netw , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.L.²

12
- 70349253929
- Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Sep
- H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
- (2009) IEEE Trans. Neural Netw , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

13
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

14
- 78651311269
- Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
- Jan.
- F. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound," IEEE Trans. Neural Netw., vol. 22, no. 1, pp. 24-36, Jan. 2011.
- (2011) IEEE Trans. Neural Netw , vol.22 , Issue.1 , pp. 24-36
- Wang, F.¹ Jin, N.² Liu, D.³ Wei, Q.⁴

15
- 84875270081
- Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using timebased policy update
- Jul.
- T. Dierks and S. Jagannathan, "Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using timebased policy update," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 7, pp. 1118-1129, Jul. 2012.
- (2012) IEEE Trans. Neural Netw. Learn. Syst , vol.23 , Issue.7 , pp. 1118-1129
- Dierks, T.¹ Jagannathan, S.²

16
- 83655163786
- Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
- Dec.
- H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, Dec. 2011.
- (2011) IEEE Trans. Neural Netw , vol.22 , Issue.12 , pp. 2226-2236
- Zhang, H.¹ Cui, L.² Zhang, X.³ Luo, Y.⁴

17
- 83655167263
- Approximate dynamic programming for optimal stationary control with control-dependent noise
- Dec.
- Y. Jiang and Z. P. Jiang, "Approximate dynamic programming for optimal stationary control with control-dependent noise," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2392-2398, Dec. 2011.
- (2011) IEEE Trans. Neural Netw , vol.22 , Issue.12 , pp. 2392-2398
- Jiang, Y.¹ Jiang, Z.P.²

18
- 83855165164
- Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming
- Dec.
- H. Zhang, R. Song, Q. Wei, and T. Zhang, "Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 1851-1862, Dec. 2011.
- (2011) IEEE Trans. Neural Netw , vol.22 , Issue.12 , pp. 1851-1862
- Zhang, H.¹ Song, R.² Wei, Q.³ Zhang, T.⁴

19
- 0004102479
- Cambridge MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

20
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- Sep
- F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, Sep. 2009.
- (2009) IEEE Circuits Syst. Mag , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

21
- 66449130966
- Adaptive dynamic programming: An introduction
- May
- F. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
- (2009) IEEE Comput. Intell. Mag , vol.4 , Issue.2 , pp. 39-47
- Wang, F.¹ Zhang, H.² Liu, D.³

22
- 47349092417
- Hoboken NJ: Wiley
- W. B. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality. Hoboken, NJ: Wiley, 2007.
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

23
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

24
- 83855164075
- Hierarchical approximate policy iteration with binary-tree state space decomposition
- Dec.
- X. Xu, C. Liu, S. X. Yang, and D. Hu, "Hierarchical approximate policy iteration with binary-tree state space decomposition," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 1863-1877, Dec. 2011.
- (2011) IEEE Trans. Neural Netw , vol.22 , Issue.12 , pp. 1863-1877
- Xu, X.¹ Liu, C.² Yang, S.X.³ Hu, D.⁴

25
- 84876158475
- Simple and fast calculation of the second-order gradients for globalized dual heuristic dynamic programming in neural networks
- Oct.
- M. Fairbank, E. Alonso, and D. Prokhorov, "Simple and fast calculation of the second-order gradients for globalized dual heuristic dynamic programming in neural networks," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 10, pp. 1671-1676, Oct. 2012.
- (2012) IEEE Trans. Neural Netw. Learn. Syst , vol.23 , Issue.10 , pp. 1671-1676
- Fairbank, M.¹ Alonso, E.² Prokhorov, D.³

26
- 61849156874
- A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control
- Y. Feng, B. Anderson, and M. Rotkowitz, "A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control," Automatica, vol. 45, no. 4, pp. 881-888, 2009.
- (2009) Automatica , vol.45 , Issue.4 , pp. 881-888
- Feng, Y.¹ Anderson, B.² Rotkowitz, M.³

27
- 56549098855
- Computing the positive stabilizing solution to algebraic Riccati equations with an indefinite quadratic term via a recursive method
- Nov
- A. Lanzon, Y. Feng, B. D. O. Anderson, and M. Rotkowitz, "Computing the positive stabilizing solution to algebraic Riccati equations with an indefinite quadratic term via a recursive method," IEEE Trans. Autom. Control, vol. 53, no. 10, pp. 2280-2291, Nov. 2008.
- (2008) IEEE Trans. Autom. Control , vol.53 , Issue.10 , pp. 2280-2291
- Lanzon, A.¹ Feng, Y.² Anderson, B.D.O.³ Rotkowitz, M.⁴

28
- 0029371239
- Numerical approach to computing nonlinear H∞ control laws
- J. Huang and C. Lin, "Numerical approach to computing nonlinear H∞ control laws," AIAA J. Guidance, Control, Dynamics, vol. 18, no. 5, pp. 989-994, 1995.
- (1995) AIAA J. Guidance, Control, Dynamics , vol.18 , Issue.5 , pp. 989-994
- Huang, J.¹ Lin, C.²

29
- 0018441647
- An approximation theory of optimal control for trainable manipulators
- Mar
- G. N. Saridis and C. G. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 9, no. 3, pp. 152-159, Mar. 1979.
- (1979) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.9 , Issue.3 , pp. 152-159
- Saridis, G.N.¹ Lee, C.G.²

30
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- R. Beard, G. N. Saridis, and J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation," Automatica, vol. 33, no. 12, pp. 2159-2177, 1997.
- (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
- Beard, R.¹ Saridis, G.N.² Wen, J.³

31
- 0032387028
- Approximate solutions to the timeinvariant Hamilton-Jacobi-Bellman equation
- R. Beard, G. N. Saridis, and J. Wen, "Approximate solutions to the timeinvariant Hamilton-Jacobi-Bellman equation," J. Optim. Theory Appl., vol. 96, no. 3, pp. 589-626, 1998.
- (1998) J. Optim. Theory Appl , vol.96 , Issue.3 , pp. 589-626
- Beard, R.¹ Saridis, G.N.² Wen, J.³

32
- 0032202335
- Successive Galerkin approximation algorithms for nonlinear optimal and robust control
- R. W. Beard and T. W. Mclain, "Successive Galerkin approximation algorithms for nonlinear optimal and robust control," Int. J. Control, vol. 71, no. 5, pp. 717-743, 1998.
- (1998) Int. J. Control , vol.71 , Issue.5 , pp. 717-743
- Beard, R.W.¹ McLain, T.W.²

33
- 84864463039
- Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
- K. G. Vamvoudakis and F. L. Lewis, "Online solution of nonlinear two-player zero-sum games using synchronous policy iteration," Int. J. Robust Nonlinear Control, vol. 22, no. 13, pp. 1460-1483, 2011.
- (2011) Int. J. Robust Nonlinear Control , vol.22 , Issue.13 , pp. 1460-1483
- Vamvoudakis, K.G.¹ Lewis, F.L.²

34
- 33845759425
- Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation
- Dec
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Policy iterations on the Hamilton-Jacobi-Isaacs equation for H∞ state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1989-1995, Dec. 2006.
- (2006) IEEE Trans. Autom. Control , vol.51 , Issue.12 , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

35
- 48949116222
- Neurodynamic programming and zero-sum games for constrained control systems
- Jul
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Neurodynamic programming and zero-sum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Jul. 2008.
- (2008) IEEE Trans. Neural Netw , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

36
- 34547119809
- New York: Springer-Verlag
- M. Abu-Khalaf, J. Huang, and F. L. Lewis, Nonlinear H2/H-Infinity Constrained Feedback Control: A Practical Design Approach Using Neural Networks. New York: Springer-Verlag, 2006.
- (2006) Nonlinear H2/H-Infinity Constrained Feedback Control: A Practical Design Approach Using Neural Networks
- Abu-Khalaf, M.¹ Huang, J.² Lewis, F.L.³

37
- 79960443754
- Adaptive dynamic programming for online solution of a zero-sum differential game
- D. Vrabie and F. L. Lewis, "Adaptive dynamic programming for online solution of a zero-sum differential game," J. Control Theory Appl., vol. 9, no. 3, pp. 353-360, 2011.
- (2011) J. Control Theory Appl , vol.9 , Issue.3 , pp. 353-360
- Vrabie, D.¹ Lewis, F.L.²

38
- 0003678750
- New York: Springer-Verlag
- E. Zeidler, Nonlinear Functional Analysis: Fixed Point Theorems, vol. 1. New York: Springer-Verlag, 1985.
- (1985) Nonlinear Functional Analysis: Fixed Point Theorems , vol.1
- Zeidler, E.¹

39
- 51249194918
- The method of successive approximation for functional equations
- L. Kantorovitch, "The method of successive approximation for functional equations," Acta Math., vol. 71, no. 1, pp. 63-97, 1939.
- (1939) Acta Math , vol.71 , Issue.1 , pp. 63-97
- Kantorovitch, L.¹

40
- 0000816132
- The Kantorovich theorem for Newton's method
- R. A. Tapia, "The Kantorovich theorem for Newton's method," Amer. Math. Monthly, vol. 78, no. 4, pp. 389-392, 1971.
- (1971) Amer. Math. Monthly , vol.78 , Issue.4 , pp. 389-392
- Tapia, R.A.¹

41
- 0002521058
- A note on the convergence of Newton's method
- L. B. Rall, "A note on the convergence of Newton's method," SIAM J. Numer. Anal., vol. 11, no. 1, pp. 34-36, 1974.
- (1974) SIAM J. Numer. Anal , vol.11 , Issue.1 , pp. 34-36
- Rall, L.B.¹

42
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- May
- A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuronlike adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst., Man, Cybern., B, Cybern., vol. 13, no. 5, pp. 834-846, May 1983.
- (1983) IEEE Trans. Syst., Man, Cybern., B, Cybern , vol.13 , Issue.5 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

43
- 0042758707
- Ph.D dissertation, Dept. Electr. Eng. & Comput. Sci., Massachusetts Inst. Technology, Cambridge
- V. Konda, "On actor-critic algorithms," Ph.D dissertation, Dept. Electr. Eng. & Comput. Sci., Massachusetts Inst. Technology, Cambridge, 2002.
- (2002) On Actor-critic Algorithms
- Konda, V.¹

44
- 4043069840
- On actor-critic algorithms
- V. Konda and J. N. Tsitsiklis, "On actor-critic algorithms," SIAM J. Control Optim., vol. 42, no. 4, pp. 1143-1166, 2003.
- (2003) SIAM J. Control Optim , vol.42 , Issue.4 , pp. 1143-1166
- Konda, V.¹ Tsitsiklis, J.N.²

45
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

46
- 0003917259
- New York: Academic
- B. A. Finlayson, The Method of Weighted Residuals and Variational Principles. New York: Academic, 1972.
- (1972) The Method of Weighted Residuals and Variational Principles
- Finlayson, B.A.¹

47
- 0004044108
- 2nd ed New York: Wiley
- B. Stevens and F. L. Lewis, Aircraft Control and Simulation, 2nd ed. New York: Wiley, 2003.
- (2003) Aircraft Control and Simulation
- Stevens, B.¹ Lewis, F.L.²

48
- 84870060536
- Ph.D. dissertation Faculty of Graduate School, Univ. Texas at Arlington, Arlington
- K. G. Vamvoudakis, "Online learning algorithms for differential dynamic games and optimal control," Ph.D. dissertation, Faculty of Graduate School, Univ. Texas at Arlington, Arlington, 2011.
- (2011) Online Learning Algorithms for Differential Dynamic Games and Optimal Control
- Vamvoudakis, K.G.¹

49
- 62949149213
- Dept. Control & Dynamical Syst., California Inst. Technology, Pasadena, Tech. Rep. TR96-021
- V. Nevisti'C and J. A. Primbs, "Constrained nonlinear optimal control: A converse HJB approach," Dept. Control & Dynamical Syst., California Inst. Technology, Pasadena, Tech. Rep. TR96-021, 1996.
- (1996) Constrained Nonlinear Optimal Control: A Converse HJB Approach
- Nevisti'C, V.¹ Primbs, J.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.