SCOPUS 정보 검색 플랫폼

IEEE Transactions on Cybernetics

Volumn 45, Issue 1, 2015, Pages 65-76

Off-policy reinforcement learning for H∞ control design

(3) Luo, Biao a,b Wu, Huai Ning a Huang, Tingwen c

a BEIHANG UNIVERSITY (China)

b INSTITUTE OF AUTOMATION (China)

c TEXAS A AND M UNIVERSITY AT QATAR (Qatar)

Author keywords

H control design; Hamilton Jacobi Isaacs equation; Neural network; Off policy learning; Reinforcement learning

Indexed keywords

AIRCRAFT CONTROL; DESIGN; LEAST SQUARES APPROXIMATIONS; NEURAL NETWORKS; NONLINEAR EQUATIONS; PARTIAL DIFFERENTIAL EQUATIONS;

CONTROL DESIGN; HAMILTON-JACOBI-ISAACS; HAMILTON-JACOBI-ISAACS EQUATIONS; MATHEMATICAL SYSTEM MODEL; METHOD OF WEIGHTED RESIDUAL; NONLINEAR PARTIAL DIFFERENTIAL EQUATIONS; POLICY LEARNING; ROTATIONAL/TRANSLATIONAL ACTUATOR;

REINFORCEMENT LEARNING;

EID: 84919730591 PISSN: 21682267 EISSN: None Source Type: Journal
DOI: 10.1109/TCYB.2014.2319577 Document Type: Article

Times cited : (338)

References (64)

1
- 0029679044
- Reinforcement learning: A survey
- Jan.
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, no. 1, pp. 237-285, Jan. 1996.
- (1996) J. Artif. Intell. Res. , vol.4 , Issue.1 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

2
- 0004102479
- Cambridge, U.K.: Cambridge Univ. Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, U.K.: Cambridge Univ. Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

3
- 0003565783
- Nashua, NH, USA: Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control. Nashua, NH, USA: Athena Scientific, 2005.
- (2005) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

4
- 47349092417
- Hoboken, NJ, USA: Wiley
- W. B. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality. Hoboken, NJ, USA: Wiley, 2007.
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

5
- 0036588686
- Adaptive dynamic programming
- May
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002.
- (2002) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev. , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

6
- 34547133970
- Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks
- Jul.
- V. Yadav, R. Padhi, and S. Balakrishnan, "Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 1115-1128, Jul. 2007.
- (2007) IEEE Trans. Neural Netw. , vol.18 , Issue.4 , pp. 1115-1128
- Yadav, V.¹ Padhi, R.² Balakrishnan, S.³

7
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
- Aug.
- H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.¹ Wei, Q.² Luo, Y.³

8
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

9
- 67349145396
- Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
- D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, 2009.
- (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.L.²

10
- 70349253929
- Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Sep.
- H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
- (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

11
- 83655163786
- Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
- Dec.
- H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method," IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, Dec. 2011.
- (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.12 , pp. 2226-2236
- Zhang, H.¹ Cui, L.² Zhang, X.³ Luo, Y.⁴

12
- 84864324494
- Online policy iteration algorithm for optimal control of linear hyperbolic PDE systems
- B. Luo and H.-N. Wu, "Online policy iteration algorithm for optimal control of linear hyperbolic PDE systems," J. Process Control, vol. 22, no. 7, pp. 1161-1170, 2012.
- (2012) J. Process Control , vol.22 , Issue.7 , pp. 1161-1170
- Luo, B.¹ Wu, H.-N.²

13
- 85013129810
- Stevenage, England: IET Press
- D. Vrabie, F. L. Lewis, and K. G. Vamvoudakis, Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles. Stevenage, England: IET Press, 2012.
- (2012) Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles
- Vrabie, D.¹ Lewis, F.L.² Vamvoudakis, K.G.³

14
- 84863467146
- Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
- Jul.
- D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
- (2012) IEEE Trans. Autom. Sci. Eng. , vol.9 , Issue.3 , pp. 628-634
- Liu, D.¹ Wang, D.² Zhao, D.³ Wei, Q.⁴ Jin, N.⁵

15
- 84869489097
- Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network
- Dec.
- B. Luo and H.-N. Wu, "Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 6, pp. 1538-1549, Dec. 2012.
- (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.42 , Issue.6 , pp. 1538-1549
- Luo, B.¹ Wu, H.-N.²

16
- 84863856475
- Heuristic dynamic programming algorithm for optimal control design of linear continuous-time hyperbolic PDE systems
- H.-N. Wu and B. Luo, "Heuristic dynamic programming algorithm for optimal control design of linear continuous-time hyperbolic PDE systems," Ind. Eng. Chem. Res., vol. 51, no. 27, pp. 9310-9319, 2012.
- (2012) Ind. Eng. Chem. Res. , vol.51 , Issue.27 , pp. 9310-9319
- Wu, H.-N.¹ Luo, B.²

17
- 84881555023
- Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
- Apr.
- D. Liu and Q. Wei, "Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems," IEEE Trans. Cybern., vol. 43, no. 2, pp. 779-789, Apr. 2013.
- (2013) IEEE Trans. Cybern. , vol.43 , Issue.2 , pp. 779-789
- Liu, D.¹ Wei, Q.²

18
- 84961643244
- A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems
- to be published
- Q. Wei and D. Liu, "A novel iterative θ-adaptive dynamic programming for discrete-time nonlinear systems," IEEE Trans. Autom. Sci. Eng., 2013, to be published.
- (2013) IEEE Trans. Autom. Sci. Eng.
- Wei, Q.¹ Liu, D.²

19
- 84893640946
- Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach
- Feb.
- D. Liu, D. Wang, and H. Li, "Decentralized stabilization for a class of continuous-time nonlinear interconnected systems using online learning optimal control approach," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 2, pp. 418-428, Feb. 2014.
- (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.2 , pp. 418-428
- Liu, D.¹ Wang, D.² Li, H.³

20
- 84907947448
- arXiv preprint arXiv:1311.0396, [Online]. Available
- B. Luo, H.-N. Wu, T. Huang, and D. Liu, "Data-based approximate policy iteration for nonlinear continuous-time optimal control design," arXiv preprint arXiv:1311.0396, 2013 [Online]. Available: http://arxiv.org/abs/1311.0396
- (2013) Data-based Approximate Policy Iteration for Nonlinear Continuous-time Optimal Control Design
- Luo, B.¹ Wu, H.-N.² Huang, T.³ Liu, D.⁴

21
- 84893708995
- Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
- H. Modares, F. L. Lewis, and M.-B. Naghibi-Sistani, "Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems," Automatica, vol. 50, no. 1, pp. 193-202, 2014.
- (2014) Automatica , vol.50 , Issue.1 , pp. 193-202
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.-B.³

22
- 84897594646
- Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems
- Mar.
- D. Liu and Q. Wei, "Policy iteration adaptive dynamic programming algorithm for discrete-time nonlinear systems," IEEE Trans. Neural Netw. Learn. Syst., vol. 25, no. 3, pp. 621-634, Mar. 2014.
- (2014) IEEE Trans. Neural Netw. Learn. Syst. , vol.25 , Issue.3 , pp. 621-634
- Liu, D.¹ Wei, Q.²

23
- 84988290534
- Data-based suboptimal neuro-control design with reinforcement learning for dissipative spatially distributed processes
- to be published
- B. Luo, H.-N. Wu, and H.-X. Li, "Data-based suboptimal neuro-control design with reinforcement learning for dissipative spatially distributed processes," Ind. Eng. Chem. Res., 2014, to be published.
- (2014) Ind. Eng. Chem. Res.
- Luo, B.¹ Wu, H.-N.² Li, H.-X.³

24
- 0003585352
- Upper Saddle River, NJ, USA: Prentice Hall
- K. Zhou, J. C. Doyle, and K. Glover, Robust and Optimal Control. Upper Saddle River, NJ, USA: Prentice Hall, 1996.
- (1996) Robust and Optimal Control
- Zhou, K.¹ Doyle, J.C.² Glover, K.³

25
- 84861383861
- New York, NY, USA: Springer-Verlag
- 2-Gain and Passivity in Nonlinear Control. New York, NY, USA: Springer-Verlag, 1996.
- (1996) 2-Gain and Passivity in Nonlinear Control
- Schaft, A.V.D.¹

26
- 0003404761
- Berlin, Germany: Springer
- ∞ Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach. Berlin, Germany: Springer, 2008.
- (2008) ∞ Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach
- Başar, T.¹ Bernhard, P.²

27
- 0026883666
- ∞ control
- Jun.
- ∞ control," IEEE Trans. Autom. Control, vol. 37, no. 6, pp. 770-784, Jun. 1992.
- (1992) IEEE Trans. Autom. Control , vol.37 , Issue.6 , pp. 770-784
- Schaft, A.V.D.¹

28
- 0029264110
- ∞ control via measurement feedback for general nonlinear systems
- Mar.
- ∞ control via measurement feedback for general nonlinear systems," IEEE Trans. Autom. Control, vol. 40, no. 3, pp. 466-472, Mar. 1995.
- (1995) IEEE Trans. Autom. Control , vol.40 , Issue.3 , pp. 466-472
- Isidori, A.¹ Kang, W.²

29
- 0026927363
- ∞-control via measurement feedback in nonlinear systems
- Sep.
- ∞-control via measurement feedback in nonlinear systems," IEEE Trans. Autom. Control, vol. 37, no. 9, pp. 1283-1293, Sep. 1992.
- (1992) IEEE Trans. Autom. Control , vol.37 , Issue.9 , pp. 1283-1293
- Isidori, A.¹ Astolfi, A.²

30
- 0032202335
- Successive Galerkin approximation algorithms for nonlinear optimal and robust control
- R. W. Beard, "Successive Galerkin approximation algorithms for nonlinear optimal and robust control," Int. J. Control, vol. 71, no. 5, pp. 717-743, 1998.
- (1998) Int. J. Control , vol.71 , Issue.5 , pp. 717-743
- Beard, R.W.¹

31
- 33845759425
- ∞ state feedback control with input saturation
- Dec.
- ∞ state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1989-1995, Dec. 2006.
- (2006) IEEE Trans. Autom. Control , vol.51 , Issue.12 , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

32
- 84864463039
- Online solution of nonlinear two-player zero-sum games using synchronous policy iteration
- K. G. Vamvoudakis and F. L. Lewis, "Online solution of nonlinear two-player zero-sum games using synchronous policy iteration," Int. J. Robust Nonlinear Control, vol. 22, no. 13, pp. 1460-1483, 2012.
- (2012) Int. J. Robust Nonlinear Control , vol.22 , Issue.13 , pp. 1460-1483
- Vamvoudakis, K.G.¹ Lewis, F.L.²

33
- 61849156874
- ∞ control
- ∞ control," Automatica, vol. 45, no. 4, pp. 881-888, 2009.
- (2009) Automatica , vol.45 , Issue.4 , pp. 881-888
- Feng, Y.¹ Anderson, B.² Rotkowitz, M.³

34
- 84876066909
- Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm
- D. Liu, H. Li, and D. Wang, "Neural-network-based zero-sum game for discrete-time nonlinear systems via iterative adaptive dynamic programming algorithm," Neurocomputing, vol. 110, no. 13, pp. 92-100, 2013.
- (2013) Neurocomputing , vol.110 , Issue.13 , pp. 92-100
- Liu, D.¹ Li, H.² Wang, D.³

35
- 56549083444
- Analytical approximation methods for the stabilizing solution of the Hamilton-Jacobi equation
- Nov.
- N. Sakamoto and A. V. D. Schaft, "Analytical approximation methods for the stabilizing solution of the Hamilton-Jacobi equation," IEEE Trans. Autom. Control, vol. 53, no. 10, pp. 2335-2350, Nov. 2008.
- (2008) IEEE Trans. Autom. Control , vol.53 , Issue.10 , pp. 2335-2350
- Sakamoto, N.¹ Schaft, A.V.D.²

36
- 0018441647
- An approximation theory of optimal control for trainable manipulators
- Mar.
- G. N. Saridis and C.-S. G. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Syst., Man, Cybern., vol. 9, no. 3, pp. 152-159, Mar. 1979.
- (1979) IEEE Trans. Syst., Man, Cybern. , vol.9 , Issue.3 , pp. 152-159
- Saridis, G.N.¹ Lee, C.-S.G.²

37
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- R. W. Beard, G. N. Saridis, and J. T. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation," Automatica, vol. 33, no. 12, pp. 2159-2177, 1997.
- (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
- Beard, R.W.¹ Saridis, G.N.² Wen, J.T.³

38
- 0032387028
- Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation
- R. W. Beard, G. Saridis, and J. Wen, "Approximate solutions to the time-invariant Hamilton-Jacobi-Bellman equation," J. Optim. Theory Appl., vol. 96, no. 3, pp. 589-626, 1998.
- (1998) J. Optim. Theory Appl. , vol.96 , Issue.3 , pp. 589-626
- Beard, R.W.¹ Saridis, G.² Wen, J.³

39
- 84890058601
- Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks
- Dec.
- S. Mehraeen, T. Dierks, S. Jagannathan, and M. L. Crow, "Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks," IEEE Trans. Cybern., vol. 43, no. 6, pp. 1641-1655, Dec. 2013.
- (2013) IEEE Trans. Cybern. , vol.43 , Issue.6 , pp. 1641-1655
- Mehraeen, S.¹ Dierks, T.² Jagannathan, S.³ Crow, M.L.⁴

40
- 48949116222
- Neurodynamic programming and zero-sum games for constrained control systems
- Jul.
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Neurodynamic programming and zero-sum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Jul. 2008.
- (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

41
- 84899093084
- ∞ control of constrained input systems
- Mar.-May
- ∞ control of constrained input systems," Int. J. Adapt. Control, vol. 28, nos. 3-5, pp. 232-254, Mar.-May 2014.
- (2014) Int. J. Adapt. Control , vol.28 , Issue.3-5 , pp. 232-254
- Modares, H.¹ Lewis, F.L.² Naghibi-Sistani, M.-B.³

42
- 78650805234
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, 2011.
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.¹ Wei, Q.² Liu, D.³

43
- 77950630017
- Online actor - Critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K. G. Vamvoudakis and F. L. Lewis, "Online actor - Critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

44
- 84876816423
- ∞ state feedback control with Galerkin's method
- ∞ state feedback control with Galerkin's method," Int. J. Robust Nonlinear Control, vol. 23, no. 9, pp. 991-1012, 2013.
- (2013) Int. J. Robust Nonlinear Control , vol.23 , Issue.9 , pp. 991-1012
- Luo, B.¹ Wu, H.-N.²

45
- 0029371239
- ∞ control laws
- ∞ control laws," J. Guid. Control Dynam., vol. 18, no. 5, pp. 989-994, 1995.
- (1995) J. Guid. Control Dynam. , vol.18 , Issue.5 , pp. 989-994
- Huang, J.¹ Lin, C.-F.²

46
- 84870062175
- ∞ state feedback control
- Feb.
- ∞ state feedback control," Inf. Sci., vol. 222, pp. 472-485, Feb. 2013.
- (2013) Inf. Sci. , vol.222 , pp. 472-485
- Wu, H.-N.¹ Luo, B.²

47
- 79960443754
- Adaptive dynamic programming for online solution of a zero-sum differential game
- D. Vrabie and F. Lewis, "Adaptive dynamic programming for online solution of a zero-sum differential game," J. Control Theory Appl., vol. 9, no. 3, pp. 353-360, 2011.
- (2011) J. Control Theory Appl. , vol.9 , Issue.3 , pp. 353-360
- Vrabie, D.¹ Lewis, F.²

48
- 84876909440
- ∞ control
- Dec.
- ∞ control," IEEE Trans. Neural Netw. Learn. Syst., vol. 23, no. 12, pp. 1884-1895, Dec. 2012.
- (2012) IEEE Trans. Neural Netw. Learn. Syst. , vol.23 , Issue.12 , pp. 1884-1895
- Wu, H.-N.¹ Luo, B.²

49
- 84885835001
- Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using singlenetwork ADP
- Feb.
- H. Zhang, L. Cui, and Y. Luo, "Near-optimal control for nonzero-sum differential games of continuous-time nonlinear systems using singlenetwork ADP," IEEE Trans. Cybern., vol. 43, no. 1, pp. 206-216, Feb. 2013.
- (2013) IEEE Trans. Cybern. , vol.43 , Issue.1 , pp. 206-216
- Zhang, H.¹ Cui, L.² Luo, Y.³

50
- 0003411271
- Carnegie Mellon University, Pittsburgh, PA, USA, Tech. Rep. CMU-CS-92-102
- S. B. Thrun, "Efficient exploration in reinforcement learning," Carnegie Mellon University, Pittsburgh, PA, USA, Tech. Rep. CMU-CS-92-102, 1992.
- (1992) Efficient Exploration in Reinforcement Learning
- Thrun, S.B.¹

51
- 84950296671
- New York, NY, USA: Wiley
- R. Courant and D. Hilbert, Methods of Mathematical Physics. New York, NY, USA: Wiley, 2004.
- (2004) Methods of Mathematical Physics
- Courant, R.¹ Hilbert, D.²

52
- 0003917259
- New York, NY, USA: Academic Press, Inc.
- B. A. Finlayson, The Method of Weighted Residuals and Variational Principles: With Applications in Fluid Mechanics, Heat and Mass Transfer. New York, NY, USA: Academic Press, Inc., 1972.
- (1972) The Method of Weighted Residuals and Variational Principles: With Applications in Fluid Mechanics, Heat and Mass Transfer
- Finlayson, B.A.¹

53
- 0011636441
- A new algorithm for adaptive multidimensional integration
- G. Peter Lepage, "A new algorithm for adaptive multidimensional integration," J. Comput. Phys., vol. 27, no. 2, pp. 192-203, 1978.
- (1978) J. Comput. Phys. , vol.27 , Issue.2 , pp. 192-203
- Lepage, G.P.¹

54
- 84889478560
- New York, NY, USA: Wiley
- J. A. Farrell and M. M. Polycarpou, Adaptive Approximation Based Control: Unifying Neural, Fuzzy and Traditional Adaptive Approximation Approaches. New York, NY, USA: Wiley, 2006.
- (2006) Adaptive Approximation Based Control: Unifying Neural, Fuzzy and Traditional Adaptive Approximation Approaches
- Farrell, J.A.¹ Polycarpou, M.M.²

55
- 0004099251
- Englewood Cliffs, NJ, USA: Prentice-Hall
- J.-J. E. Slotine et al., Applied Nonlinear Control. Englewood Cliffs, NJ, USA: Prentice-Hall 1991.
- (1991) Applied Nonlinear Control
- Slotine, J.-J.E.¹

56
- 84914965022
- On an iterative technique for Riccati equation computations
- Feb.
- D. Kleinman, "On an iterative technique for Riccati equation computations," IEEE Trans. Autom. Control, vol. 13, no. 1, pp. 114-115, Feb. 1968.
- (1968) IEEE Trans. Autom. Control , vol.13 , Issue.1 , pp. 114-115
- Kleinman, D.¹

57
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

58
- 39549085591
- Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems
- Jan.
- Z. Chen and S. Jagannathan, "Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discretetime systems," IEEE Trans. Neural Netw., vol. 19, no. 1, pp. 90-106, Jan. 2008.
- (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.1 , pp. 90-106
- Chen, Z.¹ Jagannathan, S.²

59
- 79960897012
- Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
- K. G. Vamvoudakis and F. L. Lewis, "Multi-player non-zero-sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 1556-1569, 2011.
- (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
- Vamvoudakis, K.G.¹ Lewis, F.L.²

60
- 4644328593
- Off-policy temporal-difference learning with function approximation
- D. Precup, R. S. Sutton, and S. Dasgupta, "Off-policy temporal-difference learning with function approximation," in Proc. 18th ICML, 2001, pp. 417-424.
- (2001) Proc. 18th ICML , pp. 417-424
- Precup, D.¹ Sutton, R.S.² Dasgupta, S.³

61
- 77956541799
- Toward off-policy learning control with function approximation
- H. R. Maei, C. Szepesvári, S. Bhatnagar, and R. S. Sutton, "Toward off-policy learning control with function approximation," in Proc. 27th ICML, 2010, pp. 719-726.
- (2010) Proc. 27th ICML , pp. 719-726
- Maei, H.R.¹ Szepesvári, C.² Bhatnagar, S.³ Sutton, R.S.⁴

62
- 0004141725
- Englewood Cliffs, NJ, USA: Prentice-Hall
- M. Green and D. J. Limebeer, Linear Robust Control. Englewood Cliffs, NJ, USA: Prentice-Hall, 1995.
- (1995) Linear Robust Control
- Green, M.¹ Limebeer, D.J.²

63
- 0004044108
- Hoboken, NJ, USA: Wiley-Interscience
- B. L. Stevens and F. L. Lewis, Aircraft Control and Simulation. Hoboken, NJ, USA: Wiley-Interscience, 2003.
- (2003) Aircraft Control and Simulation
- Stevens, B.L.¹ Lewis, F.L.²

64
- 0032478442
- 2 disturbance attenuation solution to the nonlinear benchmark problem
- 2 disturbance attenuation solution to the nonlinear benchmark problem," Int. J. Robust Nonlinear Control, vol. 8, nos. 4-5, pp. 311-330, 1999.
- (1999) Int. J. Robust Nonlinear Control , vol.8 , Issue.4-5 , pp. 311-330
- Escobar, G.¹ Ortega, R.² Sira-Ramirez, H.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.