SCOPUS 정보 검색 플랫폼

Volumn 23, Issue 7-8, 2013, Pages 1843-1850

Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics

(3) Liu, Derong a Yang, Xiong a Li, Hongliang a

Author keywords

Adaptive dynamic programming; Adaptive optimal control; Neural network; Nonlinear system; Online control; Policy iteration; Reinforcement learning

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; ADAPTIVE OPTIMAL CONTROL; AFFINE NONLINEAR SYSTEMS; CONTINUOUS TIME NONLINEAR SYSTEMS; HAMILTON JACOBI BELLMAN EQUATION; ON-LINE CONTROLS; OPTIMAL CONTROL PROBLEM; POLICY ITERATION;

APPROXIMATION ALGORITHMS; CONTINUOUS TIME SYSTEMS; CONTROL; DYNAMIC PROGRAMMING; ITERATIVE METHODS; NEURAL NETWORKS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING;

OPTIMAL CONTROL SYSTEMS;

EID: 84887472008 PISSN: 09410643 EISSN: None Source Type: Journal
DOI: 10.1007/s00521-012-1249-y Document Type: Article

Times cited : (89)

References (24)

1
- 34248377157
- Optimal control processes
- Pontryagin LS (1959) Optimal control processes. Uspehi Mat Nauk (in Russian) 14: 3-20.
- (1959) Uspehi Mat Nauk (in Russian) , vol.14 , pp. 3-20
- Pontryagin, L.S.¹

2
- 0003787146
- New Jersey: Princeton University Press
- Bellman RE (1957) Dynamic programming. Princeton University Press, New Jersey.
- (1957) Dynamic Programming
- Bellman, R.E.¹

3
- 0004163205
- New York: John Wiley
- Lewis FL, Syrmos VL (1995) Optimal control. John Wiley, New York.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

4
- 0015680499
- Some new algorithms for recursive estimation in constant linear systems
- Kailath T (1973) Some new algorithms for recursive estimation in constant linear systems. IEEE Trans Inf Theory 19(6): 750-760.
- (1973) IEEE Trans Inf Theory , vol.19 , Issue.6 , pp. 750-760
- Kailath, T.¹

5
- 0018681625
- A Schur method for solving algebraic Riccati equations
- Laub AJ (1979) A Schur method for solving algebraic Riccati equations. IEEE Trans Autom Control 24(6): 913-921.
- (1979) IEEE Trans Autom Control , vol.24 , Issue.6 , pp. 913-921
- Laub, A.J.¹

6
- 39649103403
- Iterative solution of algebraic Riccati equations for damped systems
- San Diego, CA
- Moris K, Navasca C (2006) Iterative solution of algebraic Riccati equations for damped systems. In: Proceedings of 45th IEEE conference on decision and control, San Diego, CA, pp 2436-2440.
- (2006) Proceedings of 45th IEEE Conference On Decision and Control , pp. 2436-2440
- Moris, K.¹ Navasca, C.²

7
- 0018441647
- An approximation theory of optimal control for trainable manipulators
- Saridis GN, Lee CS (1979) An approximation theory of optimal control for trainable manipulators. IEEE Trans Syst Man Cybern 9(3): 152-159.
- (1979) IEEE Trans Syst Man Cybern , vol.9 , Issue.3 , pp. 152-159
- Saridis, G.N.¹ Lee, C.S.²

8
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- Beard R, Saridis G, Wen J (1997) Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation. Automatica 33(12): 2159-2177.
- (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
- Beard, R.¹ Saridis, G.² Wen, J.³

9
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- Abu-Khalaf M, Lewis FL (2005) Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica 41(5): 779-791.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

10
- 0036588686
- Adaptive dynamic programming
- Murray JJ, Cox CJ, Lendaris GG, Saeks R (2002) Adaptive dynamic programming. IEEE Transa Syst Man Cybern C Appl Rev 32(2): 140-153.
- (2002) IEEE Transa Syst Man Cybern C Appl Rev , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

11
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D. A. White and D. A. Sofge (Eds.), New York: Van Nostrand Reinhold
- Werbos PJ (1992) Approximate dynamic programming for real-time control and neural modeling. In: White DA, Sofge DA (eds) Handbook of intelligent control: neural, fuzzy, and adaptive approaches. van Nostrand Reinhold, New York, pp 493-525.
- (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches. , pp. 493-525
- Werbos, P.J.¹

12
- 66449130966
- Adaptive dynamic programming: an introduction
- Wang FY, Zhang H, Liu D (2009) Adaptive dynamic programming: an introduction. IEEE Comput Intell Mag 4(2): 39-47.
- (2009) IEEE Comput Intell Mag , vol.4 , Issue.2 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

13
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- Lewis FL, Vrabie D (2009) Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst Mag 9(3): 32-50.
- (2009) IEEE Circuits Syst Mag , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

14
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof
- Al-Tamimi A, Lewis FL, Abu-Khalaf M (2008) Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans Syst Man Cybern B Cybern 38(4): 943-949.
- (2008) IEEE Trans Syst Man Cybern B Cybern , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

15
- 80054767702
- Neural-network-based optimal control for a class of nonlinear discrete-time systems with control constraints using the iterative GDHP algorithm
- San Jose, CA
- Liu D, Wang D, Zhao D (2011) Neural-network-based optimal control for a class of nonlinear discrete-time systems with control constraints using the iterative GDHP algorithm. In: Proceedings of international joint conference on neural networks, San Jose, CA, pp 53-60.
- (2011) Proceedings of International Joint Conference On Neural Networks , pp. 53-60
- Liu, D.¹ Wang, D.² Zhao, D.³

16
- 0003644124
- Cambridge: MIT Press
- Howard RA (1960) Dynamic programming and Markov processes. MIT Press, Cambridge.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

17
- 67349145396
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- Vrabie D, Lewis FL (2009) Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems. Neural Netw 22(3): 237-246.
- (2009) Neural Netw , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.L.²

18
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- Vamvoudakis KG, Lewis FL (2010) Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem. Automatica 46(5): 878-888.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

19
- 79953145872
- A novel generalized value iteration scheme for uncertain continuous-time linear systems
- Atlanta, GA
- Lee JY, Park JB, Choi YH (2010) A novel generalized value iteration scheme for uncertain continuous-time linear systems. In: Proceedings of the 49th IEEE conference on decision and control, Atlanta, GA, pp 4637-4642.
- (2010) Proceedings of the 49th IEEE Conference On Decision and Control , pp. 4637-4642
- Lee, J.Y.¹ Park, J.B.² Choi, Y.H.³

20
- 77951988311
- Science Press (in Chinese), Beijing
- Guo L, Cheng DZ, Feng DX (2005) Introduction to control theory: from basic concepts to research frontiers. Science Press (in Chinese), Beijing.
- (2005) Introduction to Control Theory: From Basic Concepts to Research Frontiers
- Guo, L.¹ Cheng, D.Z.² Feng, D.X.³

21
- 0004226864
- New York: McGraw-Hill
- Rudin W (1976) Principles of mathematical analysis, 3rd edn. McGraw-Hill, New York.
- (1976) Principles of Mathematical Analysis (3rd Edn)
- Rudin, W.¹

22
- 0025627940
- Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks
- Hornik K, Stinchcombe M, White H (1990) Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Netw 3(5): 551-560.
- (1990) Neural Netw , vol.3 , Issue.5 , pp. 551-560
- Hornik, K.¹ Stinchcombe, M.² White, H.³

23
- 0003917259
- New York: Academic Press
- Finlayson BA (1972) The method of weighted residuals and variational principles. Academic Press, New York.
- (1972) The Method of Weighted Residuals and Variational Principles
- Finlayson, B.A.¹

24
- 79551685808
- Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data
- Lewis FL, Vamvoudakis KG (2011) Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data. IEEE Trans Syst Man Cybern B Cybern 41(1): 14-25.
- (2011) IEEE Trans Syst Man Cybern B Cybern , vol.41 , Issue.1 , pp. 14-25
- Lewis, F.L.¹ Vamvoudakis, K.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.