SCOPUS 정보 검색 플랫폼

Reinforcement Learning and Approximate Dynamic Programming for Feedback Control

Volumn , Issue , 2013, Pages 281-302

Robust Adaptive Dynamic Programming

a POLYTECHNIC UNIVERSITY (United States)

Author keywords

Asymptotic, stabilizing control laws uncertainties; Optimality, robust ADP for partial state; Robust ADP, robust ADP; Robust ADP for disturbance attenuation; Robust ADP, via off line on line learning

Indexed keywords

EID: 84877912324 PISSN: None EISSN: None Source Type: Book
DOI: 10.1002/9781118453988.ch13 Document Type: Chapter

Times cited : (24)

References (30)

1
- 0004102479
- Reinforcement Learning: An Introduction
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction, MIT Press, 1998.
- (1998)
- Sutton, R.S.¹ Barto, A.G.²

2
- 0003529238
- Beyond regression: new tools for prediction and analysis in the behavioural sciences
- Ph.D. Thesis, Harvard University
- P.J. Werbos. Beyond regression: new tools for prediction and analysis in the behavioural sciences, Ph.D. Thesis, Harvard University, 1972.
- (1972)
- Werbos, P.J.¹

3
- 0024888479
- Neural networks for control and system identification
- Proceedings of IEEE Conference on Decision and Control
- P.J. Werbos. Neural networks for control and system identification, Proceedings of IEEE Conference on Decision and Control, 260-265, 1989.
- (1989) , pp. 260-265
- Werbos, P.J.¹

4
- 0002011091
- A menu of designs for reinforcement learning over time
- ed. W. T. Miller, R. S. Sutton, P. J. Werbos, Cambridge: MIT Press
- P.J. Werbos. A menu of designs for reinforcement learning over time, Neural Networks for Control, ed. W. T. Miller, R. S. Sutton, P. J. Werbos, Cambridge: MIT Press, 1991, pp 67-95.
- (1991) Neural Networks for Control , pp. 67-95
- Werbos, P.J.¹

5
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold
- P. J. Werbos. Approximate dynamic programming for real-time control and neural modeling, Handbook of Intelligent Control, ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

6
- 0004049893
- Learning from delayed rewards
- Ph.D. thesis, King's College of Cambridge, UK
- C. Watkins. Learning from delayed rewards, Ph.D. thesis, King's College of Cambridge, UK, 1989.
- (1989)
- Watkins, C.¹

7
- 33846781129
- Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control, Automatica
- A. Al-Tamimi, F.L. Lewis, and M. Abu-Khalaf. Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control, Automatica, 43(3):473-481, 2007.
- (2007) , vol.43 , Issue.3 , pp. 473-481
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

8
- 0028584964
- Adaptive linear quadratic control using policy iteration
- Proceedings of American Control Conference
- S.J. Bradtke, B.E. Ydstie, and A.G. Barto. Adaptive linear quadratic control using policy iteration, Proceedings of American Control Conference, 3:3475-3479, 1994.
- (1994) , vol.3 , pp. 3475-3479
- Bradtke, S.J.¹ Ydstie, B.E.² Barto, A.G.³

9
- 78650246160
- Approximate dynamic programming for output feedback control
- Y. Jiang and Z. P. Jiang. Approximate dynamic programming for output feedback control, In Proceedings of the 29th Chinese Control Conference, 5815-5820, 2010.
- (2010) Proceedings of the 29th Chinese Control Conference , pp. 5815-5820
- Jiang, Y.¹ Jiang, Z.P.²

10
- 83655167263
- Approximate dynamic programming for optimal stationary control with control-dependent noise
- Y. Jiang and Z. P. Jiang. Approximate dynamic programming for optimal stationary control with control-dependent noise, IEEE Transactions on Neural Networks, 22(12):2392-2398, 2011.
- (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.12 , pp. 2392-2398
- Jiang, Y.¹ Jiang, Z.P.²

11
- 79551685808
- Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data
- SMCB-41
- F. L. Lewis, K. G. Vamvoudakis. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data, IEEE Transactions Systems, Man, and Cybernetics, Part B, SMCB-41(1):14-23, 2011.
- (2011) IEEE Transactions Systems, Man, and Cybernetics, Part B , vol.41 SMCB , Issue.1 , pp. 14-23
- Lewis, F.L.¹ Vamvoudakis, K.G.²

12
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis. Adaptive optimal control for continuous-time linear systems based on policy iteration, Automatica, 45(2):477-484, 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

13
- 78650805234
- An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
- H. Zhang, Q. Wei, and D. Liu. An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games,Automatica, 47(1):207-214, 2011.
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.¹ Wei, Q.² Liu, D.³

14
- 0003427482
- Nonlinear and Adaptive Control Design
- John Wiley & Sons
- M. Krstic, I. Kanellakopoulos, and P. V. Kokotovic. Nonlinear and Adaptive Control Design, John Wiley & Sons, 1995.
- (1995)
- Krstic, M.¹ Kanellakopoulos, I.² Kokotovic, P.V.³

15
- 0025419843
- Further facts about input to state stabilization
- E. D. Sontag. Further facts about input to state stabilization, IEEE Transactions on Automatic Control, AC-35(4):473-476, 1990.
- (1990) IEEE Transactions on Automatic Control , vol.35 AC , Issue.4 , pp. 473-476
- Sontag, E.D.¹

16
- 0029288045
- On characterizations of the input-to-state stability property
- E. D. Sontag and Y. Wang. On characterizations of the input-to-state stability property, Systems & Control Letters, 24:351-359, 1995.
- (1995) Systems & Control Letters , vol.24 , pp. 351-359
- Sontag, E.D.¹ Wang, Y.²

17
- 33846166511
- Small-gain theorem for ISS systems and applications
- Z. P. Jiang, A. R. Teel, and L. Praly. Small-gain theorem for ISS systems and applications, Mathematics of Control, Signals, and Systems, 7(2):95-120, 1994.
- (1994) Mathematics of Control, Signals, and Systems , vol.7 , Issue.2 , pp. 95-120
- Jiang, Z.P.¹ Teel, A.R.² Praly, L.³

18
- 0030218302
- A Lyapunov formulation of the nonlinear small gain theorem for interconnected ISS systems
- Z. P. Jiang, I. Mareels and Y. Wang. A Lyapunov formulation of the nonlinear small gain theorem for interconnected ISS systems, Automatica, 32(8):1211-1215, 1996.
- (1996) Automatica , vol.32 , Issue.8 , pp. 1211-1215
- Jiang, Z.P.¹ Mareels, I.² Wang, Y.³

19
- 0025522760
- Global stabilization of partially linear composite systems
- A. Saberi, P. V. Kokotovic, and H. J. Sussmann. Global stabilization of partially linear composite systems, SIAM Journal on Control and Optimization, 2(6):1491-1503, 1990.
- (1990) SIAM Journal on Control and Optimization , vol.2 , Issue.6 , pp. 1491-1503
- Saberi, A.¹ Kokotovic, P.V.² Sussmann, H.J.³

20
- 0004163205
- Optimal Control
- John Wiley & Sons
- F.L. Lewis and V.L. Syrmos. Optimal Control, John Wiley & Sons, 1995.
- (1995)
- Lewis, F.L.¹ Syrmos, V.L.²

21
- 79959444159
- Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game
- D. Vrabie, F. Lewis. Adaptive dynamic programming algorithm for finding online the equilibrium solution of the two-player zero-sum differential game, The 2010 International Joint Conference on Neural Networks, 1-8, 2010.
- (2010) The 2010 International Joint Conference on Neural Networks , pp. 1-8
- Vrabie, D.¹ Lewis, F.²

22
- 0004178386
- Nonlinear Systems
- 3rd edition, Prentice-Hall
- H. K. Khalil. Nonlinear Systems, 3rd edition, Prentice-Hall, 2002.
- (2002)
- Khalil, H.K.¹

23
- 84914965022
- On an iterative technique for Riccati equation computations
- D. Kleinman. On an iterative technique for Riccati equation computations, IEEE Transactions on Automatic Control, AC-13(1):114-115, 1969.
- (1969) IEEE Transactions on Automatic Control , vol.13 AC , Issue.1 , pp. 114-115
- Kleinman, D.¹

24
- 0004151494
- Matrix Analysis
- Cambridge University Press, NY
- R. A. Horn and C. R. Johnson. Matrix Analysis, Cambridge University Press, NY, 1985.
- (1985)
- Horn, R.A.¹ Johnson, C.R.²

25
- 84860701570
- Robust approximate dynamic programming and global stabilization with nonlinear dynamic uncertainties
- Proceedings of the Joint 2011 IEEE Conference on Decision and Control and European Control Conference, Orlando, FL
- Y. Jiang and Z. P. Jiang. Robust approximate dynamic programming and global stabilization with nonlinear dynamic uncertainties, Proceedings of the Joint 2011 IEEE Conference on Decision and Control and European Control Conference, Orlando, FL, 115-120, 2011.
- (2011) , pp. 115-120
- Jiang, Y.¹ Jiang, Z.P.²

26
- 67349247013
- Intelligence in the brain: a theory of how it works and how to build it
- P.J.Werbos. Intelligence in the brain: a theory of how it works and how to build it, Neural Networks, 22(3):200-212, 2009.
- (2009) Neural Networks , vol.22 , Issue.3 , pp. 200-212
- Werbos, P.J.¹

27
- 0003690086
- Nonlinear Control Systems
- Springer-Verlag
- A. Isidori. Nonlinear Control Systems. Vol. II, Springer-Verlag, 1999.
- (1999) , vol.II
- Isidori, A.¹

28
- 0032115473
- Design of robust adaptive controllers for nonlinear systems with dynamic uncertainties
- Z. P. Jiang and L. Praly. Design of robust adaptive controllers for nonlinear systems with dynamic uncertainties, Automatica, 34(7):825-840, 1998.
- (1998) Automatica , vol.34 , Issue.7 , pp. 825-840
- Jiang, Z.P.¹ Praly, L.²

29
- 0027148081
- Robust load-frequency controller design for power systems
- Y.Wang, R. Zhou, and C.Wen. Robust load-frequency controller design for power systems, IEE Proceedings-Generation, Transmission and Distribution, 104(1):11-16, 1993.
- (1993) IEE Proceedings-Generation, Transmission and Distribution , vol.104 , Issue.1 , pp. 11-16
- Wang, Y.¹ Zhou, R.² Wen, C.³

30
- 0003564550
- Linear Control Systems Engineering
- The McGraw-Hill Companies, Inc.
- M. Driels. Linear Control Systems Engineering. The McGraw-Hill Companies, Inc., 1996.
- (1996)
- Driels, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.