SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Conference on Decision and Control

Volumn , Issue , 2010, Pages 3040-3047

Online solution of nonlinear two-player zero-sum games using synchronous policy iteration

(2) Vamvoudakis, Kyriakos G a Lewis, F L a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

Approximate dynamic programming; H infinity; Hamilton Jacobi Isaacs equation; Nash equilibrium; Persistence of excitation; Policy iteration; Synchronous zero sum game policy iteration

Indexed keywords

ADAPTIVE ALGORITHMS; ADAPTIVE CONTROL SYSTEMS; CONTINUOUS TIME SYSTEMS; DYNAMIC PROGRAMMING; GAME THEORY; ONLINE SYSTEMS; OPTIMAL SYSTEMS; SYSTEM STABILITY;

APPROXIMATE DYNAMIC PROGRAMMING; H-INFINITY; HAMILTON-JACOBI-ISAACS EQUATIONS; NASH EQUILIBRIA; PERSISTENCE OF EXCITATION; POLICY ITERATION; ZERO-SUM GAME;

ITERATIVE METHODS;

EID: 79953155097 PISSN: 07431546 EISSN: 25762370 Source Type: Conference Proceeding
DOI: 10.1109/CDC.2010.5717607 Document Type: Conference Paper

Times cited : (35)

References (31)

1
- 14844340822
- Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
- M. Abu-Khalaf, F. L. Lewis, "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach", Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

2
- 48949116222
- Neurodynamic Programming and Zero- Sum Games for Constrained Control Systems
- M. Abu-Khalaf, F. L. Lewis, "Neurodynamic Programming and Zero- Sum Games for Constrained Control Systems," IEEE Transactions on Neural Networks, vol. 19, no. 7, pp. 1243-1252, 2008.
- (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.²

3
- 33845759425
- ∞ State Feedback Control with Input Saturation
- ∞ State Feedback Control With Input Saturation" , IEEE Transactions on Automatic Control, vol. 51, no. 12, pp. 1989-1995, 2006.
- (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

4
- 0003796630
- New York: Academic Press
- R. Adams, J. Fournier, Sobolev spaces, New York: Academic Press, 2003.
- (2003) Sobolev Spaces
- Adams, R.¹ Fournier, J.²

5
- 0002145750
- ∞ -control
- ∞ -control," J. Math Syst., Estimat., Control, vol. 6, no.1, pp. 1-22, 1996.
- (1996) J. Math Syst., Estimat., Control , vol.6 , Issue.1 , pp. 1-22
- Ball, J.¹ Helton, W.²

6
- 0003809134
- Boston, MA: Birkhäuser
- M. Bardi and I. Capuzzo-Dolcetta, Optimal Control and Viscosity Solutions of Hamilto-Jacobi-Bellman Equations. Boston, MA: Birkhäuser, 1997.
- (1997) Optimal Control and Viscosity Solutions of Hamilto-Jacobi-Bellman Equations
- Bardi, M.¹ Capuzzo-Dolcetta, I.²

7
- 0003981511
- SIAM's Classic in Applied Mathematics
- nd ed. Philadelphia, PA: SIAM
- nd ed. Philadelphia, PA: SIAM, 1999, vol. 23, SIAM's Classic in Applied Mathematics.
- (1999) Dynamic Noncooperative Game Theory , vol.23
- Basar, T.¹ Olsder, G.J.²

8
- 0003404761
- Boston, MA: Birkhäuser
- ∞ Optimal Control and Related Minimax Design Problems. Boston, MA: Birkhäuser, 1995.
- (1995) ∞ Optimal Control and Related Minimax Design Problems
- Basar, T.¹ Bernard, P.²

9
- 0003487482
- Athena Scientific, MA
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

10
- 61849156874
- A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control
- Y. Feng, B. D. Anderson, M. Rotkowitz, "A game theoretic algorithm to compute local stabilizing solutions to HJBI equations in nonlinear H∞ control," Automatica, vol. 45, no. 4, pp. 881-888, 2009.
- (2009) Automatica , vol.45 , Issue.4 , pp. 881-888
- Feng, Y.¹ Anderson, B.D.² Rotkowitz, M.³

11
- 0003917259
- New York: Academic Press
- B. A. Finlayson, The method of weighted residuals and variational principles. New York: Academic Press, 1990.
- (1990) The Method of Weighted Residuals and Variational Principles
- Finlayson, B.A.¹

12
- 0025627940
- Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks
- K. Hornik, M. Stinchcombe, H. White, Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks, /eural /etworks, vol. 3, pp. 551-560, 1990.
- (1990) Neural Networks , vol.3 , pp. 551-560
- Hornik, K.¹ Stinchcombe, M.² White, H.³

13
- 0003644124
- MIT Press, Cambridge, Massachusetts
- R. A. Howard, Dynamic Programming and Markov Processes, MIT Press, Cambridge, Massachusetts, 1960.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

14
- 41849112337
- Adaptive Control Tutorial
- PA
- P. Ioannou and B. Fidan, Adaptive Control Tutorial, SIAM, Advances in Design and Control, PA, 2006.
- (2006) SIAM, Advances in Design and Control
- Ioannou, P.¹ Fidan, B.²

15
- 84914965022
- On an Iterative Technique for Riccati Equation Computations
- February
- D. Kleinman, "On an Iterative Technique for Riccati Equation Computations," IEEE Transactions on Automatic Control, vol. 13, pp. 114- 115, February, 1968.
- (1968) IEEE Transactions on Automatic Control , vol.13 , pp. 114-115
- Kleinman, D.¹

16
- 0004025786
- Taylor & Francis
- F.L. Lewis, S. Jagannathan, A. Yesildirek, /eural /etwork Control of Robot Manipulators and /onlinear Systems, Taylor & Francis 1999.
- (1999) Neural Network Control of Robot Manipulators and Nonlinear Systems
- Lewis, F.L.¹ Jagannathan, S.² Yesildirek, A.³

17
- 0004163205
- John Wiley
- F. L. Lewis, V. L. Syrmos, Optimal Control, John Wiley, 1995.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

18
- 62949149213
- Technical Report 96-021, California Institute of Technology
- V. Nevistic, J. A. Primbs, "Constrained nonlinear optimal control: a converse HJB approach," Technical Report 96-021, California Institute of Technology, 1996.
- (1996) Constrained Nonlinear Optimal Control: A Converse HJB Approach
- Nevistic, V.¹ Primbs, J.A.²

19
- 0026883666
- ∞ control
- Jun.
- ∞ control," IEEE Transactions on Automatic Control, vol. 37, no. 6, pp. 770-784, Jun. 1992.
- (1992) IEEE Transactions on Automatic Control , vol.37 , Issue.6 , pp. 770-784
- Van Der Shaft, A.J.¹

20
- 0032138828
- Notes on uniform approximation of time varying systems on finite time intervals
- E. W. Sandberg, "Notes on uniform approximation of time varying systems on finite time intervals," IEEE Transactions on Circuits and Systems-1: Fundamental Theory and Apprications, vol. 45, no. 8, pp. 863-865.
- IEEE Transactions on Circuits and Systems-1: Fundamental Theory and Apprications , vol.45 , Issue.8 , pp. 863-865
- Sandberg, E.W.¹

21
- 84921399937
- John Wiley, New Jersey
- J. Si, A. Barto, W. Powel, D. Wunch, Handbook of Learning and Approximate Dynamic Programming, John Wiley, New Jersey, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powel, W.³ Wunch, D.⁴

22
- 0029533197
- Nonsmooth control Lyapunov functions
- E. D. Sontag, H. J. Sussman, "Nonsmooth control Lyapunov functions," IEEE Proc. CDC95, pp. 2799-2805. 1995.
- (1995) IEEE Proc. CDC95 , pp. 2799-2805
- Sontag, E.D.¹ Sussman, H.J.²

23
- 0004102479
- MIT Press, Cambridge, Massachusetts
- R. S. Sutton, A. G. Barto, Reinforcement Learning - An Introduction, MIT Press, Cambridge, Massachusetts, 1998.
- (1998) Reinforcement Learning - An Introduction
- Sutton, R.S.¹ Barto, A.G.²

24
- 0142165241
- Adaptive Control Design and Analysis
- Hoboken, NJ: Wiley-Interscience
- G. Tao, Adaptive Control Design and Analysis, Adaptive and Learning Systems for Signal Processing, Communications and Control Series, Hoboken, NJ: Wiley-Interscience, 2003.
- (2003) Adaptive and Learning Systems for Signal Processing, Communications and Control Series
- Tao, G.¹

25
- 3142784521
- Hindustan Book Agency, India
- S. Tijs, Introduction to Game Theory, Hindustan Book Agency, India, 2003.
- (2003) Introduction to Game Theory
- Tijs, S.¹

26
- 77950630017
- Online Actor-Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem
- K. G. Vamvoudakis, F. L. Lewis, "Online Actor-Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

27
- 70449382072
- Online Actor Critic Algorithm to solve the Continuous-Time Infinite Horizon Optimal Control Problem
- Atlanta, June
- K. G. Vamvoudakis, and F. L. Lewis, "Online Actor Critic Algorithm to solve the Continuous-Time Infinite Horizon Optimal Control Problem," Proc. Int. Joint Conf. on /eural /etworks, pp.3180-3187, Atlanta, June 2009.
- (2009) Proc. Int. Joint Conf. on Neural Networks , pp. 3180-3187
- Vamvoudakis, K.G.¹ Lewis, F.L.²

28
- 77950629367
- Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework
- June
- D. Vrabie, K. Vamvoudakis, and F. Lewis, "Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework," Proc. of the IEEE Mediterranean Conf. on Control and Automation, pp. 1402-1409, June 2009.
- (2009) Proc. of the IEEE Mediterranean Conf. on Control and Automation , pp. 1402-1409
- Vrabie, D.¹ Vamvoudakis, K.² Lewis, F.³

29
- 77953770221
- Ph.D. Thesis, Dept. of Electrical Engineering, Univ. Texas at Arlington, Arlington, TX, USA
- D. Vrabie, Online Adaptive Optimal Control for Continuous Time Systems, Ph.D. Thesis, Dept. of Electrical Engineering, Univ. Texas at Arlington, Arlington, TX, USA, 2009.
- (2009) Online Adaptive Optimal Control for Continuous Time Systems
- Vrabie, D.¹

30
- 0003529238
- Ph.D. Thesis
- P.J. Werbos, Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences, Ph.D. Thesis, 1974.
- (1974) Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences
- Werbos, P.J.¹

31
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," Handbook of Intelligent Control, ed. D.A. White and D.A. Sofge, New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.