SCOPUS 정보 검색 플랫폼

International Journal of Robust and Nonlinear Control

Volumn 22, Issue 13, 2012, Pages 1460-1483

Online solution of nonlinear two-player zero-sum games using synchronous policy iteration

(2) Vamvoudakis, Kyriakos G a Lewis, F L a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

approximate dynamic programming; Hamilton Jacobi Isaacs equation; Nash equilibrium; synchronous zero sum game policy iteration

Indexed keywords

ADAPTIVE LEARNING ALGORITHM; APPROXIMATE DYNAMIC PROGRAMMING; CLOSED LOOP STABILITY; COMPLEX NONLINEAR SYSTEM; CONTINUOUS TIME; GAME POLICIES; GAME PROBLEM; HAMILTON-JACOBI-ISAACS; HAMILTON-JACOBI-ISAACS EQUATIONS; INFINITE HORIZONS; NASH EQUILIBRIA; ON-LINE GAMING; OPTIMAL VALUE FUNCTIONS; OPTIMAL VALUES; PERSISTENCE OF EXCITATION; POLICY ITERATION; REAL TIME; SADDLE POINT; SIMULATION EXAMPLE; TUNING ALGORITHM; ZERO-SUM GAME;

ADAPTIVE ALGORITHMS; CONTINUOUS TIME SYSTEMS; LEARNING ALGORITHMS; LINEAR SYSTEMS; NONLINEAR SYSTEMS; OPTIMAL SYSTEMS; ROBUST CONTROL;

GAME THEORY;

EID: 84864463039 PISSN: 10498923 EISSN: 10991239 Source Type: Journal
DOI: 10.1002/rnc.1760 Document Type: Article

Times cited : (192)

References (34)

1
- 3142784521
- Hindustan Book Agency: India
- Tijs S,. 2003. Introduction to Game Theory, Hindustan Book Agency: India.
- (2003) Introduction to Game Theory
- Tijs, S.¹

2
- 84855541800
- (2nd edn), SIAM: Philadelphia, PA
- Başar T, Olsder GJ,. 1999. Dynamic Noncooperative Game Theory (Classic in Applied Mathematics 23) (2nd edn), SIAM: Philadelphia, PA.
- (1999) Dynamic Noncooperative Game Theory (Classic in Applied Mathematics 23)
- Başar, T.¹ Olsder, G.J.²

3
- 84855850851
- Birkhäuser: Boston, MA
- ∞ Optimal Control and Related Minimax Design Problems, Birkhäuser: Boston, MA.
- (1995) ∞ Optimal Control and Related Minimax Design Problems
- Başar, T.¹ Bernhard, P.²

4
- 0026883666
- ∞, control
- ∞, control. IEEE Transactions on Automatic Control. 1992 37 6: 770-784.
- (1992) IEEE Transactions on Automatic Control , vol.37 , Issue.6 , pp. 770-784
- Van Der Schaft, A.J.¹

5
- 0033629916
- Reinforcement learning in continuous time and space
- Doya K,. Reinforcement learning in continuous time and space. Neural Computation. 2000 12 1: 219-245.
- (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

6
- 0035422340
- Neural mechanisms of learning and control
- Doya K, Kimura H, Kawato M,. Neural mechanisms of learning and control. IEEE Control Systems Magazine. 2001 21 4: 42-54.
- (2001) IEEE Control Systems Magazine , vol.21 , Issue.4 , pp. 42-54
- Doya, K.¹ Kimura, H.² Kawato, M.³

7
- 0003644124
- MIT Press: Cambridge, MA
- Howard RA,. 1960. Dynamic Programming and Markov Processes, MIT Press: Cambridge, MA.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

8
- 84921399937
- John Wiley: Hoboken, NJ
- Si J, Barto A, Powel W, Wunch D,. 2004. Handbook of Learning and Approximate Dynamic Programming, John Wiley: Hoboken, NJ.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powel, W.³ Wunch, D.⁴

9
- 0004102479
- MIT Press: Cambridge, MA
- Sutton RS, Barto A G,. 1998. Reinforcement Learning: An Introduction, MIT Press: Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

10
- 77950629367
- Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework
- Thessaloniki, Greece, June
- Vrabie D, Vamvoudakis K, Lewis F,. Adaptive optimal controllers based on generalized policy iteration in a continuous-time framework, Proceedings of the IEEE Mediterranean Conference on Control and Automation. Thessaloniki, Greece, June 2009; 1402-1409.
- (2009) Proceedings of the IEEE Mediterranean Conference on Control and Automation , pp. 1402-1409
- Vrabie, D.¹ Vamvoudakis, K.² Lewis, F.³

11
- 0003487482
- Athena Scientific: Belmont, MA
- Bertsekas DP, Tsitsiklis JN,. 1996. Neuro-Dynamic Programming, Athena Scientific: Belmont, MA.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

12
- 0003529238
- Ph.D. Thesis, Harvard University
- Werbos PJ,. 1974. Beyond regression: new tools for prediction and analysis in the behavior sciences, Ph.D. Thesis, Harvard University.
- (1974) Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences
- Werbos, P.J.¹

13
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- White D.A. Sofge D.A. (eds), Van Nostrand Reinhold, New York
- Werbos PJ,. 1992. Approximate dynamic programming for real-time control and neural modeling, In Handbook of Intelligent Control, White DA, Sofge DA, (eds), Van Nostrand Reinhold, New York.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

14
- 77953770221
- Ph.D. Thesis, Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX
- Vrabie D,. 2009. Online adaptive optimal control for continuous time systems, Ph.D. Thesis, Department of Electrical Engineering, University of Texas at Arlington, Arlington, TX.
- (2009) Online Adaptive Optimal Control for Continuous Time Systems
- Vrabie, D.¹

15
- 77950630017
- Online actor-critic algorithm to solve the continuous-time inifinite horizon optimal control problem
- Vamvoudakis KG, Lewis FL,. Online actor-critic algorithm to solve the continuous-time inifinite horizon optimal control problem. Automatica. 2010 46 5: 878-888.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

16
- 0004163205
- John Wiley: New York
- Lewis FL, Syrmos VL,. 1995. Optimal Control, John Wiley: New York.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

17
- 0002145750
- ∞-control
- ∞-control. Journal of Mathematical Systems, Estimation, and Control. 1996 6 1: 1-22.
- (1996) Journal of Mathematical Systems, Estimation, and Control , vol.6 , Issue.1 , pp. 1-22
- Ball, J.¹ Helton, W.²

18
- 0003809134
- Birkhäuser: Boston, MA
- Bardi M, Capuzzo-Dolcetta I,. 1997. Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations, Birkhäuser: Boston, MA.
- (1997) Optimal Control and Viscosity Solutions of Hamilton-Jacobi-Bellman Equations
- Bardi, M.¹ Capuzzo-Dolcetta, I.²

19
- 61849156874
- ∞ control
- ∞ control. Automatica. 2009 45 4: 881-888.
- (2009) Automatica , vol.45 , Issue.4 , pp. 881-888
- Feng, Y.¹ Anderson, B.D.² Rotkowitz, M.³

20
- 48949116222
- Neurodynamic programming and zero-sum games for constrained control systems
- Abu-Khalaf M, Lewis FL,. Neurodynamic programming and zero-sum games for constrained control systems. IEEE Transactions on Neural Networks. 2008 19 7: 1243-1252.
- (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.7 , pp. 1243-1252
- Abu-Khalaf, M.¹ Lewis, F.L.²

21
- 33845759425
- ∞ state feedback control with input saturation
- DOI 10.1109/TAC.2006.884959
- ∞ state feedback control with input saturation. IEEE Transactions on Automatic Control. 2006 51 12: 1989-1995. (Pubitemid 46002295)
- (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

22
- 84914965022
- On an iterative technique for Riccati equation computations
- Kleinman D,. On an iterative technique for Riccati equation computations. IEEE Transactions on Automatic Control. 1968 13 1: 114-115.
- (1968) IEEE Transactions on Automatic Control , vol.13 , Issue.1 , pp. 114-115
- Kleinman, D.¹

23
- 0003796630
- Academic Press: New York
- Adams R, Fournier J,. 2003. Sobolev Spaces, Academic Press: New York.
- (2003) Sobolev Spaces
- Adams, R.¹ Fournier, J.²

24
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
- Abu-Khalaf M, Lewis FL,. Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach. Automatica. 2005 41 5: 779-791. (Pubitemid 40352391)
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

25
- 0003917259
- Academic Press: New York
- Finlayson BA,. 1990. The method of weighted residuals and variational principles, Academic Press: New York.
- (1990) The Method of Weighted Residuals and Variational Principles
- Finlayson, B.A.¹

26
- 0025627940
- Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks
- Hornik K, Stinchcombe M, White H,. Universal Approximation of an unknown mapping and its derivatives using multilayer feedforward networks. Neural Networks. 1990 3 5: 551-560.
- (1990) Neural Networks , vol.3 , Issue.5 , pp. 551-560
- Hornik, K.¹ Stinchcombe, M.² White, H.³

27
- 0032138828
- Notes on uniform approximation of time varying systems on finite time intervals
- Sandberg EW,. Notes on uniform approximation of time varying systems on finite time intervals. IEEE Transactions on Circuits and Systems 1: Fundamental Theory and Apprications. 1998 45 8: 863-865.
- (1998) IEEE Transactions on Circuits and Systems 1: Fundamental Theory and Apprications , vol.45 , Issue.8 , pp. 863-865
- Sandberg, E.W.¹

28
- 41849112337
- SIAM: Philidelphia, PA
- Ioannou P, Fidan B,. 2006. Adaptive Control Tutorial (Advances in Design and Control), SIAM: Philidelphia, PA.
- (2006) Adaptive Control Tutorial (Advances in Design and Control)
- Ioannou, P.¹ Fidan, B.²

29
- 0142165241
- Wiley-Interscience: Hoboken, NJ
- Tao G,. 2003. Adaptive Control Design and Analysis (Adaptive and Learning Systems for Signal Processing, Communications and Control Series), Wiley-Interscience: Hoboken, NJ.
- (2003) Adaptive Control Design and Analysis (Adaptive and Learning Systems for Signal Processing, Communications and Control Series)
- Tao, G.¹

30
- 0004044108
- (2nd edn), John Wiley: Hoboken, NJ
- Stevens B, Lewis FL,. 2003. Aircract Control and Simulation (2nd edn), John Wiley: Hoboken, NJ.
- (2003) Aircract Control and Simulation
- Stevens, B.¹ Lewis, F.L.²

31
- 62949149213
- Constrained nonlinear optimal control: A converse HJB approach
- Pasadena, CA
- Nevistic V, Primbs JA,. 1996. Constrained nonlinear optimal control: a converse HJB approach, Technical Report Technical Report 96-021, California Institute of Technology, Pasadena, CA.
- (1996) Technical Report Technical Report 96-021, California Institute of Technology
- Nevistic, V.¹ Primbs, J.A.²

32
- 0004025786
- Taylor & Francis: Bristol, PA
- Lewis FL, Jagannathan S, Yesildirek A,. 1999. Neural Network Control of Robot Manipulators and Nonlinear Systems, Taylor & Francis: Bristol, PA.
- (1999) Neural Network Control of Robot Manipulators and Nonlinear Systems
- Lewis, F.L.¹ Jagannathan, S.² Yesildirek, A.³

33
- 0004178386
- Prentice-Hall: Upper Saddle River, NJ
- Khalil HK,. 1996. Nonlinear Systems, Prentice-Hall: Upper Saddle River, NJ.
- (1996) Nonlinear Systems
- Khalil, H.K.¹

34
- 79953151751
- A model free robust policy iteration algorithm for optimal control of nonlinear systems
- Atlanta, GA, 15-17 December
- Bhasin S, Johnson M, Dixon WE,. A model free robust policy iteration algorithm for optimal control of nonlinear systems, Proceedings of the 49th IEEE Conference on Decision and Control, Atlanta, GA, 15-17 December 2010; 3060-3065.
- (2010) Proceedings of the 49th IEEE Conference on Decision and Control , pp. 3060-3065
- Bhasin, S.¹ Johnson, M.² Dixon, W.E.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.