SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Conference on Decision and Control

Volumn , Issue , 2011, Pages 142-147

Nonlinear two-player zero-sum game approximate solution using a Policy Iteration algorithm

(3) Johnson, M a Bhasin, S a Dixon, W E a

a UNIVERSITY OF FLORIDA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS TIME SYSTEMS; GAME THEORY; GRADIENT METHODS; ROBUST CONTROL;

APPROXIMATE SOLUTION; DYNAMIC NEURAL NETWORKS; GRADIENT DESCENT METHOD; OPTIMAL VALUE FUNCTIONS; POLICY ITERATION ALGORITHMS; TEMPORAL DIFFERENCE ERRORS; UNCERTAIN DYNAMICS; UNIFORMLY ULTIMATELY BOUNDED;

CLOSED LOOP SYSTEMS;

EID: 84860670757 PISSN: 07431546 EISSN: 25762370 Source Type: Conference Proceeding
DOI: 10.1109/CDC.2011.6160778 Document Type: Conference Paper

Times cited : (37)

References (40)

1
- 0003911224
- Dover Pubns
- R. Isaacs, Differential games: a mathematical theory with applications to warfare and pursuit, control and optimization. Dover Pubns, 1999.
- (1999) Differential Games: A Mathematical Theory with Applications to Warfare and Pursuit, Control and Optimization
- Isaacs, R.¹

2
- 3142784521
- Hindustand Book Agency
- S. Tijs, Introduction to Game Theory. Hindustand Book Agency, 2003.
- (2003) Introduction to Game Theory
- Tijs, S.¹

3
- 0004071782
- SIAM, PA
- T. Basar and G. Olsder, Dynamic Noncooperative Game Theory. SIAM, PA, 1999.
- (1999) Dynamic Noncooperative Game Theory
- Basar, T.¹ Olsder, G.²

4
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M. Abu-Khalaf and F. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.²

5
- 0003404761
- Boston: Birkhäuser
- T. Basar and P. Bernhard, H-infinity Optimal Control and Related Minimax Design Problems. Boston: Birkhäuser, 2008.
- (2008) H-infinity Optimal Control and Related Minimax Design Problems
- Basar, T.¹ Bernhard, P.²

6
- 0020970738
- Neuron-like adaptive elements that can solve difficult learning control problems
- A. Barto, R. Sutton, and C. Anderson, "Neuron-like adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst. Man Cybern., vol. 13, no. 5, pp. 834-846, 1983.
- (1983) IEEE Trans. Syst. Man Cybern. , vol.13 , Issue.5 , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

7
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

8
- 0026852362
- Reinforcement learning is direct adaptive optimal control
- R. Sutton, A. Barto, and R. Williams, "Reinforcement learning is direct adaptive optimal control," IEEE Contr. Syst. Mag., vol. 12, no. 2, pp. 19-22, 1992.
- (1992) IEEE Contr. Syst. Mag. , vol.12 , Issue.2 , pp. 19-22
- Sutton, R.¹ Barto, A.² Williams, R.³

9
- 0033285710
- Adaptive critic neural network for feedforward compensation
- J. Campos and F. Lewis, "Adaptive critic neural network for feedforward compensation," in Proc. Am. Control Conf., vol. 4, 1999.
- (1999) Proc. Am. Control Conf. , vol.4
- Campos, J.¹ Lewis, F.²

10
- 33847648898
- Adaptive critic designs for discrete-time zero-sum games with application to h-[infinity] control
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Adaptive critic designs for discrete-time zero-sum games with application to h-[infinity] control," IEEE Trans. Syst. Man Cybern. Part B Cybern., vol. 37, pp. 240-247, 2007.
- (2007) IEEE Trans. Syst. Man Cybern. Part B Cybern. , vol.37 , pp. 240-247
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

11
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold
- P. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
- Werbos, P.¹

12
- 0003487482
- Athena Scientific
- D. Bertsekas and J. Tsitsiklis, Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

13
- 0031236002
- Adaptive critic designs
- D. V. Prokhorov and I. Wunsch, D. C., "Adaptive critic designs," IEEE Trans. Neural Networks, vol. 8, pp. 997-1007, 1997.
- (1997) IEEE Trans. Neural Networks , vol.8 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.C.I.²

14
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst. Man Cybern. Part B Cybern., vol. 38, pp. 943-949, 2008.
- (2008) IEEE Trans. Syst. Man Cybern. Part B Cybern. , vol.38 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

15
- 33846781129
- Model-free q-learning designs for linear discrete-time zero-sum games with application to h-[infinity] control
- -, "Model-free q-learning designs for linear discrete-time zero-sum games with application to h-[infinity] control," Automatica, vol. 43, pp. 473-481, 2007.
- (2007) Automatica , vol.43 , pp. 473-481
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

16
- 0030196717
- Adaptive-critic-based neural networks for aircraft optimal control
- S. Balakrishnan, "Adaptive-critic-based neural networks for aircraft optimal control," J. Guid. Contr. Dynam., vol. 19, no. 4, pp. 893-898, 1996.
- (1996) J. Guid. Contr. Dynam. , vol.19 , Issue.4 , pp. 893-898
- Balakrishnan, S.¹

17
- 0033685661
- Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle
- G. Lendaris, L. Schultz, and T. Shannon, "Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle," in Int. Joint Conf. Neural Netw., 2000, pp. 73-78.
- (2000) Int. Joint Conf. Neural Netw. , pp. 73-78
- Lendaris, G.¹ Schultz, L.² Shannon, T.³

18
- 0036060633
- An adaptive critic global controller
- S. Ferrari and R. Stengel, "An adaptive critic global controller," in Proc. Am. Control Conf., vol. 4, 2002.
- (2002) Proc. Am. Control Conf. , vol.4
- Ferrari, S.¹ Stengel, R.²

19
- 0036641793
- State-constrained agile missile control with adaptive-critic-based neural networks
- D. Han and S. Balakrishnan, "State-constrained agile missile control with adaptive-critic-based neural networks," IEEE Trans. Control Syst. Technol., vol. 10, no. 4, pp. 481-489, 2002.
- (2002) IEEE Trans. Control Syst. Technol. , vol.10 , Issue.4 , pp. 481-489
- Han, D.¹ Balakrishnan, S.²

20
- 34047138362
- Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
- P. He and S. Jagannathan, "Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints," IEEE Trans. Syst. Man Cybern. Part B Cybern., vol. 37, no. 2, pp. 425-436, 2007.
- (2007) IEEE Trans. Syst. Man Cybern. Part B Cybern. , vol.37 , Issue.2 , pp. 425-436
- He, P.¹ Jagannathan, S.²

21
- 0004370245
- Wright Lab, Wright-Patterson Air Force Base, OH, Tech. Rep.
- L. Baird, "Advantage updating," Wright Lab, Wright-Patterson Air Force Base, OH, Tech. Rep., 1993.
- (1993) Advantage Updating
- Baird, L.¹

22
- 0033629916
- Reinforcement learning in continuous time and space
- K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
- (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

23
- 0036588686
- Adaptive dynamic programming
- J. Murray, C. Cox, G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst. Man Cybern. Part C Appl. Rev., vol. 32, no. 2, pp. 140-153, 2002.
- (2002) IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. , vol.32 , Issue.2 , pp. 140-153
- Murray, J.¹ Cox, C.² Lendaris, G.³ Saeks, R.⁴

24
- 0031332446
- Galerkin approximations of the generalized hamilton-jacobi-bellman equation
- R. Beard, G. Saridis, and J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation," Automatica, vol. 33, pp. 2159-2178, 1997.
- (1997) Automatica , vol.33 , pp. 2159-2178
- Beard, R.¹ Saridis, G.² Wen, J.³

25
- 67349145396
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- D. Vrabie and F. Lewis, "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems," Neural Networks, vol. 22, no. 3, pp. 237 - 246, 2009.
- (2009) Neural Networks , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.²

26
- 79953132013
- Online synchronous policy iteration method for optimal control
- Springer
- K. Vamvoudakis and F. Lewis, "Online synchronous policy iteration method for optimal control," in Recent Advances in Intelligent Control Systems. Springer, 2009, pp. 357-374.
- (2009) Recent Advances in Intelligent Control Systems , pp. 357-374
- Vamvoudakis, K.¹ Lewis, F.²

27
- 79953155097
- Online neural network solution of nonlinear two-player zero-sum games using synchronous policy iteration
- -, "Online neural network solution of nonlinear two-player zero-sum games using synchronous policy iteration," in Proc. IEEE Conf. Decis. Control, 2010.
- (2010) Proc. IEEE Conf. Decis. Control
- Vamvoudakis, K.¹ Lewis, F.²

28
- 79953151751
- A model-free robust policy iteration algorithm for optimal control of nonlinear systems
- S. Bhasin, M. Johnson, and W. E. Dixon, "A model-free robust policy iteration algorithm for optimal control of nonlinear systems," in Proc. IEEE Conf. Decis. Control, 2010, pp. 3060-3065.
- (2010) Proc. IEEE Conf. Decis. Control , pp. 3060-3065
- Bhasin, S.¹ Johnson, M.² Dixon, W.E.³

29
- 33847202724
- Learning to predict by the methods of temporal differences
- R. Sutton, "Learning to predict by the methods of temporal differences," Mach. Learn., vol. 3, no. 1, pp. 9-44, 1988.
- (1988) Mach. Learn. , vol.3 , Issue.1 , pp. 9-44
- Sutton, R.¹

30
- 0004163205
- John Wiley & Sons
- F. L. Lewis, Optimal Control. John Wiley & Sons, 1986.
- (1986) Optimal Control
- Lewis, F.L.¹

31
- 0026883666
- L2-gain analysis of nonlinear systems and nonlinear H-[infinity] control
- A. Van der Schaft, "L2-gain analysis of nonlinear systems and nonlinear H-[infinity] control," IEEE Trans. Autom. Control, vol. 37, no. 6, pp. 770-784, 1992.
- (1992) IEEE Trans. Autom. Control , vol.37 , Issue.6 , pp. 770-784
- Van Der Schaft, A.¹

32
- 24644490744
- Dover Pubns
- D. Kirk, Optimal Control Theory: An Introduction. Dover Pubns, 2004.
- (2004) Optimal Control Theory: An Introduction
- Kirk, D.¹

33
- 0003581164
- Systems Report 91-09-01, University of Southern California
- M. Polycarpou and P. Ioannou, "Identification and control of nonlinear systems using neural network models: Design and stability analysis," Systems Report 91-09-01, University of Southern California, 1991.
- (1991) Identification and Control of Nonlinear Systems Using Neural Network Models: Design and Stability Analysis
- Polycarpou, M.¹ Ioannou, P.²

34
- 0024861871
- Approximation by superpositions of a sigmoidal function
- G. Cybenko, "Approximation by superpositions of a sigmoidal function," Math. Control Signals Syst., vol. 2, pp. 303-314, 1989.
- (1989) Math. Control Signals Syst. , vol.2 , pp. 303-314
- Cybenko, G.¹

35
- 0042032055
- Philadelphia, PA, USA: Society for Industrial and Applied Mathematics
- F. L. Lewis, R. Selmic, and J. Campos, Neuro-Fuzzy Control of Industrial Systems with Actuator Nonlinearities. Philadelphia, PA, USA: Society for Industrial and Applied Mathematics, 2002.
- (2002) Neuro-fuzzy Control of Industrial Systems with Actuator Nonlinearities
- Lewis, F.L.¹ Selmic, R.² Campos, J.³

36
- 0000466705
- Nonlinear network structures for feedback control
- F. L. Lewis, "Nonlinear network structures for feedback control," Asian J. Control, vol. 1, no. 4, pp. 205-228, 1999.
- (1999) Asian J. Control , vol.1 , Issue.4 , pp. 205-228
- Lewis, F.L.¹

37
- 0004469897
- Neurons with graded response have collective computational properties like those of two-state neurons
- J. Hopfield, "Neurons with graded response have collective computational properties like those of two-state neurons," Proc. Nat. Acad. Sci. U.S.A., vol. 81, no. 10, p. 3088, 1984.
- (1984) Proc. Nat. Acad. Sci. U.S.A. , vol.81 , Issue.10 , pp. 3088
- Hopfield, J.¹

38
- 0013268990
- World Scientific Pub Co Inc
- A. Poznyak, E. Sanchez, and W. Yu, Differential neural networks for robust nonlinear control: identification, state estimation and trajectory tracking. World Scientific Pub Co Inc, 2001.
- (2001) Differential Neural Networks for Robust Nonlinear Control: Identification, State Estimation and Trajectory Tracking
- Poznyak, A.¹ Sanchez, E.² Yu, W.³

39
- 0004099251
- Prentice Hall
- J. Slotine and W. Li, Applied Nonlinear Control. Prentice Hall, 1991.
- (1991) Applied Nonlinear Control
- Slotine, J.¹ Li, W.²

40
- 0004178386
- 3rd ed. Prentice Hall
- H. K. Khalil, Nonlinear Systems, 3rd ed. Prentice Hall, 2002.
- (2002) Nonlinear Systems
- Khalil, H.K.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.