SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Conference on Decision and Control

Volumn , Issue , 2010, Pages 3066-3071

Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games

(2) Vrabie, Draguna a Lewis, Frank a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGEBRA; CONTINUOUS TIME SYSTEMS; DYNAMIC PROGRAMMING; FEEDBACK; GAME THEORY; ITERATIVE METHODS; REINFORCEMENT LEARNING; RICCATI EQUATIONS; SYSTEM THEORY;

CONTROL STRATEGIES; COUPLED ALGEBRAIC RICCATI EQUATIONS; DIFFERENTIAL GAMES; INFINITE HORIZONS; NONZERO-SUM DIFFERENTIAL GAME; OFF-LINE METHODS; ON-LINE ALGORITHMS; ONLINE COMPUTATIONS;

E-LEARNING;

EID: 79953133535 PISSN: 07431546 EISSN: 25762370 Source Type: Conference Proceeding
DOI: 10.1109/CDC.2010.5718152 Document Type: Conference Paper

Times cited : (67)

References (24)

1
- 0027677254
- Necessary and sufficient conditions for constant solutions of coupled Riccati equations in Nash games
- Abou-Kandil, H., Freiling, G., Jank, G., "Necessary and sufficient conditions for constant solutions of coupled Riccati equations in Nash games", Systems and Control Letters, 21, 2003, pp. 295-306.
- (2003) Systems and Control Letters , vol.21 , pp. 295-306
- Abou-Kandil, H.¹ Freiling, G.² Jank, G.³

2
- 0004071782
- nd ed., (Classics in Applied Mathematics; 23), SIAM
- nd ed., (Classics in Applied Mathematics; 23), SIAM, 1999.
- (1999) Dynamic Noncooperative Game Theory
- Basar, T.¹ Olsder, G.J.²

3
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- Beard, R., Saridis, G., & Wen, J., "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation". Automatica, 33(11), pp. 2159-2177, 1997.
- (1997) Automatica , vol.33 , Issue.11 , pp. 2159-2177
- Beard, R.¹ Saridis, G.² Wen, J.³

4
- 0003487482
- Athena Scientific, MA
- Bertsekas D. P. and Tsitsiklis J. N., Neuro-Dynamic Programming, Athena Scientific, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 0018011435
- Kronecker Products and Matrix Calculus in System Theory
- Brewer J. W., "Kronecker Products and Matrix Calculus in System Theory", IEEE Trans. on Circuit and System, 25(9), 1978, pp. 772-781.
- (1978) IEEE Trans. on Circuit and System , vol.25 , Issue.9 , pp. 772-781
- Brewer, J.W.¹

6
- 79960442625
- Iterative method for general algebraic Riccati equation
- Cherfi L., Abou-Kandil H., Bourles H., "Iterative method for general algebraic Riccati equation", Proc. ACSE'05, 2005.
- Proc. ACSE'05, 2005
- Cherfi, L.¹ Abou-Kandil, H.² Bourles, H.³

7
- 33847178757
- A new algorithm for solving coupled algebraic Riccati equations
- Cherfi L., Chitour Y., Abou-Kandil H., "A new algorithm for solving coupled algebraic Riccati equations", Proc. of CIMCA'05, 2005, pp. 83-88.
- Proc. of CIMCA'05, 2005 , pp. 83-88
- Cherfi, L.¹ Chitour, Y.² Abou-Kandil, H.³

8
- 33847180300
- Chichester: Wiley
- Engwerda J.C., LQ dynamic optimization and differential games, Chichester: Wiley, 2005.
- (2005) LQ Dynamic Optimization and Differential Games
- Engwerda, J.C.¹

9
- 0030086666
- On Global Existence of Solutions to Coupled Matrix Riccati Equations in Closed-Loop Nash Games
- Freiling G., Jank G., Abou-Kandil H., "On Global Existence of Solutions to Coupled Matrix Riccati Equations in Closed-Loop Nash Games", IEEE Transaction on Automatic Control, 41(2), 1996, pp.264-269.
- (1996) IEEE Transaction on Automatic Control , vol.41 , Issue.2 , pp. 264-269
- Freiling, G.¹ Jank, G.² Abou-Kandil, H.³

10
- 34249047468
- Continuous-time adaptive critics
- DOI 10.1109/TNN.2006.889499
- Hanselmann T., Noakes L., and Zaknich A., "Continuous-time adaptive critics", IEEE Transactions on Neural Networks, 18(3), 631-647, 2007. (Pubitemid 46778095)
- (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

11
- 79953127250
- Solving Coupled Riccati Equations for Closed-Loop Nash Strategy, by Lack of Trust Approach
- Jungers M., De Pieri E., Abou-Kandil H., "Solving Coupled Riccati Equations for Closed-Loop Nash Strategy, by Lack of Trust Approach", International Journal of Tomography and Statistics, 7(F07), 2007, pp. 49-54.
- (2007) International Journal of Tomography and Statistics , vol.7 , Issue.F07 , pp. 49-54
- Jungers, M.¹ De Pieri, E.² Abou-Kandil, H.³

12
- 84914965022
- On an Iterative Technique for Riccati Equation Computations
- February
- Kleinman D., "On an Iterative Technique for Riccati Equation Computations", IEEE Trans. on Automatic Control, February, 1968.
- (1968) IEEE Trans. on Automatic Control
- Kleinman, D.¹

13
- 0000672181
- Lyapunov iterations for solving coupled algebraic Riccati equations of Nash differential games and algebraic Riccati equations of zero-sum games
- G. Olsder (ed.), Birkhauser
- Li, T. and Gajic, Z., "Lyapunov iterations for solving coupled algebraic Riccati equations of Nash differential games and algebraic Riccati equations of zero-sum games", in New Trends in Dynamic Games, G. Olsder (ed.), Birkhauser, 1995, pp. 333-351.
- (1995) New Trends in Dynamic Games , pp. 333-351
- Li, T.¹ Gajic, Z.²

14
- 33747840670
- Optimal numerical strategy for Nash games of weakly coupled large scale systems
- Mukaidani, H., "Optimal numerical strategy for Nash games of weakly coupled large scale systems", Dynamics of Continuous, Discrete and Impulsive Systems, Series B: Applications and Algorithms, 13, 2006, pp. 249-268.
- (2006) Dynamics of Continuous, Discrete and Impulsive Systems, Series B: Applications and Algorithms , vol.13 , pp. 249-268
- Mukaidani, H.¹

15
- 34247618255
- Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems
- Mukaidani, H., "Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems", Applied Mathematics and Computation, 188, 2007, pp. 103-115.
- (2007) Applied Mathematics and Computation , vol.188 , pp. 103-115
- Mukaidani, H.¹

16
- 33750905483
- Numerical computation of sign-indefinite linear quadratic differential games for weakly coupled large scale systems
- Mukaidani, H., "Numerical computation of sign-indefinite linear quadratic differential games for weakly coupled large scale systems", International Journal of Control, 80(1), 2007, pp. 75-86.
- (2007) International Journal of Control , vol.80 , Issue.1 , pp. 75-86
- Mukaidani, H.¹

17
- 0031236002
- Adaptive critic designs
- Prokhorov D., Wunsch D., "Adaptive critic designs," IEEE Trans. on Neural Networks, 8(5), 997-1007, 1997.
- (1997) IEEE Trans. on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.¹ Wunsch, D.²

18
- 4243352182
- M.S. thesis, Rutgers University
- Shah, V., Power control for wireless data services based on utility and pricing, M.S. thesis, Rutgers University, 1998.
- (1998) Power Control for Wireless Data Services Based on Utility and Pricing
- Shah, V.¹

19
- 34250487269
- Nonzero-sum differential games
- Starr, A. and Y. Ho, "Nonzero-sum differential games", J. Optimization Theory and Applications, 3(3), 1969, pp. 184-206.
- (1969) J. Optimization Theory and Applications , vol.3 , Issue.3 , pp. 184-206
- Starr, A.¹ Ho, Y.²

20
- 33847202724
- Learning to predict by the method of temporal differences
- Sutton, R., "Learning to predict by the method of temporal differences," Machine Learning, 3:9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.¹

21
- 58349110975
- Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration
- Vrabie D., Pastravanu O., Lewis F., Abu-Khalaf M., "Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration", Automatica, 45(2), 2009, pp. 477-484.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Lewis, F.³ Abu-Khalaf, M.⁴

22
- 0004049893
- Ph.D. Thesis, Cambridge University, Cambridge, England
- Watkins, C., Learning from Delayed Rewards, Ph.D. Thesis, Cambridge University, Cambridge, England, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

23
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- White, D. and Sofge D., Eds., New York: Van Nostrand
- Werbos, P. J., "Approximate dynamic programming for real-time control and neural modeling". In Handbook of Intelligent Control, Neural, Fuzzy, and, Adaptive Approaches, White, D. and Sofge D., Eds., New York: Van Nostrand, 1992.
- (1992) Handbook of Intelligent Control, Neural, Fuzzy, And, Adaptive Approaches
- Werbos, P.J.¹

24
- 49249124071
- A new approach to solve a class of continuous-time nonlinear quadratic zero-sum game using ADP
- Wei Q. and Zhang H., "A new approach to solve a class of continuous-time nonlinear quadratic zero-sum game using ADP", Proc. IEEE International Conference on Networking, Sensing and Control (ICNSC'08), 6(8) 2008, pp. 507-512.
- (2008) Proc. IEEE International Conference on Networking, Sensing and Control (ICNSC'08) , vol.6 , Issue.8 , pp. 507-512
- Wei, Q.¹ Zhang, H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.