메뉴 건너뛰기




Volumn , Issue , 2010, Pages 3066-3071

Integral reinforcement learning for online computation of feedback Nash strategies of nonzero-sum differential games

Author keywords

[No Author keywords available]

Indexed keywords

ALGEBRA; CONTINUOUS TIME SYSTEMS; DYNAMIC PROGRAMMING; FEEDBACK; GAME THEORY; ITERATIVE METHODS; REINFORCEMENT LEARNING; RICCATI EQUATIONS; SYSTEM THEORY;

EID: 79953133535     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2010.5718152     Document Type: Conference Paper
Times cited : (67)

References (24)
  • 1
    • 0027677254 scopus 로고    scopus 로고
    • Necessary and sufficient conditions for constant solutions of coupled Riccati equations in Nash games
    • Abou-Kandil, H., Freiling, G., Jank, G., "Necessary and sufficient conditions for constant solutions of coupled Riccati equations in Nash games", Systems and Control Letters, 21, 2003, pp. 295-306.
    • (2003) Systems and Control Letters , vol.21 , pp. 295-306
    • Abou-Kandil, H.1    Freiling, G.2    Jank, G.3
  • 3
    • 0031332446 scopus 로고    scopus 로고
    • Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
    • Beard, R., Saridis, G., & Wen, J., "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation". Automatica, 33(11), pp. 2159-2177, 1997.
    • (1997) Automatica , vol.33 , Issue.11 , pp. 2159-2177
    • Beard, R.1    Saridis, G.2    Wen, J.3
  • 5
    • 0018011435 scopus 로고
    • Kronecker Products and Matrix Calculus in System Theory
    • Brewer J. W., "Kronecker Products and Matrix Calculus in System Theory", IEEE Trans. on Circuit and System, 25(9), 1978, pp. 772-781.
    • (1978) IEEE Trans. on Circuit and System , vol.25 , Issue.9 , pp. 772-781
    • Brewer, J.W.1
  • 9
    • 0030086666 scopus 로고    scopus 로고
    • On Global Existence of Solutions to Coupled Matrix Riccati Equations in Closed-Loop Nash Games
    • Freiling G., Jank G., Abou-Kandil H., "On Global Existence of Solutions to Coupled Matrix Riccati Equations in Closed-Loop Nash Games", IEEE Transaction on Automatic Control, 41(2), 1996, pp.264-269.
    • (1996) IEEE Transaction on Automatic Control , vol.41 , Issue.2 , pp. 264-269
    • Freiling, G.1    Jank, G.2    Abou-Kandil, H.3
  • 11
    • 79953127250 scopus 로고    scopus 로고
    • Solving Coupled Riccati Equations for Closed-Loop Nash Strategy, by Lack of Trust Approach
    • Jungers M., De Pieri E., Abou-Kandil H., "Solving Coupled Riccati Equations for Closed-Loop Nash Strategy, by Lack of Trust Approach", International Journal of Tomography and Statistics, 7(F07), 2007, pp. 49-54.
    • (2007) International Journal of Tomography and Statistics , vol.7 , Issue.F07 , pp. 49-54
    • Jungers, M.1    De Pieri, E.2    Abou-Kandil, H.3
  • 12
    • 84914965022 scopus 로고
    • On an Iterative Technique for Riccati Equation Computations
    • February
    • Kleinman D., "On an Iterative Technique for Riccati Equation Computations", IEEE Trans. on Automatic Control, February, 1968.
    • (1968) IEEE Trans. on Automatic Control
    • Kleinman, D.1
  • 13
    • 0000672181 scopus 로고
    • Lyapunov iterations for solving coupled algebraic Riccati equations of Nash differential games and algebraic Riccati equations of zero-sum games
    • G. Olsder (ed.), Birkhauser
    • Li, T. and Gajic, Z., "Lyapunov iterations for solving coupled algebraic Riccati equations of Nash differential games and algebraic Riccati equations of zero-sum games", in New Trends in Dynamic Games, G. Olsder (ed.), Birkhauser, 1995, pp. 333-351.
    • (1995) New Trends in Dynamic Games , pp. 333-351
    • Li, T.1    Gajic, Z.2
  • 15
    • 34247618255 scopus 로고    scopus 로고
    • Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems
    • Mukaidani, H., "Newton's method for solving cross-coupled sign-indefinite algebraic Riccati equations for weakly coupled large-scale systems", Applied Mathematics and Computation, 188, 2007, pp. 103-115.
    • (2007) Applied Mathematics and Computation , vol.188 , pp. 103-115
    • Mukaidani, H.1
  • 16
    • 33750905483 scopus 로고    scopus 로고
    • Numerical computation of sign-indefinite linear quadratic differential games for weakly coupled large scale systems
    • Mukaidani, H., "Numerical computation of sign-indefinite linear quadratic differential games for weakly coupled large scale systems", International Journal of Control, 80(1), 2007, pp. 75-86.
    • (2007) International Journal of Control , vol.80 , Issue.1 , pp. 75-86
    • Mukaidani, H.1
  • 20
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • Sutton, R., "Learning to predict by the method of temporal differences," Machine Learning, 3:9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.1
  • 21
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration
    • Vrabie D., Pastravanu O., Lewis F., Abu-Khalaf M., "Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration", Automatica, 45(2), 2009, pp. 477-484.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Lewis, F.3    Abu-Khalaf, M.4
  • 22
    • 0004049893 scopus 로고
    • Ph.D. Thesis, Cambridge University, Cambridge, England
    • Watkins, C., Learning from Delayed Rewards, Ph.D. Thesis, Cambridge University, Cambridge, England, 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.1
  • 23
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • White, D. and Sofge D., Eds., New York: Van Nostrand
    • Werbos, P. J., "Approximate dynamic programming for real-time control and neural modeling". In Handbook of Intelligent Control, Neural, Fuzzy, and, Adaptive Approaches, White, D. and Sofge D., Eds., New York: Van Nostrand, 1992.
    • (1992) Handbook of Intelligent Control, Neural, Fuzzy, And, Adaptive Approaches
    • Werbos, P.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.