메뉴 건너뛰기




Volumn , Issue , 2011, Pages 128-135

Policy iteration algorithm for distributed networks and graphical games

Author keywords

best response; cooperative Hamilton Jacobi equations; graphical games; Nash equilibrium; Policy Iteration

Indexed keywords

COMPUTATION THEORY; DYNAMICAL SYSTEMS; GRAPH ALGORITHMS; ITERATIVE METHODS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING; TOPOLOGY;

EID: 84860652054     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2011.6160491     Document Type: Conference Paper
Times cited : (10)

References (38)
  • 4
    • 0018011435 scopus 로고
    • Kronecker products and matrix calculus in system theory
    • J.W. Brewer, "Kronecker products and matrix calculus in system theory," IEEE Transactions Circuits and Systems, vol. 25, 1978, pp. 772-781.
    • (1978) IEEE Transactions Circuits and Systems , vol.25 , pp. 772-781
    • Brewer, J.W.1
  • 6
    • 79953143055 scopus 로고    scopus 로고
    • Optimal control of affine nonlinear continuous-time systems using an online hamilton-jacobi-isaacs formulation1
    • Atlanta
    • T. Dierks and S. Jagannathan, Optimal Control of Affine Nonlinear Continuous-time Systems Using an Online Hamilton-Jacobi-Isaacs Formulation1, Proc. IEEE Conf Decision and Control, Atlanta, pp. 3048-3053, 2010.
    • (2010) Proc. IEEE Conf Decision and Control , pp. 3048-3053
    • Dierks, T.1    Jagannathan, S.2
  • 7
    • 4644317383 scopus 로고    scopus 로고
    • Information flow and cooperative control of vehicle formations
    • Sep.
    • J. Fax and R. Murray, "Information flow and cooperative control of vehicle formations," IEEE Trans. Autom. Control, vol. 49, no. 9, pp. 1465-1476, Sep. 2004.
    • (2004) IEEE Trans. Autom. Control , vol.49 , Issue.9 , pp. 1465-1476
    • Fax, J.1    Murray, R.2
  • 8
    • 0030086666 scopus 로고    scopus 로고
    • On global existence of solutions to coupled matrix riccati equations in closed loop nash games
    • G. Freiling, G. Jank, H. Abou-Kandil, "On global existence of Solutions to Coupled Matrix Riccati equations in closed loop Nash Games," IEEE Transactions on Automatic Control, vol. 41, no. 2, pp. 264-269, 2002.
    • (2002) IEEE Transactions on Automatic Control , vol.41 , Issue.2 , pp. 264-269
    • Freiling, G.1    Jank, G.2    Abou-Kandil, H.3
  • 9
    • 33646814428 scopus 로고    scopus 로고
    • Tracking control for multi-agent consensus with an active leader and variable topology
    • Y. Hong, J. Hu, and L. Gao, "Tracking control for multi-agent consensus with an active leader and variable topology," Automatica, vol. 42, no. 7, pp. 1177-1182, 2006.
    • (2006) Automatica , vol.42 , Issue.7 , pp. 1177-1182
    • Hong, Y.1    Hu, J.2    Gao, L.3
  • 10
    • 79960902622 scopus 로고
    • Simulation results for two new algorithms for solving coupled algebraic riccati equations
    • Sophia, Antipolis, France, June
    • Z. Gajic and T-Y. Li, "Simulation results for two new algorithms for solving coupled algebraic Riccati equations," Third Int. Symp. On Differential Games, Sophia, Antipolis, France, June 1988.
    • (1988) Third Int. Symp. on Differential Games
    • Gajic, Z.1    Li, T.-Y.2
  • 11
    • 0038548185 scopus 로고    scopus 로고
    • Coordination of groups of mobile autonomous agents using nearest neighbor rules
    • Jun.
    • A. Jadbabaie, J. Lin, and A. Morse, "Coordination of groups of mobile autonomous agents using nearest neighbor rules," IEEE Trans. Autom. Control, vol. 48, no. 6, pp. 988-1001, Jun. 2003.
    • (2003) IEEE Trans. Autom. Control , vol.48 , Issue.6 , pp. 988-1001
    • Jadbabaie, A.1    Lin, J.2    Morse, A.3
  • 15
    • 67349246510 scopus 로고    scopus 로고
    • Robust finite-time consensus tracking algorithm for multirobot systems
    • S. Khoo, L. Xie, and Z. Man, "Robust Finite-Time Consensus Tracking Algorithm for Multirobot Systems," IEEE Transactions on Mechatronics, vol. 14, pp. 219-228.
    • IEEE Transactions on Mechatronics , vol.14 , pp. 219-228
    • Khoo, S.1    Xie, L.2    Man, Z.3
  • 16
    • 0002526302 scopus 로고
    • Construction of suboptimal control sequences
    • R. J. Leake, Ruey-Wen Liu, "Construction of Suboptimal Control Sequences," J. SIAM Control, vol. 5, no, 1, pp. 54-63, 1967.
    • (1967) J. SIAM Control , vol.5 , Issue.1 , pp. 54-63
    • Leake, R.J.1    Liu, R.-W.2
  • 19
    • 7444271638 scopus 로고    scopus 로고
    • Pinning a complex dynamical network to its equilibrium
    • Oct.
    • X. Li, X. Wang, and G. Chen, "Pinning a complex dynamical network to its equilibrium," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 51, no. 10, pp. 2074-2087, Oct. 2004.
    • (2004) IEEE Trans. Circuits Syst. I, Reg. Papers , vol.51 , Issue.10 , pp. 2074-2087
    • Li, X.1    Wang, X.2    Chen, G.3
  • 21
    • 64149119332 scopus 로고    scopus 로고
    • Consensus and cooperation in networked multi-agent systems
    • Jan.
    • R. Olfati-Saber, J. Fax, and R. Murray, "Consensus and cooperation in networked multi-agent systems," Proc. IEEE, vol. 95, no. 1, pp. 215-233, Jan. 2007.
    • (2007) Proc. IEEE , vol.95 , Issue.1 , pp. 215-233
    • Olfati-Saber, R.1    Fax, J.2    Murray, R.3
  • 22
    • 4644244041 scopus 로고    scopus 로고
    • Consensus problems in networks of agents with switching topology and time-delays
    • R. Olfati-Saber and R.M. Murray, "Consensus Problems in Networks of Agents with Switching Topology and Time-Delays," IEEE Transaction of Automatic Control, vol. 49, pp. 1520-1533, 2004.
    • (2004) IEEE Transaction of Automatic Control , vol.49 , pp. 1520-1533
    • Olfati-Saber, R.1    Murray, R.M.2
  • 24
    • 23944486457 scopus 로고    scopus 로고
    • A survey of consensus problems in multi-agent coordination
    • Portland, OR
    • W. Ren, R. Beard, and E. Atkins, "A survey of consensus problems in multi-agent coordination," in Proc. Amer. Control Conf, Portland, OR, pp. 1859-1864, 2005.
    • (2005) Proc. Amer. Control Conf , pp. 1859-1864
    • Ren, W.1    Beard, R.2    Atkins, E.3
  • 25
    • 20344399896 scopus 로고    scopus 로고
    • Consensus seeking in multiagent systems under dynamically changing interaction topologies
    • May
    • W. Ren and R. Beard, "Consensus seeking in multiagent systems under dynamically changing interaction topologies," IEEE Trans. Autom. Control, vol. 50, no. 5, pp. 655-661, May 2005.
    • (2005) IEEE Trans. Autom. Control , vol.50 , Issue.5 , pp. 655-661
    • Ren, W.1    Beard, R.2
  • 27
    • 35148881579 scopus 로고    scopus 로고
    • High-order and model reference consensus algorithms in cooperative control of multivehicle systems
    • W. Ren, K. Moore, and Y. Chen, "High-order and model reference consensus algorithms in cooperative control of multivehicle systems," J. Dynam. Syst., Meas., Control, vol. 129, no. 5, pp. 678-688, 2007.
    • (2007) J. Dynam. Syst., Meas., Control , vol.129 , Issue.5 , pp. 678-688
    • Ren, W.1    Moore, K.2    Chen, Y.3
  • 32
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • K.G Vamvoudakis, and F. L. Lewis, "Online Actor-Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 33
    • 79960897012 scopus 로고    scopus 로고
    • Multi-player non-zero sum games: Online adaptive learning solution of coupled hamilton- Jacobi equations
    • K.G. Vamvoudakis, and F. L. Lewis, "Multi-Player Non-Zero Sum Games: Online Adaptive Learning Solution of Coupled Hamilton- Jacobi Equations," Automatica, vol. 47, no. 8, pp. 1556-1569, 2011.
    • (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 34
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • D. Vrabie, O. Pastravanu, F. L. Lewis, & M. Abu-Khalaf, "Adaptive Optimal Control for continuous-time linear systems based on policy iteration," Automatica, 45(2), 477-484, 2009
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Lewis, F.L.3    Abu-Khalaf, M.4
  • 36
    • 0037100101 scopus 로고    scopus 로고
    • Pinning control of scale-free dynamical networks
    • X. Wang and G. Chen, "Pinning control of scale-free dynamical networks," Physica A, vol. 310, no. 3-4, pp. 521-531, 2002.
    • (2002) Physica A , vol.310 , Issue.3-4 , pp. 521-531
    • Wang, X.1    Chen, G.2
  • 38
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • ed. D.A White and D.A. Sofge, New York: Van Nostrand Reinhold
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," Handbook of Intelligent Control, ed. D.A White and D.A. Sofge, New York: Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.