SCOPUS 정보 검색 플랫폼

Proceedings of the IEEE Conference on Decision and Control

Volumn , Issue , 2011, Pages 128-135

Policy iteration algorithm for distributed networks and graphical games

(2) Vamvoudakis, Kyriakos G a Lewis, F L a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

best response; cooperative Hamilton Jacobi equations; graphical games; Nash equilibrium; Policy Iteration

Indexed keywords

COMPUTATION THEORY; DYNAMICAL SYSTEMS; GRAPH ALGORITHMS; ITERATIVE METHODS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING; TOPOLOGY;

BEST RESPONSE; COOPERATIVE HAMILTON-JACOBI EQUATIONS; GRAPHICAL GAMES; NASH EQUILIBRIA; POLICY ITERATION;

GAME THEORY;

EID: 84860652054 PISSN: 07431546 EISSN: 25762370 Source Type: Conference Proceeding
DOI: 10.1109/CDC.2011.6160491 Document Type: Conference Paper

Times cited : (10)

References (38)

1
- 4644305952
- Birkhäuser
- H. Abou-Kandil, G. Freiling, V. Ionescu, and G. Jank, Matrix Riccati Equations in Control and Systems Theory, Birkhäuser, 2003.
- (2003) Matrix Riccati Equations in Control and Systems Theory
- Abou-Kandil, H.¹ Freiling, G.² Ionescu, V.³ Jank, G.⁴

2
- 0004071782
- nd ed. Philadelphia, PA: SIAM
- nd ed. Philadelphia, PA: SIAM, 1999.
- (1999) Dynamic Noncooperative Game Theory
- Başar, T.¹ Olsder, G.J.²

3
- 0003487482
- Athena Scientific, MA
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, MA, 1996.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 0018011435
- Kronecker products and matrix calculus in system theory
- J.W. Brewer, "Kronecker products and matrix calculus in system theory," IEEE Transactions Circuits and Systems, vol. 25, 1978, pp. 772-781.
- (1978) IEEE Transactions Circuits and Systems , vol.25 , pp. 772-781
- Brewer, J.W.¹

5
- 40949147745
- A comprehensive survey of multi-agent reinforcement learning
- L. Busoniu, R. Babuska, B. De Schutter, "A Comprehensive Survey of Multi-Agent Reinforcement Learning," IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews, vol. 38, no. 2, pp. 156-172, 2008.
- (2008) IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

6
- 79953143055
- Optimal control of affine nonlinear continuous-time systems using an online hamilton-jacobi-isaacs formulation1
- Atlanta
- T. Dierks and S. Jagannathan, Optimal Control of Affine Nonlinear Continuous-time Systems Using an Online Hamilton-Jacobi-Isaacs Formulation1, Proc. IEEE Conf Decision and Control, Atlanta, pp. 3048-3053, 2010.
- (2010) Proc. IEEE Conf Decision and Control , pp. 3048-3053
- Dierks, T.¹ Jagannathan, S.²

7
- 4644317383
- Information flow and cooperative control of vehicle formations
- Sep.
- J. Fax and R. Murray, "Information flow and cooperative control of vehicle formations," IEEE Trans. Autom. Control, vol. 49, no. 9, pp. 1465-1476, Sep. 2004.
- (2004) IEEE Trans. Autom. Control , vol.49 , Issue.9 , pp. 1465-1476
- Fax, J.¹ Murray, R.²

8
- 0030086666
- On global existence of solutions to coupled matrix riccati equations in closed loop nash games
- G. Freiling, G. Jank, H. Abou-Kandil, "On global existence of Solutions to Coupled Matrix Riccati equations in closed loop Nash Games," IEEE Transactions on Automatic Control, vol. 41, no. 2, pp. 264-269, 2002.
- (2002) IEEE Transactions on Automatic Control , vol.41 , Issue.2 , pp. 264-269
- Freiling, G.¹ Jank, G.² Abou-Kandil, H.³

9
- 33646814428
- Tracking control for multi-agent consensus with an active leader and variable topology
- Y. Hong, J. Hu, and L. Gao, "Tracking control for multi-agent consensus with an active leader and variable topology," Automatica, vol. 42, no. 7, pp. 1177-1182, 2006.
- (2006) Automatica , vol.42 , Issue.7 , pp. 1177-1182
- Hong, Y.¹ Hu, J.² Gao, L.³

10
- 79960902622
- Simulation results for two new algorithms for solving coupled algebraic riccati equations
- Sophia, Antipolis, France, June
- Z. Gajic and T-Y. Li, "Simulation results for two new algorithms for solving coupled algebraic Riccati equations," Third Int. Symp. On Differential Games, Sophia, Antipolis, France, June 1988.
- (1988) Third Int. Symp. on Differential Games
- Gajic, Z.¹ Li, T.-Y.²

11
- 0038548185
- Coordination of groups of mobile autonomous agents using nearest neighbor rules
- Jun.
- A. Jadbabaie, J. Lin, and A. Morse, "Coordination of groups of mobile autonomous agents using nearest neighbor rules," IEEE Trans. Autom. Control, vol. 48, no. 6, pp. 988-1001, Jun. 2003.
- (2003) IEEE Trans. Autom. Control , vol.48 , Issue.6 , pp. 988-1001
- Jadbabaie, A.¹ Lin, J.² Morse, A.³

12
- 79953145736
- Asymptotic stackelberg optimal control design for an uncertain euler lagrange system
- M. Johnson, T. Hiramatsu, N. Fitz-Coy, and W. E. Dixon, "Asymptotic Stackelberg Optimal Control Design for an Uncertain Euler Lagrange System," IEEE Conference on Decision and Control, pp. 6686-6691, 2010
- (2010) IEEE Conference on Decision and Control , pp. 6686-6691
- Johnson, M.¹ Hiramatsu, T.² Fitz-Coy, N.³ Dixon, W.E.⁴

13
- 0242708835
- Correlated equilibria in graphical games
- S. Kakade, M. Kearns, J. Langford, and L. Ortiz, "Correlated equilibria in graphical games," Proc. 4th ACM Conference on Electronic Commerce, pp. 42-47, 2003.
- (2003) Proc. 4th ACM Conference on Electronic Commerce , pp. 42-47
- Kakade, S.¹ Kearns, M.² Langford, J.³ Ortiz, L.⁴

14
- 0141591857
- Graphical models for game theory
- th Annual Conference on Uncertainty in Artificial Intelligence, pp. 253-260, 2001.
- (2001) th Annual Conference on Uncertainty in Artificial Intelligence , pp. 253-260
- Kearns, M.¹ Littman, M.² Singh, S.³

15
- 67349246510
- Robust finite-time consensus tracking algorithm for multirobot systems
- S. Khoo, L. Xie, and Z. Man, "Robust Finite-Time Consensus Tracking Algorithm for Multirobot Systems," IEEE Transactions on Mechatronics, vol. 14, pp. 219-228.
- IEEE Transactions on Mechatronics , vol.14 , pp. 219-228
- Khoo, S.¹ Xie, L.² Man, Z.³

16
- 0002526302
- Construction of suboptimal control sequences
- R. J. Leake, Ruey-Wen Liu, "Construction of Suboptimal Control Sequences," J. SIAM Control, vol. 5, no, 1, pp. 54-63, 1967.
- (1967) J. SIAM Control , vol.5 , Issue.1 , pp. 54-63
- Leake, R.J.¹ Liu, R.-W.²

17
- 0004163205
- John Wiley
- F. L. Lewis, V. L. Syrmos, Optimal Control, John Wiley, 1995.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

18
- 0003502434
- New Jersey: Prentice-Hall
- F. Lewis, Applied Optimal Control and Estimation: Digital Design and Implementation, New Jersey: Prentice-Hall, 1992.
- (1992) Applied Optimal Control and Estimation: Digital Design and Implementation
- Lewis, F.¹

19
- 7444271638
- Pinning a complex dynamical network to its equilibrium
- Oct.
- X. Li, X. Wang, and G. Chen, "Pinning a complex dynamical network to its equilibrium," IEEE Trans. Circuits Syst. I, Reg. Papers, vol. 51, no. 10, pp. 2074-2087, Oct. 2004.
- (2004) IEEE Trans. Circuits Syst. I, Reg. Papers , vol.51 , Issue.10 , pp. 2074-2087
- Li, X.¹ Wang, X.² Chen, G.³

20
- 0001547175
- Value-function reinforcement learning in Markov games
- M.L. Littman, "Value-function reinforcement learning in Markov games," Journal of Cognitive Systems Research 1, 2001.
- (2001) Journal of Cognitive Systems Research , vol.1
- Littman, M.L.¹

21
- 64149119332
- Consensus and cooperation in networked multi-agent systems
- Jan.
- R. Olfati-Saber, J. Fax, and R. Murray, "Consensus and cooperation in networked multi-agent systems," Proc. IEEE, vol. 95, no. 1, pp. 215-233, Jan. 2007.
- (2007) Proc. IEEE , vol.95 , Issue.1 , pp. 215-233
- Olfati-Saber, R.¹ Fax, J.² Murray, R.³

22
- 4644244041
- Consensus problems in networks of agents with switching topology and time-delays
- R. Olfati-Saber and R.M. Murray, "Consensus Problems in Networks of Agents with Switching Topology and Time-Delays," IEEE Transaction of Automatic Control, vol. 49, pp. 1520-1533, 2004.
- (2004) IEEE Transaction of Automatic Control , vol.49 , pp. 1520-1533
- Olfati-Saber, R.¹ Murray, R.M.²

23
- 84895371400
- New York: Springer-Verlag
- Z. Qu, Cooperative Control of Dynamical Systems: Applications to Autonomous Vehicles, New York: Springer-Verlag, 2009.
- (2009) Cooperative Control of Dynamical Systems: Applications to Autonomous Vehicles
- Qu, Z.¹

24
- 23944486457
- A survey of consensus problems in multi-agent coordination
- Portland, OR
- W. Ren, R. Beard, and E. Atkins, "A survey of consensus problems in multi-agent coordination," in Proc. Amer. Control Conf, Portland, OR, pp. 1859-1864, 2005.
- (2005) Proc. Amer. Control Conf , pp. 1859-1864
- Ren, W.¹ Beard, R.² Atkins, E.³

25
- 20344399896
- Consensus seeking in multiagent systems under dynamically changing interaction topologies
- May
- W. Ren and R. Beard, "Consensus seeking in multiagent systems under dynamically changing interaction topologies," IEEE Trans. Autom. Control, vol. 50, no. 5, pp. 655-661, May 2005.
- (2005) IEEE Trans. Autom. Control , vol.50 , Issue.5 , pp. 655-661
- Ren, W.¹ Beard, R.²

26
- 85030956522
- Springer, Berlin
- W. Ren and R.W. Beard, Distributed Consensus in Multi-vehicle Cooperative Control, Springer, Berlin, 2008.
- (2008) Distributed Consensus in Multi-vehicle Cooperative Control
- Ren, W.¹ Beard, R.W.²

27
- 35148881579
- High-order and model reference consensus algorithms in cooperative control of multivehicle systems
- W. Ren, K. Moore, and Y. Chen, "High-order and model reference consensus algorithms in cooperative control of multivehicle systems," J. Dynam. Syst., Meas., Control, vol. 129, no. 5, pp. 678-688, 2007.
- (2007) J. Dynam. Syst., Meas., Control , vol.129 , Issue.5 , pp. 678-688
- Ren, W.¹ Moore, K.² Chen, Y.³

28
- 84924111881
- Cambridge University Press
- Y. Shoham, K. Leyton-Brown, Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations, Cambridge University Press, 2009.
- (2009) Multiagent Systems: Algorithmic, Game-theoretic, and Logical Foundations
- Shoham, Y.¹ Leyton-Brown, K.²

29
- 0004102479
- MIT Press, Cambridge, Massachusetts
- R. S. Sutton, A. G. Barto, Reinforcement Learning - An Introduction, MIT Press, Cambridge, Massachusetts, 1998.
- (1998) Reinforcement Learning - An Introduction
- Sutton, R.S.¹ Barto, A.G.²

30
- 3142784521
- Hindustan Book Agency, India
- S. Tijs, Introduction to Game Theory, Hindustan Book Agency, India, 2003.
- (2003) Introduction to Game Theory
- Tijs, S.¹

31
- 0011264117
- Ph.D. dissertation, Dept. Elect. Eng. and Comput. Sci., MIT, Cambridge, MA
- J. Tsitsiklis, "Problems in Decentralized Decision Making and Computation," Ph.D. dissertation, Dept. Elect. Eng. and Comput. Sci., MIT, Cambridge, MA 1984.
- (1984) Problems in Decentralized Decision Making and Computation
- Tsitsiklis, J.¹

32
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K.G Vamvoudakis, and F. L. Lewis, "Online Actor-Critic Algorithm to Solve the Continuous-Time Infinite Horizon Optimal Control Problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

33
- 79960897012
- Multi-player non-zero sum games: Online adaptive learning solution of coupled hamilton- Jacobi equations
- K.G. Vamvoudakis, and F. L. Lewis, "Multi-Player Non-Zero Sum Games: Online Adaptive Learning Solution of Coupled Hamilton- Jacobi Equations," Automatica, vol. 47, no. 8, pp. 1556-1569, 2011.
- (2011) Automatica , vol.47 , Issue.8 , pp. 1556-1569
- Vamvoudakis, K.G.¹ Lewis, F.L.²

34
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, F. L. Lewis, & M. Abu-Khalaf, "Adaptive Optimal Control for continuous-time linear systems based on policy iteration," Automatica, 45(2), 477-484, 2009
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Lewis, F.L.³ Abu-Khalaf, M.⁴

35
- 49049094010
- Decentralized learning in Markov games
- August
- P. Vrancx, K. Verbeeck, and A. Nowe, "Decentralized learning in markov games," IEEE Transactions on Systems, Man and Cybernetics, vol. 38, no. 4, pp. 976-981, August 2008.
- (2008) IEEE Transactions on Systems, Man and Cybernetics , vol.38 , Issue.4 , pp. 976-981
- Vrancx, P.¹ Verbeeck, K.² Nowe, A.³

36
- 0037100101
- Pinning control of scale-free dynamical networks
- X. Wang and G. Chen, "Pinning control of scale-free dynamical networks," Physica A, vol. 310, no. 3-4, pp. 521-531, 2002.
- (2002) Physica A , vol.310 , Issue.3-4 , pp. 521-531
- Wang, X.¹ Chen, G.²

37
- 0003529238
- Ph.D. Thesis
- P.J. Werbos, Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences, Ph.D. Thesis, 1974.
- (1974) Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences
- Werbos, P.J.¹

38
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- ed. D.A White and D.A. Sofge, New York: Van Nostrand Reinhold
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," Handbook of Intelligent Control, ed. D.A White and D.A. Sofge, New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.