메뉴 건너뛰기




Volumn 59, Issue 3, 2014, Pages 629-644

Learning in mean-field games

Author keywords

Mean field game; nonlinear systems; phase transition; stochastic learning; synchronization

Indexed keywords

APPROXIMATION ALGORITHMS; CONTROL; CONTROL THEORY; GALERKIN METHODS; NONLINEAR SYSTEMS; PHASE TRANSITIONS; SYNCHRONIZATION;

EID: 84897679944     PISSN: 00189286     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAC.2013.2287733     Document Type: Article
Times cited : (32)

References (42)
  • 2
    • 84859709769 scopus 로고    scopus 로고
    • Synchronization of coupled oscillators is a game
    • Apr.
    • H. Yin, P. G. Mehta, S. P. Meyn, and U. V. Shanbhag, " Synchronization of coupled oscillators is a game," IEEE Trans. Autom. Control, vol. 57, no. 4, pp. 920-935, Apr. 2012.
    • (2012) IEEE Trans. Autom. Control , vol.57 , Issue.4 , pp. 920-935
    • Yin, H.1    Mehta, P.G.2    Meyn, S.P.3    Shanbhag, U.V.4
  • 3
    • 1542378994 scopus 로고    scopus 로고
    • Stochastic power control in wireless communication systems: Analysis, approximate control algorithms and state aggregation
    • M. Huang, R. P. Malhamé, and P. E. Caines, "Stochastic power control in wireless communication systems: Analysis, approximate control algorithms and state aggregation," in Proc. IEEE Conf. Decision and Control, 2003, pp. 4231-4236.
    • (2003) Proc. Conf. Decision and Control , pp. 4231-4236
    • Huang, M.1    Malhamé, R.P.2    Caines, P.E.3
  • 5
    • 39549087376 scopus 로고    scopus 로고
    • Large population stochastic dynamic games: Closed-loop Mckean-Vlasov systems and the Nash certainty equivalence principle
    • M. Huang, R. P. Malhamé, and P. E. Caines, "Large population stochastic dynamic games: Closed-loop Mckean-Vlasov systems and the Nash certainty equivalence principle," Commun. Inf. Syst., vol. 6, no. 3, pp. 221-251, 2006.
    • (2006) Commun. Inf. Syst. , vol.6 , Issue.3 , pp. 221-251
    • Huang, M.1    Malhamé, R.P.2    Caines, P.E.3
  • 6
    • 34648831837 scopus 로고    scopus 로고
    • Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ε-nash equilibria
    • DOI 10.1109/TAC.2007.904450
    • M. Huang, P. E. Caines, and R. P. Malhamé, "Large-population costcoupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized-Nash equilibria," IEEE Trans. Autom. Control, vol. 52, no. 9, pp. 1560-1571, Sep. 2007. (Pubitemid 47456068)
    • (2007) IEEE Transactions on Automatic Control , vol.52 , Issue.9 , pp. 1560-1571
    • Huang, M.1    Caines, P.E.2    Malhame, R.P.3
  • 7
    • 52249090671 scopus 로고    scopus 로고
    • Asymptotically optimal decentralized control for large population stochastic multiagent systems
    • Jul.
    • T. Li and J.-F. Zhang, "Asymptotically optimal decentralized control for large population stochastic multiagent systems," IEEE Trans. Autom. Control, vol. 53, no. 7, pp. 1643-1660, Jul. 2008.
    • (2008) IEEE Trans. Autom. Control , vol.53 , Issue.7 , pp. 1643-1660
    • Li, T.1    Zhang, J.-F.2
  • 10
    • 34047127341 scopus 로고    scopus 로고
    • Mean field games
    • J. M. Lasry and P. L. Lions, "Mean field games," Jpn. J. Math., vol. 2, pp. 229-260, 2007.
    • (2007) Jpn. J. Math. , vol.2 , pp. 229-260
    • Lasry, J.M.1    Lions, P.L.2
  • 11
    • 70349985676 scopus 로고    scopus 로고
    • Oblivious equilibrium:A mean field approximation for large-scale dynamic games
    • Cambridge, MA, USA: MIT Press
    • G.Y.Weintraub, L. Benkard, and B.V. Roy, "Oblivious equilibrium:A mean field approximation for large-scale dynamic games," in Advances in Neural Information Processing Systems. Cambridge, MA, USA: MIT Press, 2006, vol. 18.
    • (2006) Advances in Neural Information Processing Systems , vol.18
    • Weintraub, G.Y.1    Benkard, L.2    Roy, B.V.3
  • 12
    • 39449116294 scopus 로고    scopus 로고
    • Constrained cost-coupled stochastic games with independent state processes
    • DOI 10.1016/j.orl.2007.05.010, PII S0167637707000867
    • E. Altman, K. Avrachenkov, N. Bonneau, M. Debbah, R. El-Azouzi, and D. S. Menasche, "Constrained cost-coupled stochastic games with independent state processes," Oper. Res. Lett., vol. 36, no. 2, pp. 160-164, 2008. (Pubitemid 351265375)
    • (2008) Operations Research Letters , vol.36 , Issue.2 , pp. 160-164
    • Altman, E.1    Avrachenkov, K.2    Bonneau, N.3    Debbah, M.4    El-Azouzi, R.5    Sadoc Menasche, D.6
  • 14
    • 77958476920 scopus 로고    scopus 로고
    • Paris-Princeton Lectures on Mathematical Finance, R. Carmona, Ed. New York, NY, USA: Springer
    • O. Gueant, J. M. Lasry, and P. L. Lions, "Mean field games and applications," in Paris-Princeton Lectures on Mathematical Finance, R. Carmona, Ed. New York, NY, USA: Springer, 2010, pp. 205-266.
    • (2010) Mean Field Games and Applications , pp. 205-266
    • Gueant, O.1    Lasry, J.M.2    Lions, P.L.3
  • 15
    • 53649095038 scopus 로고    scopus 로고
    • A class of mean field interaction models for computer and communication systems
    • M. Benaim and J.-Y. Le Boudec, "A class of mean field interaction models for computer and communication systems," Perf. Eval., vol. 65, no. 11-12, pp. 823-838, 2008.
    • (2008) Perf. Eval. , vol.65 , Issue.11-12 , pp. 823-838
    • Benaim, M.1    Le Boudec, J.-Y.2
  • 17
    • 77956010519 scopus 로고    scopus 로고
    • Modeling crowd dynamics by the mean-field limit approach
    • Nov.
    • C. Dogbé, "Modeling crowd dynamics by the mean-field limit approach," Math. Comput. Model., vol. 52, no. 9-10, pp. 1506-1520, Nov. 2010.
    • (2010) Math. Comput. Model. , vol.52 , Issue.9-10 , pp. 1506-1520
    • Dogbé, C.1
  • 18
    • 80455173864 scopus 로고    scopus 로고
    • On a mean field game approach modeling congestion and aversion in pedestrian crowds
    • A. Lachapelle and M.-T. Wolfram, "On a mean field game approach modeling congestion and aversion in pedestrian crowds," Transport. Res. Part B: Methodological, vol. 45, no. 10, pp. 1572-1589, 2011.
    • (2011) IEEE Transport. Res. Part B: Methodological , vol.45 , Issue.10 , pp. 1572-1589
    • Lachapelle, A.1    Wolfram, M.-T.2
  • 19
    • 79953141436 scopus 로고    scopus 로고
    • Decentralized charging control for large populations of plug-in electric vehicles
    • Z.Ma, D. Callaway, and I. A. Hiskens, "Decentralized charging control for large populations of plug-in electric vehicles," in Proc. 49th IEEE Conf. Decision Control, 2010, pp. 206-212.
    • (2010) Proc. 49th Conf. Decision Control , pp. 206-212
    • Ma, Z.1    Callaway, D.2    Hiskens, I.A.3
  • 20
    • 84863510717 scopus 로고    scopus 로고
    • Electrical vehicles in the smart grid: A mean field game analysis
    • R. Couillet, S. M. Perlaza, H. Tembine, and M. Debbah, "Electrical vehicles in the smart grid: A mean field game analysis," IEEE J. Sel. Areas Commun., vol. 30, no. 6, pp. 1086-1096, 2012.
    • (2012) J. Sel. Areas Commun. , vol.30 , Issue.6 , pp. 1086-1096
    • Couillet, R.1    Perlaza, S.M.2    Tembine, H.3    Debbah, M.4
  • 22
    • 84869427574 scopus 로고    scopus 로고
    • Mean-field control for energy efficient buildings
    • Montreal, QC, Canada, Jun.
    • K. Deng, P. Barooah, and P. G. Mehta, "Mean-field control for energy efficient buildings," in Procs. Amer. Control Conf., Montreal, QC, Canada, Jun. 2012, pp. 3044-3049.
    • (2012) Procs. Amer. Control Conf. , pp. 3044-3049
    • Deng, K.1    Barooah, P.2    Mehta, P.G.3
  • 24
    • 62949192884 scopus 로고    scopus 로고
    • Distributed control for radial loss network systems via the Nash certainty equivalence (mean field) principle
    • Z. Ma, R. P. Malhamé, and P. E. Caines, "Distributed control for radial loss network systems via the Nash certainty equivalence (mean field) principle," in Procs. IEEE Conf. Decision and Control, 2008, pp. 3829-3834.
    • (2008) Procs. Conf. Decision and Control , pp. 3829-3834
    • Ma, Z.1    Malhamé, R.P.2    Caines, P.E.3
  • 26
    • 80052932467 scopus 로고    scopus 로고
    • Leaderfollower Cucker-Smale type flocking synthesized via mean field stochastic control theory
    • J. Angeles, B. Boulet, J. Clark, J. Kvecses, and K. Siddiqi, Eds. Heidelberg, Germany: Springer
    • M. Nourian, P. E. Caines, R. P. Malhamé, and M. Huang, "Leaderfollower Cucker-Smale type flocking synthesized via mean field stochastic control theory," in Brain, Body and Machine, ser. Advances in Soft Computing, J. Angeles, B. Boulet, J. Clark, J. Kvecses, and K. Siddiqi, Eds. Heidelberg, Germany: Springer, 2010, vol. 83, pp. 283-298.
    • (2010) Brain, Body and Machine, Ser. Advances in Soft Computing , vol.83 , pp. 283-298
    • Nourian, M.1    Caines, P.E.2    Malhamé, R.P.3    Huang, M.4
  • 27
    • 84874613599 scopus 로고    scopus 로고
    • Nash, social and centralized solutions to consensus problems via mean field control theory
    • Mar.
    • M. Nourian, P. E. Caines, R. P.Malhamé, and M. Huang, "Nash, social and centralized solutions to consensus problems via mean field control theory," IEEE Trans. Autom. Control, vol. 58, no. 3, pp. 639-653,Mar. 2013.
    • (2013) IEEE Trans. Autom. Control , vol.58 , Issue.3 , pp. 639-653
    • Nourian, M.1    Caines, P.E.2    Malhamé, R.P.3    Huang, M.4
  • 30
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.4
  • 31
    • 0000856704 scopus 로고
    • Self-entrainment of a population of coupled non-linear oscillators
    • H. Araki, Ed. Berlin, Germany: Springer Lecture Notes in Physics
    • Y. Kuramoto, "Self-entrainment of a population of coupled non-linear oscillators," in International Symposium on . Mathematical Problems in Theoretical Physics, H. Araki, Ed. Berlin, Germany: Springer, 1975, vol. 39, Lecture Notes in Physics.
    • (1975) International Symposium on . Mathematical Problems in Theoretical Physics , vol.39
    • Kuramoto, Y.1
  • 32
    • 1542351229 scopus 로고    scopus 로고
    • On the Phase Reduction and Response Dynamics of Neural Oscillator Populations
    • DOI 10.1162/089976604322860668
    • E. Brown, J. Moehlis, and P. Holmes, "On the phase reduction and response dynamics of neural oscillator populations," Neural Comput., vol. 16, no. 4, pp. 673-715, 2004. (Pubitemid 38318129)
    • (2004) Neural Computation , vol.16 , Issue.4 , pp. 673-715
    • Brown, E.1    Moehlis, J.2    Holmes, P.3
  • 33
    • 1542359162 scopus 로고    scopus 로고
    • Individual and mass behaviour in large population stochasticwireless power control problems: Centralized and nash equilibrium solutions
    • M. Huang, P. E. Caines, and R. P. Malhamé, "Individual and mass behaviour in large population stochasticwireless power control problems: Centralized and nash equilibrium solutions," in Proc. 42nd IEEE Conf. Decision and Control, 2003, vol. 1, pp. 98-103.
    • (2003) Proc. 42nd Conf. Decision and Control , vol.1 , pp. 98-103
    • Huang, M.1    Caines, P.E.2    Malhamé, R.P.3
  • 34
    • 56349169498 scopus 로고    scopus 로고
    • Markov perfect industry dynamics with many firms
    • G. Y. Weintraub, C. L. Benkard, and B. Van Roy, "Markov perfect industry dynamics with many firms," Econometrica, vol. 76, no. 6, pp. 1375-1411, 2008.
    • (2008) Econometrica , vol.76 , Issue.6 , pp. 1375-1411
    • Weintraub, G.Y.1    Benkard, C.L.2    Van Roy, B.3
  • 35
    • 7244251462 scopus 로고    scopus 로고
    • Plasticity in single neuron and circuit computations
    • DOI 10.1038/nature03011
    • A. Destexhe and E. Marder, "Plasticity in single neuron and circuit computations," Nature, vol. 431, no. 7010, pp. 789-795, Oct. 2004. (Pubitemid 39434068)
    • (2004) Nature , vol.431 , Issue.7010 , pp. 789-795
    • Destexhe, A.1    Marder, E.2
  • 37
    • 77950806766 scopus 로고    scopus 로고
    • Q-learning and Pontryagin's minimum principle
    • Dec.
    • P. G. Mehta and S. P. Meyn, "Q-learning and Pontryagin's minimum principle," in Proc. 48th IEEE Conf. Decision and Control, Dec. 2009, pp. 3598-3605.
    • (2009) Proc. 48th Conf. Decision and Control , pp. 3598-3605
    • Mehta, P.G.1    Meyn, S.P.2
  • 38
    • 80053146604 scopus 로고    scopus 로고
    • Mean field stochastic games: Convergence, Q/H-learning and optimality
    • H. Tembine,"Mean field stochastic games: Convergence, Q/H-learning and optimality," in Proc. 2011 Amer. Control Conf., 2011, pp. 2423-2428.
    • (2011) Proc. 2011 Amer. Control Conf. , pp. 2423-2428
    • Tembine, H.1
  • 39
    • 34249833101 scopus 로고
    • Learning
    • C. J. C. H.Watkins and P. Dayan, "Learning," Mach. Learn., vol. 8, no. 3-4, pp. 279-292, 1992.
    • (1992) Mach. Learn. , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.J.C.1    Dayan, P.2
  • 41
    • 0035041392 scopus 로고    scopus 로고
    • Consciousness and the brain: The thalamocortical dialogue in health and disease
    • R. Llinas and U. Ribary, "Consciousness and the brain. The thalamocortical dialogue in health and disease," Ann. New York Acad. Sci., vol. 929, pp. 166-175, 2001. (Pubitemid 32386058)
    • (2001) Annals of the New York Academy of Sciences , vol.929 , pp. 166-175
    • Llinas, R.1    Ribary, U.2
  • 42
    • 1542499302 scopus 로고
    • Stability of incoherence in a population of coupled oscillators
    • May
    • S. H. Strogatz and R. E.Mirollo, "Stability of incoherence in a population of coupled oscillators," J. Stat. Phys., vol. 63, pp. 613-635, May 1991.
    • (1991) J. Stat. Phys. , vol.63 , pp. 613-635
    • Strogatz, S.H.1    Mirollo, R.E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.