SCOPUS 정보 검색 플랫폼

IEEE Transactions on Automatic Control

Volumn 57, Issue 9, 2012, Pages 2266-2280

Mean field for Markov decision processes: From discrete to continuous optimization

(3) Gast, Nicolas a Gaujal, Bruno b Le Boudec, Jean Yves a

a EPFL (Switzerland)

b INRIA RHÔNE ALPES (France)

Author keywords

Epidemic model; Hamilton Jacobi Bellman (HJB); Markov decision processes; mean field; optimal control

Indexed keywords

EPIDEMIC MODELS; HAMILTON-JACOBI-BELLMAN; MARKOV DECISION PROCESSES; MEAN FIELD; OPTIMAL CONTROLS;

DYNAMIC PROGRAMMING; MARKOV PROCESSES; OPTIMIZATION; ORDINARY DIFFERENTIAL EQUATIONS;

LEARNING ALGORITHMS;

EID: 84865675087 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2012.2186176 Document Type: Article

Times cited : (63)

References (25)

1
- 0001793657
- Dynamics of stochastic approximation algorithms
- Lecture Notes in Math
- M. Benaïm,"Dynamics of stochastic approximation algorithms," Séminaire de Probabilités XXXIII. Lecture Notes in Math, vol. 1709, pp. 1-68, 1999.
- (1999) Séminaire de Probabilités XXXIII , vol.1709 , pp. 1-68
- Benaïm, M.¹

2
- 53649095038
- A class of mean field interaction models for computer and communication systems
- M. Benaim and J.-Y. L. Boudec,"A class of mean field interaction models for computer and communication systems," Perform. Eval., vol. 65, no. 11-12, pp. 823-838, 2008.
- (2008) Perform. Eval. , vol.65 , Issue.11-12 , pp. 823-838
- Benaim, M.¹ L. Boudec, J.-Y.²

3
- 84865699023
- Deterministic approximation of stochastic evolution in games: A generalization
- M. Benaim and J. Weibull,"Deterministic Approximation of Stochastic Evolution in Games: A Generalization," Tech. Rep., mimeo, 2003.
- (2003) Tech. Rep., mimeo
- Benaim, M.¹ Weibull, J.²

4
- 0003778897
- New York: Springer-Verlag
- A. Benveniste, P. Priouret, and M. Métivier, Adaptive Algorithms and Stochastic Approximations. New York: Springer-Verlag, 1990.
- (1990) Adaptive Algorithms Stochastic Approximations
- Benveniste, A.¹ Priouret, P.² Métivier, M.³

5
- 37149025078
- Grid brokering for batch allocation using indexes
- of LNCS. New York: Springer
- V. G. BertenB,"Grid brokering for batch allocation using indexes," in Network Control and Optimization, volume 4465 of LNCS. New York: Springer, 2007.
- (2007) Network Control and Optimization , vol.4465
- Bertenb, V.G.¹

6
- 77950814003
- Approximate dynamic programming using fluid and diffusion approximations with applications to power management
- W. Chen, D. Huang, A. A. Kulkarni, J. Unnikrishnan, Q. Zhu, P. Mehta, S. Meyn, and A. Wierman,"Approximate dynamic programming using fluid and diffusion approximations with applications to power management," in Proc. 48th IEEE Conf. Decision Control. CDC/CCC, 2009, pp. 3575-3580.
- (2009) Proc. 48th IEEE Conf. Decision Control. CDC/CCC , pp. 3575-3580
- Chen, W.¹ Huang, D.² Kulkarni, A.A.³ Unnikrishnan, J.⁴ Zhu, Q.⁵ Mehta, P.⁶ Meyn, S.⁷ Wierman, A.⁸

7
- 34547189876
- Initial studies on worm propagation in manets for future army combat systems
- Pentagon Reports
- R. Cole,"Initial studies on worm propagation in manets for future army combat systems," Tech. Rep., Pentagon Reports, 2004.
- (2004) Tech. Rep.
- Cole, R.¹

8
- 0348090400
- The linear programming approach to approximate dynamic programming
- D. P. De Farias and B. Van Roy,"The linear programming approach to approximate dynamic programming," Operat. Res., vol. 51, no. 6, pp. 850-865, 2003.
- (2003) Operat. Res. , vol.51 , Issue.6 , pp. 850-865
- De Farias, D.P.¹ Van Roy, B.²

9
- 84861082568
- Mean field limit of non-smooth systems and differential inclusions
- N. Gast and B. Gaujal,"Mean field limit of non-smooth systems and differential inclusions," ACM SIGMETRICS Perform. Eval. Rev., vol. 38, no. 2, pp. 30-32, 2010.
- (2010) ACM SIGMETRICS Perform. Eval. Rev. , vol.38 , Issue.2 , pp. 30-32
- Gast, N.¹ Gaujal, B.²

10
- 79951558493
- A mean field approach for optimization in discrete time
- N. Gast and B. Gaujal,"A mean field approach for optimization in discrete time," Discrete Event Dynam. Syst., vol. 21, pp. 63-101, 2011.
- (2011) Discrete Event Dynam. Syst. , vol.21 , pp. 63-101
- Gast, N.¹ Gaujal, B.²

11
- 64749113126
- Deterministic approximation of best-response dynamics for the matching pennies game
- Z. Gorodeisky,"Deterministic approximation of best-response dynamics for the matching pennies game," Games Econ. Behav., vol. 66, no. 1, pp. 191-201, 2009.
- (2009) Games Econ. Behav. , vol.66 , Issue.1 , pp. 191-201
- Gorodeisky, Z.¹

12
- 39549087376
- Large population stochastic dynamic games: Closed-loop mckean-vlasov systems and the nash certainty equivalence principle
- M. Huang, P. E. Caines, and R. P. Malhame,"Large population stochastic dynamic games: Closed-loop Mckean-Vlasov systems and the Nash certainty equivalence principle," Com. Inf. Syst., vol. 6, pp. 221-252, 2006.
- (2006) Com. Inf. Syst. , vol.6 , pp. 221-252
- Huang, M.¹ Caines, P.E.² Malhame, R.P.³

13
- 34249105008
- Nash certainty equivalence in large population stochastic dynamic games: Connections with the physics of interacting particle systems
- 4177530, Proceedings of the 45th IEEE Conference on Decision and Control 2006, CDC
- M. Huang, P. E. Caines, and R. P. Malhame,"Nash certainty equivalence in large population stochastic dynamic games: Connections with the physics of interacting particle systems," in Proc. 45th IEEE Conf. Decision Control, San Diego, 2006, pp. 4921-4926. (Pubitemid 351283806)
- (2006) Proceedings of the IEEE Conference on Decision and Control , pp. 4921-4926
- Huang, M.¹ Malhame, R.P.² Caines, P.E.³

14
- 34648831837
- Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ε-nash equilibria
- DOI 10.1109/TAC.2007.904450
- M. Huang, P. E. Caines, and R. P. Malhame,"Large-population costcoupled lqg problems with nonuniform agents: individual-mass behavior and decentralized -Nash equilibria," IEEE Trans. Autom. Control, vol. 52, no. 9, pp. 1560-1571, Sep. 2007. (Pubitemid 47456068)
- (2007) IEEE Transactions on Automatic Control , vol.52 , Issue.9 , pp. 1560-1571
- Huang, M.¹ Caines, P.E.² Malhame, R.P.³

15
- 77953320145
- Maximum damage malware attack in mobile wireless networks
- San Diego, CA
- M. H. R. Khouzani, S. Sarkar, and E. Altman,"Maximum damage malware attack in mobile wireless networks," in Proc. IEEE Infocom, San Diego, CA, 2010, pp. 1-9.
- (2010) Proc. IEEE Infocom , pp. 1-9
- Khouzani, M.H.R.¹ Sarkar, S.² Altman, E.³

16
- 84976646341
- Costbenefit analysis of cloud computing versus desktop grids
- Rome, Italy
- D. Kondo, B. Javadi, P. Malecot, F. Cappello, and D. Erson, "Costbenefit analysis of cloud computing versus desktop grids," in Proc. 18th Int. Heterogeneity in Comput. Workshop, Rome, Italy, 2009.
- (2009) Proc. 18th Int. Heterogeneity in Comput. Workshop
- Kondo, D.¹ Javadi, B.² Malecot, P.³ Cappello, F.⁴ Erson, D.⁵

17
- 0002232633
- Solutions of ordinary differential equations as limits of pure jump markov processes
- T. Kurtz,"Solutions of ordinary differential equations as limits of pure jump Markov processes," J. Appl. Probab., vol. 7, pp. 49-58, 1970.
- (1970) J. Appl. Probab. , vol.7 , pp. 49-58
- Kurtz, T.¹

18
- 34047127341
- Mean field games
- J.-M. Lasry and P.-L. Lions,"Mean field games," Jpn. J. Math., 2007.
- (2007) Jpn. J. Math.
- Lasry, J.-M.¹ Lions, P.-L.²

19
- 47949103963
- A generic mean field convergence result for systems of interacting objects
- J. Y. Le Boudec, D. McDonald, and J. Mundinger,"A generic mean field convergence result for systems of interacting objects," Proc. QEST '07, pp. 3-18, 2007.
- (2007) Proc. QEST '07 , pp. 3-18
- Le Boudec, J.Y.¹ McDonald, D.² Mundinger, J.³

20
- 0032628612
- The complexity of optimal queuing network control
- C. H. Papadimitriou and J. N. Tsitsiklis,"The complexity of optimal queuing network control," Math. Oper. Res., vol. 24, no. 2, pp. 292-305, 1999.
- (1999) Math. Oper. Res. , vol.24 , Issue.2 , pp. 292-305
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

21
- 0003998452
- New York: Wiley
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. New York: Wiley, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming.
- Puterman, M.L.¹

22
- 80053647353
- Vaccine:war of the worms in wired and wireless networks
- S. Tanachaiwiwat and A. Helmy,"Vaccine:War of the worms in wired and wireless networks," in Proc. IEEE INFOCOM, 2006.
- (2006) Proc. IEEE INFOCOM
- Tanachaiwiwat, S.¹ Helmy, A.²

23
- 70349977430
- Mean field asymptotic of markov decision evolutionary games and teams
- H. Tembine, J.-Y. Le Boudec, R. El-Azouzi, and E. Altman,"Mean field asymptotic of Markov decision evolutionary games and teams," Gamenets, 2009.
- (2009) Gamenets
- Tembine, H.¹ Le Boudec, J.-Y.² El-Azouzi, R.³ Altman, E.⁴

24
- 84865677406
- Tech. Rep. 342-P-11867-11883, Supelec
- H. Tembine, P. Vilanova, and M. Debbah,"Noisy mean field stochastic games with network applications," 2010, Tech. Rep. 342-P-11867-83, Supelec.
- (2010) Noisy Mean Field Stochastic Games with Network Applications
- Tembine, H.¹ Vilanova, P.² Debbah, M.³

25
- 0031143730
- An analysis of temporal-difference learning with function approximation
- PII S0018928697034375
- J. N. Tsitsiklis and B. V. Roy,"An analysis of temporal-difference learning with function approximation," IEEE Trans. Autom. Control, vol. 42, no. 5, pp. 674-690, May 1997. (Pubitemid 127760263)
- (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.