메뉴 건너뛰기




Volumn 56, Issue 2, 2006, Pages 285-298

Generalised weakened fictitious play

Author keywords

Actor critic process; Best response differential inclusion; Fictitious play; Stochastic approximation

Indexed keywords


EID: 33744514808     PISSN: 08998256     EISSN: 10902473     Source Type: Journal    
DOI: 10.1016/j.geb.2005.08.005     Document Type: Article
Times cited : (159)

References (33)
  • 1
    • 16244410118 scopus 로고    scopus 로고
    • On the convergence of reinforcement learning
    • Beggs A.W. On the convergence of reinforcement learning. J. Econ. Theory 122 (2005) 1-36
    • (2005) J. Econ. Theory , vol.122 , pp. 1-36
    • Beggs, A.W.1
  • 2
    • 0002277539 scopus 로고    scopus 로고
    • Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
    • Benaïm M., and Hirsch M.W. Mixed equilibria and dynamical systems arising from fictitious play in perturbed games. Games Econ. Behav. 29 (1999) 36-72
    • (1999) Games Econ. Behav. , vol.29 , pp. 36-72
    • Benaïm, M.1    Hirsch, M.W.2
  • 3
    • 33244474665 scopus 로고    scopus 로고
    • Stochastic approximation and differential inclusions
    • Benaïm M., Hofbauer J., and Sorin S. Stochastic approximation and differential inclusions. SIAM J. Control Optim. 44 (2005) 328-348
    • (2005) SIAM J. Control Optim. , vol.44 , pp. 328-348
    • Benaïm, M.1    Hofbauer, J.2    Sorin, S.3
  • 4
    • 33744508525 scopus 로고    scopus 로고
    • Berger, U., 2004. Two more classes of games with the fictitious play property. Mimeo. Vienna University of Economics
  • 5
    • 12444269117 scopus 로고    scopus 로고
    • Fictitious play in 2 × n games
    • Berger U. Fictitious play in 2 × n games. J. Econ. Theory 120 (2005) 139-154
    • (2005) J. Econ. Theory , vol.120 , pp. 139-154
    • Berger, U.1
  • 6
    • 0031281590 scopus 로고    scopus 로고
    • Learning through reinforcement and replicator dynamics
    • Börgers T., and Sarin R. Learning through reinforcement and replicator dynamics. J. Econ. Theory 77 (1997) 1-14
    • (1997) J. Econ. Theory , vol.77 , pp. 1-14
    • Börgers, T.1    Sarin, R.2
  • 7
    • 0002672918 scopus 로고
    • Iterative solution of games by fictitious play
    • Koopmans T.C. (Ed), Wiley, New York
    • Brown G.W. Iterative solution of games by fictitious play. In: Koopmans T.C. (Ed). Activity Analysis of Production and Allocation (1951), Wiley, New York 374-376
    • (1951) Activity Analysis of Production and Allocation , pp. 374-376
    • Brown, G.W.1
  • 8
    • 0003262604 scopus 로고
    • Isolated invariant sets and the Morse index
    • American Mathematical Society, Providence
    • Conley C.C. Isolated invariant sets and the Morse index. CBMS Regional Conference Series in Mathematics (1978), American Mathematical Society, Providence
    • (1978) CBMS Regional Conference Series in Mathematics
    • Conley, C.C.1
  • 9
    • 33744514488 scopus 로고    scopus 로고
    • Cowan, S., 1992. Dynamical systems arising from game theory. Ph.D. thesis. University of California, Berkeley
  • 10
    • 0141838158 scopus 로고    scopus 로고
    • Learning, hypothesis testing, and Nash equilibrium
    • Foster D.P., and Young H.P. Learning, hypothesis testing, and Nash equilibrium. Games Econ. Behav. 45 (2003) 73-96
    • (2003) Games Econ. Behav. , vol.45 , pp. 73-96
    • Foster, D.P.1    Young, H.P.2
  • 14
    • 0000730470 scopus 로고
    • Social stability and equilibrium
    • Gilboa I., and Matsui A. Social stability and equilibrium. Econometrica 59 (1991) 859-867
    • (1991) Econometrica , vol.59 , pp. 859-867
    • Gilboa, I.1    Matsui, A.2
  • 15
    • 0242684983 scopus 로고    scopus 로고
    • A reinforcement procedure leading to correlated equilibrium
    • Debreu G., Neuefeind W., and Trockel W. (Eds), Springer, New York
    • Hart S., and Mas-Colell A. A reinforcement procedure leading to correlated equilibrium. In: Debreu G., Neuefeind W., and Trockel W. (Eds). Economic Essays: A Festschrift for Werner Hildenbrand (2001), Springer, New York 181-200
    • (2001) Economic Essays: A Festschrift for Werner Hildenbrand , pp. 181-200
    • Hart, S.1    Mas-Colell, A.2
  • 16
    • 2942744741 scopus 로고    scopus 로고
    • Uncoupled dynamics cannot lead to Nash equilibrium
    • Hart S., and Mas-Colell A. Uncoupled dynamics cannot lead to Nash equilibrium. Amer. Econ. Rev. 93 (2003) 1830-1836
    • (2003) Amer. Econ. Rev. , vol.93 , pp. 1830-1836
    • Hart, S.1    Mas-Colell, A.2
  • 17
    • 33744508013 scopus 로고    scopus 로고
    • Hofbauer, J., 1995. Stability for the best response dynamics. Techical report. Institut für Mathematik, Universität Wien, Strudlhofgasse 4, A-1090 Vienna, Austria
  • 18
    • 20344390000 scopus 로고    scopus 로고
    • Learning in perturbed asymmetric games
    • Hofbauer J., and Hopkins E. Learning in perturbed asymmetric games. Games Econ. Behav. 52 (2005) 133-152
    • (2005) Games Econ. Behav. , vol.52 , pp. 133-152
    • Hofbauer, J.1    Hopkins, E.2
  • 19
    • 0036436650 scopus 로고    scopus 로고
    • On the global convergence of stochastic fictitious play
    • Hofbauer J., and Sandholm W.H. On the global convergence of stochastic fictitious play. Econometrica 70 (2002) 2265-2294
    • (2002) Econometrica , vol.70 , pp. 2265-2294
    • Hofbauer, J.1    Sandholm, W.H.2
  • 21
    • 33744549129 scopus 로고    scopus 로고
    • Attainability of boundary points under reinforcement learning
    • Hopkins E., and Posch M. Attainability of boundary points under reinforcement learning. Games Econ. Behav. 44 (2005) 459-514
    • (2005) Games Econ. Behav. , vol.44 , pp. 459-514
    • Hopkins, E.1    Posch, M.2
  • 22
    • 33744549925 scopus 로고    scopus 로고
    • Leslie, D.S., 2003. Reinforcement learning in games. Ph.D. thesis. University of Bristol
  • 23
    • 0346913265 scopus 로고    scopus 로고
    • Convergent multiple-timescales reinforcement learning algorithms in normal form games
    • Leslie D.S., and Collins E.J. Convergent multiple-timescales reinforcement learning algorithms in normal form games. Ann. Appl. Probability 13 (2003) 1231-1251
    • (2003) Ann. Appl. Probability , vol.13 , pp. 1231-1251
    • Leslie, D.S.1    Collins, E.J.2
  • 24
    • 33645029191 scopus 로고    scopus 로고
    • Individual Q-learning in normal form games
    • Leslie D.S., and Collins E.J. Individual Q-learning in normal form games. SIAM J. Control Optim 44 (2005) 459-514
    • (2005) SIAM J. Control Optim , vol.44 , pp. 459-514
    • Leslie, D.S.1    Collins, E.J.2
  • 25
    • 33744548852 scopus 로고    scopus 로고
    • Miyasawa, K., 1961. On the convergence of the learning process in a 2 × 2 non-zero-sum two-person game. Research Memorandum 33. Econometric Research Program, Princeton University, Princeton
  • 26
    • 0029690246 scopus 로고    scopus 로고
    • Fictitious play property for games with identical interests
    • Monderer D., and Shapley L.S. Fictitious play property for games with identical interests. J. Econ. Theory 68 (1996) 258-265
    • (1996) J. Econ. Theory , vol.68 , pp. 258-265
    • Monderer, D.1    Shapley, L.S.2
  • 27
    • 0001402950 scopus 로고
    • An iterative method of solving a game
    • Robinson J. An iterative method of solving a game. Ann. Math. 54 (1951) 296-301
    • (1951) Ann. Math. , vol.54 , pp. 296-301
    • Robinson, J.1
  • 28
    • 58149324992 scopus 로고
    • Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
    • Roth A.E., and Erev I. Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term. Games Econ. Behav. 8 (1995) 164-212
    • (1995) Games Econ. Behav. , vol.8 , pp. 164-212
    • Roth, A.E.1    Erev, I.2
  • 29
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement-learning algorithms
    • Singh S., Jaakkola T., Littman M.L., and Szepesvari C. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning 38 (2000) 287-308
    • (2000) Machine Learning , vol.38 , pp. 287-308
    • Singh, S.1    Jaakkola, T.2    Littman, M.L.3    Szepesvari, C.4
  • 31
    • 0017819644 scopus 로고
    • Evolutionarily stable strategies and game dynamics
    • Taylor P.D., and Jonker L.D. Evolutionarily stable strategies and game dynamics. Math. Biosc. 40 (1978) 145-146
    • (1978) Math. Biosc. , vol.40 , pp. 145-146
    • Taylor, P.D.1    Jonker, L.D.2
  • 32
    • 33744519260 scopus 로고    scopus 로고
    • A weakened form of fictitious play in two-person zero-sum games
    • Van der Genugten B. A weakened form of fictitious play in two-person zero-sum games. Int. Game Theory Rev. 2 (2000) 307-328
    • (2000) Int. Game Theory Rev. , vol.2 , pp. 307-328
    • Van der Genugten, B.1
  • 33
    • 0011628309 scopus 로고
    • Fictitious play applied to sequences of games and discounted stochastic games
    • Vrieze O.J., and Tijs S.H. Fictitious play applied to sequences of games and discounted stochastic games. Int. J. Game Theory 11 (1982) 71-85
    • (1982) Int. J. Game Theory , vol.11 , pp. 71-85
    • Vrieze, O.J.1    Tijs, S.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.