-
1
-
-
16244410118
-
On the convergence of reinforcement learning
-
Beggs A.W. On the convergence of reinforcement learning. J. Econ. Theory 122 (2005) 1-36
-
(2005)
J. Econ. Theory
, vol.122
, pp. 1-36
-
-
Beggs, A.W.1
-
2
-
-
0002277539
-
Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
-
Benaïm M., and Hirsch M.W. Mixed equilibria and dynamical systems arising from fictitious play in perturbed games. Games Econ. Behav. 29 (1999) 36-72
-
(1999)
Games Econ. Behav.
, vol.29
, pp. 36-72
-
-
Benaïm, M.1
Hirsch, M.W.2
-
3
-
-
33244474665
-
Stochastic approximation and differential inclusions
-
Benaïm M., Hofbauer J., and Sorin S. Stochastic approximation and differential inclusions. SIAM J. Control Optim. 44 (2005) 328-348
-
(2005)
SIAM J. Control Optim.
, vol.44
, pp. 328-348
-
-
Benaïm, M.1
Hofbauer, J.2
Sorin, S.3
-
4
-
-
33744508525
-
-
Berger, U., 2004. Two more classes of games with the fictitious play property. Mimeo. Vienna University of Economics
-
-
-
-
5
-
-
12444269117
-
Fictitious play in 2 × n games
-
Berger U. Fictitious play in 2 × n games. J. Econ. Theory 120 (2005) 139-154
-
(2005)
J. Econ. Theory
, vol.120
, pp. 139-154
-
-
Berger, U.1
-
6
-
-
0031281590
-
Learning through reinforcement and replicator dynamics
-
Börgers T., and Sarin R. Learning through reinforcement and replicator dynamics. J. Econ. Theory 77 (1997) 1-14
-
(1997)
J. Econ. Theory
, vol.77
, pp. 1-14
-
-
Börgers, T.1
Sarin, R.2
-
7
-
-
0002672918
-
Iterative solution of games by fictitious play
-
Koopmans T.C. (Ed), Wiley, New York
-
Brown G.W. Iterative solution of games by fictitious play. In: Koopmans T.C. (Ed). Activity Analysis of Production and Allocation (1951), Wiley, New York 374-376
-
(1951)
Activity Analysis of Production and Allocation
, pp. 374-376
-
-
Brown, G.W.1
-
8
-
-
0003262604
-
Isolated invariant sets and the Morse index
-
American Mathematical Society, Providence
-
Conley C.C. Isolated invariant sets and the Morse index. CBMS Regional Conference Series in Mathematics (1978), American Mathematical Society, Providence
-
(1978)
CBMS Regional Conference Series in Mathematics
-
-
Conley, C.C.1
-
9
-
-
33744514488
-
-
Cowan, S., 1992. Dynamical systems arising from game theory. Ph.D. thesis. University of California, Berkeley
-
-
-
-
10
-
-
0141838158
-
Learning, hypothesis testing, and Nash equilibrium
-
Foster D.P., and Young H.P. Learning, hypothesis testing, and Nash equilibrium. Games Econ. Behav. 45 (2003) 73-96
-
(2003)
Games Econ. Behav.
, vol.45
, pp. 73-96
-
-
Foster, D.P.1
Young, H.P.2
-
14
-
-
0000730470
-
Social stability and equilibrium
-
Gilboa I., and Matsui A. Social stability and equilibrium. Econometrica 59 (1991) 859-867
-
(1991)
Econometrica
, vol.59
, pp. 859-867
-
-
Gilboa, I.1
Matsui, A.2
-
15
-
-
0242684983
-
A reinforcement procedure leading to correlated equilibrium
-
Debreu G., Neuefeind W., and Trockel W. (Eds), Springer, New York
-
Hart S., and Mas-Colell A. A reinforcement procedure leading to correlated equilibrium. In: Debreu G., Neuefeind W., and Trockel W. (Eds). Economic Essays: A Festschrift for Werner Hildenbrand (2001), Springer, New York 181-200
-
(2001)
Economic Essays: A Festschrift for Werner Hildenbrand
, pp. 181-200
-
-
Hart, S.1
Mas-Colell, A.2
-
16
-
-
2942744741
-
Uncoupled dynamics cannot lead to Nash equilibrium
-
Hart S., and Mas-Colell A. Uncoupled dynamics cannot lead to Nash equilibrium. Amer. Econ. Rev. 93 (2003) 1830-1836
-
(2003)
Amer. Econ. Rev.
, vol.93
, pp. 1830-1836
-
-
Hart, S.1
Mas-Colell, A.2
-
17
-
-
33744508013
-
-
Hofbauer, J., 1995. Stability for the best response dynamics. Techical report. Institut für Mathematik, Universität Wien, Strudlhofgasse 4, A-1090 Vienna, Austria
-
-
-
-
18
-
-
20344390000
-
Learning in perturbed asymmetric games
-
Hofbauer J., and Hopkins E. Learning in perturbed asymmetric games. Games Econ. Behav. 52 (2005) 133-152
-
(2005)
Games Econ. Behav.
, vol.52
, pp. 133-152
-
-
Hofbauer, J.1
Hopkins, E.2
-
19
-
-
0036436650
-
On the global convergence of stochastic fictitious play
-
Hofbauer J., and Sandholm W.H. On the global convergence of stochastic fictitious play. Econometrica 70 (2002) 2265-2294
-
(2002)
Econometrica
, vol.70
, pp. 2265-2294
-
-
Hofbauer, J.1
Sandholm, W.H.2
-
21
-
-
33744549129
-
Attainability of boundary points under reinforcement learning
-
Hopkins E., and Posch M. Attainability of boundary points under reinforcement learning. Games Econ. Behav. 44 (2005) 459-514
-
(2005)
Games Econ. Behav.
, vol.44
, pp. 459-514
-
-
Hopkins, E.1
Posch, M.2
-
22
-
-
33744549925
-
-
Leslie, D.S., 2003. Reinforcement learning in games. Ph.D. thesis. University of Bristol
-
-
-
-
23
-
-
0346913265
-
Convergent multiple-timescales reinforcement learning algorithms in normal form games
-
Leslie D.S., and Collins E.J. Convergent multiple-timescales reinforcement learning algorithms in normal form games. Ann. Appl. Probability 13 (2003) 1231-1251
-
(2003)
Ann. Appl. Probability
, vol.13
, pp. 1231-1251
-
-
Leslie, D.S.1
Collins, E.J.2
-
24
-
-
33645029191
-
Individual Q-learning in normal form games
-
Leslie D.S., and Collins E.J. Individual Q-learning in normal form games. SIAM J. Control Optim 44 (2005) 459-514
-
(2005)
SIAM J. Control Optim
, vol.44
, pp. 459-514
-
-
Leslie, D.S.1
Collins, E.J.2
-
25
-
-
33744548852
-
-
Miyasawa, K., 1961. On the convergence of the learning process in a 2 × 2 non-zero-sum two-person game. Research Memorandum 33. Econometric Research Program, Princeton University, Princeton
-
-
-
-
26
-
-
0029690246
-
Fictitious play property for games with identical interests
-
Monderer D., and Shapley L.S. Fictitious play property for games with identical interests. J. Econ. Theory 68 (1996) 258-265
-
(1996)
J. Econ. Theory
, vol.68
, pp. 258-265
-
-
Monderer, D.1
Shapley, L.S.2
-
27
-
-
0001402950
-
An iterative method of solving a game
-
Robinson J. An iterative method of solving a game. Ann. Math. 54 (1951) 296-301
-
(1951)
Ann. Math.
, vol.54
, pp. 296-301
-
-
Robinson, J.1
-
28
-
-
58149324992
-
Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
-
Roth A.E., and Erev I. Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term. Games Econ. Behav. 8 (1995) 164-212
-
(1995)
Games Econ. Behav.
, vol.8
, pp. 164-212
-
-
Roth, A.E.1
Erev, I.2
-
29
-
-
0033901602
-
Convergence results for single-step on-policy reinforcement-learning algorithms
-
Singh S., Jaakkola T., Littman M.L., and Szepesvari C. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning 38 (2000) 287-308
-
(2000)
Machine Learning
, vol.38
, pp. 287-308
-
-
Singh, S.1
Jaakkola, T.2
Littman, M.L.3
Szepesvari, C.4
-
31
-
-
0017819644
-
Evolutionarily stable strategies and game dynamics
-
Taylor P.D., and Jonker L.D. Evolutionarily stable strategies and game dynamics. Math. Biosc. 40 (1978) 145-146
-
(1978)
Math. Biosc.
, vol.40
, pp. 145-146
-
-
Taylor, P.D.1
Jonker, L.D.2
-
32
-
-
33744519260
-
A weakened form of fictitious play in two-person zero-sum games
-
Van der Genugten B. A weakened form of fictitious play in two-person zero-sum games. Int. Game Theory Rev. 2 (2000) 307-328
-
(2000)
Int. Game Theory Rev.
, vol.2
, pp. 307-328
-
-
Van der Genugten, B.1
-
33
-
-
0011628309
-
Fictitious play applied to sequences of games and discounted stochastic games
-
Vrieze O.J., and Tijs S.H. Fictitious play applied to sequences of games and discounted stochastic games. Int. J. Game Theory 11 (1982) 71-85
-
(1982)
Int. J. Game Theory
, vol.11
, pp. 71-85
-
-
Vrieze, O.J.1
Tijs, S.H.2
|