메뉴 건너뛰기




Volumn 91, Issue , 2014, Pages 20-36

On learning dynamics underlying the evolution of learning rules

Author keywords

Evolutionary game theory; Fictitious play; Fluctuating environments; Producer scrounger game; Reinforcement learning; Stochastic approximation

Indexed keywords

ANIMAL; COMPUTER SIMULATION; EVOLUTIONARY BIOLOGY; GAME THEORY; LEARNING; NATURAL SELECTION; NUMERICAL MODEL; STOCHASTICITY;

EID: 84891158799     PISSN: 00405809     EISSN: 10960325     Source Type: Journal    
DOI: 10.1016/j.tpb.2013.09.003     Document Type: Article
Times cited : (23)

References (100)
  • 1
    • 33749864692 scopus 로고    scopus 로고
    • Optimal tuning of continual online exploration in reinforcement learning
    • Springer, Berlin / Heidelberg, S. Kollias, A. Stafylopatis, W. Duch, E. Oja (Eds.) Artificial Neural Networks-ICANN 2006
    • Achbany Y., Fouss F., Yen L., Pirotte A., Saerens M. Optimal tuning of continual online exploration in reinforcement learning. Lecture Notes in Computer Science 2006, vol. 4131:790-800. Springer, Berlin / Heidelberg. S. Kollias, A. Stafylopatis, W. Duch, E. Oja (Eds.).
    • (2006) Lecture Notes in Computer Science , vol.4131 , pp. 790-800
    • Achbany, Y.1    Fouss, F.2    Yen, L.3    Pirotte, A.4    Saerens, M.5
  • 3
    • 76249088580 scopus 로고    scopus 로고
    • The evolution of reciprocity: social types or social incentives?
    • André J.-B. The evolution of reciprocity: social types or social incentives?. The American Naturalist 2010, 175:197-210.
    • (2010) The American Naturalist , vol.175 , pp. 197-210
    • André, J.-B.1
  • 5
    • 80054751337 scopus 로고    scopus 로고
    • Evolution of social learning when high expected payoffs are associated with high risk of failure
    • Arbilly M., Motro U., Feldman M.W., Lotem A. Evolution of social learning when high expected payoffs are associated with high risk of failure. Journal of The Royal Society Interface 2011, 8:1604-1615.
    • (2011) Journal of The Royal Society Interface , vol.8 , pp. 1604-1615
    • Arbilly, M.1    Motro, U.2    Feldman, M.W.3    Lotem, A.4
  • 6
    • 81155139598 scopus 로고    scopus 로고
    • Recombination and the evolution of coordinated phenotypic expression in a frequency-dependent game
    • Arbilly M., Motro U., Feldman M.W., Lotem A. Recombination and the evolution of coordinated phenotypic expression in a frequency-dependent game. Theoretical Population Biology 2011, 80:244-255.
    • (2011) Theoretical Population Biology , vol.80 , pp. 244-255
    • Arbilly, M.1    Motro, U.2    Feldman, M.W.3    Lotem, A.4
  • 7
    • 84949231290 scopus 로고
    • Effective choice in the prisoner's dilemma
    • Axelrod R. Effective choice in the prisoner's dilemma. The Journal of Conflict Resolution 1980, 24:3-25.
    • (1980) The Journal of Conflict Resolution , vol.24 , pp. 3-25
    • Axelrod, R.1
  • 8
    • 0019480612 scopus 로고
    • The evolution of cooperation
    • Axelrod R., Hamilton W.D. The evolution of cooperation. Science 1981, 211:1390-1396.
    • (1981) Science , vol.211 , pp. 1390-1396
    • Axelrod, R.1    Hamilton, W.D.2
  • 9
    • 0001793657 scopus 로고    scopus 로고
    • Dynamics of stochastic approximation algorithms
    • Springer, Berlin, J. Azéma (Ed.)
    • Benaïm M. Dynamics of stochastic approximation algorithms. Séminaire de Probabilités XXXIII, vol. 1709 1999, 1-68. Springer, Berlin. J. Azéma (Ed.).
    • (1999) Séminaire de Probabilités XXXIII, vol. 1709 , pp. 1-68
    • Benaïm, M.1
  • 10
    • 84891163591 scopus 로고    scopus 로고
    • Promenade Aléatoire: Chaînes de Markov et Simulations; Martingales et Stratégies. Editions de l'Ecole Polytechnique, Palaiseau, France
    • Benaïm, M., El Karoui, N., 2005. Promenade Aléatoire: Chaînes de Markov et Simulations; Martingales et Stratégies. Editions de l'Ecole Polytechnique, Palaiseau, France.
    • (2005)
    • Benaïm, M.1    El Karoui, N.2
  • 11
    • 0002277539 scopus 로고    scopus 로고
    • Mixed equilibria and dynamical systems arising from fictitious play in perturbed games
    • Benaïm M., Hirsch M.W. Mixed equilibria and dynamical systems arising from fictitious play in perturbed games. Games and Economic Behavior 1999, 29:36-72.
    • (1999) Games and Economic Behavior , vol.29 , pp. 36-72
    • Benaïm, M.1    Hirsch, M.W.2
  • 12
    • 0039521654 scopus 로고    scopus 로고
    • Stochastic approximation algorithms with constant step size whose average is cooperative
    • Benaïm M., Hirsch M.W. Stochastic approximation algorithms with constant step size whose average is cooperative. The Annals of Applied Probability 1999, 9:216-241.
    • (1999) The Annals of Applied Probability , vol.9 , pp. 216-241
    • Benaïm, M.1    Hirsch, M.W.2
  • 14
    • 0024234018 scopus 로고
    • Individual decisions and the distribution of predators in a patchy environment
    • Bernstein C., Kacelnik A., Krebs J.R. Individual decisions and the distribution of predators in a patchy environment. Journal of Animal Ecology 1988, 57:1007-1026.
    • (1988) Journal of Animal Ecology , vol.57 , pp. 1007-1026
    • Bernstein, C.1    Kacelnik, A.2    Krebs, J.R.3
  • 15
    • 44049110303 scopus 로고
    • Evolutionary stability in repeated games played by finite automata
    • Binmore K.G., Samuelson L. Evolutionary stability in repeated games played by finite automata. Journal of Economic Theory 1992, 57:278-305.
    • (1992) Journal of Economic Theory , vol.57 , pp. 278-305
    • Binmore, K.G.1    Samuelson, L.2
  • 16
    • 0031281590 scopus 로고    scopus 로고
    • Learning through reinforcement and replicator dynamics
    • Börgers T., Sarin R. Learning through reinforcement and replicator dynamics. Journal of Economic Theory 1997, 77:1-14.
    • (1997) Journal of Economic Theory , vol.77 , pp. 1-14
    • Börgers, T.1    Sarin, R.2
  • 19
  • 20
    • 0001491619 scopus 로고
    • A mathematical model for simple learning
    • Bush R.R., Mostelller F. A mathematical model for simple learning. Psychological Review 1951, 58:313-323.
    • (1951) Psychological Review , vol.58 , pp. 313-323
    • Bush, R.R.1    Mostelller, F.2
  • 22
    • 18644365144 scopus 로고    scopus 로고
    • Experienced-weighted attraction learning in normal form games
    • Camerer C., Ho T.H. Experienced-weighted attraction learning in normal form games. Econometrica 1999, 67:827-874.
    • (1999) Econometrica , vol.67 , pp. 827-874
    • Camerer, C.1    Ho, T.H.2
  • 24
    • 0000076619 scopus 로고
    • Do chimpanzees cooperate in a learning task?
    • Chalmeau R. Do chimpanzees cooperate in a learning task?. Primates 1994, 35:385-392.
    • (1994) Primates , vol.35 , pp. 385-392
    • Chalmeau, R.1
  • 26
    • 27644515989 scopus 로고    scopus 로고
    • Learning aspiration in repeated games
    • Cho I.-K., Matsui A. Learning aspiration in repeated games. Journal of Economic Theory 2005, 124:171-201.
    • (2005) Journal of Economic Theory , vol.124 , pp. 171-201
    • Cho, I.-K.1    Matsui, A.2
  • 28
    • 79955607687 scopus 로고    scopus 로고
    • Physical constraints on the evolution of cooperation
    • Dijker A. Physical constraints on the evolution of cooperation. Evolutionary Biology 2011, 38:124-143.
    • (2011) Evolutionary Biology , vol.38 , pp. 124-143
    • Dijker, A.1
  • 31
    • 10344252314 scopus 로고    scopus 로고
    • The mentality of crows: Convergent evolution of intelligence in corvids and apes
    • Emery N.J., Clayton N.S. The mentality of crows: Convergent evolution of intelligence in corvids and apes. Science 2004, 306:1903-1907.
    • (2004) Science , vol.306 , pp. 1903-1907
    • Emery, N.J.1    Clayton, N.S.2
  • 33
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
    • Erev I., Roth A.E. Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria. American Economic Review 1998, 88:848-881.
    • (1998) American Economic Review , vol.88 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 34
    • 84871188711 scopus 로고    scopus 로고
    • Exposing the behavioral gambit: the evolution of learning and decision rules
    • Fawcett T.W., Hamblin S., Giraldeau L.-A. Exposing the behavioral gambit: the evolution of learning and decision rules. Behavioral Ecology 2013, 24:2-11.
    • (2013) Behavioral Ecology , vol.24 , pp. 2-11
    • Fawcett, T.W.1    Hamblin, S.2    Giraldeau, L.-A.3
  • 35
    • 0030186388 scopus 로고    scopus 로고
    • Individual versus social learning: evolutionary analysis in a fluctuating environment
    • Feldman M., Aoki K., Kumm J. Individual versus social learning: evolutionary analysis in a fluctuating environment. Anthropological Science 1996, 104:209-232.
    • (1996) Anthropological Science , vol.104 , pp. 209-232
    • Feldman, M.1    Aoki, K.2    Kumm, J.3
  • 36
    • 0141838158 scopus 로고    scopus 로고
    • Learning, hypothesis testing, and nash equilibrium
    • Foster D.P., Young H. Learning, hypothesis testing, and nash equilibrium. Games and Economic Behavior 2003, 45:73-96.
    • (2003) Games and Economic Behavior , vol.45 , pp. 73-96
    • Foster, D.P.1    Young, H.2
  • 38
    • 78650868877 scopus 로고    scopus 로고
    • Heterogeneous beliefs and local information in stochastic fictitious play
    • Fudenberg D., Takahashi S. Heterogeneous beliefs and local information in stochastic fictitious play. Games and Economic Behavior 2011, 71:100-120.
    • (2011) Games and Economic Behavior , vol.71 , pp. 100-120
    • Fudenberg, D.1    Takahashi, S.2
  • 44
    • 70649083163 scopus 로고    scopus 로고
    • Finding the evolutionarily stable learning rule for frequency-dependent foraging
    • Hamblin S., Giraldeau L.-A. Finding the evolutionarily stable learning rule for frequency-dependent foraging. Animal Behaviour 2009, 78:1343-1350.
    • (2009) Animal Behaviour , vol.78 , pp. 1343-1350
    • Hamblin, S.1    Giraldeau, L.-A.2
  • 45
    • 84878444206 scopus 로고    scopus 로고
    • MIT Press, Cambridge, MA, P. Hammerstein, J.R. Stevens (Eds.)
    • Evolution and the Mechanisms of Decision Making 2012, MIT Press, Cambridge, MA. P. Hammerstein, J.R. Stevens (Eds.).
    • (2012) Evolution and the Mechanisms of Decision Making
  • 46
    • 0019885790 scopus 로고
    • Learning the evolutionarily stable strategy
    • Harley C.B. Learning the evolutionarily stable strategy. Journal of Theoretical Biology 1981, 89:611-633.
    • (1981) Journal of Theoretical Biology , vol.89 , pp. 611-633
    • Harley, C.B.1
  • 49
    • 33847042809 scopus 로고    scopus 로고
    • Self-tuning experience weighted attraction learning in games
    • Ho T.H., Camerer C.F., Chong J.-K. Self-tuning experience weighted attraction learning in games. Journal of Economic Theory 2007, 133:177-198.
    • (2007) Journal of Economic Theory , vol.133 , pp. 177-198
    • Ho, T.H.1    Camerer, C.F.2    Chong, J.-K.3
  • 50
    • 0036436650 scopus 로고    scopus 로고
    • On the global convergence of stochastic fictitious play
    • Hofbauer J., Sandholm W.H. On the global convergence of stochastic fictitious play. Econometrica 2002, 70:2265-2294.
    • (2002) Econometrica , vol.70 , pp. 2265-2294
    • Hofbauer, J.1    Sandholm, W.H.2
  • 53
    • 0000359838 scopus 로고
    • Pavlovian conditioning of aggressive behavior in blue gourami fish (Trichogaster trichopterus): winners become winners and losers stay losers
    • Hollis K.L., Dumas M.J., Singh P., Fackelman P. Pavlovian conditioning of aggressive behavior in blue gourami fish (Trichogaster trichopterus): winners become winners and losers stay losers. Journal of Comparative Psychology 1995, 109:123-133.
    • (1995) Journal of Comparative Psychology , vol.109 , pp. 123-133
    • Hollis, K.L.1    Dumas, M.J.2    Singh, P.3    Fackelman, P.4
  • 54
    • 0036434064 scopus 로고    scopus 로고
    • Two competing models of how people learn in games
    • Hopkins E. Two competing models of how people learn in games. Econometrica 2002, 70:2141-2166.
    • (2002) Econometrica , vol.70 , pp. 2141-2166
    • Hopkins, E.1
  • 55
    • 0021097184 scopus 로고
    • Comments on "Learning the evolutionarily stable strategy"
    • Houston A.I. Comments on "Learning the evolutionarily stable strategy". Journal of Theoretical Biology 1983, 105:175-178.
    • (1983) Journal of Theoretical Biology , vol.105 , pp. 175-178
    • Houston, A.I.1
  • 58
    • 0002298153 scopus 로고
    • Bayesian learning in normal form games
    • Jordan J.S. Bayesian learning in normal form games. Games and Economic Behavior 1991, 3:60-81.
    • (1991) Games and Economic Behavior , vol.3 , pp. 60-81
    • Jordan, J.S.1
  • 59
    • 41949122499 scopus 로고    scopus 로고
    • A numerical analysis of the evolutionary stability of learning rules
    • Josephson J. A numerical analysis of the evolutionary stability of learning rules. Journal of Economic Dynamics and Control 2008, 32:1569-1599.
    • (2008) Journal of Economic Dynamics and Control , vol.32 , pp. 1569-1599
    • Josephson, J.1
  • 65
    • 0017526570 scopus 로고
    • Analysis of recursive stochastic algorithms
    • Ljung L. Analysis of recursive stochastic algorithms. IEEE Transactions on Automatic Control 1977, 22:551-575.
    • (1977) IEEE Transactions on Automatic Control , vol.22 , pp. 551-575
    • Ljung, L.1
  • 66
    • 84871250172 scopus 로고    scopus 로고
    • Learning to avoid the behavioral gambit
    • 13-13
    • Lotem A. Learning to avoid the behavioral gambit. Behavioral Ecology 2013, 24. 13-13.
    • (2013) Behavioral Ecology , vol.24
    • Lotem, A.1
  • 70
    • 34548719708 scopus 로고
    • The logic of animal conflict
    • Maynard Smith J., Price G.R. The logic of animal conflict. Nature 1973, 246:15-18.
    • (1973) Nature , vol.246 , pp. 15-18
    • Maynard Smith, J.1    Price, G.R.2
  • 71
    • 84891148861 scopus 로고    scopus 로고
    • Baryplot 1.0
    • McElreath, R., 2010. Baryplot 1.0.
    • (2010)
    • McElreath, R.1
  • 76
    • 67349283062 scopus 로고    scopus 로고
    • Reinforcement learning in the brain
    • Niv Y. Reinforcement learning in the brain. Journal of Mathematical Psychology 2009, 53:139-154.
    • (2009) Journal of Mathematical Psychology , vol.53 , pp. 139-154
    • Niv, Y.1
  • 77
    • 0002710392 scopus 로고
    • Some convergence theorems for stochastic learning models with distance diminishing operators
    • Norman M.F. Some convergence theorems for stochastic learning models with distance diminishing operators. Journal of Mathematical Psychology 1968, 5:61-101.
    • (1968) Journal of Mathematical Psychology , vol.5 , pp. 61-101
    • Norman, M.F.1
  • 79
    • 84884285374 scopus 로고    scopus 로고
    • R Development Core Team
    • R Foundation for Statistical Computing, Vienna, Austria
    • R Development Core Team 2011. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria.
    • (2011) R: A Language and Environment for Statistical Computing
  • 81
    • 0002109138 scopus 로고
    • A theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • Appleton-Century-Crofts, New York (NY), A.H. Black, W.F. Prokasy (Eds.)
    • Rescorla R.A., Wagner A.R. A theory of pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. Classical Conditioning II: Current Research and Theory 1972, 64-99. Appleton-Century-Crofts, New York (NY). A.H. Black, W.F. Prokasy (Eds.).
    • (1972) Classical Conditioning II: Current Research and Theory , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 82
    • 0343329755 scopus 로고    scopus 로고
    • Density-dependent patch exploitation and acquisition of environmental information
    • Rodriguez-Gironés M.A., Vásquez R.A. Density-dependent patch exploitation and acquisition of environmental information. Theoretical Population Biology 1997, 52:32-42.
    • (1997) Theoretical Population Biology , vol.52 , pp. 32-42
    • Rodriguez-Gironés, M.A.1    Vásquez, R.A.2
  • 83
    • 84982364966 scopus 로고
    • Does biology constrain culture
    • Rogers A.R. Does biology constrain culture. American Anthropologist 1988, 90:819-831.
    • (1988) American Anthropologist , vol.90 , pp. 819-831
    • Rogers, A.R.1
  • 85
    • 0000861816 scopus 로고    scopus 로고
    • Why imitate, and if so, how?
    • Schlag K.H. Why imitate, and if so, how?. Journal of Economic Theory 1998, 78:130-156.
    • (1998) Journal of Economic Theory , vol.78 , pp. 130-156
    • Schlag, K.H.1
  • 87
    • 0026300818 scopus 로고
    • Change, regularity, and value in the evolution of animal learning
    • Stephens D.W. Change, regularity, and value in the evolution of animal learning. Behavioral Ecology 1991, 2:77-89.
    • (1991) Behavioral Ecology , vol.2 , pp. 77-89
    • Stephens, D.W.1
  • 91
    • 0029584479 scopus 로고
    • Properties of evolutionarily stable learning rules
    • Tracy N.D., Seaman J.W. Properties of evolutionarily stable learning rules. Journal of Theoretical Biology 1995, 177:193-198.
    • (1995) Journal of Theoretical Biology , vol.177 , pp. 193-198
    • Tracy, N.D.1    Seaman, J.W.2
  • 93
    • 77949873664 scopus 로고    scopus 로고
    • Analyzing behavior implied by EWA learning: an emphasis on distinguishing reinforcement from belief learning
    • van der Horst W., van Assen M., Snijders C. Analyzing behavior implied by EWA learning: an emphasis on distinguishing reinforcement from belief learning. Journal of Mathematical Psychology 2010, 54:222-229.
    • (2010) Journal of Mathematical Psychology , vol.54 , pp. 222-229
    • van der Horst, W.1    van Assen, M.2    Snijders, C.3
  • 94
    • 0032014621 scopus 로고    scopus 로고
    • Pavlovian conditioning of social affirmative behavior in the mongolian gerbil (Meriones unguiculatus)
    • Villarreal R., Domjan M. Pavlovian conditioning of social affirmative behavior in the mongolian gerbil (Meriones unguiculatus). Journal of Comparative Psychology 1998, 112:26-35.
    • (1998) Journal of Comparative Psychology , vol.112 , pp. 26-35
    • Villarreal, R.1    Domjan, M.2
  • 96
    • 0034951517 scopus 로고    scopus 로고
    • A simple learning strategy that realizes robust cooperation better than pavlov in iterated prisoners' dilemma
    • Wakano J.Y., Yamamura N. A simple learning strategy that realizes robust cooperation better than pavlov in iterated prisoners' dilemma. Journal of Ethology 2001, 19:1-8.
    • (2001) Journal of Ethology , vol.19 , pp. 1-8
    • Wakano, J.Y.1    Yamamura, N.2
  • 97
    • 80051590423 scopus 로고    scopus 로고
    • Individuality in nest building: do southern masked weaver (Ploceus velatus) males vary in their nest-building behaviour?
    • Walsh P.T., Hansell M., Borello W.D., Healy S.D. Individuality in nest building: do southern masked weaver (Ploceus velatus) males vary in their nest-building behaviour?. Behavioural Processes 2011, 88:1-6.
    • (2011) Behavioural Processes , vol.88 , pp. 1-6
    • Walsh, P.T.1    Hansell, M.2    Borello, W.D.3    Healy, S.D.4
  • 99
    • 84891160830 scopus 로고    scopus 로고
    • Wolfram Research, Inc. Mathematica, Version 8.0.4. Champaign, Illinois
    • Wolfram Research, Inc. 2011. Mathematica, Version 8.0.4. Champaign, Illinois.
    • (2011)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.