메뉴 건너뛰기




Volumn 104, Issue , 2015, Pages 87-114

A model for the evolution of reinforcement learning in fluctuating games

Author keywords

Evolution of cognition; Evolutionarily stable learning rules; Exploration exploitation trade off; Repeated games; Social interactions; Trial and error learning

Indexed keywords

BEHAVIORAL RESPONSE; COGNITION; ENVIRONMENTAL CHANGE; EVOLUTION; EVOLUTIONARY BIOLOGY; FITNESS; LEARNING; NATURAL SELECTION; POLYMORPHISM;

EID: 84927523336     PISSN: 00033472     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.anbehav.2015.01.037     Document Type: Article
Times cited : (16)

References (98)
  • 2
    • 33845321382 scopus 로고    scopus 로고
    • Decision-making in group foragers with incomplete information: test of individual-based model in geese
    • Amano T., Ushiyama K., Moriguchi S., Fujita G., Higuchi H. Decision-making in group foragers with incomplete information: test of individual-based model in geese. Ecological Monographs 2006, 76(4):601-616.
    • (2006) Ecological Monographs , vol.76 , Issue.4 , pp. 601-616
    • Amano, T.1    Ushiyama, K.2    Moriguchi, S.3    Fujita, G.4    Higuchi, H.5
  • 4
  • 5
    • 81155139598 scopus 로고    scopus 로고
    • Recombination and the evolution of coordinated phenotypic expression in a frequency-dependent game
    • Arbilly M., Motro U., Feldman M.W., Lotem A. Recombination and the evolution of coordinated phenotypic expression in a frequency-dependent game. Theoretical Population Biology 2011, 80(4):244-255.
    • (2011) Theoretical Population Biology , vol.80 , Issue.4 , pp. 244-255
    • Arbilly, M.1    Motro, U.2    Feldman, M.W.3    Lotem, A.4
  • 6
    • 0000036916 scopus 로고
    • The evolution of a special class of modifiable behaviors in relation to environmental pattern
    • Arnold S.J. The evolution of a special class of modifiable behaviors in relation to environmental pattern. The American Naturalist 1978, 112(984):415-427.
    • (1978) The American Naturalist , vol.112 , Issue.984 , pp. 415-427
    • Arnold, S.J.1
  • 7
    • 84949231290 scopus 로고
    • Effective choice in the prisoner's dilemma
    • Axelrod R. Effective choice in the prisoner's dilemma. The Journal of Conflict Resolution 1980, 24(1):3-25.
    • (1980) The Journal of Conflict Resolution , vol.24 , Issue.1 , pp. 3-25
    • Axelrod, R.1
  • 8
    • 0019480612 scopus 로고
    • The evolution of cooperation
    • Axelrod R., Hamilton W.D. The evolution of cooperation. Science 1981, 211:1390-1396.
    • (1981) Science , vol.211 , pp. 1390-1396
    • Axelrod, R.1    Hamilton, W.D.2
  • 10
    • 0001793657 scopus 로고    scopus 로고
    • Dynamics of stochastic approximation algorithms
    • Benaim M. Dynamics of stochastic approximation algorithms. Séminaire de Probabilités XXXIII 1999, vol. 1709:1-68.
    • (1999) Séminaire de Probabilités XXXIII , vol.1709 , pp. 1-68
    • Benaim, M.1
  • 12
    • 0024234018 scopus 로고
    • Individual decisions and the distribution of predators in a patchy environment
    • Bernstein C., Kacelnik A., Krebs J.R. Individual decisions and the distribution of predators in a patchy environment. Journal of Animal Ecology 1988, 57(3):1007-1026.
    • (1988) Journal of Animal Ecology , vol.57 , Issue.3 , pp. 1007-1026
    • Bernstein, C.1    Kacelnik, A.2    Krebs, J.R.3
  • 13
    • 44049110303 scopus 로고
    • Evolutionary stability in repeated games played by finite automata
    • Binmore K.G., Samuelson L. Evolutionary stability in repeated games played by finite automata. Journal of Economic Theory 1992, 57:278-305.
    • (1992) Journal of Economic Theory , vol.57 , pp. 278-305
    • Binmore, K.G.1    Samuelson, L.2
  • 15
    • 40449130711 scopus 로고    scopus 로고
    • Evolution of learning in fluctuating environments: when selection favors both social and exploratory individual learning
    • Borenstein E., Feldman M.W., Aoki K. Evolution of learning in fluctuating environments: when selection favors both social and exploratory individual learning. Evolution 2008, 62(3):586-602.
    • (2008) Evolution , vol.62 , Issue.3 , pp. 586-602
    • Borenstein, E.1    Feldman, M.W.2    Aoki, K.3
  • 18
    • 0002672918 scopus 로고
    • Iterative solution of games by fictitious play
    • Wiley, New York, NY, T. Koopmans (Ed.)
    • Brown G.W. Iterative solution of games by fictitious play. Activity analysis of production and allocation 1951, 374-376. Wiley, New York, NY. T. Koopmans (Ed.).
    • (1951) Activity analysis of production and allocation , pp. 374-376
    • Brown, G.W.1
  • 19
    • 0001491619 scopus 로고
    • Amathematical model for simple learning
    • Bush R.R., Mostelller F. Amathematical model for simple learning. Psychological Review 1951, 58(5):313-323.
    • (1951) Psychological Review , vol.58 , Issue.5 , pp. 313-323
    • Bush, R.R.1    Mostelller, F.2
  • 21
    • 18644365144 scopus 로고    scopus 로고
    • Experienced-weighted attraction learning in normal form games
    • Camerer C., Ho T.H. Experienced-weighted attraction learning in normal form games. Econometrica 1999, 67(4):827-874.
    • (1999) Econometrica , vol.67 , Issue.4 , pp. 827-874
    • Camerer, C.1    Ho, T.H.2
  • 22
    • 0016948894 scopus 로고
    • Optimal foraging, the marginal value theorem
    • Charnov E.L. Optimal foraging, the marginal value theorem. Theoretical Population Biology 1976, 9(2):129-136.
    • (1976) Theoretical Population Biology , vol.9 , Issue.2 , pp. 129-136
    • Charnov, E.L.1
  • 26
    • 79955607687 scopus 로고    scopus 로고
    • Physical constraints on the evolution of cooperation
    • Dijker A. Physical constraints on the evolution of cooperation. Evolutionary Biology 2011, 38(2):124-143.
    • (2011) Evolutionary Biology , vol.38 , Issue.2 , pp. 124-143
    • Dijker, A.1
  • 27
    • 84891158799 scopus 로고    scopus 로고
    • On learning dynamics underlying the evolution of learning rules
    • Dridi S., Lehmann L. On learning dynamics underlying the evolution of learning rules. Theoretical Population Biology 2014, 91:20-36.
    • (2014) Theoretical Population Biology , vol.91 , pp. 20-36
    • Dridi, S.1    Lehmann, L.2
  • 33
    • 10344252314 scopus 로고    scopus 로고
    • The mentality of crows: convergent evolution of intelligence in corvids and apes
    • Emery N.J., Clayton N.S. The mentality of crows: convergent evolution of intelligence in corvids and apes. Science 2004, 306(5703):1903-1907.
    • (2004) Science , vol.306 , Issue.5703 , pp. 1903-1907
    • Emery, N.J.1    Clayton, N.S.2
  • 36
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria
    • Erev I., Roth A.E. Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria. American Economic Review 1998, 88(4):848-881.
    • (1998) American Economic Review , vol.88 , Issue.4 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 37
    • 84871188711 scopus 로고    scopus 로고
    • Exposing the behavioral gambit: the evolution of learning and decision rules
    • Fawcett T.W., Hamblin S., Giraldeau L.-A. Exposing the behavioral gambit: the evolution of learning and decision rules. Behavioral Ecology 2013, 24(1):2-11.
    • (2013) Behavioral Ecology , vol.24 , Issue.1 , pp. 2-11
    • Fawcett, T.W.1    Hamblin, S.2    Giraldeau, L.-A.3
  • 38
    • 0030186388 scopus 로고    scopus 로고
    • Individual versus social learning: evolutionary analysis in a fluctuating environment
    • Feldman M., Aoki K., Kumm J. Individual versus social learning: evolutionary analysis in a fluctuating environment. Anthropological Science 1996, 104:209-232.
    • (1996) Anthropological Science , vol.104 , pp. 209-232
    • Feldman, M.1    Aoki, K.2    Kumm, J.3
  • 39
    • 0001283474 scopus 로고    scopus 로고
    • Reinforcement-based versus belief-based learning models in experimental asymmetric-information games
    • Feltovich N. Reinforcement-based versus belief-based learning models in experimental asymmetric-information games. Econometrica 2000, 68(3):605-641.
    • (2000) Econometrica , vol.68 , Issue.3 , pp. 605-641
    • Feltovich, N.1
  • 41
    • 0038443886 scopus 로고    scopus 로고
    • Incompletely informed shorebirds that face a digestive constraint maximize net energy gain when exploiting patches
    • van Gils J.A., Schenk I.W., Bos O., Piersma T. Incompletely informed shorebirds that face a digestive constraint maximize net energy gain when exploiting patches. The American Naturalist 2003, 161(5):777-793.
    • (2003) The American Naturalist , vol.161 , Issue.5 , pp. 777-793
    • van Gils, J.A.1    Schenk, I.W.2    Bos, O.3    Piersma, T.4
  • 42
    • 70649083163 scopus 로고    scopus 로고
    • Finding the evolutionarily stable learning rule for frequency-dependent foraging
    • Hamblin S., Giraldeau L.-A. Finding the evolutionarily stable learning rule for frequency-dependent foraging. Animal Behaviour 2009, 78(6):1343-1350.
    • (2009) Animal Behaviour , vol.78 , Issue.6 , pp. 1343-1350
    • Hamblin, S.1    Giraldeau, L.-A.2
  • 43
    • 84878444206 scopus 로고    scopus 로고
    • MIT Press, Cambridge, MA, P. Hammerstein, J.R. Stevens (Eds.)
    • Evolution and the mechanisms of decision making 2012, MIT Press, Cambridge, MA. P. Hammerstein, J.R. Stevens (Eds.).
    • (2012) Evolution and the mechanisms of decision making
  • 44
    • 0019885790 scopus 로고
    • Learning the evolutionarily stable strategy
    • Harley C.B. Learning the evolutionarily stable strategy. Journal of Theoretical Biology 1981, 89(4):611-633.
    • (1981) Journal of Theoretical Biology , vol.89 , Issue.4 , pp. 611-633
    • Harley, C.B.1
  • 45
    • 0346401406 scopus 로고    scopus 로고
    • An evolutionary approach to learning in a changing environment
    • Heller D. An evolutionary approach to learning in a changing environment. Journal of Economic Theory 2004, 114(1):31-55.
    • (2004) Journal of Economic Theory , vol.114 , Issue.1 , pp. 31-55
    • Heller, D.1
  • 47
    • 33847042809 scopus 로고    scopus 로고
    • Self-tuning experience weighted attraction learning in games
    • Ho T.H., Camerer C.F., Chong J.-K. Self-tuning experience weighted attraction learning in games. Journal of Economic Theory 2007, 133(1):177-198.
    • (2007) Journal of Economic Theory , vol.133 , Issue.1 , pp. 177-198
    • Ho, T.H.1    Camerer, C.F.2    Chong, J.-K.3
  • 48
    • 0036436650 scopus 로고    scopus 로고
    • On the global convergence of stochastic fictitious play
    • Hofbauer J., Sandholm W.H. On the global convergence of stochastic fictitious play. Econometrica 2002, 70(6):2265-2294.
    • (2002) Econometrica , vol.70 , Issue.6 , pp. 2265-2294
    • Hofbauer, J.1    Sandholm, W.H.2
  • 50
    • 0036434064 scopus 로고    scopus 로고
    • Two competing models of how people learn in games
    • Hopkins E. Two competing models of how people learn in games. Econometrica 2002, 70(6):2141-2166.
    • (2002) Econometrica , vol.70 , Issue.6 , pp. 2141-2166
    • Hopkins, E.1
  • 51
  • 53
    • 0001872644 scopus 로고
    • Selective costs and benefits in the evolution of learning
    • Academic Press, San Diego, CA
    • Johnston T.D. Selective costs and benefits in the evolution of learning. Advances in the study of behavior 1982, Vol. 12:65-106. Academic Press, San Diego, CA.
    • (1982) Advances in the study of behavior , vol.12 , pp. 65-106
    • Johnston, T.D.1
  • 54
    • 41949122499 scopus 로고    scopus 로고
    • Anumerical analysis of the evolutionary stability of learning rules
    • Josephson J. Anumerical analysis of the evolutionary stability of learning rules. Journal of Economic Dynamics and Control 2008, 32(5):1569-1599.
    • (2008) Journal of Economic Dynamics and Control , vol.32 , Issue.5 , pp. 1569-1599
    • Josephson, J.1
  • 58
    • 68949177116 scopus 로고    scopus 로고
    • The evolution of social learning rules: payoff-biased and frequency-dependent biased transmission
    • Kendal J., Giraldeau L.-A., Laland K. The evolution of social learning rules: payoff-biased and frequency-dependent biased transmission. Journal of Theoretical Biology 2009, 260(2):210-219.
    • (2009) Journal of Theoretical Biology , vol.260 , Issue.2 , pp. 210-219
    • Kendal, J.1    Giraldeau, L.-A.2    Laland, K.3
  • 61
    • 2342458309 scopus 로고    scopus 로고
    • Social learning strategies
    • Laland K.N. Social learning strategies. Learning & Behavior 2004, 32(1):4-14.
    • (2004) Learning & Behavior , vol.32 , Issue.1 , pp. 4-14
    • Laland, K.N.1
  • 63
    • 0021549315 scopus 로고
    • Downy woodpecker foraging behavior: efficient sampling in simple stochastic environments
    • Lima S.L. Downy woodpecker foraging behavior: efficient sampling in simple stochastic environments. Ecology 1984, 65(1):166-174.
    • (1984) Ecology , vol.65 , Issue.1 , pp. 166-174
    • Lima, S.L.1
  • 64
    • 0033375391 scopus 로고    scopus 로고
    • Reproductive decision-making by female peacock wrasses: flexible versus fixed behavioral rules in variable environments
    • Luttbeg B., Warner R.R. Reproductive decision-making by female peacock wrasses: flexible versus fixed behavioral rules in variable environments. Behavioral Ecology 1999, 10(6):666-674.
    • (1999) Behavioral Ecology , vol.10 , Issue.6 , pp. 666-674
    • Luttbeg, B.1    Warner, R.R.2
  • 68
    • 0348166371 scopus 로고
    • Quantal response equilibria for normal form games
    • McKelvey R.D., Palfrey T.R. Quantal response equilibria for normal form games. Games and Economic Behavior 1995, 10(1):6-38.
    • (1995) Games and Economic Behavior , vol.10 , Issue.1 , pp. 6-38
    • McKelvey, R.D.1    Palfrey, T.R.2
  • 69
    • 33645124345 scopus 로고    scopus 로고
    • Bayes' theorem and its applications in animal behaviour
    • McNamara J.M., Green R.F., Olsson O. Bayes' theorem and its applications in animal behaviour. Oikos 2006, 112(2):243-251.
    • (2006) Oikos , vol.112 , Issue.2 , pp. 243-251
    • McNamara, J.M.1    Green, R.F.2    Olsson, O.3
  • 74
    • 84859888623 scopus 로고    scopus 로고
    • Evolution of theories of mind
    • Mohlin E. Evolution of theories of mind. Games and Economic Behavior 2012, 75(1):299-318.
    • (2012) Games and Economic Behavior , vol.75 , Issue.1 , pp. 299-318
    • Mohlin, E.1
  • 76
    • 67349283062 scopus 로고    scopus 로고
    • Reinforcement learning in the brain
    • Niv Y. Reinforcement learning in the brain. Journal of Mathematical Psychology 2009, 53(3):139-154.
    • (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 139-154
    • Niv, Y.1
  • 77
    • 0027336968 scopus 로고
    • Astrategy of win-stay, lose-shift that outperforms tit-for-tat in the prisoner's dilemma game
    • Nowak M.A., Sigmund K. Astrategy of win-stay, lose-shift that outperforms tit-for-tat in the prisoner's dilemma game. Nature 1993, 364:56-58.
    • (1993) Nature , vol.364 , pp. 56-58
    • Nowak, M.A.1    Sigmund, K.2
  • 78
    • 0001000786 scopus 로고
    • Nonconvergence to unstable points in urn models and stochastic approximations
    • Pemantle R. Nonconvergence to unstable points in urn models and stochastic approximations. The Annals of Probability 1990, 18(2):698-712.
    • (1990) The Annals of Probability , vol.18 , Issue.2 , pp. 698-712
    • Pemantle, R.1
  • 79
    • 0018050763 scopus 로고
    • Does the chimpanzee have a theory of mind?
    • Premack D., Woodruff G. Does the chimpanzee have a theory of mind?. Behavioral and Brain Sciences 1978, 1(04):515-526.
    • (1978) Behavioral and Brain Sciences , vol.1 , Issue.4 , pp. 515-526
    • Premack, D.1    Woodruff, G.2
  • 81
    • 77950817904 scopus 로고    scopus 로고
    • Why copy others? insights from the social learning strategies tournament
    • Rendell L., Boyd R., Cownden D., Enquist M., Eriksson K., Feldman M.W., et al. Why copy others? insights from the social learning strategies tournament. Science 2010, 328(5975):208-213.
    • (2010) Science , vol.328 , Issue.5975 , pp. 208-213
    • Rendell, L.1    Boyd, R.2    Cownden, D.3    Enquist, M.4    Eriksson, K.5    Feldman, M.W.6
  • 82
    • 0002109138 scopus 로고
    • Atheory of pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement
    • Appleton-Century-Crofts, New York, NY, A.H. Black, W.F. Prokasy (Eds.)
    • Rescorla R.A., Wagner A.R. Atheory of pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. Classical conditioning II: Current research and theory 1972, 64-99. Appleton-Century-Crofts, New York, NY. A.H. Black, W.F. Prokasy (Eds.).
    • (1972) Classical conditioning II: Current research and theory , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 83
    • 84982364966 scopus 로고
    • Does biology constrain culture?
    • Rogers A.R. Does biology constrain culture?. American Anthropologist 1988, 90(4):819-831.
    • (1988) American Anthropologist , vol.90 , Issue.4 , pp. 819-831
    • Rogers, A.R.1
  • 84
    • 0000861816 scopus 로고    scopus 로고
    • Why imitate, and if so, how?
    • Schlag K.H. Why imitate, and if so, how?. Journal of Economic Theory 1998, 78:130-156.
    • (1998) Journal of Economic Theory , vol.78 , pp. 130-156
    • Schlag, K.H.1
  • 85
  • 86
    • 84858956350 scopus 로고    scopus 로고
    • Reciprocal cooperation between unrelated rats depends on cost to donor and benefit to recipient
    • Schneeberger K., Dietz M., Taborsky M. Reciprocal cooperation between unrelated rats depends on cost to donor and benefit to recipient. BMC Evolutionary Biology 2012, 12(1):41.
    • (2012) BMC Evolutionary Biology , vol.12 , Issue.1 , pp. 41
    • Schneeberger, K.1    Dietz, M.2    Taborsky, M.3
  • 90
    • 0026300818 scopus 로고
    • Change, regularity, and value in the evolution of animal learning
    • Stephens D.W. Change, regularity, and value in the evolution of animal learning. Behavioral Ecology 1991, 2(1):77-89.
    • (1991) Behavioral Ecology , vol.2 , Issue.1 , pp. 77-89
    • Stephens, D.W.1
  • 91
    • 0001583241 scopus 로고    scopus 로고
    • Game theory and learning
    • Oxford University Press, New York, NY, L.A. Dugatkin, H.K. Reeve (Eds.)
    • Stephens D.W., Clements K.C. Game theory and learning. Game theory and animal behavior 1998, 239-260. Oxford University Press, New York, NY. L.A. Dugatkin, H.K. Reeve (Eds.).
    • (1998) Game theory and animal behavior , pp. 239-260
    • Stephens, D.W.1    Clements, K.C.2
  • 96
    • 33645113906 scopus 로고    scopus 로고
    • Are animals capable of bayesian updating? An empirical review
    • Valone T.J. Are animals capable of bayesian updating? An empirical review. Oikos 2006, 112(2):252-259.
    • (2006) Oikos , vol.112 , Issue.2 , pp. 252-259
    • Valone, T.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.