메뉴 건너뛰기




Volumn 278, Issue 1, 2011, Pages 55-62

Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma

Author keywords

Cooperation; Direct reciprocity; Prisoner's dilemma; Reinforcement learning

Indexed keywords

ENVIRONMENTAL CUE; LEARNING; NUMERICAL METHOD; SOCIAL BEHAVIOR;

EID: 79952856548     PISSN: 00225193     EISSN: 10958541     Source Type: Journal    
DOI: 10.1016/j.jtbi.2011.03.005     Document Type: Article
Times cited : (54)

References (47)
  • 3
    • 0042571479 scopus 로고    scopus 로고
    • Naive reinforcement learning with endogenous aspirations
    • Börgers T., Sarin R. Naive reinforcement learning with endogenous aspirations. Int. Econ. Rev. 2000, 41(4):921-950.
    • (2000) Int. Econ. Rev. , vol.41 , Issue.4 , pp. 921-950
    • Börgers, T.1    Sarin, R.2
  • 6
    • 27644515989 scopus 로고    scopus 로고
    • Learning aspiration in repeated games
    • Cho I.K., Matsui A. Learning aspiration in repeated games. J. Econ. Theory 2005, 124(2):171-201.
    • (2005) J. Econ. Theory , vol.124 , Issue.2 , pp. 171-201
    • Cho, I.K.1    Matsui, A.2
  • 8
    • 33646492363 scopus 로고    scopus 로고
    • The computational neurobiology of learning and reward
    • Daw N.D., Doya K. The computational neurobiology of learning and reward. Curr. Opin. Neurobiol. 2006, 16:199-204.
    • (2006) Curr. Opin. Neurobiol. , vol.16 , pp. 199-204
    • Daw, N.D.1    Doya, K.2
  • 9
    • 0009649778 scopus 로고    scopus 로고
    • Keeping up with the joneses: competition and the evolution of collusion
    • Dixon H.D. Keeping up with the joneses: competition and the evolution of collusion. J. Econ. Behav. Organ. 2000, 43(2):223-238.
    • (2000) J. Econ. Behav. Organ. , vol.43 , Issue.2 , pp. 223-238
    • Dixon, H.D.1
  • 10
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria. Am. Econ. Rev. 88, 848-881.
    • Erev, I., 1998. Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria. Am. Econ. Rev. 88, 848-881.
    • (1998)
    • Erev, I.1
  • 11
    • 0038630244 scopus 로고    scopus 로고
    • Simple reinforcement learning models and reciprocation in the Prisoner's dilemma game
    • G. Gigerenzer, R. Selten (Eds.)
    • Erev I., Roth A.E. Simple reinforcement learning models and reciprocation in the Prisoner's dilemma game. The Adaptive Toolbox 2001, 215-231. G. Gigerenzer, R. Selten (Eds.).
    • (2001) The Adaptive Toolbox , pp. 215-231
    • Erev, I.1    Roth, A.E.2
  • 14
    • 4444363602 scopus 로고    scopus 로고
    • Cooperation in the iterated Prisoner's dilemma is learned by operant conditioning mechanisms
    • Gutnisky D.A., Zanutto B.S. Cooperation in the iterated Prisoner's dilemma is learned by operant conditioning mechanisms. Artif. Life 2004, 10(4):433-461.
    • (2004) Artif. Life , vol.10 , Issue.4 , pp. 433-461
    • Gutnisky, D.A.1    Zanutto, B.S.2
  • 15
    • 0036401210 scopus 로고    scopus 로고
    • Simple adaptive strategy wins the Prisoner's dilemma
    • Hauert C., Stenull O. Simple adaptive strategy wins the Prisoner's dilemma. J. Theor. Biol. 2002, 218(3):261-272.
    • (2002) J. Theor. Biol. , vol.218 , Issue.3 , pp. 261-272
    • Hauert, C.1    Stenull, O.2
  • 16
    • 35048853187 scopus 로고    scopus 로고
    • Transient and asymptotic dynamics of reinforcement learning in games
    • Izquierdo L.R., Izquierdo S.S., Gotts N.M., Polhill J.G. Transient and asymptotic dynamics of reinforcement learning in games. Games Econ. Behav. 2007, 61(2):259-276.
    • (2007) Games Econ. Behav. , vol.61 , Issue.2 , pp. 259-276
    • Izquierdo, L.R.1    Izquierdo, S.S.2    Gotts, N.M.3    Polhill, J.G.4
  • 19
    • 0033473849 scopus 로고    scopus 로고
    • Satisficing and optimality in 2×2 common interest games
    • Kim Y. Satisficing and optimality in 2×2 common interest games. Econ. Theory 1999, 13(2):365-375.
    • (1999) Econ. Theory , vol.13 , Issue.2 , pp. 365-375
    • Kim, Y.1
  • 20
    • 34249965940 scopus 로고
    • Pavlov and the Prisoner's dilemma
    • Kraines D., Kraines V. Pavlov and the Prisoner's dilemma. Theory Decis. 1989, 26(1):47-79.
    • (1989) Theory Decis. , vol.26 , Issue.1 , pp. 47-79
    • Kraines, D.1    Kraines, V.2
  • 22
    • 0030522775 scopus 로고    scopus 로고
    • Natural selection and social learning in Prisoner's dilemma
    • Macy M. Natural selection and social learning in Prisoner's dilemma. Sociol. Meth. Res. 1996, 25(1):103-137.
    • (1996) Sociol. Meth. Res. , vol.25 , Issue.1 , pp. 103-137
    • Macy, M.1
  • 23
    • 84936823963 scopus 로고
    • Learning to cooperate: stochastic and tacit collusion in social exchange
    • Macy M.W. Learning to cooperate: stochastic and tacit collusion in social exchange. Am. J. Sociol. 1991, 97(3):808-843.
    • (1991) Am. J. Sociol. , vol.97 , Issue.3 , pp. 808-843
    • Macy, M.W.1
  • 24
  • 25
    • 70350354956 scopus 로고    scopus 로고
    • A theoretical analysis of temporal difference learning in the iterated Prisoner's dilemma game
    • Masuda N., Ohtsuki H. A theoretical analysis of temporal difference learning in the iterated Prisoner's dilemma game. Bull. Math. Biol. 2009, 71:1818-1850.
    • (2009) Bull. Math. Biol. , vol.71 , pp. 1818-1850
    • Masuda, N.1    Ohtsuki, H.2
  • 26
    • 0037057753 scopus 로고    scopus 로고
    • Neural economics and the biological substrates of valuation
    • Montague P.R., Berns G.S. Neural economics and the biological substrates of valuation. Neuron 2002, 36:265-284.
    • (2002) Neuron , vol.36 , pp. 265-284
    • Montague, P.R.1    Berns, G.S.2
  • 27
    • 0037612081 scopus 로고    scopus 로고
    • Aspiration adaptation in the ultimatum minigame
    • Napel S. Aspiration adaptation in the ultimatum minigame. Games Econ. Behav. 2003, 43(1):86-106.
    • (2003) Games Econ. Behav. , vol.43 , Issue.1 , pp. 86-106
    • Napel, S.1
  • 28
    • 0003025773 scopus 로고
    • Stochastic strategies in the Prisoner's dilemma
    • Nowak M. Stochastic strategies in the Prisoner's dilemma. Theor. Popul. Biol. 1990, 38:93-112.
    • (1990) Theor. Popul. Biol. , vol.38 , pp. 93-112
    • Nowak, M.1
  • 29
    • 38249022971 scopus 로고
    • Game-dynamical aspects of the Prisoner's dilemma
    • Nowak M., Sigmund K. Game-dynamical aspects of the Prisoner's dilemma. Appl. Math. Comput. 1989, 30:191-213.
    • (1989) Appl. Math. Comput. , vol.30 , pp. 191-213
    • Nowak, M.1    Sigmund, K.2
  • 30
    • 0000316822 scopus 로고
    • The evolution of stochastic strategies in the Prisoner's dilemma
    • Nowak M., Sigmund K. The evolution of stochastic strategies in the Prisoner's dilemma. Acta Applicandae Math. 1990, 20:247-265.
    • (1990) Acta Applicandae Math. , vol.20 , pp. 247-265
    • Nowak, M.1    Sigmund, K.2
  • 31
    • 0027336968 scopus 로고
    • A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's dilemma game
    • Nowak M., Sigmund K. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's dilemma game. Nature 1993, 364:56-58.
    • (1993) Nature , vol.364 , pp. 56-58
    • Nowak, M.1    Sigmund, K.2
  • 32
    • 33751169535 scopus 로고    scopus 로고
    • The Belknap Press of Harvard University Press, MA
    • Nowak M.A. Evolutionary Dynamics 2006, The Belknap Press of Harvard University Press, MA.
    • (2006) Evolutionary Dynamics
    • Nowak, M.A.1
  • 33
    • 0026471294 scopus 로고
    • Tit for tat in heterogeneous populations
    • Nowak M.A., Sigmund K. Tit for tat in heterogeneous populations. Nature 1992, 355:250-253.
    • (1992) Nature , vol.355 , pp. 250-253
    • Nowak, M.A.1    Sigmund, K.2
  • 34
    • 0000405246 scopus 로고
    • Automata, repeated games and noise
    • Nowak M.A., Sigmund K., El-Sedy E. Automata, repeated games and noise. J. Math. Biol. 1995, 33(7):703-722.
    • (1995) J. Math. Biol. , vol.33 , Issue.7 , pp. 703-722
    • Nowak, M.A.1    Sigmund, K.2    El-Sedy, E.3
  • 35
    • 0036292535 scopus 로고    scopus 로고
    • Cooperation as a result of learning with aspiration levels
    • Oechssler J. Cooperation as a result of learning with aspiration levels. J. Econ. Behav. Organ. 2002, 49(3):405-409.
    • (2002) J. Econ. Behav. Organ. , vol.49 , Issue.3 , pp. 405-409
    • Oechssler, J.1
  • 36
    • 0033481949 scopus 로고    scopus 로고
    • Convergence of aspirations and (partial) cooperation in the Prisoner's dilemma
    • Palomino F., Vega-Redondo F. Convergence of aspirations and (partial) cooperation in the Prisoner's dilemma. Int. J. Game Theory 1999, 28(4):465-488.
    • (1999) Int. J. Game Theory , vol.28 , Issue.4 , pp. 465-488
    • Palomino, F.1    Vega-Redondo, F.2
  • 37
    • 0031329041 scopus 로고    scopus 로고
    • Satisficing leads to cooperation in mutual interests games
    • Pazgal A. Satisficing leads to cooperation in mutual interests games. Int. J. Game Theory 1997, 26(4):439-453.
    • (1997) Int. J. Game Theory , vol.26 , Issue.4 , pp. 439-453
    • Pazgal, A.1
  • 38
    • 0033595095 scopus 로고    scopus 로고
    • The efficiency of adapting aspiration levels
    • Posch M., Pichler A., Sigmund K. The efficiency of adapting aspiration levels. Proc. R. Soc. London B 1999, 266(1427):1427-1435.
    • (1999) Proc. R. Soc. London B , vol.266 , Issue.1427 , pp. 1427-1435
    • Posch, M.1    Pichler, A.2    Sigmund, K.3
  • 40
    • 0030050933 scopus 로고    scopus 로고
    • Multiagent reinforcement learning in the iterated Prisoner's dilemma
    • Sandholm T.W., Crites R.H. Multiagent reinforcement learning in the iterated Prisoner's dilemma. Biosystems 1996, 37(1-2):147-166.
    • (1996) Biosystems , vol.37 , Issue.1-2 , pp. 147-166
    • Sandholm, T.W.1    Crites, R.H.2
  • 41
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science 1997, 275:1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 43
    • 0000629644 scopus 로고
    • Theories of decision-making in economics and behavioral science
    • Simon H.A. Theories of decision-making in economics and behavioral science. Am. Econ. Rev. 1959, 49(3):253-283.
    • (1959) Am. Econ. Rev. , vol.49 , Issue.3 , pp. 253-283
    • Simon, H.A.1
  • 44
    • 0002180201 scopus 로고    scopus 로고
    • Evolutionary Prisoner's dilemma game on a square lattice
    • Szabó G., Tke C. Evolutionary Prisoner's dilemma game on a square lattice. Phys. Rev. E 1998, 58(1):69-73.
    • (1998) Phys. Rev. E , vol.58 , Issue.1 , pp. 69-73
    • Szabó, G.1    Tke, C.2
  • 45
    • 0348225593 scopus 로고    scopus 로고
    • Dynamics of internal models in game players
    • Taiji M., Ikegami T. Dynamics of internal models in game players. Physica D 1999, 134(2):253-266.
    • (1999) Physica D , vol.134 , Issue.2 , pp. 253-266
    • Taiji, M.1    Ikegami, T.2
  • 46
    • 33746227636 scopus 로고    scopus 로고
    • Stochastic dynamics of invasion and fixation
    • Traulsen A., Nowak M.A., Pacheco J.M. Stochastic dynamics of invasion and fixation. Phys. Rev. E 2006, 74(1):011909.
    • (2006) Phys. Rev. E , vol.74 , Issue.1 , pp. 011909
    • Traulsen, A.1    Nowak, M.A.2    Pacheco, J.M.3
  • 47
    • 0002414229 scopus 로고
    • The evolution of reciprocal altruism
    • Trivers R.L. The evolution of reciprocal altruism. Q. Rev. Biol. 1971, 46:35-57.
    • (1971) Q. Rev. Biol. , vol.46 , pp. 35-57
    • Trivers, R.L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.