메뉴 건너뛰기




Volumn 1, Issue 1, 2001, Pages

Reinforcement Learning in Repeated Interaction Games

Author keywords

aspirations; bounded rationality; cooperation; coordination; reinforcement learning

Indexed keywords


EID: 0038660317     PISSN: None     EISSN: 19351704     Source Type: Journal    
DOI: 10.2202/1534-5963.1008     Document Type: Article
Times cited : (28)

References (60)
  • 1
    • 0001784118 scopus 로고
    • On Designing Economic Agents that Behave like Human Agents
    • B. Arthur, "On Designing Economic Agents that Behave Like Human Agents, " Journal of Evolutionary Economics 3, 1993, 1-22.
    • (1993) Journal of Evolutionary Economics , vol.3 , pp. 1-22
    • Arthur, B.1
  • 3
    • 0010875005 scopus 로고
    • Discussion Paper No. 9442 Center for Economic Research, Tilburg University May Revised, mimeo, Department of Economics, Boston University 1995
    • J. Bendor, D. Mookherjee and D. Ray, "Aspirations, Adaptive Learning and Cooperation in Repeated Games", Discussion Paper No. 9442, Center for Economic Research, Tilburg University, May 1994. Revised, mimeo, Department of Economics, Boston University, 1995.
    • (1994) Aspirations, Adaptive Learning and Cooperation in Repeated Games
    • Bendor, J.1    Mookherjee, D.2    Ray, D.3
  • 5
    • 0031161454 scopus 로고    scopus 로고
    • Muddling Through: Noisy Equilibrium Selection
    • K. Binmore and L. Samuelson, "Muddling Through: Noisy Equilibrium Selection, " Journal of Economic Theory, 74, 1997, 235-265.
    • (1997) Journal of Economic Theory , vol.74 , pp. 235-265
    • Binmore, K.1    Samuelson, L.2
  • 6
    • 0031281590 scopus 로고    scopus 로고
    • Learning through Reinforcement and Replicator Dynamics
    • T. Börgers and R. Sarin, "Learning through Reinforcement and Replicator Dynamics, " Journal of Economic Theory 77, 1997, 1-14.
    • (1997) Journal of Economic Theory , vol.77 , pp. 1-14
    • Börgers, T.1    Sarin, R.2
  • 7
    • 0042571479 scopus 로고    scopus 로고
    • Naive Reinforcement Learning with Endogenous Aspirations
    • T. Börgers and R. Sarin, , "Naive Reinforcement Learning with Endogenous Aspirations, " International Economic Review 41, 2000, 921-950.
    • (2000) International Economic Review , vol.41 , pp. 921-950
    • Börgers, T.1    Sarin, R.2
  • 10
    • 29144481480 scopus 로고    scopus 로고
    • A formal structure for multiple choice situations
    • edited by R.M. Thrall, C.H. Coombs and R.L. Davis 1954, New York Wiley
    • R. Bush, F. Mosteller and G. Thompson, "A Formal Structure For Multiple Choice Situations, " in Decision Processes, edited by R.M. Thrall, C.H. Coombs and R.L. Davis, 1954, New York: Wiley.
    • Decision Processes
    • Bush, R.1    Mosteller, F.2    Thompson, G.3
  • 11
    • 18644365144 scopus 로고    scopus 로고
    • Experience-weighted Attraction Learning in Normal Form Games
    • C. Camerer and T. Ho, "Experience-weighted Attraction Learning in Normal Form Games, " Econometrica, 67(4), 1999, 827-874.
    • (1999) Econometrica , vol.67 , Issue.4 , pp. 827-874
    • Camerer, C.1    Ho, T.2
  • 12
    • 0000742255 scopus 로고
    • A Stochastic Learning Model of Economic Behavior
    • J. Cross, "A Stochastic Learning Model of Economic Behavior, " Quarterly Journal of Economics, 87 (1973), 239-266.
    • (1973) Quarterly Journal of Economics , vol.87 , pp. 239-266
    • Cross, J.1
  • 14
    • 0009649778 scopus 로고    scopus 로고
    • Keeping up with the Joneses: Competition and the Evolution of Collusion
    • forthcoming I. Erev and A. Roth, "On the Need for Low Rationality, Cognitive Game Theory: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria working paper, August 1995, Department of Economics University of Pittsburgh
    • H.D. Dixon, "Keeping up with the Joneses: Competition and the Evolution of Collusion", Journal of Economic Behavior and Organization, 2000, forthcoming. I. Erev and A. Roth, "On the Need for Low Rationality, Cognitive Game Theory: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria, " working paper, August 1995, Department of Economics, University of Pittsburgh.
    • (2000) Journal of Economic Behavior and Organization
    • Dixon, H.D.1
  • 15
    • 0038829878 scopus 로고    scopus 로고
    • Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique Mixed Strategy Equilibria
    • H.D. Dixon, "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique Mixed Strategy Equilibria, " American Economic Review, 88, 1998, 848-881.
    • (1998) American Economic Review , vol.88 , pp. 848-881
    • Dixon, H.D.1
  • 16
    • 0012861020 scopus 로고
    • Individual behavior in uncertain situations: An interpretation in terms of statistical association theory
    • edited by R.M. Thrall, C.H. Coombs and R.L. Davis, New York Wiley
    • W. Estes, "Individual Behavior in Uncertain Situations: An Interpretation in Terms of Statistical Association Theory, " in Decision Processes, edited by R.M. Thrall, C.H. Coombs and R.L. Davis, 1954, New York: Wiley.
    • (1954) Decision Processes
    • Estes, W.1
  • 22
    • 0000038653 scopus 로고
    • How to Decide How to Decide How to ..: Modeling Limited Rationality
    • B. Lipman (1991), "How to Decide How to Decide How to ..: Modeling Limited Rationality, " Econometrica 59, 1105-1125.
    • (1991) Econometrica , vol.59 , pp. 1105-1125
    • Lipman, B.1
  • 25
    • 0002053554 scopus 로고
    • Learning Behavior in an Experimental Matching Pennies Game
    • D. Mookherjee and B. Sopher, "Learning Behavior in an Experimental Matching Pennies Game, " Games and Economic Behavior 7, 1994, 62-91.
    • (1994) Games and Economic Behavior , vol.7 , pp. 62-91
    • Mookherjee, D.1    Sopher, B.2
  • 26
    • 0002159270 scopus 로고    scopus 로고
    • Learning and Decision Costs in Experimental Constant Sum Games
    • D. Mookherjee and B. Sopher, "Learning and Decision Costs in Experimental Constant Sum Games, " Games and Economic Behavior 19, 1997, 97-132.
    • (1997) Games and Economic Behavior , vol.19 , pp. 97-132
    • Mookherjee, D.1    Sopher, B.2
  • 27
    • 0020813107 scopus 로고
    • The Use of Learning Algorithms in Telephone Traffic Routing: A Methodology
    • K. Narendra and P. Mars, "The Use of Learning Algorithms in Telephone Traffic Routing: A Methodology, " Automatica, 19(5), 1983, 495-502.
    • (1983) Automatica , vol.19 , Issue.5 , pp. 495-502
    • Narendra, K.1    Mars, P.2
  • 31
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique mixed strategy equilibria
    • I. Erev and A. Roth, "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique Mixed Strategy Equilibria, " American Economic Review, 88, 1998, 848-881.
    • (1998) American Economic Review , vol.88 , pp. 848-881
    • Erev, I.1    Roth, A.2
  • 32
    • 0012861020 scopus 로고
    • Individual behavior in uncertain situations: An interpretation in terms of statistical association theory
    • edited by R.M. Thrall, C.H. Coombs and R.L. Davis, New York Wiley
    • W. Estes, "Individual Behavior in Uncertain Situations: An Interpretation in Terms of Statistical Association Theory, " in Decision Processes, edited by R.M. Thrall, C.H. Coombs and R.L. Davis, 1954, New York: Wiley.
    • (1954) Decision Processes
    • Estes, W.1
  • 38
    • 0000038653 scopus 로고
    • How to Decide How to Decide How to ..: Modeling Limited Rationality
    • B. Lipman (1991), "How to Decide How to Decide How to ..: Modeling Limited Rationality, " Econometrica 59, 1105-1125.
    • (1991) Econometrica , vol.59 , pp. 1105-1125
    • Lipman, B.1
  • 41
    • 0002053554 scopus 로고
    • Learning behavior in an experimental matching pennies game
    • D. Mookherjee and B. Sopher, "Learning Behavior in an Experimental Matching Pennies Game, " Games and Economic Behavior 7, 1994, 62-91.
    • (1994) Games and Economic Behavior , vol.7 , pp. 62-91
    • Mookherjee, D.1    Sopher, B.2
  • 42
    • 0002159270 scopus 로고    scopus 로고
    • Learning and decision costs in experimental constant sum games
    • D. Mookherjee and B. Sopher, "Learning and Decision Costs in Experimental Constant Sum Games, " Games and Economic Behavior 19, 1997, 97-132.
    • (1997) Games and Economic Behavior , vol.19 , pp. 97-132
    • Mookherjee, D.1    Sopher, B.2
  • 43
    • 0020813107 scopus 로고
    • The use of learning algorithms in telephone traffic routing: A methodology
    • K. Narendra and P. Mars, "The Use of Learning Algorithms in Telephone Traffic Routing: A Methodology, " Automatica, 19(5), 1983, 495-502.
    • (1983) Automatica , vol.19 , Issue.5 , pp. 495-502
    • Narendra, K.1    Mars, P.2
  • 47
    • 0033481949 scopus 로고    scopus 로고
    • Convergence of Aspirations and (Partial) Cooperation in the Prisoner?s Dilemma
    • F. Palomino and F. Vega-Redondo, "Convergence of Aspirations and (Partial) Cooperation in the Prisoner?s Dilemma", International Journal of Game Theory, 28(4), 1999, 465-488.
    • (1999) International Journal of Game Theory , vol.28 , Issue.4 , pp. 465-488
    • Palomino, F.1    Vega-Redondo, F.2
  • 48
    • 0005107096 scopus 로고
    • Learning algorithms for repeated bimatrix nash games with incomplete information
    • G. Papavassilopoulos, "Learning Algorithms for Repeated Bimatrix Nash Games with Incomplete Information, " Journal of Optimization Theory and Applications, 62(3), 1989, 467-488.
    • (1989) Journal of Optimization Theory and Applications , vol.62 , Issue.3 , pp. 467-488
    • Papavassilopoulos, G.1
  • 50
    • 0031329041 scopus 로고    scopus 로고
    • Satisficing leads to cooperation in mutual interest games
    • A. Pazgal, "Satisficing Leads to Cooperation in Mutual Interest Games", International Journal of Game Theory, 26, 1997, 439-453.
    • (1997) International Journal of Game Theory , vol.26 , pp. 439-453
    • Pazgal, A.1
  • 51
    • 0030194779 scopus 로고    scopus 로고
    • Efficient equilibrium selection in evolutionary games with random matching
    • A. Robson and F. Vega-Redondo, "Efficient Equilibrium Selection in Evolutionary Games with Random Matching, " Journal of Economic Theory, 70(1), 1996, 65-92.
    • (1996) Journal of Economic Theory , vol.70 , Issue.1 , pp. 65-92
    • Robson, A.1    Vega-Redondo, F.2
  • 52
    • 58149324992 scopus 로고    scopus 로고
    • Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
    • A. Roth and I. Erev, Learning in extensive-form games: experimental data and simple dynamic models in the intermediate term, " Games Econ. Behav. 8, 1995, 164-212.
    • Games Econ. Behav. 8 , vol.1995 , pp. 164-212
    • Roth, A.1    Erev, I.2
  • 53
    • 0000319195 scopus 로고
    • The Chain Store Pradaox
    • R. Selten, "The Chain Store Pradaox, " Theory and Decision, 9, 1978, 127-159.
    • (1978) Theory and Decision , vol.9 , pp. 127-159
    • Selten, R.1
  • 54
    • 0003163893 scopus 로고
    • Evolution, Learning and Economic Behavior
    • R. Selten, "Evolution, Learning and Economic Behavior, " Games and Economic Behavior 3 (1991), 3-24.
    • (1991) Games and Economic Behavior , vol.3 , pp. 3-24
    • Selten, R.1
  • 55
    • 46149136660 scopus 로고
    • End behavior in sequences of finite prisoners? dilemma supergames
    • R. Selten and R. Stoecker, End behavior in sequences of finite Prisoners? Dilemma supergames, " J. Econ. Behav. Organ.7 (1986), 47-70.
    • (1986) J. Econ. Behav. Organ , vol.7 , pp. 47-70
    • Selten, R.1    Stoecker, R.2
  • 56
    • 84959810873 scopus 로고
    • A Behavioral Model of Rational Choice
    • H. Simon, "A Behavioral Model of Rational Choice, " Quarterly Journal of Economics, 69 (1955), 99-118.
    • (1955) Quarterly Journal of Economics , vol.69 , pp. 99-118
    • Simon, H.1
  • 58
    • 0000629644 scopus 로고
    • Theories of Decision Making in Economics and Behavioral Science
    • H. Simon, "Theories of Decision Making in Economics and Behavioral Science, " American Economic Review, 49(1), 1959, 253-283.
    • (1959) American Economic Review , vol.49 , Issue.1 , pp. 253-283
    • Simon, H.1
  • 59
    • 0004131079 scopus 로고
    • Markov Learning Models for Multiperson Interactions
    • University Press
    • P. Suppes and R. Atkinson, "Markov Learning Models for Multiperson Interactions, " Stanford: Stanford University Press, 1960.
    • (1960) Stanford: Stanford
    • Suppes, P.1    Atkinson, R.2
  • 60
    • 0001944917 scopus 로고
    • The Evolution of Conventions
    • P. Young, "The Evolution of Conventions, " Econometrica, 61(1), 1993, 57-84.
    • (1993) Econometrica , vol.61 , Issue.1 , pp. 57-84
    • Young, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.