SCOPUS 정보 검색 플랫폼

B.E. Journal of Theoretical Economics

Volumn 1, Issue 1, 2001, Pages

Reinforcement Learning in Repeated Interaction Games

(3) Bendor, Jonathan a Mookherjee, Dilip b Ray, Debraj c

a STANFORD UNIVERSITY (United States)

b BOSTON UNIVERSITY (United States)

c NEW YORK UNIVERSITY (United States)

Author keywords

aspirations; bounded rationality; cooperation; coordination; reinforcement learning

Indexed keywords

EID: 0038660317 PISSN: None EISSN: 19351704 Source Type: Journal
DOI: 10.2202/1534-5963.1008 Document Type: Article

Times cited : (28)

References (60)

1
- 0001784118
- On Designing Economic Agents that Behave like Human Agents
- B. Arthur, "On Designing Economic Agents that Behave Like Human Agents, " Journal of Evolutionary Economics 3, 1993, 1-22.
- (1993) Journal of Evolutionary Economics , vol.3 , pp. 1-22
- Arthur, B.¹

2
- 85108918188
- Aspirations, Adaptive Learning and Cooperation in Repeated Games
- J. Bendor, D. Mookherjee and D. Ray, "Aspirations, Adaptive Learning and Cooperation in Repeated Games", Discussion Paper, Planning Unit, Indian Statistical Institute, New Delhi, 1992.
- (1992) Discussion Paper, Planning Unit, Indian Statistical Institute, New Delhi
- Bendor, J.¹ Mookherjee, D.² Ray, D.³

3
- 0010875005
- Discussion Paper No. 9442 Center for Economic Research, Tilburg University May Revised, mimeo, Department of Economics, Boston University 1995
- J. Bendor, D. Mookherjee and D. Ray, "Aspirations, Adaptive Learning and Cooperation in Repeated Games", Discussion Paper No. 9442, Center for Economic Research, Tilburg University, May 1994. Revised, mimeo, Department of Economics, Boston University, 1995.
- (1994) Aspirations, Adaptive Learning and Cooperation in Repeated Games
- Bendor, J.¹ Mookherjee, D.² Ray, D.³

4
- 85108904760
- Aspiration-Based Reinforcement Learning in Repeated Games: An Overview
- J. Bendor, D. Mookherjee and D. Ray, "Aspiration-Based Reinforcement Learning in Repeated Games: An Overview, " mimeo, Department of Economics, Boston University, 2000.
- (2000) Mimeo, Department of Economics, Boston University
- Bendor, J.¹ Mookherjee, D.² Ray, D.³

5
- 0031161454
- Muddling Through: Noisy Equilibrium Selection
- K. Binmore and L. Samuelson, "Muddling Through: Noisy Equilibrium Selection, " Journal of Economic Theory, 74, 1997, 235-265.
- (1997) Journal of Economic Theory , vol.74 , pp. 235-265
- Binmore, K.¹ Samuelson, L.²

6
- 0031281590
- Learning through Reinforcement and Replicator Dynamics
- T. Börgers and R. Sarin, "Learning through Reinforcement and Replicator Dynamics, " Journal of Economic Theory 77, 1997, 1-14.
- (1997) Journal of Economic Theory , vol.77 , pp. 1-14
- Börgers, T.¹ Sarin, R.²

7
- 0042571479
- Naive Reinforcement Learning with Endogenous Aspirations
- T. Börgers and R. Sarin, , "Naive Reinforcement Learning with Endogenous Aspirations, " International Economic Review 41, 2000, 921-950.
- (2000) International Economic Review , vol.41 , pp. 921-950
- Börgers, T.¹ Sarin, R.²

8
- 85108902620
- Simple Behavior Rules which Lead to Expected Payoff Maximizing Choices
- T. Börgers, A. Morales and R. Sarin, "Simple Behavior Rules which Lead to Expected Payoff Maximizing Choices, " mimeo, University College, London, 1998.
- (1998) Mimeo, University College, London
- Börgers, T.¹ Morales, A.² Sarin, R.³

9
- 0003781528
- Wiley and Sons
- R. Bush and F. Mosteller, Stochastic Models of Learning, New York: John Wiley and Sons, 1955.
- (1955) Stochastic Models of Learning, New York: John
- Bush, R.¹ Mosteller, F.²

10
- 29144481480
- A formal structure for multiple choice situations
- edited by R.M. Thrall, C.H. Coombs and R.L. Davis 1954, New York Wiley
- R. Bush, F. Mosteller and G. Thompson, "A Formal Structure For Multiple Choice Situations, " in Decision Processes, edited by R.M. Thrall, C.H. Coombs and R.L. Davis, 1954, New York: Wiley.
- Decision Processes
- Bush, R.¹ Mosteller, F.² Thompson, G.³

11
- 18644365144
- Experience-weighted Attraction Learning in Normal Form Games
- C. Camerer and T. Ho, "Experience-weighted Attraction Learning in Normal Form Games, " Econometrica, 67(4), 1999, 827-874.
- (1999) Econometrica , vol.67 , Issue.4 , pp. 827-874
- Camerer, C.¹ Ho, T.²

12
- 0000742255
- A Stochastic Learning Model of Economic Behavior
- J. Cross, "A Stochastic Learning Model of Economic Behavior, " Quarterly Journal of Economics, 87 (1973), 239-266.
- (1973) Quarterly Journal of Economics , vol.87 , pp. 239-266
- Cross, J.¹

13
- 0004192228
- R. Cyert and J. March, A Behavioral Theory of the Firm. Englewood-Cliffs, NJ: Prentice-Hall, 1963.
- (1963) A Behavioral Theory of the Firm. Englewood-Cliffs, NJ: Prentice-Hall
- Cyert, R.¹ March, J.²

14
- 0009649778
- Keeping up with the Joneses: Competition and the Evolution of Collusion
- forthcoming I. Erev and A. Roth, "On the Need for Low Rationality, Cognitive Game Theory: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria working paper, August 1995, Department of Economics University of Pittsburgh
- H.D. Dixon, "Keeping up with the Joneses: Competition and the Evolution of Collusion", Journal of Economic Behavior and Organization, 2000, forthcoming. I. Erev and A. Roth, "On the Need for Low Rationality, Cognitive Game Theory: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria, " working paper, August 1995, Department of Economics, University of Pittsburgh.
- (2000) Journal of Economic Behavior and Organization
- Dixon, H.D.¹

15
- 0038829878
- Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique Mixed Strategy Equilibria
- H.D. Dixon, "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique Mixed Strategy Equilibria, " American Economic Review, 88, 1998, 848-881.
- (1998) American Economic Review , vol.88 , pp. 848-881
- Dixon, H.D.¹

16
- 0012861020
- Individual behavior in uncertain situations: An interpretation in terms of statistical association theory
- edited by R.M. Thrall, C.H. Coombs and R.L. Davis, New York Wiley
- W. Estes, "Individual Behavior in Uncertain Situations: An Interpretation in Terms of Statistical Association Theory, " in Decision Processes, edited by R.M. Thrall, C.H. Coombs and R.L. Davis, 1954, New York: Wiley.
- (1954) Decision Processes
- Estes, W.¹

17
- 77952373089
- Case-based Decision Theory
- I. Gilboa and D. Schmeidler, "Case-based Decision Theory, " Quarterly Journal of Economics, 110, 1995, 605-640.
- (1995) Quarterly Journal of Economics , vol.110 , pp. 605-640
- Gilboa, I.¹ Schmeidler, D.²

18
- 0030191360
- Case-based Optimization
- I. Gilboa and D. Schmeidler, "Case-based Optimization, " Games and Economic Behavior, 15 (1996), 1-26.
- (1996) Games and Economic Behavior , vol.15 , pp. 1-26
- Gilboa, I.¹ Schmeidler, D.²

19
- 0000825694
- Evolving Aspirations and Cooperation
- R. Karandikar, D. Mookherjee, D. Ray and F. Vega-Redondo, "Evolving Aspirations and Cooperation, ", Journal of Economic Theory, 80, 1998, 292-331.
- (1998) Journal of Economic Theory , vol.80 , pp. 292-331
- Karandikar, R.¹ Mookherjee, D.² Ray, D.³ Vega-Redondo, F.⁴

20
- 85108914275
- Satisficing, Cooperation and Coordination
- Y. Kim, "Satisficing, Cooperation and Coordination, " mimeo, Department of Economics, Queen Mary and Westfield College, University of London, 1995a.
- (1995) Mimeo, Department of Economics, Queen Mary and Westfield College, University of London
- Kim, Y.¹

21
- 85108906059
- A Satisficing Model of Learning in Extensive Form Games
- Y. Kim, "A Satisficing Model of Learning in Extensive Form Games, " mimeo, Department of Economics, Yonsei University, Seoul, 1995b.
- (1995) Mimeo, Department of Economics, Yonsei University, Seoul
- Kim, Y.¹

22
- 0000038653
- How to Decide How to Decide How to ..: Modeling Limited Rationality
- B. Lipman (1991), "How to Decide How to Decide How to ..: Modeling Limited Rationality, " Econometrica 59, 1105-1125.
- (1991) Econometrica , vol.59 , pp. 1105-1125
- Lipman, B.¹

23
- 84943558017
- Wiley
- R. Duncan Luce (1959), Individual Choice Behavior, John Wiley.
- (1959) Individual Choice Behavior, John
- Duncan Luce, R.¹

24
- 0003637131
- Verlag
- S.P. Meyn and R.L. Tweedie (1993) Markov Chains and Stochastic Stability, London, New York: Springer-Verlag.
- (1993) Markov Chains and Stochastic Stability, London, New York: Springer-
- Meyn, S.P.¹ Tweedie, R.L.²

25
- 0002053554
- Learning Behavior in an Experimental Matching Pennies Game
- D. Mookherjee and B. Sopher, "Learning Behavior in an Experimental Matching Pennies Game, " Games and Economic Behavior 7, 1994, 62-91.
- (1994) Games and Economic Behavior , vol.7 , pp. 62-91
- Mookherjee, D.¹ Sopher, B.²

26
- 0002159270
- Learning and Decision Costs in Experimental Constant Sum Games
- D. Mookherjee and B. Sopher, "Learning and Decision Costs in Experimental Constant Sum Games, " Games and Economic Behavior 19, 1997, 97-132.
- (1997) Games and Economic Behavior , vol.19 , pp. 97-132
- Mookherjee, D.¹ Sopher, B.²

27
- 0020813107
- The Use of Learning Algorithms in Telephone Traffic Routing: A Methodology
- K. Narendra and P. Mars, "The Use of Learning Algorithms in Telephone Traffic Routing: A Methodology, " Automatica, 19(5), 1983, 495-502.
- (1983) Automatica , vol.19 , Issue.5 , pp. 495-502
- Narendra, K.¹ Mars, P.²

28
- 0003891507
- Prentice Hall
- K. Narendra and M. Thathachar, Learning Automata: An Introduction, Englewood Cliffs: Prentice Hall, 1989.
- (1989) Learning Automata: An Introduction, Englewood Cliffs
- Narendra, K.¹ Thathachar, M.²

29
- 0003831870
- University Press
- R. Nelson and Winter, S. An Evolutionary Theory of Economic Change. Cambridge, Massachussetts: Harvard University Press, 1982.
- (1982) An Evolutionary Theory of Economic Change. Cambridge, Massachussetts: Harvard
- Nelson, R.¹ Winter, S.²

30
- 0003801740
- Department of Economics, University of Pittsburgh
- I. Erev and A. Roth, "On the Need for Low Rationality, Cognitive Game Theory: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria, " working paper, August 1995, Department of Economics, University of Pittsburgh.
- (1995) On the Need for Low Rationality, Cognitive Game Theory: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria Working Paper, August
- Erev, I.¹ Roth, A.²

31
- 0038829878
- Predicting how people play games: Reinforcement learning in experimental games with unique mixed strategy equilibria
- I. Erev and A. Roth, "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique Mixed Strategy Equilibria, " American Economic Review, 88, 1998, 848-881.
- (1998) American Economic Review , vol.88 , pp. 848-881
- Erev, I.¹ Roth, A.²

32
- 0012861020
- Individual behavior in uncertain situations: An interpretation in terms of statistical association theory
- edited by R.M. Thrall, C.H. Coombs and R.L. Davis, New York Wiley
- W. Estes, "Individual Behavior in Uncertain Situations: An Interpretation in Terms of Statistical Association Theory, " in Decision Processes, edited by R.M. Thrall, C.H. Coombs and R.L. Davis, 1954, New York: Wiley.
- (1954) Decision Processes
- Estes, W.¹

33
- 77952373089
- Case-based decision theory
- I. Gilboa and D. Schmeidler, "Case-based Decision Theory, " Quarterly Journal of Economics, 110, 1995, 605-640.
- (1995) Quarterly Journal of Economics , vol.110 , pp. 605-640
- Gilboa, I.¹ Schmeidler, D.²

34
- 0030191360
- Case-based Optimization
- I. Gilboa and D. Schmeidler, "Case-based Optimization, " Games and Economic Behavior, 15 (1996), 1-26.
- (1996) Games and Economic Behavior , vol.15 , pp. 1-26
- Gilboa, I.¹ Schmeidler, D.²

35
- 0000825694
- Evolving Aspirations and Cooperation
- R. Karandikar, D. Mookherjee, D. Ray and F. Vega-Redondo, "Evolving Aspirations and Cooperation, ", Journal of Economic Theory, 80, 1998, 292-331.
- (1998) Journal of Economic Theory , vol.80 , pp. 292-331
- Karandikar, R.¹ Mookherjee, D.² Ray, D.³ Vega-Redondo, F.⁴

36
- 85108914275
- Satisficing, Cooperation and Coordination
- Y. Kim, "Satisficing, Cooperation and Coordination, " mimeo, Department of Economics, Queen Mary and Westfield College, University of London, 1995a.
- (1995) Mimeo, Department of Economics, Queen Mary and Westfield College, University of London
- Kim, Y.¹

37
- 85108906059
- A satisficing model of learning in extensive form games
- Y. Kim, "A Satisficing Model of Learning in Extensive Form Games, " mimeo, Department of Economics, Yonsei University, Seoul, 1995b.
- (1995) Mimeo, Department of Economics, Yonsei University, Seoul
- Kim, Y.¹

38
- 0000038653
- How to Decide How to Decide How to ..: Modeling Limited Rationality
- B. Lipman (1991), "How to Decide How to Decide How to ..: Modeling Limited Rationality, " Econometrica 59, 1105-1125.
- (1991) Econometrica , vol.59 , pp. 1105-1125
- Lipman, B.¹

39
- 84943558017
- John Wiley
- R. Duncan Luce (1959), Individual Choice Behavior, John Wiley.
- (1959) Individual Choice Behavior
- Duncan Luce, R.¹

40
- 0003637131
- London, New York: Springer-Verlag
- S.P. Meyn and R.L. Tweedie (1993) Markov Chains and Stochastic Stability, London, New York: Springer-Verlag.
- (1993) Markov Chains and Stochastic Stability
- Meyn, S.P.¹ Tweedie, R.L.²

41
- 0002053554
- Learning behavior in an experimental matching pennies game
- D. Mookherjee and B. Sopher, "Learning Behavior in an Experimental Matching Pennies Game, " Games and Economic Behavior 7, 1994, 62-91.
- (1994) Games and Economic Behavior , vol.7 , pp. 62-91
- Mookherjee, D.¹ Sopher, B.²

42
- 0002159270
- Learning and decision costs in experimental constant sum games
- D. Mookherjee and B. Sopher, "Learning and Decision Costs in Experimental Constant Sum Games, " Games and Economic Behavior 19, 1997, 97-132.
- (1997) Games and Economic Behavior , vol.19 , pp. 97-132
- Mookherjee, D.¹ Sopher, B.²

43
- 0020813107
- The use of learning algorithms in telephone traffic routing: A methodology
- K. Narendra and P. Mars, "The Use of Learning Algorithms in Telephone Traffic Routing: A Methodology, " Automatica, 19(5), 1983, 495-502.
- (1983) Automatica , vol.19 , Issue.5 , pp. 495-502
- Narendra, K.¹ Mars, P.²

44
- 0003891507
- Prentice Hall
- K. Narendra and M. Thathachar, Learning Automata: An Introduction, Englewood Cliffs: Prentice Hall, 1989.
- (1989) Learning Automata: An Introduction, Englewood Cliffs
- Narendra, K.¹ Thathachar, M.²

45
- 0003831870
- University Press
- R. Nelson and Winter, S. An Evolutionary Theory of Economic Change. Cambridge, Massachussetts: Harvard University Press, 1982.
- (1982) An Evolutionary Theory of Economic Change. Cambridge, Massachussetts: Harvard
- Nelson, R.¹ Winter, S.²

46
- 0003722746
- Academic Press
- M. F. Norman, Markov Processes and Learning Models, New York and London: Academic Press, 1972.
- (1972) Markov Processes and Learning Models, New York and London
- Norman, M.F.¹

47
- 0033481949
- Convergence of Aspirations and (Partial) Cooperation in the Prisoner?s Dilemma
- F. Palomino and F. Vega-Redondo, "Convergence of Aspirations and (Partial) Cooperation in the Prisoner?s Dilemma", International Journal of Game Theory, 28(4), 1999, 465-488.
- (1999) International Journal of Game Theory , vol.28 , Issue.4 , pp. 465-488
- Palomino, F.¹ Vega-Redondo, F.²

48
- 0005107096
- Learning algorithms for repeated bimatrix nash games with incomplete information
- G. Papavassilopoulos, "Learning Algorithms for Repeated Bimatrix Nash Games with Incomplete Information, " Journal of Optimization Theory and Applications, 62(3), 1989, 467-488.
- (1989) Journal of Optimization Theory and Applications , vol.62 , Issue.3 , pp. 467-488
- Papavassilopoulos, G.¹

49
- 0003883777
- Academic Press
- K. R. Parthasarathy (1967), Probability Measures on Metric Spaces, New York: Academic Press.
- (1967) Probability Measures on Metric Spaces, New York
- Parthasarathy, K.R.¹

50
- 0031329041
- Satisficing leads to cooperation in mutual interest games
- A. Pazgal, "Satisficing Leads to Cooperation in Mutual Interest Games", International Journal of Game Theory, 26, 1997, 439-453.
- (1997) International Journal of Game Theory , vol.26 , pp. 439-453
- Pazgal, A.¹

51
- 0030194779
- Efficient equilibrium selection in evolutionary games with random matching
- A. Robson and F. Vega-Redondo, "Efficient Equilibrium Selection in Evolutionary Games with Random Matching, " Journal of Economic Theory, 70(1), 1996, 65-92.
- (1996) Journal of Economic Theory , vol.70 , Issue.1 , pp. 65-92
- Robson, A.¹ Vega-Redondo, F.²

52
- 58149324992
- Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term
- A. Roth and I. Erev, Learning in extensive-form games: experimental data and simple dynamic models in the intermediate term, " Games Econ. Behav. 8, 1995, 164-212.
- Games Econ. Behav. 8 , vol.1995 , pp. 164-212
- Roth, A.¹ Erev, I.²

53
- 0000319195
- The Chain Store Pradaox
- R. Selten, "The Chain Store Pradaox, " Theory and Decision, 9, 1978, 127-159.
- (1978) Theory and Decision , vol.9 , pp. 127-159
- Selten, R.¹

54
- 0003163893
- Evolution, Learning and Economic Behavior
- R. Selten, "Evolution, Learning and Economic Behavior, " Games and Economic Behavior 3 (1991), 3-24.
- (1991) Games and Economic Behavior , vol.3 , pp. 3-24
- Selten, R.¹

55
- 46149136660
- End behavior in sequences of finite prisoners? dilemma supergames
- R. Selten and R. Stoecker, End behavior in sequences of finite Prisoners? Dilemma supergames, " J. Econ. Behav. Organ.7 (1986), 47-70.
- (1986) J. Econ. Behav. Organ , vol.7 , pp. 47-70
- Selten, R.¹ Stoecker, R.²

56
- 84959810873
- A Behavioral Model of Rational Choice
- H. Simon, "A Behavioral Model of Rational Choice, " Quarterly Journal of Economics, 69 (1955), 99-118.
- (1955) Quarterly Journal of Economics , vol.69 , pp. 99-118
- Simon, H.¹

57
- 0004184073
- New York
- H. Simon, Models of Man, 1957, New York.
- (1957) Models of Man
- Simon, H.¹

58
- 0000629644
- Theories of Decision Making in Economics and Behavioral Science
- H. Simon, "Theories of Decision Making in Economics and Behavioral Science, " American Economic Review, 49(1), 1959, 253-283.
- (1959) American Economic Review , vol.49 , Issue.1 , pp. 253-283
- Simon, H.¹

59
- 0004131079
- Markov Learning Models for Multiperson Interactions
- University Press
- P. Suppes and R. Atkinson, "Markov Learning Models for Multiperson Interactions, " Stanford: Stanford University Press, 1960.
- (1960) Stanford: Stanford
- Suppes, P.¹ Atkinson, R.²

60
- 0001944917
- The Evolution of Conventions
- P. Young, "The Evolution of Conventions, " Econometrica, 61(1), 1993, 57-84.
- (1993) Econometrica , vol.61 , Issue.1 , pp. 57-84
- Young, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.