메뉴 건너뛰기




Volumn 22, Issue 1, 2004, Pages 45-58

Reinforcement learning and decision making in monkeys during a competitive game

Author keywords

Game theory; Mixed strategy; Motivation; Prefrontal cortex; Reward; Zero sum game

Indexed keywords

ALGORITHM; ANIMAL EXPERIMENT; ARTICLE; COMPETITION; COMPUTER; COMPUTER PROGRAM; CONTROLLED STUDY; DECISION MAKING; GAME; HISTORY; MALE; MONKEY; NONHUMAN; OCULOMOTOR SYSTEM; PREDICTION; PRIORITY JOURNAL; PROBABILITY; REINFORCEMENT; REWARD; SACCADIC EYE MOVEMENT; VISUAL STIMULATION;

EID: 9244231144     PISSN: 09266410     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cogbrainres.2004.07.007     Document Type: Article
Times cited : (120)

References (74)
  • 1
    • 0032197276 scopus 로고    scopus 로고
    • Random generation and the executive control of working memory
    • A. Baddeley, H. Emslie, J. Kolodny, and J. Duncan Random generation and the executive control of working memory Q. J. Exp. Psychol. 51 1998 819 852
    • (1998) Q. J. Exp. Psychol. , vol.51 , pp. 819-852
    • Baddeley, A.1    Emslie, H.2    Kolodny, J.3    Duncan, J.4
  • 2
    • 1842612383 scopus 로고    scopus 로고
    • Prefrontal cortex and decision making in a mixed-strategy game
    • D.J. Barraclough, M.L. Conroy, and D. Lee Prefrontal cortex and decision making in a mixed-strategy game Nat. Neurosci. 7 2004 404 410
    • (2004) Nat. Neurosci. , vol.7 , pp. 404-410
    • Barraclough, D.J.1    Conroy, M.L.2    Lee, D.3
  • 3
    • 0037908465 scopus 로고    scopus 로고
    • Does minimax work? An experimental study
    • K. Binmore, J. Swierzbinski, and C. Proulx Does minimax work? An experimental study Econ. J. 111 2001 445 464
    • (2001) Econ. J. , vol.111 , pp. 445-464
    • Binmore, K.1    Swierzbinski, J.2    Proulx, C.3
  • 4
    • 0034988599 scopus 로고    scopus 로고
    • Functional imaging of neural responses to expectancy and experience of monetary gains and losses
    • H.C. Breiter, I. Aharon, D. Kahneman, A. Dale, and P. Shizgal Functional imaging of neural responses to expectancy and experience of monetary gains and losses Neuron 30 2001 619 639
    • (2001) Neuron , vol.30 , pp. 619-639
    • Breiter, H.C.1    Aharon, I.2    Kahneman, D.3    Dale, A.4    Shizgal, P.5
  • 5
    • 0000417537 scopus 로고
    • Testing the minimax hypothesis: A re-examination of O'Neill's game experiment
    • J.N. Brown, and R.W. Rosenthal Testing the minimax hypothesis: a re-examination of O'Neill's game experiment Econometrica 58 1990 1065 1081
    • (1990) Econometrica , vol.58 , pp. 1065-1081
    • Brown, J.N.1    Rosenthal, R.W.2
  • 6
    • 84980140080 scopus 로고
    • Subjective randomization in one- and two-person games
    • D.V. Budescu, and A. Rapoport Subjective randomization in one- and two-person games J. Behav. Decis. Mak. 7 1994 261 278
    • (1994) J. Behav. Decis. Mak. , vol.7 , pp. 261-278
    • Budescu, D.V.1    Rapoport, A.2
  • 10
    • 0033249164 scopus 로고    scopus 로고
    • A neural mechanism that randomises behavior
    • R.H.S. Carpenter A neural mechanism that randomises behavior J. Conscious. Stud. 6 1999 13 22
    • (1999) J. Conscious. Stud. , vol.6 , pp. 13-22
    • Carpenter, R.H.S.1
  • 11
    • 9244228810 scopus 로고    scopus 로고
    • Boundedly rational Nash equilibrium: A probabilistic choice approach
    • H.-C. Chen, J.W. Friedman, and J.-F. Thisse Boundedly rational Nash equilibrium: a probabilistic choice approach Games Econ. Behav. 18 1996 1832 1854
    • (1996) Games Econ. Behav. , vol.18 , pp. 1832-1854
    • Chen, H.-C.1    Friedman, J.W.2    Thisse, J.-F.3
  • 12
    • 0038059890 scopus 로고    scopus 로고
    • Testing mixed-strategy equilibria when players are heterogeneous: The case of penalty kicks in soccer
    • P.-A. Chiappori, S. Levitt, and T. Groseclose Testing mixed-strategy equilibria when players are heterogeneous: the case of penalty kicks in soccer Am. Econ. Rev. 92 2002 1138 1151
    • (2002) Am. Econ. Rev. , vol.92 , pp. 1138-1151
    • Chiappori, P.-A.1    Levitt, S.2    Groseclose, T.3
  • 15
    • 0030663927 scopus 로고    scopus 로고
    • Differential neural response to positive and negative feedback in planning and guessing tasks
    • R. Elliott, C.D. Frith, and R.J. Dolan Differential neural response to positive and negative feedback in planning and guessing tasks Neuropsychologia 35 1997 1395 1404
    • (1997) Neuropsychologia , vol.35 , pp. 1395-1404
    • Elliott, R.1    Frith, C.D.2    Dolan, R.J.3
  • 16
    • 0037218711 scopus 로고    scopus 로고
    • Differential response patterns in the striatum and orbitofrontal cortex to financial reward in humans: A parametric functional magnetic resonance imaging study
    • R. Elliott, J.L. Newman, O.A. Longe, and J.F.W. Deakin Differential response patterns in the striatum and orbitofrontal cortex to financial reward in humans: a parametric functional magnetic resonance imaging study J. Neurosci. 23 2003 303 307
    • (2003) J. Neurosci. , vol.23 , pp. 303-307
    • Elliott, R.1    Newman, J.L.2    Longe, O.A.3    Deakin, J.F.W.4
  • 17
    • 0038829878 scopus 로고    scopus 로고
    • Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
    • I. Erev, and A.E. Roth Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria Am. Econ. Rev. 88 1998 848 881
    • (1998) Am. Econ. Rev. , vol.88 , pp. 848-881
    • Erev, I.1    Roth, A.E.2
  • 18
    • 1842855344 scopus 로고    scopus 로고
    • The evolution of brain lateralization: A game-theoretic analysis of population structure
    • S. Ghirlanda, and G. Vallortigara The evolution of brain lateralization: a game-theoretic analysis of population structure Proc. R. Soc. Lond., B 271 2004 853 857
    • (2004) Proc. R. Soc. Lond., B , vol.271 , pp. 853-857
    • Ghirlanda, S.1    Vallortigara, G.2
  • 19
    • 0042858574 scopus 로고    scopus 로고
    • The neurobiology of visual-saccadic decision making
    • P.W. Glimcher The neurobiology of visual-saccadic decision making Annu. Rev. Neurosci. 26 2003 133 179
    • (2003) Annu. Rev. Neurosci. , vol.26 , pp. 133-179
    • Glimcher, P.W.1
  • 20
    • 0035155538 scopus 로고    scopus 로고
    • Neural computations that underlie decisions about sensory stimuli
    • J.I. Gold, and M.N. Shadlen Neural computations that underlie decisions about sensory stimuli Trends Neurosci. 5 2001 10 16
    • (2001) Trends Neurosci. , vol.5 , pp. 10-16
    • Gold, J.I.1    Shadlen, M.N.2
  • 22
    • 0041519443 scopus 로고    scopus 로고
    • Reward-dependent gain and bias of visual responses in primate superior colliculus
    • T. Ikeda, and O. Hikosaka Reward-dependent gain and bias of visual responses in primate superior colliculus Neuron 39 2003 693 700
    • (2003) Neuron , vol.39 , pp. 693-700
    • Ikeda, T.1    Hikosaka, O.2
  • 23
    • 0141642194 scopus 로고    scopus 로고
    • Performance monitoring by the anterior cingulate cortex during saccade countermanding
    • S. Ito, V. Stuphorn, J.W. Brown, and J.D. Schall Performance monitoring by the anterior cingulate cortex during saccade countermanding Science 302 2003 120 122
    • (2003) Science , vol.302 , pp. 120-122
    • Ito, S.1    Stuphorn, V.2    Brown, J.W.3    Schall, J.D.4
  • 26
    • 9244246066 scopus 로고
    • Responsiveness in two-person zero-sum games
    • J. Kahan, and D.J. Goehring Responsiveness in two-person zero-sum games Behav. Sci. 18 1973 27 33
    • (1973) Behav. Sci. , vol.18 , pp. 27-33
    • Kahan, J.1    Goehring, D.J.2
  • 28
    • 0032150808 scopus 로고    scopus 로고
    • Expectation of reward modulates cognitive signals in the basal ganglia
    • R. Kawagoe, Y. Takikawa, and O. Hikosaka Expectation of reward modulates cognitive signals in the basal ganglia Nat. Neurosci. 1 1998 411 416
    • (1998) Nat. Neurosci. , vol.1 , pp. 411-416
    • Kawagoe, R.1    Takikawa, Y.2    Hikosaka, O.3
  • 30
    • 0033213255 scopus 로고    scopus 로고
    • Effect of expected reward magnitude on the response of neurons in the dorsolateral prefrontal cortex of the macaque
    • M.I. Leon, and M.N. Shadlen Effect of expected reward magnitude on the response of neurons in the dorsolateral prefrontal cortex of the macaque Neuron 24 1999 415 425
    • (1999) Neuron , vol.24 , pp. 415-425
    • Leon, M.I.1    Shadlen, M.N.2
  • 31
    • 0039448517 scopus 로고
    • Experimental studies of conflict in some two-person and three-person games
    • J.H. Criswell H. Solomon P. Suppes Stanford Univ. Press Stanford
    • B. Liberman Experimental studies of conflict in some two-person and three-person games J.H. Criswell H. Solomon P. Suppes Mathematical Methods in Small Group Processes 1962 Stanford Univ. Press Stanford 203 220
    • (1962) Mathematical Methods in Small Group Processes , pp. 203-220
    • Liberman, B.1
  • 32
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • Morgan Kaufmann San Francisco
    • M.L. Littman Markov games as a framework for multi-agent reinforcement learning Machine Learning: Proc. 11th Int. Conf. 1994 Morgan Kaufmann San Francisco 157 163
    • (1994) Machine Learning: Proc. 11th Int. Conf. , pp. 157-163
    • Littman, M.L.1
  • 34
    • 0042571477 scopus 로고
    • The behavior of responsive individuals playing a two-person, zero-sum game requiring the use of mixed strategies
    • D. Malcolm, and B. Liberman The behavior of responsive individuals playing a two-person, zero-sum game requiring the use of mixed strategies Psychon. Sci. 2 1965 373 374
    • (1965) Psychon. Sci. , vol.2 , pp. 373-374
    • Malcolm, D.1    Liberman, B.2
  • 36
    • 0348166371 scopus 로고    scopus 로고
    • Quantal response equilibria for normal form games
    • R.D. McKelvey, and T.R. Palfrey Quantal response equilibria for normal form games Games Econ. Behav. 7 1996 6 38
    • (1996) Games Econ. Behav. , vol.7 , pp. 6-38
    • McKelvey, R.D.1    Palfrey, T.R.2
  • 37
    • 0014037177 scopus 로고
    • Interdependent decision strategies in zero-sum games: A computer-controlled study
    • D.M. Messick Interdependent decision strategies in zero-sum games: a computer-controlled study Behav. Sci. 12 1967 33 48
    • (1967) Behav. Sci. , vol.12 , pp. 33-48
    • Messick, D.M.1
  • 38
    • 0005770331 scopus 로고
    • Note on the bias of information estimates
    • H. Quastler Free Press Glencoe
    • G.A. Miller Note on the bias of information estimates H. Quastler Information Theory in Psychology 1955 Free Press Glencoe 95 100
    • (1955) Information Theory in Psychology , pp. 95-100
    • Miller, G.A.1
  • 39
    • 0002053554 scopus 로고
    • Learning behavior in an experimental matching pennies game
    • D. Mookherjee, and B. Sopher Learning behavior in an experimental matching pennies game Games Econ. Behav. 7 1994 62 91
    • (1994) Games Econ. Behav. , vol.7 , pp. 62-91
    • Mookherjee, D.1    Sopher, B.2
  • 40
    • 0002021736 scopus 로고
    • Equilibrium points in n-person games
    • J.F. Nash Equilibrium points in n-person games Proc. Natl. Acad. Sci. 36 1950 48 49
    • (1950) Proc. Natl. Acad. Sci. , vol.36 , pp. 48-49
    • Nash, J.F.1
  • 41
    • 0002990294 scopus 로고
    • Can people behave "randomly?": The role of feedback
    • A. Neuringer Can people behave "randomly?": the role of feedback J. Exp. Psychol. Gen. 115 1986 62 75
    • (1986) J. Exp. Psychol. Gen. , vol.115 , pp. 62-75
    • Neuringer, A.1
  • 42
    • 85047674337 scopus 로고    scopus 로고
    • The production and perception of randomness
    • R.S. Nickerson The production and perception of randomness Psychol. Rev. 109 2002 330 357
    • (2002) Psychol. Rev. , vol.109 , pp. 330-357
    • Nickerson, R.S.1
  • 43
    • 0009941386 scopus 로고
    • Games with unique, mixed strategy equilibria: An experimental study
    • J. Ochs Games with unique, mixed strategy equilibria: an experimental study Games Econ. Behav. 10 1995 202 217
    • (1995) Games Econ. Behav. , vol.10 , pp. 202-217
    • Ochs, J.1
  • 44
  • 45
    • 0023323633 scopus 로고
    • Nonmetric test of the minimax theory of two-person zerosum games
    • B. O'Neill Nonmetric test of the minimax theory of two-person zerosum games Proc. Natl. Acad. Sci. U. S. A. 84 1987 2106 2109
    • (1987) Proc. Natl. Acad. Sci. U. S. A. , vol.84 , pp. 2106-2109
    • O'Neill, B.1
  • 46
    • 0009428959 scopus 로고
    • Comments on Brown and Rosenthal's reexamination
    • B. O'Neill Comments on Brown and Rosenthal's reexamination Econometrica 59 1991 503 507
    • (1991) Econometrica , vol.59 , pp. 503-507
    • O'Neill, B.1
  • 47
    • 0038135321 scopus 로고    scopus 로고
    • Professionals play minimax
    • I. Palacios-Huerta Professionals play minimax Rev. Econ. Stud. 70 2003 395 415
    • (2003) Rev. Econ. Stud. , vol.70 , pp. 395-415
    • Palacios-Huerta, I.1
  • 48
    • 0033566079 scopus 로고    scopus 로고
    • Neural correlates of decision variables in parietal cortex
    • M.L. Platt, and P.W. Glimcher Neural correlates of decision variables in parietal cortex Nature 400 1999 233 238
    • (1999) Nature , vol.400 , pp. 233-238
    • Platt, M.L.1    Glimcher, P.W.2
  • 49
    • 0037370734 scopus 로고    scopus 로고
    • Instructed delay activity in the human prefrontal cortex is modulated by monetary reward expectation
    • N. Ramnani, and R.C. Miall Instructed delay activity in the human prefrontal cortex is modulated by monetary reward expectation Cereb. Cortex 13 2003 318 327
    • (2003) Cereb. Cortex , vol.13 , pp. 318-327
    • Ramnani, N.1    Miall, R.C.2
  • 50
    • 38249016013 scopus 로고
    • Mixed strategies in strictly competitive games: A further test of the minimax hypothesis
    • A. Rapoport, and R.B. Boebel Mixed strategies in strictly competitive games: a further test of the minimax hypothesis Games Econ. Behav. 4 1992 261 283
    • (1992) Games Econ. Behav. , vol.4 , pp. 261-283
    • Rapoport, A.1    Boebel, R.B.2
  • 51
    • 33748690481 scopus 로고
    • Generation of random series in two-person strictly competitive games
    • A. Rapoport, and D.V. Budescu Generation of random series in two-person strictly competitive games J. Exp. Psychol. Gen. 121 1992 352 363
    • (1992) J. Exp. Psychol. Gen. , vol.121 , pp. 352-363
    • Rapoport, A.1    Budescu, D.V.2
  • 52
    • 21744450698 scopus 로고    scopus 로고
    • Randomization in individual choice behavior
    • A. Rapoport, and D.V. Budescu Randomization in individual choice behavior Psychol. Rev. 104 1997 603 617
    • (1997) Psychol. Rev. , vol.104 , pp. 603-617
    • Rapoport, A.1    Budescu, D.V.2
  • 53
    • 0141565150 scopus 로고    scopus 로고
    • Impact of expected reward on neuronal activity in prefrontal cortex, frontal and supplementary eye fields and premotor cortex
    • M.R. Roesch, and C.R. Olson Impact of expected reward on neuronal activity in prefrontal cortex, frontal and supplementary eye fields and premotor cortex J. Neurophysiol. 90 2003 1766 1789
    • (2003) J. Neurophysiol. , vol.90 , pp. 1766-1789
    • Roesch, M.R.1    Olson, C.R.2
  • 54
    • 0033570330 scopus 로고    scopus 로고
    • Choosing between small, likely rewards and large, unlikely rewards activates inferior and orbital prefrontal cortex
    • R.D. Rogers, A.M. Owen, H.C. Middleton, E.J. Williams, J.D. Picard, B.J. Sakakian, and T.W. Robbins Choosing between small, likely rewards and large, unlikely rewards activates inferior and orbital prefrontal cortex J. Neurosci. 19 1999 9029 9038
    • (1999) J. Neurosci. , vol.19 , pp. 9029-9038
    • Rogers, R.D.1    Owen, A.M.2    Middleton, H.C.3    Williams, E.J.4    Picard, J.D.5    Sakakian, B.J.6    Robbins, T.W.7
  • 56
    • 0037366070 scopus 로고    scopus 로고
    • Flutter discrimination: Neural codes, perception, memory and decision making
    • R. Romo, and E. Salinas Flutter discrimination: neural codes, perception, memory and decision making Nat. Rev., Neurosci. 4 2003 203 218
    • (2003) Nat. Rev., Neurosci. , vol.4 , pp. 203-218
    • Romo, R.1    Salinas, E.2
  • 57
    • 0000135041 scopus 로고    scopus 로고
    • Predicting how people play games: A simple dynamic model of choice
    • R. Sarin, and F. Vahid Predicting how people play games: a simple dynamic model of choice Games Econ. Behav. 34 2001 104 122
    • (2001) Games Econ. Behav. , vol.34 , pp. 104-122
    • Sarin, R.1    Vahid, F.2
  • 58
    • 0037720181 scopus 로고    scopus 로고
    • Neural correlates of decision processes: Neural and mental chronometry
    • J.D. Schall Neural correlates of decision processes: neural and mental chronometry Curr. Opin. Neurobiol. 13 2003 182 186
    • (2003) Curr. Opin. Neurobiol. , vol.13 , pp. 182-186
    • Schall, J.D.1
  • 59
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • W. Schultz Predictive reward signal of dopamine neurons J. Neurophysiol. 80 1998 1 27
    • (1998) J. Neurophysiol. , vol.80 , pp. 1-27
    • Schultz, W.1
  • 60
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • W. Schultz Getting formal with dopamine and reward Neuron 36 2002 241 263
    • (2002) Neuron , vol.36 , pp. 241-263
    • Schultz, W.1
  • 61
    • 1842684992 scopus 로고    scopus 로고
    • Neural coding of basic reward terms of animal learning theory, game theory, microeconomics, and behavioral ecology
    • W. Schultz Neural coding of basic reward terms of animal learning theory, game theory, microeconomics, and behavioral ecology Curr. Opin. Neurobiol. 14 2004 139 147
    • (2004) Curr. Opin. Neurobiol. , vol.14 , pp. 139-147
    • Schultz, W.1
  • 62
    • 0036271403 scopus 로고    scopus 로고
    • Mixed strategy play and the minimax hypothesis
    • J.M. Shachat Mixed strategy play and the minimax hypothesis J. Econ. Theory 104 2002 189 226
    • (2002) J. Econ. Theory , vol.104 , pp. 189-226
    • Shachat, J.M.1
  • 63
    • 0037205034 scopus 로고    scopus 로고
    • Anterior cingulate: Single neuronal signals related to degree of reward expectancy
    • M. Shidara, and B.J. Richmond Anterior cingulate: single neuronal signals related to degree of reward expectancy Science 296 2002 1709 1711
    • (2002) Science , vol.296 , pp. 1709-1711
    • Shidara, M.1    Richmond, B.J.2
  • 65
    • 0034649633 scopus 로고    scopus 로고
    • Performance monitoring by the supplementary eye field
    • V. Stuphorn, T.L. Taylor, and J.D. Schall Performance monitoring by the supplementary eye field Nature 408 2000 857 860
    • (2000) Nature , vol.408 , pp. 857-860
    • Stuphorn, V.1    Taylor, T.L.2    Schall, J.D.3
  • 66
    • 2942726234 scopus 로고    scopus 로고
    • Matching behavior and the representation of value in parietal cortex
    • L.P. Sugrue, G.S. Corrado, and W.T. Newsome Matching behavior and the representation of value in parietal cortex Science 304 2004 1782 1787
    • (2004) Science , vol.304 , pp. 1782-1787
    • Sugrue, L.P.1    Corrado, G.S.2    Newsome, W.T.3
  • 68
    • 84964109604 scopus 로고
    • Decision processes, expectations, and adoption of strategies in zero-sum games
    • J.C. Touhey Decision processes, expectations, and adoption of strategies in zero-sum games Hum. Relat. 27 1974 813 824
    • (1974) Hum. Relat. , vol.27 , pp. 813-824
    • Touhey, J.C.1
  • 69
    • 0034036369 scopus 로고    scopus 로고
    • Reward-related neuronal activity during go-no go task performance in primate orbitofrontal cortex
    • L. Tremblay, and W. Schultz Reward-related neuronal activity during go-no go task performance in primate orbitofrontal cortex J. Neurophysiol. 83 2000 1864 1876
    • (2000) J. Neurophysiol. , vol.83 , pp. 1864-1876
    • Tremblay, L.1    Schultz, W.2
  • 70
    • 0001749380 scopus 로고
    • A brief survey of variables that influence random generation
    • G.S. Tune A brief survey of variables that influence random generation Percept. Mot. Skills 18 1964 705 710
    • (1964) Percept. Mot. Skills , vol.18 , pp. 705-710
    • Tune, G.S.1
  • 72
    • 0001949035 scopus 로고
    • Generation of random sequences of human subjects: Critical survey of literature
    • W.A. Wagenaar Generation of random sequences of human subjects: critical survey of literature Psychol. Bull. 77 1972 65 72
    • (1972) Psychol. Bull. , vol.77 , pp. 65-72
    • Wagenaar, W.A.1
  • 73
    • 0037722335 scopus 로고    scopus 로고
    • Minimax play at Wimbledon
    • M. Walker, and J. Wooders Minimax play at Wimbledon Am. Econ. Rev. 91 2001 1521 1538
    • (2001) Am. Econ. Rev. , vol.91 , pp. 1521-1538
    • Walker, M.1    Wooders, J.2
  • 74
    • 0029782802 scopus 로고    scopus 로고
    • Reward expectancy in primate prefrontal cortex
    • M. Watanabe Reward expectancy in primate prefrontal cortex Nature 382 1996 629 632
    • (1996) Nature , vol.382 , pp. 629-632
    • Watanabe, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.