메뉴 건너뛰기




Volumn 66, Issue 4, 2010, Pages 585-595

States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning

Author keywords

Sysneuro

Indexed keywords

ADULT; ARTICLE; BRAIN REGION; COMPUTER MODEL; CORPUS STRIATUM; DECISION MAKING; FEMALE; HUMAN; HUMAN EXPERIMENT; LEARNING; MALE; NERVE CELL NETWORK; NORMAL HUMAN; NUCLEAR MAGNETIC RESONANCE IMAGING; PREFRONTAL CORTEX; PRIORITY JOURNAL; PROBABILITY; REINFORCEMENT; SIGNAL DETECTION; SUBICULUM; TASK PERFORMANCE;

EID: 77953260848     PISSN: 08966273     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.neuron.2010.04.016     Document Type: Article
Times cited : (902)

References (55)
  • 1
    • 34547670815 scopus 로고    scopus 로고
    • The role of the dorsal striatum in reward and decision-making
    • Balleine B.W., Delgado M.R., Hikosaka O. The role of the dorsal striatum in reward and decision-making. J. Neurosci. 2007, 27:8161-8165.
    • (2007) J. Neurosci. , vol.27 , pp. 8161-8165
    • Balleine, B.W.1    Delgado, M.R.2    Hikosaka, O.3
  • 2
    • 1842612383 scopus 로고    scopus 로고
    • Prefrontal cortex and decision making in a mixed-strategy game
    • Barraclough D.J., Conroy M.L., Lee D. Prefrontal cortex and decision making in a mixed-strategy game. Nat. Neurosci. 2004, 7:404-410.
    • (2004) Nat. Neurosci. , vol.7 , pp. 404-410
    • Barraclough, D.J.1    Conroy, M.L.2    Lee, D.3
  • 3
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • Bayer H.M., Glimcher P.W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 2005, 47:129-141.
    • (2005) Neuron , vol.47 , pp. 129-141
    • Bayer, H.M.1    Glimcher, P.W.2
  • 5
    • 0032189611 scopus 로고    scopus 로고
    • Removal of cholinergic input to rat posterior parietal cortex disrupts incremental processing of conditioned stimuli
    • Bucci D.J., Holland P.C., Gallagher M. Removal of cholinergic input to rat posterior parietal cortex disrupts incremental processing of conditioned stimuli. J. Neurosci. 1998, 18:8038-8046.
    • (1998) J. Neurosci. , vol.18 , pp. 8038-8046
    • Bucci, D.J.1    Holland, P.C.2    Gallagher, M.3
  • 6
    • 0001240712 scopus 로고    scopus 로고
    • Experience-Weighted Attraction Learning in Coordination Games: Probability Rules, Heterogeneity, and Time-Variation
    • Camerer C., Ho T.H. Experience-Weighted Attraction Learning in Coordination Games: Probability Rules, Heterogeneity, and Time-Variation. J. Math. Psychol. 1998, 42:305-326.
    • (1998) J. Math. Psychol. , vol.42 , pp. 305-326
    • Camerer, C.1    Ho, T.H.2
  • 7
    • 0034051066 scopus 로고    scopus 로고
    • Voluntary orienting is dissociated from target detection in human posterior parietal cortex
    • Corbetta M., Kincade J.M., Ollinger J.M., McAvoy M.P., Shulman G.L. Voluntary orienting is dissociated from target detection in human posterior parietal cortex. Nat. Neurosci. 2000, 3:292-297.
    • (2000) Nat. Neurosci. , vol.3 , pp. 292-297
    • Corbetta, M.1    Kincade, J.M.2    Ollinger, J.M.3    McAvoy, M.P.4    Shulman, G.L.5
  • 9
    • 35648930943 scopus 로고    scopus 로고
    • Posterior parietal cortex encodes autonomously selected motor plans
    • Cui H., Andersen R.A. Posterior parietal cortex encodes autonomously selected motor plans. Neuron 2007, 56:552-559.
    • (2007) Neuron , vol.56 , pp. 552-559
    • Cui, H.1    Andersen, R.A.2
  • 10
    • 40049086223 scopus 로고    scopus 로고
    • BOLD responses reflecting dopaminergic signals in the human ventral tegmental area
    • D'Ardenne K., McClure S.M., Nystrom L.E., Cohen J.D. BOLD responses reflecting dopaminergic signals in the human ventral tegmental area. Science 2008, 319:1264-1267.
    • (2008) Science , vol.319 , pp. 1264-1267
    • D'Ardenne, K.1    McClure, S.M.2    Nystrom, L.E.3    Cohen, J.D.4
  • 11
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 2005, 8:1704-1711.
    • (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 12
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw N.D., O'Doherty J.P., Dayan P., Seymour B., Dolan R.J. Cortical substrates for exploratory decisions in humans. Nature 2006, 441:876-879.
    • (2006) Nature , vol.441 , pp. 876-879
    • Daw, N.D.1    O'Doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 13
    • 0041517689 scopus 로고    scopus 로고
    • Optimized EPI for fMRI studies of the orbitofrontal cortex
    • Deichmann R., Gottfried J.A., Hutton C., Turner R. Optimized EPI for fMRI studies of the orbitofrontal cortex. Neuroimage 2003, 19:430-441.
    • (2003) Neuroimage , vol.19 , pp. 430-441
    • Deichmann, R.1    Gottfried, J.A.2    Hutton, C.3    Turner, R.4
  • 14
    • 0033637899 scopus 로고    scopus 로고
    • Tracking the hemodynamic responses to reward and punishment in the striatum
    • Delgado M.R., Nystrom L.E., Fissell C., Noll D.C., Fiez J.A. Tracking the hemodynamic responses to reward and punishment in the striatum. J. Neurophysiol. 2000, 84:3072-3077.
    • (2000) J. Neurophysiol. , vol.84 , pp. 3072-3077
    • Delgado, M.R.1    Nystrom, L.E.2    Fissell, C.3    Noll, D.C.4    Fiez, J.A.5
  • 15
    • 48149098616 scopus 로고    scopus 로고
    • Regulating the expectation of reward via cognitive strategies
    • Delgado M.R., Gillis M.M., Phelps E.A. Regulating the expectation of reward via cognitive strategies. Nat. Neurosci. 2008, 11:880-881.
    • (2008) Nat. Neurosci. , vol.11 , pp. 880-881
    • Delgado, M.R.1    Gillis, M.M.2    Phelps, E.A.3
  • 16
    • 0035256938 scopus 로고    scopus 로고
    • Causal learning: an associative analysis
    • Dickinson A. Causal learning: an associative analysis. Q. J. Exp. Psychol. 2001, 54B:3-25.
    • (2001) Q. J. Exp. Psychol. , vol.54 B , pp. 3-25
    • Dickinson, A.1
  • 18
    • 0033213819 scopus 로고    scopus 로고
    • What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
    • Doya K. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?. Neural Netw. 1999, 12:961-974.
    • (1999) Neural Netw. , vol.12 , pp. 961-974
    • Doya, K.1
  • 20
    • 0028818212 scopus 로고
    • Changes in brain activity patterns in aging: the novelty oddball
    • Fabiani M., Friedman D. Changes in brain activity patterns in aging: the novelty oddball. Psychophysiology 1995, 32:579-594.
    • (1995) Psychophysiology , vol.32 , pp. 579-594
    • Fabiani, M.1    Friedman, D.2
  • 23
    • 70350521769 scopus 로고    scopus 로고
    • Human reinforcement learning subdivides structured action spaces by learning effector-specific values
    • Gershman S.J., Pesaran B., Daw N.D. Human reinforcement learning subdivides structured action spaces by learning effector-specific values. J. Neurosci. 2009, 29:13524-13531.
    • (2009) J. Neurosci. , vol.29 , pp. 13524-13531
    • Gershman, S.J.1    Pesaran, B.2    Daw, N.D.3
  • 24
    • 64049106340 scopus 로고    scopus 로고
    • Visualization of group inference data in functional neuroimaging
    • Gläscher J. Visualization of group inference data in functional neuroimaging. Neuroinformatics 2009, 7:73-82.
    • (2009) Neuroinformatics , vol.7 , pp. 73-82
    • Gläscher, J.1
  • 25
    • 58449113882 scopus 로고    scopus 로고
    • Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making
    • Gläscher J., Hampton A.N., O'Doherty J.P. Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making. Cereb. Cortex 2009, 19:483-495.
    • (2009) Cereb. Cortex , vol.19 , pp. 483-495
    • Gläscher, J.1    Hampton, A.N.2    O'Doherty, J.P.3
  • 26
    • 0037057757 scopus 로고    scopus 로고
    • Banburismus and the brain: decoding the relationship between sensory stimuli, decisions, and reward
    • Gold J.I., Shadlen M.N. Banburismus and the brain: decoding the relationship between sensory stimuli, decisions, and reward. Neuron 2002, 36:299-308.
    • (2002) Neuron , vol.36 , pp. 299-308
    • Gold, J.I.1    Shadlen, M.N.2
  • 27
    • 0347086138 scopus 로고    scopus 로고
    • The primate basal ganglia: parallel and integrative networks
    • Haber S.N. The primate basal ganglia: parallel and integrative networks. J. Chem. Neuroanat. 2003, 26:317-330.
    • (2003) J. Chem. Neuroanat. , vol.26 , pp. 317-330
    • Haber, S.N.1
  • 28
    • 33644858743 scopus 로고    scopus 로고
    • Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning
    • Haruno M., Kawato M. Different neural correlates of reward expectation and reward expectation error in the putamen and caudate nucleus during stimulus-action-reward association learning. J. Neurophysiol. 2006, 95:948-959.
    • (2006) J. Neurophysiol. , vol.95 , pp. 948-959
    • Haruno, M.1    Kawato, M.2
  • 29
    • 85047670409 scopus 로고    scopus 로고
    • The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity
    • Holroyd C.B., Coles M.G. The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. Psychol. Rev. 2002, 109:679-709.
    • (2002) Psychol. Rev. , vol.109 , pp. 679-709
    • Holroyd, C.B.1    Coles, M.G.2
  • 30
    • 0035882897 scopus 로고    scopus 로고
    • Anticipation of increasing monetary reward selectively recruits nucleus accumbens
    • Knutson B., Adams C.M., Fong G.W., Hommer D. Anticipation of increasing monetary reward selectively recruits nucleus accumbens. J. Neurosci. 2001, 21:RC159.
    • (2001) J. Neurosci. , vol.21
    • Knutson, B.1    Adams, C.M.2    Fong, G.W.3    Hommer, D.4
  • 32
    • 41149146041 scopus 로고    scopus 로고
    • Neural correlates of perceptual learning in a sensory-motor, but not a sensory, cortical area
    • Law C.T., Gold J.I. Neural correlates of perceptual learning in a sensory-motor, but not a sensory, cortical area. Nat. Neurosci. 2008, 11:505-513.
    • (2008) Nat. Neurosci. , vol.11 , pp. 505-513
    • Law, C.T.1    Gold, J.I.2
  • 34
    • 0035849892 scopus 로고    scopus 로고
    • Neurophysiological investigation of the basis of the fMRI signal
    • Logothetis N.K., Pauls J., Augath M., Trinath T., Oeltermann A. Neurophysiological investigation of the basis of the fMRI signal. Nature 2001, 412:150-157.
    • (2001) Nature , vol.412 , pp. 150-157
    • Logothetis, N.K.1    Pauls, J.2    Augath, M.3    Trinath, T.4    Oeltermann, A.5
  • 35
    • 0034625673 scopus 로고    scopus 로고
    • Dissociating the role of the dorsolateral prefrontal and anterior cingulate cortex in cognitive control
    • MacDonald A.W., Cohen J.D., Stenger V.A., Carter C.S. Dissociating the role of the dorsolateral prefrontal and anterior cingulate cortex in cognitive control. Science 2000, 288:1835-1838.
    • (2000) Science , vol.288 , pp. 1835-1838
    • MacDonald, A.W.1    Cohen, J.D.2    Stenger, V.A.3    Carter, C.S.4
  • 36
    • 0037650217 scopus 로고    scopus 로고
    • Temporal prediction errors in a passive learning task activate human striatum
    • McClure S.M., Berns G.S., Montague P.R. Temporal prediction errors in a passive learning task activate human striatum. Neuron 2003, 38:339-346.
    • (2003) Neuron , vol.38 , pp. 339-346
    • McClure, S.M.1    Berns, G.S.2    Montague, P.R.3
  • 37
  • 38
    • 33646431689 scopus 로고    scopus 로고
    • Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans
    • Mushiake H., Saito N., Sakamoto K., Itoyama Y., Tanji J. Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans. Neuron 2006, 50:631-641.
    • (2006) Neuron , vol.50 , pp. 631-641
    • Mushiake, H.1    Saito, N.2    Sakamoto, K.3    Itoyama, Y.4    Tanji, J.5
  • 40
    • 0037987978 scopus 로고    scopus 로고
    • Temporal difference models and reward-related learning in the human brain
    • O'Doherty J.P., Dayan P., Friston K., Critchley H., Dolan R.J. Temporal difference models and reward-related learning in the human brain. Neuron 2003, 38:329-337.
    • (2003) Neuron , vol.38 , pp. 329-337
    • O'Doherty, J.P.1    Dayan, P.2    Friston, K.3    Critchley, H.4    Dolan, R.J.5
  • 41
    • 0032796820 scopus 로고    scopus 로고
    • The functional neuroanatomy of novelty processing: integrating ERP and fMRI results
    • Opitz B., Mecklinger A., Friederici A.D., von Cramon D.Y. The functional neuroanatomy of novelty processing: integrating ERP and fMRI results. Cereb. Cortex 1999, 9:379-391.
    • (1999) Cereb. Cortex , vol.9 , pp. 379-391
    • Opitz, B.1    Mecklinger, A.2    Friederici, A.D.3    von Cramon, D.Y.4
  • 42
    • 0019089514 scopus 로고
    • A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli
    • Pearce J.M., Hall G. A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychol. Rev. 1980, 87:532-552.
    • (1980) Psychol. Rev. , vol.87 , pp. 532-552
    • Pearce, J.M.1    Hall, G.2
  • 43
    • 0033566079 scopus 로고    scopus 로고
    • Neural correlates of decision variables in parietal cortex
    • Platt M.L., Glimcher P.W. Neural correlates of decision variables in parietal cortex. Nature 1999, 400:233-238.
    • (1999) Nature , vol.400 , pp. 233-238
    • Platt, M.L.1    Glimcher, P.W.2
  • 44
    • 34447648725 scopus 로고    scopus 로고
    • Multiple representations of belief states and action values in corticobasal ganglia loops
    • Samejima K., Doya K. Multiple representations of belief states and action values in corticobasal ganglia loops. Ann. N Y Acad. Sci. 2007, 1104:213-228.
    • (2007) Ann. N Y Acad. Sci. , vol.1104 , pp. 213-228
    • Samejima, K.1    Doya, K.2
  • 45
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz W. Predictive reward signal of dopamine neurons. J. Neurophysiol. 1998, 80:1-27.
    • (1998) J. Neurophysiol. , vol.80 , pp. 1-27
    • Schultz, W.1
  • 46
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science 1997, 275:1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 47
    • 34147151266 scopus 로고    scopus 로고
    • A common framework for perceptual learning
    • Seitz A.R., Dinse H.R. A common framework for perceptual learning. Curr. Opin. Neurobiol. 2007, 17:148-153.
    • (2007) Curr. Opin. Neurobiol. , vol.17 , pp. 148-153
    • Seitz, A.R.1    Dinse, H.R.2
  • 49
    • 40649105998 scopus 로고    scopus 로고
    • Novelty and target processing during an auditory novelty oddball: a simultaneous event-related potential and functional magnetic resonance imaging study
    • Strobel A., Debener S., Sorger B., Peters J.C., Kranczioch C., Hoechstetter K., Engel A.K., Brocke B., Goebel R. Novelty and target processing during an auditory novelty oddball: a simultaneous event-related potential and functional magnetic resonance imaging study. Neuroimage 2008, 40:869-883.
    • (2008) Neuroimage , vol.40 , pp. 869-883
    • Strobel, A.1    Debener, S.2    Sorger, B.3    Peters, J.C.4    Kranczioch, C.5    Hoechstetter, K.6    Engel, A.K.7    Brocke, B.8    Goebel, R.9
  • 50
    • 2942726234 scopus 로고    scopus 로고
    • Matching behavior and the representation of value in the parietal cortex
    • Sugrue L.P., Corrado G.S., Newsome W.T. Matching behavior and the representation of value in the parietal cortex. Science 2004, 304:1782-1787.
    • (2004) Science , vol.304 , pp. 1782-1787
    • Sugrue, L.P.1    Corrado, G.S.2    Newsome, W.T.3
  • 52
    • 0000887857 scopus 로고
    • A Proof of the Law of Effect
    • Thorndike E.L. A Proof of the Law of Effect. Science 1933, 77:173-175.
    • (1933) Science , vol.77 , pp. 173-175
    • Thorndike, E.L.1
  • 53
    • 58149442669 scopus 로고
    • Cognitive maps in rats and men
    • Tolman E.C. Cognitive maps in rats and men. Psychol. Rev. 1948, 55:189-208.
    • (1948) Psychol. Rev. , vol.55 , pp. 189-208
    • Tolman, E.C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.