메뉴 건너뛰기




Volumn 35, Issue 7, 2012, Pages 987-990

Beyond simple reinforcement learning: The computational neurobiology of reward-learning and valuation

Author keywords

Basal ganglia; Computational neuroscience; Conditioning; Decision making; Prefrontal cortex

Indexed keywords

DOPAMINE;

EID: 84859311497     PISSN: 0953816X     EISSN: 14609568     Source Type: Journal    
DOI: 10.1111/j.1460-9568.2012.08074.x     Document Type: Article
Times cited : (31)

References (43)
  • 1
    • 84859369427 scopus 로고    scopus 로고
    • Neural control of dopamine neurotransmission: implications for reinforcement learning
    • Aggarwal, M., Hyland, B.I. & Wickens, J.R. (2012) Neural control of dopamine neurotransmission: implications for reinforcement learning. Euro. J. Neurosci., 35, 1115-1123.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1115-1123
    • Aggarwal, M.1    Hyland, B.I.2    Wickens, J.R.3
  • 2
    • 0031801210 scopus 로고    scopus 로고
    • Goal-directed instrumental action: contingency and incentive learning and their cortical substrates
    • Balleine, B.W. & Dickinson, A. (1998) Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology, 37, 407-419.
    • (1998) Neuropharmacology , vol.37 , pp. 407-419
    • Balleine, B.W.1    Dickinson, A.2
  • 3
    • 72049125602 scopus 로고    scopus 로고
    • Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
    • Balleine, B.W. & O'Doherty, J.P. (2010) Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48-69.
    • (2010) Neuropsychopharmacology , vol.35 , pp. 48-69
    • Balleine, B.W.1    O'Doherty, J.P.2
  • 4
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • Bayer, H.M. & Glimcher, P.W. (2005) Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron, 47, 129-141.
    • (2005) Neuron , vol.47 , pp. 129-141
    • Bayer, H.M.1    Glimcher, P.W.2
  • 6
    • 33847634405 scopus 로고    scopus 로고
    • The debate over dopamine's role in reward: the case for incentive salience
    • Berridge, K.C. (2007) The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology, 191, 391-431.
    • (2007) Psychopharmacology , vol.191 , pp. 391-431
    • Berridge, K.C.1
  • 7
    • 84859343970 scopus 로고    scopus 로고
    • From prediction error to incentive salience: mesolimbic computation of reward motivation
    • Berridge, K.C. (2012) From prediction error to incentive salience: mesolimbic computation of reward motivation. Euro. J. Neurosci., 35, 1124-1143.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1124-1143
    • Berridge, K.C.1
  • 8
    • 0032423613 scopus 로고    scopus 로고
    • What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience?
    • Berridge, K.C. & Robinson, T.E. (1998) What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience? Brain Res. Brain Res. Rev., 28, 309-369.
    • (1998) Brain Res. Brain Res. Rev. , vol.28 , pp. 309-369
    • Berridge, K.C.1    Robinson, T.E.2
  • 9
    • 84859297479 scopus 로고    scopus 로고
    • Dissociating hippocampal and striatal contributions to sequential prediction learning
    • Bornstein, A.M. & Daw, N. (2012) Dissociating hippocampal and striatal contributions to sequential prediction learning. Euro. J. Neurosci., 35, 1011-1023.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1011-1023
    • Bornstein, A.M.1    Daw, N.2
  • 10
    • 70350566799 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective
    • Botvinick, M.M., Niv, Y. & Barto, A.C. (2009) Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition, 113, 262-280.
    • (2009) Cognition , vol.113 , pp. 262-280
    • Botvinick, M.M.1    Niv, Y.2    Barto, A.C.3
  • 11
    • 84859317336 scopus 로고    scopus 로고
    • How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis
    • Collins, A. & Frank, M.J. (2012) How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis. Euro. J. Neurosci., 35, 1024-1035.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1024-1035
    • Collins, A.1    Frank, M.J.2
  • 12
    • 0036835734 scopus 로고    scopus 로고
    • Long-term reward prediction in TD models of the dopamine system
    • Daw, N.D. & Touretzky, D.S. (2002) Long-term reward prediction in TD models of the dopamine system. Neural Comput., 14, 2567-2583.
    • (2002) Neural Comput. , vol.14 , pp. 2567-2583
    • Daw, N.D.1    Touretzky, D.S.2
  • 13
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw, N.D., Niv, Y. & Dayan, P. (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci., 8, 1704-1711.
    • (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 14
    • 84859315924 scopus 로고    scopus 로고
    • Instrumental vigour in punishment and reward
    • Dayan, P. (2012) Instrumental vigour in punishment and reward. Euro. J. Neurosci., 35, 1152-1167.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1152-1167
    • Dayan, P.1
  • 15
    • 84859341150 scopus 로고    scopus 로고
    • Habits, action sequences, and reinforcement learning
    • Dezfouli, A. & Balleine, B.W. (2012) Habits, action sequences, and reinforcement learning. Euro. J. Neurosci., 35, 1036-1051.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1036-1051
    • Dezfouli, A.1    Balleine, B.W.2
  • 16
  • 18
    • 84859353418 scopus 로고    scopus 로고
    • Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats
    • Funamizu, A., Ito, M., Doya, K., Kanzaki, R. & Takahashi, H. (2012) Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats. Euro. J. Neurosci., 35, 1179-1188.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1179-1188
    • Funamizu, A.1    Ito, M.2    Doya, K.3    Kanzaki, R.4    Takahashi, H.5
  • 19
    • 0042932360 scopus 로고    scopus 로고
    • Encoding predictive reward value in human amygdala and orbitofrontal cortex
    • Gottfried, J.A., O'Doherty, J. & Dolan, R.J. (2003) Encoding predictive reward value in human amygdala and orbitofrontal cortex. Science, 301, 1104-1107.
    • (2003) Science , vol.301 , pp. 1104-1107
    • Gottfried, J.A.1    O'Doherty, J.2    Dolan, R.J.3
  • 20
    • 33748188120 scopus 로고    scopus 로고
    • The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
    • Hampton, A.N., Bossaerts, P. & O'Doherty, J.P. (2006) The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J. Neurosci., 26, 8360-8367.
    • (2006) J. Neurosci. , vol.26 , pp. 8360-8367
    • Hampton, A.N.1    Bossaerts, P.2    O'Doherty, J.P.3
  • 21
    • 84859338188 scopus 로고    scopus 로고
    • Different dorsal striatum circuits mediate action discrimination and action generalization
    • Hilario, M., Holloway, T., Jin, X. & Costa, R. (2012) Different dorsal striatum circuits mediate action discrimination and action generalization. Euro. J. Neurosci., 35, 1105-1114.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1105-1114
    • Hilario, M.1    Holloway, T.2    Jin, X.3    Costa, R.4
  • 23
    • 0242497620 scopus 로고    scopus 로고
    • The architecture of cognitive control in the human prefrontal cortex
    • Koechlin, E., Ody, C. & Kouneiher, F. (2003) The architecture of cognitive control in the human prefrontal cortex. Science, 302, 1181-1185.
    • (2003) Science , vol.302 , pp. 1181-1185
    • Koechlin, E.1    Ody, C.2    Kouneiher, F.3
  • 24
    • 84859368712 scopus 로고    scopus 로고
    • A Theoretical account of cognitive effects in delay discounting
    • Kurth-Nelson, Z., Bickel, W.K. & Redish, A.D. (2012) A Theoretical account of cognitive effects in delay discounting. Euro. J. Neurosci., 35, 1052-1064.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1052-1064
    • Kurth-Nelson, Z.1    Bickel, W.K.2    Redish, A.D.3
  • 26
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague, P.R., Dayan, P. & Sejnowski, T.J. (1996) A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci., 16, 1936-1947.
    • (1996) J. Neurosci. , vol.16 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 27
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: opportunity costs and the control of response vigor
    • Niv, Y., Daw, N.D., Joel, D. & Dayan, P. (2007) Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology (Berl), 191, 507-520.
    • (2007) Psychopharmacology (Berl) , vol.191 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 28
    • 84859322403 scopus 로고    scopus 로고
    • Re-evaluating the role of orbitofrontal cortex in reward and reinforcement
    • Noonan, M.P., Kolling, N., Walton, M. & Rushworth, M. (2012) Re-evaluating the role of orbitofrontal cortex in reward and reinforcement. Euro. J. Neurosci., 35, 997-1010.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 997-1010
    • Noonan, M.P.1    Kolling, N.2    Walton, M.3    Rushworth, M.4
  • 29
    • 84859343955 scopus 로고    scopus 로고
    • How can a Bayesian approach inform neuroscience?
    • O'Reilly, J.X., Jbabdi, S. & Behrens, T.E. (2012) How can a Bayesian approach inform neuroscience? Euro. J. Neurosci., 35, 1168-1178.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1168-1178
    • O'Reilly, J.X.1    Jbabdi, S.2    Behrens, T.E.3
  • 30
    • 84859345638 scopus 로고    scopus 로고
    • Category representation and generalization in the prefrontal cortex
    • Pan, X. & Sakagami, M. (2012) Category representation and generalization in the prefrontal cortex. Euro. J. Neurosci., 35, 1083-1091.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1083-1091
    • Pan, X.1    Sakagami, M.2
  • 31
    • 32844469435 scopus 로고    scopus 로고
    • The primate amygdala represents the positive and negative value of visual stimuli during learning
    • Paton, J.J., Belova, M.A., Morrison, S.E. & Salzman, C.D. (2006) The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature, 439, 865-870.
    • (2006) Nature , vol.439 , pp. 865-870
    • Paton, J.J.1    Belova, M.A.2    Morrison, S.E.3    Salzman, C.D.4
  • 32
    • 0019089514 scopus 로고
    • A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli
    • Pearce, J.M. & Hall, G. (1980) A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychol. Rev., 87, 532-552.
    • (1980) Psychol. Rev. , vol.87 , pp. 532-552
    • Pearce, J.M.1    Hall, G.2
  • 33
    • 14044274312 scopus 로고    scopus 로고
    • Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards
    • Robinson, S., Sandstrom, S.M., Denenberg, V.H. & Palmiter, R.D. (2005) Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards. Behav. Neurosci., 119, 5-15.
    • (2005) Behav. Neurosci. , vol.119 , pp. 5-15
    • Robinson, S.1    Sandstrom, S.M.2    Denenberg, V.H.3    Palmiter, R.D.4
  • 34
    • 84859339780 scopus 로고    scopus 로고
    • Surprise! Neural correlates of Pearce-Hall and Rescorla-Wagner coexist within the brain
    • Roesch, M., Esber, G.R., Li, J., Daw, N. & Schoenbaum, G. (2012) Surprise! Neural correlates of Pearce-Hall and Rescorla-Wagner coexist within the brain. Euro. J. Neurosci., 35, 1189-1199.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1189-1199
    • Roesch, M.1    Esber, G.R.2    Li, J.3    Daw, N.4    Schoenbaum, G.5
  • 35
    • 28144449057 scopus 로고    scopus 로고
    • Representation of action-specific reward values in the striatum
    • Samejima, K., Ueda, Y., Doya, K. & Kimura, M. (2005) Representation of action-specific reward values in the striatum. Science, 310, 1337-1340.
    • (2005) Science , vol.310 , pp. 1337-1340
    • Samejima, K.1    Ueda, Y.2    Doya, K.3    Kimura, M.4
  • 36
    • 0032081988 scopus 로고    scopus 로고
    • Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning
    • Schoenbaum, G., Chiba, A.A. & Gallagher, M. (1998) Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning. Nat. Neurosci., 1, 155-159.
    • (1998) Nat. Neurosci. , vol.1 , pp. 155-159
    • Schoenbaum, G.1    Chiba, A.A.2    Gallagher, M.3
  • 37
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz, W. (1998) Predictive reward signal of dopamine neurons. J. Neurophysiol., 80, 1-27.
    • (1998) J. Neurophysiol. , vol.80 , pp. 1-27
    • Schultz, W.1
  • 38
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan, P. & Montague, P.R. (1997) A neural substrate of prediction and reward. Science, 275, 1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 39
    • 0742304657 scopus 로고    scopus 로고
    • The principal features and mechanisms of dopamine modulation in the prefrontal cortex
    • Seamans, J.K. & Yang, C.R. (2004) The principal features and mechanisms of dopamine modulation in the prefrontal cortex. Prog. Neurobiol., 74, 1-58.
    • (2004) Prog. Neurobiol. , vol.74 , pp. 1-58
    • Seamans, J.K.1    Yang, C.R.2
  • 40
    • 84859359283 scopus 로고    scopus 로고
    • Decomposing effects of dopaminergic medication in Parkinson's disease on probabilistic action selection: learning or performance?
    • Smittenaar, P., Chase, H.W., Aarts, E., Nusselein, B., Bloem, B.R. & Cools, R. (2012) Decomposing effects of dopaminergic medication in Parkinson's disease on probabilistic action selection: learning or performance? Euro. J. Neurosci., 35, 1144-1151.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1144-1151
    • Smittenaar, P.1    Chase, H.W.2    Aarts, E.3    Nusselein, B.4    Bloem, B.R.5    Cools, R.6
  • 42
    • 84859308709 scopus 로고    scopus 로고
    • Strategic control in decision making under uncertainty
    • Venkatraman, V. & Huettel, S. (2012) Strategic control in decision making under uncertainty. Euro. J. Neurosci., 35, 1075-1082.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1075-1082
    • Venkatraman, V.1    Huettel, S.2
  • 43
    • 84859339117 scopus 로고    scopus 로고
    • Generalization of value in reinforcement learning by humans
    • Wimmer, G.E., Daw, N.D. & Shohamy, D. (2012) Generalization of value in reinforcement learning by humans. Euro. J. Neurosci., 35, 1092-1104.
    • (2012) Euro. J. Neurosci. , vol.35 , pp. 1092-1104
    • Wimmer, G.E.1    Daw, N.D.2    Shohamy, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.