SCOPUS 정보 검색 플랫폼

European Journal of Neuroscience

Volumn 35, Issue 7, 2012, Pages 987-990

Beyond simple reinforcement learning: The computational neurobiology of reward-learning and valuation

(1) O'Doherty, John P a

a CALIFORNIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Basal ganglia; Computational neuroscience; Conditioning; Decision making; Prefrontal cortex

Indexed keywords

DOPAMINE;

ARTICLE; BAYES THEOREM; COGNITION; CORPUS STRIATUM; DOPAMINERGIC SYSTEM; DORSAL STRIATUM; HIPPOCAMPUS; HUMAN; LEARNING; NONHUMAN; ORBITAL CORTEX; PREFRONTAL CORTEX; PRIORITY JOURNAL; REINFORCEMENT; REWARD; REWARD LEARNING; STIMULUS GENERALIZATION; STIMULUS RESPONSE; TEMPORAL DIFFERENCE REINFORCEMENT LEARNING; VENTROMEDIAL PREFRONTAL CORTEX;

COMPUTATIONAL BIOLOGY; DOPAMINE; HUMANS; LEARNING; NEUROBIOLOGY; REINFORCEMENT (PSYCHOLOGY); REWARD; TIME FACTORS;

EID: 84859311497 PISSN: 0953816X EISSN: 14609568 Source Type: Journal
DOI: 10.1111/j.1460-9568.2012.08074.x Document Type: Article

Times cited : (31)

References (43)

1
- 84859369427
- Neural control of dopamine neurotransmission: implications for reinforcement learning
- Aggarwal, M., Hyland, B.I. & Wickens, J.R. (2012) Neural control of dopamine neurotransmission: implications for reinforcement learning. Euro. J. Neurosci., 35, 1115-1123.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1115-1123
- Aggarwal, M.¹ Hyland, B.I.² Wickens, J.R.³

2
- 0031801210
- Goal-directed instrumental action: contingency and incentive learning and their cortical substrates
- Balleine, B.W. & Dickinson, A. (1998) Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology, 37, 407-419.
- (1998) Neuropharmacology , vol.37 , pp. 407-419
- Balleine, B.W.¹ Dickinson, A.²

3
- 72049125602
- Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
- Balleine, B.W. & O'Doherty, J.P. (2010) Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48-69.
- (2010) Neuropsychopharmacology , vol.35 , pp. 48-69
- Balleine, B.W.¹ O'Doherty, J.P.²

4
- 21544435722
- Midbrain dopamine neurons encode a quantitative reward prediction error signal
- Bayer, H.M. & Glimcher, P.W. (2005) Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron, 47, 129-141.
- (2005) Neuron , vol.47 , pp. 129-141
- Bayer, H.M.¹ Glimcher, P.W.²

5
- 34548295327
- Learning the value of information in an uncertain world
- Behrens, T.E., Woolrich, M.W., Walton, M.E. & Rushworth, M.F. (2007) Learning the value of information in an uncertain world. Nat. Neurosci., 10, 1214-1221.
- (2007) Nat. Neurosci. , vol.10 , pp. 1214-1221
- Behrens, T.E.¹ Woolrich, M.W.² Walton, M.E.³ Rushworth, M.F.⁴

6
- 33847634405
- The debate over dopamine's role in reward: the case for incentive salience
- Berridge, K.C. (2007) The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacology, 191, 391-431.
- (2007) Psychopharmacology , vol.191 , pp. 391-431
- Berridge, K.C.¹

7
- 84859343970
- From prediction error to incentive salience: mesolimbic computation of reward motivation
- Berridge, K.C. (2012) From prediction error to incentive salience: mesolimbic computation of reward motivation. Euro. J. Neurosci., 35, 1124-1143.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1124-1143
- Berridge, K.C.¹

8
- 0032423613
- What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience?
- Berridge, K.C. & Robinson, T.E. (1998) What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience? Brain Res. Brain Res. Rev., 28, 309-369.
- (1998) Brain Res. Brain Res. Rev. , vol.28 , pp. 309-369
- Berridge, K.C.¹ Robinson, T.E.²

9
- 84859297479
- Dissociating hippocampal and striatal contributions to sequential prediction learning
- Bornstein, A.M. & Daw, N. (2012) Dissociating hippocampal and striatal contributions to sequential prediction learning. Euro. J. Neurosci., 35, 1011-1023.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1011-1023
- Bornstein, A.M.¹ Daw, N.²

10
- 70350566799
- Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective
- Botvinick, M.M., Niv, Y. & Barto, A.C. (2009) Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition, 113, 262-280.
- (2009) Cognition , vol.113 , pp. 262-280
- Botvinick, M.M.¹ Niv, Y.² Barto, A.C.³

11
- 84859317336
- How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis
- Collins, A. & Frank, M.J. (2012) How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis. Euro. J. Neurosci., 35, 1024-1035.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1024-1035
- Collins, A.¹ Frank, M.J.²

12
- 0036835734
- Long-term reward prediction in TD models of the dopamine system
- Daw, N.D. & Touretzky, D.S. (2002) Long-term reward prediction in TD models of the dopamine system. Neural Comput., 14, 2567-2583.
- (2002) Neural Comput. , vol.14 , pp. 2567-2583
- Daw, N.D.¹ Touretzky, D.S.²

13
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Daw, N.D., Niv, Y. & Dayan, P. (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci., 8, 1704-1711.
- (2005) Nat. Neurosci. , vol.8 , pp. 1704-1711
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

14
- 84859315924
- Instrumental vigour in punishment and reward
- Dayan, P. (2012) Instrumental vigour in punishment and reward. Euro. J. Neurosci., 35, 1152-1167.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1152-1167
- Dayan, P.¹

15
- 84859341150
- Habits, action sequences, and reinforcement learning
- Dezfouli, A. & Balleine, B.W. (2012) Habits, action sequences, and reinforcement learning. Euro. J. Neurosci., 35, 1036-1051.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1036-1051
- Dezfouli, A.¹ Balleine, B.W.²

16
- 0036618011
- Multiple model-based reinforcement learning
- Doya, K., Samejima, K., Katagiri, K. & Kawato, M. (2002) Multiple model-based reinforcement learning. Neural Comput., 14, 1347-1369.
- (2002) Neural Comput. , vol.14 , pp. 1347-1369
- Doya, K.¹ Samejima, K.² Katagiri, K.³ Kawato, M.⁴

17
- 78650971061
- A selective role for dopamine in stimulus-reward learning
- Flagel, S.B., Clark, J.J., Robinson, T.E., Mayo, L., Czuj, A., Willuhn, I., Akers, C.A., Clinton, S.M., Phillips, P.E. & Akil, H. (2011) A selective role for dopamine in stimulus-reward learning. Nature, 469, 53-57.
- (2011) Nature , vol.469 , pp. 53-57
- Flagel, S.B.¹ Clark, J.J.² Robinson, T.E.³ Mayo, L.⁴ Czuj, A.⁵ Willuhn, I.⁶ Akers, C.A.⁷ Clinton, S.M.⁸ Phillips, P.E.⁹ Akil, H.¹⁰

18
- 84859353418
- Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats
- Funamizu, A., Ito, M., Doya, K., Kanzaki, R. & Takahashi, H. (2012) Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats. Euro. J. Neurosci., 35, 1179-1188.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1179-1188
- Funamizu, A.¹ Ito, M.² Doya, K.³ Kanzaki, R.⁴ Takahashi, H.⁵

19
- 0042932360
- Encoding predictive reward value in human amygdala and orbitofrontal cortex
- Gottfried, J.A., O'Doherty, J. & Dolan, R.J. (2003) Encoding predictive reward value in human amygdala and orbitofrontal cortex. Science, 301, 1104-1107.
- (2003) Science , vol.301 , pp. 1104-1107
- Gottfried, J.A.¹ O'Doherty, J.² Dolan, R.J.³

20
- 33748188120
- The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
- Hampton, A.N., Bossaerts, P. & O'Doherty, J.P. (2006) The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J. Neurosci., 26, 8360-8367.
- (2006) J. Neurosci. , vol.26 , pp. 8360-8367
- Hampton, A.N.¹ Bossaerts, P.² O'Doherty, J.P.³

21
- 84859338188
- Different dorsal striatum circuits mediate action discrimination and action generalization
- Hilario, M., Holloway, T., Jin, X. & Costa, R. (2012) Different dorsal striatum circuits mediate action discrimination and action generalization. Euro. J. Neurosci., 35, 1105-1114.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1105-1114
- Hilario, M.¹ Holloway, T.² Jin, X.³ Costa, R.⁴

22
- 40849087850
- Integrating hippocampus and striatum in decision-making
- Johnson, A., van der Meer, M.A. & Redish, A.D. (2007) Integrating hippocampus and striatum in decision-making. Curr. Opin. Neurobiol., 17, 692-697.
- (2007) Curr. Opin. Neurobiol. , vol.17 , pp. 692-697
- Johnson, A.¹ van der Meer, M.A.² Redish, A.D.³

23
- 0242497620
- The architecture of cognitive control in the human prefrontal cortex
- Koechlin, E., Ody, C. & Kouneiher, F. (2003) The architecture of cognitive control in the human prefrontal cortex. Science, 302, 1181-1185.
- (2003) Science , vol.302 , pp. 1181-1185
- Koechlin, E.¹ Ody, C.² Kouneiher, F.³

24
- 84859368712
- A Theoretical account of cognitive effects in delay discounting
- Kurth-Nelson, Z., Bickel, W.K. & Redish, A.D. (2012) A Theoretical account of cognitive effects in delay discounting. Euro. J. Neurosci., 35, 1052-1064.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1052-1064
- Kurth-Nelson, Z.¹ Bickel, W.K.² Redish, A.D.³

25
- 84859323549
- Model-based learning and the contribution of the orbitofrontal cortex to the model-free world
- McDannald, M.A., Takahashi, Y., Lopatina, N., Pietras, B., Jones, J.L. & Schoenbaum, G. (2012) Model-based learning and the contribution of the orbitofrontal cortex to the model-free world. Euro. J. Neurosci., 35, 991-996.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 991-996
- McDannald, M.A.¹ Takahashi, Y.² Lopatina, N.³ Pietras, B.⁴ Jones, J.L.⁵ Schoenbaum, G.⁶

26
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive Hebbian learning
- Montague, P.R., Dayan, P. & Sejnowski, T.J. (1996) A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci., 16, 1936-1947.
- (1996) J. Neurosci. , vol.16 , pp. 1936-1947
- Montague, P.R.¹ Dayan, P.² Sejnowski, T.J.³

27
- 33847675011
- Tonic dopamine: opportunity costs and the control of response vigor
- Niv, Y., Daw, N.D., Joel, D. & Dayan, P. (2007) Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology (Berl), 191, 507-520.
- (2007) Psychopharmacology (Berl) , vol.191 , pp. 507-520
- Niv, Y.¹ Daw, N.D.² Joel, D.³ Dayan, P.⁴

28
- 84859322403
- Re-evaluating the role of orbitofrontal cortex in reward and reinforcement
- Noonan, M.P., Kolling, N., Walton, M. & Rushworth, M. (2012) Re-evaluating the role of orbitofrontal cortex in reward and reinforcement. Euro. J. Neurosci., 35, 997-1010.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 997-1010
- Noonan, M.P.¹ Kolling, N.² Walton, M.³ Rushworth, M.⁴

29
- 84859343955
- How can a Bayesian approach inform neuroscience?
- O'Reilly, J.X., Jbabdi, S. & Behrens, T.E. (2012) How can a Bayesian approach inform neuroscience? Euro. J. Neurosci., 35, 1168-1178.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1168-1178
- O'Reilly, J.X.¹ Jbabdi, S.² Behrens, T.E.³

30
- 84859345638
- Category representation and generalization in the prefrontal cortex
- Pan, X. & Sakagami, M. (2012) Category representation and generalization in the prefrontal cortex. Euro. J. Neurosci., 35, 1083-1091.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1083-1091
- Pan, X.¹ Sakagami, M.²

31
- 32844469435
- The primate amygdala represents the positive and negative value of visual stimuli during learning
- Paton, J.J., Belova, M.A., Morrison, S.E. & Salzman, C.D. (2006) The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature, 439, 865-870.
- (2006) Nature , vol.439 , pp. 865-870
- Paton, J.J.¹ Belova, M.A.² Morrison, S.E.³ Salzman, C.D.⁴

32
- 0019089514
- A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli
- Pearce, J.M. & Hall, G. (1980) A model for Pavlovian learning: variations in the effectiveness of conditioned but not of unconditioned stimuli. Psychol. Rev., 87, 532-552.
- (1980) Psychol. Rev. , vol.87 , pp. 532-552
- Pearce, J.M.¹ Hall, G.²

33
- 14044274312
- Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards
- Robinson, S., Sandstrom, S.M., Denenberg, V.H. & Palmiter, R.D. (2005) Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards. Behav. Neurosci., 119, 5-15.
- (2005) Behav. Neurosci. , vol.119 , pp. 5-15
- Robinson, S.¹ Sandstrom, S.M.² Denenberg, V.H.³ Palmiter, R.D.⁴

34
- 84859339780
- Surprise! Neural correlates of Pearce-Hall and Rescorla-Wagner coexist within the brain
- Roesch, M., Esber, G.R., Li, J., Daw, N. & Schoenbaum, G. (2012) Surprise! Neural correlates of Pearce-Hall and Rescorla-Wagner coexist within the brain. Euro. J. Neurosci., 35, 1189-1199.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1189-1199
- Roesch, M.¹ Esber, G.R.² Li, J.³ Daw, N.⁴ Schoenbaum, G.⁵

35
- 28144449057
- Representation of action-specific reward values in the striatum
- Samejima, K., Ueda, Y., Doya, K. & Kimura, M. (2005) Representation of action-specific reward values in the striatum. Science, 310, 1337-1340.
- (2005) Science , vol.310 , pp. 1337-1340
- Samejima, K.¹ Ueda, Y.² Doya, K.³ Kimura, M.⁴

36
- 0032081988
- Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning
- Schoenbaum, G., Chiba, A.A. & Gallagher, M. (1998) Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning. Nat. Neurosci., 1, 155-159.
- (1998) Nat. Neurosci. , vol.1 , pp. 155-159
- Schoenbaum, G.¹ Chiba, A.A.² Gallagher, M.³

37
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz, W. (1998) Predictive reward signal of dopamine neurons. J. Neurophysiol., 80, 1-27.
- (1998) J. Neurophysiol. , vol.80 , pp. 1-27
- Schultz, W.¹

38
- 0030896968
- A neural substrate of prediction and reward
- Schultz, W., Dayan, P. & Montague, P.R. (1997) A neural substrate of prediction and reward. Science, 275, 1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

39
- 0742304657
- The principal features and mechanisms of dopamine modulation in the prefrontal cortex
- Seamans, J.K. & Yang, C.R. (2004) The principal features and mechanisms of dopamine modulation in the prefrontal cortex. Prog. Neurobiol., 74, 1-58.
- (2004) Prog. Neurobiol. , vol.74 , pp. 1-58
- Seamans, J.K.¹ Yang, C.R.²

40
- 84859359283
- Decomposing effects of dopaminergic medication in Parkinson's disease on probabilistic action selection: learning or performance?
- Smittenaar, P., Chase, H.W., Aarts, E., Nusselein, B., Bloem, B.R. & Cools, R. (2012) Decomposing effects of dopaminergic medication in Parkinson's disease on probabilistic action selection: learning or performance? Euro. J. Neurosci., 35, 1144-1151.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1144-1151
- Smittenaar, P.¹ Chase, H.W.² Aarts, E.³ Nusselein, B.⁴ Bloem, B.R.⁵ Cools, R.⁶

41
- 0004007508
- MIT Press, Cambridge, MA.
- Sutton, R.S. & Barto, A.G. (1998) Reinforcement Learning. MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

42
- 84859308709
- Strategic control in decision making under uncertainty
- Venkatraman, V. & Huettel, S. (2012) Strategic control in decision making under uncertainty. Euro. J. Neurosci., 35, 1075-1082.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1075-1082
- Venkatraman, V.¹ Huettel, S.²

43
- 84859339117
- Generalization of value in reinforcement learning by humans
- Wimmer, G.E., Daw, N.D. & Shohamy, D. (2012) Generalization of value in reinforcement learning by humans. Euro. J. Neurosci., 35, 1092-1104.
- (2012) Euro. J. Neurosci. , vol.35 , pp. 1092-1104
- Wimmer, G.E.¹ Daw, N.D.² Shohamy, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.