메뉴 건너뛰기




Volumn 5, Issue MARCH2016, 2016, Pages

Midbrain dopamine neurons compute inferred and cached value prediction errors in a common framework

Author keywords

[No Author keywords available]

Indexed keywords

DOPAMINERGIC NERVE CELL; ERROR; MESENCEPHALON; MODEL; PREDICTION; REWARD; ANIMAL; ANIMAL BEHAVIOR; ASSOCIATION; BIOLOGICAL MODEL; FEMALE; INSTRUMENTAL CONDITIONING; LONG EVANS RAT; MALE; PHYSIOLOGY;

EID: 84964211344     PISSN: None     EISSN: 2050084X     Source Type: Journal    
DOI: 10.7554/eLife.13665     Document Type: Article
Times cited : (91)

References (57)
  • 1
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • Bayer HM, Glimcher PW. 2005. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47:129–141. doi: 10.1016/j.neuron.2005.05.020
    • (2005) Neuron , vol.47 , pp. 129-141
    • Bayer, H.M.1    Glimcher, P.W.2
  • 3
    • 78649966665 scopus 로고    scopus 로고
    • Dopamine in Motivational Control: Rewarding, Aversive, and Alerting
    • Bromberg-Martin ES, Matsumoto M, Hikosaka O. 2010a. Dopamine in Motivational Control: Rewarding, Aversive, and Alerting. Neuron 68:815–834. doi: 10.1016/j.neuron.2010.11.022
    • (2010) Neuron , vol.68 , pp. 815-834
    • Bromberg-Martin, E.S.1    Matsumoto, M.2    Hikosaka, O.3
  • 4
  • 5
    • 0001491619 scopus 로고
    • A mathematical model for simple learning
    • Bush RR, Mosteller F. 1951. A mathematical model for simple learning. Psychological Review 58:313–323. doi: 10.1037/h0054388
    • (1951) Psychological Review , vol.58 , pp. 313-323
    • Bush, R.R.1    Mosteller, F.2
  • 6
    • 84953837909 scopus 로고    scopus 로고
    • Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors
    • Chang CY, Esber GR, Marrero-Garcia Y, Yau H-J, Bonci A, Schoenbaum G. 2016. Brief optogenetic inhibition of dopamine neurons mimics endogenous negative reward prediction errors. Nature Neuroscience 19:111–116. doi: 10.1038/nn.4191
    • (2016) Nature Neuroscience , vol.19 , pp. 111-116
    • Chang, C.Y.1    Esber, G.R.2    Marrero-Garcia, Y.3    Yau, H.-J.4    Bonci, A.5    Schoenbaum, G.6
  • 7
    • 84878176981 scopus 로고    scopus 로고
    • Pavlovian valuation systems in learning and decision making
    • Clark JJ, Hollon NG, Phillips PEM. 2012. Pavlovian valuation systems in learning and decision making. Current Opinion in Neurobiology 22:1054–1061. doi: 10.1016/j.conb.2012.06.004
    • (2012) Current Opinion in Neurobiology , vol.22 , pp. 1054-1061
    • Clark, J.J.1    Hollon, N.G.2    Phillips, P.E.M.3
  • 8
    • 84856431209 scopus 로고    scopus 로고
    • Neuron-type-specific signals for reward and punishment in the ventral tegmental area
    • Cohen JY, Haesler S, Vong L, Lowell BB, Uchida N. 2012. Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 482:85–88. doi: 10.1038/nature10754
    • (2012) Nature , vol.482 , pp. 85-88
    • Cohen, J.Y.1    Haesler, S.2    Vong, L.3    Lowell, B.B.4    Uchida, N.5
  • 10
    • 79952746011 scopus 로고    scopus 로고
    • Model-based influences on humans’ choices and striatal prediction errors
    • Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ. 2011. Model-based influences on humans’ choices and striatal prediction errors. Neuron 69:1204–1215. doi: 10.1016/j.neuron.2011.02.027
    • (2011) Neuron , vol.69 , pp. 1204-1215
    • Daw, N.D.1    Gershman, S.J.2    Seymour, B.3    Dayan, P.4    Dolan, R.J.5
  • 11
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw ND, Niv Y, Dayan P. 2005. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 8:1704–1711. doi: 10.1038/nn1560
    • (2005) Nature Neuroscience , vol.8 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 13
    • 84956825053 scopus 로고    scopus 로고
    • Variability in dopamine genes dissociates model-based and model-free reinforcement learning
    • Doll BB, Bath KG, Daw ND, Frank MJ. 2016. Variability in dopamine genes dissociates model-based and model-free reinforcement learning. Journal of Neuroscience 36:1211–1222. doi: 10.1523/JNEUROSCI.1901-15.2016
    • (2016) Journal of Neuroscience , vol.36 , pp. 1211-1222
    • Doll, B.B.1    Bath, K.G.2    Daw, N.D.3    Frank, M.J.4
  • 14
    • 84872761547 scopus 로고    scopus 로고
    • The ubiquity of model-based reinforcement learning
    • Doll BB, Simon DA, Daw ND. 2012. The ubiquity of model-based reinforcement learning. Current Opinion in Neurobiology 22:1075–1081. doi: 10.1016/j.conb.2012.08.003
    • (2012) Current Opinion in Neurobiology , vol.22 , pp. 1075-1081
    • Doll, B.B.1    Simon, D.A.2    Daw, N.D.3
  • 15
    • 84941212802 scopus 로고    scopus 로고
    • Arithmetic and local circuitry underlying dopamine prediction errors
    • Eshel N, Bukwich M, Rao V, Hemmelder V, Tian J, Uchida N. 2015. Arithmetic and local circuitry underlying dopamine prediction errors. Nature 525:243–246. doi: 10.1038/nature14855
    • (2015) Nature , vol.525 , pp. 243-246
    • Eshel, N.1    Bukwich, M.2    Rao, V.3    Hemmelder, V.4    Tian, J.5    Uchida, N.6
  • 16
    • 80053152388 scopus 로고    scopus 로고
    • Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis
    • Glimcher PW. 2011. Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis. Proceedings of the National Academy of Sciences of the United States of America 108: 15647–15654. doi: 10.1073/pnas.1014269108
    • (2011) Proceedings of the National Academy of Sciences of the United States of America , vol.108 , pp. 15647-15654
    • Glimcher, P.W.1
  • 17
    • 0029812404 scopus 로고    scopus 로고
    • Learning about associatively activated stimulus representations: Implications for acquired equivalence and perceptual learning
    • Hall G. 1996. Learning about associatively activated stimulus representations: Implications for acquired equivalence and perceptual learning. Animal Learning & Behavior 24:233–255. doi: 10.3758/BF03198973
    • (1996) Animal Learning & Behavior , vol.24 , pp. 233-255
    • Hall, G.1
  • 19
    • 84892388605 scopus 로고    scopus 로고
    • Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term
    • Hart AS, Rutledge RB, Glimcher PW, Phillips PEM. 2014. Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term. Journal of Neuroscience 34:698–704. doi: 10.1523/JNEUROSCI.2489-13.2014
    • (2014) Journal of Neuroscience , vol.34 , pp. 698-704
    • Hart, A.S.1    Rutledge, R.B.2    Glimcher, P.W.3    Phillips, P.E.M.4
  • 20
    • 0018288266 scopus 로고
    • Differential effects of two ways of devaluing the unconditioned stimulus after Pavlovian appetitive conditioning
    • Holland PC, Straub JJ. 1979. Differential effects of two ways of devaluing the unconditioned stimulus after Pavlovian appetitive conditioning. Journal of Experimental Psychology 5:65–78. doi: 10.1037/0097-7403.5.1.65
    • (1979) Journal of Experimental Psychology , vol.5 , pp. 65-78
    • Holland, P.C.1    Straub, J.J.2
  • 21
    • 0025520061 scopus 로고
    • Event representation in Pavlovian conditioning: Image and action
    • Holland PC. 1990. Event representation in Pavlovian conditioning: Image and action. Cognition 37:105–131. doi: 10.1016/0010-0277(90)90020-K
    • (1990) Cognition , vol.37 , pp. 105-131
    • Holland, P.C.1
  • 22
    • 33644688754 scopus 로고    scopus 로고
    • Dopamine neurons report an error in the temporal prediction of reward during learning
    • Hollerman JR, Schultz W. 1998. Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience 1:304–309. doi: 10.1038/1124
    • (1998) Nature Neuroscience , vol.1 , pp. 304-309
    • Hollerman, J.R.1    Schultz, W.2
  • 23
    • 84883065020 scopus 로고    scopus 로고
    • Prolonged dopamine signalling in striatum signals proximity and value of distant rewards
    • Howe MW, Tierney PL, Sandberg SG, Phillips PEM, Graybiel AM. 2013. Prolonged dopamine signalling in striatum signals proximity and value of distant rewards. Nature 500:575–579. doi: 10.1038/nature12475
    • (2013) Nature , vol.500 , pp. 575-579
    • Howe, M.W.1    Tierney, P.L.2    Sandberg, S.G.3    Phillips, P.4    Graybiel, A.M.5
  • 24
    • 77954925944 scopus 로고    scopus 로고
    • Start/stop signals emerge in nigrostriatal circuits during sequence learning
    • Jin X, Costa RM. 2010. Start/stop signals emerge in nigrostriatal circuits during sequence learning. Nature 466: 457–462. doi: 10.1038/nature09263
    • (2010) Nature , vol.466 , pp. 457-462
    • Jin, X.1    Costa, R.M.2
  • 25
    • 84877276141 scopus 로고    scopus 로고
    • Effects of prefrontal cortical inactivation on neural activity in the ventral tegmental area
    • Jo YS, Lee J, Mizumori SJY. 2013. Effects of prefrontal cortical inactivation on neural activity in the ventral tegmental area. Journal of Neuroscience 33:8159–8171. doi: 10.1523/JNEUROSCI.0118-13.2013
    • (2013) Journal of Neuroscience , vol.33 , pp. 8159-8171
    • Jo, Y.S.1    Lee, J.2    Mizumori, S.J.Y.3
  • 27
    • 0036592029 scopus 로고    scopus 로고
    • Dopamine: Generalization and bonuses
    • Kakade S, Dayan P. 2002. Dopamine: generalization and bonuses. Neural Networks 15:549–559. doi: 10.1016/S0893-6080(02)00048-5
    • (2002) Neural Networks , vol.15 , pp. 549-559
    • Kakade, S.1    Dayan, P.2
  • 31
    • 33845305449 scopus 로고    scopus 로고
    • The ventral tegmental area revisited: Is there an electrophysiological marker for dopaminergic neurons?
    • Margolis EB, Lock H, Hjelmstad GO, Fields HL. 2006. The ventral tegmental area revisited: is there an electrophysiological marker for dopaminergic neurons? The Journal of Physiology 577:907–924. doi: 10.1113/jphysiol.2006.117069
    • (2006) The Journal of Physiology , vol.577 , pp. 907-924
    • Margolis, E.B.1    Lock, H.2    Hjelmstad, G.O.3    Fields, H.L.4
  • 32
    • 84964229084 scopus 로고    scopus 로고
    • Phasic dopamine transmission following state-based reinforcer devaluation in a dual-reward detection task. Society for Neuroscience Abstracts
    • Martinez V, Walton ME, Gan JO, Phillips PEM. 2008. Phasic dopamine transmission following state-based reinforcer devaluation in a dual-reward detection task. Society for Neuroscience Abstracts.
    • (2008)
    • Martinez, V.1    Walton, M.E.2    Gan, J.O.3    Phillips, P.4
  • 33
    • 67349098495 scopus 로고    scopus 로고
    • Two types of dopamine neuron distinctly convey positive and negative motivational signals
    • Matsumoto M, Hikosaka O. 2009. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 459:837–841. doi: 10.1038/nature08028
    • (2009) Nature , vol.459 , pp. 837-841
    • Matsumoto, M.1    Hikosaka, O.2
  • 34
    • 0027964829 scopus 로고
    • Importance of unpredictability for reward responses in primate dopamine neurons
    • Mirenowicz J, Schultz W. 1994. Importance of unpredictability for reward responses in primate dopamine neurons. Journal of Neurophysiology 72:1024–1027.
    • (1994) Journal of Neurophysiology , vol.72 , pp. 1024-1027
    • Mirenowicz, J.1    Schultz, W.2
  • 35
    • 33747585633 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode decisions for future action
    • Morris G, Nevet A, Arkadir D, Vaadia E, Bergman H. 2006. Midbrain dopamine neurons encode decisions for future action. Nature Neuroscience 9:1057–1063. doi: 10.1038/nn1743
    • (2006) Nature Neuroscience , vol.9 , pp. 1057-1063
    • Morris, G.1    Nevet, A.2    Arkadir, D.3    Vaadia, E.4    Bergman, H.5
  • 36
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: Opportunity costs and the control of response vigor
    • Niv Y, Daw ND, Joel D, Dayan P. 2007. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology 191:507–520. doi: 10.1007/s00213-006-0502-4
    • (2007) Psychopharmacology , vol.191 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 37
    • 45949092119 scopus 로고    scopus 로고
    • Dialogues on prediction errors
    • Niv Y, Schoenbaum G. 2008. Dialogues on prediction errors. Trends in Cognitive Sciences 12:265–272. doi: 10.1016/j.tics.2008.03.006
    • (2008) Trends in Cognitive Sciences , vol.12 , pp. 265-272
    • Niv, Y.1    Schoenbaum, G.2
  • 38
    • 21544455210 scopus 로고    scopus 로고
    • Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward-learning network
    • Pan WX, Schmidt R, Wickens JR, Hyland BI. 2005. Dopamine cells respond to predicted events during classical conditioning: evidence for eligibility traces in the reward-learning network. Journal of Neuroscience 25:6235–6242. doi: 10.1523/JNEUROSCI.1478-05.2005
    • (2005) Journal of Neuroscience , vol.25 , pp. 6235-6242
    • Pan, W.X.1    Schmidt, R.2    Wickens, J.R.3    Hyland, B.I.4
  • 39
    • 0001578011 scopus 로고
    • The extinction of within-compound flavor associations
    • Rescorla RA, Freberg L. 1978. The extinction of within-compound flavor associations. Learning and Motivation 9: 411–427. doi: 10.1016/0023-9690(78)90003-6
    • (1978) Learning and Motivation , vol.9 , pp. 411-427
    • Rescorla, R.A.1    Freberg, L.2
  • 41
    • 36448968271 scopus 로고    scopus 로고
    • Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
    • Roesch MR, Calu DJ, Schoenbaum G. 2007. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience 10:1615–1624. doi: 10.1038/nn2013
    • (2007) Nature Neuroscience , vol.10 , pp. 1615-1624
    • Roesch, M.R.1    Calu, D.J.2    Schoenbaum, G.3
  • 42
    • 84949322784 scopus 로고    scopus 로고
    • Phasic dopamine signals: From subjective reward value to formal economic utility
    • Schultz W, Carelli RM, Wightman RM. 2015. Phasic dopamine signals: from subjective reward value to formal economic utility. Current Opinion in Behavioral Sciences 5:147–154. doi: 10.1016/j.cobeha.2015.09.006
    • (2015) Current Opinion in Behavioral Sciences , vol.5 , pp. 147-154
    • Schultz, W.1    Carelli, R.M.2    Wightman, R.M.3
  • 43
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz W, Dayan P, Montague PR. 1997. A neural substrate of prediction and reward. Science 275:1593–1599. doi: 10.1126/science.275.5306.1593
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 44
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz W. 2002. Getting formal with dopamine and reward. Neuron 36:241–263. doi: 10.1016/S0896-6273(02)00967-4
    • (2002) Neuron , vol.36 , pp. 241-263
    • Schultz, W.1
  • 45
    • 84959902999 scopus 로고    scopus 로고
    • Dopamine selectively remediates’model-based’ reward learning: A computational approach
    • Sharp ME, Foerde K, Daw ND, Shohamy D. 2016. Dopamine selectively remediates’model-based’ reward learning: a computational approach. Brain 139:355–364. doi: 10.1093/brain/awv347
    • (2016) Brain , vol.139 , pp. 355-364
    • Sharp, M.E.1    Foerde, K.2    Daw, N.D.3    Shohamy, D.4
  • 48
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton RS. 1988. Learning to predict by the methods of temporal differences. Machine Learning 3:9–44. doi: 10.1007/BF00115009
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 52
    • 0345255891 scopus 로고    scopus 로고
    • Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm
    • Tobler PN, Dickinson A, Schultz W. 2003. Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm. Journal of Neuroscience 23:10402–10410.
    • (2003) Journal of Neuroscience , vol.23 , pp. 10402-10410
    • Tobler, P.N.1    Dickinson, A.2    Schultz, W.3
  • 53
    • 0035811464 scopus 로고    scopus 로고
    • Dopamine responses comply with basic assumptions of formal learning theory
    • Waelti P, Dickinson A, Schultz W. 2001. Dopamine responses comply with basic assumptions of formal learning theory. Nature 412:43–48. doi: 10.1038/35083500
    • (2001) Nature , vol.412 , pp. 43-48
    • Waelti, P.1    Dickinson, A.2    Schultz, W.3
  • 55
    • 84867287309 scopus 로고    scopus 로고
    • Preference by Association: How Memory Mechanisms in the Hippocampus Bias Decisions
    • Wimmer GE, Shohamy D. 2012. Preference by Association: How Memory Mechanisms in the Hippocampus Bias Decisions. Science 338:270–273. doi: 10.1126/science.1223252
    • (2012) Science , vol.338 , pp. 270-273
    • Wimmer, G.E.1    Shohamy, D.2
  • 56
    • 84864935116 scopus 로고    scopus 로고
    • Dopamine enhances model-based over model-free choice behavior
    • Wunderlich K, Smittenaar P, Dolan RJ. 2012. Dopamine enhances model-based over model-free choice behavior. Neuron 75:418–424. doi: 10.1016/j.neuron.2012.03.042
    • (2012) Neuron , vol.75 , pp. 418-424
    • Wunderlich, K.1    Smittenaar, P.2    Dolan, R.J.3
  • 57
    • 0032509798 scopus 로고    scopus 로고
    • Increased extracellular dopamine in the nucleus accumbens of the rat during associative learning of neutral stimuli
    • Young AMJ, Ahier RG, Upton RL, Joseph MH, Gray JA. 1998. Increased extracellular dopamine in the nucleus accumbens of the rat during associative learning of neutral stimuli. Neuroscience 83:1175–1183. doi: 10.1016/S0306-4522(97)00483-1
    • (1998) Neuroscience , vol.83 , pp. 1175-1183
    • Young, A.1    Ahier, R.G.2    Upton, R.L.3    Joseph, M.H.4    Gray, J.A.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.