메뉴 건너뛰기




Volumn 53, Issue 3, 2009, Pages 139-154

Reinforcement learning in the brain

Author keywords

[No Author keywords available]

Indexed keywords


EID: 67349283062     PISSN: 00222496     EISSN: 10960880     Source Type: Journal    
DOI: 10.1016/j.jmp.2008.12.005     Document Type: Article
Times cited : (522)

References (174)
  • 1
    • 84946268134 scopus 로고
    • Variations in the sensitivity of instrumental responding to reinforcer devaluation
    • Adams C.D. Variations in the sensitivity of instrumental responding to reinforcer devaluation. Quarterly Journal of Experimental Psychology 34B (1982) 77-98
    • (1982) Quarterly Journal of Experimental Psychology , vol.34 B , pp. 77-98
    • Adams, C.D.1
  • 3
    • 0033939592 scopus 로고    scopus 로고
    • Dopamine and synaptic plasticity in the neostriatum
    • Arbuthnott G.W., Ingham C.A., and Wickens J.R. Dopamine and synaptic plasticity in the neostriatum. Journal of Anatomy 196 Pt 4 (2000) 587-596
    • (2000) Journal of Anatomy , vol.196 , Issue.PART 4 , pp. 587-596
    • Arbuthnott, G.W.1    Ingham, C.A.2    Wickens, J.R.3
  • 4
    • 0036889792 scopus 로고    scopus 로고
    • The neural basis of functional brain imaging signals
    • Attwell D., and Iadecola C. The neural basis of functional brain imaging signals. Trends in Neuroscience 25 12 (2002) 621-625
    • (2002) Trends in Neuroscience , vol.25 , Issue.12 , pp. 621-625
    • Attwell, D.1    Iadecola, C.2
  • 5
    • 0004370245 scopus 로고
    • Advantage updating
    • Tech. rep. no. WL-TR-93-1146. Dayton, OH: Wright-Patterson Air Force Base
    • Baird, L. C. (1993). Advantage updating. Tech. rep. no. WL-TR-93-1146. Dayton, OH: Wright-Patterson Air Force Base
    • (1993)
    • Baird, L.C.1
  • 6
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • Prieditis A., and Russell S. (Eds), Morgan Kaufman, San Mateo, CA
    • Baird L.C. Residual algorithms: Reinforcement learning with function approximation. In: Prieditis A., and Russell S. (Eds). Proceedings of the 12th international conference on machine learning (1995), Morgan Kaufman, San Mateo, CA 30-37
    • (1995) Proceedings of the 12th international conference on machine learning , pp. 30-37
    • Baird, L.C.1
  • 7
    • 28444472936 scopus 로고    scopus 로고
    • Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits
    • Balleine B.W. Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits. Physiology and Behaviour 86 5 (2005) 717-730
    • (2005) Physiology and Behaviour , vol.86 , Issue.5 , pp. 717-730
    • Balleine, B.W.1
  • 8
    • 0031801210 scopus 로고    scopus 로고
    • Goal-directed instrumental action: Contingency and incentive learning and their cortical substrates
    • Balleine B.W., and Dickinson A. Goal-directed instrumental action: Contingency and incentive learning and their cortical substrates. Neuropharmacology 37 4-5 (1998) 407-419
    • (1998) Neuropharmacology , vol.37 , Issue.4-5 , pp. 407-419
    • Balleine, B.W.1    Dickinson, A.2
  • 10
    • 0028575929 scopus 로고
    • Reinforcement learning control
    • Barto A.G. Reinforcement learning control. Current Opinion in Neurobiology 4 6 (1994) 888-893
    • (1994) Current Opinion in Neurobiology , vol.4 , Issue.6 , pp. 888-893
    • Barto, A.G.1
  • 11
    • 0000541213 scopus 로고
    • Adaptive critic and the basal ganglia
    • Houk J.C., Davis J.L., and Beiser D.G. (Eds), MIT Press, Cambridge
    • Barto A.G. Adaptive critic and the basal ganglia. In: Houk J.C., Davis J.L., and Beiser D.G. (Eds). Models of information processing in the basal ganglia (1995), MIT Press, Cambridge 215-232
    • (1995) Models of information processing in the basal ganglia , pp. 215-232
    • Barto, A.G.1
  • 12
    • 33847006386 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Barto A.G., and Mahadevan S. Recent advances in hierarchical reinforcement learning. Discrete Event Systems Journal 13 (2003) 44-77
    • (2003) Discrete Event Systems Journal , vol.13 , pp. 44-77
    • Barto, A.G.1    Mahadevan, S.2
  • 16
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • Bayer H.M., and Glimcher P.W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47 1 (2005) 129-141
    • (2005) Neuron , vol.47 , Issue.1 , pp. 129-141
    • Bayer, H.M.1    Glimcher, P.W.2
  • 17
    • 34548778113 scopus 로고    scopus 로고
    • Statistics of midbrain dopamine neuron spike trains in the awake primate
    • Bayer H.M., Lau B., and Glimcher P.W. Statistics of midbrain dopamine neuron spike trains in the awake primate. Journal of Neurophysiology 98 3 (2007) 1428-1439
    • (2007) Journal of Neurophysiology , vol.98 , Issue.3 , pp. 1428-1439
    • Bayer, H.M.1    Lau, B.2    Glimcher, P.W.3
  • 19
    • 0003787146 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Bellman R.E. Dynamic programming (1957), Princeton University Press, Princeton, NJ
    • (1957) Dynamic programming
    • Bellman, R.E.1
  • 20
    • 0344442860 scopus 로고    scopus 로고
    • 'Passive stabilization' of striatal extracellular dopamine across the lesion spectrum encompassing the presymptomatic phase of Parkinson's disease: A voltammetric study in the 6-OHDA lesioned rat
    • Bergstrom B.P., and Garris P.A. 'Passive stabilization' of striatal extracellular dopamine across the lesion spectrum encompassing the presymptomatic phase of Parkinson's disease: A voltammetric study in the 6-OHDA lesioned rat. Journal of Neurochemistry 87 5 (2003) 1224-1236
    • (2003) Journal of Neurochemistry , vol.87 , Issue.5 , pp. 1224-1236
    • Bergstrom, B.P.1    Garris, P.A.2
  • 22
    • 14044268843 scopus 로고    scopus 로고
    • Espresso reward learning, hold the dopamine: Theoretical comment on robinson et al. (2005)
    • Berridge K.C. Espresso reward learning, hold the dopamine: Theoretical comment on robinson et al. (2005). Behavioral Neuroscience 119 1 (2005) 336-341
    • (2005) Behavioral Neuroscience , vol.119 , Issue.1 , pp. 336-341
    • Berridge, K.C.1
  • 23
    • 33847634405 scopus 로고    scopus 로고
    • The debate over dopamine's role in reward: The case for incentive salience
    • Berridge K.C. The debate over dopamine's role in reward: The case for incentive salience. Psychopharmacology (Berl) 191 3 (2007) 391-431
    • (2007) Psychopharmacology (Berl) , vol.191 , Issue.3 , pp. 391-431
    • Berridge, K.C.1
  • 24
    • 0032423613 scopus 로고    scopus 로고
    • What is the role of dopamine in reward: Hedonic impact, reward learning, or incentive salience?
    • Berridge K.C., and Robinson T.E. What is the role of dopamine in reward: Hedonic impact, reward learning, or incentive salience?. Brain Research Review 28 (1998) 309-369
    • (1998) Brain Research Review , vol.28 , pp. 309-369
    • Berridge, K.C.1    Robinson, T.E.2
  • 26
    • 43049099970 scopus 로고    scopus 로고
    • Hierarchical models of behavior and prefrontal function
    • Botvinick M.M. Hierarchical models of behavior and prefrontal function. Trends in Cognitive Sciences 12 5 (2008) 201-208
    • (2008) Trends in Cognitive Sciences , vol.12 , Issue.5 , pp. 201-208
    • Botvinick, M.M.1
  • 27
    • 67349105150 scopus 로고    scopus 로고
    • Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
    • Botvinick M.M., Niv Y., and Barto A.C. Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition (2008)
    • (2008) Cognition
    • Botvinick, M.M.1    Niv, Y.2    Barto, A.C.3
  • 28
    • 0001491619 scopus 로고
    • A mathematical model for simple learning
    • Bush R.R., and Mosteller F. A mathematical model for simple learning. Psychological Review 58 (1951) 313-323
    • (1951) Psychological Review , vol.58 , pp. 313-323
    • Bush, R.R.1    Mosteller, F.2
  • 30
    • 0022644104 scopus 로고
    • Stimulation of the lateral habenula inhibits dopamine-containing neurons in the substantia nigra and ventral tegmental area of the rat
    • Christoph G.R., Leonzio R.J., and Wilcox K.S. Stimulation of the lateral habenula inhibits dopamine-containing neurons in the substantia nigra and ventral tegmental area of the rat. Journal of Neuroscience 6 3 (1986) 613-619
    • (1986) Journal of Neuroscience , vol.6 , Issue.3 , pp. 613-619
    • Christoph, G.R.1    Leonzio, R.J.2    Wilcox, K.S.3
  • 31
    • 33749591279 scopus 로고    scopus 로고
    • Rapid alterations in corticostriatal ensemble coordination during acute dopamine-dependent motor dysfunction
    • Costa R.M., Lin S.-C., Sotnikova T.D., Cyr M., Gainetdinov R.R., Caron M.G., et al. Rapid alterations in corticostriatal ensemble coordination during acute dopamine-dependent motor dysfunction. Neuron 52 2 (2006) 359-369
    • (2006) Neuron , vol.52 , Issue.2 , pp. 359-369
    • Costa, R.M.1    Lin, S.-C.2    Sotnikova, T.D.3    Cyr, M.4    Gainetdinov, R.R.5    Caron, M.G.6
  • 32
    • 85156225263 scopus 로고    scopus 로고
    • Timing and partial observability in the dopamine system
    • Dietterich T., Becker S., and Ghahramani Z. (Eds), MIT Press, Cambridge, MA
    • Daw N.D., Courville A.C., and Touretzky D.S. Timing and partial observability in the dopamine system. In: Dietterich T., Becker S., and Ghahramani Z. (Eds). Advances in neural information processing systems Vol. 14 (2002), MIT Press, Cambridge, MA
    • (2002) Advances in neural information processing systems , vol.14
    • Daw, N.D.1    Courville, A.C.2    Touretzky, D.S.3
  • 33
    • 33745787929 scopus 로고    scopus 로고
    • Representation and timing in theories of the dopamine system
    • Daw N.D., Courville A.C., and Touretzky D.S. Representation and timing in theories of the dopamine system. Neural Computation 18 (2006) 1637-1677
    • (2006) Neural Computation , vol.18 , pp. 1637-1677
    • Daw, N.D.1    Courville, A.C.2    Touretzky, D.S.3
  • 34
    • 0036592008 scopus 로고    scopus 로고
    • Opponent interactions between serotonin and dopamine
    • Daw N.D., Kakade S., and Dayan P. Opponent interactions between serotonin and dopamine. Neural Networks 15 4-6 (2002) 603-616
    • (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 603-616
    • Daw, N.D.1    Kakade, S.2    Dayan, P.3
  • 35
    • 28044450875 scopus 로고    scopus 로고
    • Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    • Daw N.D., Niv Y., and Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 8 12 (2005) 1704-1711
    • (2005) Nature Neuroscience , vol.8 , Issue.12 , pp. 1704-1711
    • Daw, N.D.1    Niv, Y.2    Dayan, P.3
  • 36
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw N.D., O'Doherty J.P., Dayan P., Seymour B., and Dolan R.J. Cortical substrates for exploratory decisions in humans. Nature 441 7095 (2006) 876-879
    • (2006) Nature , vol.441 , Issue.7095 , pp. 876-879
    • Daw, N.D.1    O'Doherty, J.P.2    Dayan, P.3    Seymour, B.4    Dolan, R.J.5
  • 37
    • 0036835734 scopus 로고    scopus 로고
    • Long-term reward prediction in TD models of the dopamine system
    • Daw N.D., and Touretzky D.S. Long-term reward prediction in TD models of the dopamine system. Neural Computation 14 11 (2002) 2567-2583
    • (2002) Neural Computation , vol.14 , Issue.11 , pp. 2567-2583
    • Daw, N.D.1    Touretzky, D.S.2
  • 38
    • 34547536392 scopus 로고    scopus 로고
    • Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens
    • Day J.J., Roitman M.F., Wightman R.M., and Carelli R.M. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature Neuroscience 10 8 (2007) 1020-1028
    • (2007) Nature Neuroscience , vol.10 , Issue.8 , pp. 1020-1028
    • Day, J.J.1    Roitman, M.F.2    Wightman, R.M.3    Carelli, R.M.4
  • 41
    • 52049107354 scopus 로고    scopus 로고
    • Reinforcement learning: The good, the bad and the ugly
    • Dayan P., and Niv Y. Reinforcement learning: The good, the bad and the ugly. Current Opinion in Neurobiology 18 2 (2008) 185-196
    • (2008) Current Opinion in Neurobiology , vol.18 , Issue.2 , pp. 185-196
    • Dayan, P.1    Niv, Y.2
  • 42
    • 33749055062 scopus 로고    scopus 로고
    • The misbehavior of value and the discipline of the will
    • Dayan P., Niv Y., Seymour B., and Daw N.D. The misbehavior of value and the discipline of the will. Neural Networks 19 8 (2006) 1153-1160
    • (2006) Neural Networks , vol.19 , Issue.8 , pp. 1153-1160
    • Dayan, P.1    Niv, Y.2    Seymour, B.3    Daw, N.D.4
  • 43
    • 0042697569 scopus 로고    scopus 로고
    • Dorsal striatum responses to reward and punishment: Effects of valence and magnitude manipulations
    • Delgado M.R., Locke H.M., Stenger V.A., and Fiez J.A. Dorsal striatum responses to reward and punishment: Effects of valence and magnitude manipulations. Cognitive Affect Behavioural Neuroscience 3 1 (2003) 27-38
    • (2003) Cognitive Affect Behavioural Neuroscience , vol.3 , Issue.1 , pp. 27-38
    • Delgado, M.R.1    Locke, H.M.2    Stenger, V.A.3    Fiez, J.A.4
  • 45
    • 0043250430 scopus 로고    scopus 로고
    • The role of learning in the operation of motivational systems
    • Gallistel C.R. (Ed), John Wiley & Sons, New York
    • Dickinson A., and Balleine B.W. The role of learning in the operation of motivational systems. In: Gallistel C.R. (Ed). Learning, motivation and emotion Vol. 3 (2002), John Wiley & Sons, New York 497-533
    • (2002) Learning, motivation and emotion , vol.3 , pp. 497-533
    • Dickinson, A.1    Balleine, B.W.2
  • 46
    • 0001860705 scopus 로고
    • The effect of instrumental training contingency on susceptibility to reinforcer devaluation
    • Dickinson A., Nicholas D.J., and Adams C.D. The effect of instrumental training contingency on susceptibility to reinforcer devaluation. Quarterly Journal of Experimental Psychology 35B (1983) 35-51
    • (1983) Quarterly Journal of Experimental Psychology , vol.35 B , pp. 35-51
    • Dickinson, A.1    Nicholas, D.J.2    Adams, C.D.3
  • 47
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya K. Reinforcement learning in continuous time and space. Neural Computation 12 1 (2000) 219-245
    • (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
    • Doya, K.1
  • 48
    • 0037459319 scopus 로고    scopus 로고
    • Discrete coding of reward probability and uncertainty by dopamine neurons
    • Fiorillo C.D., Tobler P.N., and Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299 5614 (2003) 1898-1902
    • (2003) Science , vol.299 , Issue.5614 , pp. 1898-1902
    • Fiorillo, C.D.1    Tobler, P.N.2    Schultz, W.3
  • 49
    • 0041859307 scopus 로고    scopus 로고
    • Afferent modulation of dopamine neuron firing differentially regulates tonic and phasic dopamine transmission
    • Floresco S.B., West A.R., Ash B., Moore H., and Grace A.A. Afferent modulation of dopamine neuron firing differentially regulates tonic and phasic dopamine transmission. Nature Neuroscience 6 9 (2003) 968-973
    • (2003) Nature Neuroscience , vol.6 , Issue.9 , pp. 968-973
    • Floresco, S.B.1    West, A.R.2    Ash, B.3    Moore, H.4    Grace, A.A.5
  • 51
    • 23744454605 scopus 로고    scopus 로고
    • Afferents of the ventral tegmental area in the rat-anatomical substratum for integrative functions
    • Geisler S., and Zahm D.S. Afferents of the ventral tegmental area in the rat-anatomical substratum for integrative functions. The Journal of Comparative Neurology 490 3 (2005) 270-294
    • (2005) The Journal of Comparative Neurology , vol.490 , Issue.3 , pp. 270-294
    • Geisler, S.1    Zahm, D.S.2
  • 52
    • 0344787262 scopus 로고
    • Scalar Expectancy Theory and Weber's law in animal timing
    • Gibbon J. Scalar Expectancy Theory and Weber's law in animal timing. Psychological Review 84 3 (1977) 279-325
    • (1977) Psychological Review , vol.84 , Issue.3 , pp. 279-325
    • Gibbon, J.1
  • 53
    • 22544464049 scopus 로고    scopus 로고
    • Dopaminergic modulation of limbic and cortical drive of nucleus accumbens in goal-directed behavior
    • Goto Y., and Grace A.A. Dopaminergic modulation of limbic and cortical drive of nucleus accumbens in goal-directed behavior. Nature Neuroscience 8 (2005) 805-812
    • (2005) Nature Neuroscience , vol.8 , pp. 805-812
    • Goto, Y.1    Grace, A.A.2
  • 54
    • 0026059697 scopus 로고
    • Phasic versus tonic dopamine release and the modulation of dopamine system responsivity: A hypothesis for the etiology of schizophrenia
    • Grace A.A. Phasic versus tonic dopamine release and the modulation of dopamine system responsivity: A hypothesis for the etiology of schizophrenia. Neuroscience 41 1 (1991) 1-24
    • (1991) Neuroscience , vol.41 , Issue.1 , pp. 1-24
    • Grace, A.A.1
  • 55
    • 33748188120 scopus 로고    scopus 로고
    • The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
    • Hampton A.N., Bossaerts P., and O'Doherty J.P. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. Journal of Neuroscience 26 32 (2006) 8360-8367
    • (2006) Journal of Neuroscience , vol.26 , Issue.32 , pp. 8360-8367
    • Hampton, A.N.1    Bossaerts, P.2    O'Doherty, J.P.3
  • 56
    • 33846635110 scopus 로고    scopus 로고
    • Decoding the neural substrates of reward-related decision making with functional mri
    • Hampton A.N., and O'Doherty J.P. Decoding the neural substrates of reward-related decision making with functional mri. Proceedings of the National Academy of Sciences USA 104 4 (2007) 1377-1382
    • (2007) Proceedings of the National Academy of Sciences USA , vol.104 , Issue.4 , pp. 1377-1382
    • Hampton, A.N.1    O'Doherty, J.P.2
  • 57
    • 0032984336 scopus 로고    scopus 로고
    • Amygdala circuitry in attentional and representational processes
    • Holland P.C., and Gallagher M. Amygdala circuitry in attentional and representational processes. Trends in Cognitive Sciences 3 2 (1999) 65-73
    • (1999) Trends in Cognitive Sciences , vol.3 , Issue.2 , pp. 65-73
    • Holland, P.C.1    Gallagher, M.2
  • 58
    • 33644688754 scopus 로고    scopus 로고
    • Dopamine neurons report an error in the temporal prediction of reward during learning
    • Hollerman J.R., and Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience 1 (1998) 304-309
    • (1998) Nature Neuroscience , vol.1 , pp. 304-309
    • Hollerman, J.R.1    Schultz, W.2
  • 59
    • 0034061668 scopus 로고    scopus 로고
    • Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events
    • Horvitz J.C. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience 96 4 (2000) 651-656
    • (2000) Neuroscience , vol.96 , Issue.4 , pp. 651-656
    • Horvitz, J.C.1
  • 60
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • Houk J.C., Davis J.L., and Beiser D.G. (Eds), MIT Press, Cambridge
    • Houk J.C., Adams J.L., and Barto A.G. A model of how the basal ganglia generate and use neural signals that predict reinforcement. In: Houk J.C., Davis J.L., and Beiser D.G. (Eds). Models of information processing in the basal ganglia (1995), MIT Press, Cambridge 249-270
    • (1995) Models of information processing in the basal ganglia , pp. 249-270
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 62
    • 0033461157 scopus 로고    scopus 로고
    • The role of nucleus accumbens dopamine in motivated behavior: A unifying interpretation with special reference to reward-seeking
    • Ikemoto S., and Panksepp J. The role of nucleus accumbens dopamine in motivated behavior: A unifying interpretation with special reference to reward-seeking. Brain Research Reviews 31 (1999) 6-41
    • (1999) Brain Research Reviews , vol.31 , pp. 6-41
    • Ikemoto, S.1    Panksepp, J.2
  • 64
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: New anatomical and computational perspectives
    • Joel D., Niv Y., and Ruppin E. Actor-critic models of the basal ganglia: New anatomical and computational perspectives. Neural Networks 15 (2002) 535-547
    • (2002) Neural Networks , vol.15 , pp. 535-547
    • Joel, D.1    Niv, Y.2    Ruppin, E.3
  • 65
    • 0027986867 scopus 로고
    • The organization of the basal ganglia-thalamocortical cicuits: open interconnected rather than closed segregated
    • Joel D., and Weiner I. The organization of the basal ganglia-thalamocortical cicuits: open interconnected rather than closed segregated. Neuroscience 63 (1994) 363-379
    • (1994) Neuroscience , vol.63 , pp. 363-379
    • Joel, D.1    Weiner, I.2
  • 66
    • 0002875876 scopus 로고    scopus 로고
    • Striatal contention scheduling and the split circuit scheme of basal ganglia-thalamocortical circuitry: From anatomy to behaviour
    • Miller R., and Wickens J. (Eds), Harwood Academic Publishers
    • Joel D., and Weiner I. Striatal contention scheduling and the split circuit scheme of basal ganglia-thalamocortical circuitry: From anatomy to behaviour. In: Miller R., and Wickens J. (Eds). Conceptual advances in brain research: Brain dynamics and the striatal complex (1999), Harwood Academic Publishers 209-236
    • (1999) Conceptual advances in brain research: Brain dynamics and the striatal complex , pp. 209-236
    • Joel, D.1    Weiner, I.2
  • 67
    • 0031309579 scopus 로고    scopus 로고
    • Normative and descriptive models of decision making: Time discounting and risk sensitivity
    • Bock G.R., and Cardew G. (Eds), Wiley, Chichester
    • Kacelnik A. Normative and descriptive models of decision making: Time discounting and risk sensitivity. In: Bock G.R., and Cardew G. (Eds). Characterizing human psychological adaptations: Ciba Foundation symposium 208 (1997), Wiley, Chichester 51-70
    • (1997) Characterizing human psychological adaptations: Ciba Foundation symposium 208 , pp. 51-70
    • Kacelnik, A.1
  • 68
    • 0036592029 scopus 로고    scopus 로고
    • Dopamine: Generalization and bonuses
    • Kakade S., and Dayan P. Dopamine: Generalization and bonuses. Neural Networks 15 4-6 (2002) 549-559
    • (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 549-559
    • Kakade, S.1    Dayan, P.2
  • 69
    • 0002981963 scopus 로고
    • Predictability, surprise, attention, and conditioning
    • Campbell B.A., and Church R.M. (Eds), Appleton-Century-Crofts, New York
    • Kamin L.J. Predictability, surprise, attention, and conditioning. In: Campbell B.A., and Church R.M. (Eds). Punishment and aversive behavior (1969), Appleton-Century-Crofts, New York 242-259
    • (1969) Punishment and aversive behavior , pp. 242-259
    • Kamin, L.J.1
  • 71
    • 0012686548 scopus 로고    scopus 로고
    • Associative representationsof emotionally significant outcomes
    • Moore S., and Oaksford M. (Eds), John Benjamins Publishing Company, Amsterdam, Philadelphia
    • Killcross S., and Blundell P. Associative representationsof emotionally significant outcomes. In: Moore S., and Oaksford M. (Eds). Emotional cognition. from brain to behaviour Vol. 44 (2002), John Benjamins Publishing Company, Amsterdam, Philadelphia 35-73
    • (2002) Emotional cognition. from brain to behaviour , vol.44 , pp. 35-73
    • Killcross, S.1    Blundell, P.2
  • 72
    • 0037382264 scopus 로고    scopus 로고
    • Coordination of actions and habits in the medial prefrontal conrtex of rats
    • Killcross S., and Coutureau E. Coordination of actions and habits in the medial prefrontal conrtex of rats. Cerebral Cortex 13 (2003) 400-408
    • (2003) Cerebral Cortex , vol.13 , pp. 400-408
    • Killcross, S.1    Coutureau, E.2
  • 73
    • 0035882897 scopus 로고    scopus 로고
    • Anticipation of increasing monetary reward selectively recruits nucleus accumbens
    • Knutson B., Adams C.M., Fong G.W., and Hommer D. Anticipation of increasing monetary reward selectively recruits nucleus accumbens. Journal of Neuroscience 21 16 (2001) RC159
    • (2001) Journal of Neuroscience , vol.21 , Issue.16
    • Knutson, B.1    Adams, C.M.2    Fong, G.W.3    Hommer, D.4
  • 74
    • 67349191143 scopus 로고    scopus 로고
    • Representation of subjective value in the striatum
    • Glimcher P.W., Camerer C., Fehr E., and Poldrack R. (Eds), Academic Press, New York, NY
    • Knutson B., Delgado M.R., and Philips P.E.M. Representation of subjective value in the striatum. In: Glimcher P.W., Camerer C., Fehr E., and Poldrack R. (Eds). Neuroeconomics: Decision making and the brain (2008), Academic Press, New York, NY
    • (2008) Neuroeconomics: Decision making and the brain
    • Knutson, B.1    Delgado, M.R.2    Philips, P.E.M.3
  • 75
    • 0035807944 scopus 로고    scopus 로고
    • Dissociation of reward anticipation and outcome with event-related fMRI
    • Knutson B., Fong G.W., Adams C.M., Varner J.L., and Hommer D. Dissociation of reward anticipation and outcome with event-related fMRI. Neuroreport 12 17 (2001) 3683-3687
    • (2001) Neuroreport , vol.12 , Issue.17 , pp. 3683-3687
    • Knutson, B.1    Fong, G.W.2    Adams, C.M.3    Varner, J.L.4    Hommer, D.5
  • 76
    • 0037328795 scopus 로고    scopus 로고
    • A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: Characterization with rapid event-related fMRI
    • Knutson B., Fong G.W., Bennett S.M., Adams C.M., and Hommer D. A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: Characterization with rapid event-related fMRI. Neuroimage 18 2 (2003) 263-272
    • (2003) Neuroimage , vol.18 , Issue.2 , pp. 263-272
    • Knutson, B.1    Fong, G.W.2    Bennett, S.M.3    Adams, C.M.4    Hommer, D.5
  • 77
    • 33847651376 scopus 로고    scopus 로고
    • Linking nucleus accumbens dopamine and blood oxygenation
    • Knutson B., and Gibbs S.E.B. Linking nucleus accumbens dopamine and blood oxygenation. Psychopharmacology (Berl) 191 3 (2007) 813-822
    • (2007) Psychopharmacology (Berl) , vol.191 , Issue.3 , pp. 813-822
    • Knutson, B.1    Gibbs, S.E.B.2
  • 78
    • 34447637794 scopus 로고    scopus 로고
    • Reward prediction error computation in the pedunculopontine tegmental nucleus neurons
    • Kobayashi Y., and Okada K.-I. Reward prediction error computation in the pedunculopontine tegmental nucleus neurons. Annals of the New York Academy of Science 1104 (2007) 310-323
    • (2007) Annals of the New York Academy of Science , vol.1104 , pp. 310-323
    • Kobayashi, Y.1    Okada, K.-I.2
  • 81
    • 0017915793 scopus 로고
    • The Rescorla-Wagner model: Losses in associative strength in compound conditioned stimuli
    • Kremer E.F. The Rescorla-Wagner model: Losses in associative strength in compound conditioned stimuli. Journal of Experimental Psychology: Animal Behavior Processes 4 1 (1978) 22-36
    • (1978) Journal of Experimental Psychology: Animal Behavior Processes , vol.4 , Issue.1 , pp. 22-36
    • Kremer, E.F.1
  • 83
    • 41549139223 scopus 로고    scopus 로고
    • Differential tonic influence of lateral habenula on prefrontal cortex and nucleus accumbens dopamine release
    • Lecourtier L., Defrancesco A., and Moghaddam B. Differential tonic influence of lateral habenula on prefrontal cortex and nucleus accumbens dopamine release. European Journal of Neuroscience 27 7 (2008) 1755-1762
    • (2008) European Journal of Neuroscience , vol.27 , Issue.7 , pp. 1755-1762
    • Lecourtier, L.1    Defrancesco, A.2    Moghaddam, B.3
  • 85
    • 0026505520 scopus 로고
    • Responses of monkey dopamine neurons during learning of behavioral reactions
    • Ljungberg T., Apicella P., and Schultz W. Responses of monkey dopamine neurons during learning of behavioral reactions. Journal of Neurophysiology 67 1 (1992) 145-163
    • (1992) Journal of Neurophysiology , vol.67 , Issue.1 , pp. 145-163
    • Ljungberg, T.1    Apicella, P.2    Schultz, W.3
  • 86
    • 35548949006 scopus 로고    scopus 로고
    • Genetic control of instrumental conditioning by striatopallidal neuron-specific S1P receptor Gpr6
    • Lobo M.K., Cui Y., Ostlund S.B., Balleine B.W., and Yang X.W. Genetic control of instrumental conditioning by striatopallidal neuron-specific S1P receptor Gpr6. Nature Neuroscience 10 11 (2007) 1395-1397
    • (2007) Nature Neuroscience , vol.10 , Issue.11 , pp. 1395-1397
    • Lobo, M.K.1    Cui, Y.2    Ostlund, S.B.3    Balleine, B.W.4    Yang, X.W.5
  • 87
    • 0038719640 scopus 로고    scopus 로고
    • The underpinnings of the BOLD functional magnetic resonance imaging signal
    • Logothetis N.K. The underpinnings of the BOLD functional magnetic resonance imaging signal. Journal of Neuroscience 23 10 (2003) 3963-3971
    • (2003) Journal of Neuroscience , vol.23 , Issue.10 , pp. 3963-3971
    • Logothetis, N.K.1
  • 90
    • 34347343926 scopus 로고    scopus 로고
    • Lateral habenula as a source of negative reward signals in dopamine neurons
    • Matsumoto M., and Hikosaka O. Lateral habenula as a source of negative reward signals in dopamine neurons. Nature 447 (2007) 1111-1115
    • (2007) Nature , vol.447 , pp. 1111-1115
    • Matsumoto, M.1    Hikosaka, O.2
  • 91
    • 0037650217 scopus 로고    scopus 로고
    • Temporal prediction errors in a passive learning task activate human striatum
    • McClure S.M., Berns G.S., and Montague P.R. Temporal prediction errors in a passive learning task activate human striatum. Neuron 38 2 (2003) 339-346
    • (2003) Neuron , vol.38 , Issue.2 , pp. 339-346
    • McClure, S.M.1    Berns, G.S.2    Montague, P.R.3
  • 92
    • 0142058800 scopus 로고    scopus 로고
    • A computational substrate for incentive salience
    • McClure S.M., Daw N.D., and Montague P.R. A computational substrate for incentive salience. Trends in Neuroscience 26 8 (2003) 423-428
    • (2003) Trends in Neuroscience , vol.26 , Issue.8 , pp. 423-428
    • McClure, S.M.1    Daw, N.D.2    Montague, P.R.3
  • 93
    • 5144221542 scopus 로고    scopus 로고
    • Neural correlates of behavioral preference for culturally familiar drinks
    • McClure S.M., Li J., Tomlin D., Cypert K.S., Montague L.M., and Montague P.R. Neural correlates of behavioral preference for culturally familiar drinks. Neuron 44 2 (2004) 379-387
    • (2004) Neuron , vol.44 , Issue.2 , pp. 379-387
    • McClure, S.M.1    Li, J.2    Tomlin, D.3    Cypert, K.S.4    Montague, L.M.5    Montague, P.R.6
  • 94
    • 34548542836 scopus 로고    scopus 로고
    • Temporal difference modeling of the blood-oxygen level dependent response during aversive conditioning in humans: Effects of dopaminergic modulation
    • Menon M., Jensen J., Vitcu I., Graff-Guerrero A., Crawley A., Smith M.A., et al. Temporal difference modeling of the blood-oxygen level dependent response during aversive conditioning in humans: Effects of dopaminergic modulation. Biological Psychiatry 62 7 (2007) 765-772
    • (2007) Biological Psychiatry , vol.62 , Issue.7 , pp. 765-772
    • Menon, M.1    Jensen, J.2    Vitcu, I.3    Graff-Guerrero, A.4    Crawley, A.5    Smith, M.A.6
  • 95
    • 0002179838 scopus 로고
    • Corticostriatal cell assemblies in selective attention and in representation of predictable and controllable events
    • Miller R., and Wickens J.R. Corticostriatal cell assemblies in selective attention and in representation of predictable and controllable events. Concepts in Neuroscience 2 1 (1991) 65-95
    • (1991) Concepts in Neuroscience , vol.2 , Issue.1 , pp. 65-95
    • Miller, R.1    Wickens, J.R.2
  • 96
    • 0030026069 scopus 로고    scopus 로고
    • Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli
    • Mirenowicz J., and Schultz W. Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli. Nature 379 (1996) 449-451
    • (1996) Nature , vol.379 , pp. 449-451
    • Mirenowicz, J.1    Schultz, W.2
  • 97
    • 0000708595 scopus 로고
    • Using aperiodic reinforcement for directed self-organization
    • Giles C.L., Hanson S.J., and Cowan J.D. (Eds), Morgan Kaufmann, San Mateo, CA
    • Montague P.R., Dayan P., Nowlan S.J., Pouget A., and Sejnowski T.J. Using aperiodic reinforcement for directed self-organization. In: Giles C.L., Hanson S.J., and Cowan J.D. (Eds). Advances in neural information processing systems Vol. 5 (1993), Morgan Kaufmann, San Mateo, CA 969-976
    • (1993) Advances in neural information processing systems , vol.5 , pp. 969-976
    • Montague, P.R.1    Dayan, P.2    Nowlan, S.J.3    Pouget, A.4    Sejnowski, T.J.5
  • 98
    • 0028972278 scopus 로고
    • Bee foraging in uncertain environments using predictive Hebbian learning
    • Montague P.R., Dayan P., Person C., and Sejnowski T.J. Bee foraging in uncertain environments using predictive Hebbian learning. Nature 377 (1995) 725-728
    • (1995) Nature , vol.377 , pp. 725-728
    • Montague, P.R.1    Dayan, P.2    Person, C.3    Sejnowski, T.J.4
  • 99
    • 0003368918 scopus 로고
    • Foraging in an uncertain environments using predictive hebbian learning
    • Tesauro, and Cowan J.D. (Eds)
    • Montague P.R., Dayan P., and Sejnowski T.J. Foraging in an uncertain environments using predictive hebbian learning. In: Tesauro, and Cowan J.D. (Eds). Advances in neural information processing systems Vol. 6 (1994) 598-605
    • (1994) Advances in neural information processing systems , vol.6 , pp. 598-605
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 100
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive hebbian learning
    • Montague P.R., Dayan P., and Sejnowski T.J. A framework for mesencephalic dopamine systems based on predictive hebbian learning. Journal of Neuroscience 16 5 (1996) 1936-1947
    • (1996) Journal of Neuroscience , vol.16 , Issue.5 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejnowski, T.J.3
  • 102
    • 3242673464 scopus 로고    scopus 로고
    • Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons
    • Morris G., Arkadir D., Nevet A., Vaadia E., and Bergman H. Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons. Neuron 43 1 (2004) 133-143
    • (2004) Neuron , vol.43 , Issue.1 , pp. 133-143
    • Morris, G.1    Arkadir, D.2    Nevet, A.3    Vaadia, E.4    Bergman, H.5
  • 103
  • 104
    • 45549109997 scopus 로고    scopus 로고
    • Reward-dependent modulation of neuronal activity in the primate dorsal raphe nucleus
    • Nakamura K., Matsumoto M., and Hikosaka O. Reward-dependent modulation of neuronal activity in the primate dorsal raphe nucleus. Journal of Neuroscience 28 20 (2008) 5331-5343
    • (2008) Journal of Neuroscience , vol.28 , Issue.20 , pp. 5331-5343
    • Nakamura, K.1    Matsumoto, M.2    Hikosaka, O.3
  • 105
    • 17544368654 scopus 로고    scopus 로고
    • Dopaminergic modulation of neuronal excitability in the striatum and nucleus accumbens
    • Nicola S.M., Surmeier J., and Malenka R.C. Dopaminergic modulation of neuronal excitability in the striatum and nucleus accumbens. Annual Reviews in Neuroscience 23 (2000) 185-215
    • (2000) Annual Reviews in Neuroscience , vol.23 , pp. 185-215
    • Nicola, S.M.1    Surmeier, J.2    Malenka, R.C.3
  • 107
    • 33745774340 scopus 로고    scopus 로고
    • How fast to work: Response vigor, motivation and tonic dopamine
    • Weiss Y., Schölkopf B., and Platt J. (Eds), MIT Press
    • Niv Y., Daw N.D., and Dayan P. How fast to work: Response vigor, motivation and tonic dopamine. In: Weiss Y., Schölkopf B., and Platt J. (Eds). Advances in neural information processing systems: Vol. 18 (2005), MIT Press 1019-1026
    • (2005) Advances in neural information processing systems: Vol. 18 , pp. 1019-1026
    • Niv, Y.1    Daw, N.D.2    Dayan, P.3
  • 109
    • 33847675011 scopus 로고    scopus 로고
    • Tonic dopamine: Opportunity costs and the control of response vigor
    • Niv Y., Daw N.D., Joel D., and Dayan P. Tonic dopamine: Opportunity costs and the control of response vigor. Psychopharmacology (Berl) 191 3 (2007) 507-520
    • (2007) Psychopharmacology (Berl) , vol.191 , Issue.3 , pp. 507-520
    • Niv, Y.1    Daw, N.D.2    Joel, D.3    Dayan, P.4
  • 111
  • 114
    • 0037987978 scopus 로고    scopus 로고
    • Temporal difference learning model accounts for responses in human ventral striatum and orbitofrontal cortex during Pavlovian appetitive learning
    • O'Doherty J., Dayan P., Friston K., Critchley H., and Dolan R. Temporal difference learning model accounts for responses in human ventral striatum and orbitofrontal cortex during Pavlovian appetitive learning. Neuron 38 (2003) 329-337
    • (2003) Neuron , vol.38 , pp. 329-337
    • O'Doherty, J.1    Dayan, P.2    Friston, K.3    Critchley, H.4    Dolan, R.5
  • 115
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • O'Doherty J.P., Dayan P., Schultz J., Deichmann R., Friston K., and Dolan R.J. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 304 5669 (2004) 452-454
    • (2004) Science , vol.304 , Issue.5669 , pp. 452-454
    • O'Doherty, J.P.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.J.6
  • 116
    • 0037186052 scopus 로고    scopus 로고
    • Neural responses during anticipation of a primary taste reward
    • O'Doherty J.P., Deichmann R., Critchley H.D., and Dolan R.J. Neural responses during anticipation of a primary taste reward. Neuron 33 5 (2002) 815-826
    • (2002) Neuron , vol.33 , Issue.5 , pp. 815-826
    • O'Doherty, J.P.1    Deichmann, R.2    Critchley, H.D.3    Dolan, R.J.4
  • 117
    • 0036159133 scopus 로고    scopus 로고
    • Activity in human ventral striatum locked to errors of reward prediction
    • Pagnoni G., Zink C.F., Montague P.R., and Berns G.S. Activity in human ventral striatum locked to errors of reward prediction. Nature Neuroscience 5 2 (2002) 97-98
    • (2002) Nature Neuroscience , vol.5 , Issue.2 , pp. 97-98
    • Pagnoni, G.1    Zink, C.F.2    Montague, P.R.3    Berns, G.S.4
  • 118
    • 0027450928 scopus 로고
    • Anatomical aspects of information processing in primate basal ganglia
    • Parent A., and Hazrati L.N. Anatomical aspects of information processing in primate basal ganglia. Trends in Neurosciences 16 3 (1993) 111-116
    • (1993) Trends in Neurosciences , vol.16 , Issue.3 , pp. 111-116
    • Parent, A.1    Hazrati, L.N.2
  • 120
    • 33748302924 scopus 로고    scopus 로고
    • Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans
    • Pessiglione M., Seymour B., Flandin G., Dolan R.J., and Frith C.D. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 442 7106 (2006) 1042-1045
    • (2006) Nature , vol.442 , Issue.7106 , pp. 1042-1045
    • Pessiglione, M.1    Seymour, B.2    Flandin, G.3    Dolan, R.J.4    Frith, C.D.5
  • 121
    • 33746711623 scopus 로고    scopus 로고
    • Neural differentiation of expected reward and risk in human subcortical structures
    • Preuschoff K., Bossaerts P., and Quartz S.R. Neural differentiation of expected reward and risk in human subcortical structures. Neuron 51 3 (2006) 381-390
    • (2006) Neuron , vol.51 , Issue.3 , pp. 381-390
    • Preuschoff, K.1    Bossaerts, P.2    Quartz, S.R.3
  • 122
    • 33751184634 scopus 로고    scopus 로고
    • The short-latency dopamine signal: A role in discovering novel actions?
    • Redgrave P., and Gurney K. The short-latency dopamine signal: A role in discovering novel actions?. Nature Reviews Neuroscience 7 12 (2006) 967-975
    • (2006) Nature Reviews Neuroscience , vol.7 , Issue.12 , pp. 967-975
    • Redgrave, P.1    Gurney, K.2
  • 123
    • 0033119561 scopus 로고    scopus 로고
    • Is the short-latency dopamine response too short to signal reward error?
    • Redgrave P., Prescott T.J., and Gurney K. Is the short-latency dopamine response too short to signal reward error?. Trends in Neurosciences 22 4 (1999) 146-151
    • (1999) Trends in Neurosciences , vol.22 , Issue.4 , pp. 146-151
    • Redgrave, P.1    Prescott, T.J.2    Gurney, K.3
  • 124
    • 34548837994 scopus 로고    scopus 로고
    • Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling
    • Redish A.D., Jensen S., Johnson A., and Kurth-Nelson Z. Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling. Psychological Review 114 3 (2007) 784-805
    • (2007) Psychological Review , vol.114 , Issue.3 , pp. 784-805
    • Redish, A.D.1    Jensen, S.2    Johnson, A.3    Kurth-Nelson, Z.4
  • 125
    • 0000636183 scopus 로고
    • Reduction in effectiveness of reinforcement after prior excitatory conditioning
    • Rescorla R.A. Reduction in effectiveness of reinforcement after prior excitatory conditioning. Learning and Motivation 1 (1970) 372-381
    • (1970) Learning and Motivation , vol.1 , pp. 372-381
    • Rescorla, R.A.1
  • 127
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • Black A.H., and Prokasy W.F. (Eds), Appleton-Century-Crofts, New York, NY
    • Rescorla R.A., and Wagner A.R. A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In: Black A.H., and Prokasy W.F. (Eds). Classical conditioning II: Current research and theory (1972), Appleton-Century-Crofts, New York, NY 64-99
    • (1972) Classical conditioning II: Current research and theory , pp. 64-99
    • Rescorla, R.A.1    Wagner, A.R.2
  • 129
    • 0035817882 scopus 로고    scopus 로고
    • A cellular mechanism of reward-related learning
    • Reynolds J.N., Hyland B.I., and Wickens J.R. A cellular mechanism of reward-related learning. Nature 413 6851 (2001) 67-70
    • (2001) Nature , vol.413 , Issue.6851 , pp. 67-70
    • Reynolds, J.N.1    Hyland, B.I.2    Wickens, J.R.3
  • 130
    • 36448968271 scopus 로고    scopus 로고
    • Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
    • Roesch M.R., Calu D.J., and Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience 10 12 (2007) 1615-1624
    • (2007) Nature Neuroscience , vol.10 , Issue.12 , pp. 1615-1624
    • Roesch, M.R.1    Calu, D.J.2    Schoenbaum, G.3
  • 131
    • 0025247726 scopus 로고
    • Dopamine neurons of the monkey midbrain: Contingencies of responses to active touch during self-intiated arm movements
    • Romo R., and Schultz W. Dopamine neurons of the monkey midbrain: Contingencies of responses to active touch during self-intiated arm movements. The Journal of Neurophysiology 63 (1990) 592-606
    • (1990) The Journal of Neurophysiology , vol.63 , pp. 592-606
    • Romo, R.1    Schultz, W.2
  • 132
    • 0037010742 scopus 로고    scopus 로고
    • Motivational views of reinforcement: Implications for understanding the behavioral functions of nucleus accumbens dopamine
    • Salamone J.D., and Correa M. Motivational views of reinforcement: Implications for understanding the behavioral functions of nucleus accumbens dopamine. Behavioural Brain Research 137 (2002) 3-25
    • (2002) Behavioural Brain Research , vol.137 , pp. 3-25
    • Salamone, J.D.1    Correa, M.2
  • 133
    • 28144449057 scopus 로고    scopus 로고
    • Representation of action-specific reward values in the striatum
    • Samejima K., Ueda Y., Doya K., and Kimura M. Representation of action-specific reward values in the striatum. Science 310 5752 (2005) 1337-1340
    • (2005) Science , vol.310 , Issue.5752 , pp. 1337-1340
    • Samejima, K.1    Ueda, Y.2    Doya, K.3    Kimura, M.4
  • 134
    • 0001201756 scopus 로고
    • Some studies in machine learning using the game of checkers
    • Samuel A.L. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development 3 (1959) 210-229
    • (1959) IBM Journal of Research and Development , vol.3 , pp. 210-229
    • Samuel, A.L.1
  • 135
    • 36348966690 scopus 로고    scopus 로고
    • Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making
    • Schönberg T., Daw N.D., Joel D., and O'Doherty J.P. Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. Journal of Neuroscience 27 47 (2007) 12860-12867
    • (2007) Journal of Neuroscience , vol.27 , Issue.47 , pp. 12860-12867
    • Schönberg, T.1    Daw, N.D.2    Joel, D.3    O'Doherty, J.P.4
  • 136
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz W. Predictive reward signal of dopamine neurons. Journal of Neurophysiology 80 (1998) 1-27
    • (1998) Journal of Neurophysiology , vol.80 , pp. 1-27
    • Schultz, W.1
  • 137
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz W. Getting formal with dopamine and reward. Neuron 36 2 (2002) 241-263
    • (2002) Neuron , vol.36 , Issue.2 , pp. 241-263
    • Schultz, W.1
  • 138
    • 0027468102 scopus 로고
    • Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task
    • Schultz W., Apicella P., and Ljungberg T. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. Journal of Neuroscience 13 3 (1993) 900-913
    • (1993) Journal of Neuroscience , vol.13 , Issue.3 , pp. 900-913
    • Schultz, W.1    Apicella, P.2    Ljungberg, T.3
  • 139
    • 0026442752 scopus 로고
    • Neuronal activity in monkey ventral striatum related to the expectation of reward
    • Schultz W., Apicella P., Scarnati E., and Ljungberg T. Neuronal activity in monkey ventral striatum related to the expectation of reward. Journal of Neuroscience 12 12 (1992) 4595-4610
    • (1992) Journal of Neuroscience , vol.12 , Issue.12 , pp. 4595-4610
    • Schultz, W.1    Apicella, P.2    Scarnati, E.3    Ljungberg, T.4
  • 140
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz W., Dayan P., and Montague P.R. A neural substrate of prediction and reward. Science 275 (1997) 1593-1599
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 141
    • 61449124981 scopus 로고
    • Thinking locally to act globally: A novel approach to reinforcement learning
    • Hillsdale, NJ: Lawrence Erlbaum Associates
    • Schwartz, A. (1993). Thinking locally to act globally: A novel approach to reinforcement learning. In Proceedings of the fifteenth annual conference of the cognitive science society (pp. 906-911). Hillsdale, NJ: Lawrence Erlbaum Associates
    • (1993) Proceedings of the fifteenth annual conference of the cognitive science society , pp. 906-911
    • Schwartz, A.1
  • 142
    • 2942617032 scopus 로고    scopus 로고
    • Temporal difference models describe higher-order learning in humans
    • Seymour B., O'Doherty J.P., Dayan P., Koltzenburg M., Jones A.K., Dolan R.J., et al. Temporal difference models describe higher-order learning in humans. Nature 429 6992 (2004) 664-667
    • (2004) Nature , vol.429 , Issue.6992 , pp. 664-667
    • Seymour, B.1    O'Doherty, J.P.2    Dayan, P.3    Koltzenburg, M.4    Jones, A.K.5    Dolan, R.J.6
  • 143
    • 14344261491 scopus 로고    scopus 로고
    • Using relative novelty to identify useful temporal abstractions in reinforcement learning
    • Simsek, O., & Barto, A. G. (2004). Using relative novelty to identify useful temporal abstractions in reinforcement learning. In 21st international conference on machine learning
    • (2004) 21st international conference on machine learning
    • Simsek, O.1    Barto, A.G.2
  • 144
    • 0037480664 scopus 로고
    • Two types of conditioned reflex and a pseudo type
    • Skinner B.F. Two types of conditioned reflex and a pseudo type. Journal of General Psychology 12 (1935) 66-77
    • (1935) Journal of General Psychology , vol.12 , pp. 66-77
    • Skinner, B.F.1
  • 147
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal difference
    • Sutton R.S. Learning to predict by the method of temporal difference. Machine Learning 3 (1988) 9-44
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 151
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton R.S., Precup D., and Singh S. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112 (1999) 181-211
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 152
    • 4644290200 scopus 로고    scopus 로고
    • A possible role of midbrain dopamine neurons in short- and long-term adaptation of saccades to position-reward mapping
    • Takikawa Y., Kawagoe R., and Hikosaka O. A possible role of midbrain dopamine neurons in short- and long-term adaptation of saccades to position-reward mapping. Journal of Neurophysiology 92 (2004) 2520-2529
    • (2004) Journal of Neurophysiology , vol.92 , pp. 2520-2529
    • Takikawa, Y.1    Kawagoe, R.2    Hikosaka, O.3
  • 154
    • 0345255891 scopus 로고    scopus 로고
    • Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm
    • Tobler P.N., Dickinson A., and Schultz W. Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm. Journal of Neuroscience 23 32 (2003) 10402-10410
    • (2003) Journal of Neuroscience , vol.23 , Issue.32 , pp. 10402-10410
    • Tobler, P.N.1    Dickinson, A.2    Schultz, W.3
  • 155
    • 14844349975 scopus 로고    scopus 로고
    • Adaptive coding of reward value by dopamine neurons
    • Tobler P.N., Fiorillo C.D., and Schultz W. Adaptive coding of reward value by dopamine neurons. Science 307 5715 (2005) 1642-1645
    • (2005) Science , vol.307 , Issue.5715 , pp. 1642-1645
    • Tobler, P.N.1    Fiorillo, C.D.2    Schultz, W.3
  • 156
    • 33846587385 scopus 로고    scopus 로고
    • The neural basis of loss aversion in decision-making under risk
    • Tom S.M., Fox C.R., Trepel C., and Poldrack R.A. The neural basis of loss aversion in decision-making under risk. Science 315 5811 (2007) 515-518
    • (2007) Science , vol.315 , Issue.5811 , pp. 515-518
    • Tom, S.M.1    Fox, C.R.2    Trepel, C.3    Poldrack, R.A.4
  • 157
    • 1642404961 scopus 로고    scopus 로고
    • Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli
    • Ungless M.A., Magill P.J., and Bolam J.P. Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli. Science 303 5666 (2004) 2040-2042
    • (2004) Science , vol.303 , Issue.5666 , pp. 2040-2042
    • Ungless, M.A.1    Magill, P.J.2    Bolam, J.P.3
  • 158
    • 34247147767 scopus 로고    scopus 로고
    • Determining the neural substrates of goal-directed learning in the human brain
    • Valentin V.V., Dickinson A., and O'Doherty J.P. Determining the neural substrates of goal-directed learning in the human brain. Journal of Neuroscience 27 15 (2007) 4019-4026
    • (2007) Journal of Neuroscience , vol.27 , Issue.15 , pp. 4019-4026
    • Valentin, V.V.1    Dickinson, A.2    O'Doherty, J.P.3
  • 159
    • 0035811464 scopus 로고    scopus 로고
    • Dopamine responses comply with basic assumptions of formal learning theory
    • Waelti P., Dickinson A., and Schultz W. Dopamine responses comply with basic assumptions of formal learning theory. Nature 412 (2001) 43-48
    • (2001) Nature , vol.412 , pp. 43-48
    • Waelti, P.1    Dickinson, A.2    Schultz, W.3
  • 161
    • 0004049895 scopus 로고
    • Unpublished doctoral dissertation, Cambridge University, Cambridge, UK
    • Watkins, C. J. C. H. (1989). Learning with delayed rewards. Unpublished doctoral dissertation, Cambridge University, Cambridge, UK
    • (1989) Learning with delayed rewards
    • Watkins, C.J.C.H.1
  • 162
    • 0013150608 scopus 로고    scopus 로고
    • Dopamine in schizophrenia: Dysfunctional information processing in basal ganglia-thalamocortical split circuits
    • Chiara G.D. (Ed), Springer-Verlag, Berlin
    • Weiner I., and Joel D. Dopamine in schizophrenia: Dysfunctional information processing in basal ganglia-thalamocortical split circuits. In: Chiara G.D. (Ed). Handbook of experimental pharmacology vol. 154/II, dopamine in the CNS II (2002), Springer-Verlag, Berlin 417-472
    • (2002) Handbook of experimental pharmacology vol. 154/II, dopamine in the CNS II , pp. 417-472
    • Weiner, I.1    Joel, D.2
  • 163
    • 0002557583 scopus 로고
    • Advanced forecasting methods for global crisis warning and models of intelligence
    • Werbos P.J. Advanced forecasting methods for global crisis warning and models of intelligence. General Systems Yearbook 22 (1977) 25-38
    • (1977) General Systems Yearbook , vol.22 , pp. 25-38
    • Werbos, P.J.1
  • 164
    • 0029655991 scopus 로고    scopus 로고
    • Dopamine reverses the depression of rat corticostriatal synapses which normally follows high frequency stimulation of cortex in vitro
    • Wickens J.R., Begg A.J., and Arbuthnott G.W. Dopamine reverses the depression of rat corticostriatal synapses which normally follows high frequency stimulation of cortex in vitro. Neuroscience 70 1 (1996) 1-5
    • (1996) Neuroscience , vol.70 , Issue.1 , pp. 1-5
    • Wickens, J.R.1    Begg, A.J.2    Arbuthnott, G.W.3
  • 165
    • 0001785024 scopus 로고
    • Cellular models of reinforcement
    • Houk J.C., Davis J.L., and Beiser D.G. (Eds), MIT Press
    • Wickens J.R., and Kötter R. Cellular models of reinforcement. In: Houk J.C., Davis J.L., and Beiser D.G. (Eds). Models of information processing in the basal ganglia (1995), MIT Press 187-214
    • (1995) Models of information processing in the basal ganglia , pp. 187-214
    • Wickens, J.R.1    Kötter, R.2
  • 166
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8 3 (1992) 229-256
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 229-256
    • Williams, R.J.1
  • 167
    • 0018100734 scopus 로고
    • Neuroleptic-induced "anhedonia" in rats: pimozide blocks reward quality of food
    • Wise R.A., Spindler J., de Wit H., and Gerberg G.J. Neuroleptic-induced "anhedonia" in rats: pimozide blocks reward quality of food. Science 201 4352 (1978) 262-264
    • (1978) Science , vol.201 , Issue.4352 , pp. 262-264
    • Wise, R.A.1    Spindler, J.2    de Wit, H.3    Gerberg, G.J.4
  • 168
    • 0017976943 scopus 로고
    • Major attenuation of food reward with performance-sparing doses of pimozide in the rat
    • Wise R.A., Spindler J., and Legault L. Major attenuation of food reward with performance-sparing doses of pimozide in the rat. Canadian Journal of Psychology 32 (1978) 77-85
    • (1978) Canadian Journal of Psychology , vol.32 , pp. 77-85
    • Wise, R.A.1    Spindler, J.2    Legault, L.3
  • 169
    • 45249097567 scopus 로고    scopus 로고
    • Striatal activity underlies novelty-based choice in humans
    • Wittmann B.C., Daw N.D., Seymour B., and Dolan R.J. Striatal activity underlies novelty-based choice in humans. Neuron 58 6 (2008) 967-973
    • (2008) Neuron , vol.58 , Issue.6 , pp. 967-973
    • Wittmann, B.C.1    Daw, N.D.2    Seymour, B.3    Dolan, R.J.4
  • 170
    • 0008652822 scopus 로고
    • The method of Pawlow in animal psychology
    • Yerkes R.M., and Morgulis S. The method of Pawlow in animal psychology. Psychological Bulletin 6 (1909) 257-273
    • (1909) Psychological Bulletin , vol.6 , pp. 257-273
    • Yerkes, R.M.1    Morgulis, S.2
  • 171
    • 1642580578 scopus 로고    scopus 로고
    • Lesions of the dosolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning
    • Yin H.H., Knowlton B.J., and Balleine B.W. Lesions of the dosolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. European Journal of Neuroscience 19 (2004) 181-189
    • (2004) European Journal of Neuroscience , vol.19 , pp. 181-189
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3
  • 172
    • 23244461369 scopus 로고    scopus 로고
    • Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning
    • Yin H.H., Knowlton B.J., and Balleine B.W. Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. European Journal of Neuroscience 22 2 (2005) 505-512
    • (2005) European Journal of Neuroscience , vol.22 , Issue.2 , pp. 505-512
    • Yin, H.H.1    Knowlton, B.J.2    Balleine, B.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.