-
1
-
-
85151728371
-
Residual algorithms: reinforcement learning with function approximation
-
A. Prieditis and S. Russell (eds), Proceedings of the 12th International Conference on Machine Learning (IMLL 95). San Mateo, CA: Morgan Kaufman
-
Baird, L.C. (1995). Residual algorithms: reinforcement learning with function approximation. In: A. Prieditis and S. Russell (eds), Proceedings of the 12th International Conference on Machine Learning (IMLL 95). San Mateo, CA: Morgan Kaufman, pp. 30-37.
-
(1995)
, pp. 30-37
-
-
Baird, L.C.1
-
2
-
-
0000541213
-
Adaptive critic and the basal ganglia
-
MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
-
Barto A.G. Adaptive critic and the basal ganglia. Models of Information Processing in the Basal Ganglia 1995, 215-232. MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
-
(1995)
Models of Information Processing in the Basal Ganglia
, pp. 215-232
-
-
Barto, A.G.1
-
3
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Barto A.G., Sutton R.S., Anderson C.W. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Systems Man Cyber. 1983, 13:834-846.
-
(1983)
IEEE Trans. Systems Man Cyber.
, vol.13
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
4
-
-
0008840282
-
Sequential decision problems and neural networks
-
MIT Press, Cambride, MA, D.S. Touretzky (Ed.)
-
Barto A.G., Sutton R.S., Watkins C.J.C.H. Sequential decision problems and neural networks. Advances in Neural Information Processing Systems 2 1989, 686-693. MIT Press, Cambride, MA. D.S. Touretzky (Ed.).
-
(1989)
Advances in Neural Information Processing Systems 2
, pp. 686-693
-
-
Barto, A.G.1
Sutton, R.S.2
Watkins, C.J.C.H.3
-
5
-
-
0002201501
-
Learning and sequential decision making
-
MIT Press, Cambridge, MA, M. Gabriel, J. Moore (Eds.)
-
Barto A.G., Sutton R.S., Watkins C.J.C.H. Learning and sequential decision making. Learning and Computational Neuroscience: Foundations of Adaptive Networks 1990, 539-602. MIT Press, Cambridge, MA. M. Gabriel, J. Moore (Eds.).
-
(1990)
Learning and Computational Neuroscience: Foundations of Adaptive Networks
, pp. 539-602
-
-
Barto, A.G.1
Sutton, R.S.2
Watkins, C.J.C.H.3
-
6
-
-
21544435722
-
Midbrain dopamine neurons encode a quantitative reward prediction error signal
-
Bayer H.M., Glimcher P.W. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 2005, 47:129-141.
-
(2005)
Neuron
, vol.47
, pp. 129-141
-
-
Bayer, H.M.1
Glimcher, P.W.2
-
7
-
-
34548778113
-
Statistics of midbrain dopamine neuron spike trains in the awake primate
-
Bayer H.M., Lau B., Glimcher P.W. Statistics of midbrain dopamine neuron spike trains in the awake primate. J. Neurophysiol. 2007, 98:1428-1439.
-
(2007)
J. Neurophysiol.
, vol.98
, pp. 1428-1439
-
-
Bayer, H.M.1
Lau, B.2
Glimcher, P.W.3
-
8
-
-
34548295327
-
Learning the value of information in an uncertain world
-
Behrens T.E.J., Woolrich M.W., Walton M.E., Rushworth M.F.S. Learning the value of information in an uncertain world. Nat. Neurosci. 2007, 10:1214-1221.
-
(2007)
Nat. Neurosci.
, vol.10
, pp. 1214-1221
-
-
Behrens, T.E.J.1
Woolrich, M.W.2
Walton, M.E.3
Rushworth, M.F.S.4
-
9
-
-
0003787146
-
-
Princeton University Press, Princeton, NJ
-
Bellman R.E. Dynamic Programming 1957, Princeton University Press, Princeton, NJ.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
10
-
-
0344442860
-
"Passive stabilization" of striatal extracellular dopamine across the lesion spectrum encompassing the presymptomatic phase of Parkinson's disease: a voltammetric study in the 6-OHDA lesioned rat
-
Bergstrom B.P., Garris P.A. "Passive stabilization" of striatal extracellular dopamine across the lesion spectrum encompassing the presymptomatic phase of Parkinson's disease: a voltammetric study in the 6-OHDA lesioned rat. J. Neurochem. 2003, 87:1224-1236.
-
(2003)
J. Neurochem.
, vol.87
, pp. 1224-1236
-
-
Bergstrom, B.P.1
Garris, P.A.2
-
11
-
-
0035871327
-
Predictability modulates human brain response to reward
-
Berns G.S., McClure S.M., Pagnoni G., Montague P.R. Predictability modulates human brain response to reward. J. Neurosci. 2001, 21:2793-2798.
-
(2001)
J. Neurosci.
, vol.21
, pp. 2793-2798
-
-
Berns, G.S.1
McClure, S.M.2
Pagnoni, G.3
Montague, P.R.4
-
12
-
-
33847634405
-
The debate over dopamine's role in reward: the case for incentive salience
-
Berridge K.C. The debate over dopamine's role in reward: the case for incentive salience. Psychopharmacol. (Berl.) 2007, 191:391-431.
-
(2007)
Psychopharmacol. (Berl.)
, vol.191
, pp. 391-431
-
-
Berridge, K.C.1
-
13
-
-
0032423613
-
What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience?
-
Berridge K.C., Robinson T.E. What is the role of dopamine in reward: hedonic impact, reward learning, or incentive salience?. Brain Res. Rev. 1998, 28:309-369.
-
(1998)
Brain Res. Rev.
, vol.28
, pp. 309-369
-
-
Berridge, K.C.1
Robinson, T.E.2
-
15
-
-
0001491619
-
A mathematical model for simple learning
-
Bush R.R., Mosteller F. A mathematical model for simple learning. Psychol. Rev. 1951, 58:313-323.
-
(1951)
Psychol. Rev.
, vol.58
, pp. 313-323
-
-
Bush, R.R.1
Mosteller, F.2
-
16
-
-
0022644104
-
Stimulation of the lateral habenula inhibits dopamine-containing neurons in the substantia nigra and ventral tegmental area of the rat
-
Christoph G.R., Leonzio R.J., Wilcox K.S. Stimulation of the lateral habenula inhibits dopamine-containing neurons in the substantia nigra and ventral tegmental area of the rat. J. Neurosci. 1986, 6:613-619.
-
(1986)
J. Neurosci.
, vol.6
, pp. 613-619
-
-
Christoph, G.R.1
Leonzio, R.J.2
Wilcox, K.S.3
-
17
-
-
33646149370
-
Nociceptive responses of midbrain dopaminergic neurones are modulated by the superior colliculus in the rat
-
Coizet V., Dommett E.J., Redgrave P., Overton P.G. Nociceptive responses of midbrain dopaminergic neurones are modulated by the superior colliculus in the rat. Neuroscience 2006, 139:1479-1493.
-
(2006)
Neuroscience
, vol.139
, pp. 1479-1493
-
-
Coizet, V.1
Dommett, E.J.2
Redgrave, P.3
Overton, P.G.4
-
18
-
-
0033722074
-
Behavioral results suggest an average reward TD model of dopamine function
-
Daw N.D., Touretzky D.S. Behavioral results suggest an average reward TD model of dopamine function. Neurocomputing 2000, 32:679-684.
-
(2000)
Neurocomputing
, vol.32
, pp. 679-684
-
-
Daw, N.D.1
Touretzky, D.S.2
-
19
-
-
0036835734
-
Long-term reward prediction in TD models of the dopamine system
-
Daw N.D., Touretzky D.S. Long-term reward prediction in TD models of the dopamine system. Neural Computation 2002, 14:2567-2583.
-
(2002)
Neural Computation
, vol.14
, pp. 2567-2583
-
-
Daw, N.D.1
Touretzky, D.S.2
-
20
-
-
0036592008
-
Opponent interactions between serotonin and dopamine
-
Daw N.D., Kakade S., Dayan P. Opponent interactions between serotonin and dopamine. Neural Networks 2002, 15:603-616.
-
(2002)
Neural Networks
, vol.15
, pp. 603-616
-
-
Daw, N.D.1
Kakade, S.2
Dayan, P.3
-
21
-
-
33745223257
-
Cortical substrates for exploratory decisions in humans
-
Daw N.D., O'Doherty J.P., Dayan P., et al. Cortical substrates for exploratory decisions in humans. Nature 2006, 441:876-879.
-
(2006)
Nature
, vol.441
, pp. 876-879
-
-
Daw, N.D.1
O'Doherty, J.P.2
Dayan, P.3
-
22
-
-
34547536392
-
Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens
-
Day J.J., Roitman M.F., Wightman R.M., Carelli R.M. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nat. Neurosci. 2007, 10:1020-1028.
-
(2007)
Nat. Neurosci.
, vol.10
, pp. 1020-1028
-
-
Day, J.J.1
Roitman, M.F.2
Wightman, R.M.3
Carelli, R.M.4
-
24
-
-
0041997010
-
Explaining away in weight space
-
MIT Press, Cambridge, MA, T. Leen, T. Dietterich, V. Tresp (Eds.)
-
Dayan P., Kakade S. Explaining away in weight space. Advances in Neural Information Processing Systems 2000, Vol. 12:24-30. MIT Press, Cambridge, MA. T. Leen, T. Dietterich, V. Tresp (Eds.).
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 24-30
-
-
Dayan, P.1
Kakade, S.2
-
26
-
-
0042697569
-
Dorsal striatum responses to reward and punishment: effects of valence and magnitude manipulations
-
Delgado M.R., Locke H.M., Stenger V.A., Fiez J.A. Dorsal striatum responses to reward and punishment: effects of valence and magnitude manipulations. Cogn. Affect. Behav. Neurosci. 2003, 3:27-38.
-
(2003)
Cogn. Affect. Behav. Neurosci.
, vol.3
, pp. 27-38
-
-
Delgado, M.R.1
Locke, H.M.2
Stenger, V.A.3
Fiez, J.A.4
-
27
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
Doya K. Reinforcement learning in continuous time and space. Neural Computation 2000, 12:219-245.
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
28
-
-
0036592023
-
Metalearning and neuromodulation
-
Doya K. Metalearning and neuromodulation. Neural Networks 2002, 15:495-506.
-
(2002)
Neural Networks
, vol.15
, pp. 495-506
-
-
Doya, K.1
-
29
-
-
0037459319
-
Discrete coding of reward probability and uncertainty by dopamine neurons
-
Fiorillo C.D., Tobler P.N., Schultz W. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898-1902.
-
(2003)
Science
, vol.299
, pp. 1898-1902
-
-
Fiorillo, C.D.1
Tobler, P.N.2
Schultz, W.3
-
30
-
-
0041859307
-
Afferent modulation of dopamine neuron firing differentially regulates tonic and phasic dopamine transmission
-
Floresco S.B., West A.R., Ash B., et al. Afferent modulation of dopamine neuron firing differentially regulates tonic and phasic dopamine transmission. Nat. Neurosci. 2003, 6:968-973.
-
(2003)
Nat. Neurosci.
, vol.6
, pp. 968-973
-
-
Floresco, S.B.1
West, A.R.2
Ash, B.3
-
31
-
-
0018427295
-
Pimozide-induced extinction in rats: stimulus control of responding rules out motor deficit
-
Franklin K.B.J., McCoy S.N. Pimozide-induced extinction in rats: stimulus control of responding rules out motor deficit. Pharmacol. Biochem. Behav. 1979, 11:71-75.
-
(1979)
Pharmacol. Biochem. Behav.
, vol.11
, pp. 71-75
-
-
Franklin, K.B.J.1
McCoy, S.N.2
-
32
-
-
23744454605
-
Afferents of the ventral tegmental area in the rat-anatomical substratum for integrative functions
-
Geisler S., Zahm D.S. Afferents of the ventral tegmental area in the rat-anatomical substratum for integrative functions. J. Comp. Neurol. 2005, 490:270-294.
-
(2005)
J. Comp. Neurol.
, vol.490
, pp. 270-294
-
-
Geisler, S.1
Zahm, D.S.2
-
33
-
-
22544464049
-
Dopaminergic modulation of limbic and cortical drive of nucleus accumbens in goal-directed behavior
-
Goto Y., Grace A.A. Dopaminergic modulation of limbic and cortical drive of nucleus accumbens in goal-directed behavior. Nat. Neurosci. 2005, 8:805-812.
-
(2005)
Nat. Neurosci.
, vol.8
, pp. 805-812
-
-
Goto, Y.1
Grace, A.A.2
-
34
-
-
0026059697
-
Phasic versus tonic dopamine release and the modulation of dopamine system responsivity: a hypothesis for the etiology of schizophrenia
-
Grace A.A. Phasic versus tonic dopamine release and the modulation of dopamine system responsivity: a hypothesis for the etiology of schizophrenia. Neuroscience 1991, 41:1-24.
-
(1991)
Neuroscience
, vol.41
, pp. 1-24
-
-
Grace, A.A.1
-
35
-
-
33748188120
-
The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
-
Hampton A.N., Bossaerts P., O'Doherty J.P. The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. J. Neurosci. 2006, 26:8360-8367.
-
(2006)
J. Neurosci.
, vol.26
, pp. 8360-8367
-
-
Hampton, A.N.1
Bossaerts, P.2
O'Doherty, J.P.3
-
36
-
-
33644688754
-
Dopamine neurons report an error in the temporal prediction of reward during learning
-
Hollerman J.R., Schultz W. Dopamine neurons report an error in the temporal prediction of reward during learning. Nat. Neurosci. 1998, 1:304-309.
-
(1998)
Nat. Neurosci.
, vol.1
, pp. 304-309
-
-
Hollerman, J.R.1
Schultz, W.2
-
37
-
-
0034061668
-
Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events
-
Horvitz J.C. Mesolimbocortical and nigrostriatal dopamine responses to salient non-reward events. Neuroscience 2000, 96:651-656.
-
(2000)
Neuroscience
, vol.96
, pp. 651-656
-
-
Horvitz, J.C.1
-
38
-
-
0002861883
-
A model of how the basal ganglia generate and use neural signals that predict reinforcement
-
MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
-
Houk J.C., Adams J.L., Barto A.G. A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models of Information Processing in the Basal Ganglia 1995, 249-270. MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
-
(1995)
Models of Information Processing in the Basal Ganglia
, pp. 249-270
-
-
Houk, J.C.1
Adams, J.L.2
Barto, A.G.3
-
40
-
-
0033461157
-
The role of nucleus accumbens dopamine in motivated behavior: a unifying interpretation with special reference to reward-seeking
-
Ikemoto S., Panksepp J. The role of nucleus accumbens dopamine in motivated behavior: a unifying interpretation with special reference to reward-seeking. Brain Res. Rev. 1999, 31:6-41.
-
(1999)
Brain Res. Rev.
, vol.31
, pp. 6-41
-
-
Ikemoto, S.1
Panksepp, J.2
-
41
-
-
34047222753
-
Separate brain regions code for salience vs valence during reward prediction in humans
-
Jensen J., Smith A.J., Willeit M., et al. Separate brain regions code for salience vs valence during reward prediction in humans. Hum. Brain Mapp. 2007, 28:294-302.
-
(2007)
Hum. Brain Mapp.
, vol.28
, pp. 294-302
-
-
Jensen, J.1
Smith, A.J.2
Willeit, M.3
-
42
-
-
0002875876
-
Striatal contention scheduling and the split circuit scheme of basal ganglia-thalamocortical circuitry: from anatomy to behaviour
-
Harwood Academic Publishers, New York, NY, R. Miller, J. Wickens (Eds.)
-
Joel D., Weiner I. Striatal contention scheduling and the split circuit scheme of basal ganglia-thalamocortical circuitry: from anatomy to behaviour. Conceptual Advances in Brain Research: Brain Dynamics and the Striatal Complex 1999, 209-236. Harwood Academic Publishers, New York, NY. R. Miller, J. Wickens (Eds.).
-
(1999)
Conceptual Advances in Brain Research: Brain Dynamics and the Striatal Complex
, pp. 209-236
-
-
Joel, D.1
Weiner, I.2
-
43
-
-
0036592026
-
Actor-Critic models of the basal ganglia: new anatomical and computational perspectives
-
Joel D., Niv Y., Ruppin E. Actor-Critic models of the basal ganglia: new anatomical and computational perspectives. Neural Networks 2002, 15:535-547.
-
(2002)
Neural Networks
, vol.15
, pp. 535-547
-
-
Joel, D.1
Niv, Y.2
Ruppin, E.3
-
44
-
-
0031309579
-
Normative and descriptive models of decision making: time discounting and risk sensitivity
-
Wiley, Chichester, G.R. Bock, G. Cardew (Eds.)
-
Kacelnik A. Normative and descriptive models of decision making: time discounting and risk sensitivity. Characterizing Human Psychological Adaptations: Ciba Foundation Symposium 208 1997, 51-70. Wiley, Chichester. G.R. Bock, G. Cardew (Eds.).
-
(1997)
Characterizing Human Psychological Adaptations: Ciba Foundation Symposium 208
, pp. 51-70
-
-
Kacelnik, A.1
-
45
-
-
0036592029
-
Dopamine: generalization and bonuses
-
Kakade S., Dayan P. Dopamine: generalization and bonuses. Neural Networks 2002, 15:549-559.
-
(2002)
Neural Networks
, vol.15
, pp. 549-559
-
-
Kakade, S.1
Dayan, P.2
-
46
-
-
0002981963
-
Predictability, surprise, attention, and conditioning
-
Appleton Century Crofts, New York, NY, B.A. Campbell, R.M. Church (Eds.)
-
Kamin L.J. Predictability, surprise, attention, and conditioning. Punishment and Aversive Behavior 1969, 242-259. Appleton Century Crofts, New York, NY. B.A. Campbell, R.M. Church (Eds.).
-
(1969)
Punishment and Aversive Behavior
, pp. 242-259
-
-
Kamin, L.J.1
-
47
-
-
67349199780
-
Effects of serial compound stimuli on stimulus selection in classical conditioning of the rabbit nictitating membrane response
-
PhD thesis, university of Iowa.
-
Kehoe, E.J. (1977). Effects of serial compound stimuli on stimulus selection in classical conditioning of the rabbit nictitating membrane response. PhD thesis, university of Iowa.
-
(1977)
-
-
Kehoe, E.J.1
-
48
-
-
33847651376
-
Linking nucleus accumbens dopamine and blood oxygenation
-
Knutson B., Gibbs S.E.B. Linking nucleus accumbens dopamine and blood oxygenation. Psychopharmacol (Berl.), 2007, 191:813-822.
-
(2007)
Psychopharmacol (Berl.)
, vol.191
, pp. 813-822
-
-
Knutson, B.1
Gibbs, S.E.B.2
-
49
-
-
0035882897
-
Anticipation of increasing monetary reward selectively recruits nucleus accumbens
-
Knutson B., Adams C.M., Fong G.W., Hommer D. Anticipation of increasing monetary reward selectively recruits nucleus accumbens. J. Neurosci. 2001, 21:RC159.
-
(2001)
J. Neurosci.
, vol.21
-
-
Knutson, B.1
Adams, C.M.2
Fong, G.W.3
Hommer, D.4
-
50
-
-
0035807944
-
Dissociation of reward anticipation and outcome with event-related fmri
-
Knutson B., Fong G.W., Adams C.M., et al. Dissociation of reward anticipation and outcome with event-related fmri. NeuroReport 2001, 12:3683-3687.
-
(2001)
NeuroReport
, vol.12
, pp. 3683-3687
-
-
Knutson, B.1
Fong, G.W.2
Adams, C.M.3
-
51
-
-
0037328795
-
A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: characterization with rapid event-related fmri
-
Knutson B., Fong G.W., Bennett S.M., et al. A region of mesial prefrontal cortex tracks monetarily rewarding outcomes: characterization with rapid event-related fmri. NeuroImage 2003, 18:263-272.
-
(2003)
NeuroImage
, vol.18
, pp. 263-272
-
-
Knutson, B.1
Fong, G.W.2
Bennett, S.M.3
-
52
-
-
34447637794
-
Reward prediction error computation in the pedunculo-pontine tegmental nucleus neurons
-
Kobayashi Y., Okada K.-I. Reward prediction error computation in the pedunculo-pontine tegmental nucleus neurons. Ann. N.Y. Acad. Sci. 2007, 1104:310-323.
-
(2007)
Ann. N.Y. Acad. Sci.
, vol.1104
, pp. 310-323
-
-
Kobayashi, Y.1
Okada, K.-I.2
-
55
-
-
0017915793
-
The Rescorla-Wagner model: losses in associative strength in compound conditioned stimuli
-
Kremer E.F. The Rescorla-Wagner model: losses in associative strength in compound conditioned stimuli. J. Exp. Psychol. Animal Behav. Proc. 1978, 4:22-36.
-
(1978)
J. Exp. Psychol. Animal Behav. Proc.
, vol.4
, pp. 22-36
-
-
Kremer, E.F.1
-
56
-
-
0032606945
-
A probabilistic framework for the adaptation and comparison of image codes
-
Lewicki M.S., Olshausen B.A. A probabilistic framework for the adaptation and comparison of image codes. J. Opt. Soc. Am. A 1999, 16:1587-1601.
-
(1999)
J. Opt. Soc. Am. A
, vol.16
, pp. 1587-1601
-
-
Lewicki, M.S.1
Olshausen, B.A.2
-
58
-
-
0026505520
-
Responses of monkey dopaminergic neurons during learning of behavioral reactions
-
Ljungberg T., Apicella P., Schultz W. Responses of monkey dopaminergic neurons during learning of behavioral reactions. J. Neurophysiol. 1992, 67:145-163.
-
(1992)
J. Neurophysiol.
, vol.67
, pp. 145-163
-
-
Ljungberg, T.1
Apicella, P.2
Schultz, W.3
-
59
-
-
0038719640
-
The underpinnings of the BOLD functional magnetic resonance imaging signal
-
Logothetis N.K. The underpinnings of the BOLD functional magnetic resonance imaging signal. J. Neurosci. 2003, 23:3963-3971.
-
(2003)
J. Neurosci.
, vol.23
, pp. 3963-3971
-
-
Logothetis, N.K.1
-
60
-
-
34547144795
-
Neural signature of fictive learning signals in a sequential investment task
-
Lohrenz T., McCabe K., Camerer C.F., Montague P.R. Neural signature of fictive learning signals in a sequential investment task. Proc. Nat. Acad. Sci. USA 2007, 104:9493-9498.
-
(2007)
Proc. Nat. Acad. Sci. USA
, vol.104
, pp. 9493-9498
-
-
Lohrenz, T.1
McCabe, K.2
Camerer, C.F.3
Montague, P.R.4
-
62
-
-
34347343926
-
Lateral habenula as a source of negative reward signals in dopamine neurons
-
Matsumoto M., Hikosaka O. Lateral habenula as a source of negative reward signals in dopamine neurons. Nature 2007, 447:1111-1115.
-
(2007)
Nature
, vol.447
, pp. 1111-1115
-
-
Matsumoto, M.1
Hikosaka, O.2
-
64
-
-
0037650217
-
Temporal prediction errors in a passive learning task activate human striatum
-
McClure S.M., Berns G.S., Montague P.R. Temporal prediction errors in a passive learning task activate human striatum. Neuron 2003, 38:339-346.
-
(2003)
Neuron
, vol.38
, pp. 339-346
-
-
McClure, S.M.1
Berns, G.S.2
Montague, P.R.3
-
65
-
-
5144221542
-
Neural correlates of behavioral preference for culturally familiar drinks
-
McClure S.M., Li J., Tomlin D., et al. Neural correlates of behavioral preference for culturally familiar drinks. Neuron 2004, 44:379-387.
-
(2004)
Neuron
, vol.44
, pp. 379-387
-
-
McClure, S.M.1
Li, J.2
Tomlin, D.3
-
66
-
-
34548542836
-
Temporal difference modeling of the blood-oxygen level dependent response during aversive conditioning in humans: effects of dopaminergic modulation
-
Menon M., Jensen J., Vitcu I., et al. Temporal difference modeling of the blood-oxygen level dependent response during aversive conditioning in humans: effects of dopaminergic modulation. Biol. Psych. 2007, 62:765-772.
-
(2007)
Biol. Psych.
, vol.62
, pp. 765-772
-
-
Menon, M.1
Jensen, J.2
Vitcu, I.3
-
67
-
-
0002179838
-
Corticostriatal cell assemblies in selective attention and in representation of predictable and controllable events
-
Miller R., Wickens J.R. Corticostriatal cell assemblies in selective attention and in representation of predictable and controllable events. Concepts Neurosci. 1991, 2:65-95.
-
(1991)
Concepts Neurosci.
, vol.2
, pp. 65-95
-
-
Miller, R.1
Wickens, J.R.2
-
68
-
-
0030026069
-
Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli
-
Mirenowicz J., Schultz W. Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli. Nature 1996, 379:449-451.
-
(1996)
Nature
, vol.379
, pp. 449-451
-
-
Mirenowicz, J.1
Schultz, W.2
-
69
-
-
0000708595
-
Using aperiodic reinforcement for directed self-organization
-
Morgan Kaufmann, San Mateo, CA, C.L. Giles, S.J. Hanson, J.D. Cowan (Eds.)
-
Montague P.R., Dayan P., Nowlan S.J., et al. Using aperiodic reinforcement for directed self-organization. Advances in Neural Information Processing Systems 1993, Vol. 5:969-976. Morgan Kaufmann, San Mateo, CA. C.L. Giles, S.J. Hanson, J.D. Cowan (Eds.).
-
(1993)
Advances in Neural Information Processing Systems
, vol.5
, pp. 969-976
-
-
Montague, P.R.1
Dayan, P.2
Nowlan, S.J.3
-
70
-
-
0003368918
-
Foraging in an uncertain environments using predictive hebbian learning
-
Morgan Kaufmann, San Mateo, CA, T. Tesauro, J.D. Cowan (Eds.)
-
Montague P.R., Dayan P., Sejnowski T.J. Foraging in an uncertain environments using predictive hebbian learning. Advances in Neural Information Processing Systems 1994, Vol. 6:598-605. Morgan Kaufmann, San Mateo, CA. T. Tesauro, J.D. Cowan (Eds.).
-
(1994)
Advances in Neural Information Processing Systems
, vol.6
, pp. 598-605
-
-
Montague, P.R.1
Dayan, P.2
Sejnowski, T.J.3
-
71
-
-
0028972278
-
Bee foraging in uncertain environments using predictive Hebbian learning
-
Montague P.R., Dayan P., Person C., Sejnowski T.J. Bee foraging in uncertain environments using predictive Hebbian learning. Nature 1995, 377:725-728.
-
(1995)
Nature
, vol.377
, pp. 725-728
-
-
Montague, P.R.1
Dayan, P.2
Person, C.3
Sejnowski, T.J.4
-
72
-
-
0029981543
-
A framework for mesencephalic dopamine systems based on predictive hebbian learning
-
Montague P.R., Dayan P., Sejnowski T.J. A framework for mesencephalic dopamine systems based on predictive hebbian learning. J. Neurosci. 1996, 16:1936-1947.
-
(1996)
J. Neurosci.
, vol.16
, pp. 1936-1947
-
-
Montague, P.R.1
Dayan, P.2
Sejnowski, T.J.3
-
73
-
-
1242341959
-
Dynamic gain control of dopamine delivery in freely moving animals
-
Montague P.R., McClure S.M., Baldwin P.R., et al. Dynamic gain control of dopamine delivery in freely moving animals. J. Neurosci. 2004, 24:1754-1759.
-
(2004)
J. Neurosci.
, vol.24
, pp. 1754-1759
-
-
Montague, P.R.1
McClure, S.M.2
Baldwin, P.R.3
-
74
-
-
3242673464
-
Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons
-
Morris G., Arkadir D., Nevet A. Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons. Neuron 2004, 43:133-143.
-
(2004)
Neuron
, vol.43
, pp. 133-143
-
-
Morris, G.1
Arkadir, D.2
Nevet, A.3
-
75
-
-
33747585633
-
Midbrain dopamine neurons encode decisions for future action
-
Morris G., Nevet A., Arkadir D., et al. Midbrain dopamine neurons encode decisions for future action. Nat. Neurosci. 2006, 9:1057-1063.
-
(2006)
Nat. Neurosci.
, vol.9
, pp. 1057-1063
-
-
Morris, G.1
Nevet, A.2
Arkadir, D.3
-
76
-
-
0141596576
-
Policy invariance under reward transformations: Theory and application to reward shaping
-
Proceedings of the Sixteenth International Conference on Machine Learning. San Francisco, CA: Morgan Kaufmann
-
Ng, A.Y., Harada, D., and Russell, S. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. In: Proceedings of the Sixteenth International Conference on Machine Learning. San Francisco, CA: Morgan Kaufmann, pp. 278-287.
-
(1999)
, pp. 278-287
-
-
Ng, A.Y.1
Harada, D.2
Russell, S.3
-
77
-
-
17544368654
-
Dopaminergic modulation of neuronal excitability in the striatum and nucleus accumbens
-
Nicola S.M., Surmeier J., Malenka R.C. Dopaminergic modulation of neuronal excitability in the striatum and nucleus accumbens. Annu. Rev. Neurosci. 2000, 23:185-215.
-
(2000)
Annu. Rev. Neurosci.
, vol.23
, pp. 185-215
-
-
Nicola, S.M.1
Surmeier, J.2
Malenka, R.C.3
-
78
-
-
34447641086
-
Cost, benefit, tonic, phasic: what do response rates tell us about dopamine and motivation?
-
Niv Y. Cost, benefit, tonic, phasic: what do response rates tell us about dopamine and motivation?. Ann. NY Acad. Sci. 2007, 1104:357-376.
-
(2007)
Ann. NY Acad. Sci.
, vol.1104
, pp. 357-376
-
-
Niv, Y.1
-
79
-
-
34447648284
-
The Effects of Motivation on Habitual Instrumental Behavior
-
Unpublished doctoral dissertation, The Hebrew University of Jerusalem.
-
Niv, Y. (2007b). The Effects of Motivation on Habitual Instrumental Behavior. Unpublished doctoral dissertation, The Hebrew University of Jerusalem.
-
(2007)
-
-
Niv, Y.1
-
80
-
-
33745774340
-
How fast to work: response vigor, motivation and tonic dopamine
-
MIT Press, Cambridge, MA, Y. Weiss, B. Schölkopf, J. Platt (Eds.)
-
Niv Y., Daw N.D., Dayan P. How fast to work: response vigor, motivation and tonic dopamine. Advances in Neural Information Processing Systems 2005, Vol. 18:1019-1026. MIT Press, Cambridge, MA. Y. Weiss, B. Schölkopf, J. Platt (Eds.).
-
(2005)
Advances in Neural Information Processing Systems
, vol.18
, pp. 1019-1026
-
-
Niv, Y.1
Daw, N.D.2
Dayan, P.3
-
81
-
-
26444446315
-
Dopamine, uncertainty and TD learning
-
Niv Y., Duff M.O., Dayan P. Dopamine, uncertainty and TD learning. Behav. Brain Func. 2005, 1:6.
-
(2005)
Behav. Brain Func.
, vol.1
, pp. 6
-
-
Niv, Y.1
Duff, M.O.2
Dayan, P.3
-
82
-
-
33746257297
-
A normative perspective on motivation
-
Niv Y., Joel D., Dayan P. A normative perspective on motivation. Trends Cogn. Science, 2006, 10:375-381.
-
(2006)
Trends Cogn. Science
, vol.10
, pp. 375-381
-
-
Niv, Y.1
Joel, D.2
Dayan, P.3
-
84
-
-
33847675011
-
Tonic dopamine: opportunity costs and the control of response vigor
-
Niv Y., Daw N.D., Joel D., Dayan P. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacol. (Berl.), 2007, 191:507-520.
-
(2007)
Psychopharmacol. (Berl.)
, vol.191
, pp. 507-520
-
-
Niv, Y.1
Daw, N.D.2
Joel, D.3
Dayan, P.4
-
86
-
-
0037186052
-
Neural responses during anticipation of a primary taste reward
-
O'Doherty J.P., Deichmann R., Critchley H.D., Dolan R.J. Neural responses during anticipation of a primary taste reward. Neuron 2002, 33:815-826.
-
(2002)
Neuron
, vol.33
, pp. 815-826
-
-
O'Doherty, J.P.1
Deichmann, R.2
Critchley, H.D.3
Dolan, R.J.4
-
87
-
-
0037987978
-
Temporal difference learning model accounts for responses in human ventral striatum and orbitofrontal cortex during Pavlovian appetitive learning
-
O'Doherty J., Dayan P., Friston K., et al. Temporal difference learning model accounts for responses in human ventral striatum and orbitofrontal cortex during Pavlovian appetitive learning. Neuron 2003, 38:329-337.
-
(2003)
Neuron
, vol.38
, pp. 329-337
-
-
O'Doherty, J.1
Dayan, P.2
Friston, K.3
-
88
-
-
1942520195
-
Dissociable roles of ventral and dorsal striatum in instrumental conditioning
-
O'Doherty J.P., Dayan P., Schultz J., et al. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 2004, 304:452-454.
-
(2004)
Science
, vol.304
, pp. 452-454
-
-
O'Doherty, J.P.1
Dayan, P.2
Schultz, J.3
-
89
-
-
0036159133
-
Activity in human ventral striatum locked to errors of reward prediction
-
Pagnoni G., Zink C.F., Montague P.R., Berns G.S. Activity in human ventral striatum locked to errors of reward prediction. Nat. Neurosci. 2002, 5:97-98.
-
(2002)
Nat. Neurosci.
, vol.5
, pp. 97-98
-
-
Pagnoni, G.1
Zink, C.F.2
Montague, P.R.3
Berns, G.S.4
-
90
-
-
33748302924
-
Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans
-
Pessiglione M., Seymour B., Flandin G., et al. Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature 2006, 442:1042-1045.
-
(2006)
Nature
, vol.442
, pp. 1042-1045
-
-
Pessiglione, M.1
Seymour, B.2
Flandin, G.3
-
91
-
-
33746711623
-
Neural differentiation of expected reward and risk in human subcortical structures
-
Preuschoff K., Bossaerts P., Quartz S.R. Neural differentiation of expected reward and risk in human subcortical structures. Neuron 2006, 51:381-390.
-
(2006)
Neuron
, vol.51
, pp. 381-390
-
-
Preuschoff, K.1
Bossaerts, P.2
Quartz, S.R.3
-
92
-
-
33751184634
-
The short-latency dopamine signal: a role in discovering novel actions?
-
Redgrave P., Gurney K. The short-latency dopamine signal: a role in discovering novel actions?. Nat. Rev. Neurosci. 2006, 7:967-975.
-
(2006)
Nat. Rev. Neurosci.
, vol.7
, pp. 967-975
-
-
Redgrave, P.1
Gurney, K.2
-
93
-
-
0033119561
-
Is the short-latency dopamine response too short to signal reward error?
-
Redgrave P., Prescott T.J., Gurney K. Is the short-latency dopamine response too short to signal reward error?. Trends Neurosci. 1999, 22:146-151.
-
(1999)
Trends Neurosci.
, vol.22
, pp. 146-151
-
-
Redgrave, P.1
Prescott, T.J.2
Gurney, K.3
-
94
-
-
0000636183
-
Reduction in effectiveness of reinforcement after prior excitatory conditioning
-
Rescorla R.A. Reduction in effectiveness of reinforcement after prior excitatory conditioning. Learning Motiv. 1970, 1:372-381.
-
(1970)
Learning Motiv.
, vol.1
, pp. 372-381
-
-
Rescorla, R.A.1
-
96
-
-
0002109138
-
A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement
-
Appleton Century Crofts, New York, NY, A.H. Black, W.F. Prokasy (Eds.)
-
Rescorla R.A., Wagner A.R. A theory of Pavlovian conditioning: variations in the effectiveness of reinforcement and nonreinforcement. Classical Conditioning II: Current Research and Theory 1972, 64-99. Appleton Century Crofts, New York, NY. A.H. Black, W.F. Prokasy (Eds.).
-
(1972)
Classical Conditioning II: Current Research and Theory
, pp. 64-99
-
-
Rescorla, R.A.1
Wagner, A.R.2
-
97
-
-
0000707760
-
Attention in the pigeon
-
Reynolds G.S. Attention in the pigeon. J. Exp. Anal. Behav. 1961, 4:203-208.
-
(1961)
J. Exp. Anal. Behav.
, vol.4
, pp. 203-208
-
-
Reynolds, G.S.1
-
98
-
-
36448968271
-
Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
-
Roesch M.R., Calu D.J., Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neurosci. 2007, 10:1615-1624.
-
(2007)
Nature Neurosci.
, vol.10
, pp. 1615-1624
-
-
Roesch, M.R.1
Calu, D.J.2
Schoenbaum, G.3
-
99
-
-
0025247726
-
Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-intiated arm movements
-
Romo R., Schultz W. Dopamine neurons of the monkey midbrain: contingencies of responses to active touch during self-intiated arm movements. J. Neurophysiol. 1990, 63:592-606.
-
(1990)
J. Neurophysiol.
, vol.63
, pp. 592-606
-
-
Romo, R.1
Schultz, W.2
-
100
-
-
0037010742
-
Motivational views of reinforcement: implications for understanding the behavioral functions of nucleus accumbens dopamine
-
Salamone J.D., Correa M. Motivational views of reinforcement: implications for understanding the behavioral functions of nucleus accumbens dopamine. Behav. Brain Res. 2002, 137:3-25.
-
(2002)
Behav. Brain Res.
, vol.137
, pp. 3-25
-
-
Salamone, J.D.1
Correa, M.2
-
101
-
-
28144449057
-
Representation of action-specific reward values in the striatum
-
Samejima K., Ueda Y., Doya K., Kimura M. Representation of action-specific reward values in the striatum. Science 2005, 310:1337-1340.
-
(2005)
Science
, vol.310
, pp. 1337-1340
-
-
Samejima, K.1
Ueda, Y.2
Doya, K.3
Kimura, M.4
-
102
-
-
0001201756
-
Some studies in machine learning using the game of checkers
-
Samuel A.L. Some studies in machine learning using the game of checkers. IBM J. Res. Dev. 1959, 3:210-229.
-
(1959)
IBM J. Res. Dev.
, vol.3
, pp. 210-229
-
-
Samuel, A.L.1
-
103
-
-
36348966690
-
Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making
-
Schönberg T., Daw N.D., Joel D., O'Doherty J.P. Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making. J. Neurosci. 2007, 27:12860-12867.
-
(2007)
J. Neurosci.
, vol.27
, pp. 12860-12867
-
-
Schönberg, T.1
Daw, N.D.2
Joel, D.3
O'Doherty, J.P.4
-
104
-
-
0031867046
-
Predictive reward signal of dopamine neurons
-
Schultz W. Predictive reward signal of dopamine neurons. J. Neurophysiol. 1998, 80:1-27.
-
(1998)
J. Neurophysiol.
, vol.80
, pp. 1-27
-
-
Schultz, W.1
-
105
-
-
0037057755
-
Getting formal with dopamine and reward
-
Schultz W. Getting formal with dopamine and reward. Neuron 2002, 36:241-263.
-
(2002)
Neuron
, vol.36
, pp. 241-263
-
-
Schultz, W.1
-
106
-
-
0026442752
-
Neuronal activity in monkey ventral striatum related to the expectation of reward
-
Schultz W., Apicella P., Scarnati E., Ljungberg T. Neuronal activity in monkey ventral striatum related to the expectation of reward. J. Neurosci. 1992, 12:4595-4610.
-
(1992)
J. Neurosci.
, vol.12
, pp. 4595-4610
-
-
Schultz, W.1
Apicella, P.2
Scarnati, E.3
Ljungberg, T.4
-
107
-
-
0027468102
-
Responses of monkey dopamine neurons to reward and conditioned stimuli during succesive steps of learning a delayed response task
-
Schultz W., Apicella P., Ljungberg T. Responses of monkey dopamine neurons to reward and conditioned stimuli during succesive steps of learning a delayed response task. J. Neurosci. 1993, 13:900-913.
-
(1993)
J. Neurosci.
, vol.13
, pp. 900-913
-
-
Schultz, W.1
Apicella, P.2
Ljungberg, T.3
-
108
-
-
0030896968
-
A neural substrate of prediction and reward
-
Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science 1997, 275:1593-1599.
-
(1997)
Science
, vol.275
, pp. 1593-1599
-
-
Schultz, W.1
Dayan, P.2
Montague, P.R.3
-
110
-
-
2942617032
-
Temporal difference models describe higher order learning in humans
-
Seymour B., O'Doherty J.P., Dayan P., et al. Temporal difference models describe higher order learning in humans. Nature 2004, 429:664-667.
-
(2004)
Nature
, vol.429
, pp. 664-667
-
-
Seymour, B.1
O'Doherty, J.P.2
Dayan, P.3
-
111
-
-
67349161611
-
A Unified Theory of Expectation in Classical and Instrumental Conditioning
-
Unpublished Bsc thesis, Stanford University.
-
Sutton, R.S. (1978). A Unified Theory of Expectation in Classical and Instrumental Conditioning. Unpublished Bsc thesis, Stanford University.
-
(1978)
-
-
Sutton, R.S.1
-
112
-
-
33847202724
-
Learning to predict by the method of temporal difference
-
Sutton R.S. Learning to predict by the method of temporal difference. Machine Learning 1988, 3:9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
113
-
-
0003066891
-
Time-derivative models of Pavlovian reinforcement
-
MIT Press, Cambridge, MA, M. Gabriel, J. Moore (Eds.)
-
Sutton R.S., Barto A.G. Time-derivative models of Pavlovian reinforcement. Learning and Computational Neuroscience: Foundations of Adaptive Networks 1990, 497-537. MIT Press, Cambridge, MA. M. Gabriel, J. Moore (Eds.).
-
(1990)
Learning and Computational Neuroscience: Foundations of Adaptive Networks
, pp. 497-537
-
-
Sutton, R.S.1
Barto, A.G.2
-
115
-
-
4644290200
-
A possible role of midbrain dopamine neurons in short- and long-term adaptation of saccades to position-reward mapping
-
Takikawa Y., Kawagoe R., Hikosaka O. A possible role of midbrain dopamine neurons in short- and long-term adaptation of saccades to position-reward mapping. J. Neurophysiol. 2004, 92:2520-2529.
-
(2004)
J. Neurophysiol.
, vol.92
, pp. 2520-2529
-
-
Takikawa, Y.1
Kawagoe, R.2
Hikosaka, O.3
-
116
-
-
0345255891
-
Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm
-
Tobler P.N., Dickinson A., Schultz W. Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm. J. Neurosci. 2003, 23:10402-10410.
-
(2003)
J. Neurosci.
, vol.23
, pp. 10402-10410
-
-
Tobler, P.N.1
Dickinson, A.2
Schultz, W.3
-
117
-
-
14844349975
-
Adaptive coding of reward value by dopamine neurons
-
Tobler P.N., Fiorillo C.D., Schultz W. Adaptive coding of reward value by dopamine neurons. Science 2005, 307:1642-1645.
-
(2005)
Science
, vol.307
, pp. 1642-1645
-
-
Tobler, P.N.1
Fiorillo, C.D.2
Schultz, W.3
-
118
-
-
33846587385
-
The neural basis of loss aversion in decision-making under risk
-
Tom S.M., Fox C.R., Trepel C., Poldrack R.A. The neural basis of loss aversion in decision-making under risk. Science 2007, 315:515-518.
-
(2007)
Science
, vol.315
, pp. 515-518
-
-
Tom, S.M.1
Fox, C.R.2
Trepel, C.3
Poldrack, R.A.4
-
119
-
-
1642404961
-
Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli
-
Ungless M.A., Magill P.J., Bolam J.P. Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli. Science 2004, 303:2040-2042.
-
(2004)
Science
, vol.303
, pp. 2040-2042
-
-
Ungless, M.A.1
Magill, P.J.2
Bolam, J.P.3
-
120
-
-
0035811464
-
Dopamine responses comply with basic assumptions of formal learning theory
-
Waelti P., Dickinson A., Schultz W. Dopamine responses comply with basic assumptions of formal learning theory. Nature 2001, 412:43-48.
-
(2001)
Nature
, vol.412
, pp. 43-48
-
-
Waelti, P.1
Dickinson, A.2
Schultz, W.3
-
121
-
-
0004049895
-
Learning with Delayed Rewards
-
Unpublished doctoral dissertation, Cambridge University, Cambridge.
-
Watkins, C.J.C.H. (1989). Learning with Delayed Rewards. Unpublished doctoral dissertation, Cambridge University, Cambridge.
-
(1989)
-
-
Watkins, C.J.C.H.1
-
122
-
-
0013150608
-
Dopamine in schizophrenia: dysfunctional information processing in basal ganglia-thalamocortical split circuits
-
Springer Verlag, Berlin, G.D. Chiara (Ed.)
-
Weiner I., Joel D. Dopamine in schizophrenia: dysfunctional information processing in basal ganglia-thalamocortical split circuits. Handbook of Experimental Pharmacology Vol. 154/II, Dopamine in the CNS II 2002, 417-472. Springer Verlag, Berlin. G.D. Chiara (Ed.).
-
(2002)
Handbook of Experimental Pharmacology Vol. 154/II, Dopamine in the CNS II
, pp. 417-472
-
-
Weiner, I.1
Joel, D.2
-
123
-
-
0002557583
-
Advanced forecasting methods for global crisis warning and models of intelligence
-
Werbos P.J. Advanced forecasting methods for global crisis warning and models of intelligence. General Systems Yearbook 1977, 22:25-38.
-
(1977)
General Systems Yearbook
, vol.22
, pp. 25-38
-
-
Werbos, P.J.1
-
124
-
-
0001785024
-
Cellular models of reinforcement
-
MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
-
Wickens J., Kötter R. Cellular models of reinforcement. Models of Information Processing in the Basal Ganglia 1995, 187-214. MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
-
(1995)
Models of Information Processing in the Basal Ganglia
, pp. 187-214
-
-
Wickens, J.1
Kötter, R.2
-
125
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 1992, 8:229-256.
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
-
126
-
-
0023493224
-
Effects of amphetamine and pimozide on reinforcement and motor parameters in variable-interval performance
-
Willner P., Towell A., Muscat R. Effects of amphetamine and pimozide on reinforcement and motor parameters in variable-interval performance. J. Psychopharmacol. 1987, 1:140-153.
-
(1987)
J. Psychopharmacol.
, vol.1
, pp. 140-153
-
-
Willner, P.1
Towell, A.2
Muscat, R.3
-
127
-
-
84973986000
-
Neuroleptics and operant behavior: the anhedonia hypothesis
-
Wise R.A. Neuroleptics and operant behavior: the anhedonia hypothesis. Behav. Brain Sci. 1982, 5:39-53.
-
(1982)
Behav. Brain Sci.
, vol.5
, pp. 39-53
-
-
Wise, R.A.1
-
128
-
-
2642519680
-
Dopamine, learning and motivation
-
Wise R.A. Dopamine, learning and motivation. Nat. Rev. Neurosci. 2004, 5:483-495.
-
(2004)
Nat. Rev. Neurosci.
, vol.5
, pp. 483-495
-
-
Wise, R.A.1
-
129
-
-
0018100734
-
Neuroleptic-induced "anhedonia" in rats: pimozide blocks reward quality of food
-
Wise R.A., Spindler J., de Wit H., Gerberg G.J. Neuroleptic-induced "anhedonia" in rats: pimozide blocks reward quality of food. Science 1978, 201:262-264.
-
(1978)
Science
, vol.201
, pp. 262-264
-
-
Wise, R.A.1
Spindler, J.2
de Wit, H.3
Gerberg, G.J.4
-
130
-
-
0017976943
-
Major attenuation of food reward with performance-sparing doses of pimozide in the rat
-
Wise R.A., Spindler J., Legault L. Major attenuation of food reward with performance-sparing doses of pimozide in the rat. Can. J. Psychol. 1978, 32:77-85.
-
(1978)
Can. J. Psychol.
, vol.32
, pp. 77-85
-
-
Wise, R.A.1
Spindler, J.2
Legault, L.3
-
131
-
-
0036592033
-
Acetylcholine in cortical inference
-
Yu A.J., Dayan P. Acetylcholine in cortical inference. Neural Networks 2002, 15:719-730.
-
(2002)
Neural Networks
, vol.15
, pp. 719-730
-
-
Yu, A.J.1
Dayan, P.2
-
132
-
-
20444388016
-
Uncertainty, neuromodulation, and attention
-
Yu A.J., Dayan P. Uncertainty, neuromodulation, and attention. Neuron 2005, 46:681-692.
-
(2005)
Neuron
, vol.46
, pp. 681-692
-
-
Yu, A.J.1
Dayan, P.2
-
133
-
-
2442506963
-
Dopamine transmission in the human striatum during monetary reward tasks
-
Zald D.H., Boileau I., El-Dearedy W., et al. Dopamine transmission in the human striatum during monetary reward tasks. J. Neurosci. 2004, 24:4105-4112.
-
(2004)
J. Neurosci.
, vol.24
, pp. 4105-4112
-
-
Zald, D.H.1
Boileau, I.2
El-Dearedy, W.3
|