-
1
-
-
0004245883
-
-
Cambridge University Press, Cambridge, UK
-
Ainslie G. Breakdown of will 2001, Cambridge University Press, Cambridge, UK.
-
(2001)
Breakdown of will
-
-
Ainslie, G.1
-
2
-
-
0025321039
-
Functional architecture of basal ganglia circuits: Neural substrates of parallel processing
-
Alexander G.E., Crutcher M.D. Functional architecture of basal ganglia circuits: Neural substrates of parallel processing. Trends in Neurosciences 1990, 13(7):266-271.
-
(1990)
Trends in Neurosciences
, vol.13
, Issue.7
, pp. 266-271
-
-
Alexander, G.E.1
Crutcher, M.D.2
-
3
-
-
28444472936
-
Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits
-
Balleine B.W. Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits. Physiology & Behavior 2005, 86(5):717-730.
-
(2005)
Physiology & Behavior
, vol.86
, Issue.5
, pp. 717-730
-
-
Balleine, B.W.1
-
4
-
-
72049125602
-
Human and rodent homologies in action control: Corti-costriatal determinants of goal-directed and habitual action
-
Balleine B.W., O'Doherty J.P. Human and rodent homologies in action control: Corti-costriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010, 35(1):48-69.
-
(2010)
Neuropsychopharmacology
, vol.35
, Issue.1
, pp. 48-69
-
-
Balleine, B.W.1
O'Doherty, J.P.2
-
5
-
-
0000541213
-
Adaptive critics and the basal ganglia
-
MIT Press, Cambridge MA, J. Houk, J. Davis, D. Beiser (Eds.)
-
Barto A. Adaptive critics and the basal ganglia. Models of information processing in the Basal Ganglia 1995, 215-232. MIT Press, Cambridge MA. J. Houk, J. Davis, D. Beiser (Eds.).
-
(1995)
Models of information processing in the Basal Ganglia
, pp. 215-232
-
-
Barto, A.1
-
6
-
-
0020970738
-
Neuron-like adaptive elements that can solve difficult learning control problems
-
Barto A., Sutton R., Anderson C. Neuron-like adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics 1983, 13(5):834-846.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.13
, Issue.5
, pp. 834-846
-
-
Barto, A.1
Sutton, R.2
Anderson, C.3
-
7
-
-
33749651693
-
Intrinsically motivated learning of hierarchical collections of skills
-
Proceedings of international conference of developmental learning, San Diego, CA.
-
Barto, A., Singh, S., & Chentanez, N. (2004). Intrinsically motivated learning of hierarchical collections of skills. In Proceedings of international conference of developmental learning, San Diego, CA.
-
(2004)
-
-
Barto, A.1
Singh, S.2
Chentanez, N.3
-
8
-
-
2442701355
-
Motivation concepts in behavioral neuroscience
-
Berridge K.C. Motivation concepts in behavioral neuroscience. Physiology & Behavior 2004, 81:179-209.
-
(2004)
Physiology & Behavior
, vol.81
, pp. 179-209
-
-
Berridge, K.C.1
-
9
-
-
33847634405
-
The debate over dopamine's role in reward: The case for incentive salience
-
Berridge K.C. The debate over dopamine's role in reward: The case for incentive salience. Psychopharmacology (Berl) 2007, 191(3):391-431.
-
(2007)
Psychopharmacology (Berl)
, vol.191
, Issue.3
, pp. 391-431
-
-
Berridge, K.C.1
-
10
-
-
59349111600
-
Dissecting components of reward: "Liking", "wanting", and learning
-
Berridge K.C., Robinson T.E., Aldridge J.W. Dissecting components of reward: "Liking", "wanting", and learning. Current Opinion in Pharmacology 2009, 9(1):65-73.
-
(2009)
Current Opinion in Pharmacology
, vol.9
, Issue.1
, pp. 65-73
-
-
Berridge, K.C.1
Robinson, T.E.2
Aldridge, J.W.3
-
12
-
-
33750347385
-
The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks
-
Bogacz R., Brown E., Moehlis J., Holmes P., Cohen J.D. The physics of optimal decision making: A formal analysis of models of performance in two-alternative forced-choice tasks. Psychological Review 2006, 113(4):700-765.
-
(2006)
Psychological Review
, vol.113
, Issue.4
, pp. 700-765
-
-
Bogacz, R.1
Brown, E.2
Moehlis, J.3
Holmes, P.4
Cohen, J.D.5
-
13
-
-
58149417523
-
Species-specific defense reactions and avoidance learning
-
Bolles R.C. Species-specific defense reactions and avoidance learning. Psychological Review 1970, 77:32-48.
-
(1970)
Psychological Review
, vol.77
, pp. 32-48
-
-
Bolles, R.C.1
-
14
-
-
78649651245
-
Opponency revisited: Competition and cooperation between dopamine and serotonin
-
Boureau Y.-L., Dayan P. Opponency revisited: Competition and cooperation between dopamine and serotonin. Neuropsychopharmacology 2011, 36:74-97.
-
(2011)
Neuropsychopharmacology
, vol.36
, pp. 74-97
-
-
Boureau, Y.-L.1
Dayan, P.2
-
15
-
-
77956971930
-
Pavlovian processes in consumer choice: The physical presence of a good increases willingness-to-pay
-
Bushong B., King L., Camerer C., Rangel A. Pavlovian processes in consumer choice: The physical presence of a good increases willingness-to-pay. American Economic Review 2010, 100:1-18.
-
(2010)
American Economic Review
, vol.100
, pp. 1-18
-
-
Bushong, B.1
King, L.2
Camerer, C.3
Rangel, A.4
-
16
-
-
0036114221
-
Emotion and motivation: The role of the amygdala, ventral striatum, and prefrontal cortex
-
Cardinal R.N., Parkinson J.A., Hall J., Everitt B.J. Emotion and motivation: The role of the amygdala, ventral striatum, and prefrontal cortex. Neuroscience and Biobehavioral Reviews 2002, 26(3):321-352.
-
(2002)
Neuroscience and Biobehavioral Reviews
, vol.26
, Issue.3
, pp. 321-352
-
-
Cardinal, R.N.1
Parkinson, J.A.2
Hall, J.3
Everitt, B.J.4
-
17
-
-
34247842923
-
Western scrub-jays anticipate future needs independently of their current motivational state
-
Correia S.P.C., Dickinson A., Clayton N.S. Western scrub-jays anticipate future needs independently of their current motivational state. Current Biology 2007, 17(10):856-861.
-
(2007)
Current Biology
, vol.17
, Issue.10
, pp. 856-861
-
-
Correia, S.P.C.1
Dickinson, A.2
Clayton, N.S.3
-
18
-
-
79952746011
-
Model-based influences on humans' choices and striatal prediction errors
-
Daw N.D., Gershman S.J., Seymour B., Dayan P., Dolan R.J. Model-based influences on humans' choices and striatal prediction errors. Neuron 2011, 69(6):1204-1215.
-
(2011)
Neuron
, vol.69
, Issue.6
, pp. 1204-1215
-
-
Daw, N.D.1
Gershman, S.J.2
Seymour, B.3
Dayan, P.4
Dolan, R.J.5
-
19
-
-
28044450875
-
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
-
Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience 2005, 8(12):1704-1711.
-
(2005)
Nature Neuroscience
, vol.8
, Issue.12
, pp. 1704-1711
-
-
Daw, N.D.1
Niv, Y.2
Dayan, P.3
-
20
-
-
33749055062
-
The misbehavior of value and the discipline of the will
-
Dayan P., Niv Y., Seymour B., Daw N.D. The misbehavior of value and the discipline of the will. Neural Networks 2006, 19(8):1153-1160.
-
(2006)
Neural Networks
, vol.19
, Issue.8
, pp. 1153-1160
-
-
Dayan, P.1
Niv, Y.2
Seymour, B.3
Daw, N.D.4
-
21
-
-
77954761462
-
The role of value systems in decision-making
-
MIT Press, Ernst Strüngmann Forum, Cambridge, MA, C. Engel, W. Singer (Eds.)
-
Dayan P. The role of value systems in decision-making. Better than conscious: Decision making, the human mind, and implications for institutions 2008, 51-70. MIT Press, Ernst Strüngmann Forum, Cambridge, MA. C. Engel, W. Singer (Eds.).
-
(2008)
Better than conscious: Decision making, the human mind, and implications for institutions
, pp. 51-70
-
-
Dayan, P.1
-
22
-
-
60749114870
-
Decision theory, reinforcement learning, and the brain
-
Dayan P., Daw N.D. Decision theory, reinforcement learning, and the brain. Cognitive, Affective & Behavioral Neuroscience 2008, 8(4):429-453.
-
(2008)
Cognitive, Affective & Behavioral Neuroscience
, vol.8
, Issue.4
, pp. 429-453
-
-
Dayan, P.1
Daw, N.D.2
-
23
-
-
40149109071
-
Serotonin, inhibition, and negative mood
-
Dayan P., Huys Q.J.M. Serotonin, inhibition, and negative mood. PLoS Computational Biology 2008, 4(2):e4.
-
(2008)
PLoS Computational Biology
, vol.4
, Issue.2
-
-
Dayan, P.1
Huys, Q.J.M.2
-
24
-
-
84882564459
-
Values and actions in aversion
-
Academic Press, New York, NY, P. Glimcher, C. Camerer, R. Poldrack, E. Fehr (Eds.)
-
Dayan P., Seymour B. Values and actions in aversion. Neuroeconomics: Decision making and the brain 2008, 175-191. Academic Press, New York, NY. P. Glimcher, C. Camerer, R. Poldrack, E. Fehr (Eds.).
-
(2008)
Neuroeconomics: Decision making and the brain
, pp. 175-191
-
-
Dayan, P.1
Seymour, B.2
-
26
-
-
33746896990
-
Frames, biases, and rational decision-making in the human brain
-
De Martino B., Kumaran D., Seymour B., Dolan R.J. Frames, biases, and rational decision-making in the human brain. Science 2006, 313(5787):684-687.
-
(2006)
Science
, vol.313
, Issue.5787
, pp. 684-687
-
-
De Martino, B.1
Kumaran, D.2
Seymour, B.3
Dolan, R.J.4
-
27
-
-
0031619316
-
Bayesian Q-learning
-
Proceedings of the fiteenth National/tenth Conference on Artificial intelligence/Innovative Applications of Artificial Intelligence Table of Contents, Menlo Park, CA: American Association for Artificial Intelligence.
-
Dearden, R., Friedman, N., & Russell, S. (1998). Bayesian Q-learning. In Proceedings of the fiteenth National/tenth Conference on Artificial intelligence/Innovative Applications of Artificial Intelligence Table of Contents, pp. 761-768. Menlo Park, CA: American Association for Artificial Intelligence.
-
(1998)
, pp. 761-768
-
-
Dearden, R.1
Friedman, N.2
Russell, S.3
-
29
-
-
0043250430
-
The role of learning in the operation of motivational systems
-
Wiley, New York, NY
-
Dickinson A., Balleine B. The role of learning in the operation of motivational systems. Stevens' handbook of experimental psychology 2002, Vol. 3:497-534. Wiley, New York, NY.
-
(2002)
Stevens' handbook of experimental psychology
, vol.3
, pp. 497-534
-
-
Dickinson, A.1
Balleine, B.2
-
31
-
-
49049085303
-
Mesolimbic dopamine in desire and dread: Enabling motivation to be generated by localized glutamate disruptions in nucleus accumbens
-
Faure A., Reynolds S.M., Richard J.M., Berridge K.C. Mesolimbic dopamine in desire and dread: Enabling motivation to be generated by localized glutamate disruptions in nucleus accumbens. Journal of Neuroscience 2008, 28(28):7184-7192.
-
(2008)
Journal of Neuroscience
, vol.28
, Issue.28
, pp. 7184-7192
-
-
Faure, A.1
Reynolds, S.M.2
Richard, J.M.3
Berridge, K.C.4
-
32
-
-
33645458694
-
Reverse replay of behavioural sequences in hippocampal place cells during the awake state
-
Foster D.J., Wilson M.A. Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature 2006, 440(7084):680-683.
-
(2006)
Nature
, vol.440
, Issue.7084
, pp. 680-683
-
-
Foster, D.J.1
Wilson, M.A.2
-
33
-
-
10344250993
-
By carrot or by stick: Cognitive reinforcement learning in parkinsonism
-
Frank M.J., Seeberger L.C., O'Reilly R.C. By carrot or by stick: Cognitive reinforcement learning in parkinsonism. Science 2004, 306(5703):1940-1943.
-
(2004)
Science
, vol.306
, Issue.5703
, pp. 1940-1943
-
-
Frank, M.J.1
Seeberger, L.C.2
O'Reilly, R.C.3
-
34
-
-
33744550336
-
Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal
-
Frank M.J., Claus E.D. Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychological Review 2006, 113(2):300-326.
-
(2006)
Psychological Review
, vol.113
, Issue.2
, pp. 300-326
-
-
Frank, M.J.1
Claus, E.D.2
-
36
-
-
77953260848
-
States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
-
Gläscher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66(4):585-595.
-
(2010)
Neuron
, vol.66
, Issue.4
, pp. 585-595
-
-
Gläscher, J.1
Daw, N.2
Dayan, P.3
O'Doherty, J.P.4
-
37
-
-
77649151242
-
Hippocampal replay is not a simple function of experience
-
Gupta A.S., van der Meer M.A.A., Touretzky D.S., Redish A.D. Hippocampal replay is not a simple function of experience. Neuron 2010, 65(5):695-705.
-
(2010)
Neuron
, vol.65
, Issue.5
, pp. 695-705
-
-
Gupta, A.S.1
van der Meer, M.A.A.2
Touretzky, D.S.3
Redish, A.D.4
-
38
-
-
34548566262
-
Towards an executive without a homunculus: Computational models of the prefrontal cortex/basal ganglia system
-
Hazy T.E., Frank M.J., O'Reilly R.C. Towards an executive without a homunculus: Computational models of the prefrontal cortex/basal ganglia system. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences 2007, 362(1485):1601-1613.
-
(2007)
Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences
, vol.362
, Issue.1485
, pp. 1601-1613
-
-
Hazy, T.E.1
Frank, M.J.2
O'Reilly, R.C.3
-
39
-
-
0022979089
-
An approach through the looking-glass
-
Hershberger W.A. An approach through the looking-glass. Animal Learning & Behavior 1986, 14:443-451.
-
(1986)
Animal Learning & Behavior
, vol.14
, pp. 443-451
-
-
Hershberger, W.A.1
-
40
-
-
4043119771
-
Decisions from experience and the effect of rare events in risky choice
-
Hertwig R., Barron G., Weber E.U., Erev I. Decisions from experience and the effect of rare events in risky choice. Psychological Science 2004, 15(8):534-539.
-
(2004)
Psychological Science
, vol.15
, Issue.8
, pp. 534-539
-
-
Hertwig, R.1
Barron, G.2
Weber, E.U.3
Erev, I.4
-
41
-
-
70449671239
-
The description-experience gap in risky choice
-
Hertwig R., Erev I. The description-experience gap in risky choice. Trends in Cognitive Sciences 2009, 13:517-523.
-
(2009)
Trends in Cognitive Sciences
, vol.13
, pp. 517-523
-
-
Hertwig, R.1
Erev, I.2
-
42
-
-
1842853951
-
Relations between Pavlovian-instrumental transfer and reinforcer devaluation
-
Holland P.C. Relations between Pavlovian-instrumental transfer and reinforcer devaluation. Journal of Experimental Psychology. Animal Behavior Processes 2004, 30(2):104-117.
-
(2004)
Journal of Experimental Psychology. Animal Behavior Processes
, vol.30
, Issue.2
, pp. 104-117
-
-
Holland, P.C.1
-
43
-
-
70350570499
-
A Bayesian formulation of behavioral control
-
Huys Q.J.M., Dayan P. A Bayesian formulation of behavioral control. Cognition 2009, 113:314-328.
-
(2009)
Cognition
, vol.113
, pp. 314-328
-
-
Huys, Q.J.M.1
Dayan, P.2
-
44
-
-
0242267471
-
A perspective on judgment and choice: Mapping bounded rationality
-
Kahneman D. A perspective on judgment and choice: Mapping bounded rationality. American Psychologist 2003, 58(9):697-720.
-
(2003)
American Psychologist
, vol.58
, Issue.9
, pp. 697-720
-
-
Kahneman, D.1
-
45
-
-
33846565849
-
Frames and brains: Elicitation and control of response tendencies
-
Kahneman D., Frederick S. Frames and brains: Elicitation and control of response tendencies. Trends in Cognitive Sciences 2007, 11:45-46.
-
(2007)
Trends in Cognitive Sciences
, vol.11
, pp. 45-46
-
-
Kahneman, D.1
Frederick, S.2
-
46
-
-
0037382264
-
Coordination of actions and habits in the medial prefrontal cortex of rats
-
Killcross S., Coutureau E. Coordination of actions and habits in the medial prefrontal cortex of rats. Cerebral Cortex 2003, 13(4):400-408.
-
(2003)
Cerebral Cortex
, vol.13
, Issue.4
, pp. 400-408
-
-
Killcross, S.1
Coutureau, E.2
-
49
-
-
0029981543
-
A framework for mesencephalic dopamine systems based on predictive hebbian learning
-
Montague P.R., Dayan P., Sejnowski T.J. A framework for mesencephalic dopamine systems based on predictive hebbian learning. Journal of Neuroscience 1996, 16(5):1936-1947.
-
(1996)
Journal of Neuroscience
, vol.16
, Issue.5
, pp. 1936-1947
-
-
Montague, P.R.1
Dayan, P.2
Sejnowski, T.J.3
-
50
-
-
33747585633
-
Midbrain dopamine neurons encode decisions for future action
-
Morris G., Nevet A., Arkadir D., Vaadia E., Bergman H. Midbrain dopamine neurons encode decisions for future action. Nature Neuroscience 2006, 9(8):1057-1063.
-
(2006)
Nature Neuroscience
, vol.9
, Issue.8
, pp. 1057-1063
-
-
Morris, G.1
Nevet, A.2
Arkadir, D.3
Vaadia, E.4
Bergman, H.5
-
52
-
-
67349283062
-
Reinforcement learning in the brain
-
Niv Y. Reinforcement learning in the brain. Journal of Mathematical Psychology 2009, 53(3):139-154.
-
(2009)
Journal of Mathematical Psychology
, vol.53
, Issue.3
, pp. 139-154
-
-
Niv, Y.1
-
53
-
-
33847675011
-
Tonic dopamine: Opportunity costs and the control of response vigor
-
Niv Y., Daw N.D., Joel D., Dayan P. Tonic dopamine: Opportunity costs and the control of response vigor. Psychopharmacology (Berl) 2007, 191(3):507-520.
-
(2007)
Psychopharmacology (Berl)
, vol.191
, Issue.3
, pp. 507-520
-
-
Niv, Y.1
Daw, N.D.2
Joel, D.3
Dayan, P.4
-
54
-
-
9644310472
-
Reward representations and reward-related learning in the human brain: Insights from neuroimaging
-
O'Doherty J.P. Reward representations and reward-related learning in the human brain: Insights from neuroimaging. Current Opinion in Neurobiology 2004, 14(6):769-776.
-
(2004)
Current Opinion in Neurobiology
, vol.14
, Issue.6
, pp. 769-776
-
-
O'Doherty, J.P.1
-
55
-
-
37549066620
-
Lights, camembert, action! The role of human orbitofrontal cortex in encoding stimuli, rewards, and choices
-
O'Doherty J.P. Lights, camembert, action! The role of human orbitofrontal cortex in encoding stimuli, rewards, and choices. Annals of the New York Academy of Sciences, USA 2007, 1121:254-272.
-
(2007)
Annals of the New York Academy of Sciences, USA
, vol.1121
, pp. 254-272
-
-
O'Doherty, J.P.1
-
58
-
-
45349095604
-
Opioid reward "liking" and "wanting" in the nucleus accumbens
-
Peciña S. Opioid reward "liking" and "wanting" in the nucleus accumbens. Physiology & Behavior 2008, 94(5):675-680.
-
(2008)
Physiology & Behavior
, vol.94
, Issue.5
, pp. 675-680
-
-
Peciña, S.1
-
59
-
-
30744457109
-
Hedonic hot spot in nucleus accumbens shell: Where do mu-opioids cause increased hedonic impact of sweetness?
-
Peciña S., Berridge K.C. Hedonic hot spot in nucleus accumbens shell: Where do mu-opioids cause increased hedonic impact of sweetness?. Journal of Neuroscience 2005, 25(50):11777-11786.
-
(2005)
Journal of Neuroscience
, vol.25
, Issue.50
, pp. 11777-11786
-
-
Peciña, S.1
Berridge, K.C.2
-
61
-
-
0002109138
-
A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement
-
Appleton-Century-Crofts, New York, NY, A.H. Black, W.F. Prokasy (Eds.)
-
Rescorla R.A., Wagner A.R. A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and non-reinforcement. Classical conditioning II: Current theory and research 1972, 64-99. Appleton-Century-Crofts, New York, NY. A.H. Black, W.F. Prokasy (Eds.).
-
(1972)
Classical conditioning II: Current theory and research
, pp. 64-99
-
-
Rescorla, R.A.1
Wagner, A.R.2
-
62
-
-
0035341482
-
Fear and feeding in the nucleus accumbens shell: Rostrocaudal segregation of GABA-elicited defensive behavior versus eating behavior
-
Reynolds S.M., Berridge K.C. Fear and feeding in the nucleus accumbens shell: Rostrocaudal segregation of GABA-elicited defensive behavior versus eating behavior. Journal of Neuroscience 2001, 21(9):3261-3270.
-
(2001)
Journal of Neuroscience
, vol.21
, Issue.9
, pp. 3261-3270
-
-
Reynolds, S.M.1
Berridge, K.C.2
-
63
-
-
0037104732
-
Positive and negative motivation in nucleus accum-bens shell: Bivalent rostrocaudal gradients for GABA-elicited eating, taste "liking"/"disliking" reactions, place preference/avoidance, and fear
-
Reynolds S.M., Berridge K.C. Positive and negative motivation in nucleus accum-bens shell: Bivalent rostrocaudal gradients for GABA-elicited eating, taste "liking"/"disliking" reactions, place preference/avoidance, and fear. Journal of Neuroscience 2002, 22(16):7308-7320.
-
(2002)
Journal of Neuroscience
, vol.22
, Issue.16
, pp. 7308-7320
-
-
Reynolds, S.M.1
Berridge, K.C.2
-
64
-
-
36448968271
-
Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
-
Roesch M.R., Calu D.J., Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience 2007, 10(12):1615-1624.
-
(2007)
Nature Neuroscience
, vol.10
, Issue.12
, pp. 1615-1624
-
-
Roesch, M.R.1
Calu, D.J.2
Schoenbaum, G.3
-
65
-
-
0003636089
-
-
Cambridge University, Cambrdige, UK, Technical Report CUED/F-INFENG-TR 166
-
Rummery G., Niranjan M. On-line Q-learning using connectionist systems 1994, Cambridge University, Cambrdige, UK, Technical Report CUED/F-INFENG-TR 166.
-
(1994)
On-line Q-learning using connectionist systems
-
-
Rummery, G.1
Niranjan, M.2
-
66
-
-
28144449057
-
Representation of action-specific reward values in the striatum
-
Samejima K., Ueda Y., Doya K., Kimura M. Representation of action-specific reward values in the striatum. Science 2005, 310(5752):1337-1340.
-
(2005)
Science
, vol.310
, Issue.5752
, pp. 1337-1340
-
-
Samejima, K.1
Ueda, Y.2
Doya, K.3
Kimura, M.4
-
67
-
-
0001201756
-
Some studies in machine learning using the game of checkers
-
Samuel A. Some studies in machine learning using the game of checkers. IBM Journal of Research and Development 1959, 3:210-229.
-
(1959)
IBM Journal of Research and Development
, vol.3
, pp. 210-229
-
-
Samuel, A.1
-
68
-
-
0037057755
-
Getting formal with dopamine and reward
-
Schultz W. Getting formal with dopamine and reward. Neuron 2002, 36(2):241-263.
-
(2002)
Neuron
, vol.36
, Issue.2
, pp. 241-263
-
-
Schultz, W.1
-
69
-
-
0002193484
-
Relation between classical conditioning and instrumental learning
-
Appleton-Century-Crofts, New York, NY, W. Prokasy (Ed.)
-
Sheffield F. Relation between classical conditioning and instrumental learning. Classical conditioning 1965, 302-322. Appleton-Century-Crofts, New York, NY. W. Prokasy (Ed.).
-
(1965)
Classical conditioning
, pp. 302-322
-
-
Sheffield, F.1
-
70
-
-
84899031920
-
Intrinsically motivated reinforcement learning
-
MIT Press, Cambridge, MA
-
Singh S., Barto A., Chentanez N. Intrinsically motivated reinforcement learning. Advances in Neural Information Processing Systems 2005, 1281-1288. MIT Press, Cambridge, MA.
-
(2005)
Advances in Neural Information Processing Systems
, pp. 1281-1288
-
-
Singh, S.1
Barto, A.2
Chentanez, N.3
-
71
-
-
77955909363
-
-
Where do rewards come from? In Proceedings of the thirty-first Annual Conference of the Cognitive Science Society Amsterdam, The Netherlands.
-
Singh, S., Lewis, R., & Barto, A. (2009). Where do rewards come from? In Proceedings of the thirty-first Annual Conference of the Cognitive Science Society (pp. 2601-2606). Amsterdam, The Netherlands.
-
(2009)
, pp. 2601-2606
-
-
Singh, S.1
Lewis, R.2
Barto, A.3
-
72
-
-
53149107120
-
Striatal and extrastriatal dopamine in the basal ganglia: An overview of its anatomical organization in normal and parkinsonian brains
-
Smith Y., Villalba R. Striatal and extrastriatal dopamine in the basal ganglia: An overview of its anatomical organization in normal and parkinsonian brains. Movement Disorders 2008, 23(Suppl 3):S534-S547.
-
(2008)
Movement Disorders
, vol.23
, Issue.SUPPL.3
-
-
Smith, Y.1
Villalba, R.2
-
73
-
-
0031137174
-
Mental time travel and the evolution of the human mind
-
Suddendorf T., Corballis M.C. Mental time travel and the evolution of the human mind. Genetic, Social, and General Psychology Monographs 1997, 123(2):133-167.
-
(1997)
Genetic, Social, and General Psychology Monographs
, vol.123
, Issue.2
, pp. 133-167
-
-
Suddendorf, T.1
Corballis, M.C.2
-
74
-
-
0032930935
-
A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
-
Suri R.E., Schultz W. A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience 1999, 91(3):871-890.
-
(1999)
Neuroscience
, vol.91
, Issue.3
, pp. 871-890
-
-
Suri, R.E.1
Schultz, W.2
-
75
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton R. Learning to predict by the methods of temporal differences. Machine Learning 1988, 3(1):9-44.
-
(1988)
Machine Learning
, vol.3
, Issue.1
, pp. 9-44
-
-
Sutton, R.1
-
76
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Proceedings of the seventh international conference on machine learning
-
Sutton, R. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the seventh international conference on machine learning, 216: 224.
-
(1990)
, vol.216
, pp. 224
-
-
Sutton, R.1
-
77
-
-
0004102479
-
-
The MIT Press, Cambridge, MA, Adaptive Computation and Machine Learning
-
Sutton R.S., Barto A.G. Reinforcement learning: An introduction 1998, The MIT Press, Cambridge, MA, Adaptive Computation and Machine Learning.
-
(1998)
Reinforcement learning: An introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
79
-
-
0001461525
-
There is more than one kind of learning
-
Tolman E. There is more than one kind of learning. Psychological Review 1949, 56:144-155.
-
(1949)
Psychological Review
, vol.56
, pp. 144-155
-
-
Tolman, E.1
-
82
-
-
85047685362
-
The time course of perceptual choice: The leaky, competing accumulator model
-
Usher M., McClelland J.L. The time course of perceptual choice: The leaky, competing accumulator model. Psychological Review 2001, 108(3):550-592.
-
(2001)
Psychological Review
, vol.108
, Issue.3
, pp. 550-592
-
-
Usher, M.1
McClelland, J.L.2
-
83
-
-
34247147767
-
Determining the neural substrates of goal-directed learning in the human brain
-
Valentin V.V., Dickinson A., O'Doherty J.P. Determining the neural substrates of goal-directed learning in the human brain. Journal of Neuroscience 2007, 27(15):4019-4026.
-
(2007)
Journal of Neuroscience
, vol.27
, Issue.15
, pp. 4019-4026
-
-
Valentin, V.V.1
Dickinson, A.2
O'Doherty, J.P.3
-
85
-
-
84882553319
-
-
Learning from delayed rewards. PhD thesis, Cambridge, UK: University of Cambridge.
-
Watkins, C. (1989). Learning from delayed rewards. PhD thesis, Cambridge, UK: University of Cambridge.
-
(1989)
-
-
Watkins, C.1
-
86
-
-
1942443226
-
Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation
-
Weber E.U., Shafir S., Blais A.-R. Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation. Psychological Review 2004, 111(2):430-445.
-
(2004)
Psychological Review
, vol.111
, Issue.2
, pp. 430-445
-
-
Weber, E.U.1
Shafir, S.2
Blais, A.-R.3
-
87
-
-
0002278965
-
Adaptive switching circuits. In Western Electric Show and Convention Record
-
New York, NY.
-
Widrow, B., & Hoff, M. (1960). Adaptive switching circuits. In Western Electric Show and Convention Record (Vol. 4, pp. 96-104). New York, NY.
-
(1960)
, vol.4
, pp. 96-104
-
-
Widrow, B.1
Hoff, M.2
-
88
-
-
84989993724
-
Auto-maintenance in the pigeon: Sustained pecking despite contingent non-reinforcement
-
Williams D.R., Williams H. Auto-maintenance in the pigeon: Sustained pecking despite contingent non-reinforcement. Journal of the Experimental Analysis of Behavior 1969, 12(4):511-520.
-
(1969)
Journal of the Experimental Analysis of Behavior
, vol.12
, Issue.4
, pp. 511-520
-
-
Williams, D.R.1
Williams, H.2
-
89
-
-
45249097567
-
Striatal activity underlies novelty-based choice in humans
-
Wittmann B.C., Daw N.D., Seymour B., Dolan R.J. Striatal activity underlies novelty-based choice in humans. Neuron 2008, 58(6):967-973.
-
(2008)
Neuron
, vol.58
, Issue.6
, pp. 967-973
-
-
Wittmann, B.C.1
Daw, N.D.2
Seymour, B.3
Dolan, R.J.4
|