-
1
-
-
33646833114
-
Prediction error as a linear function of reward probability is coded in human nucleus accumbens
-
Abler, B., Walter, H., Erk, S., Kammerer, H., & Spitzer, M. (2006). Prediction error as a linear function of reward probability is coded in human nucleus accumbens. NeuroImage, 31, 790-795
-
(2006)
NeuroImage
, vol.31
, pp. 790-795
-
-
Abler, B.1
Walter, H.2
Erk, S.3
Kammerer, H.4
Spitzer, M.5
-
4
-
-
23944453331
-
Tracing problem solving in real time: fMRI analysis of the subject-paced tower of Hanoi
-
Anderson, J. R., Albert, M. V., & Fincham, J. M. (2005). Tracing problem solving in real time: fMRI analysis of the subject-paced tower of Hanoi. Journal of Cognitive Neuroscience, 17, 1261-1274
-
(2005)
Journal of Cognitive Neuroscience
, vol.17
, pp. 1261-1274
-
-
Anderson, J.R.1
Albert, M.V.2
Fincham, J.M.3
-
5
-
-
0033929121
-
Task-specific neural activity in the primate prefrontal cortex
-
Asaad, W. F., Rainer, G., & Miller, E. K. (2000). Task-specific neural activity in the primate prefrontal cortex. Journal of Neurophysiology, 84, 451-459
-
(2000)
Journal of Neurophysiology
, vol.84
, pp. 451-459
-
-
Asaad, W.F.1
Rainer, G.2
Miller, E.K.3
-
6
-
-
67651120119
-
Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-Maze
-
Baker, T. E., & Holroyd, C. B. (2009). Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-Maze. Cerebral Cortex, 19, 1708-1722
-
(2009)
Cerebral Cortex
, vol.19
, pp. 1708-1722
-
-
Baker, T.E.1
Holroyd, C.B.2
-
7
-
-
28444472936
-
Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits
-
Balleine, B. W. (2005). Neural bases of food-seeking: Affect, arousal and reward in corticostriatolimbic circuits. Physiology & Behavior, 86, 717- 730
-
(2005)
Physiology & Behavior
, vol.86
, pp. 717-730
-
-
Balleine, B.W.1
-
8
-
-
0000171584
-
Motivational control of heterogeneous instrumental chains
-
Balleine, B. W., Garner, C., Gonzalez, F., & Dickinson, A. (1995). Motivational control of heterogeneous instrumental chains. Journal of Experimental Psychology: Animal Behavior Processes, 21, 203-217
-
(1995)
Journal of Experimental Psychology: Animal Behavior Processes
, vol.21
, pp. 203-217
-
-
Balleine, B.W.1
Garner, C.2
Gonzalez, F.3
Dickinson, A.4
-
9
-
-
72049125602
-
Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action
-
Balleine, B. W., & O'doherty, J. P. (2010). Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48-69
-
(2010)
Neuropsychopharmacology
, vol.35
, pp. 48-69
-
-
Balleine, B.W.1
O'doherty, J.P.2
-
11
-
-
0035871327
-
Predictability modulates human brain response to reward
-
Berns, G. S., Mcclure, S. M., Pagnoni, G., & Montague, P. R. (2001). Predictability modulates human brain response to reward. The Journal of Neuroscience, 21, 2793-2798.
-
(2001)
The Journal of Neuroscience
, vol.21
, pp. 2793-2798
-
-
Berns, G.S.1
Mcclure, S.M.2
Pagnoni, G.3
Montague, P.R.4
-
12
-
-
33847634405
-
The debate over dopamine's role in reward: The case for incentive salience
-
Berridge, K. C. (2007). The debate over dopamine's role in reward: The case for incentive salience. Psychopharmacology, 191, 391-431
-
(2007)
Psychopharmacology
, vol.191
, pp. 391-431
-
-
Berridge, K.C.1
-
13
-
-
0000040523
-
The effect of the introduction of reward upon the maze performance of rats
-
Blodgett, H. C. (1929). The effect of the introduction of reward upon the maze performance of rats. University of California Publications in Psychology, 4, 113-134.
-
(1929)
University of California Publications in Psychology
, vol.4
, pp. 113-134
-
-
Blodgett, H.C.1
-
14
-
-
34248999741
-
Short-term memory traces for action bias in human reinforcement learning
-
Bogacz, R., Mcclure, S. M., Li, J., Cohen, J. D., & Montague, P. R. (2007). Short-term memory traces for action bias in human reinforcement learning. Brain Research, 1153, 111-121
-
(2007)
Brain Research
, vol.1153
, pp. 111-121
-
-
Bogacz, R.1
Mcclure, S.M.2
Li, J.3
Cohen, J.D.4
Montague, P.R.5
-
15
-
-
37549047252
-
Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function
-
Botvinick, M. (2007). Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function. Cognitive, Affective & Behavioral Neuroscience, 7, 356-366
-
(2007)
Cognitive, Affective & Behavioral Neuroscience
, vol.7
, pp. 356-366
-
-
Botvinick, M.1
-
16
-
-
84944628858
-
Conflict monitoring and cognitive control
-
Botvinick, M. M., Braver, T. S., Barch, D. M., Carter, C. S., & Cohen, J. D. (2001). Conflict monitoring and cognitive control. Psychological Review, 108, 624-652
-
(2001)
Psychological Review
, vol.108
, pp. 624-652
-
-
Botvinick, M.M.1
Braver, T.S.2
Barch, D.M.3
Carter, C.S.4
Cohen, J.D.5
-
17
-
-
70350566799
-
Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective
-
Botvinick, M. M., Niv, Y., & Barto, A. C. (2009). Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition, 113, 262-280
-
(2009)
Cognition
, vol.113
, pp. 262-280
-
-
Botvinick, M.M.1
Niv, Y.2
Barto, A.C.3
-
18
-
-
77952979246
-
Human medial orbitofrontal cortex is recruited during experience of imagined and real rewards
-
Bray, S., Shimojo, S., & O'Doherty, J. P. (2010). Human medial orbitofrontal cortex is recruited during experience of imagined and real rewards. Journal of Neurophysiology, 103, 2506-2512
-
(2010)
Journal of Neurophysiology
, vol.103
, pp. 2506-2512
-
-
Bray, S.1
Shimojo, S.2
O'doherty, J.P.3
-
20
-
-
13844309349
-
Learned predictions of error likelihood in the anterior cingulate cortex
-
Brown, J. W., & Braver, T. S. (2005). Learned predictions of error likelihood in the anterior cingulate cortex. Science, 307, 1118-1121
-
(2005)
Science
, vol.307
, pp. 1118-1121
-
-
Brown, J.W.1
Braver, T.S.2
-
21
-
-
58149439823
-
Differential errors in animal mazes
-
Buel, J. (1935). Differential errors in animal mazes. Psychological Bulletin, 32, 67-99
-
(1935)
Psychological Bulletin
, vol.32
, pp. 67-99
-
-
Buel, J.1
-
22
-
-
14844315691
-
How we use rules to select actions: A review of evidence from cognitive neuroscience
-
Bunge, S. A. (2004). How we use rules to select actions: A review of evidence from cognitive neuroscience. Cognitive, Affective & Behavioral Neuroscience, 4, 564-579
-
(2004)
Cognitive, Affective & Behavioral Neuroscience
, vol.4
, pp. 564-579
-
-
Bunge, S.A.1
-
23
-
-
0030023405
-
Conservation of hippocampal memory function in rats and humans
-
Bunsey, M., & Eichenbaum, H. (1996). Conservation of hippocampal memory function in rats and humans. Nature, 379, 255-257
-
(1996)
Nature
, vol.379
, pp. 255-257
-
-
Bunsey, M.1
Eichenbaum, H.2
-
24
-
-
67349227786
-
Theoretical tools for understanding and aiding dynamic decision making
-
Busemeyer, J. R., & Pleskac, T. J. (2009). Theoretical tools for understanding and aiding dynamic decision making. Journal of Mathematical Psychology, 53, 126-138
-
(2009)
Journal of Mathematical Psychology
, vol.53
, pp. 126-138
-
-
Busemeyer, J.R.1
Pleskac, T.J.2
-
26
-
-
0034801578
-
The role of ventral and orbital prefrontal cortex in conditional visuomotor learning and strategy use in Rhesus monkeys (Macaca mulatta)
-
Bussey, T. J., Wise, S. P., & Murray, E. A. (2001). The role of ventral and orbital prefrontal cortex in conditional visuomotor learning and strategy use in Rhesus monkeys (Macaca mulatta). Behavioral Neuroscience, 115, 971-982
-
(2001)
Behavioral Neuroscience
, vol.115
, pp. 971-982
-
-
Bussey, T.J.1
Wise, S.P.2
Murray, E.A.3
-
27
-
-
33644480565
-
The effect of different amounts of alternating partial reinforcement on resistance to extinction
-
Capaldi, E. J. (1957). The effect of different amounts of alternating partial reinforcement on resistance to extinction. The American Journal of Psychology, 70, 451-452
-
(1957)
The American Journal of Psychology
, vol.70
, pp. 451-452
-
-
Capaldi, E.J.1
-
28
-
-
0036114221
-
Emotion and motivation: The role of the amygdala, ventral striatum, and prefrontal cortex
-
Cardinal, R. N., Parkinson, J. A., Hall, J., & Everitt, B. J. (2002). Emotion and motivation: The role of the amygdala, ventral striatum, and prefrontal cortex. Neuroscience and Biobehavioral Reviews, 26, 321-352
-
(2002)
Neuroscience and Biobehavioral Reviews
, vol.26
, pp. 321-352
-
-
Cardinal, R.N.1
Parkinson, J.A.2
Hall, J.3
Everitt, B.J.4
-
29
-
-
0032076255
-
Anterior cingulate cortex, error detection, and the online monitoring of performance
-
Carter, C. S., Braver, T. S., Barch, D. M., Botvinick, M. M., Noll, D., & Cohen, J. D. (1998). Anterior cingulate cortex, error detection, and the online monitoring of performance. Science, 280, 747-749
-
(1998)
Science
, vol.280
, pp. 747-749
-
-
Carter, C.S.1
Braver, T.S.2
Barch, D.M.3
Botvinick, M.M.4
Noll, D.5
Cohen, J.D.6
-
30
-
-
33846225079
-
Reinforcement learning signals predict future decisions
-
Cohen, M. X., & Ranganath, C. (2007). Reinforcement learning signals predict future decisions. The Journal of Neuroscience, 27, 371-378
-
(2007)
The Journal of Neuroscience
, vol.27
, pp. 371-378
-
-
Cohen, M.X.1
Ranganath, C.2
-
31
-
-
0003459801
-
Memory, amnesia, and the hippocampal system
-
Cambridge, MA: MIT Press
-
Cohen, N. J., & Eichenbaum, H. (1993). Memory, amnesia, and the hippocampal system. Cambridge, MA: MIT Press.
-
(1993)
-
-
Cohen, N.J.1
Eichenbaum, H.2
-
32
-
-
33746365099
-
Bayesian theories of conditioning in a changing world
-
Courville, A. C., Daw, N. D., & Touretzky, D. S. (2006). Bayesian theories of conditioning in a changing world. Trends in Cognitive Sciences, 10, 295-300
-
(2006)
Trends in Cognitive Sciences
, vol.10
, pp. 295-300
-
-
Courville, A.C.1
Daw, N.D.2
Touretzky, D.S.3
-
33
-
-
79952746011
-
Model-based influences on humans' choices and striatal prediction errors
-
Daw, N. D., Gershman, S. J., Seymour, B., Dayan, P., & Dolan, R. J. (2011). Model-based influences on humans' choices and striatal prediction errors. Neuron, 69, 1204-1215
-
(2011)
Neuron
, vol.69
, pp. 1204-1215
-
-
Daw, N.D.1
Gershman, S.J.2
Seymour, B.3
Dayan, P.4
Dolan, R.J.5
-
34
-
-
28044450875
-
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
-
Daw, N. D., Niv, Y., & Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience, 8, 1704-1711
-
(2005)
Nature Neuroscience
, vol.8
, pp. 1704-1711
-
-
Daw, N.D.1
Niv, Y.2
Dayan, P.3
-
35
-
-
33745223257
-
Cortical substrates for exploratory decisions in humans
-
Daw, N. D., O'doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441, 876-879
-
(2006)
Nature
, vol.441
, pp. 876-879
-
-
Daw, N.D.1
O'doherty, J.P.2
Dayan, P.3
Seymour, B.4
Dolan, R.J.5
-
36
-
-
84859315924
-
Instrumental vigour in punishment and reward
-
Dayan, P. (2012). Instrumental vigour in punishment and reward. European Journal of Neuroscience, 35, 1152-1168
-
(2012)
European Journal of Neuroscience
, vol.35
, pp. 1152-1168
-
-
Dayan, P.1
-
37
-
-
60749114870
-
Decision theory, reinforcement learning, and the brain
-
Dayan, P., & Daw, N. D. (2008). Decision theory, reinforcement learning, and the brain. Cognitive, Affective & Behavioral Neuroscience, 8, 429- 453
-
(2008)
Cognitive, Affective & Behavioral Neuroscience
, vol.8
, pp. 429-453
-
-
Dayan, P.1
Daw, N.D.2
-
38
-
-
52049107354
-
Reinforcement learning: The good, the bad and the ugly
-
Dayan, P., & Niv, Y. (2008). Reinforcement learning: The good, the bad and the ugly. Current Opinion in Neurobiology, 18, 185-196
-
(2008)
Current Opinion in Neurobiology
, vol.18
, pp. 185-196
-
-
Dayan, P.1
Niv, Y.2
-
39
-
-
0004217226
-
-
(2nd ed.) The Hague, the Netherlands: Mouton. (Original work published 1946)
-
de Groot, A. D. (1978). Thought and choice in chess (2nd ed.). The Hague, the Netherlands: Mouton. (Original work published 1946).
-
(1978)
Thought and choice in chess
-
-
de Groot, A.D.1
-
40
-
-
0042697569
-
Dorsal striatum responses to reward and punishment: Effects of valence and magnitude manipulations
-
Delgado, M. R., Locke, H. M., Stenger, V. A., & Fiez, J. A. (2003). Dorsal striatum responses to reward and punishment: Effects of valence and magnitude manipulations. Cognitive, Affective & Behavioral Neuroscience, 3, 27-38.
-
(2003)
Cognitive, Affective & Behavioral Neuroscience
, vol.3
, pp. 27-38
-
-
Delgado, M.R.1
Locke, H.M.2
Stenger, V.A.3
Fiez, J.A.4
-
41
-
-
0032371180
-
Omission learning after instrumental pretraining
-
Dickinson, A., Squire, S., Varga, Z., & Smith, J. W. (1998). Omission learning after instrumental pretraining. The Quarterly Journal of Experimental Psychology B: Comparative and Physiological Psychology, 51B, 271-286.
-
(1998)
The Quarterly Journal of Experimental Psychology B: Comparative and Physiological Psychology
, vol.51 B
, pp. 271-286
-
-
Dickinson, A.1
Squire, S.2
Varga, Z.3
Smith, J.W.4
-
42
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
43
-
-
0027634299
-
A selectionist approach to reinforcement
-
Donahoe, J. W., Burgos, J. E., & Palmer, D. C. (1993). A selectionist approach to reinforcement. Journal of the Experimental Analysis of Behavior, 60, 17-40.
-
(1993)
Journal of the Experimental Analysis of Behavior
, vol.60
, pp. 17-40
-
-
Donahoe, J.W.1
Burgos, J.E.2
Palmer, D.C.3
-
44
-
-
85006547536
-
A comparison of human and agent reinforcement learning
-
L. Carlson, C. Hoelscher, & T. F. Shipley (Eds.), Austin, TX: Cognitive Sciences Society
-
Doshi-Velez, F., & Ghahramani, Z. (2011). A comparison of human and agent reinforcement learning. In L. Carlson, C. Hoelscher, & T. F. Shipley (Eds.), Proceedings of the 33rd annual conference of the Cognitive Science Society (pp. 2703-2708). Austin, TX: Cognitive Sciences Society.
-
(2011)
Proceedings of the 33rd annual conference of the Cognitive Science Society
, pp. 2703-2708
-
-
Doshi-Velez, F.1
Ghahramani, Z.2
-
45
-
-
0033213819
-
What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Networks
-
Doya, K. (1999). What are the computations of the cerebellum, the basal ganglia and the cerebral cortex? Neural Networks, 12, 961-974.
-
(1999)
, vol.12
, pp. 961-974
-
-
Doya, K.1
-
46
-
-
0002337786
-
Metalearning, neuromodulation, and emotion
-
G. Hatano, N. Okada, & H. Tanabe (Eds.), Amsterdam, the Netherlands: Elsevier Science
-
Doya, K. (2000). Metalearning, neuromodulation, and emotion. In G. Hatano, N. Okada, & H. Tanabe (Eds.), Affective minds (pp. 101-104). Amsterdam, the Netherlands: Elsevier Science.
-
(2000)
Affective minds (pp. 101-104)
-
-
Doya, K.1
-
48
-
-
37349015901
-
Error-related negativities elicited by monetary loss and cues that predict loss
-
Dunning, J. P., & Hajcak, G. (2007). Error-related negativities elicited by monetary loss and cues that predict loss. NeuroReport, 18, 1875-1878.
-
(2007)
NeuroReport
, vol.18
, pp. 1875-1878
-
-
Dunning, J.P.1
Hajcak, G.2
-
49
-
-
1542316149
-
Instrumental responding for rewards is associated with enhanced neuronal response in subcortical reward systems
-
Elliott, R., Newman, J. L., Longe, O. A., & Deakin, J. F. W. (2004). Instrumental responding for rewards is associated with enhanced neuronal response in subcortical reward systems. NeuroImage, 21, 984- 990.
-
(2004)
NeuroImage
, vol.21
, pp. 984-990
-
-
Elliott, R.1
Newman, J.L.2
Longe, O.A.3
Deakin, J.F.W.4
-
50
-
-
0025853659
-
Effects of crossmodal divided attention on late ERP components: II
-
Falkenstein, M., Hohnsbein, J., Hoormann, J., & Blanke, L. (1991). Effects of crossmodal divided attention on late ERP components: II. Error processing in choice reaction tasks. Electroencephalography & Clinical Neurophysiology, 78, 447-455.
-
(1991)
Error processing in choice reaction tasks. Electroencephalography & Clinical Neurophysiology
, vol.78
, pp. 447-455
-
-
Falkenstein, M.1
Hohnsbein, J.2
Hoormann, J.3
Blanke, L.4
-
51
-
-
78649604962
-
Evidence for model-based action planning in a sequential finger movement task
-
Fermin, A., Yoshida, T., Ito, M., Yoshimoto, J., & Doya, K. (2010). Evidence for model-based action planning in a sequential finger movement task. Journal of Motor Behavior, 42, 371-379.
-
(2010)
Journal of Motor Behavior
, vol.42
, pp. 371-379
-
-
Fermin, A.1
Yoshida, T.2
Ito, M.3
Yoshimoto, J.4
Doya, K.5
-
52
-
-
0037459319
-
Discrete coding of reward probability and uncertainty by dopamine neurons
-
Fiorillo, C. D., Tobler, P. N., & Schultz, W. (2003). Discrete coding of reward probability and uncertainty by dopamine neurons. Science, 299, 1898-1902.
-
(2003)
Science
, vol.299
, pp. 1898-1902
-
-
Fiorillo, C.D.1
Tobler, P.N.2
Schultz, W.3
-
53
-
-
33645458694
-
Reverse replay of behavioural sequences in hippocampal place cells during the awake state
-
Foster, D. J., & Wilson, M. A. (2006). Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature, 440, 680-683.
-
(2006)
Nature
, vol.440
, pp. 680-683
-
-
Foster, D.J.1
Wilson, M.A.2
-
54
-
-
33744550336
-
Anatomy of a decision: Striatoorbitofrontal interactions in reinforcement learning, decision making, and reversal
-
Frank, M. J., & Claus, E. D. (2006). Anatomy of a decision: Striatoorbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychological Review, 113, 300-326.
-
(2006)
Psychological Review
, vol.113
, pp. 300-326
-
-
Frank, M.J.1
Claus, E.D.2
-
55
-
-
0041109884
-
Time discounting and time preference: A critical review
-
Frederick, S., Loewenstein, G., & O'Donoghue, T. (2002). Time discounting and time preference: A critical review. Journal of Economic Literature, 40, 351-401.
-
(2002)
Journal of Economic Literature
, vol.40
, pp. 351-401
-
-
Frederick, S.1
Loewenstein, G.2
O'Donoghue, T.3
-
56
-
-
33745108748
-
From recurrent choice to skill learning: A reinforcement-learning model
-
Fu, W. T., & Anderson, J. R. (2006). From recurrent choice to skill learning: A reinforcement-learning model. Journal of Experimental Psychology: General, 135, 184-206.
-
(2006)
Journal of Experimental Psychology: General
, vol.135
, pp. 184-206
-
-
Fu, W.T.1
Anderson, J.R.2
-
58
-
-
40949160181
-
Solving the credit assignment problem: Explicit and implicit learning of action sequences with probabilistic outcomes
-
Fu, W. T., & Anderson, J. R. (2008b). Solving the credit assignment problem: Explicit and implicit learning of action sequences with probabilistic outcomes. Psychological Research, 72, 321-330.
-
(2008)
Psychological Research
, vol.72
, pp. 321-330
-
-
Fu, W.T.1
Anderson, J.R.2
-
59
-
-
10244241691
-
Resolving the paradox of the active user: Stable suboptimal performance in interactive tasks
-
Fu, W. T., & Gray, W. D. (2004). Resolving the paradox of the active user: Stable suboptimal performance in interactive tasks. Cognitive Science, [28,] 901-935.
-
(2004)
Cognitive Science
, vol.28
, pp. 901-935
-
-
Fu, W.T.1
Gray, W.D.2
-
60
-
-
0003966168
-
The prefrontal cortex: Anatomy, physiology, and neuropsychology of the frontal lobe
-
Philadelphia, PA: Lippincott- Raven
-
Fuster, J. M. (1997). The prefrontal cortex: Anatomy, physiology, and neuropsychology of the frontal lobe. Philadelphia, PA: Lippincott- Raven.
-
(1997)
-
-
Fuster, J.M.1
-
61
-
-
84965511150
-
A neural system for error detection and compensation
-
Gehring, W. J., Goss, B., Coles, M. G. H., Meyer, D. E., & Donchin, E. (1993). A neural system for error detection and compensation. Psychological Science, 4, 385-390.
-
(1993)
Psychological Science
, vol.4
, pp. 385-390
-
-
Gehring, W.J.1
Goss, B.2
Coles, M.G.H.3
Meyer, D.E.4
Donchin, E.5
-
62
-
-
74049117596
-
Context, learning, and extinction
-
Gershman, S. J., Blei, D., & Niv, Y. (2010). Context, learning, and extinction. Psychological Review, 117, 197-209.
-
(2010)
Psychological Review
, vol.117
, pp. 197-209
-
-
Gershman, S.J.1
Blei, D.2
Niv, Y.3
-
64
-
-
84870955069
-
Exploring a latent cause theory of classical conditioning
-
Gershman, S. J., & Niv, Y. (2012). Exploring a latent cause theory of classical conditioning. Learning & Behavior, 40, 255-268.
-
(2012)
Learning & Behavior
, vol.40
, pp. 255-268
-
-
Gershman, S.J.1
Niv, Y.2
-
65
-
-
77953260848
-
States versus rewards: Dissociable neural prediction error signals underlying modelbased and model-free reinforcement learning
-
Gläscher, J., Daw, N., Dayan, P., & O'doherty, J. P. (2010). States versus rewards: Dissociable neural prediction error signals underlying modelbased and model-free reinforcement learning. Neuron, 66, 585-595.
-
(2010)
Neuron
, vol.66
, pp. 585-595
-
-
Gläscher, J.1
Daw, N.2
Dayan, P.3
O'doherty, J.P.4
-
66
-
-
58449113882
-
Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making
-
Gläscher, J., Hampton, A. N., & O'doherty, J. P. (2008). Determining a role for ventromedial prefrontal cortex in encoding action-based value signals during reward-related decision making. Cerebral Cortex, 19, 483-495.
-
(2008)
Cerebral Cortex
, vol.19
, pp. 483-495
-
-
Gläscher, J.1
Hampton, A.N.2
O'doherty, J.P.3
-
67
-
-
33746347681
-
The soft constraints hypothesis: A rational analysis approach to resource allocation for interactive behavior
-
Gray, W. D., Sims, C. R., Fu, W. T., & Schoelles, M. J. (2006). The soft constraints hypothesis: A rational analysis approach to resource allocation for interactive behavior. Psychological Review, 113, 461-482.
-
(2006)
Psychological Review
, vol.113
, pp. 461-482
-
-
Gray, W.D.1
Sims, C.R.2
Fu, W.T.3
Schoelles, M.J.4
-
68
-
-
70350572378
-
Short-term gains, long-term pains: How cues about state aid learning in dynamic environments
-
Gureckis, T. M., & Love, B. C. (2009). Short-term gains, long-term pains: How cues about state aid learning in dynamic environments. Cognition, 113, 293-313.
-
(2009)
Cognition
, vol.113
, pp. 293-313
-
-
Gureckis, T.M.1
Love, B.C.2
-
69
-
-
0034654526
-
Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum
-
Haber, S. N., Fudge, J. L., & McFarland, N. R. (2000). Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum. Journal of Neuroscience, 20, 2369-2382.
-
(2000)
Journal of Neuroscience
, vol.20
, pp. 2369-2382
-
-
Haber, S.N.1
Fudge, J.L.2
McFarland, N.R.3
-
70
-
-
0025264150
-
Topographic organization of the ventral striatal efferent projections in the rhesus monkey: An anterograde tracing study
-
Haber, S. N., Lynd-Balta, E., Klein, C., & Groenewegen, H. J. (1990). Topographic organization of the ventral striatal efferent projections in the rhesus monkey: An anterograde tracing study. Journal of Comparative Neurology, 293, 282-298.
-
(1990)
Journal of Comparative Neurology
, vol.293
, pp. 282-298
-
-
Haber, S.N.1
Lynd-Balta, E.2
Klein, C.3
Groenewegen, H.J.4
-
71
-
-
33748188120
-
The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans
-
Hampton, A. N., Bossaerts, P., & O'doherty, J. P. (2006). The role of the ventromedial prefrontal cortex in abstract state-based inference during decision making in humans. The Journal of Neuroscience, 26, 8360- 8367.
-
(2006)
The Journal of Neuroscience
, vol.26
, pp. 8360-8367
-
-
Hampton, A.N.1
Bossaerts, P.2
O'doherty, J.P.3
-
73
-
-
84980138473
-
Utility maximization and melioration: Internalities in individual choice
-
Herrnstein, R. J., Loewenstein, G. F., Prelec, D., & Vaughan, W. (1993). Utility maximization and melioration: Internalities in individual choice. Journal of Behavioral Decision Making, 6, 149-185.
-
(1993)
Journal of Behavioral Decision Making
, vol.6
, pp. 149-185
-
-
Herrnstein, R.J.1
Loewenstein, G.F.2
Prelec, D.3
Vaughan, W.4
-
75
-
-
0036644494
-
Decision biases and persistent illicit drug use: An experimental study of distributed choice and addiction
-
Heyman, G. M., & Dunn, B. (2002). Decision biases and persistent illicit drug use: An experimental study of distributed choice and addiction. Drug and Alcohol Dependence, 67, 193-203.
-
(2002)
Drug and Alcohol Dependence
, vol.67
, pp. 193-203
-
-
Heyman, G.M.1
Dunn, B.2
-
76
-
-
0016564510
-
The effect of two ways of devaluing the unconditioned stimulus after first- and second-order appetitive conditioning
-
Holland, P. C., & Rescorla, R. (1975). The effect of two ways of devaluing the unconditioned stimulus after first- and second-order appetitive conditioning. Journal of Experimental Psychology: Animal Behavior Processes, 1, 355-363.
-
(1975)
Journal of Experimental Psychology: Animal Behavior Processes
, vol.1
, pp. 355-363
-
-
Holland, P.C.1
Rescorla, R.2
-
77
-
-
85047670409
-
The neural basis of human error processing: Reinforcement learning, dopamine, and the error-relatednegativity
-
Holroyd, C. B., & Coles, M. G. H. (2002). The neural basis of human error processing: Reinforcement learning, dopamine, and the error-relatednegativity. Psychological Review, 109, 679-709.
-
(2002)
Psychological Review
, vol.109
, pp. 679-709
-
-
Holroyd, C.B.1
Coles, M.G.H.2
-
78
-
-
79952906182
-
Reward positivity elicited by predictive cues
-
Holroyd, C. B., Krigolson, O. E., & Lee, S. (2011). Reward positivity elicited by predictive cues. NeuroReport, 22, 249-252.
-
(2011)
NeuroReport
, vol.22
, pp. 249-252
-
-
Holroyd, C.B.1
Krigolson, O.E.2
Lee, S.3
-
79
-
-
0034077644
-
Neuronal activity in the primate prefrontal cortex in the process of motor selection based on two behavioral rules
-
Hoshi, E., Shima, K., & Tanji, J. (2000). Neuronal activity in the primate prefrontal cortex in the process of motor selection based on two behavioral rules. Journal of Neurophysiology, 83, 2355-2373.
-
(2000)
Journal of Neurophysiology
, vol.83
, pp. 2355-2373
-
-
Hoshi, E.1
Shima, K.2
Tanji, J.3
-
80
-
-
0003090736
-
The goal gradient hypothesis and maze learning
-
Hull, C. L.. (1932) The goal gradient hypothesis and maze learning. Psychological Review, 39, 25-43.
-
(1932)
Psychological Review
, vol.39
, pp. 25-43
-
-
Hull, C.L.1
-
82
-
-
84859371025
-
Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees
-
Huys, Q. J. M., Eshel, N., O'Nions, E., Sheridan, L., Dayan, P., & Roiser, J. P.. (2012). Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLOS Computational Biology, 8, e1002410.
-
(2012)
PLOS Computational Biology
, vol.8
-
-
Huys, Q.J.M.1
Eshel, N.2
O'Nions, E.3
Sheridan, L.4
Dayan, P.5
Roiser, J.P.6
-
83
-
-
0035489031
-
Addiction and the brain: The neurobiology of compulsion and its persistence
-
Hyman, S. E., & Malenka, R. C.. (2001). Addiction and the brain: The neurobiology of compulsion and its persistence. Nature Reviews Neuroscience, 2, 695-703.
-
(2001)
Nature Reviews Neuroscience
, vol.2
, pp. 695-703
-
-
Hyman, S.E.1
Malenka, R.C.2
-
84
-
-
67449138590
-
Brain mechanisms for predictive control by switching internal models: Implications for higher-order cognitive functions
-
Imamizu, H., & Kawato, M.. (2009). Brain mechanisms for predictive control by switching internal models: Implications for higher-order cognitive functions. Psychological Research, 73, 527-544.
-
(2009)
Psychological Research
, vol.73
, pp. 527-544
-
-
Imamizu, H.1
Kawato, M.2
-
85
-
-
41149173145
-
Control of mental activities by internal models in the cerebellum
-
Ito, M. (2008). Control of mental activities by internal models in the cerebellum. Nature Reviews Neuroscience, 9, 304-313.
-
(2008)
Nature Reviews Neuroscience
, vol.9
, pp. 304-313
-
-
Ito, M.1
-
86
-
-
4444358622
-
Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency
-
Izquierdo, A. D., Suda, R. K., & Murray, E. A.. (2004). Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency. The Journal of Neuroscience, 24, 7540-7548.
-
(2004)
The Journal of Neuroscience
, vol.24
, pp. 7540-7548
-
-
Izquierdo, A.D.1
Suda, R.K.2
Murray, E.A.3
-
87
-
-
0003891643
-
-
New York, NY: Dover. (Original work published 1890)
-
James, W.. (1950). The principles of psychology. New York, NY: Dover. (Original work published 1890).
-
(1950)
The principles of psychology
-
-
James, W.1
-
88
-
-
84857974114
-
When, what, and how much to reward in reinforcement learning-based models of cognition
-
Janssen, C. P., & Gray, W. D.. (2012). When, what, and how much to reward in reinforcement learning-based models of cognition. Cognitive Science, 36, 333-358.
-
(2012)
Cognitive Science
, vol.36
, pp. 333-358
-
-
Janssen, C.P.1
Gray, W.D.2
-
89
-
-
0036592026
-
Actor-critic models of the basal ganglia: New anatomical and computational perspectives
-
Joel, D., Niv, Y., & Ruppin, E.. (2002). Actor-critic models of the basal ganglia: New anatomical and computational perspectives. Neural Networks, 15, 535-547.
-
(2002)
Neural Networks
, vol.15
, pp. 535-547
-
-
Joel, D.1
Niv, Y.2
Ruppin, E.3
-
90
-
-
36048937548
-
Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point
-
Johnson, A., & Redish, A. D.. (2007). Neural ensembles in CA3 transiently encode paths forward of the animal at a decision point. The Journal of Neuroscience, 27, 12176 -12189.
-
(2007)
The Journal of Neuroscience
, vol.27
-
-
Johnson, A.1
Redish, A.D.2
-
91
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
Kaelbling, L. P., Littman, M. L., & Cassandra, A. R.. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
-
(1998)
Artificial Intelligence
, vol.101
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
93
-
-
33745565701
-
Optimal decision making and the anterior cingulate cortex
-
Kennerley, S. W., Walton, M. E., Behrens, T. E. J., Buckley, M. J., & Rushworth, M. F. S.. (2006). Optimal decision making and the anterior cingulate cortex. Nature Neuroscience, 9, 940-947.
-
(2006)
Nature Neuroscience
, vol.9
, pp. 940-947
-
-
Kennerley, S.W.1
Walton, M.E.2
Behrens, T.E.J.3
Buckley, M.J.4
Rushworth, M.F.S.5
-
94
-
-
79958143780
-
Speed/accuracy trade-off between the habitual and the goal-directed processes
-
Keramati, M., Dezfouli, A., & Piray, P.. (2011). Speed/accuracy trade-off between the habitual and the goal-directed processes. PLOS Computational Biology, 7, e1002055.
-
(2011)
PLOS Computational Biology
, vol.7
-
-
Keramati, M.1
Dezfouli, A.2
Piray, P.3
-
95
-
-
1142286344
-
Anterior cingulate conflict monitoring and adjustments in control
-
Kerns, J. G., Cohen, J. D., Macdonald, A. W., Cho, R. Y., Stenger, V. A., & Carter, C. S. (2004). Anterior cingulate conflict monitoring and adjustments in control. Science, 303, 1023-1026.
-
(2004)
Science
, vol.303
, pp. 1023-1026
-
-
Kerns, J.G.1
Cohen, J.D.2
Macdonald, A.W.3
Cho, R.Y.4
Stenger, V.A.5
Carter, C.S.6
-
96
-
-
0037382264
-
Coordination of actions and habits in the medial prefrontal cortex of rats
-
Killcross, S., & Coutureau, E. (2003). Coordination of actions and habits in the medial prefrontal cortex of rats. Cerebral Cortex, 13, 400-408.
-
(2003)
Cerebral Cortex
, vol.13
, pp. 400-408
-
-
Killcross, S.1
Coutureau, E.2
-
97
-
-
18644371864
-
Distributed neural representation of expected value
-
Knutson, B., Taylor, J., Kaufman, M., Peterson, R., & Glover, G. (2005). Distributed neural representation of expected value. The Journal of Neuroscience, 25, 4806-4812.
-
(2005)
The Journal of Neuroscience
, vol.25
, pp. 4806-4812
-
-
Knutson, B.1
Taylor, J.2
Kaufman, M.3
Peterson, R.4
Glover, G.5
-
98
-
-
50349093022
-
Influences of reward delays on responses of dopamine neurons
-
Kobayashi, S., & Schultz, W. (2008). Influences of reward delays on responses of dopamine neurons. The Journal of Neuroscience, 28, 7837- 7846.
-
(2008)
The Journal of Neuroscience
, vol.28
, pp. 7837-7846
-
-
Kobayashi, S.1
Schultz, W.2
-
99
-
-
80052700236
-
This ought to be good: Brain activity accompanying positive and negative expectations and outcomes
-
Liao, Y., Gramann, K., Feng, W., Deák, G. O., & Li, H. (2011). This ought to be good: Brain activity accompanying positive and negative expectations and outcomes. Psychophysiology, 48, 1412-1419.
-
(2011)
Psychophysiology
, vol.48
, pp. 1412-1419
-
-
Liao, Y.1
Gramann, K.2
Feng, W.3
Deák, G.O.4
Li, H.5
-
100
-
-
79951839136
-
Neural correlates of instrumental contingency learning: Differential effects of action-reward conjunction and disjunction
-
Liljeholm, M., Tricomi, E., O'doherty, J. P., & Balleine, B. W. (2011). Neural correlates of instrumental contingency learning: Differential effects of action-reward conjunction and disjunction. The Journal of Neuroscience, 31, 2474-2480.
-
(2011)
The Journal of Neuroscience
, vol.31
, pp. 2474-2480
-
-
Liljeholm, M.1
Tricomi, E.2
O'doherty, J.P.3
Balleine, B.W.4
-
101
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
Lin, L. J. (1992). Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293-321.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.J.1
-
102
-
-
0012327484
-
Using eligibility traces to find the best memoryless policy in partially observable Markov decision processes
-
J. W. Shavlik (Ed.), San Francisco, CA: Morgan Kaufmann
-
Loch, J., & Singh, S. (1998). Using eligibility traces to find the best memoryless policy in partially observable Markov decision processes. In J. W. Shavlik (Ed.), Proceedings of the fifteenth international conference on machine learning (pp. 323-331). San Francisco, CA: Morgan Kaufmann.
-
(1998)
Proceedings of the fifteenth international conference on machine learning
, pp. 32333
-
-
Loch, J.1
Singh, S.2
-
104
-
-
0000603047
-
The choice axiom after twenty years
-
Luce, R. D. (1977). The choice axiom after twenty years. Journal of Mathematical Psychology, 15, 215-233.
-
(1977)
Journal of Mathematical Psychology
, vol.15
, pp. 215-233
-
-
Luce, R.D.1
-
105
-
-
0030789031
-
Impulsive and self-control choices in opioid-dependent patients and non-drug-using control participants: Drug and monetary rewards
-
Madden, G. J., Petry, N. M., Badger, G. J., & Bickel, W. K. (1997). Impulsive and self-control choices in opioid-dependent patients and non-drug-using control participants: Drug and monetary rewards. Experimental and Clinical Psychopharmacology, 5, 256-262.
-
(1997)
Experimental and Clinical Psychopharmacology
, vol.5
, pp. 256-262
-
-
Madden, G.J.1
Petry, N.M.2
Badger, G.J.3
Bickel, W.K.4
-
106
-
-
77953156256
-
Fear conditioning and social groups: Statistics, not genetics
-
Maia, T. V. (2009). Fear conditioning and social groups: Statistics, not genetics. Cognitive Science, 33, 1232-1251.
-
(2009)
Cognitive Science
, vol.33
, pp. 1232-1251
-
-
Maia, T.V.1
-
107
-
-
77949897253
-
Two-factor theory, the actor-critic model, and conditioned avoidance
-
Maia, T. V. (2010). Two-factor theory, the actor-critic model, and conditioned avoidance. Learning & Behavior, 38, 50-67.
-
(2010)
Learning & Behavior
, vol.38
, pp. 50-67
-
-
Maia, T.V.1
-
108
-
-
79251569290
-
From reinforcement learning models to psychiatric and neurological disorders
-
Maia, T. V., & Frank, M. J. (2011). From reinforcement learning models to psychiatric and neurological disorders. Nature Neuroscience, 14, 154-162.
-
(2011)
Nature Neuroscience
, vol.14
, pp. 154-162
-
-
Maia, T.V.1
Frank, M.J.2
-
109
-
-
33644820167
-
Prefrontal cell activities related to monkeys' success and failure in adapting to rule changes in a Wisconsin Card Sorting test analog
-
Mansouri, F. A., Matsumoto, K., & Tanaka, K. (2006). Prefrontal cell activities related to monkeys' success and failure in adapting to rule changes in a Wisconsin Card Sorting test analog. The Journal of Neuroscience, 26, 2745-2756.
-
(2006)
The Journal of Neuroscience
, vol.26
, pp. 2745-2756
-
-
Mansouri, F.A.1
Matsumoto, K.2
Tanaka, K.3
-
110
-
-
0001657237
-
Instance-based utile distinctions for reinforcement learning with hidden states
-
A. Prieditis & S. J. Russell (Eds.), San Francisco, CA: Morgan Kaufmann
-
Mccallum, R. A. (1995). Instance-based utile distinctions for reinforcement learning with hidden states. In A. Prieditis & S. J. Russell (Eds.), The proceedings of the twelfth international machine learning conference (pp. 387-395). San Francisco, CA: Morgan Kaufmann.
-
(1995)
The proceedings of the twelfth international machine learning conference
, pp. 387395
-
-
Mccallum, R.A.1
-
111
-
-
0037650217
-
Temporal prediction errors in a passive learning task activate human striatum
-
Mcclure, S. M., Berns, G. S., & Montague, P. R. (2003). Temporal prediction errors in a passive learning task activate human striatum. Neuron, 38, 339-346.
-
(2003)
Neuron
, vol.38
, pp. 339-346
-
-
Mcclure, S.M.1
Berns, G.S.2
Montague, P.R.3
-
112
-
-
79951823576
-
Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning
-
McDannald, M. A., Lucantonio, F., Burke, K. A., Niv, Y., & Schoenbaum, G. (2011). Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning. The Journal of Neuroscience, 31, 2700-2705.
-
(2011)
The Journal of Neuroscience
, vol.31
, pp. 2700-2705
-
-
McDannald, M.A.1
Lucantonio, F.2
Burke, K.A.3
Niv, Y.4
Schoenbaum, G.5
-
114
-
-
0031436055
-
Event-related brain potentials following incorrect feedback in a time-estimation task: Evidence for a "generic" neural system for error detection
-
Miltner, W. H. R., Braun, C. H., & Coles, M. G. H. (1997). Event-related brain potentials following incorrect feedback in a time-estimation task: Evidence for a "generic" neural system for error detection. Journal of Cognitive Neuroscience, 9, 788-798.
-
(1997)
Journal of Cognitive Neuroscience
, vol.9
, pp. 788-798
-
-
Miltner, W.H.R.1
Braun, C.H.2
Coles, M.G.H.3
-
115
-
-
0002936464
-
Steps toward artificial intelligence
-
E. A. Feigenbaum & J. Feldman (Eds.), New York, NY: McGraw-Hill
-
Minsky, M. (1963). Steps toward artificial intelligence. In E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought (pp. 406-450). New York, NY: McGraw-Hill.
-
(1963)
Computers and thought
, pp. 406450
-
-
Minsky, M.1
-
116
-
-
0033662350
-
Effects of central 5-hydroxytrptamine depletion on sensitivity to delayed and probabilistic reinforcement
-
Mobini, S., Chiang, T. J., Ho, M. Y., Bradshaw, C. M., & Szabadi, E. (2000). Effects of central 5-hydroxytrptamine depletion on sensitivity to delayed and probabilistic reinforcement. Psychopharmacology, 152, 390-397.
-
(2000)
Psychopharmacology
, vol.152
, pp. 390-397
-
-
Mobini, S.1
Chiang, T.J.2
Ho, M.Y.3
Bradshaw, C.M.4
Szabadi, E.5
-
117
-
-
0037057753
-
Neural economics and the biological substrates of valuation
-
Montague, P. R., & Berns, G. S. (2002). Neural economics and the biological substrates of valuation. Neuron, 36, 265-284.
-
(2002)
Neuron
, vol.36
, pp. 265-284
-
-
Montague, P.R.1
Berns, G.S.2
-
118
-
-
0029981543
-
A framework for mesencephalic dopamine systems based on predictive Hebbian learning
-
Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of Neuroscience, 16, 1936-1947.
-
(1996)
The Journal of Neuroscience
, vol.16
, pp. 1936-1947
-
-
Montague, P.R.1
Dayan, P.2
Sejnowski, T.J.3
-
119
-
-
7244240565
-
Computational roles for dopamine in behavioural control
-
Montague, P. R., Hyman, S. E., & Cohen, J. D. (2004). Computational roles for dopamine in behavioural control. Nature, 431, 760-767.
-
(2004)
Nature
, vol.431
, pp. 760-767
-
-
Montague, P.R.1
Hyman, S.E.2
Cohen, J.D.3
-
120
-
-
33747585633
-
Midbrain dopamine neurons encode decisions for future action
-
Morris, G., Nevet, A., Arkadir, D., Vaadia, E., & Bergman, H. (2006). Midbrain dopamine neurons encode decisions for future action. Nature Neuroscience, 9, 1057-1063.
-
(2006)
Nature Neuroscience
, vol.9
, pp. 1057-1063
-
-
Morris, G.1
Nevet, A.2
Arkadir, D.3
Vaadia, E.4
Bergman, H.5
-
121
-
-
33745978411
-
A comparison of abstract rules in the prefrontal cortex, premotor cortex, inferior temporal cortex, and striatum
-
Muhammad, R., Wallis, J. D., & Miller, E. K. (2006). A comparison of abstract rules in the prefrontal cortex, premotor cortex, inferior temporal cortex, and striatum. Journal of Cognitive Neuroscience, 18, 974-989.
-
(2006)
Journal of Cognitive Neuroscience
, vol.18
, pp. 974-989
-
-
Muhammad, R.1
Wallis, J.D.2
Miller, E.K.3
-
122
-
-
33646431689
-
Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans
-
Mushiake, H., Saito, N., Sakamoto, K., Itoyama, Y., & Tanji, J. (2006). Activity in the lateral prefrontal cortex reflects multiple steps of future events in action plans. Neuron, 50, 631-641.
-
(2006)
Neuron
, vol.50
, pp. 631-641
-
-
Mushiake, H.1
Saito, N.2
Sakamoto, K.3
Itoyama, Y.4
Tanji, J.5
-
124
-
-
67349283062
-
Reinforcement learning in the brain
-
Niv, Y. (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53, 139-154.
-
(2009)
Journal of Mathematical Psychology
, vol.53
, pp. 139-154
-
-
Niv, Y.1
-
125
-
-
0037987978
-
Temporal difference models and reward-related learning in the human brain
-
O'doherty, J. P., Dayan, P., Friston, K., Critchley, H., & Dolan, R. J. (2003). Temporal difference models and reward-related learning in the human brain. Neuron, 38, 329 -337.
-
(2003)
Neuron
, vol.38
-
-
O'doherty, J.P.1
Dayan, P.2
Friston, K.3
Critchley, H.4
Dolan, R.J.5
-
126
-
-
1942520195
-
Dissociable roles of ventral and dorsal striatum in instrumental conditioning
-
O'doherty, J. P., Dayan, P., Schultz, J., Deichmann, R., Friston, K., & Dolan, R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452-454.
-
(2004)
Science
, vol.304
, pp. 452-454
-
-
O'doherty, J.P.1
Dayan, P.2
Schultz, J.3
Deichmann, R.4
Friston, K.5
Dolan, R.J.6
-
127
-
-
34447643062
-
Model-based fMRI and its application to reward learning and decision making
-
O'doherty, J. P., Hampton, A., & Kim, H. (2007). Model-based fMRI and its application to reward learning and decision making. Annals of the New York Academy of Science, 1104, 35-53.
-
(2007)
Annals of the New York Academy of Science
, vol.1104
, pp. 35-53
-
-
O'doherty, J.P.1
Hampton, A.2
Kim, H.3
-
129
-
-
33644927837
-
Making working memory work: A computational model of learning in prefrontal cortex and basal ganglia
-
O'Reilly, R. C., & Frank, M. J. (2006). Making working memory work: A computational model of learning in prefrontal cortex and basal ganglia. Neural Computation, 18, 283-328.
-
(2006)
Neural Computation
, vol.18
, pp. 283-328
-
-
O'Reilly, R.C.1
Frank, M.J.2
-
130
-
-
23944507547
-
Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning
-
Ostlund, S. B., & Balleine, B. W. (2005). Lesions of medial prefrontal cortex disrupt the acquisition but not the expression of goal-directed learning. The Journal of Neuroscience, 25, 7763-7770.
-
(2005)
The Journal of Neuroscience
, vol.25
, pp. 7763-7770
-
-
Ostlund, S.B.1
Balleine, B.W.2
-
131
-
-
84894429027
-
(in press)
-
Otto, A. R., Gershman, S. J., Markman, A. B., & Daw, N. D. (in press). The curse of planning: Dissecting multiple reinforcement learning systems by taxing the central executive. Psychological Science.
-
The curse of planning: Dissecting multiple reinforcement learning systems by taxing the central executive. Psychological Science.
-
-
Otto, A.R.1
Gershman, S.J.2
Markman, A.B.3
Daw, N.D.4
-
132
-
-
0030722121
-
Cognitive planning in humans: Neuropsychological, neuroanatomical and neuropharmacological perspectives
-
Owen, A. M. (1997). Cognitive planning in humans: Neuropsychological, neuroanatomical and neuropharmacological perspectives. Progress in Neurobiology, 53, 431-450.
-
(1997)
Progress in Neurobiology
, vol.53
, pp. 431-450
-
-
Owen, A.M.1
-
133
-
-
0028907108
-
Dopamine-dependent fronto-striatalplanning deficits in early Parkinson's disease
-
Owen, A. M., Sahakian, B. J., Hodges, J. R., Summers, B. A., Polkey, C. E., & Robbins, T. W. (1995). Dopamine-dependent fronto-striatalplanning deficits in early Parkinson's disease. Neuropsychology, 9, 126-140.
-
(1995)
Neuropsychology
, vol.9
, pp. 126-140
-
-
Owen, A.M.1
Sahakian, B.J.2
Hodges, J.R.3
Summers, B.A.4
Polkey, C.E.5
Robbins, T.W.6
-
134
-
-
0036308524
-
Learning and memory functions of the basal ganglia
-
Packard, M. G., & Knowlton, B. J. (2002). Learning and memory functions of the basal ganglia. Annual Review of Neuroscience, 25, 563-593.
-
(2002)
Annual Review of Neuroscience
, vol.25
, pp. 563-593
-
-
Packard, M.G.1
Knowlton, B.J.2
-
135
-
-
0036159133
-
Activity in human ventral striatum locked to errors of reward prediction
-
Pagnoni, G., Zink, C. F., Montague, P. R., & Berns, G. S. (2002). Activity in human ventral striatum locked to errors of reward prediction. Nature Neuroscience, 5, 97-98.
-
(2002)
Nature Neuroscience
, vol.5
, pp. 97-98
-
-
Pagnoni, G.1
Zink, C.F.2
Montague, P.R.3
Berns, G.S.4
-
136
-
-
21544455210
-
Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward learning network
-
Pan, W. X., Schmidt, R., Wickens, J. R., & Hyland, B. I. (2005). Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward learning network. The Journal of Neuroscience, 25, 6235-6242.
-
(2005)
The Journal of Neuroscience
, vol.25
, pp. 6235-6242
-
-
Pan, W.X.1
Schmidt, R.2
Wickens, J.R.3
Hyland, B.I.4
-
138
-
-
34548651404
-
Orbitofrontal cortex encodes willingness to pay in everyday economic transactions
-
Plassmann, H., O'doherty, J., & Rangel, A. (2007). Orbitofrontal cortex encodes willingness to pay in everyday economic transactions. The Journal of Neuroscience, 27, 9984-9988.
-
(2007)
The Journal of Neuroscience
, vol.27
, pp. 9984-9988
-
-
Plassmann, H.1
O'doherty, J.2
Rangel, A.3
-
139
-
-
0002253315
-
Psychophysiology of N200/N400: A review and classification scheme
-
J. R. Jennings, P. K. Ackles, & M. G. H. Coles (Eds.), London, England: Jessica Kingsley
-
Pritchard, W. S., Shappell, S. A., & Brandt, M. E. (1991). Psychophysiology of N200/N400: A review and classification scheme. In J. R. Jennings, P. K. Ackles, & M. G. H. Coles (Eds.), Advances in psychophysiology (Vol. 4, pp. 43-106). London, England: Jessica Kingsley.
-
(1991)
Advances in psychophysiology
, vol.4
, pp. 43-106
-
-
Pritchard, W.S.1
Shappell, S.A.2
Brandt, M.E.3
-
140
-
-
0029018495
-
Self-control: Beyond commitment
-
Rachlin, H. (1995). Self-control: Beyond commitment. Behavioral and Brain Sciences, 18, 109-159.
-
(1995)
Behavioral and Brain Sciences
, vol.18
, pp. 109-159
-
-
Rachlin, H.1
-
141
-
-
45749098894
-
A framework for studying the neurobiology of value-based decision making
-
Rangel, A., Camerer, C., & Montague, P. R. (2008). A framework for studying the neurobiology of value-based decision making. Nature Reviews Neuroscience, 9, 545-556.
-
(2008)
Nature Reviews Neuroscience
, vol.9
, pp. 545-556
-
-
Rangel, A.1
Camerer, C.2
Montague, P.R.3
-
142
-
-
79960241771
-
Decision making under uncertainty: A neural model based on partially observable Markov decision processes
-
Rao, R. P. N. (2010). Decision making under uncertainty: A neural model based on partially observable Markov decision processes. Frontiers in Computational Neuroscience, 4, 146.
-
(2010)
Frontiers in Computational Neuroscience
, vol.4
, pp. 146
-
-
Rao, R.P.N.1
-
143
-
-
77958465056
-
Goal-directed and habitual control in the basal ganglia: Implications for Parkinson's disease
-
Redgrave, P., Rodriguez, M., Smith, Y., Rodriguez-Oroz, M. C., Lehericy, S., Bergman, H.,... Obeso, J. A. (2010). Goal-directed and habitual control in the basal ganglia: Implications for Parkinson's disease. Nature Reviews Neuroscience, 11, 760-772.
-
(2010)
Nature Reviews Neuroscience
, vol.11
, pp. 760-772
-
-
Redgrave, P.1
Rodriguez, M.2
Smith, Y.3
Rodriguez-Oroz, M.C.4
Lehericy, S.5
Bergman, H.6
Obeso, J.A.7
-
144
-
-
48349092693
-
A unified framework for addiction: Vulnerabilities in the decision process
-
Redish, A. D., Jensen, S., & Johnson, A. (2008). A unified framework for addiction: Vulnerabilities in the decision process. Behavioral and Brain Sciences, 31, 415-437.
-
(2008)
Behavioral and Brain Sciences
, vol.31
, pp. 415-437
-
-
Redish, A.D.1
Jensen, S.2
Johnson, A.3
-
145
-
-
34548837994
-
Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling
-
Redish, A. D., Jensen, S., Johnson, A., & Kurth-Nelson, Z. (2007). Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling. Psychological Review, 114, 784-805.
-
(2007)
Psychological Review
, vol.114
, pp. 784-805
-
-
Redish, A.D.1
Jensen, S.2
Johnson, A.3
Kurth-Nelson, Z.4
-
146
-
-
0002109138
-
A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
-
A. H. Black & W. F. Prokasy (Eds.), New York, NY: Appleton-Century-Crofts
-
Rescorla, R. A., & Wagner, A. R. (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In A. H. Black & W. F. Prokasy (Eds.), Classical conditioning II: Current research and theory (pp. 64-99). New York, NY: Appleton-Century-Crofts.
-
(1972)
Classical conditioning II: Current research and theory
, pp. 6499
-
-
Rescorla, R.A.1
Wagner, A.R.2
-
147
-
-
33751168257
-
A review of delay-discounting research with humans: Relations to drug use and gambling
-
Reynolds, B. (2006). A review of delay-discounting research with humans: Relations to drug use and gambling. Behavioural Pharmacology, 17, 651-667.
-
(2006)
Behavioural Pharmacology
, vol.17
, pp. 651-667
-
-
Reynolds, B.1
-
148
-
-
0035817882
-
A cellular mechanism of reward-related learning
-
Reynolds, J. N. J., Hyland, B. I., & Wickens, J. R. (2001). A cellular mechanism of reward-related learning. Nature, 413, 67-70.
-
(2001)
Nature
, vol.413
, pp. 67-70
-
-
Reynolds, J.N.J.1
Hyland, B.I.2
Wickens, J.R.3
-
149
-
-
0036592025
-
Dopamine-dependent plasticity of corticostriatal synapses
-
Reynolds, J. N. J., & Wickens, J. R. (2002). Dopamine-dependent plasticity of corticostriatal synapses. Neural Networks, 15, 507-521
-
(2002)
Neural Networks
, vol.15
, pp. 507-521
-
-
Reynolds, J.N.J.1
Wickens, J.R.2
-
150
-
-
79960637995
-
A neural signature of hierarchical reinforcement learning
-
Ribas-Fernandes, J. J. F., Solway, A., Diuk, C., McGuire, J. T., Barto, A. G., Niv, Y., & Botvinick, M. M. (2011). A neural signature of hierarchical reinforcement learning. Neuron, 71, 370-379.
-
(2011)
Neuron
, vol.71
, pp. 370-379
-
-
Ribas-Fernandes, J.J.F.1
Solway, A.2
Diuk, C.3
McGuire, J.T.4
Barto, A.G.5
Niv, Y.6
Botvinick, M.M.7
-
152
-
-
36448968271
-
Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
-
Roesch, M. R., Calu, D. J., & Schoenbaum, G. (2007). Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience, 10, 1615-1624.
-
(2007)
Nature Neuroscience
, vol.10
, pp. 1615-1624
-
-
Roesch, M.R.1
Calu, D.J.2
Schoenbaum, G.3
-
153
-
-
0042380155
-
-
Rolls, E. T., Kringelbach, M. L., & de Araujo, I. E. T. (2003). Different representations of pleasant and unpleasant odours in the human brain. European Journal of Neuroscience, 18, 695-703.
-
(2003)
European Journal of Neuroscience
, vol.18
, pp. 695-703
-
-
Rolls, E.T.1
Kringelbach, M.L.2
de Araujo, I.E.T.3
-
154
-
-
77957728784
-
Testing the reward prediction error hypothesis with an axiomatic model
-
Rutledge, R. B., Dean, M., Caplin, A., & Glimcher, P. W. (2010). Testing the reward prediction error hypothesis with an axiomatic model. The Journal of Neuroscience, 30, 13525-13536.
-
(2010)
The Journal of Neuroscience
, vol.30
, pp. 13525-13536
-
-
Rutledge, R.B.1
Dean, M.2
Caplin, A.3
Glimcher, P.W.4
-
155
-
-
25144449580
-
Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period
-
Saito, N., Mushiake, H., Sakamoto, K., Itoyama, Y., & Tanji, J. (2005). Representation of immediate and final behavioral goals in the monkey prefrontal cortex during an instructed delay period. Cerebral Cortex, 15, 1535-1546.
-
(2005)
Cerebral Cortex
, vol.15
, pp. 1535-1546
-
-
Saito, N.1
Mushiake, H.2
Sakamoto, K.3
Itoyama, Y.4
Tanji, J.5
-
156
-
-
0003297918
-
Some studies in machine learning using the game of checkers
-
In E. A. Feigenbaum & J. Feldman (Eds.) New York, NY: McGraw-Hill. (Reprinted from 1959, IBM Journal of Research and Development, 3, pp. 211-229)
-
Samuel, A. L. (1995). Some studies in machine learning using the game of checkers. In E. A. Feigenbaum & J. Feldman (Eds.), Computers and thought (pp. 71-105). New York, NY: McGraw-Hill. (Reprinted from 1959, IBM Journal of Research and Development, 3, pp. 211-229)
-
(1995)
Computers and thought
, pp. 71-105
-
-
Samuel, A.L.1
-
157
-
-
0242440823
-
Correlated coding of motivation and outcome of decision by dopamine neurons
-
Satoh, T., Nakai, S., Sato, T., & Kimura, M. (2003). Correlated coding of motivation and outcome of decision by dopamine neurons. The Journal of Neuroscience, 23, 9913-9923.
-
(2003)
The Journal of Neuroscience
, vol.23
, pp. 9913-9923
-
-
Satoh, T.1
Nakai, S.2
Sato, T.3
Kimura, M.4
-
158
-
-
34548013298
-
Remembering the past to imagine the future: The prospective brain
-
Schacter, D. L., Addis, D. R., & Buckner, R. L. (2007). Remembering the past to imagine the future: The prospective brain. Nature Reviews Neuroscience, 8, 657-661.
-
(2007)
Nature Reviews Neuroscience
, vol.8
, pp. 657-661
-
-
Schacter, D.L.1
Addis, D.R.2
Buckner, R.L.3
-
159
-
-
0031867046
-
Predictive reward signal of dopamine neurons
-
Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80, 1-27.
-
(1998)
Journal of Neurophysiology
, vol.80
, pp. 1-27
-
-
Schultz, W.1
-
160
-
-
0027468102
-
Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task
-
Schultz, W., Apicella, P., & Ljungberg, T. (1993). Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. The Journal of Neuroscience, 13, 900-913.
-
(1993)
The Journal of Neuroscience
, vol.13
, pp. 900-913
-
-
Schultz, W.1
Apicella, P.2
Ljungberg, T.3
-
161
-
-
0030896968
-
A neural substrate of prediction and reward
-
Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599.
-
(1997)
Science
, vol.275
, pp. 1593-1599
-
-
Schultz, W.1
Dayan, P.2
Montague, P.R.3
-
162
-
-
43749107069
-
Low-serotonin levels increase delayed reward discounting in humans
-
Schweighofer, N., Bertin, M., Shishida, K., Okamoto, Y., Tanaka, S. C., Yamawaki, S., & Doya, K. (2008). Low-serotonin levels increase delayed reward discounting in humans. The Journal of Neuroscience, 28, 4528-4532.
-
(2008)
The Journal of Neuroscience
, vol.28
, pp. 4528-4532
-
-
Schweighofer, N.1
Bertin, M.2
Shishida, K.3
Okamoto, Y.4
Tanaka, S.C.5
Yamawaki, S.6
Doya, K.7
-
164
-
-
79955709936
-
Neural correlates of forward planning in a spatial decision task in humans
-
Simon, D. A., & Daw, N. D. (2011). Neural correlates of forward planning in a spatial decision task in humans. The Journal of Neuroscience, 31, 5526-5539.
-
(2011)
The Journal of Neuroscience
, vol.31
, pp. 5526-5539
-
-
Simon, D.A.1
Daw, N.D.2
-
165
-
-
84881118309
-
Melioration as rational choice: Sequential decision making in uncertain environments
-
Sims, C. R., Neth, H., Jacobs, R. A., & Gray, W. D. (2013). Melioration as rational choice: Sequential decision making in uncertain environments. Psychological Review, 120, 139-154.
-
(2013)
Psychological Review
, vol.120
, pp. 139-154
-
-
Sims, C.R.1
Neth, H.2
Jacobs, R.A.3
Gray, W.D.4
-
166
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
Singh, S. P., & Sutton, R. S. (1996). Reinforcement learning with replacing eligibility traces. Machine Learning, 22, 123-158.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
167
-
-
84894466619
-
The behavior of organisms: An experimental analysis
-
Skinner, B. F. (1938). The behavior of organisms: An experimental analysis. Oxford, England: Appleton-Century.
-
(1938)
Oxford, England
, pp. 61-84
-
-
Skinner, B.F.1
-
168
-
-
33646230819
-
Dopamine, prediction error and associative learning: A model-based account
-
Smith, A., Li, M., Becker, S., & Kapur, S. (2006). Dopamine, prediction error and associative learning: A model-based account. Network: Computation in Neural Systems, 17, 61- 84.
-
(2006)
Network: Computation in Neural Systems
, vol.17
, pp. 61-84
-
-
Smith, A.1
Li, M.2
Becker, S.3
Kapur, S.4
-
169
-
-
84859737036
-
Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates
-
Solway, A., & Botvinick, M. M. (2012). Goal-directed decision making as probabilistic inference: A computational framework and potential neural correlates. Psychological Review, 119, 120-154.
-
(2012)
Psychological Review
, vol.119
, pp. 120-154
-
-
Solway, A.1
Botvinick, M.M.2
-
170
-
-
33745078823
-
The order of eliminating blinds in maze learning by the rat
-
Spence, K. W. (1932). The order of eliminating blinds in maze learning by the rat. Journal of Comparative Psychology, 14, 9-27.
-
(1932)
Journal of Comparative Psychology
, vol.14
, pp. 9-27
-
-
Spence, K.W.1
-
172
-
-
33646848669
-
Lost is virtual space: Studies in human and ideal spatial navigation
-
Stankiewicz, B. J., Legge, G. E., Mansfield, J. S., & Schlicht, E. J. (2006). Lost is virtual space: Studies in human and ideal spatial navigation. Journal of Experimental Psychology: Human Perception and Performance, 32, 688-704.
-
(2006)
Journal of Experimental Psychology: Human Perception and Performance
, vol.32
, pp. 688-704
-
-
Stankiewicz, B.J.1
Legge, G.E.2
Mansfield, J.S.3
Schlicht, E.J.4
-
173
-
-
77953152738
-
Conditional routing of information to the cortex: A model of the basal ganglia's role in cognitive coordination
-
Stocco, A., Lebiere, C., & Anderson, J. R. (2010). Conditional routing of information to the cortex: A model of the basal ganglia's role in cognitive coordination. Psychological Review, 117, 541-574.
-
(2010)
Psychological Review
, vol.117
, pp. 541-574
-
-
Stocco, A.1
Lebiere, C.2
Anderson, J.R.3
-
174
-
-
84864680808
-
The cerebellum and cognition: Evidence from functional imaging studies
-
Stoodley, C. J. (2012). The cerebellum and cognition: Evidence from functional imaging studies. The Cerebellum, 11, 352-365.
-
(2012)
The Cerebellum
, vol.11
, pp. 352-365
-
-
Stoodley, C.J.1
-
175
-
-
67651027237
-
Cerebellum and nonmotor function
-
Strick, P. L., Dum, R. P., & Fiez, J. A. (2009). Cerebellum and nonmotor function. Annual Review of Neuroscience, 32, 413-434.
-
(2009)
Annual Review of Neuroscience
, vol.32
, pp. 413-434
-
-
Strick, P.L.1
Dum, R.P.2
Fiez, J.A.3
-
176
-
-
0002995053
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
B. W. Porter & R. J. Mooney (Eds.), San Francisco, CA: Morgan Kaufmann
-
Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In B. W. Porter & R. J. Mooney (Eds.), Proceedings of the seventh international conference on machine learning (pp. 216-224). San Francisco, CA: Morgan Kaufmann.
-
(1990)
Proceedings of the seventh international conference on machine learning
, pp. 216224
-
-
Sutton, R.S.1
-
177
-
-
0003066891
-
Time-derivative models of Pavlovian reinforcement
-
M. Gabriel & J. Moore (Eds.), Cambridge, MA: MIT Press
-
Sutton, R. S., & Barto, A. G. (1990). Time-derivative models of Pavlovian reinforcement. In M. Gabriel & J. Moore (Eds.), Learning and computational neuroscience: Foundations of adaptive networks (pp. 497-537). Cambridge, MA: MIT Press.
-
(1990)
Learning and computational neuroscience: Foundations of adaptive networks
, pp. 497537
-
-
Sutton, R.S.1
Barto, A.G.2
-
179
-
-
51449103900
-
The acquisition of robust and flexible cognitive skills
-
Taatgen, N. A., Huss, D., Dickison, D., & Anderson, J. R. (2008). The acquisition of robust and flexible cognitive skills. Journal of Experimental Psychology: General, 137, 548-565.
-
(2008)
Journal of Experimental Psychology: General
, vol.137
, pp. 548-565
-
-
Taatgen, N.A.1
Huss, D.2
Dickison, D.3
Anderson, J.R.4
-
180
-
-
48549088919
-
Calculating consequences: Brain systems that encode the causal effects of actions
-
Tanaka, S. C., Balleine, B. W., & O'doherty, J. P. (2008). Calculating consequences: Brain systems that encode the causal effects of actions. The Journal of Neuroscience, 28, 6750-6755.
-
(2008)
The Journal of Neuroscience
, vol.28
, pp. 6750-6755
-
-
Tanaka, S.C.1
Balleine, B.W.2
O'doherty, J.P.3
-
181
-
-
72449125194
-
Serotonin affects association of aversive outcomes to past actions
-
Tanaka, S. C., Shishida, K., Schweighofer, N., Okamoto, Y., Yamawaki, S., & Doya, K. (2009). Serotonin affects association of aversive outcomes to past actions. The Journal of Neuroscience, 29, 15669-15674.
-
(2009)
The Journal of Neuroscience
, vol.29
, pp. 15669-15674
-
-
Tanaka, S.C.1
Shishida, K.2
Schweighofer, N.3
Okamoto, Y.4
Yamawaki, S.5
Doya, K.6
-
182
-
-
0003033145
-
A critical review of latent learning and related experiments
-
Thistlethwaite, D. (1951). A critical review of latent learning and related experiments. Psychological Bulletin, 48, 97-129.
-
(1951)
Psychological Bulletin
, vol.48
, pp. 97-129
-
-
Thistlethwaite, D.1
-
183
-
-
0002210775
-
The role of exploration in learning control
-
D. A. White & D. A. Sofge (Eds.), Florence, KY: Van Nostrand Reinhold
-
Thrun, S. B. (1992). The role of exploration in learning control. In D. A. White & D. A. Sofge (Eds.), Handbook of intelligent control: Neural, fuzzy and adaptive approaches (pp. 527-554). Florence, KY: Van Nostrand Reinhold.
-
(1992)
Handbook of intelligent control: Neural, fuzzy and adaptive approaches
, pp. 527554
-
-
Thrun, S.B.1
-
184
-
-
14844349975
-
Adaptive coding of reward value by dopamine neurons
-
Tobler, P. N., Fiorillo, C. D., & Schultz, W. (2005). Adaptive coding of reward value by dopamine neurons. Science, 307, 1642-1645.
-
(2005)
Science
, vol.307
, pp. 1642-1645
-
-
Tobler, P.N.1
Fiorillo, C.D.2
Schultz, W.3
-
185
-
-
33644806981
-
Human neural learning depends on reward prediction errors in the blocking paradigm
-
Tobler, P. N., O'doherty, J. P., Dolan, R. J., & Schultz, W. (2005). Human neural learning depends on reward prediction errors in the blocking paradigm. Journal of Neurophysiology, 95, 301-310.
-
(2005)
Journal of Neurophysiology
, vol.95
, pp. 301-310
-
-
Tobler, P.N.1
O'doherty, J.P.2
Dolan, R.J.3
Schultz, W.4
-
186
-
-
77549088095
-
Learning to use working memory in partially observable environments through dopaminergic reinforcement
-
D. Koller, D. Schuurmans, Y. Bengio, & L. Bottou (Eds.), Cambridge, MA: MIT Press
-
Todd, M. T., Niv, Y., & Cohen, J. D. (2009). Learning to use working memory in partially observable environments through dopaminergic reinforcement. In D. Koller, D. Schuurmans, Y. Bengio, & L. Bottou (Eds.), Advances in neural information processing systems (pp. 1689- 1696). Cambridge, MA: MIT Press.
-
(2009)
Advances in neural information processing systems
-
-
Todd, M.T.1
Niv, Y.2
Cohen, J.D.3
-
188
-
-
33244490051
-
Degrees of hunger, reward and non-reward, and maze learning in rats
-
Tolman, E. C., & Honzik, C. H. (1930). Degrees of hunger, reward and non-reward, and maze learning in rats. University of California Publications in Psychology, 4, 241-256.
-
(1930)
University of California Publications in Psychology
, vol.4
, pp. 241-256
-
-
Tolman, E.C.1
Honzik, C.H.2
-
189
-
-
66449119919
-
A specific role for posterior dorsolateral striatum in human habit learning
-
Tricomi, E., Balleine, B. W., & O'doherty, J. P. (2009). A specific role for posterior dorsolateral striatum in human habit learning. European Journal of Neuroscience, 29, 2225-2232.
-
(2009)
European Journal of Neuroscience
, vol.29
, pp. 2225-2232
-
-
Tricomi, E.1
Balleine, B.W.2
O'doherty, J.P.3
-
190
-
-
1642534402
-
Modulation of caudate activity by action contingency
-
Tricomi, E. M., Delgado, M. R., & Fiez, J. A. (2004). Modulation of caudate activity by action contingency. Neuron, 41, 281-292.
-
(2004)
Neuron
, vol.41
, pp. 281-292
-
-
Tricomi, E.M.1
Delgado, M.R.2
Fiez, J.A.3
-
192
-
-
34247147767
-
Determining the neural substrates of goal-directed learning in the human brain
-
Valentin, V. V., Dickinson, A., & O'doherty, J. P. (2007). Determining the neural substrates of goal-directed learning in the human brain. The Journal of Neuroscience, 27, 4019-4026
-
(2007)
The Journal of Neuroscience
, vol.27
, pp. 4019-4026
-
-
Valentin, V.V.1
Dickinson, A.2
O'doherty, J.P.3
-
194
-
-
0037092472
-
The timing of action-monitoring processes in the anterior cingulate cortex
-
van Veen, V., & Carter, C. S. (2002). The timing of action-monitoring processes in the anterior cingulate cortex. Journal of Cognitive Neuroscience, 14, 593-602.
-
(2002)
Journal of Cognitive Neuroscience
, vol.14
, pp. 593-602
-
-
van Veen, V.1
Carter, C.S.2
-
195
-
-
0035811464
-
Dopamine responses comply with basic assumptions of formal learning theory
-
Waelti, P., Dickinson, A., & Schultz, W. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature, 412, 43-48.
-
(2001)
Nature
, vol.412
, pp. 43-48
-
-
Waelti, P.1
Dickinson, A.2
Schultz, W.3
-
196
-
-
63149124215
-
The strategic nature of changing your mind
-
Walsh, M. M., & Anderson, J. R. (2009). The strategic nature of changing your mind. Cognitive Psychology, 58, 416-440.
-
(2009)
Cognitive Psychology
, vol.58
, pp. 416-440
-
-
Walsh, M.M.1
Anderson, J.R.2
-
197
-
-
80051661786
-
Learning from delayed feedback: Neural responses in temporal credit assignment
-
Walsh, M. M., & Anderson, J. R. (2011a). Learning from delayed feedback: Neural responses in temporal credit assignment. Cognitive, Affective & Behavioral Neuroscience, 11, 131-143.
-
(2011)
Cognitive, Affective & Behavioral Neuroscience
, vol.11
, pp. 131-143
-
-
Walsh, M.M.1
Anderson, J.R.2
-
199
-
-
84864813064
-
Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice
-
Walsh, M. M., & Anderson, J. R. (2012). Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice. Neuroscience and Biobehavioral Reviews, 36, 1870-1884.
-
(2012)
Neuroscience and Biobehavioral Reviews
, vol.36
, pp. 1870-1884
-
-
Walsh, M.M.1
Anderson, J.R.2
-
200
-
-
84875959090
-
The importance of action history in decision making and reinforcement learning
-
R. L. Lewis, T. A. Polk, & J. E. Laird (Eds.), Ann Arbor, MI
-
Wang, Y., & Laird, J. E. (2007). The importance of action history in decision making and reinforcement learning. In R. L. Lewis, T. A. Polk, & J. E. Laird (Eds.), Proceedings of the eighth international conference on cognitive modeling (pp. 85-90). Ann Arbor, MI.
-
(2007)
Proceedings of the eighth international conference on cognitive modeling (pp. 85-90)
-
-
Wang, Y.1
Laird, J.E.2
-
201
-
-
0008573205
-
When more means less: Factors affecting human self-control in a local versus global choice paradigm
-
Warry, C. J., Remington, B., & Sonuga-Barke, E. J. S. (1999). When more means less: Factors affecting human self-control in a local versus global choice paradigm. Learning and Motivation, 30, 53-73.
-
(1999)
Learning and Motivation
, vol.30
, pp. 53-73
-
-
Warry, C.J.1
Remington, B.2
Sonuga-Barke, E.J.S.3
-
202
-
-
84860166687
-
Phasic mesolimbic dopamine signaling precedes and predicts performance of a self-initiated action sequence task
-
Wassum, K. M., Ostlund, S. B., & Maidment, N. T. (2012). Phasic mesolimbic dopamine signaling precedes and predicts performance of a self-initiated action sequence task. Biological Psychiatry, 71, 846-854.
-
(2012)
Biological Psychiatry
, vol.71
, pp. 846-854
-
-
Wassum, K.M.1
Ostlund, S.B.2
Maidment, N.T.3
-
203
-
-
0033006462
-
Rule-dependent neuronal activity in the prefrontal cortex. Experimental
-
White, I. M., & Wise, S. P. (1999). Rule-dependent neuronal activity in the prefrontal cortex. Experimental Brain Research, 126, 315-335.
-
(1999)
Brain Research
, vol.126
, pp. 315-335
-
-
White, I.M.1
Wise, S.P.2
-
204
-
-
0029655991
-
Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro
-
Wickens, J. R., Begg, A. J., & Arbuthnott, G. W. (1996). Dopamine reverses the depression of rat corticostriatal synapses which normally follows high-frequency stimulation of cortex in vitro. Neuroscience, 70, 1-5.
-
(1996)
Neuroscience
, vol.70
, pp. 1-5
-
-
Wickens, J.R.1
Begg, A.J.2
Arbuthnott, G.W.3
-
205
-
-
84860307045
-
Mapping value based planning and extensively trained choice in the human brain
-
Wunderlich, K., Dayan, P., & Dolan, R. J. (2012). Mapping value based planning and extensively trained choice in the human brain. Nature Neuroscience, 15, 786-791.
-
(2012)
Nature Neuroscience
, vol.15
, pp. 786-791
-
-
Wunderlich, K.1
Dayan, P.2
Dolan, R.J.3
-
206
-
-
33644767291
-
Prefrontal brain activity predicts temporally extended decision-making behavior
-
Yarkoni, T., Braver, T. S., Gray, J. R., & Green, L. (2005). Prefrontal brain activity predicts temporally extended decision-making behavior. Journal of the Experimental Analysis of Behavior, 84, 537-554.
-
(2005)
Journal of the Experimental Analysis of Behavior
, vol.84
, pp. 537-554
-
-
Yarkoni, T.1
Braver, T.S.2
Gray, J.R.3
Green, L.4
-
207
-
-
1442274999
-
Melioration and the transition from touch-typing training to everyday use
-
Yechiam, E., Erev, I., Yehene, V., & Gopher, D. (2003). Melioration and the transition from touch-typing training to everyday use. Human Factors, 45, 671-684.
-
(2003)
Human Factors
, vol.45
, pp. 671-684
-
-
Yechiam, E.1
Erev, I.2
Yehene, V.3
Gopher, D.4
-
208
-
-
3042570744
-
The neural basis of error detection: Conflict monitoring and the error-related negativity
-
Yeung, N., Botvinick, M. M., & Cohen, J. D. (2004). The neural basis of error detection: Conflict monitoring and the error-related negativity. Psychological Review, 111, 931-959.
-
(2004)
Psychological Review
, vol.111
, pp. 931-959
-
-
Yeung, N.1
Botvinick, M.M.2
Cohen, J.D.3
-
209
-
-
1642580578
-
Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning
-
Yin, H. H., Knowlton, B. J., & Balleine, B. W. (2004). Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. European Journal of Neuroscience, 19, 181-189.
-
(2004)
European Journal of Neuroscience
, vol.19
, pp. 181-189
-
-
Yin, H.H.1
Knowlton, B.J.2
Balleine, B.W.3
-
210
-
-
33646853495
-
Resolution of uncertainty in prefrontal cortex
-
Yoshida, W., & Ishii, S. (2006). Resolution of uncertainty in prefrontal cortex. Neuron, 50, 781-789.
-
(2006)
Neuron
, vol.50
, pp. 781-789
-
-
Yoshida, W.1
Ishii, S.2
|