-
1
-
-
78651226963
-
Structure learning in human sequential decision-making
-
Acuna D.E., Schrater P. Structure learning in human sequential decision-making. PLoS Comput. Biol. 2010, 6(12):e1001003.
-
(2010)
PLoS Comput. Biol.
, vol.6
, Issue.12
-
-
Acuna, D.E.1
Schrater, P.2
-
2
-
-
84882523833
-
-
Elsevier, Amsterdam
-
Balleine B.W., Daw N., O'Doherty J.P. Multiple Forms of Value Learning and the Function of Dopamine, Neuroeconomics Decision Making and the Brain 2008, Elsevier, Amsterdam, pp. 367-387.
-
(2008)
Multiple Forms of Value Learning and the Function of Dopamine, Neuroeconomics Decision Making and the Brain
, pp. 367-387
-
-
Balleine, B.W.1
Daw, N.2
O'Doherty, J.P.3
-
3
-
-
0035811508
-
Cortical remodelling induced by activity of ventral tegmental dopamine neurons
-
Bao S., Chan V.T., Merzenich M.M. Cortical remodelling induced by activity of ventral tegmental dopamine neurons. Nature 2001, 412(6842):79-83.
-
(2001)
Nature
, vol.412
, Issue.6842
, pp. 79-83
-
-
Bao, S.1
Chan, V.T.2
Merzenich, M.M.3
-
4
-
-
33746618859
-
FMRI investigation of cortical and subcortical networks in the learning of abstract and effector-specific representations of motor sequences
-
Bapi R.S., Miyapuram K.P., Graydon F.X., Doya K. fMRI investigation of cortical and subcortical networks in the learning of abstract and effector-specific representations of motor sequences. NeuroImage 2006, 32(2):714-727.
-
(2006)
NeuroImage
, vol.32
, Issue.2
, pp. 714-727
-
-
Bapi, R.S.1
Miyapuram, K.P.2
Graydon, F.X.3
Doya, K.4
-
5
-
-
0000541213
-
Adaptive critics and the basal ganglia
-
The MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
-
Barto A. Adaptive critics and the basal ganglia. Models of Information Processing in the Basal Ganglia 1994, 12-31. The MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
-
(1994)
Models of Information Processing in the Basal Ganglia
, pp. 12-31
-
-
Barto, A.1
-
6
-
-
21544435722
-
Midbrain dopamine neurons encode a quantitative reward prediction error signal
-
Bayer H., Glimcher P. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 2005, 47(1):129-141.
-
(2005)
Neuron
, vol.47
, Issue.1
, pp. 129-141
-
-
Bayer, H.1
Glimcher, P.2
-
7
-
-
63849268432
-
Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli
-
Brischoux F., Chakraborty S., Brierley D.I., Ungless M.A. Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli. Proc. Natl. Acad. Sci. U.S.A. 2009, 106(12):4894-4899.
-
(2009)
Proc. Natl. Acad. Sci. U.S.A.
, vol.106
, Issue.12
, pp. 4894-4899
-
-
Brischoux, F.1
Chakraborty, S.2
Brierley, D.I.3
Ungless, M.A.4
-
8
-
-
68349115012
-
Midbrain dopamine neurons signal preference for advance information about upcoming rewards
-
Bromberg-Martin E.S., Hikosaka O. Midbrain dopamine neurons signal preference for advance information about upcoming rewards. Neuron 2009, 63(1):119-126.
-
(2009)
Neuron
, vol.63
, Issue.1
, pp. 119-126
-
-
Bromberg-Martin, E.S.1
Hikosaka, O.2
-
9
-
-
77954484934
-
Distinct tonic and phasic anticipatory activity in lateral habenula and dopamine neurons
-
Bromberg-Martin E.S., Matsumoto M., Hikosaka O. Distinct tonic and phasic anticipatory activity in lateral habenula and dopamine neurons. Neuron 2010, 67(1):144-155.
-
(2010)
Neuron
, vol.67
, Issue.1
, pp. 144-155
-
-
Bromberg-Martin, E.S.1
Matsumoto, M.2
Hikosaka, O.3
-
10
-
-
77955458973
-
Multiple timescales of memory in lateral habenula and dopamine neurons
-
Bromberg-Martin E.S., Matsumoto M., Nakahara H., Hikosaka O. Multiple timescales of memory in lateral habenula and dopamine neurons. Neuron 2010, 67(3):499-510.
-
(2010)
Neuron
, vol.67
, Issue.3
, pp. 499-510
-
-
Bromberg-Martin, E.S.1
Matsumoto, M.2
Nakahara, H.3
Hikosaka, O.4
-
11
-
-
84856431209
-
Neuron-type-specific signals for reward and punishment in the ventral tegmental area
-
Cohen J.Y., Haesler S., Vong L., Lowell B.B., Uchida N. Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 2012, 482(7383):85-88.
-
(2012)
Nature
, vol.482
, Issue.7383
, pp. 85-88
-
-
Cohen, J.Y.1
Haesler, S.2
Vong, L.3
Lowell, B.B.4
Uchida, N.5
-
12
-
-
80052944607
-
A selectionist account of de novo action learning
-
Costa R.M. A selectionist account of de novo action learning. Curr. Opin. Neurobiol. 2011, 21(4):579-586.
-
(2011)
Curr. Opin. Neurobiol.
, vol.21
, Issue.4
, pp. 579-586
-
-
Costa, R.M.1
-
13
-
-
33746365099
-
Bayesian theories of conditioning in a changing world
-
Courville A., Daw N., Touretzky D. Bayesian theories of conditioning in a changing world. Trends Cogn. Sci. 2006, 10(7):294-300.
-
(2006)
Trends Cogn. Sci.
, vol.10
, Issue.7
, pp. 294-300
-
-
Courville, A.1
Daw, N.2
Touretzky, D.3
-
14
-
-
33745787929
-
Representation and timing in theories of the dopamine system
-
Daw N.D., Courville A.C., Touretzky D.S. Representation and timing in theories of the dopamine system. Neural Comput. 2006, 18(7):1637-1677.
-
(2006)
Neural Comput.
, vol.18
, Issue.7
, pp. 1637-1677
-
-
Daw, N.D.1
Courville, A.C.2
Touretzky, D.S.3
-
15
-
-
79952746011
-
Model-based influences on humans' choices and striatal prediction errors
-
Daw N.D., Gershman S.J., Seymour B., Dayan P., Dolan R.J. Model-based influences on humans' choices and striatal prediction errors. Neuron 2011, 69(6):1204-1215.
-
(2011)
Neuron
, vol.69
, Issue.6
, pp. 1204-1215
-
-
Daw, N.D.1
Gershman, S.J.2
Seymour, B.3
Dayan, P.4
Dolan, R.J.5
-
16
-
-
28044450875
-
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
-
Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 2005, 8(12):1704-1711.
-
(2005)
Nat. Neurosci.
, vol.8
, Issue.12
, pp. 1704-1711
-
-
Daw, N.D.1
Niv, Y.2
Dayan, P.3
-
17
-
-
0001158047
-
Improving generalization for temporal difference learning: the successor representation
-
Dayan P. Improving generalization for temporal difference learning: the successor representation. Neural Comput. 1993, 5:613-624.
-
(1993)
Neural Comput.
, vol.5
, pp. 613-624
-
-
Dayan, P.1
-
18
-
-
52049107354
-
Reinforcement learning: the good, the bad and the ugly
-
Dayan P.G., Niv Y. Reinforcement learning: the good, the bad and the ugly. Curr. Opin. Neurobiol. 2008, 18(2):185-196.
-
(2008)
Curr. Opin. Neurobiol.
, vol.18
, Issue.2
, pp. 185-196
-
-
Dayan, P.G.1
Niv, Y.2
-
19
-
-
0033213819
-
What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
-
Doya K. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?. Neural Netw. 1999, 12(7-8):961-974.
-
(1999)
Neural Netw.
, vol.12
, Issue.7-8
, pp. 961-974
-
-
Doya, K.1
-
20
-
-
34547679813
-
Reinforcement learning: computational theory and biological mechanisms
-
Doya K. Reinforcement learning: computational theory and biological mechanisms. HFSP J. 2007, 1(1):30-40.
-
(2007)
HFSP J.
, vol.1
, Issue.1
, pp. 30-40
-
-
Doya, K.1
-
21
-
-
80053075057
-
Dopamine neurons learn to encode the long-term value of multiple future rewards
-
Enomoto K., Matsumoto N., Nakai S., Satoh T., Sato T.K., Ueda Y., Inokawa H., Haruno M., Kimura M. Dopamine neurons learn to encode the long-term value of multiple future rewards. Proc. Natl. Acad. Sci. U.S.A. 2011, 108(37):15462-15467.
-
(2011)
Proc. Natl. Acad. Sci. U.S.A.
, vol.108
, Issue.37
, pp. 15462-15467
-
-
Enomoto, K.1
Matsumoto, N.2
Nakai, S.3
Satoh, T.4
Sato, T.K.5
Ueda, Y.6
Inokawa, H.7
Haruno, M.8
Kimura, M.9
-
22
-
-
0037459319
-
Discrete coding of reward probability and uncertainty by dopamine neurons
-
Fiorillo C.D., Tobler P.N., Schultz J. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898-1902.
-
(2003)
Science
, vol.299
, pp. 1898-1902
-
-
Fiorillo, C.D.1
Tobler, P.N.2
Schultz, J.3
-
23
-
-
77449154520
-
Do substantia nigra dopaminergic neurons differentiate between reward and punishment?
-
Frank M.J., Surmeier D.J. Do substantia nigra dopaminergic neurons differentiate between reward and punishment?. J. Mol. Cell Biol. 2009, 1(1):15-16.
-
(2009)
J. Mol. Cell Biol.
, vol.1
, Issue.1
, pp. 15-16
-
-
Frank, M.J.1
Surmeier, D.J.2
-
24
-
-
84859353418
-
Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats
-
Funamizu A., Ito M., Doya K., Kanzaki R., Takahashi H. Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats. Eur. J. Neurosci. 2012, 35(7):1180-1189.
-
(2012)
Eur. J. Neurosci.
, vol.35
, Issue.7
, pp. 1180-1189
-
-
Funamizu, A.1
Ito, M.2
Doya, K.3
Kanzaki, R.4
Takahashi, H.5
-
25
-
-
84865306110
-
The successor representation and temporal context
-
Gershman S.J., Moore C.D., Todd M.T., Norman K.N., Sederberg P.B. The successor representation and temporal context. Neural Comput. 2012, 24:1-16.
-
(2012)
Neural Comput.
, vol.24
, pp. 1-16
-
-
Gershman, S.J.1
Moore, C.D.2
Todd, M.T.3
Norman, K.N.4
Sederberg, P.B.5
-
26
-
-
77952541839
-
Learning latent structure: carving nature at its joints
-
Gershman S.J., Niv Y. Learning latent structure: carving nature at its joints. Curr. Opin. Neurobiol. 2010, 20(2):251-256.
-
(2010)
Curr. Opin. Neurobiol.
, vol.20
, Issue.2
, pp. 251-256
-
-
Gershman, S.J.1
Niv, Y.2
-
27
-
-
70350521769
-
Human reinforcement learning subdivides structured action spaces by learning effector-specific values
-
Gershman S.J., Pesaran B., Daw N.D. Human reinforcement learning subdivides structured action spaces by learning effector-specific values. J. Neurosci. 2009, 29(43):13524-13531.
-
(2009)
J. Neurosci.
, vol.29
, Issue.43
, pp. 13524-13531
-
-
Gershman, S.J.1
Pesaran, B.2
Daw, N.D.3
-
28
-
-
77953260848
-
States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
-
Gläscher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66(4):585-595.
-
(2010)
Neuron
, vol.66
, Issue.4
, pp. 585-595
-
-
Gläscher, J.1
Daw, N.2
Dayan, P.3
O'Doherty, J.P.4
-
29
-
-
80053152388
-
Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis
-
Glimcher P.W. Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proc. Natl. Acad. Sci. U.S.A. 2011, 108(Suppl. 3):15647-15654.
-
(2011)
Proc. Natl. Acad. Sci. U.S.A.
, vol.108
, Issue.SUPPL. 3
, pp. 15647-15654
-
-
Glimcher, P.W.1
-
30
-
-
77958005364
-
Alterations in choice behavior by manipulations of world model
-
Green C.S., Benson C., Kersten D., Schrater P. Alterations in choice behavior by manipulations of world model. Proc. Natl. Acad. Sci. U.S.A. 2010, 107(37):16401-16406.
-
(2010)
Proc. Natl. Acad. Sci. U.S.A.
, vol.107
, Issue.37
, pp. 16401-16406
-
-
Green, C.S.1
Benson, C.2
Kersten, D.3
Schrater, P.4
-
31
-
-
33646496765
-
Dopamine modulation in the basal ganglia locks the gate to working memory
-
Gruber A.J., Dayan P., Gutkin B.S., Solla S.A. Dopamine modulation in the basal ganglia locks the gate to working memory. J. Comput. Neurosci. 2006, 20(2):153-166.
-
(2006)
J. Comput. Neurosci.
, vol.20
, Issue.2
, pp. 153-166
-
-
Gruber, A.J.1
Dayan, P.2
Gutkin, B.S.3
Solla, S.A.4
-
32
-
-
77749341538
-
Neurons in anterior cingulate cortex multiplex information about reward and action
-
Hayden B.Y., Heilbronner S., Pearson J., Platt M.L. Neurons in anterior cingulate cortex multiplex information about reward and action. J. Neurosci. 2010, 30(9):3339-3346.
-
(2010)
J. Neurosci.
, vol.30
, Issue.9
, pp. 3339-3346
-
-
Hayden, B.Y.1
Heilbronner, S.2
Pearson, J.3
Platt, M.L.4
-
33
-
-
79959678641
-
Neuronal basis of sequential foraging decisions in a patchy environment
-
Hayden B.Y., Pearson J.M., Platt M.L. Neuronal basis of sequential foraging decisions in a patchy environment. Nat. Neurosci. 2011, 14(7):933-939.
-
(2011)
Nat. Neurosci.
, vol.14
, Issue.7
, pp. 933-939
-
-
Hayden, B.Y.1
Pearson, J.M.2
Platt, M.L.3
-
34
-
-
0033214899
-
Parallel neural networks for learning sequential procedures
-
Hikosaka O., Nakahara H., Rand M.K., Sakai K., Lu X., Nakamura K., Miyachi S., Doya K. Parallel neural networks for learning sequential procedures. Trends Neurosci. 1999, 22(10):464-471.
-
(1999)
Trends Neurosci.
, vol.22
, Issue.10
, pp. 464-471
-
-
Hikosaka, O.1
Nakahara, H.2
Rand, M.K.3
Sakai, K.4
Lu, X.5
Nakamura, K.6
Miyachi, S.7
Doya, K.8
-
36
-
-
0002861883
-
A model of how the basal ganglia generate and use neural signals that predict reinforcement
-
The MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
-
Houk J.C., Adams J.L., Barto A. A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models of Information Processing in the Basal Ganglia 1994, 249-252. The MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
-
(1994)
Models of Information Processing in the Basal Ganglia
, pp. 249-252
-
-
Houk, J.C.1
Adams, J.L.2
Barto, A.3
-
37
-
-
61349171920
-
Encoding of probabilistic rewarding and aversive events by pallidal and nigral neurons
-
Joshua M., Adler A., Rosin B., Vaadia E., Bergman H. Encoding of probabilistic rewarding and aversive events by pallidal and nigral neurons. J. Neurophysiol. 2009, 101(2):758-772.
-
(2009)
J. Neurophysiol.
, vol.101
, Issue.2
, pp. 758-772
-
-
Joshua, M.1
Adler, A.2
Rosin, B.3
Vaadia, E.4
Bergman, H.5
-
38
-
-
0036592029
-
Dopamine: generalization and bonuses
-
Kakade S., Dayan P. Dopamine: generalization and bonuses. Neural Netw. 2002, 15(4-6):549-559.
-
(2002)
Neural Netw.
, vol.15
, Issue.4-6
, pp. 549-559
-
-
Kakade, S.1
Dayan, P.2
-
39
-
-
84859496054
-
Neural mechanisms of foraging
-
Kolling N., Behrens T.E., Mars R.B., Rushworth M.F. Neural mechanisms of foraging. Science 2012, 336(6077):95-98.
-
(2012)
Science
, vol.336
, Issue.6077
, pp. 95-98
-
-
Kolling, N.1
Behrens, T.E.2
Mars, R.B.3
Rushworth, M.F.4
-
40
-
-
40249097514
-
Unique properties of mesoprefrontal neurons within a dual mesocorticolimbic dopamine system
-
Lammel S., Hetzel A., Häckel O., Jones I., Liss B., Roeper J. Unique properties of mesoprefrontal neurons within a dual mesocorticolimbic dopamine system. Neuron 2008, 57(5):760-773.
-
(2008)
Neuron
, vol.57
, Issue.5
, pp. 760-773
-
-
Lammel, S.1
Hetzel, A.2
Häckel, O.3
Jones, I.4
Liss, B.5
Roeper, J.6
-
41
-
-
57349130536
-
Stimulus representation and the timing of reward-prediction errors in models of the dopamine system
-
Ludvig E.A., Sutton R.S., Kehoe E.J. Stimulus representation and the timing of reward-prediction errors in models of the dopamine system. Neural Comput. 2008, 20(12):3034-3054.
-
(2008)
Neural Comput.
, vol.20
, Issue.12
, pp. 3034-3054
-
-
Ludvig, E.A.1
Sutton, R.S.2
Kehoe, E.J.3
-
42
-
-
67349098495
-
Two types of dopamine neuron distinctly convey positive and negative motivational signals
-
Matsumoto M., Hikosaka O. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 2009, 459(7248):837-841.
-
(2009)
Nature
, vol.459
, Issue.7248
, pp. 837-841
-
-
Matsumoto, M.1
Hikosaka, O.2
-
43
-
-
79951823576
-
Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning
-
McDannald M.A., Lucantonio F., Burke K.A., Niv Y., Schoenbaum G. Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning. J. Neurosci. 2011, 31(7):2700-2705.
-
(2011)
J. Neurosci.
, vol.31
, Issue.7
, pp. 2700-2705
-
-
McDannald, M.A.1
Lucantonio, F.2
Burke, K.A.3
Niv, Y.4
Schoenbaum, G.5
-
44
-
-
84859323549
-
Model-based learning and the contribution of the orbitofrontal cortex to the model-free world
-
McDannald M.A., Takahashi Y.K., Lopatina N., Pietras B.W., Jones J.L., Schoenbaum G. Model-based learning and the contribution of the orbitofrontal cortex to the model-free world. Eur. J. Neurosci. 2012, 35(7):991-996.
-
(2012)
Eur. J. Neurosci.
, vol.35
, Issue.7
, pp. 991-996
-
-
McDannald, M.A.1
Takahashi, Y.K.2
Lopatina, N.3
Pietras, B.W.4
Jones, J.L.5
Schoenbaum, G.6
-
45
-
-
0029981543
-
A framework for mesencephalic dopamine systems based on predictive Hebbian learning
-
Montague P., Dayan P., Sejnowski T. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 1996, 16(5):1936-1947.
-
(1996)
J. Neurosci.
, vol.16
, Issue.5
, pp. 1936-1947
-
-
Montague, P.1
Dayan, P.2
Sejnowski, T.3
-
46
-
-
7244240565
-
Computational roles for dopamine in behavioural control
-
Montague P.R., Hyman S.E., Cohen J.D. Computational roles for dopamine in behavioural control. Nature 2004, 431(7010):760-767.
-
(2004)
Nature
, vol.431
, Issue.7010
, pp. 760-767
-
-
Montague, P.R.1
Hyman, S.E.2
Cohen, J.D.3
-
48
-
-
33747585633
-
Midbrain dopamine neurons encode decisions for future action
-
Morris G., Nevet A., Arkadir D., Vaadia E., Bergman H. Midbrain dopamine neurons encode decisions for future action. Nat. Neurosci. 2006, 9(8):1057-1063.
-
(2006)
Nat. Neurosci.
, vol.9
, Issue.8
, pp. 1057-1063
-
-
Morris, G.1
Nevet, A.2
Arkadir, D.3
Vaadia, E.4
Bergman, H.5
-
49
-
-
0035399093
-
Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach
-
Nakahara H., Doya K., Hikosaka O. Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach. J. Cogn. Neurosci. 2001, 13(5):626-647.
-
(2001)
J. Cogn. Neurosci.
, vol.13
, Issue.5
, pp. 626-647
-
-
Nakahara, H.1
Doya, K.2
Hikosaka, O.3
-
50
-
-
1642575165
-
Dopamine neurons can represent context-dependent prediction error
-
Nakahara H., Itoh H., Kawagoe R., Takikawa Y., Hikosaka O. Dopamine neurons can represent context-dependent prediction error. Neuron 2004, 41:269-280.
-
(2004)
Neuron
, vol.41
, pp. 269-280
-
-
Nakahara, H.1
Itoh, H.2
Kawagoe, R.3
Takikawa, Y.4
Hikosaka, O.5
-
51
-
-
78649667137
-
Internal-time temporal difference model for neural value-based decision making
-
Nakahara H., Kaveri S. Internal-time temporal difference model for neural value-based decision making. Neural Comput. 2010, 22(12):3062-3106.
-
(2010)
Neural Comput.
, vol.22
, Issue.12
, pp. 3062-3106
-
-
Nakahara, H.1
Kaveri, S.2
-
52
-
-
77956209239
-
Temporally extended dopamine responses to perceptually demanding reward-predictive stimuli
-
Nomoto K., Schultz W., Watanabe T., Sakagami M. Temporally extended dopamine responses to perceptually demanding reward-predictive stimuli. J. Neurosci. 2010, 30(32):10692-10702.
-
(2010)
J. Neurosci.
, vol.30
, Issue.32
, pp. 10692-10702
-
-
Nomoto, K.1
Schultz, W.2
Watanabe, T.3
Sakagami, M.4
-
53
-
-
70350558451
-
Brain hemispheres selectively track the expected value of contralateral options
-
Palminteri S., Boraud T., Lafargue G., Dubois B., Pessiglione M. Brain hemispheres selectively track the expected value of contralateral options. J. Neurosci. 2009, 29(43):13465-13472.
-
(2009)
J. Neurosci.
, vol.29
, Issue.43
, pp. 13465-13472
-
-
Palminteri, S.1
Boraud, T.2
Lafargue, G.3
Dubois, B.4
Pessiglione, M.5
-
54
-
-
34547982545
-
-
Analyzing feature generation for value-function approximation. In: ICML-07, Oregon, USA
-
Parr, R., Painter-Wakefield, C., Li, L., Littman, M., 2007. Analyzing feature generation for value-function approximation. In: ICML-07, Oregon, USA, pp.737-744.
-
(2007)
, pp. 737-744
-
-
Parr, R.1
Painter-Wakefield, C.2
Li, L.3
Littman, M.4
-
55
-
-
45749098894
-
A framework for studying the neurobiology of value-based decision making
-
Rangel A., Camerer C., Montague P.R. A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci. 2008, 9(7):545-556.
-
(2008)
Nat. Rev. Neurosci.
, vol.9
, Issue.7
, pp. 545-556
-
-
Rangel, A.1
Camerer, C.2
Montague, P.R.3
-
56
-
-
79960241771
-
Decision making under uncertainty: a neural model based on partially observable markov decision processes
-
Rao R.P. Decision making under uncertainty: a neural model based on partially observable markov decision processes. Front. Comput. Neurosci. 2010, 4:146.
-
(2010)
Front. Comput. Neurosci.
, vol.4
, pp. 146
-
-
Rao, R.P.1
-
57
-
-
33751184634
-
The short-latency dopamine signal: a role in discovering novel actions?
-
Redgrave P., Gurney K. The short-latency dopamine signal: a role in discovering novel actions?. Nat. Rev. Neurosci. 2006, 7(12):967-975.
-
(2006)
Nat. Rev. Neurosci.
, vol.7
, Issue.12
, pp. 967-975
-
-
Redgrave, P.1
Gurney, K.2
-
58
-
-
34548837994
-
Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling
-
Redish A.D., Jensen S., Johnson A., Kurth-Nelson Z. Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychol. Rev. 2007, 114(3):784-805.
-
(2007)
Psychol. Rev.
, vol.114
, Issue.3
, pp. 784-805
-
-
Redish, A.D.1
Jensen, S.2
Johnson, A.3
Kurth-Nelson, Z.4
-
59
-
-
79953798456
-
Cortical map plasticity improves learning but is not necessary for improved performance
-
Reed A., Riley J., Carraway R., Carrasco A., Perez C., Jakkamsetti V., Kilgard M.P. Cortical map plasticity improves learning but is not necessary for improved performance. Neuron 2011, 70(1):121-131.
-
(2011)
Neuron
, vol.70
, Issue.1
, pp. 121-131
-
-
Reed, A.1
Riley, J.2
Carraway, R.3
Carrasco, A.4
Perez, C.5
Jakkamsetti, V.6
Kilgard, M.P.7
-
60
-
-
0036592025
-
Dopamine-dependent plasticity of corticostriatal synapses
-
Reynolds J.N., Wickens J.R. Dopamine-dependent plasticity of corticostriatal synapses. Neural Netw. 2002, 15(4-6):507-521.
-
(2002)
Neural Netw.
, vol.15
, Issue.4-6
, pp. 507-521
-
-
Reynolds, J.N.1
Wickens, J.R.2
-
61
-
-
79960637995
-
A neural signature of hierarchical reinforcement learning
-
Ribas-Fernandes J.J.F., Solway A., Diuk C., McGuire J.T., Barto A.G., Niv Y., Botvinick M.M. A neural signature of hierarchical reinforcement learning. Neuron 2011, 71(2):370-379.
-
(2011)
Neuron
, vol.71
, Issue.2
, pp. 370-379
-
-
Ribas-Fernandes, J.J.F.1
Solway, A.2
Diuk, C.3
McGuire, J.T.4
Barto, A.G.5
Niv, Y.6
Botvinick, M.M.7
-
62
-
-
77249084637
-
Neural correlates of variations in event processing during learning in basolateral amygdala
-
Roesch M.R., Calu D.J., Esber G.R., Schoenbaum G. Neural correlates of variations in event processing during learning in basolateral amygdala. J. Neurosci. 2010, 30(7):2464-2471.
-
(2010)
J. Neurosci.
, vol.30
, Issue.7
, pp. 2464-2471
-
-
Roesch, M.R.1
Calu, D.J.2
Esber, G.R.3
Schoenbaum, G.4
-
63
-
-
36448968271
-
Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
-
Roesch M.R., Calu D.J., Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat. Neurosci. 2007, 10(12):1615-1624.
-
(2007)
Nat. Neurosci.
, vol.10
, Issue.12
, pp. 1615-1624
-
-
Roesch, M.R.1
Calu, D.J.2
Schoenbaum, G.3
-
64
-
-
84878178249
-
Valuation and decision-making in frontal cortex: one or many serial or parallel systems?
-
(Epub ahead of print)
-
Rushworth M.F., Kolling N., Sallet J., Mars R.B. Valuation and decision-making in frontal cortex: one or many serial or parallel systems?. Curr. Opin. Neurobiol. 2012, (Epub ahead of print).
-
(2012)
Curr. Opin. Neurobiol.
-
-
Rushworth, M.F.1
Kolling, N.2
Sallet, J.3
Mars, R.B.4
-
65
-
-
0242440823
-
Correlated coding of motivation and outcome of decision by dopamine neurons
-
Satoh T., Nakai S., Sato T., Kimura M. Correlated coding of motivation and outcome of decision by dopamine neurons. J. Neurosci. 2003, 23(30):9913-9923.
-
(2003)
J. Neurosci.
, vol.23
, Issue.30
, pp. 9913-9923
-
-
Satoh, T.1
Nakai, S.2
Sato, T.3
Kimura, M.4
-
66
-
-
0031867046
-
Predictive reward signal of dopamine neurons
-
Schultz W. Predictive reward signal of dopamine neurons. J. Neurophysiol. 1998, 80:1-27.
-
(1998)
J. Neurophysiol.
, vol.80
, pp. 1-27
-
-
Schultz, W.1
-
67
-
-
0030896968
-
A neural substrate of prediction and reward
-
Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science 1997, 275(5306):1593-1599.
-
(1997)
Science
, vol.275
, Issue.5306
, pp. 1593-1599
-
-
Schultz, W.1
Dayan, P.2
Montague, P.R.3
-
68
-
-
34147151266
-
A common framework for perceptual learning
-
Seitz A.R., Dinse H.R. A common framework for perceptual learning. Curr. Opin. Neurobiol. 2007, 17(2):148-153.
-
(2007)
Curr. Opin. Neurobiol.
, vol.17
, Issue.2
, pp. 148-153
-
-
Seitz, A.R.1
Dinse, H.R.2
-
71
-
-
0003066891
-
Time-derivative models of pavlovian reinforcement
-
The MIT Press, Cambridge, MA, M. Gabriel, J. Moore (Eds.)
-
Sutton R.S., Barto A.G. Time-derivative models of pavlovian reinforcement. Learning and Computational Neuroscience: Foundations of Adaptive Networks 1990, 497-537. The MIT Press, Cambridge, MA. M. Gabriel, J. Moore (Eds.).
-
(1990)
Learning and Computational Neuroscience: Foundations of Adaptive Networks
, pp. 497-537
-
-
Sutton, R.S.1
Barto, A.G.2
-
72
-
-
71149099079
-
Fast gradient-descent methods for temporal-difference learning with linear function approximation
-
Sutton R.S., Maei H.R., Precup D., Bhatnagar S., Silver D., Szepesvari C., Wiewiora E. Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML-09 2009, 993-1000.
-
(2009)
ICML-09
, pp. 993-1000
-
-
Sutton, R.S.1
Maei, H.R.2
Precup, D.3
Bhatnagar, S.4
Silver, D.5
Szepesvari, C.6
Wiewiora, E.7
-
73
-
-
84899464022
-
Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction
-
Sutton R.S., Modayil J., Delp M., Degris T., Pilarski P.M., White A. Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction. AAMAS'11 The 10th International Conference on Autonomous Agents and Multiagent Systems - vol. 2 2011, 761-768.
-
(2011)
AAMAS'11 The 10th International Conference on Autonomous Agents and Multiagent Systems - vol. 2
, pp. 761-768
-
-
Sutton, R.S.1
Modayil, J.2
Delp, M.3
Degris, T.4
Pilarski, P.M.5
White, A.6
-
74
-
-
84862673530
-
Learning to simulate others' decisions
-
Suzuki S., Harasawa N., Ueno K., Gardner J.L., Ichinohe N., Haruno M., Cheng K., Nakahara H. Learning to simulate others' decisions. Neuron 2012, 74:1125-1137.
-
(2012)
Neuron
, vol.74
, pp. 1125-1137
-
-
Suzuki, S.1
Harasawa, N.2
Ueno, K.3
Gardner, J.L.4
Ichinohe, N.5
Haruno, M.6
Cheng, K.7
Nakahara, H.8
-
75
-
-
84871936763
-
Learning to use working memory in partially observable environments through dopaminergic reinforcement
-
Todd M., Niv Y., Cohen J.D. Learning to use working memory in partially observable environments through dopaminergic reinforcement. Advances in Neural Information Processing Systems (NIPS) 2009, 21:1-8.
-
(2009)
Advances in Neural Information Processing Systems (NIPS)
, vol.21
, pp. 1-8
-
-
Todd, M.1
Niv, Y.2
Cohen, J.D.3
-
76
-
-
78751687449
-
The neural basis of intuitive best next-move generation in board game experts
-
Wan X., Nakatani H., Ueno K., Asamizuya T., Cheng K., Tanaka K. The neural basis of intuitive best next-move generation in board game experts. Science 2011, 331(6015):341-346.
-
(2011)
Science
, vol.331
, Issue.6015
, pp. 341-346
-
-
Wan, X.1
Nakatani, H.2
Ueno, K.3
Asamizuya, T.4
Cheng, K.5
Tanaka, K.6
-
77
-
-
84860307045
-
Mapping value based planning and extensively trained choice in the human brain
-
Wunderlich K., Dayan P., Dolan R.J. Mapping value based planning and extensively trained choice in the human brain. Nat. Neurosci. 2012, 1-19.
-
(2012)
Nat. Neurosci.
, pp. 1-19
-
-
Wunderlich, K.1
Dayan, P.2
Dolan, R.J.3
-
78
-
-
80055117878
-
Prediction error associated with the perceptual segmentation of naturalistic events
-
Zacks J.M., Kurby C.A., Eisenberg M.L., Haroutunian N. Prediction error associated with the perceptual segmentation of naturalistic events. J. Cogn. Neurosci. 2011, 23(12):4057-4066.
-
(2011)
J. Cogn. Neurosci.
, vol.23
, Issue.12
, pp. 4057-4066
-
-
Zacks, J.M.1
Kurby, C.A.2
Eisenberg, M.L.3
Haroutunian, N.4
|