SCOPUS 정보 검색 플랫폼

Neuroscience Research

Volumn 74, Issue 3-4, 2012, Pages 177-183

Learning to represent reward structure: A key to adapting to complex environments

(2) Nakahara, Hiroyuki a Hikosaka, Okihide b

a RIKEN BRAIN SCIENCE INSTITUTE (Japan)

b NATIONAL EYE INSTITUTE (United States)

Author keywords

Decision; Dopamine; Reinforcement learning; Reward; Salience; Structure; Value

Indexed keywords

DOPAMINE;

ARTICLE; ASSOCIATION; DOPAMINERGIC ACTIVITY; LEARNING; PREDICTION; PRIORITY JOURNAL; REINFORCEMENT; REWARD;

ANIMALS; BRAIN; DOPAMINE; HUMANS; LEARNING; MODELS, NEUROLOGICAL; REINFORCEMENT (PSYCHOLOGY); REWARD;

EID: 84871931160 PISSN: 01680102 EISSN: 18728111 Source Type: Journal
DOI: 10.1016/j.neures.2012.09.007 Document Type: Article

Times cited : (20)

References (78)

1
- 78651226963
- Structure learning in human sequential decision-making
- Acuna D.E., Schrater P. Structure learning in human sequential decision-making. PLoS Comput. Biol. 2010, 6(12):e1001003.
- (2010) PLoS Comput. Biol. , vol.6 , Issue.12
- Acuna, D.E.¹ Schrater, P.²

2
- 84882523833
- Elsevier, Amsterdam
- Balleine B.W., Daw N., O'Doherty J.P. Multiple Forms of Value Learning and the Function of Dopamine, Neuroeconomics Decision Making and the Brain 2008, Elsevier, Amsterdam, pp. 367-387.
- (2008) Multiple Forms of Value Learning and the Function of Dopamine, Neuroeconomics Decision Making and the Brain , pp. 367-387
- Balleine, B.W.¹ Daw, N.² O'Doherty, J.P.³

3
- 0035811508
- Cortical remodelling induced by activity of ventral tegmental dopamine neurons
- Bao S., Chan V.T., Merzenich M.M. Cortical remodelling induced by activity of ventral tegmental dopamine neurons. Nature 2001, 412(6842):79-83.
- (2001) Nature , vol.412 , Issue.6842 , pp. 79-83
- Bao, S.¹ Chan, V.T.² Merzenich, M.M.³

4
- 33746618859
- FMRI investigation of cortical and subcortical networks in the learning of abstract and effector-specific representations of motor sequences
- Bapi R.S., Miyapuram K.P., Graydon F.X., Doya K. fMRI investigation of cortical and subcortical networks in the learning of abstract and effector-specific representations of motor sequences. NeuroImage 2006, 32(2):714-727.
- (2006) NeuroImage , vol.32 , Issue.2 , pp. 714-727
- Bapi, R.S.¹ Miyapuram, K.P.² Graydon, F.X.³ Doya, K.⁴

5
- 0000541213
- Adaptive critics and the basal ganglia
- The MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
- Barto A. Adaptive critics and the basal ganglia. Models of Information Processing in the Basal Ganglia 1994, 12-31. The MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
- (1994) Models of Information Processing in the Basal Ganglia , pp. 12-31
- Barto, A.¹

6
- 21544435722
- Midbrain dopamine neurons encode a quantitative reward prediction error signal
- Bayer H., Glimcher P. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 2005, 47(1):129-141.
- (2005) Neuron , vol.47 , Issue.1 , pp. 129-141
- Bayer, H.¹ Glimcher, P.²

7
- 63849268432
- Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli
- Brischoux F., Chakraborty S., Brierley D.I., Ungless M.A. Phasic excitation of dopamine neurons in ventral VTA by noxious stimuli. Proc. Natl. Acad. Sci. U.S.A. 2009, 106(12):4894-4899.
- (2009) Proc. Natl. Acad. Sci. U.S.A. , vol.106 , Issue.12 , pp. 4894-4899
- Brischoux, F.¹ Chakraborty, S.² Brierley, D.I.³ Ungless, M.A.⁴

8
- 68349115012
- Midbrain dopamine neurons signal preference for advance information about upcoming rewards
- Bromberg-Martin E.S., Hikosaka O. Midbrain dopamine neurons signal preference for advance information about upcoming rewards. Neuron 2009, 63(1):119-126.
- (2009) Neuron , vol.63 , Issue.1 , pp. 119-126
- Bromberg-Martin, E.S.¹ Hikosaka, O.²

9
- 77954484934
- Distinct tonic and phasic anticipatory activity in lateral habenula and dopamine neurons
- Bromberg-Martin E.S., Matsumoto M., Hikosaka O. Distinct tonic and phasic anticipatory activity in lateral habenula and dopamine neurons. Neuron 2010, 67(1):144-155.
- (2010) Neuron , vol.67 , Issue.1 , pp. 144-155
- Bromberg-Martin, E.S.¹ Matsumoto, M.² Hikosaka, O.³

10
- 77955458973
- Multiple timescales of memory in lateral habenula and dopamine neurons
- Bromberg-Martin E.S., Matsumoto M., Nakahara H., Hikosaka O. Multiple timescales of memory in lateral habenula and dopamine neurons. Neuron 2010, 67(3):499-510.
- (2010) Neuron , vol.67 , Issue.3 , pp. 499-510
- Bromberg-Martin, E.S.¹ Matsumoto, M.² Nakahara, H.³ Hikosaka, O.⁴

11
- 84856431209
- Neuron-type-specific signals for reward and punishment in the ventral tegmental area
- Cohen J.Y., Haesler S., Vong L., Lowell B.B., Uchida N. Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 2012, 482(7383):85-88.
- (2012) Nature , vol.482 , Issue.7383 , pp. 85-88
- Cohen, J.Y.¹ Haesler, S.² Vong, L.³ Lowell, B.B.⁴ Uchida, N.⁵

12
- 80052944607
- A selectionist account of de novo action learning
- Costa R.M. A selectionist account of de novo action learning. Curr. Opin. Neurobiol. 2011, 21(4):579-586.
- (2011) Curr. Opin. Neurobiol. , vol.21 , Issue.4 , pp. 579-586
- Costa, R.M.¹

13
- 33746365099
- Bayesian theories of conditioning in a changing world
- Courville A., Daw N., Touretzky D. Bayesian theories of conditioning in a changing world. Trends Cogn. Sci. 2006, 10(7):294-300.
- (2006) Trends Cogn. Sci. , vol.10 , Issue.7 , pp. 294-300
- Courville, A.¹ Daw, N.² Touretzky, D.³

14
- 33745787929
- Representation and timing in theories of the dopamine system
- Daw N.D., Courville A.C., Touretzky D.S. Representation and timing in theories of the dopamine system. Neural Comput. 2006, 18(7):1637-1677.
- (2006) Neural Comput. , vol.18 , Issue.7 , pp. 1637-1677
- Daw, N.D.¹ Courville, A.C.² Touretzky, D.S.³

15
- 79952746011
- Model-based influences on humans' choices and striatal prediction errors
- Daw N.D., Gershman S.J., Seymour B., Dayan P., Dolan R.J. Model-based influences on humans' choices and striatal prediction errors. Neuron 2011, 69(6):1204-1215.
- (2011) Neuron , vol.69 , Issue.6 , pp. 1204-1215
- Daw, N.D.¹ Gershman, S.J.² Seymour, B.³ Dayan, P.⁴ Dolan, R.J.⁵

16
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Daw N.D., Niv Y., Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 2005, 8(12):1704-1711.
- (2005) Nat. Neurosci. , vol.8 , Issue.12 , pp. 1704-1711
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

17
- 0001158047
- Improving generalization for temporal difference learning: the successor representation
- Dayan P. Improving generalization for temporal difference learning: the successor representation. Neural Comput. 1993, 5:613-624.
- (1993) Neural Comput. , vol.5 , pp. 613-624
- Dayan, P.¹

18
- 52049107354
- Reinforcement learning: the good, the bad and the ugly
- Dayan P.G., Niv Y. Reinforcement learning: the good, the bad and the ugly. Curr. Opin. Neurobiol. 2008, 18(2):185-196.
- (2008) Curr. Opin. Neurobiol. , vol.18 , Issue.2 , pp. 185-196
- Dayan, P.G.¹ Niv, Y.²

19
- 0033213819
- What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?
- Doya K. What are the computations of the cerebellum, the basal ganglia and the cerebral cortex?. Neural Netw. 1999, 12(7-8):961-974.
- (1999) Neural Netw. , vol.12 , Issue.7-8 , pp. 961-974
- Doya, K.¹

20
- 34547679813
- Reinforcement learning: computational theory and biological mechanisms
- Doya K. Reinforcement learning: computational theory and biological mechanisms. HFSP J. 2007, 1(1):30-40.
- (2007) HFSP J. , vol.1 , Issue.1 , pp. 30-40
- Doya, K.¹

21
- 80053075057
- Dopamine neurons learn to encode the long-term value of multiple future rewards
- Enomoto K., Matsumoto N., Nakai S., Satoh T., Sato T.K., Ueda Y., Inokawa H., Haruno M., Kimura M. Dopamine neurons learn to encode the long-term value of multiple future rewards. Proc. Natl. Acad. Sci. U.S.A. 2011, 108(37):15462-15467.
- (2011) Proc. Natl. Acad. Sci. U.S.A. , vol.108 , Issue.37 , pp. 15462-15467
- Enomoto, K.¹ Matsumoto, N.² Nakai, S.³ Satoh, T.⁴ Sato, T.K.⁵ Ueda, Y.⁶ Inokawa, H.⁷ Haruno, M.⁸ Kimura, M.⁹

22
- 0037459319
- Discrete coding of reward probability and uncertainty by dopamine neurons
- Fiorillo C.D., Tobler P.N., Schultz J. Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898-1902.
- (2003) Science , vol.299 , pp. 1898-1902
- Fiorillo, C.D.¹ Tobler, P.N.² Schultz, J.³

23
- 77449154520
- Do substantia nigra dopaminergic neurons differentiate between reward and punishment?
- Frank M.J., Surmeier D.J. Do substantia nigra dopaminergic neurons differentiate between reward and punishment?. J. Mol. Cell Biol. 2009, 1(1):15-16.
- (2009) J. Mol. Cell Biol. , vol.1 , Issue.1 , pp. 15-16
- Frank, M.J.¹ Surmeier, D.J.²

24
- 84859353418
- Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats
- Funamizu A., Ito M., Doya K., Kanzaki R., Takahashi H. Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats. Eur. J. Neurosci. 2012, 35(7):1180-1189.
- (2012) Eur. J. Neurosci. , vol.35 , Issue.7 , pp. 1180-1189
- Funamizu, A.¹ Ito, M.² Doya, K.³ Kanzaki, R.⁴ Takahashi, H.⁵

25
- 84865306110
- The successor representation and temporal context
- Gershman S.J., Moore C.D., Todd M.T., Norman K.N., Sederberg P.B. The successor representation and temporal context. Neural Comput. 2012, 24:1-16.
- (2012) Neural Comput. , vol.24 , pp. 1-16
- Gershman, S.J.¹ Moore, C.D.² Todd, M.T.³ Norman, K.N.⁴ Sederberg, P.B.⁵

26
- 77952541839
- Learning latent structure: carving nature at its joints
- Gershman S.J., Niv Y. Learning latent structure: carving nature at its joints. Curr. Opin. Neurobiol. 2010, 20(2):251-256.
- (2010) Curr. Opin. Neurobiol. , vol.20 , Issue.2 , pp. 251-256
- Gershman, S.J.¹ Niv, Y.²

27
- 70350521769
- Human reinforcement learning subdivides structured action spaces by learning effector-specific values
- Gershman S.J., Pesaran B., Daw N.D. Human reinforcement learning subdivides structured action spaces by learning effector-specific values. J. Neurosci. 2009, 29(43):13524-13531.
- (2009) J. Neurosci. , vol.29 , Issue.43 , pp. 13524-13531
- Gershman, S.J.¹ Pesaran, B.² Daw, N.D.³

28
- 77953260848
- States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning
- Gläscher J., Daw N., Dayan P., O'Doherty J.P. States versus rewards: dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 2010, 66(4):585-595.
- (2010) Neuron , vol.66 , Issue.4 , pp. 585-595
- Gläscher, J.¹ Daw, N.² Dayan, P.³ O'Doherty, J.P.⁴

29
- 80053152388
- Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis
- Glimcher P.W. Understanding dopamine and reinforcement learning: the dopamine reward prediction error hypothesis. Proc. Natl. Acad. Sci. U.S.A. 2011, 108(Suppl. 3):15647-15654.
- (2011) Proc. Natl. Acad. Sci. U.S.A. , vol.108 , Issue.SUPPL. 3 , pp. 15647-15654
- Glimcher, P.W.¹

30
- 77958005364
- Alterations in choice behavior by manipulations of world model
- Green C.S., Benson C., Kersten D., Schrater P. Alterations in choice behavior by manipulations of world model. Proc. Natl. Acad. Sci. U.S.A. 2010, 107(37):16401-16406.
- (2010) Proc. Natl. Acad. Sci. U.S.A. , vol.107 , Issue.37 , pp. 16401-16406
- Green, C.S.¹ Benson, C.² Kersten, D.³ Schrater, P.⁴

31
- 33646496765
- Dopamine modulation in the basal ganglia locks the gate to working memory
- Gruber A.J., Dayan P., Gutkin B.S., Solla S.A. Dopamine modulation in the basal ganglia locks the gate to working memory. J. Comput. Neurosci. 2006, 20(2):153-166.
- (2006) J. Comput. Neurosci. , vol.20 , Issue.2 , pp. 153-166
- Gruber, A.J.¹ Dayan, P.² Gutkin, B.S.³ Solla, S.A.⁴

32
- 77749341538
- Neurons in anterior cingulate cortex multiplex information about reward and action
- Hayden B.Y., Heilbronner S., Pearson J., Platt M.L. Neurons in anterior cingulate cortex multiplex information about reward and action. J. Neurosci. 2010, 30(9):3339-3346.
- (2010) J. Neurosci. , vol.30 , Issue.9 , pp. 3339-3346
- Hayden, B.Y.¹ Heilbronner, S.² Pearson, J.³ Platt, M.L.⁴

33
- 79959678641
- Neuronal basis of sequential foraging decisions in a patchy environment
- Hayden B.Y., Pearson J.M., Platt M.L. Neuronal basis of sequential foraging decisions in a patchy environment. Nat. Neurosci. 2011, 14(7):933-939.
- (2011) Nat. Neurosci. , vol.14 , Issue.7 , pp. 933-939
- Hayden, B.Y.¹ Pearson, J.M.² Platt, M.L.³

34
- 0033214899
- Parallel neural networks for learning sequential procedures
- Hikosaka O., Nakahara H., Rand M.K., Sakai K., Lu X., Nakamura K., Miyachi S., Doya K. Parallel neural networks for learning sequential procedures. Trends Neurosci. 1999, 22(10):464-471.
- (1999) Trends Neurosci. , vol.22 , Issue.10 , pp. 464-471
- Hikosaka, O.¹ Nakahara, H.² Rand, M.K.³ Sakai, K.⁴ Lu, X.⁵ Nakamura, K.⁶ Miyachi, S.⁷ Doya, K.⁸

35
- 33644865763
- Basal ganglia orient eyes to reward
- Hikosaka O., Nakamura K., Nakahara H. Basal ganglia orient eyes to reward. J. Neurophysiol. 2006, 95(2):567-584.
- (2006) J. Neurophysiol. , vol.95 , Issue.2 , pp. 567-584
- Hikosaka, O.¹ Nakamura, K.² Nakahara, H.³

36
- 0002861883
- A model of how the basal ganglia generate and use neural signals that predict reinforcement
- The MIT Press, Cambridge, MA, J.C. Houk, J.L. Davis, D.G. Beiser (Eds.)
- Houk J.C., Adams J.L., Barto A. A model of how the basal ganglia generate and use neural signals that predict reinforcement. Models of Information Processing in the Basal Ganglia 1994, 249-252. The MIT Press, Cambridge, MA. J.C. Houk, J.L. Davis, D.G. Beiser (Eds.).
- (1994) Models of Information Processing in the Basal Ganglia , pp. 249-252
- Houk, J.C.¹ Adams, J.L.² Barto, A.³

37
- 61349171920
- Encoding of probabilistic rewarding and aversive events by pallidal and nigral neurons
- Joshua M., Adler A., Rosin B., Vaadia E., Bergman H. Encoding of probabilistic rewarding and aversive events by pallidal and nigral neurons. J. Neurophysiol. 2009, 101(2):758-772.
- (2009) J. Neurophysiol. , vol.101 , Issue.2 , pp. 758-772
- Joshua, M.¹ Adler, A.² Rosin, B.³ Vaadia, E.⁴ Bergman, H.⁵

38
- 0036592029
- Dopamine: generalization and bonuses
- Kakade S., Dayan P. Dopamine: generalization and bonuses. Neural Netw. 2002, 15(4-6):549-559.
- (2002) Neural Netw. , vol.15 , Issue.4-6 , pp. 549-559
- Kakade, S.¹ Dayan, P.²

39
- 84859496054
- Neural mechanisms of foraging
- Kolling N., Behrens T.E., Mars R.B., Rushworth M.F. Neural mechanisms of foraging. Science 2012, 336(6077):95-98.
- (2012) Science , vol.336 , Issue.6077 , pp. 95-98
- Kolling, N.¹ Behrens, T.E.² Mars, R.B.³ Rushworth, M.F.⁴

40
- 40249097514
- Unique properties of mesoprefrontal neurons within a dual mesocorticolimbic dopamine system
- Lammel S., Hetzel A., Häckel O., Jones I., Liss B., Roeper J. Unique properties of mesoprefrontal neurons within a dual mesocorticolimbic dopamine system. Neuron 2008, 57(5):760-773.
- (2008) Neuron , vol.57 , Issue.5 , pp. 760-773
- Lammel, S.¹ Hetzel, A.² Häckel, O.³ Jones, I.⁴ Liss, B.⁵ Roeper, J.⁶

41
- 57349130536
- Stimulus representation and the timing of reward-prediction errors in models of the dopamine system
- Ludvig E.A., Sutton R.S., Kehoe E.J. Stimulus representation and the timing of reward-prediction errors in models of the dopamine system. Neural Comput. 2008, 20(12):3034-3054.
- (2008) Neural Comput. , vol.20 , Issue.12 , pp. 3034-3054
- Ludvig, E.A.¹ Sutton, R.S.² Kehoe, E.J.³

42
- 67349098495
- Two types of dopamine neuron distinctly convey positive and negative motivational signals
- Matsumoto M., Hikosaka O. Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 2009, 459(7248):837-841.
- (2009) Nature , vol.459 , Issue.7248 , pp. 837-841
- Matsumoto, M.¹ Hikosaka, O.²

43
- 79951823576
- Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning
- McDannald M.A., Lucantonio F., Burke K.A., Niv Y., Schoenbaum G. Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning. J. Neurosci. 2011, 31(7):2700-2705.
- (2011) J. Neurosci. , vol.31 , Issue.7 , pp. 2700-2705
- McDannald, M.A.¹ Lucantonio, F.² Burke, K.A.³ Niv, Y.⁴ Schoenbaum, G.⁵

44
- 84859323549
- Model-based learning and the contribution of the orbitofrontal cortex to the model-free world
- McDannald M.A., Takahashi Y.K., Lopatina N., Pietras B.W., Jones J.L., Schoenbaum G. Model-based learning and the contribution of the orbitofrontal cortex to the model-free world. Eur. J. Neurosci. 2012, 35(7):991-996.
- (2012) Eur. J. Neurosci. , vol.35 , Issue.7 , pp. 991-996
- McDannald, M.A.¹ Takahashi, Y.K.² Lopatina, N.³ Pietras, B.W.⁴ Jones, J.L.⁵ Schoenbaum, G.⁶

45
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive Hebbian learning
- Montague P., Dayan P., Sejnowski T. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 1996, 16(5):1936-1947.
- (1996) J. Neurosci. , vol.16 , Issue.5 , pp. 1936-1947
- Montague, P.¹ Dayan, P.² Sejnowski, T.³

46
- 7244240565
- Computational roles for dopamine in behavioural control
- Montague P.R., Hyman S.E., Cohen J.D. Computational roles for dopamine in behavioural control. Nature 2004, 431(7010):760-767.
- (2004) Nature , vol.431 , Issue.7010 , pp. 760-767
- Montague, P.R.¹ Hyman, S.E.² Cohen, J.D.³

47
- 33748337293
- Imaging valuation models in human choice
- Montague P.R., King-Casas B., Cohen J.D. Imaging valuation models in human choice. Annu. Rev. Neurosci. 2006, 29:417-448.
- (2006) Annu. Rev. Neurosci. , vol.29 , pp. 417-448
- Montague, P.R.¹ King-Casas, B.² Cohen, J.D.³

48
- 33747585633
- Midbrain dopamine neurons encode decisions for future action
- Morris G., Nevet A., Arkadir D., Vaadia E., Bergman H. Midbrain dopamine neurons encode decisions for future action. Nat. Neurosci. 2006, 9(8):1057-1063.
- (2006) Nat. Neurosci. , vol.9 , Issue.8 , pp. 1057-1063
- Morris, G.¹ Nevet, A.² Arkadir, D.³ Vaadia, E.⁴ Bergman, H.⁵

49
- 0035399093
- Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach
- Nakahara H., Doya K., Hikosaka O. Parallel cortico-basal ganglia mechanisms for acquisition and execution of visuomotor sequences - a computational approach. J. Cogn. Neurosci. 2001, 13(5):626-647.
- (2001) J. Cogn. Neurosci. , vol.13 , Issue.5 , pp. 626-647
- Nakahara, H.¹ Doya, K.² Hikosaka, O.³

50
- 1642575165
- Dopamine neurons can represent context-dependent prediction error
- Nakahara H., Itoh H., Kawagoe R., Takikawa Y., Hikosaka O. Dopamine neurons can represent context-dependent prediction error. Neuron 2004, 41:269-280.
- (2004) Neuron , vol.41 , pp. 269-280
- Nakahara, H.¹ Itoh, H.² Kawagoe, R.³ Takikawa, Y.⁴ Hikosaka, O.⁵

51
- 78649667137
- Internal-time temporal difference model for neural value-based decision making
- Nakahara H., Kaveri S. Internal-time temporal difference model for neural value-based decision making. Neural Comput. 2010, 22(12):3062-3106.
- (2010) Neural Comput. , vol.22 , Issue.12 , pp. 3062-3106
- Nakahara, H.¹ Kaveri, S.²

52
- 77956209239
- Temporally extended dopamine responses to perceptually demanding reward-predictive stimuli
- Nomoto K., Schultz W., Watanabe T., Sakagami M. Temporally extended dopamine responses to perceptually demanding reward-predictive stimuli. J. Neurosci. 2010, 30(32):10692-10702.
- (2010) J. Neurosci. , vol.30 , Issue.32 , pp. 10692-10702
- Nomoto, K.¹ Schultz, W.² Watanabe, T.³ Sakagami, M.⁴

53
- 70350558451
- Brain hemispheres selectively track the expected value of contralateral options
- Palminteri S., Boraud T., Lafargue G., Dubois B., Pessiglione M. Brain hemispheres selectively track the expected value of contralateral options. J. Neurosci. 2009, 29(43):13465-13472.
- (2009) J. Neurosci. , vol.29 , Issue.43 , pp. 13465-13472
- Palminteri, S.¹ Boraud, T.² Lafargue, G.³ Dubois, B.⁴ Pessiglione, M.⁵

54
- 34547982545
- Analyzing feature generation for value-function approximation. In: ICML-07, Oregon, USA
- Parr, R., Painter-Wakefield, C., Li, L., Littman, M., 2007. Analyzing feature generation for value-function approximation. In: ICML-07, Oregon, USA, pp.737-744.
- (2007) , pp. 737-744
- Parr, R.¹ Painter-Wakefield, C.² Li, L.³ Littman, M.⁴

55
- 45749098894
- A framework for studying the neurobiology of value-based decision making
- Rangel A., Camerer C., Montague P.R. A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci. 2008, 9(7):545-556.
- (2008) Nat. Rev. Neurosci. , vol.9 , Issue.7 , pp. 545-556
- Rangel, A.¹ Camerer, C.² Montague, P.R.³

56
- 79960241771
- Decision making under uncertainty: a neural model based on partially observable markov decision processes
- Rao R.P. Decision making under uncertainty: a neural model based on partially observable markov decision processes. Front. Comput. Neurosci. 2010, 4:146.
- (2010) Front. Comput. Neurosci. , vol.4 , pp. 146
- Rao, R.P.¹

57
- 33751184634
- The short-latency dopamine signal: a role in discovering novel actions?
- Redgrave P., Gurney K. The short-latency dopamine signal: a role in discovering novel actions?. Nat. Rev. Neurosci. 2006, 7(12):967-975.
- (2006) Nat. Rev. Neurosci. , vol.7 , Issue.12 , pp. 967-975
- Redgrave, P.¹ Gurney, K.²

58
- 34548837994
- Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling
- Redish A.D., Jensen S., Johnson A., Kurth-Nelson Z. Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychol. Rev. 2007, 114(3):784-805.
- (2007) Psychol. Rev. , vol.114 , Issue.3 , pp. 784-805
- Redish, A.D.¹ Jensen, S.² Johnson, A.³ Kurth-Nelson, Z.⁴

59
- 79953798456
- Cortical map plasticity improves learning but is not necessary for improved performance
- Reed A., Riley J., Carraway R., Carrasco A., Perez C., Jakkamsetti V., Kilgard M.P. Cortical map plasticity improves learning but is not necessary for improved performance. Neuron 2011, 70(1):121-131.
- (2011) Neuron , vol.70 , Issue.1 , pp. 121-131
- Reed, A.¹ Riley, J.² Carraway, R.³ Carrasco, A.⁴ Perez, C.⁵ Jakkamsetti, V.⁶ Kilgard, M.P.⁷

60
- 0036592025
- Dopamine-dependent plasticity of corticostriatal synapses
- Reynolds J.N., Wickens J.R. Dopamine-dependent plasticity of corticostriatal synapses. Neural Netw. 2002, 15(4-6):507-521.
- (2002) Neural Netw. , vol.15 , Issue.4-6 , pp. 507-521
- Reynolds, J.N.¹ Wickens, J.R.²

61
- 79960637995
- A neural signature of hierarchical reinforcement learning
- Ribas-Fernandes J.J.F., Solway A., Diuk C., McGuire J.T., Barto A.G., Niv Y., Botvinick M.M. A neural signature of hierarchical reinforcement learning. Neuron 2011, 71(2):370-379.
- (2011) Neuron , vol.71 , Issue.2 , pp. 370-379
- Ribas-Fernandes, J.J.F.¹ Solway, A.² Diuk, C.³ McGuire, J.T.⁴ Barto, A.G.⁵ Niv, Y.⁶ Botvinick, M.M.⁷

62
- 77249084637
- Neural correlates of variations in event processing during learning in basolateral amygdala
- Roesch M.R., Calu D.J., Esber G.R., Schoenbaum G. Neural correlates of variations in event processing during learning in basolateral amygdala. J. Neurosci. 2010, 30(7):2464-2471.
- (2010) J. Neurosci. , vol.30 , Issue.7 , pp. 2464-2471
- Roesch, M.R.¹ Calu, D.J.² Esber, G.R.³ Schoenbaum, G.⁴

63
- 36448968271
- Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards
- Roesch M.R., Calu D.J., Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nat. Neurosci. 2007, 10(12):1615-1624.
- (2007) Nat. Neurosci. , vol.10 , Issue.12 , pp. 1615-1624
- Roesch, M.R.¹ Calu, D.J.² Schoenbaum, G.³

64
- 84878178249
- Valuation and decision-making in frontal cortex: one or many serial or parallel systems?
- (Epub ahead of print)
- Rushworth M.F., Kolling N., Sallet J., Mars R.B. Valuation and decision-making in frontal cortex: one or many serial or parallel systems?. Curr. Opin. Neurobiol. 2012, (Epub ahead of print).
- (2012) Curr. Opin. Neurobiol.
- Rushworth, M.F.¹ Kolling, N.² Sallet, J.³ Mars, R.B.⁴

65
- 0242440823
- Correlated coding of motivation and outcome of decision by dopamine neurons
- Satoh T., Nakai S., Sato T., Kimura M. Correlated coding of motivation and outcome of decision by dopamine neurons. J. Neurosci. 2003, 23(30):9913-9923.
- (2003) J. Neurosci. , vol.23 , Issue.30 , pp. 9913-9923
- Satoh, T.¹ Nakai, S.² Sato, T.³ Kimura, M.⁴

66
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz W. Predictive reward signal of dopamine neurons. J. Neurophysiol. 1998, 80:1-27.
- (1998) J. Neurophysiol. , vol.80 , pp. 1-27
- Schultz, W.¹

67
- 0030896968
- A neural substrate of prediction and reward
- Schultz W., Dayan P., Montague P.R. A neural substrate of prediction and reward. Science 1997, 275(5306):1593-1599.
- (1997) Science , vol.275 , Issue.5306 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

68
- 34147151266
- A common framework for perceptual learning
- Seitz A.R., Dinse H.R. A common framework for perceptual learning. Curr. Opin. Neurobiol. 2007, 17(2):148-153.
- (2007) Curr. Opin. Neurobiol. , vol.17 , Issue.2 , pp. 148-153
- Seitz, A.R.¹ Dinse, H.R.²

69
- 84899031920
- Intrinsically Motivated Reinforcement Learning
- Singh S., Barto A.G., Chentanez N. Intrinsically Motivated Reinforcement Learning. Advances in Neural Information Processing Systems (NIPS) 2005, 17:1281-1288.
- (2005) Advances in Neural Information Processing Systems (NIPS) , vol.17 , pp. 1281-1288
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

70
- 0004102479
- The MIT Press, Cambridge, MA
- Sutton R., Barto A.G. Reinforcement Learning: An Introduction 1998, The MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.G.²

71
- 0003066891
- Time-derivative models of pavlovian reinforcement
- The MIT Press, Cambridge, MA, M. Gabriel, J. Moore (Eds.)
- Sutton R.S., Barto A.G. Time-derivative models of pavlovian reinforcement. Learning and Computational Neuroscience: Foundations of Adaptive Networks 1990, 497-537. The MIT Press, Cambridge, MA. M. Gabriel, J. Moore (Eds.).
- (1990) Learning and Computational Neuroscience: Foundations of Adaptive Networks , pp. 497-537
- Sutton, R.S.¹ Barto, A.G.²

72
- 71149099079
- Fast gradient-descent methods for temporal-difference learning with linear function approximation
- Sutton R.S., Maei H.R., Precup D., Bhatnagar S., Silver D., Szepesvari C., Wiewiora E. Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML-09 2009, 993-1000.
- (2009) ICML-09 , pp. 993-1000
- Sutton, R.S.¹ Maei, H.R.² Precup, D.³ Bhatnagar, S.⁴ Silver, D.⁵ Szepesvari, C.⁶ Wiewiora, E.⁷

73
- 84899464022
- Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction
- Sutton R.S., Modayil J., Delp M., Degris T., Pilarski P.M., White A. Horde: A Scalable Real-time Architecture for Learning Knowledge from Unsupervised Sensorimotor Interaction. AAMAS'11 The 10th International Conference on Autonomous Agents and Multiagent Systems - vol. 2 2011, 761-768.
- (2011) AAMAS'11 The 10th International Conference on Autonomous Agents and Multiagent Systems - vol. 2 , pp. 761-768
- Sutton, R.S.¹ Modayil, J.² Delp, M.³ Degris, T.⁴ Pilarski, P.M.⁵ White, A.⁶

74
- 84862673530
- Learning to simulate others' decisions
- Suzuki S., Harasawa N., Ueno K., Gardner J.L., Ichinohe N., Haruno M., Cheng K., Nakahara H. Learning to simulate others' decisions. Neuron 2012, 74:1125-1137.
- (2012) Neuron , vol.74 , pp. 1125-1137
- Suzuki, S.¹ Harasawa, N.² Ueno, K.³ Gardner, J.L.⁴ Ichinohe, N.⁵ Haruno, M.⁶ Cheng, K.⁷ Nakahara, H.⁸

75
- 84871936763
- Learning to use working memory in partially observable environments through dopaminergic reinforcement
- Todd M., Niv Y., Cohen J.D. Learning to use working memory in partially observable environments through dopaminergic reinforcement. Advances in Neural Information Processing Systems (NIPS) 2009, 21:1-8.
- (2009) Advances in Neural Information Processing Systems (NIPS) , vol.21 , pp. 1-8
- Todd, M.¹ Niv, Y.² Cohen, J.D.³

76
- 78751687449
- The neural basis of intuitive best next-move generation in board game experts
- Wan X., Nakatani H., Ueno K., Asamizuya T., Cheng K., Tanaka K. The neural basis of intuitive best next-move generation in board game experts. Science 2011, 331(6015):341-346.
- (2011) Science , vol.331 , Issue.6015 , pp. 341-346
- Wan, X.¹ Nakatani, H.² Ueno, K.³ Asamizuya, T.⁴ Cheng, K.⁵ Tanaka, K.⁶

77
- 84860307045
- Mapping value based planning and extensively trained choice in the human brain
- Wunderlich K., Dayan P., Dolan R.J. Mapping value based planning and extensively trained choice in the human brain. Nat. Neurosci. 2012, 1-19.
- (2012) Nat. Neurosci. , pp. 1-19
- Wunderlich, K.¹ Dayan, P.² Dolan, R.J.³

78
- 80055117878
- Prediction error associated with the perceptual segmentation of naturalistic events
- Zacks J.M., Kurby C.A., Eisenberg M.L., Haroutunian N. Prediction error associated with the perceptual segmentation of naturalistic events. J. Cogn. Neurosci. 2011, 23(12):4057-4066.
- (2011) J. Cogn. Neurosci. , vol.23 , Issue.12 , pp. 4057-4066
- Zacks, J.M.¹ Kurby, C.A.² Eisenberg, M.L.³ Haroutunian, N.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.