SCOPUS 정보 검색 플랫폼

PLoS ONE

Volumn 4, Issue 10, 2009, Pages

Temporal-difference reinforcement learning with distributed representations

(2) Kurth Nelson, Zeb a Redish, A David a

a UNIVERSITY OF MINNESOTA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DOPAMINE;

ARTICLE; BEHAVIORAL RESEARCH; CONDITIONING; CONTROLLED STUDY; DELAY CONDITIONING; LEARNING ALGORITHM; MATHEMATICAL COMPUTING; REINFORCEMENT; REWARD; STIMULUS RESPONSE; TEMPORAL DIFFERENCE ALGORITHM; TRACE CONDITIONING; ALGORITHM; ANIMAL; ARTIFICIAL NEURAL NETWORK; BIOLOGICAL MODEL; COMPUTER PROGRAM; COMPUTER SIMULATION; HUMAN; LEARNING; PROBABILITY; REPRODUCIBILITY; STATISTICAL MODEL; TIME;

ANIMALIA;

ALGORITHMS; ANIMALS; COMPUTER SIMULATION; CONDITIONING (PSYCHOLOGY); HUMANS; LEARNING; MARKOV CHAINS; MODELS, NEUROLOGICAL; MODELS, STATISTICAL; NEURAL NETWORKS (COMPUTER); REINFORCEMENT (PSYCHOLOGY); REPRODUCIBILITY OF RESULTS; REWARD; SOFTWARE; TIME FACTORS;

EID: 70449382577 PISSN: None EISSN: 19326203 Source Type: Journal
DOI: 10.1371/journal.pone.0007362 Document Type: Article

Times cited : (56)

References (113)

1
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive Hebbian learning
- Montague PR, Dayan P, Sejnowski TJ (1996) A framework for mesencephalic dopamine systems based on predictive Hebbian learning. Journal of Neuroscience 16: 1936-1947.
- (1996) Journal of Neuroscience , vol.16 , pp. 1936-1947
- Montague, P.R.¹ Dayan, P.² Sejnowski, T.J.³

2
- 0030896968
- A neural substrate of prediction and reward
- Schultz W, Dayan P, Montague R (1997) A neural substrate of prediction and reward. Science 275: 1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, R.³

3
- 0002337786
- Metalearning, neuromodulation, and emotion
- Hatano G, Okada N, Tanabe H, eds, Elsevier
- Doya K (2000) Metalearning, neuromodulation, and emotion. In: Hatano G, Okada N, Tanabe H, eds. Affective Minds, Elsevier.
- (2000) Affective Minds
- Doya, K.¹

4
- 0004102479
- Cambridge MA: MIT Press
- Sutton RS, Barto AG (1998) Reinforcement Learning: An introduction. Cambridge MA: MIT Press.
- (1998) Reinforcement Learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

5
- 33745787929
- Representation and timing in theories of the dopamine system
- Daw ND, Courville AC, Touretzky DS (2006) Representation and timing in theories of the dopamine system. Neural Computation 18: 1637-1677.
- (2006) Neural Computation , vol.18 , pp. 1637-1677
- Daw, N.D.¹ Courville, A.C.² Touretzky, D.S.³

6
- 34548837994
- Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling
- Redish AD, Jensen S, Johnson A, Kurth-Nelson Z (2007) Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling. Psychological Review 114: 784-805.
- (2007) Psychological Review , vol.114 , pp. 784-805
- Redish, A.D.¹ Jensen, S.² Johnson, A.³ Kurth-Nelson, Z.⁴

7
- 70449457869
- Sutton RS, ed (1992) Special issue on reinforcement learning, 8(3/ 4) of Machine Learning. Boston: Kluwer Academic Publishers.
- Sutton RS, ed (1992) Special issue on reinforcement learning, volume 8(3/ 4) of Machine Learning. Boston: Kluwer Academic Publishers.

8
- 13244266174
- Ph.D. thesis, Carnegie Mellon University
- Daw ND (2003) Reinforcement learning models of the dopamine system and their behavioral implications. Ph.D. thesis, Carnegie Mellon University.
- (2003) Reinforcement learning models of the dopamine system and their behavioral implications
- Daw, N.D.¹

9
- 0002109138
- A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
- Black AH, Prokesy WF, eds, Current Research and Theory. New York: Appleton Century Crofts. pp
- Rescorla RA, Wagner AR (1972) A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In: Black AH, Prokesy WF, eds. Classical Conditioning II: Current Research and Theory. New York: Appleton Century Crofts. pp 64-99.
- (1972) Classical Conditioning II , pp. 64-99
- Rescorla, R.A.¹ Wagner, A.R.²

10
- 0019537951
- Toward a modern theory of adaptive networks: Expectation and prediction
- Sutton RS, Barto AG (1981) Toward a modern theory of adaptive networks: Expectation and prediction. Psychological Review 88: 135-170.
- (1981) Psychological Review , vol.88 , pp. 135-170
- Sutton, R.S.¹ Barto, A.G.²

11
- 0000541213
- Houk JC, Davis JL, Beiser DG, eds. Models of Information Processing in the Basal Ganglia. Cambridge MA: MIT Press. pp
- Barto AG (1995) Adaptive critics and the basal ganglia. In: Houk JC, Davis JL, Beiser DG, eds. Models of Information Processing in the Basal Ganglia. Cambridge MA: MIT Press. pp 215-232.
- (1995) Adaptive critics and the basal ganglia , pp. 215-232
- Barto, A.G.¹

12
- 0034078011
- Neuronal coding of prediction errors
- Schultz W, Dickinson A (2000) Neuronal coding of prediction errors. Annual Review of Neuroscience 23: 473-500.
- (2000) Annual Review of Neuroscience , vol.23 , pp. 473-500
- Schultz, W.¹ Dickinson, A.²

13
- 0037057755
- Getting formal with dopamine and reward
- Schultz W (2002) Getting formal with dopamine and reward. Neuron 36: 241-263.
- (2002) Neuron , vol.36 , pp. 241-263
- Schultz, W.¹

14
- 10344225664
- Addiction as a computational process gone awry
- Redish AD (2004) Addiction as a computational process gone awry. Science 306: 1944-1947.
- (2004) Science , vol.306 , pp. 1944-1947
- Redish, A.D.¹

15
- 0036592029
- Dopamine: Generalization and bonuses
- Kakade S, Dayan P (2002) Dopamine: generalization and bonuses. Neural Networks 15: 549-599.
- (2002) Neural Networks , vol.15 , pp. 549-599
- Kakade, S.¹ Dayan, P.²

16
- 0037987978
- Temporal difference models and reward-related learning in the human brain
- O'Doherty JP, PeterDayan, KF, Critchley H, Dolan RJ (2003) Temporal difference models and reward-related learning in the human brain. Neuron 38: 329-337.
- (2003) Neuron , vol.38 , pp. 329-337
- O'Doherty, J.P.¹ PeterDayan, K.F.² Critchley, H.³ Dolan, R.J.⁴

17
- 9644310472
- Reward representations and reward-related learning in the human brain: Insights from neuroimaging
- O'Doherty JP (2004) Reward representations and reward-related learning in the human brain: insights from neuroimaging. Current Opinion in Neurobiology 14: 769-776.
- (2004) Current Opinion in Neurobiology , vol.14 , pp. 769-776
- O'Doherty, J.P.¹

18
- 21544435722
- Midbrain dopamine neurons encode a quantitative reward prediction error signal
- Bayer HM, Glimcher P (2005) Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47: 129-141.
- (2005) Neuron , vol.47 , pp. 129-141
- Bayer, H.M.¹ Glimcher, P.²

19
- 21544455210
- Dopamine Cells Respond to Predicted Events during Classical Conditioning: Evidence for Eligibility Traces in the Reward-Learning Network
- Pan WX, Schmidt R, Wickens JR, Hyland BI (2005) Dopamine Cells Respond to Predicted Events during Classical Conditioning: Evidence for Eligibility Traces in the Reward-Learning Network. J Neurosci 25: 6235-6242.
- (2005) J Neurosci , vol.25 , pp. 6235-6242
- Pan, W.X.¹ Schmidt, R.² Wickens, J.R.³ Hyland, B.I.⁴

20
- 20444397095
- Extinction of cocaine self-administration reveals functionally and temporally distinct dopaminergic signals in the nucleus accumbens
- Stuber GD, Wightman RM, Carelli RM (2005) Extinction of cocaine self-administration reveals functionally and temporally distinct dopaminergic signals in the nucleus accumbens. Neuron 46: 661-669.
- (2005) Neuron , vol.46 , pp. 661-669
- Stuber, G.D.¹ Wightman, R.M.² Carelli, R.M.³

21
- 34547536392
- Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens
- Day JJ, Roitman MF, Wightman RM, Carelli RM (2007) Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature Neuroscience 10: 1020-1028.
- (2007) Nature Neuroscience , vol.10 , pp. 1020-1028
- Day, J.J.¹ Roitman, M.F.² Wightman, R.M.³ Carelli, R.M.⁴

22
- 34548778113
- Statistics of midbrain dopamine neuron spike trains in the awake primate
- Bayer HM, Lau B, Glimcher PW (2007) Statistics of midbrain dopamine neuron spike trains in the awake primate. J Neurophysiol 98: 1428-1439.
- (2007) J Neurophysiol , vol.98 , pp. 1428-1439
- Bayer, H.M.¹ Lau, B.² Glimcher, P.W.³

23
- 0036618011
- Multiple model-based reinforcement learning
- Doya K, Samejima K, Katagiri KI, Kawato M (2002) Multiple model-based reinforcement learning. Neural Computation 14: 1347-1369.
- (2002) Neural Computation , vol.14 , pp. 1347-1369
- Doya, K.¹ Samejima, K.² Katagiri, K.I.³ Kawato, M.⁴

24
- 34547742206
- Multiple model-based reinforcement learning explains dopamine neuronal activity
- Bertin M, Schweighofer N, Doya K (2007) Multiple model-based reinforcement learning explains dopamine neuronal activity. Neural Networks 20: 668-675.
- (2007) Neural Networks , vol.20 , pp. 668-675
- Bertin, M.¹ Schweighofer, N.² Doya, K.³

25
- 57349130536
- Stimulus representation and the timing of reward-prediction errors in models of the dopamine system
- Ludvig EA, Sutton RS, Kehoe EJ (2008) Stimulus representation and the timing of reward-prediction errors in models of the dopamine system. Neural Computation 20: 3034-3054.
- (2008) Neural Computation , vol.20 , pp. 3034-3054
- Ludvig, E.A.¹ Sutton, R.S.² Kehoe, E.J.³

26
- 77549086811
- Koller D, Schuurmans D, Bengio Y, Bottou L, eds. Advances in Neural Information Processing Systems
- Ludvig EA, Sutton RS, Verbeek E, Kehoe EJ (2009) A computational model of hippocampal function in trace conditioning. In: Koller D, Schuurmans D, Bengio Y, Bottou L, eds. Advances in Neural Information Processing Systems 21. pp 993-1000.
- (2009) A computational model of hippocampal function in trace conditioning , vol.21 , pp. 993-1000
- Ludvig, E.A.¹ Sutton, R.S.² Verbeek, E.³ Kehoe, E.J.⁴

27
- 0022930826
- Parallel organization of functionally segregated circuits linking basal ganglia and cortex
- Alexander GE, DeLong MR, Strick PL (1986) Parallel organization of functionally segregated circuits linking basal ganglia and cortex. Annual Reviews Neuroscience 9: 357-381.
- (1986) Annual Reviews Neuroscience , vol.9 , pp. 357-381
- Alexander, G.E.¹ DeLong, M.R.² Strick, P.L.³

28
- 0001317998
- Houk JC, Davis JL, Beiser DG, eds. Models of Information Processing in the Basal Ganglia, MIT Pres. pp
- Strick PL, Dum RP, Picard N (1995) Macro-organization of the circuts connecting the basal ganglia with the cortical motor areas. In: Houk JC, Davis JL, Beiser DG, eds. Models of Information Processing in the Basal Ganglia, MIT Pres. pp 117-130.
- (1995) Macro-organization of the circuts connecting the basal ganglia with the cortical motor areas , pp. 117-130
- Strick, P.L.¹ Dum, R.P.² Picard, N.³

29
- 0034654526
- Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum
- Haber SN, Fudge JL, McFarland NR (2000) Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum. Journal of Neuroscience 20: 2369-2382.
- (2000) Journal of Neuroscience , vol.20 , pp. 2369-2382
- Haber, S.N.¹ Fudge, J.L.² McFarland, N.R.³

30
- 3343026029
- Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops
- Tanaka SC, Doya K, Okada G, Ueda K, Okamoto Y, et al. (2004) Prediction of immediate and future rewards differentially recruits cortico-basal ganglia loops. Nature Neuroscience 7: 887-893.
- (2004) Nature Neuroscience , vol.7 , pp. 887-893
- Tanaka, S.C.¹ Doya, K.² Okada, G.³ Ueda, K.⁴ Okamoto, Y.⁵

31
- 43749107069
- Low-serotonin levels increase delayed reward discounting in humans
- Schweighofer N, Bertin M, Shishida K, Okamoto Y, Tanaka SC, et al. (2008) Low-serotonin levels increase delayed reward discounting in humans. Journal of Neuroscience 28: 4528-4532.
- (2008) Journal of Neuroscience , vol.28 , pp. 4528-4532
- Schweighofer, N.¹ Bertin, M.² Shishida, K.³ Okamoto, Y.⁴ Tanaka, S.C.⁵

32
- 0026505520
- Responses of monkey dopamine neurons during learning of behavioral reactions
- Ljungberg T, Apicella P, Schultz W (1992) Responses of monkey dopamine neurons during learning of behavioral reactions. Journal of Neurophysiology 67: 145-163.
- (1992) Journal of Neurophysiology , vol.67 , pp. 145-163
- Ljungberg, T.¹ Apicella, P.² Schultz, W.³

33
- 33644688754
- Dopamine neurons report an error in the temporal prediction of reward during learning
- Hollerman JR, Schultz W (1998) Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience 1: 304-309.
- (1998) Nature Neuroscience , vol.1 , pp. 304-309
- Hollerman, J.R.¹ Schultz, W.²

34
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz W (1998) Predictive reward signal of dopamine neurons. Journal of Neurophysiology 80: 1-27.
- (1998) Journal of Neurophysiology , vol.80 , pp. 1-27
- Schultz, W.¹

35
- 1842684992
- Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology
- Schultz W (2004) Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology. Current Opinion in Neurobiology 14: 139-147.
- (2004) Current Opinion in Neurobiology , vol.14 , pp. 139-147
- Schultz, W.¹

36
- 0004287950
- Princeton
- Stephens DW, Krebs JR (1987) Foraging Theory. Princeton.
- (1987) Foraging Theory
- Stephens, D.W.¹ Krebs, J.R.²

37
- 70449450852
- Redish AD, Kurth-Nelson Z (2010) Neural models of temporal discounting. In: Madden G, Bickel W, eds. Impulsivity: The Behavioral and Neurological Science of Discounting, APA books. pp 123-158.
- Redish AD, Kurth-Nelson Z (2010) Neural models of temporal discounting. In: Madden G, Bickel W, eds. Impulsivity: The Behavioral and Neurological Science of Discounting, APA books. pp 123-158.

38
- 0003934109
- Cambridge Univ Press
- Ainslie G (1992) Picoeconomics. Cambridge Univ Press.
- (1992) Picoeconomics
- Ainslie, G.¹

39
- 0030920737
- Choice, delay, probability and conditioned reinforcement
- Mazur J (1997) Choice, delay, probability and conditioned reinforcement. Animal Learning and Behavior 25: 131-147.
- (1997) Animal Learning and Behavior , vol.25 , pp. 131-147
- Mazur, J.¹

40
- 0035229091
- Hyperbolic value addition and general models of animal choice
- Mazur JE (2001) Hyperbolic value addition and general models of animal choice. Psychological Review 108: 96-112.
- (2001) Psychological Review , vol.108 , pp. 96-112
- Mazur, J.E.¹

41
- 0004245883
- Cambridge Univ Press
- Ainslie G (2001) Breakdown of Will. Cambridge Univ Press.
- (2001) Breakdown of Will
- Ainslie, G.¹

42
- 48349146537
- Madden G, Bickel W, Critchfield T, eds in press, APA books
- Madden G, Bickel W, Critchfield T, eds (in press) Impulsivity: Theory, Science, and Neuroscience of Discounting. APA books.
- Impulsivity: Theory, Science, and Neuroscience of Discounting

43
- 0002610737
- On a routing problem
- Bellman R (1958) On a routing problem. Quarterly Journal of Applied Mathematics 16: 87-90.
- (1958) Quarterly Journal of Applied Mathematics , vol.16 , pp. 87-90
- Bellman, R.¹

44
- 84921399937
- Si J, Barto AG, Powell WB, Wuncsch II D, eds , Wiley: IEEE Press
- Si J, Barto AG, Powell WB, Wuncsch II D, eds (2004) Handbook of learning and approximate dynamic programming. Wiley: IEEE Press.
- (2004) Handbook of learning and approximate dynamic programming

45
- 0346706306
- One hundred years of forgetting: A quantitative description of retention
- Rubin DC, Wenzel AE (1996) One hundred years of forgetting: A quantitative description of retention. Psyhcological Review 103: 734-760.
- (1996) Psyhcological Review , vol.103 , pp. 734-760
- Rubin, D.C.¹ Wenzel, A.E.²

46
- 0033465537
- The precise time course of retention
- Rubin DC, Hinton S, Wenzel A (1999) The precise time course of retention. Journal of Experimental Psychology: Learning, Memory, and Cognition 25: 1161-1176.
- (1999) Journal of Experimental Psychology: Learning, Memory, and Cognition , vol.25 , pp. 1161-1176
- Rubin, D.C.¹ Hinton, S.² Wenzel, A.³

47
- 0003799951
- Harvard Univ Press
- Herrnstein RJ (1997) The Matching Law. Harvard Univ Press.
- (1997) The Matching Law
- Herrnstein, R.J.¹

48
- 0032812326
- Discounting of delayed rewards in opioid-dependent outpatients exponential or hyperbolic discounting functions?
- Madden GJ, Bickel WK, Jacobs EA (1999) Discounting of delayed rewards in opioid-dependent outpatients exponential or hyperbolic discounting functions? Experimental and Clinical Psychopharmacology 7: 284-293.
- (1999) Experimental and Clinical Psychopharmacology , vol.7 , pp. 284-293
- Madden, G.J.¹ Bickel, W.K.² Jacobs, E.A.³

49
- 0035638928
- Is time-discounting hyperbolic or subadditive?
- Read D (2001) Is time-discounting hyperbolic or subadditive? Journal of Risk and Uncertainty 23: 5-32.
- (2001) Journal of Risk and Uncertainty , vol.23 , pp. 5-32
- Read, D.¹

50
- 0031912057
- Polydrug abuse in heroin addicts: A behavioral economic analysis
- Petry NM, Bickel WK (1998) Polydrug abuse in heroin addicts: a behavioral economic analysis. Addiction 93: 321-335.
- (1998) Addiction , vol.93 , pp. 321-335
- Petry, N.M.¹ Bickel, W.K.²

51
- 0032743153
- Measures of impulsivity in cigarette smokers and non-smokers
- Mitchell SH (1999) Measures of impulsivity in cigarette smokers and non-smokers. Psychopharmacology 146: 455-464.
- (1999) Psychopharmacology , vol.146 , pp. 455-464
- Mitchell, S.H.¹

52
- 0036672359
- Discounting of delayed health gains and losses by current, never- and ex-smokers of cigarettes
- Odum AL, Madden GJ, Bickel WK (2002) Discounting of delayed health gains and losses by current, never- and ex-smokers of cigarettes. Nicotine and Tobacco Research 4: 295-303.
- (2002) Nicotine and Tobacco Research , vol.4 , pp. 295-303
- Odum, A.L.¹ Madden, G.J.² Bickel, W.K.³

53
- 0142155119
- Pathological gambling severity is associated with impulsivity in a delay discounting procedure
- Alessi SM, Petry NM (2003) Pathological gambling severity is associated with impulsivity in a delay discounting procedure. Behavioural Processes 64: 345-354.
- (2003) Behavioural Processes , vol.64 , pp. 345-354
- Alessi, S.M.¹ Petry, N.M.²

54
- 33751168257
- A review of delay-discounting research with humans: Relations to drug use and gambling
- Reynolds B (2006) A review of delay-discounting research with humans: relations to drug use and gambling. Behavioural Pharmacology 17: 651-667.
- (2006) Behavioural Pharmacology , vol.17 , pp. 651-667
- Reynolds, B.¹

55
- 1942436827
- Memory traces of trace memories: Neurogenesis, synaptogenesis and awareness
- Shors TJ (2004) Memory traces of trace memories: neurogenesis, synaptogenesis and awareness. Trends in Neurosciences 27: 250-256.
- (2004) Trends in Neurosciences , vol.27 , pp. 250-256
- Shors, T.J.¹

56
- 0242600534
- Subsecond dopamine release promotes cocaine seeking
- Phillips PEM, Stuber GD, Heien MLAV, Wightman RM, Carelli RM (2003) Subsecond dopamine release promotes cocaine seeking. Nature 422: 614-618.
- (2003) Nature , vol.422 , pp. 614-618
- Phillips, P.E.M.¹ Stuber, G.D.² Heien, M.L.A.V.³ Wightman, R.M.⁴ Carelli, R.M.⁵

57
- 1242269217
- Dopamine operates as a subsecond modulator of food seeking
- Roitman MF, Stuber GD, Phillips PEM, Wightman RM, Carelli RM (2004) Dopamine operates as a subsecond modulator of food seeking. Journal of Neuroscience 24: 1265-1271.
- (2004) Journal of Neuroscience , vol.24 , pp. 1265-1271
- Roitman, M.F.¹ Stuber, G.D.² Phillips, P.E.M.³ Wightman, R.M.⁴ Carelli, R.M.⁵

58
- 0004281531
- Oxford Univ Press
- Pavlov I (1927) Conditioned Reflexes. Oxford Univ Press.
- (1927) Conditioned Reflexes
- Pavlov, I.¹

59
- 0023035964
- Hippocampus and trace conditioning of the rabbit's classically conditioned nictitating membrane response
- Solomon PR, Schaaf ERV, Thompson RF, Weisz DJ (1986) Hippocampus and trace conditioning of the rabbit's classically conditioned nictitating membrane response. Behavioral Neuroscience 100: 729-744.
- (1986) Behavioral Neuroscience , vol.100 , pp. 729-744
- Solomon, P.R.¹ Schaaf, E.R.V.² Thompson, R.F.³ Weisz, D.J.⁴

60
- 0035662094
- The role of the hippocampus in trace conditioning: Temporal discontinuity or task difficulty?
- Beylin AV, Gandhi CC, Wood GE, Talk AC, Matzel LD, et al. (2001) The role of the hippocampus in trace conditioning: Temporal discontinuity or task difficulty? Neurobiology of Learning and Memory 76: 447-461.
- (2001) Neurobiology of Learning and Memory , vol.76 , pp. 447-461
- Beylin, A.V.¹ Gandhi, C.C.² Wood, G.E.³ Talk, A.C.⁴ Matzel, L.D.⁵

61
- 35148864530
- Dorsal, ventral, and complete excitotoxic lesions of the hippocampus in rats failed to impair appetitive trace conditioning
- Thibaudeau G, Potvin O, Allen K, Dore FY, Goulet S (2007) Dorsal, ventral, and complete excitotoxic lesions of the hippocampus in rats failed to impair appetitive trace conditioning. Behavioural Brain Research 185: 9-20.
- (2007) Behavioural Brain Research , vol.185 , pp. 9-20
- Thibaudeau, G.¹ Potvin, O.² Allen, K.³ Dore, F.Y.⁴ Goulet, S.⁵

62
- 20644435564
- The formation of neural codes in the hippocampus: Trace conditioning as a prototypical paradigm for studying the random recoding hypothesis
- Levy WB, Sanyal A, Rodriguez P, Sullivan DW, Wu XB (2005) The formation of neural codes in the hippocampus: trace conditioning as a prototypical paradigm for studying the random recoding hypothesis. Biol Cybern 92: 409-426.
- (2005) Biol Cybern , vol.92 , pp. 409-426
- Levy, W.B.¹ Sanyal, A.² Rodriguez, P.³ Sullivan, D.W.⁴ Wu, X.B.⁵

63
- 51149102880
- Internally generated cell assembly sequences in the rat hippocampus
- Pastalkova E, Itskov V, Amarasingham A, Buzsaki G (2008) Internally generated cell assembly sequences in the rat hippocampus. Science 321: 1322-1327.
- (2008) Science , vol.321 , pp. 1322-1327
- Pastalkova, E.¹ Itskov, V.² Amarasingham, A.³ Buzsaki, G.⁴

64
- 0021137772
- Bridging temporal gaps between cs and us in autoshaping: A test of a local context hypothesis
- Kaplan PS (1984) Bridging temporal gaps between cs and us in autoshaping: A test of a local context hypothesis. Animal Learning and Behavior 12: 142-148.
- (1984) Animal Learning and Behavior , vol.12 , pp. 142-148
- Kaplan, P.S.¹

65
- 0037431291
- Dopamine as chicken and egg
- Self D (2003) Dopamine as chicken and egg. Nature 422: 573-574.
- (2003) Nature , vol.422 , pp. 573-574
- Self, D.¹

66
- 0030026069
- Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli
- Mirenowicz J, Schultz W (1996) Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli. Nature 379: 449-451.
- (1996) Nature , vol.379 , pp. 449-451
- Mirenowicz, J.¹ Schultz, W.²

67
- 0035315989
- Temporal difference model reproduces anticipatory neural activity
- Suri RE, Schultz W (2001) Temporal difference model reproduces anticipatory neural activity. Neural Computation 13: 841-862.
- (2001) Neural Computation , vol.13 , pp. 841-862
- Suri, R.E.¹ Schultz, W.²

68
- 0037459319
- Discrete coding of reward probability and uncertainty by dopamine neurons
- Fiorillo CD, Tobler PN, Schultz W (2003) Discrete coding of reward probability and uncertainty by dopamine neurons. Science 299: 1898-1902.
- (2003) Science , vol.299 , pp. 1898-1902
- Fiorillo, C.D.¹ Tobler, P.N.² Schultz, W.³

69
- 0027964829
- Importance of unpredictability for reward responses in primate dopamine neurons
- Mirenowicz J, Schultz W (1994) Importance of unpredictability for reward responses in primate dopamine neurons. Journal of Neurophysiology 72: 1024-1027.
- (1994) Journal of Neurophysiology , vol.72 , pp. 1024-1027
- Mirenowicz, J.¹ Schultz, W.²

70
- 13244267004
- Temporal sequence learning, prediction, and control - a review of different models and their relation to biological mechanisms
- Wörgötter F, Porr B (2005) Temporal sequence learning, prediction, and control - a review of different models and their relation to biological mechanisms. Neural Computation 17: 245-319.
- (2005) Neural Computation , vol.17 , pp. 245-319
- Wörgötter, F.¹ Porr, B.²

71
- 0036592008
- Opponent interactions between serotonin and dopamine
- Daw ND, Kakade S, Dayan P (2002) Opponent interactions between serotonin and dopamine. Neural Networks 15: 603-616.
- (2002) Neural Networks , vol.15 , pp. 603-616
- Daw, N.D.¹ Kakade, S.² Dayan, P.³

72
- 7044239264
- Behavior: A marketplace in the brain?
- Ainslie G, Monterosso J (2004) Behavior: A marketplace in the brain? Science 306: 421-423.
- (2004) Science , vol.306 , pp. 421-423
- Ainslie, G.¹ Monterosso, J.²

73
- 0032558817
- On hyperbolic discounting and uncertain hazard rates
- Sozou PD (1998) On hyperbolic discounting and uncertain hazard rates. The Royal Society London B 265: 2015-2020.
- (1998) The Royal Society London , vol.B 265 , pp. 2015-2020
- Sozou, P.D.¹

74
- 0031309579
- Kacelnik A (1997) Normative and descriptive models of decision making: time discounting and risk sensitivity. In: Bock GR, Cardew G, eds. Characterizing Human Psychological Adaptations. Chichester UK: Wiley, 208 of Ciba Foundation Symposia. pp 51-66. Discussion 67-70.
- Kacelnik A (1997) Normative and descriptive models of decision making: time discounting and risk sensitivity. In: Bock GR, Cardew G, eds. Characterizing Human Psychological Adaptations. Chichester UK: Wiley, volume 208 of Ciba Foundation Symposia. pp 51-66. Discussion 67-70.

75
- 70449389676
- An economic perspective on addiction and matching
- Laibson DI (1996) An economic perspective on addiction and matching. Behavioral and Brain Sciences 19: 583-584.
- (1996) Behavioral and Brain Sciences , vol.19 , pp. 583-584
- Laibson, D.I.¹

76
- 5144224271
- Separate neural systems value immediate and delayed monetary rewards
- McClure SM, Laibson DI, Loewenstein G, Cohen JD (2004) Separate neural systems value immediate and delayed monetary rewards. Science 306: 503-507.
- (2004) Science , vol.306 , pp. 503-507
- McClure, S.M.¹ Laibson, D.I.² Loewenstein, G.³ Cohen, J.D.⁴

77
- 33644512423
- Neuroeconomics: Cross-currents in research on decision-making
- Sanfey AG, Loewenstein G, McClure SM, Cohen JD (2006) Neuroeconomics: cross-currents in research on decision-making. Trends in Cognitive Sciences 10: 108-116.
- (2006) Trends in Cognitive Sciences , vol.10 , pp. 108-116
- Sanfey, A.G.¹ Loewenstein, G.² McClure, S.M.³ Cohen, J.D.⁴

78
- 0035968007
- Impulsive choice induced in rats by lesion of the nucleus accumbens core
- Cardinal RN, Pennicott DR, Sugathapala CL, Robbins TW, Everitt BJ (2001) Impulsive choice induced in rats by lesion of the nucleus accumbens core. Science 292: 2499-2501.
- (2001) Science , vol.292 , pp. 2499-2501
- Cardinal, R.N.¹ Pennicott, D.R.² Sugathapala, C.L.³ Robbins, T.W.⁴ Everitt, B.J.⁵

79
- 0141619496
- Operant conditioning
- Staddon JER, Cerutti DT (2003) Operant conditioning. Annual Reviews of Psychology 54: 115-144.
- (2003) Annual Reviews of Psychology , vol.54 , pp. 115-144
- Staddon, J.E.R.¹ Cerutti, D.T.²

80
- 39149087042
- Is a bird in the hand worth two in the future? the neuroeconomics of intertemporal decision-making
- Kalenscher T, Pennartz CMA (2008) Is a bird in the hand worth two in the future? the neuroeconomics of intertemporal decision-making. Progress in Neurobiology 84: 284-315.
- (2008) Progress in Neurobiology , vol.84 , pp. 284-315
- Kalenscher, T.¹ Pennartz, C.M.A.²

81
- 0023776798
- Scalar expectancy theory and choice between delayed rewards
- Gibbon J, Church RM, Fairhurst S, Kacelnik A (1988) Scalar expectancy theory and choice between delayed rewards. Psychological Review 95: 102-114.
- (1988) Psychological Review , vol.95 , pp. 102-114
- Gibbon, J.¹ Church, R.M.² Fairhurst, S.³ Kacelnik, A.⁴

82
- 0034169238
- Time, rate, and conditioning
- Gallistel CR, Gibbon J (2000) Time, rate, and conditioning. Psychological Review 107: 289-344.
- (2000) Psychological Review , vol.107 , pp. 289-344
- Gallistel, C.R.¹ Gibbon, J.²

83
- 0742324926
- Inter-module credit assignment in modular reinforcement learning
- Samejima K, Doya K, Kawato M (2003) Inter-module credit assignment in modular reinforcement learning. Neural Networks 16: 985-994.
- (2003) Neural Networks , vol.16 , pp. 985-994
- Samejima, K.¹ Doya, K.² Kawato, M.³

84
- 34848829141
- Dopamine release is heterogeneous within microenvironments of the rat nucleus accumbens
- Wightman RM, Heien MLAV, Wassum KM, Sombers LA, Aragona BJ, et al. (2007) Dopamine release is heterogeneous within microenvironments of the rat nucleus accumbens. European Journal of Neuroscience 26: 2046-2054.
- (2007) European Journal of Neuroscience , vol.26 , pp. 2046-2054
- Wightman, R.M.¹ Heien, M.L.A.V.² Wassum, K.M.³ Sombers, L.A.⁴ Aragona, B.J.⁵

85
- 0030513846
- A sequence predicting CA3 is a flexible associator that learns and uses context to solve hippocampal-like tasks
- Levy WB (1996) A sequence predicting CA3 is a flexible associator that learns and uses context to solve hippocampal-like tasks. Hippocampus 6: 579-591.
- (1996) Hippocampus , vol.6 , pp. 579-591
- Levy, W.B.¹

86
- 0032519055
- Probabilistic interpretation of population codes
- Zemel RS, Dayan P, Pouget A (1998) Probabilistic interpretation of population codes. Neural Computation 10: 403-430.
- (1998) Neural Computation , vol.10 , pp. 403-430
- Zemel, R.S.¹ Dayan, P.² Pouget, A.³

87
- 0004291629
- MIT Press
- Dayan P, Abbott LF (2001) Theoretical Neuroscience. MIT Press.
- (2001) Theoretical Neuroscience
- Dayan, P.¹ Abbott, L.F.²

88
- 0347625931
- Detecting dynamical changes within a simulated neural ensemble using a measure of representational quality
- Jackson JC, Redish AD (2003) Detecting dynamical changes within a simulated neural ensemble using a measure of representational quality. Network: Computation in Neural Systems 14: 629-645.
- (2003) Network: Computation in Neural Systems , vol.14 , pp. 629-645
- Jackson, J.C.¹ Redish, A.D.²

89
- 14844299356
- Reconstruction of the postsubiculum head direction signal from neural ensembles
- Johnson A, Seeland KD, Redish AD (2005) Reconstruction of the postsubiculum head direction signal from neural ensembles. Hippocampus 15: 86-96.
- (2005) Hippocampus , vol.15 , pp. 86-96
- Johnson, A.¹ Seeland, K.D.² Redish, A.D.³

90
- 84928291969
- Hölscher C, Munk MHJ, eds. Mechanisms of information processing in the Brain: Encoding of information in neural populations and networks, Cambridge University Press. pp
- Johnson A, Jackson J, Redish AD (2008) Measuring distributed properties of neural representations beyond the decoding of local variables - implications for cognition. In: Hölscher C, Munk MHJ, eds. Mechanisms of information processing in the Brain: Encoding of information in neural populations and networks, Cambridge University Press. pp 95-119.
- (2008) Measuring distributed properties of neural representations beyond the decoding of local variables - implications for cognition , pp. 95-119
- Johnson, A.¹ Jackson, J.² Redish, A.D.³

91
- 84899017487
- Dietterich TG, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems, Cambridge, MA: MIT Press
- Dayan P (2002) Motivated reinforcement learning. In: Dietterich TG, Becker S, Ghahramani Z, eds. Advances in Neural Information Processing Systems 14. Cambridge, MA: MIT Press.
- (2002) Motivated reinforcement learning , pp. 14
- Dayan, P.¹

92
- 0032930935
- A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
- Suri RE, Schultz W (1999) A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience 91: 871-890.
- (1999) Neuroscience , vol.91 , pp. 871-890
- Suri, R.E.¹ Schultz, W.²

93
- 0003781238
- New York: Cambridge University Press
- Norris JR (1997) Markov Chains. New York: Cambridge University Press.
- (1997) Markov Chains
- Norris, J.R.¹

94
- 0003618624
- New York: Springer
- Brémaud P (1999) Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues. New York: Springer.
- (1999) Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues
- Brémaud, P.¹

95
- 0038503086
- Dopamine and inference about timing
- Daw ND, Courville AC, Touretzky DS (2002) Dopamine and inference about timing. Proceedings of the Second International Conference on Development and Learning.
- (2002) Proceedings of the Second International Conference on Development and Learning
- Daw, N.D.¹ Courville, A.C.² Touretzky, D.S.³

96
- 26444446315
- Dopamine, uncertainty, and TD learning
- Niv Y, Duff MO, Dayan P (2005) Dopamine, uncertainty, and TD learning. Behavioral and Brain Functions 1: 6.
- (2005) Behavioral and Brain Functions , vol.1 , pp. 6
- Niv, Y.¹ Duff, M.O.² Dayan, P.³

97
- 70449457868
- Tesauro G, Touretzky D, Leen T, eds. Advances in Neural Information Processing, MIT Press
- Badtke SJ, Duff MO (1995) Reinforcement-learning methods for continuoustime Markov decision problems. In: Tesauro G, Touretzky D, Leen T, eds. Advances in Neural Information Processing 7, MIT Press.
- (1995) Reinforcement-learning methods for continuoustime Markov decision problems , pp. 7
- Badtke, S.J.¹ Duff, M.O.²

98
- 0032643313
- Solving semi-markov decision problems using average reward reinforcement learning
- Das T, Gosavi A, Mahadevan S, Marchalleck N (1999) Solving semi-markov decision problems using average reward reinforcement learning. Management Science 45: 575-596.
- (1999) Management Science , vol.45 , pp. 575-596
- Das, T.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

99
- 0001235758
- Houk JC, Davis JL, Beiser DG, eds. Models of Information Processing in the Basal Ganglia. Cambridge MA: MIT Press. pp
- Schultz W, Romo R, Ljungberg T, Mirenowicz J, hollerman JR, et al. (1995) Reward-related signals carried by dopamine neurons. In: Houk JC, Davis JL, Beiser DG, eds. Models of Information Processing in the Basal Ganglia. Cambridge MA: MIT Press. pp 233-248.
- (1995) Reward-related signals carried by dopamine neurons , pp. 233-248
- Schultz, W.¹ Romo, R.² Ljungberg, T.³ Mirenowicz, J.⁴ hollerman, J.R.⁵

100
- 33646467129
- Evidence that the delay-period activity of dopamine neurons corresponds to reward uncertainty rather than backpropogating TD errors
- Fiorillo CD, Tobler PN, Schultz W (2005) Evidence that the delay-period activity of dopamine neurons corresponds to reward uncertainty rather than backpropogating TD errors. Behavioral and Brain Functions 1: 7.
- (2005) Behavioral and Brain Functions , vol.1 , pp. 7
- Fiorillo, C.D.¹ Tobler, P.N.² Schultz, W.³

101
- 34147168649
- Coordinated accumbal dopamine release and neural activity drive goal-directed behavior
- Cheer JF, Aragona BJ, Heien MLAV, Seipel AT, Carelli RM, et al. (2007) Coordinated accumbal dopamine release and neural activity drive goal-directed behavior. Neuron 54: 237-244.
- (2007) Neuron , vol.54 , pp. 237-244
- Cheer, J.F.¹ Aragona, B.J.² Heien, M.L.A.V.³ Seipel, A.T.⁴ Carelli, R.M.⁵

102
- 0004212914
- Academic Press
- Mackintosh NJ (1974) The Psychology of Animal Learning. Academic Press.
- (1974) The Psychology of Animal Learning
- Mackintosh, N.J.¹

103
- 48149101941
- The temporal precision of reward prediction in dopamine neurons
- Fiorillo CD, Newsome WT, Schultz W (2008) The temporal precision of reward prediction in dopamine neurons. Nature Neuroscience 11: 966-973.
- (2008) Nature Neuroscience , vol.11 , pp. 966-973
- Fiorillo, C.D.¹ Newsome, W.T.² Schultz, W.³

104
- 0033213819
- What are the computations of the cerebellum, the basal ganglia, and the cerebral cortex?
- Doya K (1999) What are the computations of the cerebellum, the basal ganglia, and the cerebral cortex? Neural networks 12: 961-974.
- (1999) Neural networks , vol.12 , pp. 961-974
- Doya, K.¹

105
- 0034524427
- Complementary roles of basal ganglia and cerebellum in learning and motor control
- Doya K (2000) Complementary roles of basal ganglia and cerebellum in learning and motor control. Current Opinion in Neurobiology 10: 732-739.
- (2000) Current Opinion in Neurobiology , vol.10 , pp. 732-739
- Doya, K.¹

106
- 28144449057
- Representation of action-specific reward values in the striatum
- Samejima K, Ueda Y, Doya K, Kimura M (2005) Representation of action-specific reward values in the striatum. Science 310: 1337-1340.
- (2005) Science , vol.310 , pp. 1337-1340
- Samejima, K.¹ Ueda, Y.² Doya, K.³ Kimura, M.⁴

107
- 34147191094
- Efficient reinforcement learning: Computational theories, neuroscience and robotics
- Kawato M, Samejima K (2007) Efficient reinforcement learning: computational theories, neuroscience and robotics. Current Opinion in Neurobiology 17: 205-212.
- (2007) Current Opinion in Neurobiology , vol.17 , pp. 205-212
- Kawato, M.¹ Samejima, K.²

108
- 0025321039
- Functional architecture of basal ganglia circuits: Neural substrates of parallel processing
- Alexander GE, Crutcher MD (1990) Functional architecture of basal ganglia circuits: Neural substrates of parallel processing. Trends in Neurosciences 13: 266-271.
- (1990) Trends in Neurosciences , vol.13 , pp. 266-271
- Alexander, G.E.¹ Crutcher, M.D.²

109
- 0002063951
- Striosomes and matrisomes
- Bernardi G, Carpenter MB, Di Chiara G, eds, Plenum
- Graybiel AM, Flaherty AW, Giménez-Amaya JM (1991) Striosomes and matrisomes. In: Bernardi G, Carpenter MB, Di Chiara G, eds. The Basal Ganglia III, Plenum.
- (1991) The Basal Ganglia III
- Graybiel, A.M.¹ Flaherty, A.W.² Giménez-Amaya, J.M.³

110
- 0031873240
- Hyperbolic temporal discounting in social drinkers and problem drinkers
- Vuchinich RE, Simpson CA (1998) Hyperbolic temporal discounting in social drinkers and problem drinkers. Experimental and Clinical Psychopharmacology 6: 292-305.
- (1998) Experimental and Clinical Psychopharmacology , vol.6 , pp. 292-305
- Vuchinich, R.E.¹ Simpson, C.A.²

111
- 34447630083
- Schweighofer N, Tanaka SC, Doya K (2007) Serotonin and the evaluation of future rewards. theory, experiments, and possible neural mechanisms. Annals of the New York Academy of Sciences 1104: 289-300.
- Schweighofer N, Tanaka SC, Doya K (2007) Serotonin and the evaluation of future rewards. theory, experiments, and possible neural mechanisms. Annals of the New York Academy of Sciences 1104: 289-300.

112
- 70449457651
- An fMRI study of the delay discounting of reward after tryptophan depletion and loading
- Society for Neuroscience Abstracts
- Tanaka SC, Schweighofer N, Asahi S, Okamoto Y, Doya K (2004) An fMRI study of the delay discounting of reward after tryptophan depletion and loading. 2: reward-expectation. Society for Neuroscience Abstracts.
- (2004) 2: Reward-expectation
- Tanaka, S.C.¹ Schweighofer, N.² Asahi, S.³ Okamoto, Y.⁴ Doya, K.⁵

113
- 33751392881
- Humans can adopt optimal discounting strategy under real-time constraints
- Schweighofer N, Shishida K, Han CE, Yamawaki YOSCTS, Doya K (2006) Humans can adopt optimal discounting strategy under real-time constraints. PLoS Computational Biology 2: e152.
- (2006) PLoS Computational Biology , vol.2
- Schweighofer, N.¹ Shishida, K.² Han, C.E.³ YOSCTS, Y.⁴ Doya, K.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.