SCOPUS 정보 검색 플랫폼

Neural Networks

Volumn 15, Issue 4-6, 2002, Pages 549-559

Dopamine: Generalization and bonuses

(2) Kakade, Sham a Dayan, Peter a

a UNIVERSITY COLLEGE LONDON (United Kingdom)

Author keywords

Dopamine; Exploration; Generalization; Reinforcement learning; Temporal difference

Indexed keywords

CALCULATIONS; ERRORS; NEUROLOGY;

DOPAMINE;

NEURAL NETWORKS;

DOPAMINE;

DOPAMINERGIC NERVE CELL; DOPAMINERGIC SYSTEM; EXPLORATORY BEHAVIOR; HUMAN; INFORMATION PROCESSING; LEARNING; NONHUMAN; PRIORITY JOURNAL; REVIEW; REWARD; THEORETICAL MODEL; ANIMAL; PHYSIOLOGY;

ANIMALS; DOPAMINE; EXPLORATORY BEHAVIOR; GENERALIZATION (PSYCHOLOGY); HUMANS; REWARD;

EID: 0036592029 PISSN: 08936080 EISSN: None Source Type: Journal
DOI: 10.1016/S0893-6080(02)00048-5 Document Type: Article

Times cited : (361)

References (67)

1
- 0029922576
- Psychobiology of novelty seeking and drug seeking behavior
- (1996) Behavioural Brain Research , vol.77 , pp. 23-43
- Bardo, M.T.¹ Donohew, R.L.² Harrington, N.G.³

2
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- (1983) IEEE Transaction on Systems, Man and Cybernetics , vol.13 SMC , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

3
- 2142658982
- Neuro-dynamic programming, Cambridge, MA: Athena Scientific
- (1996)
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 84880854156
- (2001) R-MAX-A general polynomial time algorithm for near-optimal reinforcement learning , vol.IJCAI , pp. 953-958
- Brafman, R.I.¹ Tennenholtz, M.²

5
- 0032765150
- Cognition and control in schizophrenia: A computational model of dopamine and prefrontal function
- (1999) Biological Psychiatry , vol.46 , pp. 312-328
- Braver, T.S.¹ Barch, D.M.² Cohen, J.D.³

6
- 0000696066
- The misbehavior of organisms
- (1961) American Psychologist , vol.16 , pp. 681-684
- Breland, K.¹ Breland, M.²

7
- 0021284218
- Properties of the internal clock
- (1984) Annals of the New York Academy of Sciences , vol.423 , pp. 566-582
- Church, R.M.¹

8
- 2142767000
- Roberts A.C., Robbins T.W. (Eds.), The prefrontal cortex, executive and cognitive functions, Oxford: OUP
- (1998)
- Cohen, J.D.¹ Braver, T.S.² O'Reilly, R.C.³

9
- 0033722074
- Behavioral considerations suggest an average reward TD model of the dopamine system
- (2000) Neurocomputing , vol.32 , pp. 679-684
- Daw, N.D.¹ Touretzky, D.S.²

10
- 0036592008
- Opponent interactions between serotonin and dopamine
- (2002) Neural Networks , vol.15 , Issue.4-5
- Daw, N.D.¹ Kakade, S.² Dayan, P.³

11
- 84899017487
- Motivated reinforcement learning
- Dietterich T.G., Becker S., Ghahramani Z. (Eds.), Cambridge, MA: MIT Press
- (2000) Neural information processing systems, 14
- Dayan, P.¹

12
- 2142819598
- Theoretical neuroscience, Cambridge, MA: MIT Press
- (2001)
- Dayan, P.¹ Abbott, L.F.²

13
- 0030260201
- Exploration bonuses and dual control
- (1996) Machine Learning , vol.25 , pp. 5-22
- Dayan, P.¹ Sejnowski, T.J.²

14
- 0043250430
- The role of learning in motivation
- Gallistel C.R. (Ed.), Steven's handbook of experimental psychology, 3rd ed, New York: Wiley
- (2001) Steven's handbook of experimental psychology , vol.3 Vol.
- Dickinson, A.¹ Balleine, B.²

15
- 2142751213
- Reinforcement learning in continuous time and space
- (1999) Neural Computation , vol.12 , pp. 243-269
- Doya, K.¹

16
- 0032835152
- Association between novelty seeking and type 4 dopamine receptor gene in a large Finnish cohort sample
- (1999) American Journal of Psychiatry , vol.156 , pp. 1453-1455
- Ekelund, J.¹ Lichtermann, D.² Jaervelin, M.R.³ Peltonen, L.⁴

17
- 0033179417
- Orbitofrontal cortex and representation of incentive value in associative learning
- (1999) Journal of Neuroscience , vol.19 , pp. 6610-6614
- Gallagher, M.¹ McMahan, R.W.² Schoenbaum, G.³

18
- 0030989710
- Toward a neurobiology of temporal cognition: Advances and challenges
- (1997) Current Opinion in Neurobiology , vol.7 , pp. 170-184
- Gibbon, J.¹ Malapani, C.² Dale, C.L.³ Gallistel, C.R.⁴

19
- 0031589689
- Dopamine's role
- (1997) Science , vol.278 , pp. 1548-1549
- Gray, J.A.¹ Young, A.M.² Joseph, M.H.³

20
- 0000146022
- Neural dynamics of attentionally modulated pavlovian conditioning: Conditioned reinforcement, inhibition, and opponent processing
- (1987) Psychobiology , vol.15 , pp. 195-240
- Grossberg, S.¹ Schmajuk, N.A.²

21
- 0024775767
- Neural dynamics of adaptive timing and temporal discrimination during associative learning
- (1989) Neural Networks , vol.2 , pp. 79-102
- Grossberg, S.¹ Schmajuk, N.A.²

22
- 0032914820
- An electrophysiological characterization of ventral tegmental area dopaminergic neurons during differential pavlovian fear conditioning in the awake rabbit
- (1999) Behavioural Brain Research , vol.99 , pp. 169-179
- Guarraci, F.A.¹ Kapp, B.S.²

23
- 0030920752
- The role of an amygdalo-nigrostriatal pathway in associative learning
- (1997) Journal of Neuroscience , vol.17 , pp. 3913-3919
- Han, J.S.¹ McMahan, R.W.² Holland, P.³ Gallagher, M.⁴

24
- 0029758993
- Neurotoxic lesions of basolateral, but not central, amygdala interfere with Pavlovian second-order conditioning and reinforcer devaluation effects
- (1996) Journal of Neuroscience , vol.16 , pp. 5256-5265
- Hatfield, T.¹ Han, J.S.² Conley, M.³ Gallagher, M.⁴ Holland, P.⁵

25
- 0032984336
- Amygdala circuitry in attentional and representational processes
- (1999) Trends in Cognitive Sciences , vol.3 , pp. 65-73
- Holland, P.C.¹ Gallagher, M.²

26
- 0028235161
- Involvement of dopamine and excitatory amino acid transmission in novelty-induced motor activity
- (1994) Journal of Pharmacology, Experimental Therapeutics , vol.269 , pp. 976-988
- Hooks, M.S.¹ Kalivas, P.W.²

27
- 0030757872
- Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat
- (1997) Brain Research , vol.759 , pp. 251-258
- Horvitz, J.C.¹ Stewart, T.² Jacobs, B.³

28
- 0002861883
- A model of how the basal ganglia generate and use neural signals that predict reinforcement
- Houk J.C., Davis J.L., Beiser D.G. (Eds.), Models of information processing in the basal ganglia, Cambridge, MA: MIT Press
- (1995) , pp. 249-270
- Houk, J.C.¹ Adams, J.L.² Barto, A.G.³

29
- 2142806930
- Principles of behavior, New York: Appleton (Century)
- (1943)
- Hull, C.L.¹

30
- 0033461157
- (1999) Brain Research Reviews , vol.31 , pp. 6-41
- Ikemoto, S.¹ Panksepp, J.²

31
- 2142818206
- Leen T.K., Dietterich T.G., Tresp V. (Eds.), Dopamine bonuses, NIPS
- (2000)
- Kakade, S.¹ Dayan, P.²

32
- 2142647467
- Kehoe, E.J (1977). Effects of serial compound stimuli on stimulus selection in classical conditioning of the rabbit nictitating membrane response. PhD Thesis, University of Iowa.

33
- 0027964829
- Importance of unpredictability for reward responses in primate dopamine neurons
- (1994) Journal of Neurophysiology , vol.72 , pp. 1024-1027
- Mirenowicz, J.¹ Schultz, W.²

34
- 0030026069
- Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli
- (1996) Nature , vol.379 , pp. 449-451
- Mirenowicz, J.¹ Schultz, W.²

35
- 0028972278
- Bee foraging in uncertain environments using predictive hebbian learning
- (1995) Nature , vol.377 , pp. 725-728
- Montague, P.R.¹ Dayan, P.² Person, C.³ Sejnowski, T.J.⁴

36
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive hebbian learning
- (1996) Journal of Neuroscience , vol.16 , pp. 1936-1947
- Montague, P.R.¹ Dayan, P.² Sejnowski, T.J.³

37
- 2142661774
- Ng, A. Y., Harada, D., & Russell, S (1999). Policy invariance under reward transformations: Theory and application to reward shaping. Proceedings of the 16th International Conference on Machine Learning.

38
- 0035152958
- Abstract reward and punishment representations in the human orbitofrontal cortex
- (2001) Nature Neuroscience , vol.4 , pp. 95-102
- O'Doherty, J.¹ Kringelbach, M.L.² Rolls, E.T.³ Hornak, J.⁴ Andrews, C.⁵

39
- 0035931930
- Temporal dynamics of a neural solution to the aperture problem in visual area MT of macaque brain
- (2001) Nature , vol.409 , pp. 1040-1042
- Pack, C.C.¹ Born, R.T.²

40
- 0033055130
- Dopamine D4 receptor gene: Novelty or nonsense?
- (1999) Neuropsychopharmacology , vol.21 , pp. 3-16
- Paterson, A.D.¹ Sunohara, G.A.² Kennedy, J.L.³

41
- 0033480246
- The influence of background stimuli on summation in autoshaping
- (1999) Quarterly Journal of Experimental Psychology, Comparative, Physiological Psychology , vol.52 , pp. 53-74
- Pearce, J.M.¹ George, D.N.² Redhead, E.S.³ Aydin, A.⁴ Wynne, C.⁵

42
- 0033119561
- Is the short-latency dopamine response too short to signal reward error?
- (1999) Trends in Neurosciences , vol.22 , pp. 146-151
- Redgrave, P.¹ Prescott, T.² Gurney, K.³

43
- 0030023561
- Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task
- (1996) Animal Learning and Behavior , vol.24 , pp. 38-45
- Reed, P.¹ Mitchell, C.² Nokes, T.³

44
- 0002109138
- A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement
- Black A.H., Prokasy W.F. (Eds.), Classical conditioning II, current research and theory, New York: Aleton (Century/Crofts)
- (1972) , pp. 64-69
- Rescorla, R.A.¹ Wagner, A.R.²

45
- 0034017604
- The orbitofrontal cortex and reward
- (2000) Cerebral Cortex , vol.10 , pp. 284-294
- Rolls, E.T.¹

46
- 0028194966
- The involvement of nucleus accumbens dopamine in appetitive and aversive motivation
- (1994) Behavioural Brain Research , vol.61 , pp. 117-133
- Salamone, J.D.¹

47
- 0032081988
- Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning
- (1998) Nature Neuroscience , vol.1 , pp. 155-159
- Schoenbaum, G.¹ Chiba, A.A.² Gallagher, M.³

48
- 0033103761
- Neural encoding in orbitofrontal cortex and basolateral amygdala during olfactory discrimination learning
- (1999) Journal of Neuroscience , vol.19 , pp. 1876-1884
- Schoenbaum, G.¹ Chiba, A.A.² Gallagher, M.³

49
- 0000444602
- Activity of dopamine neurons in the behaving primate
- (1992) Seminars in the Neurosciences , vol.4 , pp. 129-138
- Schultz, W.¹

50
- 0031867046
- Predictive reward signal of dopamine neurons
- (1998) Journal of Neurophysiology , vol.80 , pp. 1-27
- Schultz, W.¹

51
- 0025216214
- Dopamine neurons of the monkey midbrain, contingencies of responses to stimuli eliciting immediate behavioral reactions
- (1990) Journal of Neuroscience , vol.63 , pp. 607-624
- Schultz, W.¹ Romo, R.²

52
- 0027468102
- Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task
- (1993) Journal of Neuroscience , vol.13 , pp. 900-913
- Schultz, W.¹ Apicella, P.² Ljungberg, T.³

53
- 0030896968
- A neural substrate of prediction and reward
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

54
- 0034061495
- Reward processing in primate orbitofrontal cortex and basal ganglia
- (2000) Cerebral Cortex , vol.10 , pp. 272-283
- Schultz, W.¹ Tremblay, L.² Hollerman, J.R.³

55
- 0016045280
- An opponent-process theory of motivation. I. Temporal dynamics of affect
- (1974) Psychological Review , vol.81 , pp. 119-145
- Solomon, R.L.¹ Corbit, J.D.²

56
- 0036592034
- TD models of reward predictive responses in dopamine neurons
- (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 523-533
- Suri, R.E.¹

57
- 0032930935
- A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
- (1999) Neuroscience , vol.91 , pp. 871-890
- Suri, R.E.¹ Schultz, W.²

58
- 33847202724
- Learning to predict by the methods of temporal difference
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

59
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- (1990) Machine Learning, Proceedings of the Seventh International Conference , pp. 216-224
- Sutton, R.S.¹

60
- 2142828109
- Reinforcement learning: An introduction, Cambridge, MA: MIT Press
- (1998)
- Sutton, R.S.¹ Barto, A.G.²

61
- 0034036369
- Reward-related neuronal activity during go-nogo task performance in primate orbitofrontal cortex
- (2000) Journal of Neurophysiology , vol.83 , pp. 1864-1876
- Tremblay, L.¹ Schultz, W.²

62
- 0034117036
- Modifications of reward expectation-related neuronal activity during learning in primate orbitofrontal cortex
- (2000) Journal of Neurophysiology , vol.83 , pp. 1877-1885
- Tremblay, L.¹ Schultz, W.²

63
- 0035811464
- Dopamine responses comply with basic assumptions of formal learning theory
- (2001) Nature , vol.412 , pp. 43-48
- Waelti, P.¹ Dickinson, A.² Schultz, W.³

64
- 0029985962
- Covert orienting of attention in the rat and the role of striatal dopamine
- (1996) Journal of Neuroscience , vol.16 , pp. 3082-3088
- Ward, N.M.¹ Brown, V.J.²

65
- 2142660332
- Watkins, C. J. C. H (1989). Learning from delayed rewards. PhD dissertation, University of Cambridge.

66
- 0029851654
- Excitotoxic lesions of the basolateral amygdala impair the acquisition of cocaine-seeking behaviour under a second-order schedule of reinforcement
- (1996) Psychopharmacology , vol.127 , pp. 213-224
- Whitelaw, R.B.¹ Markou, A.² Robbins, T.W.³ Everitt, B.J.⁴

67
- 0029116316
- Modulation of memory fields by dopamine D1 receptors in prefrontal cortex
- (1995) Nature , vol.376 , pp. 572-575
- Williams, G.V.¹ Goldman-Rakic, P.S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.