SCOPUS 정보 검색 플랫폼

Neural Computation

Volumn 18, Issue 7, 2006, Pages 1637-1677

Representation and timing in theories of the dopamine system

(3) Daw, Nathaniel D a Courville, Aaron C b Touretzky, David S c

a UNIVERSITY COLLEGE LONDON (United Kingdom)

b CARNEGIE MELLON UNIVERSITY (United States)

c CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DOPAMINE;

ALGORITHM; ANIMAL; ARTICLE; ARTIFICIAL NEURAL NETWORK; BIOLOGICAL MODEL; COMPARATIVE STUDY; HUMAN; NERVE CELL; NERVE CELL NETWORK; PHYSIOLOGY; REACTION TIME; REWARD; STATISTICAL MODEL; TIME;

ALGORITHMS; ANIMALS; DOPAMINE; HUMANS; MODELS, NEUROLOGICAL; MODELS, STATISTICAL; NERVE NET; NEURAL NETWORKS (COMPUTER); NEURONS; REACTION TIME; REWARD; TIME FACTORS;

EID: 33745787929 PISSN: 08997667 EISSN: 1530888X Source Type: Journal
DOI: 10.1162/neco.2006.18.7.1637 Document Type: Article

Times cited : (128)

References (92)

1
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- Baum, L. E., Petrie, T., Soulds, G., & Weiss, N. (1970). A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Annals of Mathematical Statistics, 41, 164-171.
- (1970) Annals of Mathematical Statistics , vol.41 , pp. 164-171
- Baum, L.E.¹ Petrie, T.² Soulds, G.³ Weiss, N.⁴

2
- 21544435722
- Midbrain dopamine neurons encode a quantitative reward prediction error signal
- Bayer, H. M., & Glimcher, P. W. (2005). Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron, 47, 129-141.
- (2005) Neuron , vol.47 , pp. 129-141
- Bayer, H.M.¹ Glimcher, P.W.²

3
- 0006101544
- Mechanisms of feature-positive and feature-negative discrimination learning in an appetitive conditioning paradigm
- N. A. Schmajuk & P. C. Holland (Eds.). Washington, DC: American Psychological Association
- Bouton, M. E., & Nelson, J. B. (1998). Mechanisms of feature-positive and feature-negative discrimination learning in an appetitive conditioning paradigm. In N. A. Schmajuk & P. C. Holland (Eds.), Occasion setting: Associative learning and cognition in animals (pp. 69-112). Washington, DC: American Psychological Association.
- (1998) Occasion Setting: Associative Learning and Cognition in Animals , pp. 69-112
- Bouton, M.E.¹ Nelson, J.B.²

4
- 85150714688
- Reinforcement learning methods for continuous-time Markov decision problems
- G. Tesauro, D. S. Touretzky, & T. K. Leen (Eds.). Cambridge, MA: MIT Press
- Bradtke, S. J., & Duff, M. O. (1995). Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. S. Touretzky, & T. K. Leen (Eds.), Advances in neural information processing systems, 7 (pp. 393-400). Cambridge, MA: MIT Press.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 393-400
- Bradtke, S.J.¹ Duff, M.O.²

5
- 0033508899
- How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues
- Brown, J., Bullock, D., & Grossberg, S. (1999). How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues. Journal of Neuroscience, 19(23), 10502-10511.
- (1999) Journal of Neuroscience , vol.19 , Issue.23 , pp. 10502-10511
- Brown, J.¹ Bullock, D.² Grossberg, S.³

6
- 0026998041
- Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
- San Jose, CA: AAAI Press
- Chrisman, L. (1992). Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In Proceedings of the Tenth National Conference on Artificial Intelligence (AAAI-92) (pp. 183-188). San Jose, CA: AAAI Press.
- (1992) Proceedings of the Tenth National Conference on Artificial Intelligence (AAAI-92) , pp. 183-188
- Chrisman, L.¹

7
- 33746365200
- Model uncertainty in classical conditioning
- S. Thrun, L. K. Saul, & B. Schölkopf (Eds.), Cambridge, MA: MIT Press
- Courville, A. C., Daw, N. D., Gordon, G. J., & Touretzky, D. S. (2003). Model uncertainty in classical conditioning. In S. Thrun, L. K. Saul, & B. Schölkopf (Eds.), Advances in neural information processing systems, 16 Cambridge, MA: MIT Press.
- (2003) Advances in Neural Information Processing Systems , vol.16
- Courville, A.C.¹ Daw, N.D.² Gordon, G.J.³ Touretzky, D.S.⁴

8
- 33750189183
- Similarity and discrimination in classical conditioning: A latent variable account
- L. K. Saul, Y. Weiss, & L. Bottou (Eds.). Cambridge, MA: MIT Press
- Courville, A. C., Daw, N. D., & Touretzky, D. S. (2004). Similarity and discrimination in classical conditioning: A latent variable account. In L. K. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17. Cambridge, MA: MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.17
- Courville, A.C.¹ Daw, N.D.² Touretzky, D.S.³

9
- 84899024060
- Modeling temporal structure in classical conditioning
- T. G. Dietterich, S. Becker, & Z. Ghahramani (Eds.). Cambridge, MA: MIT Press
- Courville, A. C., & Touretzky, D. S. (2001). Modeling temporal structure in classical conditioning. In T. G. Dietterich, S. Becker, & Z. Ghahramani (Eds.), Advances in neural information processing systems, 14 (pp. 3-10). Cambridge, MA: MIT Press.
- (2001) Advances in Neural Information Processing Systems , vol.14 , pp. 3-10
- Courville, A.C.¹ Touretzky, D.S.²

10
- 0032643313
- Solving semi-Markov decision problems using average reward reinforcement learning
- Das, T., Gosavi, A., Mahadevan, S., & Marchalleck, N. (1999). Solving semi-Markov decision problems using average reward reinforcement learning. Management Science, 45, 560-574.
- (1999) Management Science , vol.45 , pp. 560-574
- Das, T.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

11
- 13244266174
- Unpublished doctoral dissertation, School of Computer Science, Carnegie Mellon University
- Daw, N. D. (2003). Reinforcement learning models of the dopamine system and their behavioral implications. Unpublished doctoral dissertation, School of Computer Science, Carnegie Mellon University.
- (2003) Reinforcement Learning Models of the Dopamine System and Their Behavioral Implications
- Daw, N.D.¹

12
- 0036592008
- Opponent interactions between serotonin and dopamine
- Daw, N. D., Kakade, S., & Dayan, P. (2002). Opponent interactions between serotonin and dopamine. Neural Networks, 15, 603-616.
- (2002) Neural Networks , vol.15 , pp. 603-616
- Daw, N.D.¹ Kakade, S.² Dayan, P.³

13
- 33644763667
- Actions, values, policies, and the basal ganglia
- (in press). E. Bezard (Ed.). New York: Nova Science
- Daw, N. D., Niv, Y., & Dayan, P. (in press). Actions, values, policies, and the basal ganglia. In E. Bezard (Ed.), Recent breakthroughs in basal ganglia research. New York: Nova Science.
- Recent Breakthroughs in Basal Ganglia Research
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

14
- 28044450875
- Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
- Daw, N. D., Niv, Y., & Dayan, P. (2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience.
- (2005) Nature Neuroscience
- Daw, N.D.¹ Niv, Y.² Dayan, P.³

15
- 0036835734
- Long-term reward prediction in TD models of the dopamine system
- Daw, N. D., & Touretzky, D. S. (2002). Long-term reward prediction in TD models of the dopamine system. Neural Computation, 14, 2567-2583.
- (2002) Neural Computation , vol.14 , pp. 2567-2583
- Daw, N.D.¹ Touretzky, D.S.²

16
- 33745795105
- Contrasting neuronal correlates between dorsal and ventral striatum in the rat
- Daw, N., Touretzky, D., & Skaggs, W. (2004). Contrasting neuronal correlates between dorsal and ventral striatum in the rat. In Cosyne04 Computational and Systems Neuroscience Abstracts, Vol. 1.
- (2004) Cosyne04 Computational and Systems Neuroscience Abstracts , vol.1
- Daw, N.¹ Touretzky, D.² Skaggs, W.³

17
- 84899017487
- Motivated reinforcement learning
- T. Dietterich, S. Decker, & Z. Ghahramani (Eds.). Cambridge, MA: MIT Press
- Dayan, P. (2002). Motivated reinforcement learning. In T. Dietterich, S. Decker, & Z. Ghahramani (Eds.), Advances in neural information processing systems, 14 (pp. 11-18). Cambridge, MA: MIT Press.
- (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 11-18
- Dayan, P.¹

18
- 0037057808
- Reward, motivation and reinforcement learning
- Dayan, P., & Balleine, B. W. (2002). Reward, motivation and reinforcement learning. Neuron, 36, 285-298.
- (2002) Neuron , vol.36 , pp. 285-298
- Dayan, P.¹ Balleine, B.W.²

19
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society B, 39, 1-38.
- (1977) Journal of the Royal Statistical Society B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

20
- 35648958957
- Bayesian inference in spiking neurons
- L. K. Saul, Y. Weiss, & L. Bottou (Eds.). Cambridge, MA: MIT Press
- Deneve, S. (2004). Bayesian inference in spiking neurons. In L. K. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17. Cambridge, MA: MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.17
- Deneve, S.¹

21
- 0043250430
- The role of learning in motivation
- C. R. Gallistel (Ed.). New York: Wiley
- Dickinson, A., & Balleine, B. (2002). The role of learning in motivation. In C. R. Gallistel (Ed.), Stevens' handbook of experimental psychology (3rd ed.), Vol. 3: Learning, motivation and emotion (pp. 497-533). New York: Wiley.
- (2002) Stevens' Handbook of Experimental Psychology (3rd Ed.), Vol. 3: Learning, Motivation and Emotion , vol.3 , pp. 497-533
- Dickinson, A.¹ Balleine, B.²

22
- 0000068150
- Surprise and the attenuation of blocking
- Dickinson, A., Hall, G., & Mackintosh, N. J. (1976). Surprise and the attenuation of blocking. Journal of Experimental Psychology: Animal Behavior Processes, 2, 313-322.
- (1976) Journal of Experimental Psychology: Animal Behavior Processes , vol.2 , pp. 313-322
- Dickinson, A.¹ Hall, G.² Mackintosh, N.J.³

23
- 0007567512
- Reinforcer specificity in the enhancement of conditioning by posttrial surprise
- Dickinson, A., & Mackintosh, N. J. (1979). Reinforcer specificity in the enhancement of conditioning by posttrial surprise. Journal of Experimental Psychology: Animal Behavior Processes, 5, 162-177.
- (1979) Journal of Experimental Psychology: Animal Behavior Processes , vol.5 , pp. 162-177
- Dickinson, A.¹ Mackintosh, N.J.²

24
- 0033913868
- Dissociation of Pavlovian and instrumental incentive learning under dopamine antagonists
- Dickinson, A., Smith, J., & Mirenowicz, J. (2000). Dissociation of Pavlovian and instrumental incentive learning under dopamine antagonists. Behavioral Neuroscience, 114, 468-483.
- (2000) Behavioral Neuroscience , vol.114 , pp. 468-483
- Dickinson, A.¹ Smith, J.² Mirenowicz, J.³

25
- 0033213819
- What are the computations in the cerebellum, the basal ganglia, and the cerebral cortex?
- Doya, K. (1999). What are the computations in the cerebellum, the basal ganglia, and the cerebral cortex? Neural Networks, 12, 961-974.
- (1999) Neural Networks , vol.12 , pp. 961-974
- Doya, K.¹

26
- 0034524427
- Complementary roles of basal ganglia and cerebellum in learning and motor control
- Doya, K. (2000). Complementary roles of basal ganglia and cerebellum in learning and motor control. Current Opinion in Neurobiology, 10, 732-739.
- (2000) Current Opinion in Neurobiology , vol.10 , pp. 732-739
- Doya, K.¹

27
- 15244346900
- Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation
- Faure, A., Haberland, U., Condé, F., & Massioui, N. E. (2005). Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation. Journal of Neuroscience, 25, 2771-2780.
- (2005) Journal of Neuroscience , vol.25 , pp. 2771-2780
- Faure, A.¹ Haberland, U.² Condé, F.³ Massioui, N.E.⁴

28
- 33745774575
- The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence
- Fiorillo, C. D., & Schultz, W. (2001). The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence. In Society for Neuroscience Abstracts, 27, 827.5.
- (2001) Society for Neuroscience Abstracts , vol.27
- Fiorillo, C.D.¹ Schultz, W.²

29
- 0037459319
- Discrete coding of reward probability and uncertainty by dopamine neurons
- Fiorillo, C. D., Tobler, P. N., & Schultz, W. (2003). Discrete coding of reward probability and uncertainty by dopamine neurons. Science, 299, 1898-1902.
- (2003) Science , vol.299 , pp. 1898-1902
- Fiorillo, C.D.¹ Tobler, P.N.² Schultz, W.³

30
- 0034169238
- Time, rate and conditioning
- Gallistel, C. R., & Gibbon, J. (2000). Time, rate and conditioning. Psychological Review, 107(2), 289-344.
- (2000) Psychological Review , vol.107 , Issue.2 , pp. 289-344
- Gallistel, C.R.¹ Gibbon, J.²

31
- 1642496799
- Sources of variability and systematic error in mouse timing behavior
- Gallistel, C. R., King, A., & McDonald, R. (2004). Sources of variability and systematic error in mouse timing behavior. Journal of Experimental Psychology: Animal Behavior Processes, 30, 3-16.
- (2004) Journal of Experimental Psychology: Animal Behavior Processes , vol.30 , pp. 3-16
- Gallistel, C.R.¹ King, A.² McDonald, R.³

32
- 0344787262
- Scalar expectancy theory and Weber's law in animal timing
- Gibbon, J. (1977). Scalar expectancy theory and Weber's law in animal timing. Psychological Review, 84, 279-325.
- (1977) Psychological Review , vol.84 , pp. 279-325
- Gibbon, J.¹

33
- 0037057757
- Banburismus and the brain: Decoding the relationship between sensory stimuli, decisions, and reward
- Gold, J. I., & Shadlen, M. N. (2002). Banburismus and the brain: Decoding the relationship between sensory stimuli, decisions, and reward. Neuron, 36, 299-308.
- (2002) Neuron , vol.36 , pp. 299-308
- Gold, J.I.¹ Shadlen, M.N.²

34
- 0000272386
- Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme
- Guedon, Y., & Cocozza-Thivent, C. (1990). Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme. Computer Speech and Language, 4, 167-192.
- (1990) Computer Speech and Language , vol.4 , pp. 167-192
- Guedon, Y.¹ Cocozza-Thivent, C.²

35
- 0024040120
- Excitation and inhibition in unblocking
- Holland, P. C. (1988). Excitation and inhibition in unblocking. Journal of Experimental Psychology: Animal Behavior Processes, 14, 261-279.
- (1988) Journal of Experimental Psychology: Animal Behavior Processes , vol.14 , pp. 261-279
- Holland, P.C.¹

36
- 18744369990
- Variations in unconditioned stimulus processing in unblocking
- Holland, P. C., & Kenmuir, C. (2005). Variations in unconditioned stimulus processing in unblocking. Journal of Experimental Psychology: Animal Behavior Processes, 31, 155-171.
- (2005) Journal of Experimental Psychology: Animal Behavior Processes , vol.31 , pp. 155-171
- Holland, P.C.¹ Kenmuir, C.²

37
- 0032942327
- Hippocampal lesions interfere with Pavlovian negative occasion setting
- Holland, P. C., Lamoureux, J. A., Han, J., & Gallagher, M. (1999). Hippocampal lesions interfere with Pavlovian negative occasion setting. Hippocampus, 9, 143-157.
- (1999) Hippocampus , vol.9 , pp. 143-157
- Holland, P.C.¹ Lamoureux, J.A.² Han, J.³ Gallagher, M.⁴

38
- 33644688754
- Dopamine neurons report an error in the temporal prediction of reward during learning
- Hollerman, J. R., & Schultz, W. (1998). Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience, 1, 304-309.
- (1998) Nature Neuroscience , vol.1 , pp. 304-309
- Hollerman, J.R.¹ Schultz, W.²

39
- 0002861883
- A model of how the basal ganglia generate and use neural signals that predict reinforcement
- J. C. Houk, J. L. Davis, & D. G. Beiser (Eds.). Cambridge, MA: MIT Press
- Houk, J. C., Adams, J. L., & Barto, A. G. (1995). A model of how the basal ganglia generate and use neural signals that predict reinforcement. In J. C. Houk, J. L. Davis, & D. G. Beiser (Eds.), Models of information processing in the basal ganglia (pp. 249-270). Cambridge, MA: MIT Press.
- (1995) Models of Information Processing in the Basal Ganglia , pp. 249-270
- Houk, J.C.¹ Adams, J.L.² Barto, A.G.³

40
- 0036592026
- Actor-critic models of the basal ganglia: New anatomical and computational perspectives
- Joel, D., Niv, Y., & Ruppin, E. (2002). Actor-critic models of the basal ganglia: New anatomical and computational perspectives. Neural Networks, 15, 535-547.
- (2002) Neural Networks , vol.15 , pp. 535-547
- Joel, D.¹ Niv, Y.² Ruppin, E.³

41
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman, M. L., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

42
- 84898950247
- Acquisition in autoshaping
- S. A. Solla, T. K. Leen, & K. R. Muller (Eds.). Cambridge, MA: MIT Press
- Kakade, S., & Dayan, P. (2000). Acquisition in autoshaping. In S. A. Solla, T. K. Leen, & K. R. Muller (Eds.), Advances in neural information processing systems, 12. Cambridge, MA: MIT Press.
- (2000) Advances in Neural Information Processing Systems , vol.12
- Kakade, S.¹ Dayan, P.²

43
- 85047672086
- Acquisition and extinction in autoshaping
- Kakade, S., & Dayan, P. (2002). Acquisition and extinction in autoshaping. Psychological Review, 109, 533-544.
- (2002) Psychological Review , vol.109 , pp. 533-544
- Kakade, S.¹ Dayan, P.²

44
- 0023988664
- A behavioral theory of timing
- Killeen, P. R., & Fetterman, J. G. (1988). A behavioral theory of timing. Psychological Review, 95, 274-295.
- (1988) Psychological Review , vol.95 , pp. 274-295
- Killeen, P.R.¹ Fetterman, J.G.²

45
- 33745784599
- Action-selection in temporally dependent phenomena using temporal difference learning over a collective belief structure
- Kurth-Nelson, Z., & Redish, A. (2004). μagents: Action-selection in temporally dependent phenomena using temporal difference learning over a collective belief structure. Society for Neuroscience Abstracts, 30, 207.1.
- (2004) Society for Neuroscience Abstracts , vol.30
- Kurth-Nelson, Z.¹ Redish, A.²

46
- 0022685753
- Continuously variable duration hidden Markov models for automatic speech recognition
- Levinson, S. E. (1986). Continuously variable duration hidden Markov models for automatic speech recognition. Computer Speech and Language, 1, 29-45.
- (1986) Computer Speech and Language , vol.1 , pp. 29-45
- Levinson, S.E.¹

47
- 0036212160
- Efficient coding of natural sounds
- Lewicki, M. S. (2002). Efficient coding of natural sounds. Nature Neuroscience, 5, 356-363.
- (2002) Nature Neuroscience , vol.5 , pp. 356-363
- Lewicki, M.S.¹

48
- 0032606945
- A probabilistic framework for the adaptation and comparison of image codes
- Lewicki, M. S., & Olshausen, B. A. (1999). A probabilistic framework for the adaptation and comparison of image codes. Journal of the Optical Society of America A: Optics, Image Science, and Vision, 16, 1587-1601.
- (1999) Journal of the Optical Society of America A: Optics, Image Science, and Vision , vol.16 , pp. 1587-1601
- Lewicki, M.S.¹ Olshausen, B.A.²

49
- 0026505520
- Responses of monkey dopamine neurons during learning of behavioral reactions
- Ljungberg, T., Apicella, P., & Schultz, W. (1992). Responses of monkey dopamine neurons during learning of behavioral reactions. Journal of Neurophysiology, 67, 145-163.
- (1992) Journal of Neurophysiology , vol.67 , pp. 145-163
- Ljungberg, T.¹ Apicella, P.² Schultz, W.³

50
- 0031110595
- Learning the temporal dynamics of behavior
- Machado, A. (1997). Learning the temporal dynamics of behavior. Psychological Review, 104, 241-265.
- (1997) Psychological Review , vol.104 , pp. 241-265
- Machado, A.¹

51
- 0001963197
- Self-improving factory simulation using continuous-time average-reward reinforcement learning
- San Mateo, CA: Morgan Kaufmann
- Mahadevan, S., Marchalleck, N., Das, T., & Gosavi, A. (1997). Self-improving factory simulation using continuous-time average-reward reinforcement learning. In Proceedings of the 14th International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann.
- (1997) Proceedings of the 14th International Conference on Machine Learning
- Mahadevan, S.¹ Marchalleck, N.² Das, T.³ Gosavi, A.⁴

52
- 0033013403
- Reinforcement-induced within-trial resetting of an internal clock
- Matell, M. S., & Meek, W. H. (1999). Reinforcement-induced within-trial resetting of an internal clock. Behavioural Processes, 45, 159-171.
- (1999) Behavioural Processes , vol.45 , pp. 159-171
- Matell, M.S.¹ Meek, W.H.²

53
- 0142058800
- A computational substrate for incentive salience
- McClure, S. M., Daw, N. D., & Montague, P. R. (2003). A computational substrate for incentive salience. Trends in Neurosciences, 26, 423-428.
- (2003) Trends in Neurosciences , vol.26 , pp. 423-428
- McClure, S.M.¹ Daw, N.D.² Montague, P.R.³

54
- 0029156181
- Timing processes in the reinforcement-omission effect
- Mellon, R. C., Leak, T. M., Fairhurst, S., & Gibbon, J. (1995). Timing processes in the reinforcement-omission effect. Animal Learning and Behavior, 23, 286-296.
- (1995) Animal Learning and Behavior , vol.23 , pp. 286-296
- Mellon, R.C.¹ Leak, T.M.² Fairhurst, S.³ Gibbon, J.⁴

55
- 0030026069
- Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli
- Mirenowicz, J., & Schultz, W. (1996). Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli. Nature, 379, 449-451.
- (1996) Nature , vol.379 , pp. 449-451
- Mirenowicz, J.¹ Schultz, W.²

56
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive Hebbian learning
- Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. Journal of Neuroscience, 16, 1936-1947.
- (1996) Journal of Neuroscience , vol.16 , pp. 1936-1947
- Montague, P.R.¹ Dayan, P.² Sejnowski, T.J.³

57
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, 13, 103-130.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

58
- 3242673464
- Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons
- Morris, G., Arkadir, D., Nevet, A., Vaadia, E., & Bergman, H. (2004). Coincident but distinct messages of midbrain dopamine and striatal tonically active neurons. Neuron, 43, 133-143.
- (2004) Neuron , vol.43 , pp. 133-143
- Morris, G.¹ Arkadir, D.² Nevet, A.³ Vaadia, E.⁴ Bergman, H.⁵

59
- 33745774340
- How fast to work: Response vigor, motivation, and tonic dopamine
- L. K. Saul, Y. Weiss, & L. Bottou (Eds.). Cambridge, MA: MIT Press
- Niv, Y., Daw, N. D., & Dayan, P. (2005). How fast to work: Response vigor, motivation, and tonic dopamine. In L. K. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17. Cambridge, MA: MIT Press.
- (2005) Advances in Neural Information Processing Systems , vol.17
- Niv, Y.¹ Daw, N.D.² Dayan, P.³

60
- 33745773269
- The effects of uncertainty on TD learning
- Niv, Y., Duff, M. O., & Dayan, P. (2004). The effects of uncertainty on TD learning. In Cosyne04 - Computational and Systems Neuroscience Abstracts, vol. 1.
- (2004) Cosyne04 - Computational and Systems Neuroscience Abstracts , vol.1
- Niv, Y.¹ Duff, M.O.² Dayan, P.³

61
- 26444446315
- Dopamine, uncertainty, and TD learning
- Niv, Y., Duff, M. O., & Dayan, P. (2005). Dopamine, uncertainty, and TD learning. Behavioral and Brain Functions, 1, 6.
- (2005) Behavioral and Brain Functions , vol.1 , pp. 6
- Niv, Y.¹ Duff, M.O.² Dayan, P.³

62
- 1942520195
- Dissociable roles of ventral and dorsal striatum in instrumental conditioning
- O'Doherty, J., Dayan, P., Schultz, J., Deichmann, R., Fristen, K., & Dolan, R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452-154.
- (2004) Science , vol.304 , pp. 452-1154
- O'Doherty, J.¹ Dayan, P.² Schultz, J.³ Deichmann, R.⁴ Fristen, K.⁵ Dolan, R.J.⁶

63
- 0030722121
- Cognitive planning in humans: Neuropsychological, neuroanatomical and neuropharmacological perspectives
- Owen, A. M. (1997). Cognitive planning in humans: Neuropsychological, neuroanatomical and neuropharmacological perspectives. Progress in Neurobiology, 53, 431-150.
- (1997) Progress in Neurobiology , vol.53 , pp. 431-1150
- Owen, A.M.¹

64
- 21544455210
- Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward-learning network
- Pan, W. X., Schmidt, R., Wickens, J., & Hyland, B. (2005). Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward-learning network. Journal of Neuroscience, 25, 6235-6242.
- (2005) Journal of Neuroscience , vol.25 , pp. 6235-6242
- Pan, W.X.¹ Schmidt, R.² Wickens, J.³ Hyland, B.⁴

65
- 0037010809
- Nucleus accumbens dopamine depletion impairs both acquisition and performance of appetitive Pavlovian approach behaviour: Implications for mesoaccumbens dopamine function
- Parkinson, J. A., Dalley, J. W., Cardinal, R. N., Bamford, A., Fehnert, B., Lachenal, G., Rudarakanchana, N., Halkerston, K., Robbins, T. W., & Everitt, B. J. (2002). Nucleus accumbens dopamine depletion impairs both acquisition and performance of appetitive Pavlovian approach behaviour: Implications for mesoaccumbens dopamine function. Behavioral Brain Research, 137, 149-163.
- (2002) Behavioral Brain Research , vol.137 , pp. 149-163
- Parkinson, J.A.¹ Dalley, J.W.² Cardinal, R.N.³ Bamford, A.⁴ Fehnert, B.⁵ Lachenal, G.⁶ Rudarakanchana, N.⁷ Halkerston, K.⁸ Robbins, T.W.⁹ Everitt, B.J.¹⁰

66
- 23844511443
- Hierarchical Bayesian inference in networks of spiking neurons
- L. K. Saul, Y. Weiss, & L. Bottou (Eds.). Cambridge, MA: MIT Press
- Rao, R. P. N. (2004). Hierarchical Bayesian inference in networks of spiking neurons. In L. K. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17. Cambridge, MA: MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.17
- Rao, R.P.N.¹

67
- 0012586376
- Cambridge, MA: MIT Press
- Rao, R. P. N., Olshausen, B. A., & Lewicki, M. S. (2002). Probabilistic models of the brain: Perception and neural function. Cambridge, MA: MIT Press.
- (2002) Probabilistic Models of the Brain: Perception and Neural Function
- Rao, R.P.N.¹ Olshausen, B.A.² Lewicki, M.S.³

68
- 0242440823
- Correlated coding of motivation and outcome of decision by dopamine neurons
- Satoh, T., Nakai, S., Sato, T., & Kimura, M. (2003). Correlated coding of motivation and outcome of decision by dopamine neurons. Journal of Neuroscience, 23, 9913-9923.
- (2003) Journal of Neuroscience , vol.23 , pp. 9913-9923
- Satoh, T.¹ Nakai, S.² Sato, T.³ Kimura, M.⁴

69
- 0031867046
- Predictive reward signal of dopamine neurons
- Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neuro-physiology, 80, 1-27.
- (1998) Journal of Neuro-physiology , vol.80 , pp. 1-27
- Schultz, W.¹

70
- 0027468102
- Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task
- Schultz, W., Apicella, P., & Ljungberg, T. (1993). Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. Journal of Neuroscience, 13, 900-913.
- (1993) Journal of Neuroscience , vol.13 , pp. 900-913
- Schultz, W.¹ Apicella, P.² Ljungberg, T.³

71
- 0030896968
- A neural substrate of prediction and reward
- Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

72
- 0025216214
- Dopamine neurons of the monkey midbrain: Contingencies of responses to stimuli eliciting immediate behavioral reactions
- Schultz, W., & Romo, R. (1990). Dopamine neurons of the monkey midbrain: Contingencies of responses to stimuli eliciting immediate behavioral reactions. Journal of Neurophysiology, 63, 607-624.
- (1990) Journal of Neurophysiology , vol.63 , pp. 607-624
- Schultz, W.¹ Romo, R.²

73
- 13244284646
- A computational model of the functional role of the ventral-striatal D2 receptor in the expression of previously acquired behaviors
- Smith, A. J., Decker, S., & Kapur, S. (2005). A computational model of the functional role of the ventral-striatal D2 receptor in the expression of previously acquired behaviors. Neural Computation, 17, 361-395.
- (2005) Neural Computation , vol.17 , pp. 361-395
- Smith, A.J.¹ Decker, S.² Kapur, S.³

74
- 0141619496
- Operant conditioning
- Staddon, J. E. R., & Cerutti, D. T. (2003). Operant conditioning. Annual Reviews of Psychology, 54, 115-144.
- (2003) Annual Reviews of Psychology , vol.54 , pp. 115-144
- Staddon, J.E.R.¹ Cerutti, D.T.²

75
- 0033089272
- Time and memory: Towards a pacemaker-free theory of interval timing
- Staddon, J. E. R., & Higa, J. J. (1999). Time and memory: Towards a pacemaker-free theory of interval timing. Journal of the Experimental Analysis of Behavior, 71, 215-251.
- (1999) Journal of the Experimental Analysis of Behavior , vol.71 , pp. 215-251
- Staddon, J.E.R.¹ Higa, J.J.²

76
- 84990001255
- Reinforcement omission on fixed-interval schedules
- Staddon, J. E., & Innis, N. K. (1969). Reinforcement omission on fixed-interval schedules. Journal of the Experimental Analysis of Behavior, 12, 689-700.
- (1969) Journal of the Experimental Analysis of Behavior , vol.12 , pp. 689-700
- Staddon, J.E.¹ Innis, N.K.²

77
- 0035726809
- Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model
- Suri, R. E. (2001). Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model. Experimental Brain Research, 140, 234-240.
- (2001) Experimental Brain Research , vol.140 , pp. 234-240
- Suri, R.E.¹

78
- 0031854385
- Learning of sequential movements with dopamine-like reinforcement signal in neural network model
- Suri, R. E., & Schultz, W. (1998). Learning of sequential movements with dopamine-like reinforcement signal in neural network model. Experimental Brain Research, 121, 350-354.
- (1998) Experimental Brain Research , vol.121 , pp. 350-354
- Suri, R.E.¹ Schultz, W.²

79
- 0032930935
- A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task
- Suri, R. E., & Schultz, W. (1999). A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task. Neuroscience, 91, 871-890.
- (1999) Neuroscience , vol.91 , pp. 871-890
- Suri, R.E.¹ Schultz, W.²

80
- 0003617454
- Unpublished doctoral dissertation, University of Massachusetts
- Sutton, R. S. (1984). Temporal credit assignment in reinforcement learning. Unpublished doctoral dissertation, University of Massachusetts.
- (1984) Temporal Credit Assignment in Reinforcement Learning
- Sutton, R.S.¹

81
- 33847202724
- Learning to predict by the method of temporal differences
- Sutton, R. S. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

82
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- San Mateo, CA: Morgan Kaufmann
- Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the Seventh International Conference on Machine Learning, (pp. 216-224). San Mateo, CA: Morgan Kaufmann.
- (1990) Proceedings of the Seventh International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

83
- 0003066891
- Time-derivative models of Pavlovian reinforcement
- M. Gabriel & J. Moore (Eds.). Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1990). Time-derivative models of Pavlovian reinforcement. In M. Gabriel & J. Moore (Eds.), Learning and computational neuroscience: Foundations of adaptive networks (pp. 497-537). Cambridge, MA: MIT Press.
- (1990) Learning and Computational Neuroscience: Foundations of Adaptive Networks , pp. 497-537
- Sutton, R.S.¹ Barto, A.G.²

84
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

85
- 0742306447
- Kalman filter control embedded into the reinforcement learning framework
- Szita, I., & Lorincz, S. (2004). Kalman filter control embedded into the reinforcement learning framework. Neural Computation, 16, 491-499.
- (2004) Neural Computation , vol.16 , pp. 491-499
- Szita, I.¹ Lorincz, S.²

86
- 0345255891
- Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm
- Tobler, P., Dickinson, A., & Schultz, W. (2003). Coding of predicted reward omission by dopamine neurons in a conditioned inhibition paradigm. Journal of Neuro-science, 23, 10402-10410.
- (2003) Journal of Neuro-science , vol.23 , pp. 10402-10410
- Tobler, P.¹ Dickinson, A.² Schultz, W.³

87
- 0036832957
- On average versus discounted reward temporal-difference learning
- Tsitsiklis, J. N., & Van Roy, B. (2002). On average versus discounted reward temporal-difference learning. Machine Learning, 49, 179-191.
- (2002) Machine Learning , vol.49 , pp. 179-191
- Tsitsiklis, J.N.¹ Van Roy, B.²

88
- 1642404961
- Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli
- Ungless, M. A., Magill, P. J., & Bolam, J. P. (2004). Uniform inhibition of dopamine neurons in the ventral tegmental area by aversive stimuli. Science, 303, 2040-2042.
- (2004) Science , vol.303 , pp. 2040-2042
- Ungless, M.A.¹ Magill, P.J.² Bolam, J.P.³

89
- 3242669835
- Putting a spin on the dorsal-ventral divide of the striatum
- Voorn, P., Vanderschuren, L. J., Groenewegen, H. J., Robbins, T. W., & Pennartz, C. M. (2004). Putting a spin on the dorsal-ventral divide of the striatum. Trends in Neuroscience, 27, 468-474.
- (2004) Trends in Neuroscience , vol.27 , pp. 468-474
- Voorn, P.¹ Vanderschuren, L.J.² Groenewegen, H.J.³ Robbins, T.W.⁴ Pennartz, C.M.⁵

90
- 0035811464
- Dopamine responses comply with basic assumptions of formal learning theory
- Waelti, P., Dickinson, A., & Schultz, W. (2001). Dopamine responses comply with basic assumptions of formal learning theory. Nature, 412, 43-48.
- (2001) Nature , vol.412 , pp. 43-48
- Waelti, P.¹ Dickinson, A.² Schultz, W.³

91
- 0028520185
- Second-order conditioning and Pavlovian conditioned inhibition: Operational similarities and differences
- Yin, H., Barnet, R. C., & Miller, R. R. (1994). Second-order conditioning and Pavlovian conditioned inhibition: Operational similarities and differences. Journal of Experimental Psychology: Animal Behavior Processes, 20, 419-428.
- (1994) Journal of Experimental Psychology: Animal Behavior Processes , vol.20 , pp. 419-428
- Yin, H.¹ Barnet, R.C.² Miller, R.R.³

92
- 33745791519
- Probabilistic computation in spiking neurons
- L. K. Saul, Y. Weiss, & L. Bottou (Eds.). Cambridge, MA: MIT Press
- Zemel, R., Huys, Q., Natarajan, R., & Dayan, P. (2004). Probabilistic computation in spiking neurons. In L. K. Saul, Y. Weiss, & L. Bottou (Eds.), Advances in neural information processing systems, 17. Cambridge, MA: MIT Press.
- (2004) Advances in Neural Information Processing Systems , vol.17
- Zemel, R.¹ Huys, Q.² Natarajan, R.³ Dayan, P.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.