메뉴 건너뛰기




Volumn 15, Issue 4-6, 2002, Pages 549-559

Dopamine: Generalization and bonuses

Author keywords

Dopamine; Exploration; Generalization; Reinforcement learning; Temporal difference

Indexed keywords

CALCULATIONS; ERRORS; NEUROLOGY;

EID: 0036592029     PISSN: 08936080     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0893-6080(02)00048-5     Document Type: Article
Times cited : (361)

References (67)
  • 20
    • 0000146022 scopus 로고
    • Neural dynamics of attentionally modulated pavlovian conditioning: Conditioned reinforcement, inhibition, and opponent processing
    • (1987) Psychobiology , vol.15 , pp. 195-240
    • Grossberg, S.1    Schmajuk, N.A.2
  • 28
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • Houk J.C., Davis J.L., Beiser D.G. (Eds.), Models of information processing in the basal ganglia, Cambridge, MA: MIT Press
    • (1995) , pp. 249-270
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 29
    • 2142806930 scopus 로고
    • Principles of behavior, New York: Appleton (Century)
    • (1943)
    • Hull, C.L.1
  • 31
    • 2142818206 scopus 로고    scopus 로고
    • Leen T.K., Dietterich T.G., Tresp V. (Eds.), Dopamine bonuses, NIPS
    • (2000)
    • Kakade, S.1    Dayan, P.2
  • 32
    • 2142647467 scopus 로고    scopus 로고
    • Kehoe, E.J (1977). Effects of serial compound stimuli on stimulus selection in classical conditioning of the rabbit nictitating membrane response. PhD Thesis, University of Iowa.
  • 34
    • 0030026069 scopus 로고    scopus 로고
    • Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli
    • (1996) Nature , vol.379 , pp. 449-451
    • Mirenowicz, J.1    Schultz, W.2
  • 37
    • 2142661774 scopus 로고    scopus 로고
    • Ng, A. Y., Harada, D., & Russell, S (1999). Policy invariance under reward transformations: Theory and application to reward shaping. Proceedings of the 16th International Conference on Machine Learning.
  • 39
    • 0035931930 scopus 로고    scopus 로고
    • Temporal dynamics of a neural solution to the aperture problem in visual area MT of macaque brain
    • (2001) Nature , vol.409 , pp. 1040-1042
    • Pack, C.C.1    Born, R.T.2
  • 44
    • 0002109138 scopus 로고
    • A theory of pavlovian conditioning: The effectiveness of reinforcement and non-reinforcement
    • Black A.H., Prokasy W.F. (Eds.), Classical conditioning II, current research and theory, New York: Aleton (Century/Crofts)
    • (1972) , pp. 64-69
    • Rescorla, R.A.1    Wagner, A.R.2
  • 56
    • 0036592034 scopus 로고    scopus 로고
    • TD models of reward predictive responses in dopamine neurons
    • (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 523-533
    • Suri, R.E.1
  • 57
    • 0032930935 scopus 로고    scopus 로고
    • A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task
    • (1999) Neuroscience , vol.91 , pp. 871-890
    • Suri, R.E.1    Schultz, W.2
  • 65
    • 2142660332 scopus 로고    scopus 로고
    • Watkins, C. J. C. H (1989). Learning from delayed rewards. PhD dissertation, University of Cambridge.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.