메뉴 건너뛰기




Volumn , Issue , 2002, Pages 83-90

Timing and Partial Observability in the Dopamine System

Author keywords

[No Author keywords available]

Indexed keywords

AMINES; MARKOV PROCESSES; NEUROPHYSIOLOGY; PHYSIOLOGICAL MODELS; TIMING CIRCUITS;

EID: 85156225263     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (20)
  • 1
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generate and use neural signals that predict reinforcement
    • JC Houk, JL Davis, and DG Beiser, editors, pages MIT Press
    • JC Houk, JL Adams, and AG Barto. A model of how the basal ganglia generate and use neural signals that predict reinforcement. In JC Houk, JL Davis, and DG Beiser, editors, Models of Information Processing in the Basal Ganglia, pages 249-270. MIT Press, 1995.
    • (1995) Models of Information Processing in the Basal Ganglia , pp. 249-270
    • Houk, JC1    Adams, JL2    Barto, AG3
  • 2
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • PR Montague, P Dayan, and TJ Sejnowski. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci, 16:1936-1947, 1996.
    • (1996) J Neurosci , vol.16 , pp. 1936-1947
    • Montague, PR1    Dayan, P2    Sejnowski, TJ3
  • 3
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • W Schultz, P Dayan, and PR Montague. A neural substrate of prediction and reward. Science, 275:1593-1599, 1997.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W1    Dayan, P2    Montague, PR3
  • 4
    • 0032930935 scopus 로고    scopus 로고
    • A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task
    • RE Suri and W Schultz. A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task. Neurosci, 91:871-890, 1999.
    • (1999) Neurosci , vol.91 , pp. 871-890
    • Suri, RE1    Schultz, W2
  • 5
    • 0036835734 scopus 로고    scopus 로고
    • Long-term reward prediction in TD models of the dopamine system
    • ND Daw and DS Touretzky. Long-term reward prediction in TD models of the dopamine system. Neural Comp, 14:2567-2583, 2002.
    • (2002) Neural Comp , vol.14 , pp. 2567-2583
    • Daw, ND1    Touretzky, DS2
  • 6
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • RS Sutton. Learning to predict by the method of temporal differences. Machine Learning, 3:9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, RS1
  • 7
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • W Schultz. Predictive reward signal of dopamine neurons. J Neurophys, 80:1-27, 1998.
    • (1998) J Neurophys , vol.80 , pp. 1-27
    • Schultz, W1
  • 9
    • 33644688754 scopus 로고    scopus 로고
    • Dopamine neurons report an error in the temporal prediction of reward during learning
    • JR Hollerman and W Schultz. Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neurosci, 1:304-309, 1998.
    • (1998) Nature Neurosci , vol.1 , pp. 304-309
    • Hollerman, JR1    Schultz, W2
  • 10
    • 70350584243 scopus 로고    scopus 로고
    • Combining configural and TD learning on a robot
    • IEEE Computer Society
    • DS Touretzky, ND Daw, and EJ Tira-Thompson. Combining configural and TD learning on a robot. In ICDL 2, pages 47-52. IEEE Computer Society, 2002.
    • (2002) ICDL , vol.2 , pp. 47-52
    • Touretzky, DS1    Daw, ND2    Tira-Thompson, EJ3
  • 11
    • 33745774575 scopus 로고    scopus 로고
    • The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence
    • 5
    • CD Fiorillo and W Schultz. The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence. In Soc. Neurosci. Abstracts, volume 27: 827.5, 2001.
    • (2001) Soc. Neurosci. Abstracts , vol.27 , pp. 827
    • Fiorillo, CD1    Schultz, W2
  • 12
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • LP Kaelbling, ML Littman, and AR Cassandra. Planning and acting in partially observable stochastic domains. Artif Intell, 101:99-134, 1998.
    • (1998) Artif Intell , vol.101 , pp. 99-134
    • Kaelbling, LP1    Littman, ML2    Cassandra, AR3
  • 13
    • 85150714688 scopus 로고
    • Reinforcement learning methods for continuous-time Markov Decision Problems
    • MIT Press
    • SJ Bradtke and MO Duff. Reinforcement learning methods for continuous-time Markov Decision Problems. In NIPS 7, pages 393-400. MIT Press, 1995.
    • (1995) NIPS , vol.7 , pp. 393-400
    • Bradtke, SJ1    Duff, MO2
  • 14
    • 0026998041 scopus 로고
    • Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
    • L Chrisman. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In AAAI 10, pages 183-188, 1992.
    • (1992) AAAI , vol.10 , pp. 183-188
    • Chrisman, L1
  • 15
    • 84899024060 scopus 로고    scopus 로고
    • Modeling temporal structure in classical conditioning
    • MIT Press
    • AC Courville and DS Touretzky. Modeling temporal structure in classical conditioning. In NIPS 14, pages 3-10. MIT Press, 2001.
    • (2001) NIPS , vol.14 , pp. 3-10
    • Courville, AC1    Touretzky, DS2
  • 16
    • 84898950247 scopus 로고    scopus 로고
    • Acquisition in autoshaping
    • MIT Press
    • S Kakade and P Dayan. Acquisition in autoshaping. In NIPS 12, pages 24-30. MIT Press, 2000.
    • (2000) NIPS , vol.12 , pp. 24-30
    • Kakade, S1    Dayan, P2
  • 17
    • 0000272386 scopus 로고
    • Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme
    • Y Guedon and C Cocozza-Thivent. Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme. Comp Speech and Lang, 4:167-192, 1990.
    • (1990) Comp Speech and Lang , vol.4 , pp. 167-192
    • Guedon, Y1    Cocozza-Thivent, C2
  • 18
    • 0034169238 scopus 로고    scopus 로고
    • Time, rate and conditioning
    • CR Gallistel and J Gibbon. Time, rate and conditioning. Psych Rev, 107(2):289-344, 2000.
    • (2000) Psych Rev , vol.107 , Issue.2 , pp. 289-344
    • Gallistel, CR1    Gibbon, J2
  • 19
    • 0035726809 scopus 로고    scopus 로고
    • Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model
    • RE Suri. Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model. Exp Brain Research, 140:234-240, 2001.
    • (2001) Exp Brain Research , vol.140 , pp. 234-240
    • Suri, RE1
  • 20
    • 0011072195 scopus 로고    scopus 로고
    • Motivated reinforcement learning
    • MIT Press
    • P Dayan. Motivated reinforcement learning. In NIPS 14, pages 11-18. MIT Press, 2001.
    • (2001) NIPS , vol.14 , pp. 11-18
    • Dayan, P1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.