SCOPUS 정보 검색 플랫폼

NIPS 2002: Proceedings of the 15th International Conference on Neural Information Processing Systems

Volumn , Issue , 2002, Pages 83-90

Timing and Partial Observability in the Dopamine System

(3) Daw, Nathaniel D a,c Courville, Aaron C b,c Touretzky, David S a,c

a Carnegie Mellon University (United States)

b CARNEGIE MELLON UNIVERSITY (United States)

c CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AMINES; MARKOV PROCESSES; NEUROPHYSIOLOGY; PHYSIOLOGICAL MODELS; TIMING CIRCUITS;

DOPAMINE; DOPAMINE NEURONS; HIDDEN STATE; MODEL USE; PARTIAL OBSERVABILITY; REWARD-PREDICTION ERROR; SEMI MARKOV PROCESS; TEMPORAL DIFFERENCES; TEMPORAL VARIABILITY; TEMPORAL-DIFFERENCE ALGORITHM;

FORECASTING;

EID: 85156225263 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (5)

References (20)

1
- 0002861883
- A model of how the basal ganglia generate and use neural signals that predict reinforcement
- JC Houk, JL Davis, and DG Beiser, editors, pages MIT Press
- JC Houk, JL Adams, and AG Barto. A model of how the basal ganglia generate and use neural signals that predict reinforcement. In JC Houk, JL Davis, and DG Beiser, editors, Models of Information Processing in the Basal Ganglia, pages 249-270. MIT Press, 1995.
- (1995) Models of Information Processing in the Basal Ganglia , pp. 249-270
- Houk, JC¹ Adams, JL² Barto, AG³

2
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive Hebbian learning
- PR Montague, P Dayan, and TJ Sejnowski. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci, 16:1936-1947, 1996.
- (1996) J Neurosci , vol.16 , pp. 1936-1947
- Montague, PR¹ Dayan, P² Sejnowski, TJ³

3
- 0030896968
- A neural substrate of prediction and reward
- W Schultz, P Dayan, and PR Montague. A neural substrate of prediction and reward. Science, 275:1593-1599, 1997.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W¹ Dayan, P² Montague, PR³

4
- 0032930935
- A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task
- RE Suri and W Schultz. A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task. Neurosci, 91:871-890, 1999.
- (1999) Neurosci , vol.91 , pp. 871-890
- Suri, RE¹ Schultz, W²

5
- 0036835734
- Long-term reward prediction in TD models of the dopamine system
- ND Daw and DS Touretzky. Long-term reward prediction in TD models of the dopamine system. Neural Comp, 14:2567-2583, 2002.
- (2002) Neural Comp , vol.14 , pp. 2567-2583
- Daw, ND¹ Touretzky, DS²

6
- 33847202724
- Learning to predict by the method of temporal differences
- RS Sutton. Learning to predict by the method of temporal differences. Machine Learning, 3:9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, RS¹

7
- 0031867046
- Predictive reward signal of dopamine neurons
- W Schultz. Predictive reward signal of dopamine neurons. J Neurophys, 80:1-27, 1998.
- (1998) J Neurophys , vol.80 , pp. 1-27
- Schultz, W¹

8
- 0003066891
- Time-derivative models of Pavlovian reinforcement
- M Gabriel and J Moore, editors, pages MIT Press
- RS Sutton and AG Barto. Time-derivative models of Pavlovian reinforcement. In M Gabriel and J Moore, editors, Learning and Computational Neuroscience: Foundations of Adaptive Networks, pages 497-537. MIT Press, 1990.
- (1990) Learning and Computational Neuroscience: Foundations of Adaptive Networks , pp. 497-537
- Sutton, RS¹ Barto, AG²

9
- 33644688754
- Dopamine neurons report an error in the temporal prediction of reward during learning
- JR Hollerman and W Schultz. Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neurosci, 1:304-309, 1998.
- (1998) Nature Neurosci , vol.1 , pp. 304-309
- Hollerman, JR¹ Schultz, W²

10
- 70350584243
- Combining configural and TD learning on a robot
- IEEE Computer Society
- DS Touretzky, ND Daw, and EJ Tira-Thompson. Combining configural and TD learning on a robot. In ICDL 2, pages 47-52. IEEE Computer Society, 2002.
- (2002) ICDL , vol.2 , pp. 47-52
- Touretzky, DS¹ Daw, ND² Tira-Thompson, EJ³

11
- 33745774575
- The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence
- 5
- CD Fiorillo and W Schultz. The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence. In Soc. Neurosci. Abstracts, volume 27: 827.5, 2001.
- (2001) Soc. Neurosci. Abstracts , vol.27 , pp. 827
- Fiorillo, CD¹ Schultz, W²

12
- 0032073263
- Planning and acting in partially observable stochastic domains
- LP Kaelbling, ML Littman, and AR Cassandra. Planning and acting in partially observable stochastic domains. Artif Intell, 101:99-134, 1998.
- (1998) Artif Intell , vol.101 , pp. 99-134
- Kaelbling, LP¹ Littman, ML² Cassandra, AR³

13
- 85150714688
- Reinforcement learning methods for continuous-time Markov Decision Problems
- MIT Press
- SJ Bradtke and MO Duff. Reinforcement learning methods for continuous-time Markov Decision Problems. In NIPS 7, pages 393-400. MIT Press, 1995.
- (1995) NIPS , vol.7 , pp. 393-400
- Bradtke, SJ¹ Duff, MO²

14
- 0026998041
- Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
- L Chrisman. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In AAAI 10, pages 183-188, 1992.
- (1992) AAAI , vol.10 , pp. 183-188
- Chrisman, L¹

15
- 84899024060
- Modeling temporal structure in classical conditioning
- MIT Press
- AC Courville and DS Touretzky. Modeling temporal structure in classical conditioning. In NIPS 14, pages 3-10. MIT Press, 2001.
- (2001) NIPS , vol.14 , pp. 3-10
- Courville, AC¹ Touretzky, DS²

16
- 84898950247
- Acquisition in autoshaping
- MIT Press
- S Kakade and P Dayan. Acquisition in autoshaping. In NIPS 12, pages 24-30. MIT Press, 2000.
- (2000) NIPS , vol.12 , pp. 24-30
- Kakade, S¹ Dayan, P²

17
- 0000272386
- Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme
- Y Guedon and C Cocozza-Thivent. Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme. Comp Speech and Lang, 4:167-192, 1990.
- (1990) Comp Speech and Lang , vol.4 , pp. 167-192
- Guedon, Y¹ Cocozza-Thivent, C²

18
- 0034169238
- Time, rate and conditioning
- CR Gallistel and J Gibbon. Time, rate and conditioning. Psych Rev, 107(2):289-344, 2000.
- (2000) Psych Rev , vol.107 , Issue.2 , pp. 289-344
- Gallistel, CR¹ Gibbon, J²

19
- 0035726809
- Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model
- RE Suri. Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model. Exp Brain Research, 140:234-240, 2001.
- (2001) Exp Brain Research , vol.140 , pp. 234-240
- Suri, RE¹

20
- 0011072195
- Motivated reinforcement learning
- MIT Press
- P Dayan. Motivated reinforcement learning. In NIPS 14, pages 11-18. MIT Press, 2001.
- (2001) NIPS , vol.14 , pp. 11-18
- Dayan, P¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.