-
1
-
-
0002861883
-
A model of how the basal ganglia generate and use neural signals that predict reinforcement
-
JC Houk, JL Davis, and DG Beiser, editors, pages MIT Press
-
JC Houk, JL Adams, and AG Barto. A model of how the basal ganglia generate and use neural signals that predict reinforcement. In JC Houk, JL Davis, and DG Beiser, editors, Models of Information Processing in the Basal Ganglia, pages 249-270. MIT Press, 1995.
-
(1995)
Models of Information Processing in the Basal Ganglia
, pp. 249-270
-
-
Houk, JC1
Adams, JL2
Barto, AG3
-
2
-
-
0029981543
-
A framework for mesencephalic dopamine systems based on predictive Hebbian learning
-
PR Montague, P Dayan, and TJ Sejnowski. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci, 16:1936-1947, 1996.
-
(1996)
J Neurosci
, vol.16
, pp. 1936-1947
-
-
Montague, PR1
Dayan, P2
Sejnowski, TJ3
-
3
-
-
0030896968
-
A neural substrate of prediction and reward
-
W Schultz, P Dayan, and PR Montague. A neural substrate of prediction and reward. Science, 275:1593-1599, 1997.
-
(1997)
Science
, vol.275
, pp. 1593-1599
-
-
Schultz, W1
Dayan, P2
Montague, PR3
-
4
-
-
0032930935
-
A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task
-
RE Suri and W Schultz. A neural network with dopamine-like reinforcement signal that learns a spatial delayed response task. Neurosci, 91:871-890, 1999.
-
(1999)
Neurosci
, vol.91
, pp. 871-890
-
-
Suri, RE1
Schultz, W2
-
5
-
-
0036835734
-
Long-term reward prediction in TD models of the dopamine system
-
ND Daw and DS Touretzky. Long-term reward prediction in TD models of the dopamine system. Neural Comp, 14:2567-2583, 2002.
-
(2002)
Neural Comp
, vol.14
, pp. 2567-2583
-
-
Daw, ND1
Touretzky, DS2
-
6
-
-
33847202724
-
Learning to predict by the method of temporal differences
-
RS Sutton. Learning to predict by the method of temporal differences. Machine Learning, 3:9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, RS1
-
7
-
-
0031867046
-
Predictive reward signal of dopamine neurons
-
W Schultz. Predictive reward signal of dopamine neurons. J Neurophys, 80:1-27, 1998.
-
(1998)
J Neurophys
, vol.80
, pp. 1-27
-
-
Schultz, W1
-
9
-
-
33644688754
-
Dopamine neurons report an error in the temporal prediction of reward during learning
-
JR Hollerman and W Schultz. Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neurosci, 1:304-309, 1998.
-
(1998)
Nature Neurosci
, vol.1
, pp. 304-309
-
-
Hollerman, JR1
Schultz, W2
-
10
-
-
70350584243
-
Combining configural and TD learning on a robot
-
IEEE Computer Society
-
DS Touretzky, ND Daw, and EJ Tira-Thompson. Combining configural and TD learning on a robot. In ICDL 2, pages 47-52. IEEE Computer Society, 2002.
-
(2002)
ICDL
, vol.2
, pp. 47-52
-
-
Touretzky, DS1
Daw, ND2
Tira-Thompson, EJ3
-
11
-
-
33745774575
-
The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence
-
5
-
CD Fiorillo and W Schultz. The reward responses of dopamine neurons persist when prediction of reward is probabilistic with respect to time or occurrence. In Soc. Neurosci. Abstracts, volume 27: 827.5, 2001.
-
(2001)
Soc. Neurosci. Abstracts
, vol.27
, pp. 827
-
-
Fiorillo, CD1
Schultz, W2
-
12
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
LP Kaelbling, ML Littman, and AR Cassandra. Planning and acting in partially observable stochastic domains. Artif Intell, 101:99-134, 1998.
-
(1998)
Artif Intell
, vol.101
, pp. 99-134
-
-
Kaelbling, LP1
Littman, ML2
Cassandra, AR3
-
13
-
-
85150714688
-
Reinforcement learning methods for continuous-time Markov Decision Problems
-
MIT Press
-
SJ Bradtke and MO Duff. Reinforcement learning methods for continuous-time Markov Decision Problems. In NIPS 7, pages 393-400. MIT Press, 1995.
-
(1995)
NIPS
, vol.7
, pp. 393-400
-
-
Bradtke, SJ1
Duff, MO2
-
14
-
-
0026998041
-
Reinforcement learning with perceptual aliasing: The perceptual distinctions approach
-
L Chrisman. Reinforcement learning with perceptual aliasing: The perceptual distinctions approach. In AAAI 10, pages 183-188, 1992.
-
(1992)
AAAI
, vol.10
, pp. 183-188
-
-
Chrisman, L1
-
15
-
-
84899024060
-
Modeling temporal structure in classical conditioning
-
MIT Press
-
AC Courville and DS Touretzky. Modeling temporal structure in classical conditioning. In NIPS 14, pages 3-10. MIT Press, 2001.
-
(2001)
NIPS
, vol.14
, pp. 3-10
-
-
Courville, AC1
Touretzky, DS2
-
16
-
-
84898950247
-
Acquisition in autoshaping
-
MIT Press
-
S Kakade and P Dayan. Acquisition in autoshaping. In NIPS 12, pages 24-30. MIT Press, 2000.
-
(2000)
NIPS
, vol.12
, pp. 24-30
-
-
Kakade, S1
Dayan, P2
-
17
-
-
0000272386
-
Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme
-
Y Guedon and C Cocozza-Thivent. Explicit state occupancy modeling by hidden semi-Markov models: Application of Derin's scheme. Comp Speech and Lang, 4:167-192, 1990.
-
(1990)
Comp Speech and Lang
, vol.4
, pp. 167-192
-
-
Guedon, Y1
Cocozza-Thivent, C2
-
18
-
-
0034169238
-
Time, rate and conditioning
-
CR Gallistel and J Gibbon. Time, rate and conditioning. Psych Rev, 107(2):289-344, 2000.
-
(2000)
Psych Rev
, vol.107
, Issue.2
, pp. 289-344
-
-
Gallistel, CR1
Gibbon, J2
-
19
-
-
0035726809
-
Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model
-
RE Suri. Anticipatory responses of dopamine neurons and cortical neurons reproduced by internal model. Exp Brain Research, 140:234-240, 2001.
-
(2001)
Exp Brain Research
, vol.140
, pp. 234-240
-
-
Suri, RE1
-
20
-
-
0011072195
-
Motivated reinforcement learning
-
MIT Press
-
P Dayan. Motivated reinforcement learning. In NIPS 14, pages 11-18. MIT Press, 2001.
-
(2001)
NIPS
, vol.14
, pp. 11-18
-
-
Dayan, P1
|