-
1
-
-
0001771345
-
Linear least-squares algorithms for temporal difference learning
-
Bradtke, S. J., & Barto, A. G. (1996). Linear least-squares algorithms for temporal difference learning. Machine Learning, 22, 33-57.
-
(1996)
Machine Learning
, vol.22
, pp. 33-57
-
-
Bradtke, S.J.1
Barto, A.G.2
-
2
-
-
0025168366
-
More psychophysics of the pigeon's use of landmarks
-
Cheng, K. (1990). More psychophysics of the pigeon's use of landmarks. Journal of Comparative Physiology A, 166, 857-863.
-
(1990)
Journal of Comparative Physiology A
, vol.166
, pp. 857-863
-
-
Cheng, K.1
-
3
-
-
0000174737
-
Error is proportional to distance measured by honeybees: Weber's law in the odometer
-
Cheng, K., Srinivasan, M., & Zhang, S. (1999). Error is proportional to distance measured by honeybees: Weber's law in the odometer. Animal Cognition, 2, 11-16.
-
(1999)
Animal Cognition
, vol.2
, pp. 11-16
-
-
Cheng, K.1
Srinivasan, M.2
Zhang, S.3
-
4
-
-
33745787929
-
Representation and timing in theories of the dopamine system
-
Daw, N. D., Courville, A. C., & Touretzky, D. S. (2006). Representation and timing in theories of the dopamine system. Neural Computation, 18, 1637-1677.
-
(2006)
Neural Computation
, vol.18
, pp. 1637-1677
-
-
Daw, N.D.1
Courville, A.C.2
Touretzky, D.S.3
-
5
-
-
0033968832
-
A model of hippocampally dependent navigation, using the temporal difference learning rule
-
Foster, D., Morris, R., & Dayan, P. (2000). A model of hippocampally dependent navigation, using the temporal difference learning rule. Hippocampus, 10, 1-16.
-
(2000)
Hippocampus
, vol.10
, pp. 1-16
-
-
Foster, D.1
Morris, R.2
Dayan, P.3
-
6
-
-
80053152388
-
Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis
-
Glimcher, P. W. (2011). Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis. Proceedings of the National Academy of Sciences, 108, 15647-15654.
-
(2011)
Proceedings of the National Academy of Sciences
, vol.108
, pp. 15647-15654
-
-
Glimcher, P.W.1
-
7
-
-
80055084825
-
Grid cells, place cells, and geodesic generalization for spatial reinforcement learning
-
Gustafson, N. J., & Daw, N. D. (2011). Grid cells, place cells, and geodesic generalization for spatial reinforcement learning. PLoS Computational Biology, 7(10), e1002235.
-
(2011)
PLoS Computational Biology
, Issue.10
-
-
Gustafson, N.J.1
Daw, N.D.2
-
8
-
-
84883065020
-
Prolonged dopamine signalling in striatum signals proximity and value of distant rewards
-
Howe, M.W., Tierney, P. L., Sandberg, S. G., Phillips, P. E., & Graybiel, A. M. (2013). Prolonged dopamine signalling in striatum signals proximity and value of distant rewards. Nature, 500, 575-579.
-
(2013)
Nature
, vol.500
, pp. 575-579
-
-
Howe, M.W.1
Tierney, P.L.2
Sandberg, S.G.3
Phillips, P.E.4
Graybiel, A.M.5
-
9
-
-
70449382577
-
Temporal-difference reinforcement learning with distributed representations
-
Kurth-Nelson, Z., & Redish, A. D. (2009). Temporal-difference reinforcement learning with distributed representations. PloS One, 4, e7362.
-
(2009)
PloS One
, vol.4
-
-
Kurth-Nelson, Z.1
Redish, A.D.2
-
10
-
-
57349130536
-
Stimulus representation and the timing of reward-prediction errors in models of the dopamine system
-
Ludvig, E. A., Sutton, R. S., & Kehoe, E. J. (2008). Stimulus representation and the timing of reward-prediction errors in models of the dopamine system. Neural Computation, 20, 3034-3054.
-
(2008)
Neural Computation
, vol.20
, pp. 3034-3054
-
-
Ludvig, E.A.1
Sutton, R.S.2
Kehoe, E.J.3
-
11
-
-
84883087774
-
Neuroscience: Dopamine ramps up
-
Niv, Y. (2013). Neuroscience: Dopamine ramps up. Nature, 500, 533-535.
-
(2013)
Nature
, vol.500
, pp. 533-535
-
-
Niv, Y.1
-
13
-
-
0029999805
-
Geometric determinants of the place fields of hippocampal neurons
-
O'Keefe, J., & Burgess, N. (1996). Geometric determinants of the place fields of hippocampal neurons. Nature, 381, 425-428.
-
(1996)
Nature
, vol.381
, pp. 425-428
-
-
O'Keefe, J.1
Burgess, N.2
-
14
-
-
0030896968
-
A neural substrate of prediction and reward
-
Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599.
-
(1997)
Science
, vol.275
, pp. 1593-1599
-
-
Schultz, W.1
Dayan, P.2
Montague, P.R.3
|