-
1
-
-
67349283062
-
Reinforcement learning in the brain
-
Niv Y. Reinforcement learning in the brain. J. Math. Psychol. 2009, 53:139-154.
-
(2009)
J. Math. Psychol.
, vol.53
, pp. 139-154
-
-
Niv, Y.1
-
2
-
-
72049125602
-
Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action
-
Balleine B.W., O'Doherty J.P. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 2010, 35:48-69.
-
(2010)
Neuropsychopharmacology
, vol.35
, pp. 48-69
-
-
Balleine, B.W.1
O'Doherty, J.P.2
-
3
-
-
84863416482
-
Temporal-difference search in computer Go
-
Silver D., et al. Temporal-difference search in computer Go. Mach. Learn. 2012, 87:183-219.
-
(2012)
Mach. Learn.
, vol.87
, pp. 183-219
-
-
Silver, D.1
-
5
-
-
84859737036
-
Goal directed decision making as probabilistic inference: A computational framework and potential neural correlates
-
Solway A., Botvinick M.M. Goal directed decision making as probabilistic inference: A computational framework and potential neural correlates. Psychol. Rev. 2012, 119:120-154.
-
(2012)
Psychol. Rev.
, vol.119
, pp. 120-154
-
-
Solway, A.1
Botvinick, M.M.2
-
6
-
-
84877282363
-
On stochastic optimal control and reinforcement learning by approximate inference
-
(Roy, N., ed), MIT Press (in press)
-
Rawlik, K. et al. On stochastic optimal control and reinforcement learning by approximate inference. In Proceedings, International Conference on Robotics Science and Systems (RSS 2012) (Roy, N., ed), MIT Press (in press).
-
Proceedings, International Conference on Robotics Science and Systems (RSS 2012)
-
-
Rawlik, K.1
-
8
-
-
84862024986
-
Optimal control as a graphical model inference problem
-
Kappen H.J., et al. Optimal control as a graphical model inference problem. Mach. Learn. 2012, 87:159-182.
-
(2012)
Mach. Learn.
, vol.87
, pp. 159-182
-
-
Kappen, H.J.1
-
9
-
-
77952091673
-
Action and behavior: a free-energy formulation
-
Friston K., et al. Action and behavior: a free-energy formulation. Biol. Cybern. 2010, 102:227-260.
-
(2010)
Biol. Cybern.
, vol.102
, pp. 227-260
-
-
Friston, K.1
-
11
-
-
34347361793
-
The neural basis of decision making
-
Gold J.I., Shadlen M.N. The neural basis of decision making. Annu. Rev. Neurosci. 2007, 30:535-574.
-
(2007)
Annu. Rev. Neurosci.
, vol.30
, pp. 535-574
-
-
Gold, J.I.1
Shadlen, M.N.2
-
12
-
-
78249247078
-
Rational approximations to rational models: alternative algorithms for category learning
-
Sanborn A.N., et al. Rational approximations to rational models: alternative algorithms for category learning. Psychol. Rev. 2010, 117:1144-1167.
-
(2010)
Psychol. Rev.
, vol.117
, pp. 1144-1167
-
-
Sanborn, A.N.1
-
13
-
-
81355133300
-
Neural dynamics as sampling: a model for stochastic computation in recurrent networks of spiking neurons
-
Buesing L., et al. Neural dynamics as sampling: a model for stochastic computation in recurrent networks of spiking neurons. PLoS Comput. Biol. 2011, 7:e1002211.
-
(2011)
PLoS Comput. Biol.
, vol.7
-
-
Buesing, L.1
-
14
-
-
42949178622
-
Value representations in the primate striatum during matching behavior
-
Lau B., Glimcher P.W. Value representations in the primate striatum during matching behavior. Neuron 2008, 58:451-463.
-
(2008)
Neuron
, vol.58
, pp. 451-463
-
-
Lau, B.1
Glimcher, P.W.2
-
15
-
-
33646566317
-
Neurons in the orbitofrontal cortex encode economic value
-
Padoa-Schioppa C., Assad J.A. Neurons in the orbitofrontal cortex encode economic value. Nature 2006, 441:223-226.
-
(2006)
Nature
, vol.441
, pp. 223-226
-
-
Padoa-Schioppa, C.1
Assad, J.A.2
|