-
1
-
-
84898958374
-
Gradient descent for general reinforcement learning
-
Baird L, Moore A. 1999. Gradient descent for general reinforcement learning. Advances in Neural Information Processing Systems 11:968–974 https://papers.nips.cc/paper/1576-gradient-descent-for-general-reinforcement-learning.pdf.
-
(1999)
Advances in Neural Information Processing Systems
, vol.11
, pp. 968-974
-
-
Baird, L.1
Moore, A.2
-
2
-
-
84875920967
-
From fixed points to chaos: Three models of delayed discrimination
-
PMID: 23438479
-
Barak O, Sussillo D, Romo R, Tsodyks M, Abbott LF. 2013. From fixed points to chaos: three models of delayed discrimination. Progress in Neurobiology 103:214–222. doi: 10.1016/j.pneurobio.2013.02.002, PMID: 23438479.
-
(2013)
Progress in Neurobiology
, vol.103
, pp. 214-222
-
-
Barak, O.1
Sussillo, D.2
Romo, R.3
Tsodyks, M.4
Abbott, L.F.5
-
3
-
-
0020970738
-
Neuronlike adaptive elements that can solve difficult learning control problems
-
Barto AG, Sutton RS, Anderson CW. 1983. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics SMC-13:834–846. doi: 10.1109/TSMC.1983.6313077.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics SMC-13
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Erson, C.W.3
-
5
-
-
21544435722
-
Midbrain dopamine neurons encode a quantitative reward prediction error signal
-
PMID: 15996553
-
Bayer HM, Glimcher PW. 2005. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47:129–141. doi: 10.1016/j.neuron.2005.05.020, PMID: 15996553.
-
(2005)
Neuron
, vol.47
, pp. 129-141
-
-
Bayer, H.M.1
Glimcher, P.W.2
-
6
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
PMID: 18267787
-
Bengio Y, Simard P, Frasconi P. 1994. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks 5:157–166. doi: 10.1109/72.279181, PMID: 18267787.
-
(1994)
IEEE Transactions on Neural Networks
, vol.5
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
7
-
-
79952006391
-
A reservoir of time constants for memory traces in cortical neurons
-
PMID: 21317906
-
Bernacchia A, Seo H, Lee D, Wang XJ. 2011. A reservoir of time constants for memory traces in cortical neurons. Nature Neuroscience 14:366–372. doi: 10.1038/nn.2752, PMID: 21317906.
-
(2011)
Nature Neuroscience
, vol.14
, pp. 366-372
-
-
Bernacchia, A.1
Seo, H.2
Lee, D.3
Wang, X.J.4
-
8
-
-
84971251166
-
Does computational neuroscience need new synaptic learning paradigms?
-
Brea J, Gerstner W. 2016. Does computational neuroscience need new synaptic learning paradigms? Current Opinion in Behavioral Sciences 11:61–66. doi: 10.1016/j.cobeha.2016.05.012.
-
(2016)
Current Opinion in Behavioral Sciences
, vol.11
, pp. 61-66
-
-
Brea, J.1
Gerstner, W.2
-
9
-
-
84946086073
-
Reinforcement learning of linking and tracing contours in recurrent neural networks.
-
Brosch T, Neumann H, Roelfsema PR. 2015. Reinforcement learning of linking and tracing contours in recurrent neural networks.. PLoS Computational Biology 11:e1004489. doi: 10.1371/journal.pcbi.1004489.
-
(2015)
Plos Computational Biology
, vol.11
-
-
Brosch, T.1
Neumann, H.2
Roelfsema, P.R.3
-
10
-
-
84919607718
-
Deep neural networks rival the representation of primate IT cortex for core visual object recognition
-
Cadieu CF, Hong H, Yamins DLK, Pinto N, Ardila D, Solomon EA, Majaj NJ, DiCarlo JJ. 2014. Deep neural networks rival the representation of primate IT cortex for core visual object recognition. PLoS Computational Biology 10:e1003963. doi: 10.1371/journal.pcbi.1003963.
-
(2014)
Plos Computational Biology
, vol.10
-
-
Cadieu, C.F.1
Hong, H.2
Yamins, D.3
Pinto, N.4
Ardila, D.5
Solomon, E.A.6
Majaj, N.J.7
Dicarlo, J.J.8
-
11
-
-
84930243913
-
Dynamic control of response criterion in premotor cortex during perceptual detection under temporal uncertainty
-
PMID: 25959731
-
Carnevale F, de Lafuente V, Romo R, Barak O, Parga N. 2015. Dynamic control of response criterion in premotor cortex during perceptual detection under temporal uncertainty. Neuron 86:1067–1077. doi: 10.1016/j.neuron.2015.04.014, PMID: 25959731.
-
(2015)
Neuron
, vol.86
, pp. 1067-1077
-
-
Carnevale, F.1
De Lafuente, V.2
Romo, R.3
Barak, O.4
Parga, N.5
-
12
-
-
84961291190
-
-
arXiv
-
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv. http://arxiv.org/abs/1406.1078.
-
(2014)
Learning Phrase Representations Using RNN Encoder-Decoder for Statistical Machine Translation
-
-
Cho, K.1
Van Merrienboer, B.2
Gulcehre, C.3
Bahdanau, D.4
Bougares, F.5
Schwenk, H.6
Bengio, Y.7
-
14
-
-
0037057808
-
Reward, motivation, and reinforcement learning
-
PMID: 12383782
-
Dayan P, Balleine BW. 2002. Reward, motivation, and reinforcement learning. Neuron 36:285–298. doi: 10.1016/S0896-6273(02)00963-7, PMID: 12383782.
-
(2002)
Neuron
, vol.36
, pp. 285-298
-
-
Dayan, P.1
Balleine, B.W.2
-
15
-
-
60749114870
-
Decision theory, reinforcement learning, and the brain. Cognitive
-
PMID: 19033240
-
Dayan P, Daw ND. 2008. Decision theory, reinforcement learning, and the brain. Cognitive, Affective, & Behavioral Neuroscience 8:429–453. doi: 10.3758/CABN.8.4.429, PMID: 19033240.
-
(2008)
Affective, & Behavioral Neuroscience
, vol.8
, pp. 429-453
-
-
Dayan, P.1
Daw, N.D.2
-
16
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
PMID: 10636940
-
Doya K. 2000. Reinforcement learning in continuous time and space. Neural Computation 12:219–245. doi: 10.1162/089976600300015961, PMID: 10636940.
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
17
-
-
77049204393
-
Cholinergic and inhibitory synapses in a pathway from motor-axon collaterals to motoneurones
-
PMID: 13222354
-
Eccles JC, Fatt P, Koketsu K. 1954. Cholinergic and inhibitory synapses in a pathway from motor-axon collaterals to motoneurones. The Journal of Physiology 126:524–562. doi: 10.1113/jphysiol.1954.sp005226, PMID: 13222354.
-
(1954)
The Journal of Physiology
, vol.126
, pp. 524-562
-
-
Eccles, J.C.1
Fatt, P.2
Koketsu, K.3
-
18
-
-
84924778329
-
Choice-correlated activity fluctuations underlie learning of neuronal category representation
-
PMID: 25759251
-
Engel TA, Chaisangmongkon W, Freedman DJ, Wang XJ. 2015. Choice-correlated activity fluctuations underlie learning of neuronal category representation. Nature Communications 6:6454. doi: 10.1038/ncomms7454, PMID: 25759251.
-
(2015)
Nature Communications
, vol.6
, pp. 6454
-
-
Engel, T.A.1
Chaisangmongkon, W.2
Freedman, D.J.3
Wang, X.J.4
-
19
-
-
33746652644
-
Gradient learning in spiking neural networks by dynamic perturbation of conductances
-
PMID: 16907616
-
Fiete IR, Seung HS. 2006. Gradient learning in spiking neural networks by dynamic perturbation of conductances. Physical Review Letters 97:048104. doi: 10.1103/PhysRevLett.97.048104, PMID: 16907616.
-
(2006)
Physical Review Letters
, vol.97
-
-
Fiete, I.R.1
Seung, H.S.2
-
20
-
-
35348872545
-
Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances
-
PMID: 17652414
-
Fiete IR, Fee MS, Seung HS. 2007. Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. Journal of Neurophysiology 98:2038–2057. doi: 10.1152/jn.01311.2006, PMID: 17652414.
-
(2007)
Journal of Neurophysiology
, vol.98
, pp. 2038-2057
-
-
Fiete, I.R.1
Fee, M.S.2
Seung, H.S.3
-
21
-
-
33744550336
-
Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal
-
PMID: 16637763
-
Frank MJ, Claus ED. 2006. Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychological Review 113:300–326. doi: 10.1037/0033-295X.113.2.300, PMID: 16637763.
-
(2006)
Psychological Review
, vol.113
, pp. 300-326
-
-
Frank, M.J.1
Claus, E.D.2
-
22
-
-
84957812846
-
Goal-Directed decision making with spiking neurons
-
PMID: 26843636
-
Friedrich J, Lengyel M. 2016. Goal-Directed decision making with spiking neurons. Journal of Neuroscience 36: 1529–1546. doi: 10.1523/JNEUROSCI.2854-15.2016, PMID: 26843636.
-
(2016)
Journal of Neuroscience
, vol.36
, pp. 1529-1546
-
-
Friedrich, J.1
Lengyel, M.2
-
23
-
-
77957731196
-
Functional requirements for reward-modulated spike-timing-dependent plasticity
-
PMID: 20926659
-
Frémaux N, Sprekeler H, Gerstner W. 2010. Functional requirements for reward-modulated spike-timing-dependent plasticity. Journal of Neuroscience 30:13326–13337. doi: 10.1523/JNEUROSCI.6249-09.2010, PMID: 20926659.
-
(2010)
Journal of Neuroscience
, vol.30
, pp. 13326-13337
-
-
Frémaux, N.1
Sprekeler, H.2
Gerstner, W.3
-
24
-
-
84928527851
-
On simplicity and complexity in the brave new world of large-scale neuroscience
-
PMID: 25932978
-
Gao P, Ganguli S. 2015. On simplicity and complexity in the brave new world of large-scale neuroscience. Current Opinion in Neurobiology 32:148–155. doi: 10.1016/j.conb.2015.04.003, PMID: 25932978.
-
(2015)
Current Opinion in Neurobiology
, vol.32
, pp. 148-155
-
-
Gao, P.1
Ganguli, S.2
-
25
-
-
34347361793
-
The neural basis of decision making
-
PMID: 17600525
-
Gold JI, Shadlen MN. 2007. The neural basis of decision making. Annual Review of Neuroscience 30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038, PMID: 17600525.
-
(2007)
Annual Review of Neuroscience
, vol.30
, pp. 535-574
-
-
Gold, J.I.1
Shadlen, M.N.2
-
27
-
-
84871756682
-
A survey of actor-critic reinforcement learning: Standard and natural policy gradients
-
Grondman I, Busoniu L, Lopes GAD, Babuska R. 2012. A survey of actor-critic reinforcement learning: Standard and natural policy gradients. IEEE Transactions on Systems, Man, and Cybernetics, Part C 42:1291–1307. doi: 10.1109/TSMCC.2012.2218595.
-
(2012)
IEEE Transactions on Systems, Man, and Cybernetics, Part C
, vol.42
, pp. 1291-1307
-
-
Grondman, I.1
Busoniu, L.2
Lopes, G.3
Babuska, R.4
-
28
-
-
84902438429
-
Optimal control of transient dynamics in balanced networks supports generation of complex movements
-
PMID: 24945778
-
Hennequin G, Vogels TP, Gerstner W. 2014. Optimal control of transient dynamics in balanced networks supports generation of complex movements. Neuron 82:1394–1406. doi: 10.1016/j.neuron.2014.04.045, PMID: 24945778.
-
(2014)
Neuron
, vol.82
, pp. 1394-1406
-
-
Hennequin, G.1
Vogels, T.P.2
Gerstner, W.3
-
29
-
-
84904700222
-
Basal ganglia circuits for reward value-guided behavior
-
PMID: 25032497
-
Hikosaka O, Kim HF, Yasuda M, Yamamoto S. 2014. Basal ganglia circuits for reward value-guided behavior. Annual Review of Neuroscience 37:289–306. doi: 10.1146/annurev-neuro-071013-013924, PMID: 25032497.
-
(2014)
Annual Review of Neuroscience
, vol.37
, pp. 289-306
-
-
Hikosaka, O.1
Kim, H.F.2
Yasuda, M.3
Yamamoto, S.4
-
30
-
-
84894276345
-
Emergence of complex computational structures from chaotic neural networks through reward-modulated hebbian learning
-
PMID: 23146969
-
Hoerzer GM, Legenstein R, Maass W. 2014. Emergence of complex computational structures from chaotic neural networks through reward-modulated hebbian learning. Cerebral Cortex 24:677–690. doi: 10.1093/cercor/bhs348, PMID: 23146969.
-
(2014)
Cerebral Cortex
, vol.24
, pp. 677-690
-
-
Hoerzer, G.M.1
Legenstein, R.2
Maass, W.3
-
31
-
-
84959087247
-
Explicit information for category-orthogonal object properties increases along the ventral stream
-
PMID: 26900926
-
Hong H, Yamins DL, Majaj NJ, DiCarlo JJ. 2016. Explicit information for category-orthogonal object properties increases along the ventral stream. Nature Neuroscience 19:613–622. doi: 10.1038/nn.4247, PMID: 26900926.
-
(2016)
Nature Neuroscience
, vol.19
, pp. 613-622
-
-
Hong, H.1
Yamins, D.L.2
Majaj, N.J.3
Dicarlo, J.J.4
-
32
-
-
0002861883
-
A model of how the basal ganglia generates and uses neural signals that predict reinforcement
-
Houk J. C, Davis J. L, Beisberb D. G, Cambridge, MA: MIT Press
-
Houk JC, Adams JL, Barto AG. 1995. A model of how the basal ganglia generates and uses neural signals that predict reinforcement. In: Houk J. C, Davis J. L, Beisberb D. G (Eds). Models of Information Processing in the Basal Ganglia. Cambridge, MA: MIT Press. p 249–274.
-
(1995)
Models of Information Processing in the Basal Ganglia
, pp. 249-274
-
-
Houk, J.C.1
Adams, J.L.2
Barto, A.G.3
-
33
-
-
34948906745
-
Solving the distal reward problem through linkage of STDP and dopamine signaling
-
PMID: 17220510
-
Izhikevich EM. 2007. Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral Cortex 17:2443–2452. doi: 10.1093/cercor/bhl152, PMID: 17220510.
-
(2007)
Cerebral Cortex
, vol.17
, pp. 2443-2452
-
-
Izhikevich, E.M.1
-
34
-
-
84989328143
-
-
arXiv
-
Jaderberg M, Czarnecki WM, Osindero S, Vinyals O, Graves A, Kavukcuoglu K. 2016. Decoupled neural interfaces using synthetic gradients. arXiv. http://arxiv.org/abs/1608.05343.
-
(2016)
Decoupled Neural Interfaces Using Synthetic Gradients
-
-
Jaderberg, M.1
Czarnecki, W.M.2
Osindero, S.3
Vinyals, O.4
Graves, A.5
Kavukcuoglu, K.6
-
35
-
-
0036592026
-
Actor-critic models of the basal ganglia: New anatomical and computational perspectives
-
PMID: 12371510
-
Joel D, Niv Y, Ruppin E. 2002. Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Neural Networks 15:535–547. doi: 10.1016/S0893-6080(02)00047-3, PMID: 12371510.
-
(2002)
Neural Networks
, vol.15
, pp. 535-547
-
-
Joel, D.1
Niv, Y.2
Ruppin, E.3
-
36
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
Kaelbling LP, Littman ML, Cassandra AR. 1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101:99–134. doi: 10.1016/S0004-3702(98)00023-X
-
(1998)
Artificial Intelligence
, vol.101
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
37
-
-
51649116802
-
Neural correlates, computation and behavioural impact of decision confidence
-
PMID: 18690210
-
Kepecs A, Uchida N, Zariwala HA, Mainen ZF. 2008. Neural correlates, computation and behavioural impact of decision confidence. Nature 455:227–231. doi: 10.1038/nature07200, PMID: 18690210.
-
(2008)
Nature
, vol.455
, pp. 227-231
-
-
Kepecs, A.1
Uchida, N.2
Zariwala, H.A.3
Mainen, Z.F.4
-
38
-
-
41149144550
-
Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment
-
PMID: 18354005
-
Kiani R, Hanks TD, Shadlen MN. 2008. Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. Journal of Neuroscience 28:3017–3029. doi: 10.1523/JNEUROSCI.4761-07.2008, PMID: 18354005.
-
(2008)
Journal of Neuroscience
, vol.28
, pp. 3017-3029
-
-
Kiani, R.1
Hanks, T.D.2
Shadlen, M.N.3
-
39
-
-
65649149780
-
Representation of confidence associated with a decision by neurons in the parietal cortex
-
PMID: 19423820
-
Kiani R, Shadlen MN. 2009. Representation of confidence associated with a decision by neurons in the parietal cortex. Science 324:759–764. doi: 10.1126/science.1169405, PMID: 19423820.
-
(2009)
Science
, vol.324
, pp. 759-764
-
-
Kiani, R.1
Shadlen, M.N.2
-
40
-
-
85083951076
-
Adam: A method for stochastic optimization
-
arXiv
-
Kingma DP, Ba JL. 2015. Adam: A method for stochastic optimization. Int. Conf. Learn. Represent. arXiv. https://arxiv.org/abs/1412.6980.
-
(2015)
Int. Conf. Learn. Represent
-
-
Kingma, D.P.1
Ba, J.L.2
-
41
-
-
84879686840
-
Robust timing and motor patterns by taming chaos in recurrent neural networks
-
PMID: 23708144
-
Laje R, Buonomano DV. 2013. Robust timing and motor patterns by taming chaos in recurrent neural networks. Nature Neuroscience 16:925–933. doi: 10.1038/nn.3405, PMID: 23708144.
-
(2013)
Nature Neuroscience
, vol.16
, pp. 925-933
-
-
Laje, R.1
Buonomano, D.V.2
-
42
-
-
84907966818
-
Orbitofrontal cortex is required for optimal waiting based on decision confidence
-
PMID: 25242219
-
Lak A, Costa GM, Romberg E, Koulakov AA, Mainen ZF, Kepecs A. 2014. Orbitofrontal cortex is required for optimal waiting based on decision confidence. Neuron 84:190–201. doi: 10.1016/j.neuron.2014.08.039, PMID: 25242219.
-
(2014)
Neuron
, vol.84
, pp. 190-201
-
-
Lak, A.1
Costa, G.M.2
Romberg, E.3
Koulakov, A.A.4
Mainen, Z.F.5
Kepecs, A.6
-
43
-
-
79955721719
-
Signals in human striatum are appropriate for policy update rather than value prediction
-
PMID: 21471387
-
Li J, Daw ND. 2011. Signals in human striatum are appropriate for policy update rather than value prediction. Journal of Neuroscience 31:5504–5511. doi: 10.1523/JNEUROSCI.6316-10.2011, PMID: 21471387.
-
(2011)
Journal of Neuroscience
, vol.31
, pp. 5504-5511
-
-
Li, J.1
Daw, N.D.2
-
44
-
-
84994417427
-
Random synaptic feedback weights support error backpropagation for deep learning
-
PMID: 27 824044
-
Lillicrap TP, Cownden D, Tweed DB, Akerman CJ. 2016. Random synaptic feedback weights support error backpropagation for deep learning. Nature Communications 7:13276. doi: 10.1038/ncomms13276, PMID: 27 824044.
-
(2016)
Nature Communications
, vol.7
, pp. 13276
-
-
Lillicrap, T.P.1
Cownden, D.2
Tweed, D.B.3
Akerman, C.J.4
-
45
-
-
74849088855
-
Functional, but not anatomical, separation of “what” and “when” in prefrontal cortex
-
PMID: 20053 916
-
Machens CK, Romo R, Brody CD. 2010. Functional, but not anatomical, separation of “what” and “when” in prefrontal cortex. Journal of Neuroscience 30:350–360. doi: 10.1523/JNEUROSCI.3276-09.2010, PMID: 20053 916.
-
(2010)
Journal of Neuroscience
, vol.30
, pp. 350-360
-
-
Machens, C.K.1
Romo, R.2
Brody, C.D.3
-
46
-
-
77949897253
-
Two-factor theory, the actor-critic model, and conditioned avoidance
-
PMID: 20065349
-
Maia TV. 2010. Two-factor theory, the actor-critic model, and conditioned avoidance. Learning & Behavior 38: 50–67. doi: 10.3758/LB.38.1.50, PMID: 20065349.
-
(2010)
Learning & Behavior
, vol.38
, pp. 50-67
-
-
Maia, T.V.1
-
47
-
-
84887390404
-
Context-dependent computation by recurrent dynamics in prefrontal cortex
-
PMID: 24201281
-
Mante V, Sussillo D, Shenoy KV, Newsome WT. 2013. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 503:78–84. doi: 10.1038/nature12742, PMID: 24201281.
-
(2013)
Nature
, vol.503
, pp. 78-84
-
-
Mante, V.1
Sussillo, D.2
Shenoy, K.V.3
Newsome, W.T.4
-
51
-
-
0142219854
-
A role for neural integrators in perceptual decision making
-
PMID: 14576217
-
Mazurek ME, Roitman JD, Ditterich J, Shadlen MN. 2003. A role for neural integrators in perceptual decision making. Cerebral Cortex 13:1257–1269. doi: 10.1093/cercor/bhg097, PMID: 14576217.
-
(2003)
Cerebral Cortex
, vol.13
, pp. 1257-1269
-
-
Mazurek, M.E.1
Roitman, J.D.2
Ditterich, J.3
Shadlen, M.N.4
-
54
-
-
84971448181
-
-
arXiv
-
Mnih V, Mirza M, Graves A, Harley T, Lillicrap TP, Silver D. 2016. Asynchronous methods for deep reinforcement learning. arXiv. http://arxiv.org/abs/1602.01783.
-
(2016)
Asynchronous Methods for Deep Reinforcement Learning
-
-
Mnih, V.1
Mirza, M.2
Graves, A.3
Harley, T.4
Lillicrap, T.P.5
Silver, D.6
-
56
-
-
1942520195
-
Dissociable roles of ventral and dorsal striatum in instrumental conditioning
-
PMID: 15087550
-
O’Doherty J, Dayan P, Schultz J, Deichmann R, Friston K, Dolan RJ. 2004. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 304:452–454. doi: 10.1126/science.1094285, PMID: 15087550.
-
(2004)
Science
, vol.304
, pp. 452-454
-
-
O’Doherty, J.1
Dayan, P.2
Schultz, J.3
Deichmann, R.4
Friston, K.5
Dolan, R.J.6
-
57
-
-
33646566317
-
Neurons in the orbitofrontal cortex encode economic value
-
PMID: 16633341
-
Padoa-Schioppa C, Assad JA. 2006. Neurons in the orbitofrontal cortex encode economic value. Nature 441: 223–226. doi: 10.1038/nature04676, PMID: 16633341.
-
(2006)
Nature
, vol.441
, pp. 223-226
-
-
Padoa-Schioppa, C.1
Assad, J.A.2
-
60
-
-
44949241322
-
Reinforcement learning of motor skills with policy gradients
-
PMID: 18482830
-
Peters J, Schaal S. 2008. Reinforcement learning of motor skills with policy gradients. Neural Networks 21:682–697. doi: 10.1016/j.neunet.2008.02.003, PMID: 18482830.
-
(2008)
Neural Networks
, vol.21
, pp. 682-697
-
-
Peters, J.1
Schaal, S.2
-
61
-
-
84959861453
-
Recurrent network models of sequence generation and memory
-
Rajan K, Harvey CD, Tank DW. 2015. Recurrent network models of sequence generation and memory. Neuron 90:128–142. doi: 10.1016/j.neuron.2016.02.009.
-
(2015)
Neuron
, vol.90
, pp. 128-142
-
-
Rajan, K.1
Harvey, C.D.2
Tank, D.W.3
-
63
-
-
79960241771
-
Decision making under uncertainty: A neural model based on partially observable markov decision processes
-
PMID: 21152255
-
Rao RP. 2010. Decision making under uncertainty: a neural model based on partially observable markov decision processes. Frontiers in Computational Neuroscience 4:146. doi: 10.3389/fncom.2010.00146, PMID: 21152255.
-
(2010)
Frontiers in Computational Neuroscience
, vol.4
, pp. 146
-
-
Rao, R.P.1
-
64
-
-
84858019519
-
Multisensory decision-making in rats and humans
-
PMID: 22423093
-
Raposo D, Sheppard JP, Schrater PR, Churchland AK. 2012. Multisensory decision-making in rats and humans. Journal of Neuroscience 32:3726–3735. doi: 10.1523/JNEUROSCI.4998-11.2012, PMID: 22423093.
-
(2012)
Journal of Neuroscience
, vol.32
, pp. 3726-3735
-
-
Raposo, D.1
Sheppard, J.P.2
Schrater, P.R.3
Churchland, A.K.4
-
65
-
-
84926204605
-
A category-free neural population supports evolving demands during decision-making
-
PMID: 25383902
-
Raposo D, Kaufman MT, Churchland AK. 2014. A category-free neural population supports evolving demands during decision-making. Nature Neuroscience 17:1784–1792. doi: 10.1038/nn.3865, PMID: 25383902.
-
(2014)
Nature Neuroscience
, vol.17
, pp. 1784-1792
-
-
Raposo, D.1
Kaufman, M.T.2
Churchland, A.K.3
-
66
-
-
84863881230
-
Internal representation of task rules by recurrent dynamics: The importance of the diversity of neural responses
-
PMID: 21048899
-
Rigotti M, Ben Dayan Rubin D, Wang XJ, Fusi S. 2010. Internal representation of task rules by recurrent dynamics: the importance of the diversity of neural responses. Frontiers in Computational Neuroscience 4:24. doi: 10.3389/fncom.2010.00024, PMID: 21048899.
-
(2010)
Frontiers in Computational Neuroscience
, vol.4
, pp. 24
-
-
Rigotti, M.1
Ben Dayan Rubin, D.2
Wang, X.J.3
Fusi, S.4
-
67
-
-
84878390558
-
The importance of mixed selectivity in complex cognitive tasks
-
PMID: 23685452
-
Rigotti M, Barak O, Warden MR, Wang XJ, Daw ND, Miller EK, Fusi S. 2013. The importance of mixed selectivity in complex cognitive tasks. Nature 497:585–590. doi: 10.1038/nature12160, PMID: 23685452.
-
(2013)
Nature
, vol.497
, pp. 585-590
-
-
Rigotti, M.1
Barak, O.2
Warden, M.R.3
Wang, X.J.4
Daw, N.D.5
Miller, E.K.6
Fusi, S.7
-
68
-
-
0036850727
-
Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task
-
PMID: 12417672
-
Roitman JD, Shadlen MN. 2002. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. Journal of Neuroscience 22:9475–9489. PMID: 12417672.
-
(2002)
Journal of Neuroscience
, vol.22
, pp. 9475-9489
-
-
Roitman, J.D.1
Shadlen, M.N.2
-
69
-
-
0033519704
-
Neuronal correlates of parametric working memory in the prefrontal cortex
-
PMID: 10365959
-
Romo R, Brody CD, Hernández A, Lemus L. 1999. Neuronal correlates of parametric working memory in the prefrontal cortex. Nature 399:470–473. doi: 10.1038/20939, PMID: 10365959.
-
(1999)
Nature
, vol.399
, pp. 470-473
-
-
Romo, R.1
Brody, C.D.2
Hernández, A.3
Lemus, L.4
-
70
-
-
0000646059
-
Learning internal representations by error propagation
-
Rumelhart DE, McClelland JL (Eds), Cambridge, MA: MIT Press
-
Rumelhart DE, Hinton GE, Williams RJ. 1986. Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (Eds). Parallel Distributed Processing. Cambridge, MA: MIT Press. 1 p 318–362.
-
(1986)
Parallel Distributed Processing
, vol.1
, pp. 318-362
-
-
Rumelhart, D.E.1
Hinton, G.E.2
Williams, R.J.3
-
72
-
-
83155184719
-
Does the orbitofrontal cortex signal value?
-
PMID: 22145878
-
Schoenbaum G, Takahashi Y, Liu TL, McDannald MA. 2011. Does the orbitofrontal cortex signal value? Annals of the New York Academy of Sciences 1239:87–99. doi: 10.1111/j.1749-6632.2011.06210.x, PMID: 22145878.
-
(2011)
Annals of the New York Academy of Sciences
, vol.1239
, pp. 87-99
-
-
Schoenbaum, G.1
Takahashi, Y.2
Liu, T.L.3
McDannald, M.A.4
-
73
-
-
0030896968
-
A neural substrate of prediction and reward
-
PMID: 9054347
-
Schultz W, Dayan P, Montague PR. 1997. A neural substrate of prediction and reward. Science 275:1593–1599. doi: 10.1126/science.275.5306.1593, PMID: 9054347.
-
(1997)
Science
, vol.275
, pp. 1593-1599
-
-
Schultz, W.1
Dayan, P.2
Montague, P.R.3
-
74
-
-
0034061495
-
Reward processing in primate orbitofrontal cortex and basal ganglia
-
PMID: 10731222
-
Schultz W, Tremblay L, Hollerman JR. 2000. Reward processing in primate orbitofrontal cortex and basal ganglia. Cerebral Cortex 10:272–283. doi: 10.1093/cercor/10.3.272, PMID: 10731222.
-
(2000)
Cerebral Cortex
, vol.10
, pp. 272-283
-
-
Schultz, W.1
Tremblay, L.2
Hollerman, J.R.3
-
75
-
-
0347362917
-
Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
-
PMID: 14687542
-
Seung HS. 2003. Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron 40:1063–1073. doi: 10.1016/S0896-6273(03)00761-X, PMID: 14687542.
-
(2003)
Neuron
, vol.40
, pp. 1063-1073
-
-
Seung, H.S.1
-
76
-
-
33748999594
-
Neural mechanism for stochastic behaviour during a competitive game
-
PMID: 17015181
-
Soltani A, Lee D, Wang XJ. 2006. Neural mechanism for stochastic behaviour during a competitive game. Neural Networks 19:1075–1090. doi: 10.1016/j.neunet.2006.05.044, PMID: 17015181.
-
(2006)
Neural Networks
, vol.19
, pp. 1075-1090
-
-
Soltani, A.1
Lee, D.2
Wang, X.J.3
-
77
-
-
73949142431
-
Synaptic computation underlying probabilistic inference
-
PMID: 20010823
-
Soltani A, Wang XJ. 2010. Synaptic computation underlying probabilistic inference. Nature Neuroscience 13: 112–119. doi: 10.1038/nn.2450, PMID: 20010823.
-
(2010)
Nature Neuroscience
, vol.13
, pp. 112-119
-
-
Soltani, A.1
Wang, X.J.2
-
78
-
-
84959494188
-
Training excitatory-inhibitory recurrent neural networks for cognitive tasks: A simple and flexible framework
-
PMID: 26928718
-
Song HF, Yang GR, Wang XJ. 2016. Training excitatory-inhibitory recurrent neural networks for cognitive tasks: A simple and flexible framework. PLoS Computational Biology 12:e1004792. doi: 10.1371/journal.pcbi.1004792, PMID: 26928718.
-
(2016)
Plos Computational Biology
, vol.12
-
-
Song, H.F.1
Yang, G.R.2
Wang, X.J.3
-
79
-
-
84928639374
-
What the orbitofrontal cortex does not do
-
PMID: 25919962
-
Stalnaker TA, Cooch NK, Schoenbaum G. 2015. What the orbitofrontal cortex does not do. Nature Neuroscience 18:620–627. doi: 10.1038/nn.3982, PMID: 25919962.
-
(2015)
Nature Neuroscience
, vol.18
, pp. 620-627
-
-
Stalnaker, T.A.1
Cooch, N.K.2
Schoenbaum, G.3
-
80
-
-
17844396920
-
Choosing the greater of two goods: Neural currencies for valuation and decision making
-
PMID: 15832198
-
Sugrue LP, Corrado GS, Newsome WT. 2005. Choosing the greater of two goods: neural currencies for valuation and decision making. Nature Reviews Neuroscience 6:363–375. doi: 10.1038/nrn1666, PMID: 15832198.
-
(2005)
Nature Reviews Neuroscience
, vol.6
, pp. 363-375
-
-
Sugrue, L.P.1
Corrado, G.S.2
Newsome, W.T.3
-
81
-
-
68949147577
-
Generating coherent patterns of activity from chaotic neural networks
-
PMID: 19709635
-
Sussillo D, Abbott LF. 2009. Generating coherent patterns of activity from chaotic neural networks. Neuron 63: 544–557. doi: 10.1016/j.neuron.2009.07.018, PMID: 19709635.
-
(2009)
Neuron
, vol.63
, pp. 544-557
-
-
Sussillo, D.1
Abbott, L.F.2
-
82
-
-
84877827546
-
Opening the black box: Low-dimensional dynamics in high-dimensional recurrent neural networks
-
PMID: 23272922
-
Sussillo D, Barak O. 2013. Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks. Neural Computation 25:626–649. doi: 10.1162/NECO_a_00409, PMID: 23272922.
-
(2013)
Neural Computation
, vol.25
, pp. 626-649
-
-
Sussillo, D.1
Barak, O.2
-
83
-
-
84893503924
-
Neural circuits as computational dynamical systems
-
PMID: 24509098
-
Sussillo D. 2014. Neural circuits as computational dynamical systems. Current Opinion in Neurobiology 25:156–163. doi: 10.1016/j.conb.2014.01.008, PMID: 24509098.
-
(2014)
Current Opinion in Neurobiology
, vol.25
, pp. 156-163
-
-
Sussillo, D.1
-
84
-
-
84933280082
-
A neural network that finds a naturalistic solution for the production of muscle activity
-
PMID: 26075643
-
Sussillo D, Churchland MM, Kaufman MT, Shenoy KV. 2015. A neural network that finds a naturalistic solution for the production of muscle activity. Nature Neuroscience 18:1025–1033. doi: 10.1038/nn.4042, PMID: 26075643.
-
(2015)
Nature Neuroscience
, vol.18
, pp. 1025-1033
-
-
Sussillo, D.1
Churchland, M.M.2
Kaufman, M.T.3
Shenoy, K.V.4
-
86
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
Sutton RS, Mcallester D, Singh S, Mansour Y. 2000. Policy gradient methods for reinforcement learning with function approximation. Advances in neural information processing systems 12:1057–1063 http://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
87
-
-
67651147037
-
Silencing the critics: Understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model
-
PMID: 18982111
-
Takahashi Y, Schoenbaum G, Niv Y. 2008. Silencing the critics: understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model. Frontiers in Neuroscience 2:86–99. doi: 10.3389/neuro.01.014.2008, PMID: 18982111.
-
(2008)
Frontiers in Neuroscience
, vol.2
, pp. 86-99
-
-
Takahashi, Y.1
Schoenbaum, G.2
Niv, Y.3
-
88
-
-
82255179147
-
Expectancy-related changes in firing of dopamine neurons depend on orbitofrontal cortex
-
PMID: 22037501
-
Takahashi YK, Roesch MR, Wilson RC, Toreson K, O’Donnell P, Niv Y, Schoenbaum G. 2011. Expectancy-related changes in firing of dopamine neurons depend on orbitofrontal cortex. Nature Neuroscience 14:1590–1597. doi: 10.1038/nn.2957, PMID: 22037501.
-
(2011)
Nature Neuroscience
, vol.14
, pp. 1590-1597
-
-
Takahashi, Y.K.1
Roesch, M.R.2
Wilson, R.C.3
Toreson, K.4
O’Donnell, P.5
Niv, Y.6
Schoenbaum, G.7
-
90
-
-
77549088095
-
Learning to use working memory in partially observable environments through dopaminergic reinforcement
-
Todd MT, Niv Y, Cohen JD. 2008. Learning to use working memory in partially observable environments through dopaminergic reinforcement. Advances in Neural Information Processing Systems. http://papers.nips.cc/paper/3508-learning-to-use-working-memory-in-partially-observable-environments-through-dopaminergic-reinforcement.pdf.
-
(2008)
Advances in Neural Information Processing Systems
-
-
Todd, M.T.1
Niv, Y.2
Cohen, J.D.3
-
91
-
-
78651481149
-
Basal ganglia contributions to motor control: A vigorous tutor
-
PMID: 20850966
-
Turner RS, Desmurget M. 2010. Basal ganglia contributions to motor control: a vigorous tutor. Current Opinion in Neurobiology 20:704–716. doi: 10.1016/j.conb.2010.08.022, PMID: 20850966.
-
(2010)
Current Opinion in Neurobiology
, vol.20
, pp. 704-716
-
-
Turner, R.S.1
Desmurget, M.2
-
92
-
-
60749100305
-
Reinforcement learning in populations of spiking neurons
-
PMID: 19219040
-
Urbanczik R, Senn W. 2009. Reinforcement learning in populations of spiking neurons. Nature Neuroscience 12: 250–252. doi: 10.1038/nn.2264, PMID: 19219040.
-
(2009)
Nature Neuroscience
, vol.12
, pp. 250-252
-
-
Urbanczik, R.1
Senn, W.2
-
93
-
-
34547674638
-
Orbitofrontal cortex and its contribution to decision-making
-
PMID: 17417936
-
Wallis JD. 2007. Orbitofrontal cortex and its contribution to decision-making. Annual Review of Neuroscience 30:31–56. doi: 10.1146/annurev.neuro.30.051606.094334, PMID: 17417936.
-
(2007)
Annual Review of Neuroscience
, vol.30
, pp. 31-56
-
-
Wallis, J.D.1
-
94
-
-
0037028039
-
Probabilistic decision making by slow reverberation in cortical circuits
-
PMID: 12467598
-
Wang XJ. 2002. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36:955–968. doi: 10.1016/S0896-6273(02)01092-9, PMID: 12467598.
-
(2002)
Neuron
, vol.36
, pp. 955-968
-
-
Wang, X.J.1
-
95
-
-
53849125053
-
Decision making in recurrent neuronal circuits
-
PMID: 18957215
-
Wang XJ. 2008. Decision making in recurrent neuronal circuits. Neuron 60:215–234. doi: 10.1016/j.neuron.2008.09.034, PMID: 18957215.
-
(2008)
Neuron
, vol.60
, pp. 215-234
-
-
Wang, X.J.1
-
96
-
-
84939794227
-
Confidence estimation as a stochastic process in a neurodynamical system of decision making
-
PMID: 25948870
-
Wei Z, Wang XJ. 2015. Confidence estimation as a stochastic process in a neurodynamical system of decision making. Journal of Neurophysiology 114:99–113. doi: 10.1152/jn.00793.2014, PMID: 25948870.
-
(2015)
Journal of Neurophysiology
, vol.114
, pp. 99-113
-
-
Wei, Z.1
Wang, X.J.2
-
98
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams RJ. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8:229–256. doi: 10.1007/BF00992696.
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
-
99
-
-
32544439341
-
A recurrent network mechanism of time integration in perceptual decisions
-
PMID: 16436619
-
Wong KF, Wang XJ. 2006. A recurrent network mechanism of time integration in perceptual decisions. Journal of Neuroscience 26:1314–1328. doi: 10.1523/JNEUROSCI.3733-05.2006, PMID: 16436619.
-
(2006)
Journal of Neuroscience
, vol.26
, pp. 1314-1328
-
-
Wong, K.F.1
Wang, X.J.2
-
100
-
-
84970002232
-
Show, attend and tell: Neural image caption generation with visual attention
-
Xu K, Ba JL, Kiros R, Cho K, Courville A, Salakhutdinov R, Zemel RS, Bengio Y. 2015. Show, attend and tell: Neural image caption generation with visual attention. Proceedings of the 32 nd International Conference on Machine Learning. http://jmlr.org/proceedings/papers/v37/xuc15.pdf.
-
(2015)
Proceedings of the 32 Nd International Conference on Machine Learning
-
-
Xu, K.1
Ba, J.L.2
Kiros, R.3
Cho, K.4
Courville, A.5
Salakhutdinov, R.6
Zemel, R.S.7
Bengio, Y.8
-
101
-
-
84902213589
-
Performance-optimized hierarchical models predict neural responses in higher visual cortex
-
PMID: 24812127
-
Yamins DL, Hong H, Cadieu CF, Solomon EA, Seibert D, DiCarlo JJ. 2014. Performance-optimized hierarchical models predict neural responses in higher visual cortex. PNAS 111:8619–8624. doi: 10.1073/pnas.1403112111, PMID: 24812127.
-
(2014)
PNAS
, vol.111
, pp. 8619-8624
-
-
Yamins, D.L.1
Hong, H.2
Cadieu, C.F.3
Solomon, E.A.4
Seibert, D.5
Dicarlo, J.J.6
-
103
-
-
0023877474
-
A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons
-
PMID: 3344044
-
Zipser D, Andersen RA. 1988. A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons. Nature 331:679–684. doi: 10.1038/331679a0, PMID: 3344044
-
(1988)
Nature
, vol.331
, pp. 679-684
-
-
Zipser, D.1
Ersen, R.A.2
|