메뉴 건너뛰기




Volumn 6, Issue , 2017, Pages

Reward-based training of recurrent neural networks for cognitive and value-based tasks

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING; MODEL; NERVOUS SYSTEM; REWARD; ANIMAL; ANIMAL BEHAVIOR; BIOLOGICAL MODEL; COGNITION; CONDITIONING; DECISION MAKING;

EID: 85012005486     PISSN: None     EISSN: 2050084X     Source Type: Journal    
DOI: 10.7554/eLife.21492     Document Type: Article
Times cited : (120)

References (103)
  • 1
    • 84898958374 scopus 로고    scopus 로고
    • Gradient descent for general reinforcement learning
    • Baird L, Moore A. 1999. Gradient descent for general reinforcement learning. Advances in Neural Information Processing Systems 11:968–974 https://papers.nips.cc/paper/1576-gradient-descent-for-general-reinforcement-learning.pdf.
    • (1999) Advances in Neural Information Processing Systems , vol.11 , pp. 968-974
    • Baird, L.1    Moore, A.2
  • 2
    • 84875920967 scopus 로고    scopus 로고
    • From fixed points to chaos: Three models of delayed discrimination
    • PMID: 23438479
    • Barak O, Sussillo D, Romo R, Tsodyks M, Abbott LF. 2013. From fixed points to chaos: three models of delayed discrimination. Progress in Neurobiology 103:214–222. doi: 10.1016/j.pneurobio.2013.02.002, PMID: 23438479.
    • (2013) Progress in Neurobiology , vol.103 , pp. 214-222
    • Barak, O.1    Sussillo, D.2    Romo, R.3    Tsodyks, M.4    Abbott, L.F.5
  • 5
    • 21544435722 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode a quantitative reward prediction error signal
    • PMID: 15996553
    • Bayer HM, Glimcher PW. 2005. Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47:129–141. doi: 10.1016/j.neuron.2005.05.020, PMID: 15996553.
    • (2005) Neuron , vol.47 , pp. 129-141
    • Bayer, H.M.1    Glimcher, P.W.2
  • 6
    • 0028392483 scopus 로고
    • Learning long-term dependencies with gradient descent is difficult
    • PMID: 18267787
    • Bengio Y, Simard P, Frasconi P. 1994. Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks 5:157–166. doi: 10.1109/72.279181, PMID: 18267787.
    • (1994) IEEE Transactions on Neural Networks , vol.5 , pp. 157-166
    • Bengio, Y.1    Simard, P.2    Frasconi, P.3
  • 7
    • 79952006391 scopus 로고    scopus 로고
    • A reservoir of time constants for memory traces in cortical neurons
    • PMID: 21317906
    • Bernacchia A, Seo H, Lee D, Wang XJ. 2011. A reservoir of time constants for memory traces in cortical neurons. Nature Neuroscience 14:366–372. doi: 10.1038/nn.2752, PMID: 21317906.
    • (2011) Nature Neuroscience , vol.14 , pp. 366-372
    • Bernacchia, A.1    Seo, H.2    Lee, D.3    Wang, X.J.4
  • 8
    • 84971251166 scopus 로고    scopus 로고
    • Does computational neuroscience need new synaptic learning paradigms?
    • Brea J, Gerstner W. 2016. Does computational neuroscience need new synaptic learning paradigms? Current Opinion in Behavioral Sciences 11:61–66. doi: 10.1016/j.cobeha.2016.05.012.
    • (2016) Current Opinion in Behavioral Sciences , vol.11 , pp. 61-66
    • Brea, J.1    Gerstner, W.2
  • 9
    • 84946086073 scopus 로고    scopus 로고
    • Reinforcement learning of linking and tracing contours in recurrent neural networks.
    • Brosch T, Neumann H, Roelfsema PR. 2015. Reinforcement learning of linking and tracing contours in recurrent neural networks.. PLoS Computational Biology 11:e1004489. doi: 10.1371/journal.pcbi.1004489.
    • (2015) Plos Computational Biology , vol.11
    • Brosch, T.1    Neumann, H.2    Roelfsema, P.R.3
  • 11
    • 84930243913 scopus 로고    scopus 로고
    • Dynamic control of response criterion in premotor cortex during perceptual detection under temporal uncertainty
    • PMID: 25959731
    • Carnevale F, de Lafuente V, Romo R, Barak O, Parga N. 2015. Dynamic control of response criterion in premotor cortex during perceptual detection under temporal uncertainty. Neuron 86:1067–1077. doi: 10.1016/j.neuron.2015.04.014, PMID: 25959731.
    • (2015) Neuron , vol.86 , pp. 1067-1077
    • Carnevale, F.1    De Lafuente, V.2    Romo, R.3    Barak, O.4    Parga, N.5
  • 14
    • 0037057808 scopus 로고    scopus 로고
    • Reward, motivation, and reinforcement learning
    • PMID: 12383782
    • Dayan P, Balleine BW. 2002. Reward, motivation, and reinforcement learning. Neuron 36:285–298. doi: 10.1016/S0896-6273(02)00963-7, PMID: 12383782.
    • (2002) Neuron , vol.36 , pp. 285-298
    • Dayan, P.1    Balleine, B.W.2
  • 15
    • 60749114870 scopus 로고    scopus 로고
    • Decision theory, reinforcement learning, and the brain. Cognitive
    • PMID: 19033240
    • Dayan P, Daw ND. 2008. Decision theory, reinforcement learning, and the brain. Cognitive, Affective, & Behavioral Neuroscience 8:429–453. doi: 10.3758/CABN.8.4.429, PMID: 19033240.
    • (2008) Affective, & Behavioral Neuroscience , vol.8 , pp. 429-453
    • Dayan, P.1    Daw, N.D.2
  • 16
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • PMID: 10636940
    • Doya K. 2000. Reinforcement learning in continuous time and space. Neural Computation 12:219–245. doi: 10.1162/089976600300015961, PMID: 10636940.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 17
    • 77049204393 scopus 로고
    • Cholinergic and inhibitory synapses in a pathway from motor-axon collaterals to motoneurones
    • PMID: 13222354
    • Eccles JC, Fatt P, Koketsu K. 1954. Cholinergic and inhibitory synapses in a pathway from motor-axon collaterals to motoneurones. The Journal of Physiology 126:524–562. doi: 10.1113/jphysiol.1954.sp005226, PMID: 13222354.
    • (1954) The Journal of Physiology , vol.126 , pp. 524-562
    • Eccles, J.C.1    Fatt, P.2    Koketsu, K.3
  • 18
    • 84924778329 scopus 로고    scopus 로고
    • Choice-correlated activity fluctuations underlie learning of neuronal category representation
    • PMID: 25759251
    • Engel TA, Chaisangmongkon W, Freedman DJ, Wang XJ. 2015. Choice-correlated activity fluctuations underlie learning of neuronal category representation. Nature Communications 6:6454. doi: 10.1038/ncomms7454, PMID: 25759251.
    • (2015) Nature Communications , vol.6 , pp. 6454
    • Engel, T.A.1    Chaisangmongkon, W.2    Freedman, D.J.3    Wang, X.J.4
  • 19
    • 33746652644 scopus 로고    scopus 로고
    • Gradient learning in spiking neural networks by dynamic perturbation of conductances
    • PMID: 16907616
    • Fiete IR, Seung HS. 2006. Gradient learning in spiking neural networks by dynamic perturbation of conductances. Physical Review Letters 97:048104. doi: 10.1103/PhysRevLett.97.048104, PMID: 16907616.
    • (2006) Physical Review Letters , vol.97
    • Fiete, I.R.1    Seung, H.S.2
  • 20
    • 35348872545 scopus 로고    scopus 로고
    • Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances
    • PMID: 17652414
    • Fiete IR, Fee MS, Seung HS. 2007. Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. Journal of Neurophysiology 98:2038–2057. doi: 10.1152/jn.01311.2006, PMID: 17652414.
    • (2007) Journal of Neurophysiology , vol.98 , pp. 2038-2057
    • Fiete, I.R.1    Fee, M.S.2    Seung, H.S.3
  • 21
    • 33744550336 scopus 로고    scopus 로고
    • Anatomy of a decision: Striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal
    • PMID: 16637763
    • Frank MJ, Claus ED. 2006. Anatomy of a decision: striato-orbitofrontal interactions in reinforcement learning, decision making, and reversal. Psychological Review 113:300–326. doi: 10.1037/0033-295X.113.2.300, PMID: 16637763.
    • (2006) Psychological Review , vol.113 , pp. 300-326
    • Frank, M.J.1    Claus, E.D.2
  • 22
    • 84957812846 scopus 로고    scopus 로고
    • Goal-Directed decision making with spiking neurons
    • PMID: 26843636
    • Friedrich J, Lengyel M. 2016. Goal-Directed decision making with spiking neurons. Journal of Neuroscience 36: 1529–1546. doi: 10.1523/JNEUROSCI.2854-15.2016, PMID: 26843636.
    • (2016) Journal of Neuroscience , vol.36 , pp. 1529-1546
    • Friedrich, J.1    Lengyel, M.2
  • 23
    • 77957731196 scopus 로고    scopus 로고
    • Functional requirements for reward-modulated spike-timing-dependent plasticity
    • PMID: 20926659
    • Frémaux N, Sprekeler H, Gerstner W. 2010. Functional requirements for reward-modulated spike-timing-dependent plasticity. Journal of Neuroscience 30:13326–13337. doi: 10.1523/JNEUROSCI.6249-09.2010, PMID: 20926659.
    • (2010) Journal of Neuroscience , vol.30 , pp. 13326-13337
    • Frémaux, N.1    Sprekeler, H.2    Gerstner, W.3
  • 24
    • 84928527851 scopus 로고    scopus 로고
    • On simplicity and complexity in the brave new world of large-scale neuroscience
    • PMID: 25932978
    • Gao P, Ganguli S. 2015. On simplicity and complexity in the brave new world of large-scale neuroscience. Current Opinion in Neurobiology 32:148–155. doi: 10.1016/j.conb.2015.04.003, PMID: 25932978.
    • (2015) Current Opinion in Neurobiology , vol.32 , pp. 148-155
    • Gao, P.1    Ganguli, S.2
  • 25
    • 34347361793 scopus 로고    scopus 로고
    • The neural basis of decision making
    • PMID: 17600525
    • Gold JI, Shadlen MN. 2007. The neural basis of decision making. Annual Review of Neuroscience 30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038, PMID: 17600525.
    • (2007) Annual Review of Neuroscience , vol.30 , pp. 535-574
    • Gold, J.I.1    Shadlen, M.N.2
  • 28
    • 84902438429 scopus 로고    scopus 로고
    • Optimal control of transient dynamics in balanced networks supports generation of complex movements
    • PMID: 24945778
    • Hennequin G, Vogels TP, Gerstner W. 2014. Optimal control of transient dynamics in balanced networks supports generation of complex movements. Neuron 82:1394–1406. doi: 10.1016/j.neuron.2014.04.045, PMID: 24945778.
    • (2014) Neuron , vol.82 , pp. 1394-1406
    • Hennequin, G.1    Vogels, T.P.2    Gerstner, W.3
  • 29
    • 84904700222 scopus 로고    scopus 로고
    • Basal ganglia circuits for reward value-guided behavior
    • PMID: 25032497
    • Hikosaka O, Kim HF, Yasuda M, Yamamoto S. 2014. Basal ganglia circuits for reward value-guided behavior. Annual Review of Neuroscience 37:289–306. doi: 10.1146/annurev-neuro-071013-013924, PMID: 25032497.
    • (2014) Annual Review of Neuroscience , vol.37 , pp. 289-306
    • Hikosaka, O.1    Kim, H.F.2    Yasuda, M.3    Yamamoto, S.4
  • 30
    • 84894276345 scopus 로고    scopus 로고
    • Emergence of complex computational structures from chaotic neural networks through reward-modulated hebbian learning
    • PMID: 23146969
    • Hoerzer GM, Legenstein R, Maass W. 2014. Emergence of complex computational structures from chaotic neural networks through reward-modulated hebbian learning. Cerebral Cortex 24:677–690. doi: 10.1093/cercor/bhs348, PMID: 23146969.
    • (2014) Cerebral Cortex , vol.24 , pp. 677-690
    • Hoerzer, G.M.1    Legenstein, R.2    Maass, W.3
  • 31
    • 84959087247 scopus 로고    scopus 로고
    • Explicit information for category-orthogonal object properties increases along the ventral stream
    • PMID: 26900926
    • Hong H, Yamins DL, Majaj NJ, DiCarlo JJ. 2016. Explicit information for category-orthogonal object properties increases along the ventral stream. Nature Neuroscience 19:613–622. doi: 10.1038/nn.4247, PMID: 26900926.
    • (2016) Nature Neuroscience , vol.19 , pp. 613-622
    • Hong, H.1    Yamins, D.L.2    Majaj, N.J.3    Dicarlo, J.J.4
  • 32
    • 0002861883 scopus 로고
    • A model of how the basal ganglia generates and uses neural signals that predict reinforcement
    • Houk J. C, Davis J. L, Beisberb D. G, Cambridge, MA: MIT Press
    • Houk JC, Adams JL, Barto AG. 1995. A model of how the basal ganglia generates and uses neural signals that predict reinforcement. In: Houk J. C, Davis J. L, Beisberb D. G (Eds). Models of Information Processing in the Basal Ganglia. Cambridge, MA: MIT Press. p 249–274.
    • (1995) Models of Information Processing in the Basal Ganglia , pp. 249-274
    • Houk, J.C.1    Adams, J.L.2    Barto, A.G.3
  • 33
    • 34948906745 scopus 로고    scopus 로고
    • Solving the distal reward problem through linkage of STDP and dopamine signaling
    • PMID: 17220510
    • Izhikevich EM. 2007. Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral Cortex 17:2443–2452. doi: 10.1093/cercor/bhl152, PMID: 17220510.
    • (2007) Cerebral Cortex , vol.17 , pp. 2443-2452
    • Izhikevich, E.M.1
  • 35
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: New anatomical and computational perspectives
    • PMID: 12371510
    • Joel D, Niv Y, Ruppin E. 2002. Actor-critic models of the basal ganglia: new anatomical and computational perspectives. Neural Networks 15:535–547. doi: 10.1016/S0893-6080(02)00047-3, PMID: 12371510.
    • (2002) Neural Networks , vol.15 , pp. 535-547
    • Joel, D.1    Niv, Y.2    Ruppin, E.3
  • 36
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • Kaelbling LP, Littman ML, Cassandra AR. 1998. Planning and acting in partially observable stochastic domains. Artificial Intelligence 101:99–134. doi: 10.1016/S0004-3702(98)00023-X
    • (1998) Artificial Intelligence , vol.101 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 37
    • 51649116802 scopus 로고    scopus 로고
    • Neural correlates, computation and behavioural impact of decision confidence
    • PMID: 18690210
    • Kepecs A, Uchida N, Zariwala HA, Mainen ZF. 2008. Neural correlates, computation and behavioural impact of decision confidence. Nature 455:227–231. doi: 10.1038/nature07200, PMID: 18690210.
    • (2008) Nature , vol.455 , pp. 227-231
    • Kepecs, A.1    Uchida, N.2    Zariwala, H.A.3    Mainen, Z.F.4
  • 38
    • 41149144550 scopus 로고    scopus 로고
    • Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment
    • PMID: 18354005
    • Kiani R, Hanks TD, Shadlen MN. 2008. Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. Journal of Neuroscience 28:3017–3029. doi: 10.1523/JNEUROSCI.4761-07.2008, PMID: 18354005.
    • (2008) Journal of Neuroscience , vol.28 , pp. 3017-3029
    • Kiani, R.1    Hanks, T.D.2    Shadlen, M.N.3
  • 39
    • 65649149780 scopus 로고    scopus 로고
    • Representation of confidence associated with a decision by neurons in the parietal cortex
    • PMID: 19423820
    • Kiani R, Shadlen MN. 2009. Representation of confidence associated with a decision by neurons in the parietal cortex. Science 324:759–764. doi: 10.1126/science.1169405, PMID: 19423820.
    • (2009) Science , vol.324 , pp. 759-764
    • Kiani, R.1    Shadlen, M.N.2
  • 40
    • 85083951076 scopus 로고    scopus 로고
    • Adam: A method for stochastic optimization
    • arXiv
    • Kingma DP, Ba JL. 2015. Adam: A method for stochastic optimization. Int. Conf. Learn. Represent. arXiv. https://arxiv.org/abs/1412.6980.
    • (2015) Int. Conf. Learn. Represent
    • Kingma, D.P.1    Ba, J.L.2
  • 41
    • 84879686840 scopus 로고    scopus 로고
    • Robust timing and motor patterns by taming chaos in recurrent neural networks
    • PMID: 23708144
    • Laje R, Buonomano DV. 2013. Robust timing and motor patterns by taming chaos in recurrent neural networks. Nature Neuroscience 16:925–933. doi: 10.1038/nn.3405, PMID: 23708144.
    • (2013) Nature Neuroscience , vol.16 , pp. 925-933
    • Laje, R.1    Buonomano, D.V.2
  • 42
    • 84907966818 scopus 로고    scopus 로고
    • Orbitofrontal cortex is required for optimal waiting based on decision confidence
    • PMID: 25242219
    • Lak A, Costa GM, Romberg E, Koulakov AA, Mainen ZF, Kepecs A. 2014. Orbitofrontal cortex is required for optimal waiting based on decision confidence. Neuron 84:190–201. doi: 10.1016/j.neuron.2014.08.039, PMID: 25242219.
    • (2014) Neuron , vol.84 , pp. 190-201
    • Lak, A.1    Costa, G.M.2    Romberg, E.3    Koulakov, A.A.4    Mainen, Z.F.5    Kepecs, A.6
  • 43
    • 79955721719 scopus 로고    scopus 로고
    • Signals in human striatum are appropriate for policy update rather than value prediction
    • PMID: 21471387
    • Li J, Daw ND. 2011. Signals in human striatum are appropriate for policy update rather than value prediction. Journal of Neuroscience 31:5504–5511. doi: 10.1523/JNEUROSCI.6316-10.2011, PMID: 21471387.
    • (2011) Journal of Neuroscience , vol.31 , pp. 5504-5511
    • Li, J.1    Daw, N.D.2
  • 44
    • 84994417427 scopus 로고    scopus 로고
    • Random synaptic feedback weights support error backpropagation for deep learning
    • PMID: 27 824044
    • Lillicrap TP, Cownden D, Tweed DB, Akerman CJ. 2016. Random synaptic feedback weights support error backpropagation for deep learning. Nature Communications 7:13276. doi: 10.1038/ncomms13276, PMID: 27 824044.
    • (2016) Nature Communications , vol.7 , pp. 13276
    • Lillicrap, T.P.1    Cownden, D.2    Tweed, D.B.3    Akerman, C.J.4
  • 45
    • 74849088855 scopus 로고    scopus 로고
    • Functional, but not anatomical, separation of “what” and “when” in prefrontal cortex
    • PMID: 20053 916
    • Machens CK, Romo R, Brody CD. 2010. Functional, but not anatomical, separation of “what” and “when” in prefrontal cortex. Journal of Neuroscience 30:350–360. doi: 10.1523/JNEUROSCI.3276-09.2010, PMID: 20053 916.
    • (2010) Journal of Neuroscience , vol.30 , pp. 350-360
    • Machens, C.K.1    Romo, R.2    Brody, C.D.3
  • 46
    • 77949897253 scopus 로고    scopus 로고
    • Two-factor theory, the actor-critic model, and conditioned avoidance
    • PMID: 20065349
    • Maia TV. 2010. Two-factor theory, the actor-critic model, and conditioned avoidance. Learning & Behavior 38: 50–67. doi: 10.3758/LB.38.1.50, PMID: 20065349.
    • (2010) Learning & Behavior , vol.38 , pp. 50-67
    • Maia, T.V.1
  • 47
    • 84887390404 scopus 로고    scopus 로고
    • Context-dependent computation by recurrent dynamics in prefrontal cortex
    • PMID: 24201281
    • Mante V, Sussillo D, Shenoy KV, Newsome WT. 2013. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 503:78–84. doi: 10.1038/nature12742, PMID: 24201281.
    • (2013) Nature , vol.503 , pp. 78-84
    • Mante, V.1    Sussillo, D.2    Shenoy, K.V.3    Newsome, W.T.4
  • 51
    • 0142219854 scopus 로고    scopus 로고
    • A role for neural integrators in perceptual decision making
    • PMID: 14576217
    • Mazurek ME, Roitman JD, Ditterich J, Shadlen MN. 2003. A role for neural integrators in perceptual decision making. Cerebral Cortex 13:1257–1269. doi: 10.1093/cercor/bhg097, PMID: 14576217.
    • (2003) Cerebral Cortex , vol.13 , pp. 1257-1269
    • Mazurek, M.E.1    Roitman, J.D.2    Ditterich, J.3    Shadlen, M.N.4
  • 56
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • PMID: 15087550
    • O’Doherty J, Dayan P, Schultz J, Deichmann R, Friston K, Dolan RJ. 2004. Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science 304:452–454. doi: 10.1126/science.1094285, PMID: 15087550.
    • (2004) Science , vol.304 , pp. 452-454
    • O’Doherty, J.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.J.6
  • 57
    • 33646566317 scopus 로고    scopus 로고
    • Neurons in the orbitofrontal cortex encode economic value
    • PMID: 16633341
    • Padoa-Schioppa C, Assad JA. 2006. Neurons in the orbitofrontal cortex encode economic value. Nature 441: 223–226. doi: 10.1038/nature04676, PMID: 16633341.
    • (2006) Nature , vol.441 , pp. 223-226
    • Padoa-Schioppa, C.1    Assad, J.A.2
  • 60
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • PMID: 18482830
    • Peters J, Schaal S. 2008. Reinforcement learning of motor skills with policy gradients. Neural Networks 21:682–697. doi: 10.1016/j.neunet.2008.02.003, PMID: 18482830.
    • (2008) Neural Networks , vol.21 , pp. 682-697
    • Peters, J.1    Schaal, S.2
  • 61
    • 84959861453 scopus 로고    scopus 로고
    • Recurrent network models of sequence generation and memory
    • Rajan K, Harvey CD, Tank DW. 2015. Recurrent network models of sequence generation and memory. Neuron 90:128–142. doi: 10.1016/j.neuron.2016.02.009.
    • (2015) Neuron , vol.90 , pp. 128-142
    • Rajan, K.1    Harvey, C.D.2    Tank, D.W.3
  • 63
    • 79960241771 scopus 로고    scopus 로고
    • Decision making under uncertainty: A neural model based on partially observable markov decision processes
    • PMID: 21152255
    • Rao RP. 2010. Decision making under uncertainty: a neural model based on partially observable markov decision processes. Frontiers in Computational Neuroscience 4:146. doi: 10.3389/fncom.2010.00146, PMID: 21152255.
    • (2010) Frontiers in Computational Neuroscience , vol.4 , pp. 146
    • Rao, R.P.1
  • 64
  • 65
    • 84926204605 scopus 로고    scopus 로고
    • A category-free neural population supports evolving demands during decision-making
    • PMID: 25383902
    • Raposo D, Kaufman MT, Churchland AK. 2014. A category-free neural population supports evolving demands during decision-making. Nature Neuroscience 17:1784–1792. doi: 10.1038/nn.3865, PMID: 25383902.
    • (2014) Nature Neuroscience , vol.17 , pp. 1784-1792
    • Raposo, D.1    Kaufman, M.T.2    Churchland, A.K.3
  • 66
    • 84863881230 scopus 로고    scopus 로고
    • Internal representation of task rules by recurrent dynamics: The importance of the diversity of neural responses
    • PMID: 21048899
    • Rigotti M, Ben Dayan Rubin D, Wang XJ, Fusi S. 2010. Internal representation of task rules by recurrent dynamics: the importance of the diversity of neural responses. Frontiers in Computational Neuroscience 4:24. doi: 10.3389/fncom.2010.00024, PMID: 21048899.
    • (2010) Frontiers in Computational Neuroscience , vol.4 , pp. 24
    • Rigotti, M.1    Ben Dayan Rubin, D.2    Wang, X.J.3    Fusi, S.4
  • 67
    • 84878390558 scopus 로고    scopus 로고
    • The importance of mixed selectivity in complex cognitive tasks
    • PMID: 23685452
    • Rigotti M, Barak O, Warden MR, Wang XJ, Daw ND, Miller EK, Fusi S. 2013. The importance of mixed selectivity in complex cognitive tasks. Nature 497:585–590. doi: 10.1038/nature12160, PMID: 23685452.
    • (2013) Nature , vol.497 , pp. 585-590
    • Rigotti, M.1    Barak, O.2    Warden, M.R.3    Wang, X.J.4    Daw, N.D.5    Miller, E.K.6    Fusi, S.7
  • 68
    • 0036850727 scopus 로고    scopus 로고
    • Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task
    • PMID: 12417672
    • Roitman JD, Shadlen MN. 2002. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. Journal of Neuroscience 22:9475–9489. PMID: 12417672.
    • (2002) Journal of Neuroscience , vol.22 , pp. 9475-9489
    • Roitman, J.D.1    Shadlen, M.N.2
  • 69
    • 0033519704 scopus 로고    scopus 로고
    • Neuronal correlates of parametric working memory in the prefrontal cortex
    • PMID: 10365959
    • Romo R, Brody CD, Hernández A, Lemus L. 1999. Neuronal correlates of parametric working memory in the prefrontal cortex. Nature 399:470–473. doi: 10.1038/20939, PMID: 10365959.
    • (1999) Nature , vol.399 , pp. 470-473
    • Romo, R.1    Brody, C.D.2    Hernández, A.3    Lemus, L.4
  • 70
    • 0000646059 scopus 로고
    • Learning internal representations by error propagation
    • Rumelhart DE, McClelland JL (Eds), Cambridge, MA: MIT Press
    • Rumelhart DE, Hinton GE, Williams RJ. 1986. Learning internal representations by error propagation. In: Rumelhart DE, McClelland JL (Eds). Parallel Distributed Processing. Cambridge, MA: MIT Press. 1 p 318–362.
    • (1986) Parallel Distributed Processing , vol.1 , pp. 318-362
    • Rumelhart, D.E.1    Hinton, G.E.2    Williams, R.J.3
  • 73
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • PMID: 9054347
    • Schultz W, Dayan P, Montague PR. 1997. A neural substrate of prediction and reward. Science 275:1593–1599. doi: 10.1126/science.275.5306.1593, PMID: 9054347.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 74
    • 0034061495 scopus 로고    scopus 로고
    • Reward processing in primate orbitofrontal cortex and basal ganglia
    • PMID: 10731222
    • Schultz W, Tremblay L, Hollerman JR. 2000. Reward processing in primate orbitofrontal cortex and basal ganglia. Cerebral Cortex 10:272–283. doi: 10.1093/cercor/10.3.272, PMID: 10731222.
    • (2000) Cerebral Cortex , vol.10 , pp. 272-283
    • Schultz, W.1    Tremblay, L.2    Hollerman, J.R.3
  • 75
    • 0347362917 scopus 로고    scopus 로고
    • Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
    • PMID: 14687542
    • Seung HS. 2003. Learning in spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron 40:1063–1073. doi: 10.1016/S0896-6273(03)00761-X, PMID: 14687542.
    • (2003) Neuron , vol.40 , pp. 1063-1073
    • Seung, H.S.1
  • 76
    • 33748999594 scopus 로고    scopus 로고
    • Neural mechanism for stochastic behaviour during a competitive game
    • PMID: 17015181
    • Soltani A, Lee D, Wang XJ. 2006. Neural mechanism for stochastic behaviour during a competitive game. Neural Networks 19:1075–1090. doi: 10.1016/j.neunet.2006.05.044, PMID: 17015181.
    • (2006) Neural Networks , vol.19 , pp. 1075-1090
    • Soltani, A.1    Lee, D.2    Wang, X.J.3
  • 77
    • 73949142431 scopus 로고    scopus 로고
    • Synaptic computation underlying probabilistic inference
    • PMID: 20010823
    • Soltani A, Wang XJ. 2010. Synaptic computation underlying probabilistic inference. Nature Neuroscience 13: 112–119. doi: 10.1038/nn.2450, PMID: 20010823.
    • (2010) Nature Neuroscience , vol.13 , pp. 112-119
    • Soltani, A.1    Wang, X.J.2
  • 78
    • 84959494188 scopus 로고    scopus 로고
    • Training excitatory-inhibitory recurrent neural networks for cognitive tasks: A simple and flexible framework
    • PMID: 26928718
    • Song HF, Yang GR, Wang XJ. 2016. Training excitatory-inhibitory recurrent neural networks for cognitive tasks: A simple and flexible framework. PLoS Computational Biology 12:e1004792. doi: 10.1371/journal.pcbi.1004792, PMID: 26928718.
    • (2016) Plos Computational Biology , vol.12
    • Song, H.F.1    Yang, G.R.2    Wang, X.J.3
  • 79
    • 84928639374 scopus 로고    scopus 로고
    • What the orbitofrontal cortex does not do
    • PMID: 25919962
    • Stalnaker TA, Cooch NK, Schoenbaum G. 2015. What the orbitofrontal cortex does not do. Nature Neuroscience 18:620–627. doi: 10.1038/nn.3982, PMID: 25919962.
    • (2015) Nature Neuroscience , vol.18 , pp. 620-627
    • Stalnaker, T.A.1    Cooch, N.K.2    Schoenbaum, G.3
  • 80
    • 17844396920 scopus 로고    scopus 로고
    • Choosing the greater of two goods: Neural currencies for valuation and decision making
    • PMID: 15832198
    • Sugrue LP, Corrado GS, Newsome WT. 2005. Choosing the greater of two goods: neural currencies for valuation and decision making. Nature Reviews Neuroscience 6:363–375. doi: 10.1038/nrn1666, PMID: 15832198.
    • (2005) Nature Reviews Neuroscience , vol.6 , pp. 363-375
    • Sugrue, L.P.1    Corrado, G.S.2    Newsome, W.T.3
  • 81
    • 68949147577 scopus 로고    scopus 로고
    • Generating coherent patterns of activity from chaotic neural networks
    • PMID: 19709635
    • Sussillo D, Abbott LF. 2009. Generating coherent patterns of activity from chaotic neural networks. Neuron 63: 544–557. doi: 10.1016/j.neuron.2009.07.018, PMID: 19709635.
    • (2009) Neuron , vol.63 , pp. 544-557
    • Sussillo, D.1    Abbott, L.F.2
  • 82
    • 84877827546 scopus 로고    scopus 로고
    • Opening the black box: Low-dimensional dynamics in high-dimensional recurrent neural networks
    • PMID: 23272922
    • Sussillo D, Barak O. 2013. Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks. Neural Computation 25:626–649. doi: 10.1162/NECO_a_00409, PMID: 23272922.
    • (2013) Neural Computation , vol.25 , pp. 626-649
    • Sussillo, D.1    Barak, O.2
  • 83
    • 84893503924 scopus 로고    scopus 로고
    • Neural circuits as computational dynamical systems
    • PMID: 24509098
    • Sussillo D. 2014. Neural circuits as computational dynamical systems. Current Opinion in Neurobiology 25:156–163. doi: 10.1016/j.conb.2014.01.008, PMID: 24509098.
    • (2014) Current Opinion in Neurobiology , vol.25 , pp. 156-163
    • Sussillo, D.1
  • 84
    • 84933280082 scopus 로고    scopus 로고
    • A neural network that finds a naturalistic solution for the production of muscle activity
    • PMID: 26075643
    • Sussillo D, Churchland MM, Kaufman MT, Shenoy KV. 2015. A neural network that finds a naturalistic solution for the production of muscle activity. Nature Neuroscience 18:1025–1033. doi: 10.1038/nn.4042, PMID: 26075643.
    • (2015) Nature Neuroscience , vol.18 , pp. 1025-1033
    • Sussillo, D.1    Churchland, M.M.2    Kaufman, M.T.3    Shenoy, K.V.4
  • 86
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • Sutton RS, Mcallester D, Singh S, Mansour Y. 2000. Policy gradient methods for reinforcement learning with function approximation. Advances in neural information processing systems 12:1057–1063 http://papers.nips.cc/paper/1713-policy-gradient-methods-for-reinforcement-learning-with-function-approximation.pdf.
    • (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
    • Sutton, R.S.1    McAllester, D.2    Singh, S.3    Mansour, Y.4
  • 87
    • 67651147037 scopus 로고    scopus 로고
    • Silencing the critics: Understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model
    • PMID: 18982111
    • Takahashi Y, Schoenbaum G, Niv Y. 2008. Silencing the critics: understanding the effects of cocaine sensitization on dorsolateral and ventral striatum in the context of an actor/critic model. Frontiers in Neuroscience 2:86–99. doi: 10.3389/neuro.01.014.2008, PMID: 18982111.
    • (2008) Frontiers in Neuroscience , vol.2 , pp. 86-99
    • Takahashi, Y.1    Schoenbaum, G.2    Niv, Y.3
  • 90
    • 77549088095 scopus 로고    scopus 로고
    • Learning to use working memory in partially observable environments through dopaminergic reinforcement
    • Todd MT, Niv Y, Cohen JD. 2008. Learning to use working memory in partially observable environments through dopaminergic reinforcement. Advances in Neural Information Processing Systems. http://papers.nips.cc/paper/3508-learning-to-use-working-memory-in-partially-observable-environments-through-dopaminergic-reinforcement.pdf.
    • (2008) Advances in Neural Information Processing Systems
    • Todd, M.T.1    Niv, Y.2    Cohen, J.D.3
  • 91
    • 78651481149 scopus 로고    scopus 로고
    • Basal ganglia contributions to motor control: A vigorous tutor
    • PMID: 20850966
    • Turner RS, Desmurget M. 2010. Basal ganglia contributions to motor control: a vigorous tutor. Current Opinion in Neurobiology 20:704–716. doi: 10.1016/j.conb.2010.08.022, PMID: 20850966.
    • (2010) Current Opinion in Neurobiology , vol.20 , pp. 704-716
    • Turner, R.S.1    Desmurget, M.2
  • 92
    • 60749100305 scopus 로고    scopus 로고
    • Reinforcement learning in populations of spiking neurons
    • PMID: 19219040
    • Urbanczik R, Senn W. 2009. Reinforcement learning in populations of spiking neurons. Nature Neuroscience 12: 250–252. doi: 10.1038/nn.2264, PMID: 19219040.
    • (2009) Nature Neuroscience , vol.12 , pp. 250-252
    • Urbanczik, R.1    Senn, W.2
  • 93
    • 34547674638 scopus 로고    scopus 로고
    • Orbitofrontal cortex and its contribution to decision-making
    • PMID: 17417936
    • Wallis JD. 2007. Orbitofrontal cortex and its contribution to decision-making. Annual Review of Neuroscience 30:31–56. doi: 10.1146/annurev.neuro.30.051606.094334, PMID: 17417936.
    • (2007) Annual Review of Neuroscience , vol.30 , pp. 31-56
    • Wallis, J.D.1
  • 94
    • 0037028039 scopus 로고    scopus 로고
    • Probabilistic decision making by slow reverberation in cortical circuits
    • PMID: 12467598
    • Wang XJ. 2002. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 36:955–968. doi: 10.1016/S0896-6273(02)01092-9, PMID: 12467598.
    • (2002) Neuron , vol.36 , pp. 955-968
    • Wang, X.J.1
  • 95
    • 53849125053 scopus 로고    scopus 로고
    • Decision making in recurrent neuronal circuits
    • PMID: 18957215
    • Wang XJ. 2008. Decision making in recurrent neuronal circuits. Neuron 60:215–234. doi: 10.1016/j.neuron.2008.09.034, PMID: 18957215.
    • (2008) Neuron , vol.60 , pp. 215-234
    • Wang, X.J.1
  • 96
    • 84939794227 scopus 로고    scopus 로고
    • Confidence estimation as a stochastic process in a neurodynamical system of decision making
    • PMID: 25948870
    • Wei Z, Wang XJ. 2015. Confidence estimation as a stochastic process in a neurodynamical system of decision making. Journal of Neurophysiology 114:99–113. doi: 10.1152/jn.00793.2014, PMID: 25948870.
    • (2015) Journal of Neurophysiology , vol.114 , pp. 99-113
    • Wei, Z.1    Wang, X.J.2
  • 98
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams RJ. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8:229–256. doi: 10.1007/BF00992696.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.J.1
  • 99
    • 32544439341 scopus 로고    scopus 로고
    • A recurrent network mechanism of time integration in perceptual decisions
    • PMID: 16436619
    • Wong KF, Wang XJ. 2006. A recurrent network mechanism of time integration in perceptual decisions. Journal of Neuroscience 26:1314–1328. doi: 10.1523/JNEUROSCI.3733-05.2006, PMID: 16436619.
    • (2006) Journal of Neuroscience , vol.26 , pp. 1314-1328
    • Wong, K.F.1    Wang, X.J.2
  • 101
    • 84902213589 scopus 로고    scopus 로고
    • Performance-optimized hierarchical models predict neural responses in higher visual cortex
    • PMID: 24812127
    • Yamins DL, Hong H, Cadieu CF, Solomon EA, Seibert D, DiCarlo JJ. 2014. Performance-optimized hierarchical models predict neural responses in higher visual cortex. PNAS 111:8619–8624. doi: 10.1073/pnas.1403112111, PMID: 24812127.
    • (2014) PNAS , vol.111 , pp. 8619-8624
    • Yamins, D.L.1    Hong, H.2    Cadieu, C.F.3    Solomon, E.A.4    Seibert, D.5    Dicarlo, J.J.6
  • 103
    • 0023877474 scopus 로고
    • A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons
    • PMID: 3344044
    • Zipser D, Andersen RA. 1988. A back-propagation programmed network that simulates response properties of a subset of posterior parietal neurons. Nature 331:679–684. doi: 10.1038/331679a0, PMID: 3344044
    • (1988) Nature , vol.331 , pp. 679-684
    • Zipser, D.1    Ersen, R.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.