메뉴 건너뛰기




Volumn 24, Issue 5, 2014, Pages

Code-specific learning rules improve action selection by populations of spiking neurons

Author keywords

Action perturbation; Code specificity; Decision making; Policy gradient; Population coding; Reinforcement learning; Spike timing dependent synaptic plasticity; Spiking neural network

Indexed keywords

DECISION MAKING; NEURAL NETWORKS; NEURONS; POPULATION STATISTICS; REINFORCEMENT LEARNING;

EID: 84901857481     PISSN: 01290657     EISSN: None     Source Type: Journal    
DOI: 10.1142/S0129065714500026     Document Type: Article
Times cited : (35)

References (29)
  • 1
    • 0031012615 scopus 로고    scopus 로고
    • Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs
    • DOI 10.1126/science.275.5297.213
    • H. Markram, J. Lübke, M. Frotscher and B. Sakmann, Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs, Science 275 (1997) 213-215. (Pubitemid 27034762
    • (1997) Science , vol.275 , Issue.5297 , pp. 213-215
    • Markram, H.1    Lubke, J.2    Frotscher, M.3    Sakmann, B.4
  • 2
    • 0032535029 scopus 로고    scopus 로고
    • Synapticmodifications in cultured hippocampal neurons: Dependence on spike timing synaptic strength and postsynaptic cell type
    • G. Bi andM. Poo, Synapticmodifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength and postsynaptic cell type, J. Neurosci. 18(24) (1998) 10464-10472
    • (1998) J. Neurosci , vol.18 , Issue.24 , pp. 10464-10472
    • Bi, G.1    Poo, M.2
  • 3
    • 0028024043 scopus 로고
    • Noise neural codes and cortical organization
    • M. N. Shadlen and W. T. Newsome, Noise, neural codes and cortical organization, Curr. Opin. Neurobiol. 4 (1994) 569-579
    • (1994) Curr. Opin. Neurobiol , vol.4 , pp. 569-579
    • Shadlen, M.N.1    Newsome, W.T.2
  • 4
    • 84958376048 scopus 로고
    • Information encoding in short firing rate epochs by single neurons in the primate temporal visual cortex
    • M. J. Tovee and E. T. Rolls, Information encoding in short firing rate epochs by single neurons in the primate temporal visual cortex, Vis. Cognit. 2(1) (1995) 35-58
    • (1995) Vis. Cognit , vol.2 , Issue.1 , pp. 35-58
    • Tovee, M.J.1    Rolls, E.T.2
  • 5
    • 33746208993 scopus 로고    scopus 로고
    • Biophysical and phenomenological models of multiple spike interactions in spike-Timing dependent plasticity
    • M. Badoual, Q. Zou, A. P. Davison, M. Rudolph, T. Bal, Y. Fregnac and A. Destexhe, Biophysical and phenomenological models of multiple spike interactions in spike-Timing dependent plasticity, Int. J. Neural Syst. 16(2) (2006) 79-97
    • (2006) Int. J. Neural Syst , vol.16 , Issue.2 , pp. 79-97
    • Badoual, M.1    Zou, Q.2    Davison, A.P.3    Rudolph, M.4    Bal, T.5    Fregnac, Y.6    Destexhe, A.7
  • 7
    • 0034329021 scopus 로고    scopus 로고
    • Information processing with population codes
    • A. Pouget, R. S. Zemel and P. Dayan, Information processing with population codes, Nat. Rev. Neurosci. 1(2) (2000) 125-132
    • (2000) Nat. Rev. Neurosci , vol.1 , Issue.2 , pp. 125-132
    • Pouget, A.1    Zemel, R.S.2    Dayan, P.3
  • 8
    • 33745726849 scopus 로고    scopus 로고
    • Neural correlations population coding and computation
    • B. Averbeck, P. E. Latham and A. Pouget, Neural correlations, population coding and computation, Nat. Rev. Neurosci. 7 (2006) 358-366
    • (2006) Nat. Rev. Neurosci , vol.7 , pp. 358-366
    • Averbeck, B.1    Latham, P.E.2    Pouget, A.3
  • 9
    • 60749100305 scopus 로고    scopus 로고
    • Reinforcement learning in populations of spiking neurons
    • R. Urbanczik and W. Senn, Reinforcement learning in populations of spiking neurons, Nat. Neurosci. 12(3) (2009) 250-252
    • (2009) Nat. Neurosci , vol.12 , Issue.3 , pp. 250-252
    • Urbanczik, R.1    Senn, W.2
  • 10
    • 77955988359 scopus 로고    scopus 로고
    • Learning spike-based population codes by reward and population feedback
    • J. Friedrich, R. Urbanczik and W. Senn, Learning spike-based population codes by reward and population feedback, Neural Comput. 22(7) (2010) 1698-1717
    • (2010) Neural Comput , vol.22 , Issue.7 , pp. 1698-1717
    • Friedrich, J.1    Urbanczik, R.2    Senn, W.3
  • 11
    • 84866941777 scopus 로고    scopus 로고
    • Spike-based decision learning of Nash equilibria in two-player games
    • J. Friedrich and W. Senn, Spike-based decision learning of Nash equilibria in two-player games, PLoS. Comput. Biol. 8(9) (2012) e1002691
    • (2012) PLoS. Comput. Biol , vol.8 , Issue.9
    • Friedrich, J.1    Senn, W.2
  • 12
    • 78649588368 scopus 로고    scopus 로고
    • On the probabilistic optimization of spiking neural networks
    • S. Schliebs, N. Kasabov and M. Defoin-Platel, On the probabilistic optimization of spiking neural networks, Int. J. Neural. Syst. 20(6) (2010) 481-500
    • (2010) Int. J. Neural. Syst , vol.20 , Issue.6 , pp. 481-500
    • Schliebs, S.1    Kasabov, N.2    Defoin-Platel, M.3
  • 13
    • 84858740338 scopus 로고    scopus 로고
    • Codespecific policy gradient rules for spiking neurons
    • eds. Y. Bengio D. Schuurmans, J. Lafferty, C. K. I. Williams and A. Culotta
    • H. Sprekeler, G. Hennequin and W. Gerstner, Codespecific policy gradient rules for spiking neurons, in Advances in Neural Information Processing Systems 22, eds. Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams and A. Culotta (2009), pp. 1741-1749
    • (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1741-1749
    • Sprekeler, H.1    Hennequin, G.2    Gerstner, W.3
  • 14
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • R. J. Williams, Simple statistical gradient-following algorithms for connectionist reinforcement learning, Mach. Learn. 8 (1992) 229-256
    • (1992) Mach. Learn , vol.8 , pp. 229-256
    • Williams, R.J.1
  • 15
    • 33646801243 scopus 로고    scopus 로고
    • Optimal spike-Timing-dependent plasticity for precise action potential firing in supervised learning
    • DOI 10.1162/neco.2006.18.6.1318
    • J. Pfister, T. Toyoizumi, D. Barber and W. Gerstner, Optimal spike-Timing-dependent plasticity for precise action potential firing in supervised learning, Neural Comput. 18(6) (2006) 1318-1348. (Pubitemid 43765446
    • (2006) Neural Computation , vol.18 , Issue.6 , pp. 1318-1348
    • Pfister, J.-P.1    Toyoizumi, T.2    Barber, D.3    Gerstner, W.4
  • 16
    • 34249708388 scopus 로고    scopus 로고
    • Reinforcement learning through modulation of spike-Timing-dependent synaptic plasticity
    • R. V. Florian, Reinforcement learning through modulation of spike-Timing-dependent synaptic plasticity, Neural Comput. 19(6) (2007) 1468-1502
    • (2007) Neural Comput , vol.19 , Issue.6 , pp. 1468-1502
    • Florian, R.V.1
  • 17
    • 33746652644 scopus 로고    scopus 로고
    • Gradient learning in spiking neural networks by dynamic perturbation of conductances
    • I. R. Fiete and H. S. Seung, Gradient learning in spiking neural networks by dynamic perturbation of conductances, Phys. Rev. Lett. 97(4) (2006) 048104
    • (2006) Phys. Rev. Lett , vol.97 , Issue.4 , pp. 048104
    • Fiete, I.R.1    Seung, H.S.2
  • 18
    • 79959855306 scopus 로고    scopus 로고
    • Temporal difference based actor critic learning - convergence and neural implementation
    • eds. D. Koller, D. Schuurmans, Y. Bengio and L. Bottou
    • D. Di Castro, D. Volkinshtein and R. Meir, Temporal difference based actor critic learning-convergence and neural implementation, in Advances in Neural Information Processing Systems 21, eds. D. Koller, D. Schuurmans, Y. Bengio and L. Bottou (2008), pp. 385-392
    • (2008) Advances in Neural Information Processing Systems , vol.21 , pp. 385-392
    • Di Castro, D.1    Volkinshtein, D.2    Meir, R.3
  • 19
    • 84859496500 scopus 로고    scopus 로고
    • Gradient estimation in dendritic reinforcement learning
    • M. Schiess, R. Urbanczik and W. Senn, Gradient estimation in dendritic reinforcement learning, J. Math. Neurosci. 2(1) (2012) 2
    • (2012) J. Math. Neurosci , vol.2 , Issue.1 , pp. 2
    • Schiess, M.1    Urbanczik, R.2    Senn, W.3
  • 20
    • 37649027755 scopus 로고    scopus 로고
    • Learning in neural networks by reinforcement of irregular spiking
    • X. Xie and H. S. Seung, Learning in neural networks by reinforcement of irregular spiking, Phys. Rev. E 69(4) (2004) 041909
    • (2004) Phys. Rev. e , vol.69 , Issue.4 , pp. 041909
    • Xie, X.1    Seung, H.S.2
  • 21
    • 84864362117 scopus 로고    scopus 로고
    • SPAN: Spike pattern association neuron for learning spatio-Temporal spike patterns
    • A. Mohemmed, S. Schliebs, S. Matsuda and N. Kasabov, SPAN: Spike pattern association neuron for learning spatio-Temporal spike patterns, Int. J. Neural Syst. 22(4) (2012) 1250012
    • (2012) Int. J. Neural Syst , vol.22 , Issue.4 , pp. 1250012
    • Mohemmed, A.1    Schliebs, S.2    Matsuda, S.3    Kasabov, N.4
  • 22
    • 0347362917 scopus 로고    scopus 로고
    • Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
    • H. S. Seung, Learning in spiking neural networks by reinforcement of stochastic synaptic transmission, Neuron 40(6) (2003) 1063-1073
    • (2003) Neuron , vol.40 , Issue.6 , pp. 1063-1073
    • Seung, H.S.1
  • 23
    • 33746228228 scopus 로고    scopus 로고
    • The role of the basal ganglia in exploration in A neural model based on reinforcement learning
    • DOI 10.1142/S0129065706000548, PII S0129065706000548
    • D. Sridharan, P. S. Prashanth and V. S. Chakravarthy, The role of the basal ganglia in exploration in a neural model based on reinforcement learning, Int. J. Neural Syst. 16(2) (2006) 111-124. (Pubitemid 44099468
    • (2006) International Journal of Neural Systems , vol.16 , Issue.2 , pp. 111-124
    • Sridharan, D.1    Prashanth, P.S.2    Chakravarthy, V.S.3
  • 24
    • 0019957779 scopus 로고
    • Place navigation impaired in rats with hippocampal lesions
    • DOI 10.1038/297681a0
    • R. G. Morris, P. Garrud, J. N. Rawlins and J. O'Keefe, Place navigation impaired in rats with hippocampal lesions, Nature 297(5868) (1982) 681-683. (Pubitemid 12096189
    • (1982) Nature , vol.297 , Issue.5868 , pp. 681-683
    • Morris, R.G.M.1    Garrud, P.2    Rawlins, J.N.P.3    O'Keefe, J.4
  • 25
    • 74549209037 scopus 로고    scopus 로고
    • Spike-based reinforcement learning in continuous state and action space: When policy gradient methods fail
    • E. Vasilaki, N. Fr'emaux, R. Urbanczik, W. Senn and W. Gerstner, Spike-based reinforcement learning in continuous state and action space: When policy gradient methods fail, PLoS Comput. Biol. 5(12) (2009) e1000586
    • (2009) PLoS Comput. Biol , vol.5 , Issue.12
    • Vasilaki, E.1    Fr'Emaux, N.2    Urbanczik, R.3    Senn, W.4    Gerstner, W.5
  • 26
    • 79959853243 scopus 로고    scopus 로고
    • Spatiotemporal credit assignment in neuronal population learning
    • J. Friedrich, R. Urbanczik and W. Senn, Spatiotemporal credit assignment in neuronal population learning, PLoS Comput. Biol. 7(6) (2011) e1002092
    • (2011) PLoS Comput. Biol , vol.7 , Issue.6
    • Friedrich, J.1    Urbanczik, R.2    Senn, W.3
  • 27
    • 84876888983 scopus 로고    scopus 로고
    • Reinforcement learning using a continuous time actorcritic framework with spiking neurons
    • N. Fr'emaux, H. Sprekeler and W. Gerstner, Reinforcement learning using a continuous time actorcritic framework with spiking neurons, PLoS Comput. Biol. 9(4) (2013) e1003024
    • (2013) PLoS Comput. Biol , vol.9 , Issue.4
    • Fr'Emaux, N.1    Sprekeler, H.2    Gerstner, W.3
  • 28
    • 27144462270 scopus 로고    scopus 로고
    • Learning curves for stochastic gradient descent in linear feedforward networks
    • J. Werfel, X. Xie and H. S. Seung, Learning curves for stochastic gradient descent in linear feedforward networks, Neural Comput. 17(12) (2005) 2699-2718
    • (2005) Neural Comput , vol.17 , Issue.12 , pp. 2699-2718
    • Werfel, J.1    Xie, X.2    Seung, H.S.3
  • 29
    • 77649152514 scopus 로고    scopus 로고
    • Connectivity reflects coding: A model of voltagebased stdp with homeostasis
    • C. Clopath, L. Büsing, E. Vasilaki and W. Gerstner, Connectivity reflects coding: A model of voltagebased STDP with homeostasis, Nat. Neurosci. 13(3) (2010) 344-352.
    • (2010) Nat. Neurosci , vol.13 , Issue.3 , pp. 344-352
    • Clopath, C.1    Büsing, L.2    Vasilaki, E.3    Gerstner, W.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.