SCOPUS 정보 검색 플랫폼

Neural Computation

Volumn 21, Issue 2, 2009, Pages 301-339

A spiking neural network model of an actor-critic learning agent

(3) Potjans, Wiebke a Morrison, Abigail a Diesmann, Markus a,b

a RIKEN BRAIN SCIENCE INSTITUTE (Japan)

b UNIVERSITY OF FREIBURG (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION POTENTIAL; ALGORITHM; ANIMAL; ARTICLE; ARTIFICIAL NEURAL NETWORK; BIOLOGICAL MODEL; COMPUTER SIMULATION; HUMAN; LEARNING; NERVE CELL; PHYSIOLOGY; REINFORCEMENT; TIME;

ACTION POTENTIALS; ALGORITHMS; ANIMALS; COMPUTER SIMULATION; HUMANS; LEARNING; MODELS, NEUROLOGICAL; NEURAL NETWORKS (COMPUTER); NEURONS; REINFORCEMENT (PSYCHOLOGY); TIME FACTORS;

EID: 67650298948 PISSN: 08997667 EISSN: 1530888X Source Type: Journal
DOI: 10.1162/neco.2008.08-07-593 Document Type: Article

Times cited : (72)

References (76)

1
- 0003888025
- Cambridge: Cambridge University Press
- Amit, D. J. (1989). Modeling brain function. Cambridge: Cambridge University Press.
- (1989) Modeling brain function
- Amit, D.J.¹

2
- 0037427024
- Activity-dependent presynaptic facilitation and Hebbian LTP are both required and interact during classical conditioning in Aplysia
- Antonov, I., Antonova, I., Kandel, E. R., & Hawkins, R. D. (2003). Activity-dependent presynaptic facilitation and Hebbian LTP are both required and interact during classical conditioning in Aplysia. Neuron, 37(1), 135-147.
- (2003) Neuron , vol.37 , Issue.1 , pp. 135-147
- Antonov, I.¹ Antonova, I.² Kandel, E.R.³ Hawkins, R.D.⁴

3
- 33845882205
- Separate neural substrates for skill-learning and performance in the ventral and dorsal striatum
- Attalah, H. E., Lopez-Paniagua, D., Rudy, J. W., & O'Reilly, R. C. (2007). Separate neural substrates for skill-learning and performance in the ventral and dorsal striatum. Nat. Neurosci., 10(1), 126-131.
- (2007) Nat. Neurosci. , vol.10 , Issue.1 , pp. 126-131
- Attalah, H.E.¹ Lopez-Paniagua, D.² Rudy, J.W.³ O'Reilly, R.C.⁴

4
- 34548049545
- Reinforcement learning, spike-time-dependent plasticity, and the BCM rule
- Baras, D., & Meir, R. (2007). Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput., 19, 2245-2279.
- (2007) Neural Comput. , vol.19 , pp. 2245-2279
- Baras, D.¹ Meir, R.²

5
- 0000541213
- Adaptive critic and the basal ganglia
- In J. C. Houk, J. Davis, & D. Beisser (Eds.), Cambridge, MA: MIT Press
- Barto, A. G. (1995). Adaptive critic and the basal ganglia. In J. C. Houk, J. Davis, & D. Beisser (Eds.), Models of information processing in the basal ganglia (pp. 215-232). Cambridge, MA: MIT Press.
- (1995) Models of information processing in the basal ganglia , pp. 215-232
- Barto, A.G.¹

6
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Barto, A., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Sys. M. Cybern., 13, 834-846.
- (1983) IEEE Trans. Sys. M. Cybern. , vol.13 , pp. 834-846
- Barto, A.¹ Sutton, R.S.² Anderson, C.W.³

7
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-dynamic programming. Belmont, MA: Athena Scientific.
- (1996) Neuro-dynamic programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

8
- 0032535029
- Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type
- Bi, G.-q., & Poo, M.-m. (1998). Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type. J. Neurosci., 18, 10464-10472.
- (1998) J. Neurosci. , vol.18 , pp. 10464-10472
- Bi, G.-q.¹ Poo, M.-m.²

9
- 0000133548
- Synaptic plasticity in rat hippocampal slice cultures: local "Hebbian"conjunction of pre- and postsynaptic stimulation leads to distributed synaptic enhancement
- Bonhoeffer, T., Staiger, V., & Aertsen, A. (1989). Synaptic plasticity in rat hippocampal slice cultures: local "Hebbian" conjunction of pre- and postsynaptic stimulation leads to distributed synaptic enhancement. Proc. Natl. Acad. Sci. USA, 86(20), 8113-8117.
- (1989) Proc. Natl. Acad. Sci. USA , vol.86 , Issue.20 , pp. 8113-8117
- Bonhoeffer, T.¹ Staiger, V.² Aertsen, A.³

10
- 0029759681
- Spread of synaptic depression mediated by presynaptic cytoplasmic signaling
- Cash, S., Zucker, R., & Poo, M.-m. (1996). Spread of synaptic depression mediated by presynaptic cytoplasmic signaling. Science, 272(5264), 998-1001.
- (1996) Science , vol.272 , Issue.5264 , pp. 998-1001
- Cash, S.¹ Zucker, R.² Poo, M.-m.³

11
- 0000430514
- The convergence of TD(λ) for general λ
- Dayan, P. (1992). The convergence of TD(λ) for general λ. Machine Learning, 8, 341- 362.
- (1992) Machine Learning , vol.8 , pp. 341-362
- Dayan, P.¹

12
- 0028388685
- TD(λ) converges with probability 1
- Dayan, P., & Sejnowski, T. (1994). TD(λ) converges with probability 1. Machine Learning, 14, 295-301.
- (1994) Machine Learning , vol.14 , pp. 295-301
- Dayan, P.¹ Sejnowski, T.²

13
- 0034524427
- Complementary roles of basal ganglia and cerebellum in learning and motor control
- Doya, K. (2000a). Complementary roles of basal ganglia and cerebellum in learning and motor control. Curr. Opin. Neurobiol., 10, 732-739.
- (2000) Curr. Opin. Neurobiol. , vol.10 , pp. 732-739
- Doya, K.¹

14
- 0033629916
- Reinforcement learning in continuous time and space
- Doya, K. (2000b). Reinforcement learning in continuous time and space. Neural Comput., 12(1), 219-245.
- (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

15
- 0036592023
- Metalearning and neuromodulation
- Doya, K. (2002). Metalearning and neuromodulation. Neural Networks, 15, 495-506.
- (2002) Neural Networks , vol.15 , pp. 495-506
- Doya, K.¹

16
- 37549060355
- Reinforcement learning with modulated spike timing-dependent synaptic plasticity
- Farries,M. A., & Fairhall, A. L. (2007). Reinforcement learning with modulated spike timing-dependent synaptic plasticity. J. Neurophysiol., 98, 3648-3665.
- (2007) J. Neurophysiol. , vol.98 , pp. 3648-3665
- Farries, M.A.¹ Fairhall, A.L.²

17
- 0030795297
- Propagation of activity-dependent synaptic depression in simple neural networks
- Fitzsimonds, R., Song, H.-j., & Poo, M.-m. (1997). Propagation of activity-dependent synaptic depression in simple neural networks. Nature, 388, 439-448.
- (1997) Nature , vol.388 , pp. 439-448
- Fitzsimonds, R.¹ Song, H.-j.² Poo, M.-m.³

18
- 34249708388
- Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity
- Florian, R. V. (2007). Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput., 19, 1468-1502.
- (2007) Neural Comput , vol.19 , pp. 1468-1502
- Florian, R.V.¹

19
- 0033968832
- A model of hippocampally dependent navigation, using the temporal difference learning rule
- Foster, D. J., Morris, R. G. M., & Dayan, P. (2000). A model of hippocampally dependent navigation, using the temporal difference learning rule. Hippocampus, 10, 1-16.
- (2000) Hippocampus , vol.10 , pp. 1-16
- Foster, D.J.¹ Morris, R.G.M.² Dayan, P.³

20
- 0037187567
- Spike-timing-dependent synaptic modification induced by natural spike trains
- Froemke, R. C., & Dan, Y. (2002). Spike-timing-dependent synaptic modification induced by natural spike trains. Nature, 416(6879), 433-438.
- (2002) Nature , vol.416 , Issue.6879 , pp. 433-438
- Froemke, R.C.¹ Dan, Y.²

21
- 0032213183
- Phenomenological model of visually evoked spike trains in cat geniculate nonlagged X-cells
- Gazeres, N., Borg-Graham, L., & Frégnac, Y. (1998). Phenomenological model of visually evoked spike trains in cat geniculate nonlagged X-cells. Vis. Neurosci., 15, 1157-1174.
- (1998) Vis. Neurosci. , vol.15 , pp. 1157-1174
- Gazeres, N.¹ Borg-Graham, L.² Frégnac, Y.³

22
- 0020401276
- On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex
- Georgopoulos, A., Kalaska, J. F., Caminiti, R., & Massey, J. T. (1982). On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex. J. Neurosci., 11(2), 1527-1537.
- (1982) J. Neurosci. , vol.11 , Issue.2 , pp. 1527-1537
- Georgopoulos, A.¹ Kalaska, J.F.² Caminiti, R.³ Massey, J.T.⁴

23
- 43949092150
- NEST (neural simulation tool)
- Gewaltig, M.-O., & Diesmann, M. (2007). NEST (neural simulation tool). Scholarpedia, 2(4), 1430.
- (2007) Scholarpedia , vol.2 , Issue.4 , pp. 1430
- Gewaltig, M.-O.¹ Diesmann, M.²

24
- 33750590110
- Programmable logic construction kits for hyper real-time neuronal modeling
- Guerrero-Rivera, R., Morrison, A., Diesmann, M., & Pearce, T. C. (2006). Programmable logic construction kits for hyper real-time neuronal modeling. Neural Comput., 18, 2651-2679.
- (2006) Neural Comput. , vol.18 , pp. 2651-2679
- Guerrero-Rivera, R.¹ Morrison, A.² Diesmann, M.³ Pearce, T.C.⁴

25
- 4344560353
- Modeling compositionality by dynamic binding of synfire chains
- Hayon, G., Abeles, M., & Lehmann, D. (2004). Modeling compositionality by dynamic binding of synfire chains. J. Comput. Neurosci., 17, 179-201.
- (2004) J. Comput. Neurosci. , vol.17 , pp. 179-201
- Hayon, G.¹ Abeles, M.² Lehmann, D.³

26
- 0007010202
- Cambridge, MA: MIT Press
- Houk, J. C., Adams, J. L.,&Barto, A. G. (1995). A model of how the basal ganglia generate and use neural signals that predict reinforcement. Cambridge, MA:MIT Press.
- (1995) A model of how the basal ganglia generate and use neural signals that predict reinforcement
- Houk, J.C.¹ Adams, J.L.² Barto, A.G.³

27
- 34948906745
- Solving the distal reward problem through linkage of STDP and dopamine signaling
- Izhikevich, E.M. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb. Cortex, 17(10), 2443-2452.
- (2007) Cereb. Cortex , vol.17 , Issue.10 , pp. 2443-2452
- Izhikevich, E.M.¹

28
- 0036592026
- Actor-critic models of the basal ganglia: New anatomical and computational perspectives
- Joel, D., Niv, J., & Ruppin, E. (2002). Actor-critic models of the basal ganglia: New anatomical and computational perspectives. Neural Networks, 15, 535-547.
- (2002) Neural Networks , vol.15 , pp. 535-547
- Joel, D.¹ Niv, J.² Ruppin, E.³

29
- 0035957333
- Formation of temporal-feature maps by axonal propagation of synaptic learning
- Kempter, R., Leibold, C., Wagner, H., & van Hemmen, J. (2001). Formation of temporal-feature maps by axonal propagation of synaptic learning. Proc. Natl. Acad. Sci. USA, 7(98), 4166-4171.
- (2001) Proc. Natl. Acad. Sci. USA , vol.7 , Issue.98 , pp. 4166-4171
- Kempter, R.¹ Leibold, C.² Wagner, H.³ van Hemmen, J.⁴

30
- 0042276164
- A drive-reinforcementmodel of single neuron function
- In J. Denker (Ed.), New York: American Institute of Physics
- Klopf, A. (1986). A drive-reinforcementmodel of single neuron function. In J. Denker (Ed.), Neural networks for computing: AIP Conference Proceedings (Vol. 151, pp. 265-270). New York: American Institute of Physics.
- (1986) Neural networks for computing: AIP Conference Proceedings , vol.151 , pp. 265-270
- Klopf, A.¹

31
- 0023878618
- A neuronal model of classical conditioning
- Klopf, A. (1988). A neuronal model of classical conditioning. Psychobiology, 16, 85-125.
- (1988) Psychobiology , vol.16 , pp. 85-125
- Klopf, A.¹

32
- 4043069840
- On actor-critic algorithms
- Konda, V., & Tsitsiklis, J. (2003). On actor-critic algorithms. SIAM Journal on Control and Optimization, 42(4), 1143-1166.
- (2003) SIAM Journal on Control and Optimization , vol.42 , Issue.4 , pp. 1143-1166
- Konda, V.¹ Tsitsiklis, J.²

33
- 0042276165
- DifferentialHebbian learning
- In J. Denker (Ed.), New York: American Institute of Physics
- Kosko, B. (1986). DifferentialHebbian learning. In J. Denker (Ed.), Neural networks for Computing: AIP Conference Proceedings (Vol. 151, pp. 277-288). New York: American Institute of Physics.
- (1986) Neural networks for Computing: AIP Conference Proceedings , vol.151 , pp. 277-288
- Kosko, B.¹

34
- 0025697397
- Non-Hebbian synapses in rat visual cortex
- Kossel, A., Bonhoeffer, T., & Boltz, J. (1990). Non-Hebbian synapses in rat visual cortex. NeuroReport, 1(2), 115-118.
- (1990) NeuroReport , vol.1 , Issue.2 , pp. 115-118
- Kossel, A.¹ Bonhoeffer, T.² Boltz, J.³

35
- 0035842180
- Temporal map formation in the barn owl's brain
- Leibold, C., Kempter, R., & van Hemmen, J. (2001). Temporal map formation in the barn owl's brain. Phys. Rev. Lett., 87(24), 248101.
- (2001) Phys. Rev. Lett. , vol.87 , Issue.24 , pp. 248101
- Leibold, C.¹ Kempter, R.² van Hemmen, J.³

36
- 0039484222
- Mapping time
- Leibold, C., & van Hemmen, J. (2002). Mapping time. Biol. Cybern., 87, 428-439.
- (2002) Biol. Cybern. , vol.87 , pp. 428-439
- Leibold, C.¹ van Hemmen, J.²

37
- 33845904391
- Knowing without doing
- Lerchner, A., La Camera, G., & Richmond, B. (2007). Knowing without doing. Nat. Neurosci., 10(1), 15-17.
- (2007) Nat. Neurosci. , vol.10 , Issue.1 , pp. 15-17
- Lerchner, A.¹ La Camera, G.² Richmond, B.³

38
- 0031012615
- Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs
- Markram, H., Lübke, J., Frotscher, M., & Sakmann, B. (1997). Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science, 275, 213-215.
- (1997) Science , vol.275 , pp. 213-215
- Markram, H.¹ Lübke, J.² Frotscher, M.³ Sakmann, B.⁴

39
- 0028972278
- Bee foraging in uncertain environments using predictive Hebbian learning
- Montague, P., Dayan, P., Person, C., & Sejnowski, T. (1995). Bee foraging in uncertain environments using predictive Hebbian learning. Nature, 377, 725-728.
- (1995) Nature , vol.377 , pp. 725-728
- Montague, P.¹ Dayan, P.² Person, C.³ Sejnowski, T.⁴

40
- 0029981543
- A framework for mesencephalic dopamine systems based on predictive Hebbian learning
- Montague, P. R., Dayan, P., & Sejowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci., 16(5), 1936-1947.
- (1996) J. Neurosci. , vol.16 , Issue.5 , pp. 1936-1947
- Montague, P.R.¹ Dayan, P.² Sejowski, T.J.³

41
- 0035979437
- Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
- Morimoto, J., & Doya, K. (2001). Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning. Robotics and Autonomous Systems, 36, 37-51.
- (2001) Robotics and Autonomous Systems , vol.36 , pp. 37-51
- Morimoto, J.¹ Doya, K.²

42
- 33747585633
- Midbrain dopamine neurons encode decisions for future action
- Morris, G., Nevet, A., Arkadir, D., Vaadia, E., & Bergman, H. (2006). Midbrain dopamine neurons encode decisions for future action. Nat. Neurosci., 9(8), 1057-1063.
- (2006) Nat. Neurosci. , vol.9 , Issue.8 , pp. 1057-1063
- Morris, G.¹ Nevet, A.² Arkadir, D.³ Vaadia, E.⁴ Bergman, H.⁵

43
- 43949102027
- Phenomenological models of synaptic plasticity based on spike timing
- Morrison, A., Diesmann, M., & Gerstner, W. (2008). Phenomenological models of synaptic plasticity based on spike timing. Biol. Cybern., 98, 459-478.
- (2008) Biol. Cybern. , vol.98 , pp. 459-478
- Morrison, A.¹ Diesmann, M.² Gerstner, W.³

44
- 36248947984
- Spike-frequency adapting neural assemblies: Beyond mean adaptation and renewal theories
- Muller, E., Buesing, L., Schemmel, J., & Meier, K. (2007). Spike-frequency adapting neural assemblies: Beyond mean adaptation and renewal theories. Neural Comput., 19, 2958-3010.
- (2007) Neural Comput. , vol.19 , pp. 2958-3010
- Muller, E.¹ Buesing, L.² Schemmel, J.³ Meier, K.⁴

45
- 33646399442
- Policy gradient in continuous time
- Munos, R. (2006). Policy gradient in continuous time. Journal of Machine Learning Research, 7, 771-791.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 771-791
- Munos, R.¹

46
- 0036972336
- Evolution of reinforcement learning in uncertain environments:Asimple explanation for complex foraging behaviors
- Niv, Y., Joel,D., Meilijson, I., & Ruppin, E. (2002). Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors. Adaptive Behavior, 10(1), 5-24.
- (2002) Adaptive Behavior , vol.10 , Issue.1 , pp. 5-24
- Niv, Y.¹ Joel, D.² Meilijson, I.³ Ruppin, E.⁴

47
- 0037987978
- Temporal difference models and reward-related learning in the human brain
- O'Doherty, J. P., Dayan, P., Friston, K., Critchley, H., & Dolan, R. J. (2003). Temporal difference models and reward-related learning in the human brain. Neuron, 28, 329-337.
- (2003) Neuron , vol.28 , pp. 329-337
- O'Doherty, J.P.¹ Dayan, P.² Friston, K.³ Critchley, H.⁴ Dolan, R.J.⁵

48
- 1942520195
- Dissociable roles of ventral and dorsal striatum in instrumental conditioning
- O'Doherty, J., Dayan, P., Schultz, J., Deichmann, R., Friston, K., & Dolan, R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452-454.
- (2004) Science , vol.304 , pp. 452-454
- O'Doherty, J.¹ Dayan, P.² Schultz, J.³ Deichmann, R.⁴ Friston, K.⁵ Dolan, R.J.⁶

49
- 33748302924
- Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans
- Pessiglione, M., Seymour, B., Flandin, G., Dolan, R., & Frith, C. (2006). Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature, 442, 1042-1045.
- (2006) Nature , vol.442 , pp. 1042-1045
- Pessiglione, M.¹ Seymour, B.² Flandin, G.³ Dolan, R.⁴ Frith, C.⁵

50
- 33748898872
- Triplets of spikes in a model of spike timing-dependent plasticity
- Pfister, J.-P., & Gerstner, W. (2006). Triplets of spikes in a model of spike timing-dependent plasticity. J. Neurosci., 26, 9673-9682.
- (2006) J. Neurosci. , vol.26 , pp. 9673-9682
- Pfister, J.-P.¹ Gerstner, W.²

51
- 38049169348
- Interconnecting VLSI spiking neural networks using isochronous connections
- In Berlin: Springer
- Philipp, S., Grübl, A., Meier, K., & Schemmel, J. (2007). Interconnecting VLSI spiking neural networks using isochronous connections. In Proceedings of IWANN2007 (pp. 471-478). Berlin: Springer.
- (2007) Proceedings of IWANN2007 , pp. 471-478
- Philipp, S.¹ Grübl, A.² Meier, K.³ Schemmel, J.⁴

52
- 0037686661
- Isotropic sequence order learning
- Porr, B., & Wörgötter, F. (2003). Isotropic sequence order learning. Neural Comput., 15, 831-864.
- (2003) Neural Comput. , vol.15 , pp. 831-864
- Porr, B.¹ Wörgötter, F.²

53
- 35549002871
- Learning with relevance: Using a third factor to stabilize Hebbian learning
- Porr, B., & Wörgötter, F. (2007). Learning with relevance: Using a third factor to stabilize Hebbian learning. Neural Comput., 19(10), 2694-2719.
- (2007) Neural Comput , vol.19 , Issue.10 , pp. 2694-2719
- Porr, B.¹ Wörgötter, F.²

54
- 67650299964
- Reinforcement learning in an actor-critic spiking network model
- Potjans, W., Morrison, A., & Diesmann, M. (2007a). Reinforcement learning in an actor-critic spiking network model. Neuroforum, 8(1).
- (2007) Neuroforum , vol.8 , Issue.1
- Potjans, W.¹ Morrison, A.² Diesmann, M.³

55
- 85036811038
- A spiking neural networkmodel for the actor-critic temporal-difference learning algorithm
- In San Diego, CA: Society for Neuroscience
- Potjans, W., Morrison, A., & Diesmann, M. (2007b). A spiking neural networkmodel for the actor-critic temporal-difference learning algorithm. In Proceedings of the 37th SFN Meeting. San Diego, CA: Society for Neuroscience.
- (2007) Proceedings of the 37th SFN Meeting
- Potjans, W.¹ Morrison, A.² Diesmann, M.³

56
- 0035489925
- Spike-timing-dependent Hebbian plasticity as temporal difference learning
- Rao, R. P. N., & Sejnowski, T. J. (2001). Spike-timing-dependent Hebbian plasticity as temporal difference learning. Neural Comput., 13, 2221-2237.
- (2001) Neural Comput. , vol.13 , pp. 2221-2237
- Rao, R.P.N.¹ Sejnowski, T.J.²

57
- 0036592025
- Dopamine-dependent plasticity of corticostriatal synapses
- Reynolds, J. N., & Wickens, J. R. (2002). Dopamine-dependent plasticity of corticostriatal synapses. Neural Networks, 15, 507-521.
- (2002) Neural Networks , vol.15 , pp. 507-521
- Reynolds, J.N.¹ Wickens, J.R.²

58
- 0032696609
- Computational consequences of temporally asymmetric learning rules: I. Differential Hebbian learning
- Roberts, P. D. (1999). Computational consequences of temporally asymmetric learning rules: I. Differential Hebbian learning. J. Comput. Neurosci., 7, 235-246.
- (1999) J. Comput. Neurosci. , vol.7 , pp. 235-246
- Roberts, P.D.¹

59
- 0037057755
- Getting formal with dopamine and reward
- Schultz, W. (2002). Getting formal with dopamine and reward. Neuron, 36, 241-263.
- (2002) Neuron , vol.36 , pp. 241-263
- Schultz, W.¹

60
- 0030896968
- A neural substrate of prediction and reward
- Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599.
- (1997) Science , vol.275 , pp. 1593-1599
- Schultz, W.¹ Dayan, P.² Montague, P.R.³

61
- 0028181521
- Locally distributed synaptic potentiation on the hippocampus
- Schuman, E., & Madison, D. (1994). Locally distributed synaptic potentiation on the hippocampus. Science, 263, 532-536.
- (1994) Science , vol.263 , pp. 532-536
- Schuman, E.¹ Madison, D.²

62
- 0347362917
- Learning spiking neural networks by reinforcement of stochastic synaptic transmission
- Seung, H. S. (2003). Learning spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron, 40, 1063-1073.
- (2003) Neuron , vol.40 , pp. 1063-1073
- Seung, H.S.¹

63
- 2942617032
- Temporal difference models describe higher-order learning in humans
- Seymour, B., O'Doherty, J., Dayan, P., Koltzenburg, M., Jones, A., Dolan, R., et al. (2004). Temporal difference models describe higher-order learning in humans. Nature, 429, 664-667.
- (2004) Nature , vol.429 , pp. 664-667
- Seymour, B.¹ O'Doherty, J.² Dayan, P.³ Koltzenburg, M.⁴ Jones, A.⁵ Dolan, R.⁶

64
- 0032930935
- A neural network model with dopamine-like reinforcement signal that learns a spatial delayed reponse task
- Suri, R., & Schultz, W. (1999). A neural network model with dopamine-like reinforcement signal that learns a spatial delayed reponse task. Neuroscience, 91(3), 871-890.
- (1999) Neuroscience , vol.91 , Issue.3 , pp. 871-890
- Suri, R.¹ Schultz, W.²

65
- 0035315989
- Temporal difference model reproduces anticipatory neural activity
- Suri, R. E., & Schultz, W. (2001). Temporal difference model reproduces anticipatory neural activity. Neural Comput., 13, 841-862.
- (2001) Neural Comput. , vol.13 , pp. 841-862
- Suri, R.E.¹ Schultz, W.²

66
- 33847202724
- Learning to predict by methods of temporal difference
- Sutton, R. (1988). Learning to predict by methods of temporal difference. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.¹

67
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

68
- 0034192399
- Selective presynaptic propagation of long-term potentiation in defined neural networks
- Tao, H.-z. W., Zhang, L. I., Bi, G.-q., & Poo, M.-m. (2000). Selective presynaptic propagation of long-term potentiation in defined neural networks. J. Neurosci., 20(9), 3233-3243.
- (2000) J. Neurosci. , vol.20 , Issue.9 , pp. 3233-3243
- Tao, H.-z.¹ Zhang, L.I.² Bi, G.-q.³ Poo, M.-m.⁴

69
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Comput., 6(2), 215-219.
- (1994) Neural Comput. , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

70
- 1642534402
- Modulation of caudate activity by action contingency
- Tricomi, E. M., Delgado, M. R., & Fiez, J. A. (2004). Modulation of caudate activity by action contingency. Neuron, 41, 281-292.
- (2004) Neuron , vol.41 , pp. 281-292
- Tricomi, E.M.¹ Delgado, M.R.² Fiez, J.A.³

71
- 11144349546
- Spike times make sense
- VanRullen, R., Guyonneau, R., & Thorpe, S. J. (2005). Spike times make sense. TINS, 28(1), 1-4.
- (2005) TINS , vol.28 , Issue.1 , pp. 1-4
- VanRullen, R.¹ Guyonneau, R.² Thorpe, S.J.³

72
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams, R. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8, 229-256.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.¹

73
- 0017524329
- An adaptive optimal controller for discrete-time Markov environments
- Witten, I. H. (1977). An adaptive optimal controller for discrete-time Markov environments. Information and Control, 34, 286-295.
- (1977) Information and Control , vol.34 , pp. 286-295
- Witten, I.H.¹

74
- 13244267004
- Temporal sequence learning, prediction, and control: A review of different models and their relation to biological mechanisms
- Wörgötter, F., & Porr, B. (2005). Temporal sequence learning, prediction, and control: A review of different models and their relation to biological mechanisms. Neural Comput., 17, 245-319.
- (2005) Neural Comput. , vol.17 , pp. 245-319
- Wörgötter, F.¹ Porr, B.²

75
- 37649027755
- Learning in neural networks by reinforcement of irregular spiking
- Xie, X., & Seung, H. S. (2004). Learning in neural networks by reinforcement of irregular spiking. Phys. Rev. E, 69, 41909.
- (2004) Phys. Rev. E , vol.69 , pp. 41909
- Xie, X.¹ Seung, H.S.²

76
- 0032480332
- A critical window for cooperation and competition among developing retinotectal synapses
- Zhang, L. I., Tao, H.W., Holt, C. E., Harris, W. A., & Poo, M.-m. (1998). A critical window for cooperation and competition among developing retinotectal synapses. Nature, 395, 37-44.
- (1998) Nature , vol.395 , pp. 37-44
- Zhang, L.I.¹ Tao, H.W.² Holt, C.E.³ Harris, W.A.⁴ Poo, M.-m.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.