메뉴 건너뛰기




Volumn 21, Issue 2, 2009, Pages 301-339

A spiking neural network model of an actor-critic learning agent

Author keywords

[No Author keywords available]

Indexed keywords

ACTION POTENTIAL; ALGORITHM; ANIMAL; ARTICLE; ARTIFICIAL NEURAL NETWORK; BIOLOGICAL MODEL; COMPUTER SIMULATION; HUMAN; LEARNING; NERVE CELL; PHYSIOLOGY; REINFORCEMENT; TIME;

EID: 67650298948     PISSN: 08997667     EISSN: 1530888X     Source Type: Journal    
DOI: 10.1162/neco.2008.08-07-593     Document Type: Article
Times cited : (72)

References (76)
  • 1
    • 0003888025 scopus 로고
    • Cambridge: Cambridge University Press
    • Amit, D. J. (1989). Modeling brain function. Cambridge: Cambridge University Press.
    • (1989) Modeling brain function
    • Amit, D.J.1
  • 2
    • 0037427024 scopus 로고    scopus 로고
    • Activity-dependent presynaptic facilitation and Hebbian LTP are both required and interact during classical conditioning in Aplysia
    • Antonov, I., Antonova, I., Kandel, E. R., & Hawkins, R. D. (2003). Activity-dependent presynaptic facilitation and Hebbian LTP are both required and interact during classical conditioning in Aplysia. Neuron, 37(1), 135-147.
    • (2003) Neuron , vol.37 , Issue.1 , pp. 135-147
    • Antonov, I.1    Antonova, I.2    Kandel, E.R.3    Hawkins, R.D.4
  • 3
    • 33845882205 scopus 로고    scopus 로고
    • Separate neural substrates for skill-learning and performance in the ventral and dorsal striatum
    • Attalah, H. E., Lopez-Paniagua, D., Rudy, J. W., & O'Reilly, R. C. (2007). Separate neural substrates for skill-learning and performance in the ventral and dorsal striatum. Nat. Neurosci., 10(1), 126-131.
    • (2007) Nat. Neurosci. , vol.10 , Issue.1 , pp. 126-131
    • Attalah, H.E.1    Lopez-Paniagua, D.2    Rudy, J.W.3    O'Reilly, R.C.4
  • 4
    • 34548049545 scopus 로고    scopus 로고
    • Reinforcement learning, spike-time-dependent plasticity, and the BCM rule
    • Baras, D., & Meir, R. (2007). Reinforcement learning, spike-time-dependent plasticity, and the BCM rule. Neural Comput., 19, 2245-2279.
    • (2007) Neural Comput. , vol.19 , pp. 2245-2279
    • Baras, D.1    Meir, R.2
  • 5
    • 0000541213 scopus 로고
    • Adaptive critic and the basal ganglia
    • In J. C. Houk, J. Davis, & D. Beisser (Eds.), Cambridge, MA: MIT Press
    • Barto, A. G. (1995). Adaptive critic and the basal ganglia. In J. C. Houk, J. Davis, & D. Beisser (Eds.), Models of information processing in the basal ganglia (pp. 215-232). Cambridge, MA: MIT Press.
    • (1995) Models of information processing in the basal ganglia , pp. 215-232
    • Barto, A.G.1
  • 6
    • 0020970738 scopus 로고
    • Neuronlike adaptive elements that can solve difficult learning control problems
    • Barto, A., Sutton, R. S., & Anderson, C. W. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Trans. Sys. M. Cybern., 13, 834-846.
    • (1983) IEEE Trans. Sys. M. Cybern. , vol.13 , pp. 834-846
    • Barto, A.1    Sutton, R.S.2    Anderson, C.W.3
  • 8
    • 0032535029 scopus 로고    scopus 로고
    • Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type
    • Bi, G.-q., & Poo, M.-m. (1998). Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type. J. Neurosci., 18, 10464-10472.
    • (1998) J. Neurosci. , vol.18 , pp. 10464-10472
    • Bi, G.-q.1    Poo, M.-m.2
  • 9
    • 0000133548 scopus 로고
    • Synaptic plasticity in rat hippocampal slice cultures: local "Hebbian"conjunction of pre- and postsynaptic stimulation leads to distributed synaptic enhancement
    • Bonhoeffer, T., Staiger, V., & Aertsen, A. (1989). Synaptic plasticity in rat hippocampal slice cultures: local "Hebbian" conjunction of pre- and postsynaptic stimulation leads to distributed synaptic enhancement. Proc. Natl. Acad. Sci. USA, 86(20), 8113-8117.
    • (1989) Proc. Natl. Acad. Sci. USA , vol.86 , Issue.20 , pp. 8113-8117
    • Bonhoeffer, T.1    Staiger, V.2    Aertsen, A.3
  • 10
    • 0029759681 scopus 로고    scopus 로고
    • Spread of synaptic depression mediated by presynaptic cytoplasmic signaling
    • Cash, S., Zucker, R., & Poo, M.-m. (1996). Spread of synaptic depression mediated by presynaptic cytoplasmic signaling. Science, 272(5264), 998-1001.
    • (1996) Science , vol.272 , Issue.5264 , pp. 998-1001
    • Cash, S.1    Zucker, R.2    Poo, M.-m.3
  • 11
    • 0000430514 scopus 로고
    • The convergence of TD(λ) for general λ
    • Dayan, P. (1992). The convergence of TD(λ) for general λ. Machine Learning, 8, 341- 362.
    • (1992) Machine Learning , vol.8 , pp. 341-362
    • Dayan, P.1
  • 12
    • 0028388685 scopus 로고
    • TD(λ) converges with probability 1
    • Dayan, P., & Sejnowski, T. (1994). TD(λ) converges with probability 1. Machine Learning, 14, 295-301.
    • (1994) Machine Learning , vol.14 , pp. 295-301
    • Dayan, P.1    Sejnowski, T.2
  • 13
    • 0034524427 scopus 로고    scopus 로고
    • Complementary roles of basal ganglia and cerebellum in learning and motor control
    • Doya, K. (2000a). Complementary roles of basal ganglia and cerebellum in learning and motor control. Curr. Opin. Neurobiol., 10, 732-739.
    • (2000) Curr. Opin. Neurobiol. , vol.10 , pp. 732-739
    • Doya, K.1
  • 14
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya, K. (2000b). Reinforcement learning in continuous time and space. Neural Comput., 12(1), 219-245.
    • (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
    • Doya, K.1
  • 15
    • 0036592023 scopus 로고    scopus 로고
    • Metalearning and neuromodulation
    • Doya, K. (2002). Metalearning and neuromodulation. Neural Networks, 15, 495-506.
    • (2002) Neural Networks , vol.15 , pp. 495-506
    • Doya, K.1
  • 16
    • 37549060355 scopus 로고    scopus 로고
    • Reinforcement learning with modulated spike timing-dependent synaptic plasticity
    • Farries,M. A., & Fairhall, A. L. (2007). Reinforcement learning with modulated spike timing-dependent synaptic plasticity. J. Neurophysiol., 98, 3648-3665.
    • (2007) J. Neurophysiol. , vol.98 , pp. 3648-3665
    • Farries, M.A.1    Fairhall, A.L.2
  • 17
    • 0030795297 scopus 로고    scopus 로고
    • Propagation of activity-dependent synaptic depression in simple neural networks
    • Fitzsimonds, R., Song, H.-j., & Poo, M.-m. (1997). Propagation of activity-dependent synaptic depression in simple neural networks. Nature, 388, 439-448.
    • (1997) Nature , vol.388 , pp. 439-448
    • Fitzsimonds, R.1    Song, H.-j.2    Poo, M.-m.3
  • 18
    • 34249708388 scopus 로고    scopus 로고
    • Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity
    • Florian, R. V. (2007). Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural Comput., 19, 1468-1502.
    • (2007) Neural Comput , vol.19 , pp. 1468-1502
    • Florian, R.V.1
  • 19
    • 0033968832 scopus 로고    scopus 로고
    • A model of hippocampally dependent navigation, using the temporal difference learning rule
    • Foster, D. J., Morris, R. G. M., & Dayan, P. (2000). A model of hippocampally dependent navigation, using the temporal difference learning rule. Hippocampus, 10, 1-16.
    • (2000) Hippocampus , vol.10 , pp. 1-16
    • Foster, D.J.1    Morris, R.G.M.2    Dayan, P.3
  • 20
    • 0037187567 scopus 로고    scopus 로고
    • Spike-timing-dependent synaptic modification induced by natural spike trains
    • Froemke, R. C., & Dan, Y. (2002). Spike-timing-dependent synaptic modification induced by natural spike trains. Nature, 416(6879), 433-438.
    • (2002) Nature , vol.416 , Issue.6879 , pp. 433-438
    • Froemke, R.C.1    Dan, Y.2
  • 21
    • 0032213183 scopus 로고    scopus 로고
    • Phenomenological model of visually evoked spike trains in cat geniculate nonlagged X-cells
    • Gazeres, N., Borg-Graham, L., & Frégnac, Y. (1998). Phenomenological model of visually evoked spike trains in cat geniculate nonlagged X-cells. Vis. Neurosci., 15, 1157-1174.
    • (1998) Vis. Neurosci. , vol.15 , pp. 1157-1174
    • Gazeres, N.1    Borg-Graham, L.2    Frégnac, Y.3
  • 22
    • 0020401276 scopus 로고
    • On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex
    • Georgopoulos, A., Kalaska, J. F., Caminiti, R., & Massey, J. T. (1982). On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex. J. Neurosci., 11(2), 1527-1537.
    • (1982) J. Neurosci. , vol.11 , Issue.2 , pp. 1527-1537
    • Georgopoulos, A.1    Kalaska, J.F.2    Caminiti, R.3    Massey, J.T.4
  • 23
    • 43949092150 scopus 로고    scopus 로고
    • NEST (neural simulation tool)
    • Gewaltig, M.-O., & Diesmann, M. (2007). NEST (neural simulation tool). Scholarpedia, 2(4), 1430.
    • (2007) Scholarpedia , vol.2 , Issue.4 , pp. 1430
    • Gewaltig, M.-O.1    Diesmann, M.2
  • 24
    • 33750590110 scopus 로고    scopus 로고
    • Programmable logic construction kits for hyper real-time neuronal modeling
    • Guerrero-Rivera, R., Morrison, A., Diesmann, M., & Pearce, T. C. (2006). Programmable logic construction kits for hyper real-time neuronal modeling. Neural Comput., 18, 2651-2679.
    • (2006) Neural Comput. , vol.18 , pp. 2651-2679
    • Guerrero-Rivera, R.1    Morrison, A.2    Diesmann, M.3    Pearce, T.C.4
  • 25
    • 4344560353 scopus 로고    scopus 로고
    • Modeling compositionality by dynamic binding of synfire chains
    • Hayon, G., Abeles, M., & Lehmann, D. (2004). Modeling compositionality by dynamic binding of synfire chains. J. Comput. Neurosci., 17, 179-201.
    • (2004) J. Comput. Neurosci. , vol.17 , pp. 179-201
    • Hayon, G.1    Abeles, M.2    Lehmann, D.3
  • 27
    • 34948906745 scopus 로고    scopus 로고
    • Solving the distal reward problem through linkage of STDP and dopamine signaling
    • Izhikevich, E.M. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cereb. Cortex, 17(10), 2443-2452.
    • (2007) Cereb. Cortex , vol.17 , Issue.10 , pp. 2443-2452
    • Izhikevich, E.M.1
  • 28
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: New anatomical and computational perspectives
    • Joel, D., Niv, J., & Ruppin, E. (2002). Actor-critic models of the basal ganglia: New anatomical and computational perspectives. Neural Networks, 15, 535-547.
    • (2002) Neural Networks , vol.15 , pp. 535-547
    • Joel, D.1    Niv, J.2    Ruppin, E.3
  • 29
    • 0035957333 scopus 로고    scopus 로고
    • Formation of temporal-feature maps by axonal propagation of synaptic learning
    • Kempter, R., Leibold, C., Wagner, H., & van Hemmen, J. (2001). Formation of temporal-feature maps by axonal propagation of synaptic learning. Proc. Natl. Acad. Sci. USA, 7(98), 4166-4171.
    • (2001) Proc. Natl. Acad. Sci. USA , vol.7 , Issue.98 , pp. 4166-4171
    • Kempter, R.1    Leibold, C.2    Wagner, H.3    van Hemmen, J.4
  • 30
    • 0042276164 scopus 로고
    • A drive-reinforcementmodel of single neuron function
    • In J. Denker (Ed.), New York: American Institute of Physics
    • Klopf, A. (1986). A drive-reinforcementmodel of single neuron function. In J. Denker (Ed.), Neural networks for computing: AIP Conference Proceedings (Vol. 151, pp. 265-270). New York: American Institute of Physics.
    • (1986) Neural networks for computing: AIP Conference Proceedings , vol.151 , pp. 265-270
    • Klopf, A.1
  • 31
    • 0023878618 scopus 로고
    • A neuronal model of classical conditioning
    • Klopf, A. (1988). A neuronal model of classical conditioning. Psychobiology, 16, 85-125.
    • (1988) Psychobiology , vol.16 , pp. 85-125
    • Klopf, A.1
  • 33
    • 0042276165 scopus 로고
    • DifferentialHebbian learning
    • In J. Denker (Ed.), New York: American Institute of Physics
    • Kosko, B. (1986). DifferentialHebbian learning. In J. Denker (Ed.), Neural networks for Computing: AIP Conference Proceedings (Vol. 151, pp. 277-288). New York: American Institute of Physics.
    • (1986) Neural networks for Computing: AIP Conference Proceedings , vol.151 , pp. 277-288
    • Kosko, B.1
  • 34
    • 0025697397 scopus 로고
    • Non-Hebbian synapses in rat visual cortex
    • Kossel, A., Bonhoeffer, T., & Boltz, J. (1990). Non-Hebbian synapses in rat visual cortex. NeuroReport, 1(2), 115-118.
    • (1990) NeuroReport , vol.1 , Issue.2 , pp. 115-118
    • Kossel, A.1    Bonhoeffer, T.2    Boltz, J.3
  • 35
    • 0035842180 scopus 로고    scopus 로고
    • Temporal map formation in the barn owl's brain
    • Leibold, C., Kempter, R., & van Hemmen, J. (2001). Temporal map formation in the barn owl's brain. Phys. Rev. Lett., 87(24), 248101.
    • (2001) Phys. Rev. Lett. , vol.87 , Issue.24 , pp. 248101
    • Leibold, C.1    Kempter, R.2    van Hemmen, J.3
  • 38
    • 0031012615 scopus 로고    scopus 로고
    • Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs
    • Markram, H., Lübke, J., Frotscher, M., & Sakmann, B. (1997). Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science, 275, 213-215.
    • (1997) Science , vol.275 , pp. 213-215
    • Markram, H.1    Lübke, J.2    Frotscher, M.3    Sakmann, B.4
  • 39
    • 0028972278 scopus 로고
    • Bee foraging in uncertain environments using predictive Hebbian learning
    • Montague, P., Dayan, P., Person, C., & Sejnowski, T. (1995). Bee foraging in uncertain environments using predictive Hebbian learning. Nature, 377, 725-728.
    • (1995) Nature , vol.377 , pp. 725-728
    • Montague, P.1    Dayan, P.2    Person, C.3    Sejnowski, T.4
  • 40
    • 0029981543 scopus 로고    scopus 로고
    • A framework for mesencephalic dopamine systems based on predictive Hebbian learning
    • Montague, P. R., Dayan, P., & Sejowski, T. J. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci., 16(5), 1936-1947.
    • (1996) J. Neurosci. , vol.16 , Issue.5 , pp. 1936-1947
    • Montague, P.R.1    Dayan, P.2    Sejowski, T.J.3
  • 41
    • 0035979437 scopus 로고    scopus 로고
    • Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning
    • Morimoto, J., & Doya, K. (2001). Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning. Robotics and Autonomous Systems, 36, 37-51.
    • (2001) Robotics and Autonomous Systems , vol.36 , pp. 37-51
    • Morimoto, J.1    Doya, K.2
  • 42
    • 33747585633 scopus 로고    scopus 로고
    • Midbrain dopamine neurons encode decisions for future action
    • Morris, G., Nevet, A., Arkadir, D., Vaadia, E., & Bergman, H. (2006). Midbrain dopamine neurons encode decisions for future action. Nat. Neurosci., 9(8), 1057-1063.
    • (2006) Nat. Neurosci. , vol.9 , Issue.8 , pp. 1057-1063
    • Morris, G.1    Nevet, A.2    Arkadir, D.3    Vaadia, E.4    Bergman, H.5
  • 43
    • 43949102027 scopus 로고    scopus 로고
    • Phenomenological models of synaptic plasticity based on spike timing
    • Morrison, A., Diesmann, M., & Gerstner, W. (2008). Phenomenological models of synaptic plasticity based on spike timing. Biol. Cybern., 98, 459-478.
    • (2008) Biol. Cybern. , vol.98 , pp. 459-478
    • Morrison, A.1    Diesmann, M.2    Gerstner, W.3
  • 44
    • 36248947984 scopus 로고    scopus 로고
    • Spike-frequency adapting neural assemblies: Beyond mean adaptation and renewal theories
    • Muller, E., Buesing, L., Schemmel, J., & Meier, K. (2007). Spike-frequency adapting neural assemblies: Beyond mean adaptation and renewal theories. Neural Comput., 19, 2958-3010.
    • (2007) Neural Comput. , vol.19 , pp. 2958-3010
    • Muller, E.1    Buesing, L.2    Schemmel, J.3    Meier, K.4
  • 46
    • 0036972336 scopus 로고    scopus 로고
    • Evolution of reinforcement learning in uncertain environments:Asimple explanation for complex foraging behaviors
    • Niv, Y., Joel,D., Meilijson, I., & Ruppin, E. (2002). Evolution of reinforcement learning in uncertain environments: A simple explanation for complex foraging behaviors. Adaptive Behavior, 10(1), 5-24.
    • (2002) Adaptive Behavior , vol.10 , Issue.1 , pp. 5-24
    • Niv, Y.1    Joel, D.2    Meilijson, I.3    Ruppin, E.4
  • 47
    • 0037987978 scopus 로고    scopus 로고
    • Temporal difference models and reward-related learning in the human brain
    • O'Doherty, J. P., Dayan, P., Friston, K., Critchley, H., & Dolan, R. J. (2003). Temporal difference models and reward-related learning in the human brain. Neuron, 28, 329-337.
    • (2003) Neuron , vol.28 , pp. 329-337
    • O'Doherty, J.P.1    Dayan, P.2    Friston, K.3    Critchley, H.4    Dolan, R.J.5
  • 48
    • 1942520195 scopus 로고    scopus 로고
    • Dissociable roles of ventral and dorsal striatum in instrumental conditioning
    • O'Doherty, J., Dayan, P., Schultz, J., Deichmann, R., Friston, K., & Dolan, R. J. (2004). Dissociable roles of ventral and dorsal striatum in instrumental conditioning. Science, 304, 452-454.
    • (2004) Science , vol.304 , pp. 452-454
    • O'Doherty, J.1    Dayan, P.2    Schultz, J.3    Deichmann, R.4    Friston, K.5    Dolan, R.J.6
  • 49
    • 33748302924 scopus 로고    scopus 로고
    • Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans
    • Pessiglione, M., Seymour, B., Flandin, G., Dolan, R., & Frith, C. (2006). Dopamine-dependent prediction errors underpin reward-seeking behaviour in humans. Nature, 442, 1042-1045.
    • (2006) Nature , vol.442 , pp. 1042-1045
    • Pessiglione, M.1    Seymour, B.2    Flandin, G.3    Dolan, R.4    Frith, C.5
  • 50
    • 33748898872 scopus 로고    scopus 로고
    • Triplets of spikes in a model of spike timing-dependent plasticity
    • Pfister, J.-P., & Gerstner, W. (2006). Triplets of spikes in a model of spike timing-dependent plasticity. J. Neurosci., 26, 9673-9682.
    • (2006) J. Neurosci. , vol.26 , pp. 9673-9682
    • Pfister, J.-P.1    Gerstner, W.2
  • 51
    • 38049169348 scopus 로고    scopus 로고
    • Interconnecting VLSI spiking neural networks using isochronous connections
    • In Berlin: Springer
    • Philipp, S., Grübl, A., Meier, K., & Schemmel, J. (2007). Interconnecting VLSI spiking neural networks using isochronous connections. In Proceedings of IWANN2007 (pp. 471-478). Berlin: Springer.
    • (2007) Proceedings of IWANN2007 , pp. 471-478
    • Philipp, S.1    Grübl, A.2    Meier, K.3    Schemmel, J.4
  • 52
    • 0037686661 scopus 로고    scopus 로고
    • Isotropic sequence order learning
    • Porr, B., & Wörgötter, F. (2003). Isotropic sequence order learning. Neural Comput., 15, 831-864.
    • (2003) Neural Comput. , vol.15 , pp. 831-864
    • Porr, B.1    Wörgötter, F.2
  • 53
    • 35549002871 scopus 로고    scopus 로고
    • Learning with relevance: Using a third factor to stabilize Hebbian learning
    • Porr, B., & Wörgötter, F. (2007). Learning with relevance: Using a third factor to stabilize Hebbian learning. Neural Comput., 19(10), 2694-2719.
    • (2007) Neural Comput , vol.19 , Issue.10 , pp. 2694-2719
    • Porr, B.1    Wörgötter, F.2
  • 54
    • 67650299964 scopus 로고    scopus 로고
    • Reinforcement learning in an actor-critic spiking network model
    • Potjans, W., Morrison, A., & Diesmann, M. (2007a). Reinforcement learning in an actor-critic spiking network model. Neuroforum, 8(1).
    • (2007) Neuroforum , vol.8 , Issue.1
    • Potjans, W.1    Morrison, A.2    Diesmann, M.3
  • 55
    • 85036811038 scopus 로고    scopus 로고
    • A spiking neural networkmodel for the actor-critic temporal-difference learning algorithm
    • In San Diego, CA: Society for Neuroscience
    • Potjans, W., Morrison, A., & Diesmann, M. (2007b). A spiking neural networkmodel for the actor-critic temporal-difference learning algorithm. In Proceedings of the 37th SFN Meeting. San Diego, CA: Society for Neuroscience.
    • (2007) Proceedings of the 37th SFN Meeting
    • Potjans, W.1    Morrison, A.2    Diesmann, M.3
  • 56
    • 0035489925 scopus 로고    scopus 로고
    • Spike-timing-dependent Hebbian plasticity as temporal difference learning
    • Rao, R. P. N., & Sejnowski, T. J. (2001). Spike-timing-dependent Hebbian plasticity as temporal difference learning. Neural Comput., 13, 2221-2237.
    • (2001) Neural Comput. , vol.13 , pp. 2221-2237
    • Rao, R.P.N.1    Sejnowski, T.J.2
  • 57
    • 0036592025 scopus 로고    scopus 로고
    • Dopamine-dependent plasticity of corticostriatal synapses
    • Reynolds, J. N., & Wickens, J. R. (2002). Dopamine-dependent plasticity of corticostriatal synapses. Neural Networks, 15, 507-521.
    • (2002) Neural Networks , vol.15 , pp. 507-521
    • Reynolds, J.N.1    Wickens, J.R.2
  • 58
    • 0032696609 scopus 로고    scopus 로고
    • Computational consequences of temporally asymmetric learning rules: I. Differential Hebbian learning
    • Roberts, P. D. (1999). Computational consequences of temporally asymmetric learning rules: I. Differential Hebbian learning. J. Comput. Neurosci., 7, 235-246.
    • (1999) J. Comput. Neurosci. , vol.7 , pp. 235-246
    • Roberts, P.D.1
  • 59
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz, W. (2002). Getting formal with dopamine and reward. Neuron, 36, 241-263.
    • (2002) Neuron , vol.36 , pp. 241-263
    • Schultz, W.1
  • 60
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate of prediction and reward
    • Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate of prediction and reward. Science, 275, 1593-1599.
    • (1997) Science , vol.275 , pp. 1593-1599
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 61
    • 0028181521 scopus 로고
    • Locally distributed synaptic potentiation on the hippocampus
    • Schuman, E., & Madison, D. (1994). Locally distributed synaptic potentiation on the hippocampus. Science, 263, 532-536.
    • (1994) Science , vol.263 , pp. 532-536
    • Schuman, E.1    Madison, D.2
  • 62
    • 0347362917 scopus 로고    scopus 로고
    • Learning spiking neural networks by reinforcement of stochastic synaptic transmission
    • Seung, H. S. (2003). Learning spiking neural networks by reinforcement of stochastic synaptic transmission. Neuron, 40, 1063-1073.
    • (2003) Neuron , vol.40 , pp. 1063-1073
    • Seung, H.S.1
  • 63
    • 2942617032 scopus 로고    scopus 로고
    • Temporal difference models describe higher-order learning in humans
    • Seymour, B., O'Doherty, J., Dayan, P., Koltzenburg, M., Jones, A., Dolan, R., et al. (2004). Temporal difference models describe higher-order learning in humans. Nature, 429, 664-667.
    • (2004) Nature , vol.429 , pp. 664-667
    • Seymour, B.1    O'Doherty, J.2    Dayan, P.3    Koltzenburg, M.4    Jones, A.5    Dolan, R.6
  • 64
    • 0032930935 scopus 로고    scopus 로고
    • A neural network model with dopamine-like reinforcement signal that learns a spatial delayed reponse task
    • Suri, R., & Schultz, W. (1999). A neural network model with dopamine-like reinforcement signal that learns a spatial delayed reponse task. Neuroscience, 91(3), 871-890.
    • (1999) Neuroscience , vol.91 , Issue.3 , pp. 871-890
    • Suri, R.1    Schultz, W.2
  • 65
    • 0035315989 scopus 로고    scopus 로고
    • Temporal difference model reproduces anticipatory neural activity
    • Suri, R. E., & Schultz, W. (2001). Temporal difference model reproduces anticipatory neural activity. Neural Comput., 13, 841-862.
    • (2001) Neural Comput. , vol.13 , pp. 841-862
    • Suri, R.E.1    Schultz, W.2
  • 66
    • 33847202724 scopus 로고
    • Learning to predict by methods of temporal difference
    • Sutton, R. (1988). Learning to predict by methods of temporal difference. Machine Learning, 3, 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.1
  • 68
    • 0034192399 scopus 로고    scopus 로고
    • Selective presynaptic propagation of long-term potentiation in defined neural networks
    • Tao, H.-z. W., Zhang, L. I., Bi, G.-q., & Poo, M.-m. (2000). Selective presynaptic propagation of long-term potentiation in defined neural networks. J. Neurosci., 20(9), 3233-3243.
    • (2000) J. Neurosci. , vol.20 , Issue.9 , pp. 3233-3243
    • Tao, H.-z.1    Zhang, L.I.2    Bi, G.-q.3    Poo, M.-m.4
  • 69
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program, achieves master-level play
    • Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Comput., 6(2), 215-219.
    • (1994) Neural Comput. , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1
  • 70
    • 1642534402 scopus 로고    scopus 로고
    • Modulation of caudate activity by action contingency
    • Tricomi, E. M., Delgado, M. R., & Fiez, J. A. (2004). Modulation of caudate activity by action contingency. Neuron, 41, 281-292.
    • (2004) Neuron , vol.41 , pp. 281-292
    • Tricomi, E.M.1    Delgado, M.R.2    Fiez, J.A.3
  • 71
  • 72
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • Williams, R. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8, 229-256.
    • (1992) Machine Learning , vol.8 , pp. 229-256
    • Williams, R.1
  • 73
    • 0017524329 scopus 로고
    • An adaptive optimal controller for discrete-time Markov environments
    • Witten, I. H. (1977). An adaptive optimal controller for discrete-time Markov environments. Information and Control, 34, 286-295.
    • (1977) Information and Control , vol.34 , pp. 286-295
    • Witten, I.H.1
  • 74
    • 13244267004 scopus 로고    scopus 로고
    • Temporal sequence learning, prediction, and control: A review of different models and their relation to biological mechanisms
    • Wörgötter, F., & Porr, B. (2005). Temporal sequence learning, prediction, and control: A review of different models and their relation to biological mechanisms. Neural Comput., 17, 245-319.
    • (2005) Neural Comput. , vol.17 , pp. 245-319
    • Wörgötter, F.1    Porr, B.2
  • 75
    • 37649027755 scopus 로고    scopus 로고
    • Learning in neural networks by reinforcement of irregular spiking
    • Xie, X., & Seung, H. S. (2004). Learning in neural networks by reinforcement of irregular spiking. Phys. Rev. E, 69, 41909.
    • (2004) Phys. Rev. E , vol.69 , pp. 41909
    • Xie, X.1    Seung, H.S.2
  • 76
    • 0032480332 scopus 로고    scopus 로고
    • A critical window for cooperation and competition among developing retinotectal synapses
    • Zhang, L. I., Tao, H.W., Holt, C. E., Harris, W. A., & Poo, M.-m. (1998). A critical window for cooperation and competition among developing retinotectal synapses. Nature, 395, 37-44.
    • (1998) Nature , vol.395 , pp. 37-44
    • Zhang, L.I.1    Tao, H.W.2    Holt, C.E.3    Harris, W.A.4    Poo, M.-m.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.