메뉴 건너뛰기




Volumn 25, Issue 4, 2013, Pages 940-978

Solving the distal reward problem with rare correlations

Author keywords

[No Author keywords available]

Indexed keywords

ACTION POTENTIAL; BIOLOGICAL MODEL; COMPUTER SIMULATION; LEARNING; LETTER; MEMORY; NERVE CELL; NERVE CELL PLASTICITY; PHYSIOLOGY; REINFORCEMENT; REWARD; SYNAPSE;

EID: 84877812921     PISSN: 08997667     EISSN: 1530888X     Source Type: Journal    
DOI: 10.1162/NECO_a_00419     Document Type: Letter
Times cited : (24)

References (66)
  • 2
    • 0037930510 scopus 로고    scopus 로고
    • An embodied model of learning, plasticity, and reward
    • Alexander, W. H., & Sporns, O. (2002). An embodied model of learning, plasticity, and reward. Adaptive Behavior, 10, 143-159.
    • (2002) Adaptive Behavior , vol.10 , pp. 143-159
    • Alexander, W.H.1    Sporns, O.2
  • 5
    • 40149084151 scopus 로고    scopus 로고
    • Synapse-specific stabilization of plasticity processes: The synaptic tagging and capture hypothesis revisited 10 years later
    • Barco, A., Lopez de Armentia, M., & Alarcon, J. M. (2008). Synapse-specific stabilization of plasticity processes: The synaptic tagging and capture hypothesis revisited 10 years later. Neuroscience and Biobehavioral Reviews, 32, 831-851.
    • (2008) Neuroscience and Biobehavioral Reviews , vol.32 , pp. 831-851
    • Barco, A.1    Lopez de Armentia, M.2    Alarcon, J.M.3
  • 7
    • 0032535029 scopus 로고    scopus 로고
    • Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type
    • Bi, G.-q., & Poo, M.-m. (1998). Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type. Journal of Neuroscience, 18(24), 10464-10472.
    • (1998) Jurnal of Neuroscience , vol.1 , Issue.24 , pp. 10464-10472
    • Bi, G.Q.1    Poo, M.M.2
  • 8
    • 0034928712 scopus 로고    scopus 로고
    • Synaptic modification by correlated activity: Hebb's postulate revisited
    • Bi, G.-q., & Poo, M.-m. (2001). Synaptic modification by correlated activity: Hebb's postulate revisited. Annual Review of Neuroscience, 24, 139-166.
    • (2001) Annual Review of Neuroscience , vol.24 , pp. 139-166
    • Bi, G.-Q.1    Poo, M.-M.2
  • 9
    • 0029019565 scopus 로고
    • In vivo assessment of dopamine uptake in rat medial prefrontal cortex: Comparison with dorsal striatum and nucleus accumbens
    • Cass, W. A., & Gerhardt, G. A. (1995). In vivo assessment of dopamine uptake in rat medial prefrontal cortex: Comparison with dorsal striatum and nucleus accumbens. Journal of Neurochemistry, 65, 201-207.
    • (1995) Journal of Neurochemistry , vol.65 , pp. 201-207
    • Cass, W.A.1    Gerhardt, G.A.2
  • 10
    • 11844273191 scopus 로고    scopus 로고
    • Hebb's synapse and learning rule: A history and commentary.
    • Donald O
    • Cooper, S. J. (2005). Donald O. Hebb's synapse and learning rule: A history and commentary. Neuroscience and Biobehavioral Reviews, 28(8), 851-874.
    • (2005) Neuroscience and Biobehavioral Reviews , vol.28 , Issue.8 , pp. 851-874
    • Cooper, S.J.1
  • 11
    • 84964166431 scopus 로고
    • Pharmacology and nerve-endings
    • Dale, H. H. (1935). Pharmacology and nerve-endings. Proc. R. Soc. Med., 28, 319-332.
    • (1935) Proc. R. Soc. Med. , vol.28 , pp. 319-332
    • Dale, H.H.1
  • 12
    • 11144307494 scopus 로고    scopus 로고
    • Synaptic and spiking dynamics underlying reward reversal in the orbitofrontal cortex
    • Deco, G., & Rolls, E. T. (2005). Synaptic and spiking dynamics underlying reward reversal in the orbitofrontal cortex. Cerebral Cortex, 15, 15-30.
    • (2005) Cerebral Cortex , vol.15 , pp. 15-30
    • Deco, G.1    Rolls, E.T.2
  • 13
    • 37549060355 scopus 로고    scopus 로고
    • Reinforcement learning with modulated spike timing-dependent synaptic plasticity
    • Farries,M. A., & Fairhall, A. L. (2007). Reinforcement learning with modulated spike timing-dependent synaptic plasticity. Journal of Neurophysiology, 98, 3648-3665.
    • (2007) Journal of Neurophysiology , vol.98 , pp. 3648-3665
    • Farries, M.A.1    Fairhall, A.L.2
  • 14
    • 0032523539 scopus 로고    scopus 로고
    • Computational models of neuromodulation
    • Fellous, J.-M., & Linster, C. (1998). Computational models of neuromodulation. Neural Computation, 10, 771-805.
    • (1998) Neural Computation , vol.10 , pp. 771-805
    • Fellous, J.M.1    Linster, C.2
  • 15
    • 34249708388 scopus 로고    scopus 로고
    • Reinforcement learning through modulation of spike-timingdependent synaptic plasticity
    • Florian, R. V. (2007). Reinforcement learning through modulation of spike-timingdependent synaptic plasticity. Neural Computation, 19, 1468-1502.
    • (2007) Neural Computation , vol.19 , pp. 1468-1502
    • Florian, R.V.1
  • 16
    • 33645458694 scopus 로고    scopus 로고
    • Reverse replay of behavioural sequences in hippocampal place cells during the awake state
    • Foster, D. J., & Wilson, M. A. (2006). Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature, 440, 683-680.
    • (2006) Nature , vol.440 , pp. 683-680
    • Foster, D.J.1    Wilson, M.A.2
  • 17
    • 0031024891 scopus 로고    scopus 로고
    • Synaptic tagging and long-term potentiation
    • Frey, U., & Morris, R. G. M. (1997). Synaptic tagging and long-term potentiation. Nature, 385, 533-536.
    • (1997) Nature , vol.385 , pp. 533-536
    • Frey, U.1    Morris, R.G.M.2
  • 18
    • 0028025252 scopus 로고
    • Efflux of dopamine from the synaptic cleft in the nucleus accumbens of the rat brain
    • Garris, P., Ciolkowski, E., Pastore, P., & Wighmann, R. (1994). Efflux of dopamine from the synaptic cleft in the nucleus accumbens of the rat brain. Journal of Neuroscience, 14, 6084-6093.
    • (1994) Journal of Neuroscience , vol.14 , pp. 6084-6093
    • Garris, P.1    Ciolkowski, E.2    Pastore, P.3    Wighmann, R.4
  • 19
    • 0040744515 scopus 로고    scopus 로고
    • Mathematical formulations of Hebbian learning
    • Gerstner, W., & Kistler, M. W. (2002). Mathematical formulations of Hebbian learning. Biological Cybernetics, 87, 404-415.
    • (2002) Biological Cybernetics , vol.87 , pp. 404-415
    • Gerstner, W.1    Kistler, M.W.2
  • 21
    • 0028958196 scopus 로고
    • Neuromodulation and cortical function:Modeling the physiological basis of behavior
    • Hasselmo,M. E. (1995). Neuromodulation and cortical function:Modeling the physiological basis of behavior. Behavioural Brain Research, 67, 1-27.
    • (1995) Behavioural Brain Research , vol.67 , pp. 1-27
    • Hasselmo, M.E.1
  • 22
    • 20444394794 scopus 로고    scopus 로고
    • Expecting the unexpected: Modeling of neuromodulation
    • Hasselmo, M. E. (2005). Expecting the unexpected: Modeling of neuromodulation. Neuron, 46, 426-528.
    • (2005) Neuron , vol.46 , pp. 426-528
    • Hasselmo, M.E.1
  • 24
    • 67650881941 scopus 로고    scopus 로고
    • Learning substrates in the primate prefrontal cortex and striatum: Sustained activity related to successful actions
    • Histed, M. H., Pasupathy, A., & Miller, E. K. (2009). Learning substrates in the primate prefrontal cortex and striatum: Sustained activity related to successful actions. Neuron, 63, 146-148.
    • (2009) Neuron , vol.63 , pp. 146-148
    • Histed, M.H.1    Pasupathy, A.2    Miller, E.K.3
  • 26
    • 33644898137 scopus 로고    scopus 로고
    • Polychonization: Computation with spikes
    • Izhikevich, E. M. (2006). Polychonization: Computation with spikes. Neural Computation, 18, 245-282.
    • (2006) Neural Computation , vol.18 , pp. 245-282
    • Izhikevich, E.M.1
  • 27
    • 34948906745 scopus 로고    scopus 로고
    • Solving the distal reward problem through linkage of STDP and dopamine signaling
    • Izhikevich, E.M. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral Cortex, 17, 2443-2452.
    • (2007) Cerebral Cortex , vol.17 , pp. 2443-2452
    • Izhikevich, E.M.1
  • 28
    • 0013812351 scopus 로고
    • Heterosynaptic facilitation in neurones of the abdominal ganglion of Aplysia depilans
    • Kandel, E. R., & Tauc, L. (1965). Heterosynaptic facilitation in neurones of the abdominal ganglion of Aplysia depilans. Journal of Physiology, 181, 1-27.
    • (1965) Journal of Physiology , vol.181 , pp. 1-27
    • Kandel, E.R.1    Tauc, L.2
  • 29
    • 55449121121 scopus 로고    scopus 로고
    • A learning theory for rewardmodulated spike-timing-dependent plasticity with application to biofeedback
    • Legenstein, R., Pecevski, D., & Maass, W. (2008). A learning theory for rewardmodulated spike-timing-dependent plasticity with application to biofeedback. PLoS Comput. Biol., 4(10), e1000180.
    • (2008) PLoS Comput. Biol. , vol.4 , Issue.10
    • Legenstein, R.1    Pecevski, D.2    Maass, W.3
  • 30
    • 0031012390 scopus 로고    scopus 로고
    • A synaptically controlled, associative signal for Hebbian plasticity in hippocampal neurons
    • Magee, J. C., & Johnston, D. (1997). A synaptically controlled, associative signal for Hebbian plasticity in hippocampal neurons. Science, 275, 209-213.
    • (1997) Science , vol.275 , pp. 209-213
    • Magee, J.C.1    Johnston, D.2
  • 31
    • 0030079531 scopus 로고    scopus 로고
    • Neuralmodulation: Following your own rhythm
    • Marder, E. (1996). Neuralmodulation: Following your own rhythm. Current Biology, 6(2), 119-121.
    • (1996) Current Biology , vol.6 , Issue.2 , pp. 119-121
    • Marder, E.1
  • 32
    • 0036592024 scopus 로고    scopus 로고
    • Cellular, synaptic and network effects of neuromodulation
    • Marder, E., & Thirumalai, V. (2002). Cellular, synaptic and network effects of neuromodulation. Neural Networks, 15, 479-493.
    • (2002) Neural Networks , vol.15 , pp. 479-493
    • Marder, E.1    Thirumalai, V.2
  • 33
    • 0031012615 scopus 로고    scopus 로고
    • Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs
    • Markram, H., Lübke, J., Frotscher, M., & Sakmann, B. (1997). Regulation of synaptic efficacy by coincidence of postsynaptic APs and EPSPs. Science, 275, 213-215.
    • (1997) Science , vol.275 , pp. 213-215
    • Markram, H.1    Lübke, J.2    Frotscher, M.3    Sakmann, B.4
  • 34
    • 0028972278 scopus 로고
    • Bee foraging in uncertain environments using predictive Hebbian learning
    • Montague, P. R., Dayan, P., Person, C., & Sejnowski, T. J. (1995). Bee foraging in uncertain environments using predictive Hebbian learning. Nature, 377, 725-728.
    • (1995) Nature , vol.377 , pp. 725-728
    • Montague, P.R.1    Dayan, P.2    Person, C.3    Sejnowski, T.J.4
  • 35
    • 0035152958 scopus 로고    scopus 로고
    • Abstract reward and punishment representations in the humanorbitofrontal cortex
    • O'Doherty, J. P., Kringelbach, M. L., Rolls, E. T., & Andrews, C. (2001). Abstract reward and punishment representations in the humanorbitofrontal cortex. Nature Neuroscience, 4(1), 95-102.
    • (2001) Nature Neuroscience , vol.4 , Issue.1 , pp. 95-102
    • O'Doherty, J.P.1    Kringelbach, M.L.2    Rolls, E.T.3    Andrews, C.4
  • 36
    • 21544455210 scopus 로고    scopus 로고
    • Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward-learning Network
    • Pan, W.-X., Schmidt, R., Wickens, J. R., & Hyland, B. I. (2005). Dopamine cells respond to predicted events during classical conditioning: Evidence for eligibility traces in the reward-learning Network. Journal of Neuroscience, 25(26), 6235- 6242.
    • (2005) Journal of Neuroscience , vol.5 , Issue.26 , pp. 6235-6242
    • Pan, W.X.1    Schmidt, R.2    Wickens, J.R.3    Hyland, B.I.4
  • 37
    • 78650996104 scopus 로고    scopus 로고
    • Synaptic tagging, evaluation of memories, and the distal reward problem
    • Päpper, M., Kempter, R., & Leibold, C. (2011). Synaptic tagging, evaluation of memories, and the distal reward problem. Learning and Memory, 18, 58-70.
    • (2011) Learning and Memory , vol.18 , pp. 58-70
    • Päpper, M.1    Kempter, R.2    Leibold, C.3
  • 38
  • 41
    • 35549002871 scopus 로고    scopus 로고
    • Learning with relevance: Using a third factor to stabilize Hebbian learning
    • Porr, B., & Wörgötter, F. (2007). Learning with relevance: Using a third factor to stabilize Hebbian learning. Neural Computation, 19(10), 2694-2719.
    • (2007) Neural Computation , vol.19 , Issue.10 , pp. 2694-2719
    • Porr, B.1    Wörgötter, F.2
  • 42
    • 79958078227 scopus 로고    scopus 로고
    • An imperfect dopaminergic error signal can drive temporal-difference learning
    • Potjans,W., Diesmann, M., & Morrison, A. (2011). An imperfect dopaminergic error signal can drive temporal-difference learning. PLoS Computational Biology, 7(5), 1-20.
    • (2011) PLoS Computational Biology , vol.7 , Issue.5 , pp. 1-20
    • Potjans, W.1    Diesmann, M.2    Morrison, A.3
  • 43
    • 67650298948 scopus 로고    scopus 로고
    • A spiking neural network model of an actor-critic learning agent
    • Potjans, W., Morrison, A., & Diesmann, M. (2009). A spiking neural network model of an actor-critic learning agent. Neural Computation, 21(2), 301-339.
    • (2009) Neural Computation , vol.21 , Issue.2 , pp. 301-339
    • Potjans, W.1    Morrison, A.2    Diesmann, M.3
  • 45
    • 78650336345 scopus 로고    scopus 로고
    • Making memories last: The synaptic tagging and capture hypothesis
    • Redondo, R. L., & Morris, R. G. M. (2011). Making memories last: The synaptic tagging and capture hypothesis. Nature Reviews Neuroscience, 12, 17-30.
    • (2011) Nature Reviews Neuroscience , vol.12 , pp. 17-30
    • Redondo, R.L.1    Morris, R.G.M.2
  • 47
    • 39349084640 scopus 로고    scopus 로고
    • Expected value, reward outcome, and temporal difference error representations in a probabilistic decision task
    • Rolls, E T., McCabe, C., & Redoute, J. (2008). Expected value, reward outcome, and temporal difference error representations in a probabilistic decision task. Cerebral Cortex, 18, 652-663.
    • (2008) Cerebral Cortex , vol.18 , pp. 652-663
    • Rolls, E.T.1    McCabe, C.2    Redoute, J.3
  • 48
    • 38149103483 scopus 로고    scopus 로고
    • Order-dependent coincidence detection in cerebellar purkinje neurons at the inositol ttrisphosphate receptor
    • Sarkisov, D. V. P., & Wang, S. S. H. (2008). Order-dependent coincidence detection in cerebellar purkinje neurons at the inositol ttrisphosphate receptor. Journal of Neuroscience, 28(1), 133-142.
    • (2008) Journal of Neuroscience , vol.28 , Issue.1 , pp. 133-142
    • Sarkisov, D.V.P.1    Wang, S.S.H.2
  • 49
    • 0031867046 scopus 로고    scopus 로고
    • Predictive reward signal of dopamine neurons
    • Schultz, W. (1998). Predictive reward signal of dopamine neurons. Journal of Neurophysiology, 80, 1-27.
    • (1998) Journal of Neurophysiology , vol.80 , pp. 1-27
    • Schultz, W.1
  • 50
    • 0037057755 scopus 로고    scopus 로고
    • Getting formal with dopamine and reward
    • Schultz,W. (2002). Getting formal with dopamine and reward. Neuron, 36, 241-263.
    • (2002) Neuron , vol.36 , pp. 241-263
    • Schultz, W.1
  • 51
    • 32444439058 scopus 로고    scopus 로고
    • Behavioural theories and the neurophysiology of reward
    • Schultz,W. (2006). Behavioural theories and the neurophysiology of reward. Annual Review of Psychology, 57, 87-115.
    • (2006) Annual Review of Psychology , vol.57 , pp. 87-115
    • Schultz, W.1
  • 52
    • 0027468102 scopus 로고
    • Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task
    • Schultz, W., Apicella, P., & Ljungberg, T. (1993). Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. Journal of Neuroscience, 13, 900-913.
    • (1993) Journal of Neuroscience , vol.13 , pp. 900-913
    • Schultz, W.1    Apicella, P.2    Ljungberg, T.3
  • 53
    • 0030896968 scopus 로고    scopus 로고
    • A neural substrate for prediction and reward
    • Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate for prediction and reward. Science, 275, 1593-1598.
    • (1997) Science , vol.275 , pp. 1593-1598
    • Schultz, W.1    Dayan, P.2    Montague, P.R.3
  • 56
    • 84865440918 scopus 로고    scopus 로고
    • From modulated Hebbian plasticity to simple behavior learning through noise and weight saturation
    • Soltoggio, A, & Stanley, K. O. (2012). From modulated Hebbian plasticity to simple behavior learning through noise and weight saturation. Neural Networks, 34, 28-41.
    • (2012) Neural Networks , vol.34 , pp. 28-41
    • Soltoggio, A.1    Stanley, K.O.2
  • 57
    • 33750014486 scopus 로고    scopus 로고
    • Learning at the edge of chaos: Temporal coupling of spiking neurons controller for autonomous robotic
    • Palo Alto, CA: AAAI.
    • Soula, H., Alwan, A., & Beslon, G. (2005). Learning at the edge of chaos: Temporal coupling of spiking neurons controller for autonomous robotic. In Proceedings of the AAAI Spring Symposia on Developmental Robotics. Palo Alto, CA: AAAI.
    • (2005) Proceedings of the AAAI Spring Symposia on Developmental Robotics.
    • Soula, H.1    Alwan, A.2    Beslon, G.3
  • 62
    • 0022156246 scopus 로고
    • The basis of superstitious behavior: Chance contingency, stimulus substitution, or appetitive behavior
    • Timberlake, W., & Lucas, G. A. (1985). The basis of superstitious behavior: Chance contingency, stimulus substitution, or appetitive behavior?Journal of Experimental Analysis of Behaviour, 44(3), 279-299.
    • (1985) Journal of Experimental Analysis of Behaviour , vol.44 , Issue.3 , pp. 279-299
    • Timberlake, W.1    Lucas, G.A.2
  • 63
    • 74549209037 scopus 로고    scopus 로고
    • Spikebased reinforcement learning in continuous state and action space: When policy gradient methods fail
    • Vasilaki, E., Frémaux, N., Urbanczik, R., Senn, W., & Gerstner, W. (2009). Spikebased reinforcement learning in continuous state and action space: When policy gradient methods fail. PLoS Computational Biology, 5(12).
    • (2009) PLoS Computational Biology , vol.5 , Issue.12
    • Vasilaki, E.1    Frémaux, N.2    Urbanczik, R.3    Senn, W.4    Gerstner, W.5
  • 64
    • 0033680563 scopus 로고    scopus 로고
    • Coincidence detection in single dendritic spines mediated by calcium release
    • Wang, S. S. H., Denk, W., & Häusser, M. (2000). Coincidence detection in single dendritic spines mediated by calcium release. Nature Neuroscience, 3(12), 1266-1273.
    • (2000) Nature Neuroscience , vol.3 , Issue.12 , pp. 1266-1273
    • Wang, S.S.H.1    Denk, W.2    Häusser, M.3
  • 65
    • 0025241289 scopus 로고
    • nControl of dopamine extracellular concentration in rat striatum by impulse flow and uptake
    • Wighmann, R., & Zimmerman, J. (1990). Control of dopamine extracellular concentration in rat striatum by impulse flow and uptake. Brain Res. Brain Res. Rev., 15(2), 135-144.
    • (1990) Brain Res. Brain Res. Rev. , vol.15 , Issue.2 , pp. 135-144
    • Wighmann, R.1    Zimmerman, J.2
  • 66
    • 2642519680 scopus 로고    scopus 로고
    • Dopamine, learning and motivation
    • Wise, R. A. (2004). Dopamine, learning and motivation. Nature Reviews Neuroscience, 5, 1-12.
    • (2004) Nature Reviews Neuroscience , vol.5 , pp. 1-12
    • Wise, R.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.