메뉴 건너뛰기




Volumn , Issue , 2013, Pages 993-1000

Neural combinatorial learning of goal-directed behavior with reservoir critic and reward modulated hebbian plasticity

Author keywords

Correlation learning; Re inforcement learning; Reservoir networks; Temporal memory

Indexed keywords

CLASSICAL CONDITIONING; GOAL-DIRECTED BEHAVIOR; MOBILE ROBOT SYSTEMS; OPERANT CONDITIONING; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; RADIAL BASIS FUNCTION(RBF); RESERVOIR NETWORKS; TEMPORAL MEMORY;

EID: 84893615072     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SMC.2013.174     Document Type: Conference Paper
Times cited : (4)

References (16)
  • 1
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya, K. (2000) Reinforcement Learning In Continuous Time and Space. Neural Computation, 12, 219-245.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 2
    • 84887488068 scopus 로고    scopus 로고
    • Information dynamics based self-adaptive reservoir for delay temporal memory tasks
    • doi: 10.1007/s12530-013-9080-y
    • Dasgupta, S., Wörgötter, F., and Manoonpong, P. (2013) Information Dynamics based Self-adaptive Reservoir for Delay Temporal Memory Tasks. Evolving Systems, doi: 10.1007/s12530-013-9080-y.
    • (2013) Evolving Systems
    • Dasgupta, S.1    Wörgötter, F.2    Manoonpong, P.3
  • 3
    • 84876888983 scopus 로고    scopus 로고
    • Reinforcement learning using a continuous time actor-critic framework with spiking neurons
    • doi:10.1371/journal.pcbi.1003024
    • Frémaux, N., Sprekeler, H., Gerstner, W. (2013), Reinforcement Learning Using a Continuous Time Actor-Critic Framework with Spiking Neurons. PLoS Comput Biol 9(4): e1003024. doi:10.1371/journal.pcbi.1003024.
    • (2013) PLoS Comput Biol , vol.9 , Issue.4
    • Frémaux, N.1    Sprekeler, H.2    Gerstner, W.3
  • 4
    • 1842421269 scopus 로고    scopus 로고
    • Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication
    • Jaeger, H., and Haas, H. (2004): Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication. Science, 304(5667), 78-80.
    • (2004) Science , vol.304 , Issue.5667 , pp. 78-80
    • Jaeger, H.1    Haas, H.2
  • 5
    • 0036834701 scopus 로고    scopus 로고
    • Real-time computing without stable states: A new framework for neural computation based on perturbations
    • Maass, W., Natschlger, T., and Markram, H. (2002), Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Computation. 14 (11): 253160.
    • (2002) Neural Computation , vol.14 , Issue.11 , pp. 253160
    • Maass, W.1    Natschlger, T.2    Markram, H.3
  • 6
    • 77954098778 scopus 로고    scopus 로고
    • A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task
    • Legenstein, R., Chase, S.M., Schwartz, A.B., and Maass, W. (2010) A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task. J Neurosci 30:84008410.
    • (2010) J Neurosci , vol.30 , pp. 84008410
    • Legenstein, R.1    Chase, S.M.2    Schwartz, A.B.3    Maass, W.4
  • 7
    • 78650211630 scopus 로고    scopus 로고
    • Extraction of reward-related feature space using correlation-based and reward- based learning methods
    • Sydney, Australia, November 22-25 (ICONIP'10), Part I, LNCS 6443
    • Manoonpong, P., Wörgötter, F., and Morimoto, J. (2010) Extraction of Reward-Related Feature Space Using Correlation-Based and Reward- Based Learning Methods. In Proc. 17th International Conference on Neural Information Processing, Sydney, Australia, November 22-25 (ICONIP'10), Part I, LNCS 6443, pp. 414-421.
    • (2010) Proc. 17th International Conference on Neural Information Processing , pp. 414-421
    • Manoonpong, P.1    Wörgötter, F.2    Morimoto, J.3
  • 8
    • 84880376650 scopus 로고    scopus 로고
    • Combining correlation-based and reward-based learning in neural control for policy improvement
    • doi: 10.1142/S021952591350015X
    • Manoonpong, P., Kolodziejski, C., Wörgötter, F., and Morimoto J. (2013) Combining Correlation-Based and Reward-Based Learning in Neural Control for Policy Improvement. Advances in Complex Systems, doi: 10.1142/S021952591350015X.
    • (2013) Advances in Complex Systems
    • Morimoto, J.1    Manoonpong, P.2    Kolodziejski, C.3    Wörgötter, F.4
  • 12
    • 0004281531 scopus 로고
    • Oxford University Press, Oxford, UK
    • Pavlov, I., Conditioned reflexes (Oxford University Press, Oxford, UK, 1927).
    • (1927) Conditioned Reflexes
    • Pavlov, I.1
  • 13
    • 33646781302 scopus 로고    scopus 로고
    • Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only
    • Porr, B., and Wörgötter, F. (2006), Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only. Neural computation 18, 1380-1412.
    • (2006) Neural Computation , vol.18 , pp. 1380-1412
    • Porr, B.1    Wörgötter, F.2
  • 15
    • 34247243264 scopus 로고    scopus 로고
    • Synergies between intrinsic and synaptic plasticity mechanisms
    • Triesch, J. (2007), Synergies between Intrinsic and Synaptic Plasticity Mechanisms. Neural Computation 4, 885-909.
    • (2007) Neural Computation , vol.4 , pp. 885-909
    • Triesch, J.1
  • 16
    • 13244267004 scopus 로고    scopus 로고
    • Temporal sequence learning, prediction and control - A review of different models and their relation to biological mechanism
    • 1000
    • Wörgötter, F., and Porr, B. (2004) Temporal sequence learning, prediction and control - A review of different models and their relation to biological mechanism. Neural Computation. 17, 245-319. 1000.
    • (2004) Neural Computation , vol.17 , pp. 245-319
    • Wörgötter, F.1    Porr, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.