SCOPUS 정보 검색 플랫폼

Proceedings - 2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013

Volumn , Issue , 2013, Pages 993-1000

Neural combinatorial learning of goal-directed behavior with reservoir critic and reward modulated hebbian plasticity

(4) Dasgupta, Sakyasingha a Wörgötter, Florentin a Morimoto, Jun b Manoonpong, Poramate a

a UNIVERSITY OF GÖTTINGEN (Germany)

b ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

Correlation learning; Re inforcement learning; Reservoir networks; Temporal memory

Indexed keywords

CLASSICAL CONDITIONING; GOAL-DIRECTED BEHAVIOR; MOBILE ROBOT SYSTEMS; OPERANT CONDITIONING; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; RADIAL BASIS FUNCTION(RBF); RESERVOIR NETWORKS; TEMPORAL MEMORY;

CYBERNETICS; RADIAL BASIS FUNCTION NETWORKS;

LEARNING SYSTEMS;

EID: 84893615072 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SMC.2013.174 Document Type: Conference Paper

Times cited : (4)

References (16)

1
- 0033629916
- Reinforcement learning in continuous time and space
- Doya, K. (2000) Reinforcement Learning In Continuous Time and Space. Neural Computation, 12, 219-245.
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

2
- 84887488068
- Information dynamics based self-adaptive reservoir for delay temporal memory tasks
- doi: 10.1007/s12530-013-9080-y
- Dasgupta, S., Wörgötter, F., and Manoonpong, P. (2013) Information Dynamics based Self-adaptive Reservoir for Delay Temporal Memory Tasks. Evolving Systems, doi: 10.1007/s12530-013-9080-y.
- (2013) Evolving Systems
- Dasgupta, S.¹ Wörgötter, F.² Manoonpong, P.³

3
- 84876888983
- Reinforcement learning using a continuous time actor-critic framework with spiking neurons
- doi:10.1371/journal.pcbi.1003024
- Frémaux, N., Sprekeler, H., Gerstner, W. (2013), Reinforcement Learning Using a Continuous Time Actor-Critic Framework with Spiking Neurons. PLoS Comput Biol 9(4): e1003024. doi:10.1371/journal.pcbi.1003024.
- (2013) PLoS Comput Biol , vol.9 , Issue.4
- Frémaux, N.¹ Sprekeler, H.² Gerstner, W.³

4
- 1842421269
- Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication
- Jaeger, H., and Haas, H. (2004): Harnessing Nonlinearity: Predicting Chaotic Systems and Saving Energy in Wireless Communication. Science, 304(5667), 78-80.
- (2004) Science , vol.304 , Issue.5667 , pp. 78-80
- Jaeger, H.¹ Haas, H.²

5
- 0036834701
- Real-time computing without stable states: A new framework for neural computation based on perturbations
- Maass, W., Natschlger, T., and Markram, H. (2002), Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Computation. 14 (11): 253160.
- (2002) Neural Computation , vol.14 , Issue.11 , pp. 253160
- Maass, W.¹ Natschlger, T.² Markram, H.³

6
- 77954098778
- A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task
- Legenstein, R., Chase, S.M., Schwartz, A.B., and Maass, W. (2010) A reward-modulated hebbian learning rule can explain experimentally observed network reorganization in a brain control task. J Neurosci 30:84008410.
- (2010) J Neurosci , vol.30 , pp. 84008410
- Legenstein, R.¹ Chase, S.M.² Schwartz, A.B.³ Maass, W.⁴

7
- 78650211630
- Extraction of reward-related feature space using correlation-based and reward- based learning methods
- Sydney, Australia, November 22-25 (ICONIP'10), Part I, LNCS 6443
- Manoonpong, P., Wörgötter, F., and Morimoto, J. (2010) Extraction of Reward-Related Feature Space Using Correlation-Based and Reward- Based Learning Methods. In Proc. 17th International Conference on Neural Information Processing, Sydney, Australia, November 22-25 (ICONIP'10), Part I, LNCS 6443, pp. 414-421.
- (2010) Proc. 17th International Conference on Neural Information Processing , pp. 414-421
- Manoonpong, P.¹ Wörgötter, F.² Morimoto, J.³

8
- 84880376650
- Combining correlation-based and reward-based learning in neural control for policy improvement
- doi: 10.1142/S021952591350015X
- Manoonpong, P., Kolodziejski, C., Wörgötter, F., and Morimoto J. (2013) Combining Correlation-Based and Reward-Based Learning in Neural Control for Policy Improvement. Advances in Complex Systems, doi: 10.1142/S021952591350015X.
- (2013) Advances in Complex Systems
- Morimoto, J.¹ Manoonpong, P.² Kolodziejski, C.³ Wörgötter, F.⁴

9
- 0032312876
- Reinforcement learning of dynamic motor sequence: Learning to stand up
- IEEE
- Morimoto, J., and Kenji, Doya. (1998) Reinforcement learning of dynamic motor sequence: Learning to stand up. In Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems, Vol. 3. IEEE.
- (1998) Proc. IEEE/RSJ International Conference on Intelligent Robots and Systems , vol.3
- Morimoto, J.¹ Kenji, D.²

10
- 85088916195
- Anticipating rewards in continuous time and space with echo state networks and actor-critic design
- Oubbati, M., Kchele, M., Koprinkova-Hristova, P., and Palm, G. (2011), Anticipating Rewards in Continuous Time and Space with Echo State Networks and Actor-Critic Design. In Proc. 19th European Symposium on Artificial Neural Networks (ESANN).
- (2011) Proc. 19th European Symposium on Artificial Neural Networks (ESANN)
- Oubbati, M.¹ Kchele, M.² Koprinkova-Hristova, P.³ Palm, G.⁴

11
- 78751554137
- Adaptive critic design with echo state network
- Koprinkova-Hristova, P., Oubbati, M., and Palm, G. (2010). Adaptive critic design with echo state network. In Proc. IEEE International Conference on Systems, Man, and Cybernetics, 1010-1015.
- (2010) Proc. IEEE International Conference on Systems, Man, and Cybernetics , pp. 1010-1015
- Koprinkova-Hristova, P.¹ Oubbati, M.² Palm, G.³

12
- 0004281531
- Oxford University Press, Oxford, UK
- Pavlov, I., Conditioned reflexes (Oxford University Press, Oxford, UK, 1927).
- (1927) Conditioned Reflexes
- Pavlov, I.¹

13
- 33646781302
- Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only
- Porr, B., and Wörgötter, F. (2006), Strongly improved stability and faster convergence of temporal sequence learning by utilising input correlations only. Neural computation 18, 1380-1412.
- (2006) Neural Computation , vol.18 , pp. 1380-1412
- Porr, B.¹ Wörgötter, F.²

14
- 0003880401
- Appleton Century Croft, New York
- Skinner, B., The Behavior of Organisms: An Experimental Analysis (Appleton Century Croft, New York, 1938).
- (1938) The Behavior of Organisms: An Experimental Analysis
- Skinner, B.¹

15
- 34247243264
- Synergies between intrinsic and synaptic plasticity mechanisms
- Triesch, J. (2007), Synergies between Intrinsic and Synaptic Plasticity Mechanisms. Neural Computation 4, 885-909.
- (2007) Neural Computation , vol.4 , pp. 885-909
- Triesch, J.¹

16
- 13244267004
- Temporal sequence learning, prediction and control - A review of different models and their relation to biological mechanism
- 1000
- Wörgötter, F., and Porr, B. (2004) Temporal sequence learning, prediction and control - A review of different models and their relation to biological mechanism. Neural Computation. 17, 245-319. 1000.
- (2004) Neural Computation , vol.17 , pp. 245-319
- Wörgötter, F.¹ Porr, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.