메뉴 건너뛰기




Volumn , Issue , 2010, Pages 1010-1015

Adaptive critic design with echo state network

Author keywords

Adaptive critic designs; Echo state networks; Mobile robot; Reinforcement learning

Indexed keywords

ADAPTIVE CRITIC; ADAPTIVE CRITIC DESIGNS; ECHO STATE NETWORKS; MOBILE ROBOT CONTROL; NOVEL NEURAL NETWORK; ON-LINE APPLICATIONS; ONLINE TRAINING; OPTIMIZATION APPROACH; TRAINING ALGORITHMS;

EID: 78751554137     PISSN: 1062922X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICSMC.2010.5641744     Document Type: Conference Paper
Times cited : (23)

References (18)
  • 1
    • 0020970738 scopus 로고
    • Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems
    • A.G. Barto, R.S. Sutton, C.W. Anderson, Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems, IEEE Trans. on Systems, Man and Cybernetics, vol. 13, No 5, 1983, pp. 834-846.
    • (1983) IEEE Trans. on Systems, Man and Cybernetics , vol.13 , Issue.5 , pp. 834-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 2
    • 85012688561 scopus 로고
    • Princeton, NJ: Princeton Univ. Press
    • R.E. Bellman, Dynamic Programming, Princeton, NJ: Princeton Univ. Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 4
    • 33749833931 scopus 로고    scopus 로고
    • Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the "echo state network" approach
    • German National Research Center for Information Technology
    • H. Jaeger, Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the "echo state network" approach, GMD Report 159, German National Research Center for Information Technology, 2002 p.48.
    • (2002) GMD Report 159 , pp. 48
    • Jaeger, H.1
  • 5
    • 78349289898 scopus 로고    scopus 로고
    • Adaptive nonlinear system identification with echo state networks
    • MIT Press, Cambridge, MA
    • H. Jaeger, Adaptive nonlinear system identification with echo state networks, In: Advances in Neural Information Processing Systems 15 (NIPS 2002), MIT Press, Cambridge, MA, 2003, pp.593-600.
    • (2003) Advances in Neural Information Processing Systems 15 (NIPS 2002) , pp. 593-600
    • Jaeger, H.1
  • 6
    • 68649088777 scopus 로고    scopus 로고
    • Reservoir computing approaches to recurrent neural network training
    • M. Lukosevicius, H. Jaeger, Reservoir computing approaches to recurrent neural network training, Computer Science Review, vol.3, 2009, pp.127-149.
    • (2009) Computer Science Review , vol.3 , pp. 127-149
    • Lukosevicius, M.1    Jaeger, H.2
  • 7
    • 78751559967 scopus 로고    scopus 로고
    • Neural techniques in control
    • Neural Networks for Instrumentation, Measurement and Related Industrial Applications, Edited by S. Ablameyko, L. Goras, M. Gori and V. Piuri. IOS Press, Amsterdam
    • A. Pacut, Neural techniques in control, In: Neural Networks for Instrumentation, Measurement and Related Industrial Applications, Edited by S. Ablameyko, L. Goras, M. Gori and V. Piuri. NATO Science Series vol. 185, IOS Press, Amsterdam, 2003, pp.78-118.
    • (2003) NATO Science Series , vol.185 , pp. 78-118
    • Pacut, A.1
  • 10
    • 34547133026 scopus 로고    scopus 로고
    • Training recurrent neurocontrollers for real-time applications
    • D. Prokhorov, Training recurrent neurocontrollers for real-time applications, IEEE Trans. on Neural Networks, vol.18, N04, 2007, pp.1003-1015.
    • (2007) IEEE Trans. on Neural Networks , vol.18 , Issue.4 , pp. 1003-1015
    • Prokhorov, D.1
  • 13
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • J. Si, Y.-T. Wang, On-line learning control by association and reinforcement, IEEE Trans. on Neural Networks, vol.12, No2, 2001, pp.264-276.
    • (2001) IEEE Trans. on Neural Networks , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 14
    • 33847202724 scopus 로고
    • Learning to predict by methods of temporal differences
    • R.S. Sutton, Learning to predict by methods of temporal differences, Machine Learning, vol.3, 1988, pp.9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 16
    • 0025503558 scopus 로고
    • Backpropagation Through Time: What It Does and How to Do It
    • P.J. Werbos, Backpropagation Through Time: What It Does and How to Do It, Proceedings of the IEEE, vol. 78, No 10, 1990, pp.1550-1560.
    • (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
    • Werbos, P.J.1
  • 17
    • 0015667648 scopus 로고
    • Punish/Reward: Learning with a Critic in Adaptive Threshold Systems
    • B. Widrow et al., Punish/Reward: Learning with a Critic in Adaptive Threshold Systems, IEEE Trans. on SMC, vol. 3, No 5, 1973, pp.455-465.
    • (1973) IEEE Trans. on SMC , vol.3 , Issue.5 , pp. 455-465
    • Widrow, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.