-
1
-
-
0020970738
-
Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems
-
A.G. Barto, R.S. Sutton, C.W. Anderson, Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems, IEEE Trans. on Systems, Man and Cybernetics, vol. 13, No 5, 1983, pp. 834-846.
-
(1983)
IEEE Trans. on Systems, Man and Cybernetics
, vol.13
, Issue.5
, pp. 834-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
2
-
-
85012688561
-
-
Princeton, NJ: Princeton Univ. Press
-
R.E. Bellman, Dynamic Programming, Princeton, NJ: Princeton Univ. Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
3
-
-
0003487482
-
-
Athena Scientific, Belmont, MA
-
D.P. Bertsekas, J.N. Tsitsiklis, Neuro-Dymanic Programming, Athena Scientific, Belmont, MA, 1996.
-
(1996)
Neuro-Dymanic Programming
-
-
Bertsekas, D.P.1
Tsitsiklis, J.N.2
-
4
-
-
33749833931
-
Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the "echo state network" approach
-
German National Research Center for Information Technology
-
H. Jaeger, Tutorial on training recurrent neural networks, covering BPPT, RTRL, EKF and the "echo state network" approach, GMD Report 159, German National Research Center for Information Technology, 2002 p.48.
-
(2002)
GMD Report 159
, pp. 48
-
-
Jaeger, H.1
-
5
-
-
78349289898
-
Adaptive nonlinear system identification with echo state networks
-
MIT Press, Cambridge, MA
-
H. Jaeger, Adaptive nonlinear system identification with echo state networks, In: Advances in Neural Information Processing Systems 15 (NIPS 2002), MIT Press, Cambridge, MA, 2003, pp.593-600.
-
(2003)
Advances in Neural Information Processing Systems 15 (NIPS 2002)
, pp. 593-600
-
-
Jaeger, H.1
-
6
-
-
68649088777
-
Reservoir computing approaches to recurrent neural network training
-
M. Lukosevicius, H. Jaeger, Reservoir computing approaches to recurrent neural network training, Computer Science Review, vol.3, 2009, pp.127-149.
-
(2009)
Computer Science Review
, vol.3
, pp. 127-149
-
-
Lukosevicius, M.1
Jaeger, H.2
-
7
-
-
78751559967
-
Neural techniques in control
-
Neural Networks for Instrumentation, Measurement and Related Industrial Applications, Edited by S. Ablameyko, L. Goras, M. Gori and V. Piuri. IOS Press, Amsterdam
-
A. Pacut, Neural techniques in control, In: Neural Networks for Instrumentation, Measurement and Related Industrial Applications, Edited by S. Ablameyko, L. Goras, M. Gori and V. Piuri. NATO Science Series vol. 185, IOS Press, Amsterdam, 2003, pp.78-118.
-
(2003)
NATO Science Series
, vol.185
, pp. 78-118
-
-
Pacut, A.1
-
10
-
-
34547133026
-
Training recurrent neurocontrollers for real-time applications
-
D. Prokhorov, Training recurrent neurocontrollers for real-time applications, IEEE Trans. on Neural Networks, vol.18, N04, 2007, pp.1003-1015.
-
(2007)
IEEE Trans. on Neural Networks
, vol.18
, Issue.4
, pp. 1003-1015
-
-
Prokhorov, D.1
-
11
-
-
33750112286
-
Echo state networks: Appeal and challenges
-
D. Prokhorov, Echo state networks: appeal and challenges, Proc. of Int. Joint Conf. on Neural Networks (IJCNN), Montreal, Canada, August 2005, pp.1463-1466.
-
Proc. of Int. Joint Conf. on Neural Networks (IJCNN), Montreal, Canada, August 2005
, pp. 1463-1466
-
-
Prokhorov, D.1
-
12
-
-
40649085253
-
Improving reservoirs using intrinsic plasticity
-
B. Schrauwen, M. Wandermann, D. Verstraeten, J.J. Steil, Improving reservoirs using intrinsic plasticity, Neurocomputing, vol. 71, 2008, pp.1l59-1171.
-
(2008)
Neurocomputing
, vol.71
-
-
Schrauwen, B.1
Wandermann, M.2
Verstraeten, D.3
Steil, J.J.4
-
13
-
-
0035273403
-
On-line learning control by association and reinforcement
-
J. Si, Y.-T. Wang, On-line learning control by association and reinforcement, IEEE Trans. on Neural Networks, vol.12, No2, 2001, pp.264-276.
-
(2001)
IEEE Trans. on Neural Networks
, vol.12
, Issue.2
, pp. 264-276
-
-
Si, J.1
Wang, Y.-T.2
-
14
-
-
33847202724
-
Learning to predict by methods of temporal differences
-
R.S. Sutton, Learning to predict by methods of temporal differences, Machine Learning, vol.3, 1988, pp.9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
15
-
-
34249833101
-
Q-learning
-
C. Watkins, P. Dayan, Q-learning, Machine Learning, vol. 8, 1992, pp.279-292.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
16
-
-
0025503558
-
Backpropagation Through Time: What It Does and How to Do It
-
P.J. Werbos, Backpropagation Through Time: What It Does and How to Do It, Proceedings of the IEEE, vol. 78, No 10, 1990, pp.1550-1560.
-
(1990)
Proceedings of the IEEE
, vol.78
, Issue.10
, pp. 1550-1560
-
-
Werbos, P.J.1
-
17
-
-
0015667648
-
Punish/Reward: Learning with a Critic in Adaptive Threshold Systems
-
B. Widrow et al., Punish/Reward: Learning with a Critic in Adaptive Threshold Systems, IEEE Trans. on SMC, vol. 3, No 5, 1973, pp.455-465.
-
(1973)
IEEE Trans. on SMC
, vol.3
, Issue.5
, pp. 455-465
-
-
Widrow, B.1
|