-
1
-
-
0035273403
-
Online learning control by association and reinforcement
-
J. Si, and Y. Wang, "Online learning control by association and reinforcement," IEEE Transactions on Neural Networks, vol. 12, no. 2, pp. 264-276, 2001.
-
(2001)
IEEE Transactions on Neural Networks
, vol.12
, Issue.2
, pp. 264-276
-
-
Si, J.1
Wang, Y.2
-
2
-
-
0344666440
-
Analyzing and enhancing direct NDP designs using a control-theoretic approach
-
L. Yang, J. Si, K. Tsakalis, and A. Rodriguez, "Analyzing and enhancing direct NDP designs using a control-theoretic approach," IEEE International Symposium on Intelligent Control, pp. 529-532, 2003.
-
(2003)
IEEE International Symposium on Intelligent Control
, pp. 529-532
-
-
Yang, L.1
Si, J.2
Tsakalis, K.3
Rodriguez, A.4
-
3
-
-
0020970738
-
Neuron like adaptive elements that can solve difficult learning control problems
-
A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuron like adaptive elements that can solve difficult learning control problems," IEEE Transactions On Systems, Man, and Cybernetics, vol. 13, pp. 834-847, 1983.
-
(1983)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.13
, pp. 834-847
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
4
-
-
0002557583
-
Advanced forecasting methods for global crisis warning and models of intelligence
-
P. Werbos, "Advanced forecasting methods for global crisis warning and models of intelligence," General System Yearbook, vol. 22, pp. 25-38, 1977.
-
(1977)
General System Yearbook
, vol.22
, pp. 25-38
-
-
Werbos, P.1
-
5
-
-
0002011091
-
A menu of design for reinforcement learning over time
-
W. T. Miller III, R. S. Sutton, & P. J. Werbos (Eds.), MIT Press
-
P. Werbos, "A menu of design for reinforcement learning over time," Neural Networks for Control, in W. T. Miller III, R. S. Sutton, & P. J. Werbos (Eds.), MIT Press, pp. 67-95, 1990.
-
(1990)
Neural Networks for Control
, pp. 67-95
-
-
Werbos, P.1
-
6
-
-
0002437599
-
Neuro-control and supervised learning: An overview and valuation
-
D. White, & D. Sofge (Eds.), Van Nostrand Reinhold
-
P. Werbos, "Neuro-control and supervised learning: An overview and valuation," Handbook of Intelligent Control, in D. White, & D. Sofge (Eds.), Van Nostrand Reinhold, pp. 65-89, 1992.
-
(1992)
Handbook of Intelligent Control
, pp. 65-89
-
-
Werbos, P.1
-
7
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
D. White, & D. Sofge (Eds.), Van Nostrand Reinhold
-
P. Werbos, "Approximate dynamic programming for real-time control and neural modeling," Handbook of Intelligent Control, in D. White, & D. Sofge (Eds.), Van Nostrand Reinhold, pp. 493-525, 1992.
-
(1992)
Handbook of Intelligent Control
, pp. 493-525
-
-
Werbos, P.1
-
10
-
-
0028497630
-
Asynchronous stochastic approximation and Q-learning
-
J. N. Tsitsiklis, "Asynchronous Stochastic Approximation and Q-learning," Machine Learning, vol. 16, no. 3, pp. 185-202, 1994.
-
(1994)
Machine Learning
, vol.16
, Issue.3
, pp. 185-202
-
-
Tsitsiklis, J.N.1
-
11
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
12
-
-
0028584964
-
Adaptive linear quadratic control using policy iteration
-
S. J. Bradtke, B. E. Ydstie, and A. G. Barto, "Adaptive linear quadratic control using policy iteration," Proceedings of American Control Converence, pp. 3475-3479, 1994.
-
(1994)
Proceedings of American Control Converence
, pp. 3475-3479
-
-
Bradtke, S.J.1
Ydstie, B.E.2
Barto, A.G.3
|