-
1
-
-
84921399937
-
-
Wiley-IEEE New York
-
Si, J., Barto, A.G., Powell, W.B., Wunsch, D. (eds.): Handbook of Learning and Approximate Dynamic Programming. Wiley-IEEE, New York (2004)
-
(2004)
Handbook of Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.G.2
Powell, W.B.3
Wunsch, D.4
-
2
-
-
85012688561
-
-
Princeton University Press Princeton
-
Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
5
-
-
33847202724
-
Learning to predict by the methods of temporal difference
-
R.S. Sutton 1988 Learning to predict by the methods of temporal difference Mach. Learn. 3 9 44
-
(1988)
Mach. Learn.
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
6
-
-
0020970738
-
Neuron like adaptive elements that can solve difficult learning control problems
-
A.G. Barto R.S. Sutton C.W. Anderson 1983 Neuron like adaptive elements that can solve difficult learning control problems IEEE Trans. Syst. Man, Cybern. 13 834 847
-
(1983)
IEEE Trans. Syst. Man, Cybern.
, vol.13
, pp. 834-847
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
-
7
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
R.S. Sutton 1996 Reinforcement learning with replacing eligibility traces Mach. Learn. 22 1 123 158 (Pubitemid 126724365)
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
8
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program achieves master-level play
-
G. Tesauro 1994 TD-Gammon, a self-teaching backgammon program achieves master-level play Neural Comput. 6 215 219
-
(1994)
Neural Comput.
, vol.6
, pp. 215-219
-
-
Tesauro, G.1
-
9
-
-
0002557583
-
Advanced forecasting methods for global crisis warning and models of intelligence
-
P.J. Werbos 1977 Advanced forecasting methods for global crisis warning and models of intelligence Gen. Syst. Yearb. 22 25 38
-
(1977)
Gen. Syst. Yearb.
, vol.22
, pp. 25-38
-
-
Werbos, P.J.1
-
10
-
-
0002011091
-
A menu of design for reinforcement learning over time
-
MIT Cambridge
-
Werbos, P.J.: A menu of design for reinforcement learning over time. In: Miller, W.T., III, Sutton, R.S., Werbos, P.J. (eds.) Neural Networks for Control, ch. 3, pp. 67-95. MIT, Cambridge (1990)
-
(1990)
Neural Networks for Control, Ch. 3
, pp. 67-95
-
-
Werbos, P.J.1
Miller Iii, W.T.2
Sutton, R.S.3
Werbos, P.J.4
-
11
-
-
0002437599
-
Neuro-control and supervised learning: An overview and valuation
-
Van Nostrand New York
-
Werbos, P.J.: Neuro-control and supervised learning: an overview and valuation. In: White, D., Sofge, D. (eds.) Handbook of Intelligent Control, pp. 65-89. Van Nostrand, New York (1992)
-
(1992)
Handbook of Intelligent Control
, pp. 65-89
-
-
Werbos, P.J.1
White, D.2
Sofge, D.3
-
12
-
-
0002031779
-
Approximate dynamic programming for real-time control and neural modeling
-
Van Nostrand New York
-
Werbos, P.J.: Approximate dynamic programming for real-time control and neural modeling. In: White, D., Sofge, D. (eds.) Handbook of Intelligent Control, pp. 493-525. Van Nostrand, New York (1992)
-
(1992)
Handbook of Intelligent Control
, pp. 493-525
-
-
Werbos, P.J.1
White, D.2
Sofge, D.3
-
13
-
-
0029592634
-
Adaptive critic designs: A case study for neurocontrol
-
DOI 10.1016/0893-6080(95)00042-9
-
D.V. Prokhorov R.A. Santiago D.C. Wunsch 1995 Adaptive critic designs: a case study for neurocontrol Neural Netw. 8 9 1367 1372 (Pubitemid 26072896)
-
(1995)
Neural Networks
, vol.8
, Issue.9
, pp. 1367-1372
-
-
Prokhorov, D.V.1
Santiago, R.A.2
Wunsch II, D.C.3
-
15
-
-
0035273403
-
On-line learning control by association and reinforcement
-
DOI 10.1109/72.914523, PII S1045922701014047
-
J. Si Y. Wang 2001 Online learning control by association and reinforcement IEEE Trans. Neural Netw. 12 2 264 276 (Pubitemid 32371483)
-
(2001)
IEEE Transactions on Neural Networks
, vol.12
, Issue.2
, pp. 264-276
-
-
Si, J.1
Wang, Y.-T.2
-
16
-
-
0036157443
-
Apache helicopter stabilization using neural dynamic programming
-
R. Enns J. Si 2002 Apache helicopter stabilization using neural dynamic programming AIAA J. Guid. Control Dyn. 25 1 19 25 (Pubitemid 34109509)
-
(2002)
Journal of Guidance, Control, and Dynamics
, vol.25
, Issue.1
, pp. 19-25
-
-
Enns, R.1
Si, J.2
-
17
-
-
0043026775
-
Helicopter trimming and tracking control using direct neural dynamic programming
-
R. Enns J. Si 2003 Helicopter trimming and tracking control using direct neural dynamic programming IEEE Trans. Neural Netw. 14 4 929 939
-
(2003)
IEEE Trans. Neural Netw.
, vol.14
, Issue.4
, pp. 929-939
-
-
Enns, R.1
Si, J.2
-
18
-
-
0042767744
-
Helicopter flight-control reconfiguration for main rotor actuator failures
-
R. Enns J. Si 2003 Helicopter flight-control reconfiguration for main rotor actuator failures AIAA J. Guid. Control Dyn. 26 4 572 584
-
(2003)
AIAA J. Guid. Control Dyn.
, vol.26
, Issue.4
, pp. 572-584
-
-
Enns, R.1
Si, J.2
-
20
-
-
0031672813
-
Nonlinear optimal control of a triple link inverted pendulum with single control input
-
K.D. Eltohamy C.Y. Kuo 1998 Nonlinear optimal control of a triple link inverted pendulum with single control input Int. J. Control 69 2 239 256
-
(1998)
Int. J. Control
, vol.69
, Issue.2
, pp. 239-256
-
-
Eltohamy, K.D.1
Kuo, C.Y.2
-
21
-
-
0007908166
-
Experiments with reinforcement learning in problems with continuous state and action spaces
-
University of Massachussetts, Amherst
-
Santamaria, J.C., Sutton, R.S., Ram, A.: Experiments with reinforcement learning in problems with continuous state and action spaces. COINS Technical Report 96-88, University of Massachussetts, Amherst (1996)
-
(1996)
COINS Technical Report 96-88
-
-
Santamaria, J.C.1
Sutton, R.S.2
Ram, A.3
|