-
1
-
-
66449130966
-
Adaptive dynamic programming: An introduction
-
F.-Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Computational Intelligence Magazine, pp. 39- 47, 2009.
-
(2009)
IEEE Computational Intelligence Magazine
, pp. 39-47
-
-
Wang, F.-Y.1
Zhang, H.2
Liu, D.3
-
3
-
-
85012688561
-
-
Princeton NJ, USA: Princeton University Press
-
R. E. Bellman, Dynamic Programming. Princeton, NJ, USA: Princeton University Press, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
4
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
7
-
-
0002031779
-
Approximating dynamic programming for real-time control and neural modeling
-
editors White and Sofge, Chapter 13
-
P. J. Werbos, "Approximating dynamic programming for real-time control and neural modeling." Handbook of Intelligent Control, editors White and Sofge, Chapter 13, pp. 493-525, 1992.
-
(1992)
Handbook of Intelligent Control
, pp. 493-525
-
-
Werbos, P.J.1
-
12
-
-
0000255539
-
Fast exact multiplication by the Hessian
-
B. A. Pearlmutter, "Fast exact multiplication by the Hessian," Neural Computation, vol. 6, no. 1, pp. 147-160, 1994.
-
(1994)
Neural Computation
, vol.6
, Issue.1
, pp. 147-160
-
-
Pearlmutter, B.A.1
-
13
-
-
0008011457
-
Neural networks, system identification, and control in the chemical process industries
-
Chapter 10
-
P. J. Werbos, "Neural networks, system identification, and control in the chemical process industries." Handbook of Intelligent Control, editors White and Sofge, Chapter 10, pp. 283-356, 1992.
-
(1992)
Handbook of Intelligent Control, Editors White and Sofge
, pp. 283-356
-
-
Werbos, P.J.1
-
14
-
-
31844443291
-
Inverted autonomous helicopter flight via reinforcement learning
-
MIT Press
-
A. Y. Ng, H. J. Kim, M. I. Jordan, and S. Sastry, "Inverted autonomous helicopter flight via reinforcement learning," in International Symposium on Experimental Robotics. MIT Press, 2004.
-
(2004)
International Symposium on Experimental Robotics
-
-
Ng, A.Y.1
Kim, H.J.2
Jordan, M.I.3
Sastry, S.4
-
15
-
-
33646384929
-
Policy gradient in continuous time
-
R. Munos, "Policy gradient in continuous time," Journal of Machine Learning Research, vol. 7, pp. 413-427, 2006.
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 413-427
-
-
Munos, R.1
-
16
-
-
0033629916
-
Reinforcement learning in continuous time and space
-
K. Doya, "Reinforcement learning in continuous time and space," Neural Computation, vol. 12, no. 1, pp. 219-245, 2000.
-
(2000)
Neural Computation
, vol.12
, Issue.1
, pp. 219-245
-
-
Doya, K.1
-
19
-
-
0037561866
-
Dual heuristic programming excitation neurocontrol for generators in a multimachine power system
-
G. K. Venayagamoorthy and D. C. Wunsch, "Dual heuristic programming excitation neurocontrol for generators in a multimachine power system," IEEE Transactions on Industry Applications, vol. 39, pp. 382- 394, 2003.
-
(2003)
IEEE Transactions on Industry Applications
, vol.39
, pp. 382-394
-
-
Venayagamoorthy, G.K.1
Wunsch, D.C.2
|