-
2
-
-
0028584964
-
Adaptive linear quadratic control using policy iteration
-
Baltmore, Myrland, June
-
Bradtke S. J., B. E. Ydestie, A. G. Barto, "Adaptive linear quadratic control using policy iteration", Proceedings of the American Control Conference, pp. 3475-3476, Baltmore, Myrland, June, 1994.
-
(1994)
Proceedings of the American Control Conference
, pp. 3475-3476
-
-
Bradtke, S.J.1
Ydestie, B.E.2
Barto, A.G.3
-
3
-
-
0018011435
-
Kronecker Products and Matrix Calculus in System Theory
-
Brewer J. W., "Kronecker Products and Matrix Calculus in System Theory", IEEE Trans. on Circuit and System, Vol. CAS-25, No. 9, 1978.
-
(1978)
IEEE Trans. on Circuit and System
, vol.CAS-25
, Issue.9
-
-
Brewer, J.W.1
-
5
-
-
0033629916
-
Reinforcement Learning in Continuous Time and Space
-
MIT Press
-
Doya, K., "Reinforcement Learning in Continuous Time and Space," Neural Computation, vol. 12, pp. 219-245, MIT Press, 2000.
-
(2000)
Neural Computation
, vol.12
, pp. 219-245
-
-
Doya, K.1
-
6
-
-
0036060633
-
An Adaptive Critic Global Controller
-
Anchorage, AK
-
Ferrari, S., R. Stengel, "An Adaptive Critic Global Controller," Proceedings of the American Control Conference, pp. 2665-2670, Anchorage, AK, 2002.
-
(2002)
Proceedings of the American Control Conference
, pp. 2665-2670
-
-
Ferrari, S.1
Stengel, R.2
-
7
-
-
84914965022
-
On an Iterative Technique for Riccati Equation Computations
-
February
-
Kleinman D., "On an Iterative Technique for Riccati Equation Computations", IEEE Trans. on Automatic Control, February, 1968.
-
(1968)
IEEE Trans. on Automatic Control
-
-
Kleinman, D.1
-
10
-
-
0034548295
-
Convergence Analysis of Adaptive Critic Based Optimal Control
-
Chicago, IL
-
Liu, X., S. N. Balakrishnan, "Convergence Analysis of Adaptive Critic Based Optimal Control", Proceedings of the American Control Conference, pp. 1929-1933, Chicago, IL, 2000.
-
(2000)
Proceedings of the American Control Conference
, pp. 1929-1933
-
-
Liu, X.1
Balakrishnan, S.N.2
-
11
-
-
0036588686
-
Adaptive Dynamic Programming
-
Murray J. J., C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive Dynamic Programming", IEEE Trans. on Systems, Man and Cybernetics, Vol. 32, No. 2, pp 140-153, 2002.
-
(2002)
IEEE Trans. on Systems, Man and Cybernetics
, vol.32
, Issue.2
, pp. 140-153
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
12
-
-
84921399937
-
-
John Wiley, New Jersey
-
Si J., A. Barto, W. Powel, D. Wunch, Handbook of Learning and Approximate Dynamic Programming, John Wiley, New Jersey, 2004.
-
(2004)
Handbook of Learning and Approximate Dynamic Programming
-
-
Si, J.1
Barto, A.2
Powel, W.3
Wunch, D.4
-
13
-
-
33847202724
-
Learning to predict by the method of temporal differences
-
Sutton, R., "Learning to predict by the method of temporal differences," Machine Learning, 3:9-44, 1988.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.1
-
14
-
-
0004049893
-
-
Ph.D. Thesis, Cambridge University, Cambridge, England
-
Watkins, C., Learning from Delayed Rewards, Ph.D. Thesis, Cambridge University, Cambridge, England, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.1
-
16
-
-
0003544743
-
-
White, D, D. Sofge, Eds, New York: Van Nostrand
-
White, D., D. Sofge, Eds., Handbook of Intelligent Control, Neural, Fuzzy, and, Adaptive Approaches, New York: Van Nostrand, 1992.
-
(1992)
Handbook of Intelligent Control, Neural, Fuzzy, and, Adaptive Approaches
-
-
-
17
-
-
0027148081
-
Robust load-frequency controller design for power systems
-
Wang, Y., R. Zhou, C. Wen, "Robust load-frequency controller design for power systems", IEE Proc.-C, Vol. 140, No. 1, 1993.
-
(1993)
IEE Proc.-C
, vol.140
, Issue.1
-
-
Wang, Y.1
Zhou, R.2
Wen, C.3
|