-
2
-
-
0004102479
-
-
Cambridge, MA: MIT Press
-
Sutton, R. S., and Barto, A. G., Reinforcement learning-an introduction, Cambridge, MA: MIT Press, 1998.
-
(1998)
Reinforcement learning-an introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
6
-
-
0003785722
-
-
Ph. D. Dissertation, Electrical Engineering Dep., Rensselaer Polytech Ins., Troy, New York
-
Beard, R. W., "Improving the Closed-loop Performance of Nonlinear Systems, " Ph. D. Dissertation, Electrical Engineering Dep., Rensselaer Polytech Ins., Troy, New York, 1995.
-
(1995)
Improving the Closed-loop Performance of Nonlinear Systems
-
-
Beard, R.W.1
-
7
-
-
14844340822
-
Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
-
Abu-Khalaf, M., and Lewis, F. L., "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach, " Automatica, Vol. 41, 2005, pp. 779, 791.
-
(2005)
Automatica
, vol.41
-
-
Abu-Khalaf, M.1
Lewis, F.L.2
-
8
-
-
0033629916
-
Reinforcement Learning in Continuous-time and Space
-
Doya, K., "Reinforcement Learning in Continuous-time and Space, " Neural Computation, Vol. 12, No. 1, 2000, pp. 219, 245.
-
(2000)
Neural Computation
, vol.12
, Issue.1
-
-
Doya, K.1
-
9
-
-
77950630017
-
Online Actor-critic Algorithm to Solve the Continuous Infinite-time Horizon Optimal Control Problem
-
Vamvoudakis, K., and Lewis, F. L., "Online Actor-critic Algorithm to Solve the Continuous Infinite-time Horizon Optimal Control Problem, " Automatica, Vol. 46, 2010, pp. 878, 888.
-
(2010)
Automatica
, vol.46
-
-
Vamvoudakis, K.1
Lewis, F.L.2
-
10
-
-
0036588686
-
Adaptive Dynamic Programming
-
Murray, J. J., Cox, C. J., Lendaris, G. G., and Saeks, R., "Adaptive Dynamic Programming, " IEEE Trans. Syst., Man, Cybern., Part C: Appl. Rev., Vol. 32, No. 2, 2002, pp. 140, 153.
-
(2002)
IEEE Trans. Syst. Man, Cybern., Part C: Appl. Rev
, vol.32
, Issue.2
-
-
Murray, J.J.1
Cox, C.J.2
Lendaris, G.G.3
Saeks, R.4
-
11
-
-
67349145396
-
Neural Network Approach to Continuous-time Direct Adaptive Optimal Control for Partially Unknown Nonlinear Systems
-
Vrabie, D., and Lewis, F. L., "Neural Network Approach to Continuous-time Direct Adaptive Optimal Control for Partially Unknown Nonlinear Systems, " Neural Netw., Vol. 22, 2009, pp. 237, 246.
-
(2009)
Neural Netw
, vol.22
-
-
Vrabie, D.1
Lewis, F.L.2
-
13
-
-
58349110975
-
Adaptive Optimal Control for Continuous-time Linear Systems Based on Policy Iteration
-
Vrabie, D., Pastravanu, O., Abu-Khalaf, M., and Lewis, F. L., "Adaptive Optimal Control for Continuous-time Linear Systems Based on Policy Iteration, " Automatica, Vol. 45, No. 2, 2009, pp. 477, 484.
-
(2009)
Automatica
, vol.45
, Issue.2
-
-
Vrabie, D.1
Pastravanu, O.2
Abu-Khalaf, M.3
Lewis, F.L.4
-
14
-
-
0031143730
-
An Analysis of Temporal-Difference Learning with Function Approximation
-
Tsitsiklis, J. N., and Van Roy, B., "An Analysis of Temporal-Difference Learning with Function Approximation, " IEEE Trans. Automatic Control, Vol. 42, 1997, pp. 674, 690.
-
(1997)
IEEE Trans. Automatic Control
, vol.42
-
-
Tsitsiklis, J.N.1
Roy, B.V.2
-
15
-
-
71749106087
-
Real-time Reinforcement Learning by Sequential Actor-critics and Experience Replay
-
Wawrzynski, P., "Real-time Reinforcement Learning by Sequential Actor-critics and Experience Replay. " Neural Netw., Vol. 22, 2009, pp. 1484, 1497.
-
(2009)
Neural Netw
, vol.22
-
-
Wawrzynski, P.1
-
16
-
-
56749173285
-
Efficient Experience Reuse in Non-Markovian Environments
-
Control Inf. Technol., Tokyo, Japan
-
Dung, L. T., Komeda, T., and Takagi, M., "Efficient Experience Reuse in Non-Markovian Environments. " Proceeding of the Internatinal Conference Instrum, Control Inf. Technol., Tokyo, Japan, 2008, pp. 3327-3332.
-
(2008)
Proceeding of the Internatinal Conference Instrum
, pp. 3327-3332
-
-
Dung, L.T.1
Komeda, T.2
Takagi, M.3
-
17
-
-
60349130974
-
Batch reinforcement learning in a complex domain
-
Honolulu, HI
-
Kalyanakrishnan, S., and Stone, P., "Batch reinforcement learning in a complex domain. " Proceeding of the 6th Internation Conference on Autoomus Agents and Multi-Agent Systms, Honolulu, HI, pp. 650-657, 2007.
-
(2007)
Proceeding of the 6th Internation Conference on Autoomus Agents and Multi-Agent Systms
, pp. 650-657
-
-
Kalyanakrishnan, S.1
Stone, P.2
-
18
-
-
0000123778
-
Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching
-
Lin, L. J., "Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching. " Machine Learning, Vol. 8, 1992, pp. 293, 321.
-
(1992)
Machine Learning
, vol.8
-
-
Lin, L.J.1
-
19
-
-
84857501996
-
Experience Replay for Real-Time Reinforcement Learning Control
-
Adam, S., Busoniu, L., and Babuska, R., "Experience Replay for Real-Time Reinforcement Learning Control. " IEEE Trans. Syst. Man, Cybern., Part C: Appl. Rev., Vol. 42, 2012, pp. 201, 212.
-
(2012)
IEEE Trans. Syst. Man Cybern., Part C: Appl. Rev
, vol.42
-
-
Adam, S.1
Busoniu, L.2
Babuska, R.3
-
21
-
-
84883670357
-
Concurrent Learning for Convergence in Adaptive Control without
-
Atlanta GA
-
Chowdhary, G. V., and Johnson, E., "Concurrent Learning for Convergence in Adaptive Control without, " IEEE CDC, Atlanta GA, 2010, pp. 3675-3679.
-
(2010)
IEEE CDC
, pp. 3675-3679
-
-
Chowdhary, G.V.1
Johnson, E.2
-
22
-
-
0030392685
-
Constrained optimization and control of nonlinear systems: New results in optimal control
-
Lyshevski, S. E., "Constrained optimization and control of nonlinear systems: New results in optimal control, " Proceeding of the IEEE Conference Decision and Control, 1996, pp. 541-546.
-
(1996)
Proceeding of the IEEE Conference Decision and Control
, pp. 541-546
-
-
Lyshevski, S.E.1
|