메뉴 건너뛰기




Volumn , Issue , 2013, Pages

Adaptive optimal control of partially-unknown constrained-input systems using policy iteration with experience replay

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE OPTIMAL CONTROL; FEEDBACK CONTROL LAW; NEAR-OPTIMAL CONTROL; ONLINE LEARNING ALGORITHMS; OPTIMAL CONTROL PROBLEM; OPTIMAL CONTROL SOLUTION; PERSISTENCE OF EXCITATION; POLICY ITERATION ALGORITHMS;

EID: 84883680649     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (10)

References (23)
  • 6
    • 0003785722 scopus 로고
    • Ph. D. Dissertation, Electrical Engineering Dep., Rensselaer Polytech Ins., Troy, New York
    • Beard, R. W., "Improving the Closed-loop Performance of Nonlinear Systems, " Ph. D. Dissertation, Electrical Engineering Dep., Rensselaer Polytech Ins., Troy, New York, 1995.
    • (1995) Improving the Closed-loop Performance of Nonlinear Systems
    • Beard, R.W.1
  • 7
    • 14844340822 scopus 로고    scopus 로고
    • Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
    • Abu-Khalaf, M., and Lewis, F. L., "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach, " Automatica, Vol. 41, 2005, pp. 779, 791.
    • (2005) Automatica , vol.41
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 8
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement Learning in Continuous-time and Space
    • Doya, K., "Reinforcement Learning in Continuous-time and Space, " Neural Computation, Vol. 12, No. 1, 2000, pp. 219, 245.
    • (2000) Neural Computation , vol.12 , Issue.1
    • Doya, K.1
  • 9
    • 77950630017 scopus 로고    scopus 로고
    • Online Actor-critic Algorithm to Solve the Continuous Infinite-time Horizon Optimal Control Problem
    • Vamvoudakis, K., and Lewis, F. L., "Online Actor-critic Algorithm to Solve the Continuous Infinite-time Horizon Optimal Control Problem, " Automatica, Vol. 46, 2010, pp. 878, 888.
    • (2010) Automatica , vol.46
    • Vamvoudakis, K.1    Lewis, F.L.2
  • 11
    • 67349145396 scopus 로고    scopus 로고
    • Neural Network Approach to Continuous-time Direct Adaptive Optimal Control for Partially Unknown Nonlinear Systems
    • Vrabie, D., and Lewis, F. L., "Neural Network Approach to Continuous-time Direct Adaptive Optimal Control for Partially Unknown Nonlinear Systems, " Neural Netw., Vol. 22, 2009, pp. 237, 246.
    • (2009) Neural Netw , vol.22
    • Vrabie, D.1    Lewis, F.L.2
  • 13
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive Optimal Control for Continuous-time Linear Systems Based on Policy Iteration
    • Vrabie, D., Pastravanu, O., Abu-Khalaf, M., and Lewis, F. L., "Adaptive Optimal Control for Continuous-time Linear Systems Based on Policy Iteration, " Automatica, Vol. 45, No. 2, 2009, pp. 477, 484.
    • (2009) Automatica , vol.45 , Issue.2
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 14
    • 0031143730 scopus 로고    scopus 로고
    • An Analysis of Temporal-Difference Learning with Function Approximation
    • Tsitsiklis, J. N., and Van Roy, B., "An Analysis of Temporal-Difference Learning with Function Approximation, " IEEE Trans. Automatic Control, Vol. 42, 1997, pp. 674, 690.
    • (1997) IEEE Trans. Automatic Control , vol.42
    • Tsitsiklis, J.N.1    Roy, B.V.2
  • 15
    • 71749106087 scopus 로고    scopus 로고
    • Real-time Reinforcement Learning by Sequential Actor-critics and Experience Replay
    • Wawrzynski, P., "Real-time Reinforcement Learning by Sequential Actor-critics and Experience Replay. " Neural Netw., Vol. 22, 2009, pp. 1484, 1497.
    • (2009) Neural Netw , vol.22
    • Wawrzynski, P.1
  • 16
    • 56749173285 scopus 로고    scopus 로고
    • Efficient Experience Reuse in Non-Markovian Environments
    • Control Inf. Technol., Tokyo, Japan
    • Dung, L. T., Komeda, T., and Takagi, M., "Efficient Experience Reuse in Non-Markovian Environments. " Proceeding of the Internatinal Conference Instrum, Control Inf. Technol., Tokyo, Japan, 2008, pp. 3327-3332.
    • (2008) Proceeding of the Internatinal Conference Instrum , pp. 3327-3332
    • Dung, L.T.1    Komeda, T.2    Takagi, M.3
  • 18
    • 0000123778 scopus 로고
    • Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching
    • Lin, L. J., "Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching. " Machine Learning, Vol. 8, 1992, pp. 293, 321.
    • (1992) Machine Learning , vol.8
    • Lin, L.J.1
  • 21
    • 84883670357 scopus 로고    scopus 로고
    • Concurrent Learning for Convergence in Adaptive Control without
    • Atlanta GA
    • Chowdhary, G. V., and Johnson, E., "Concurrent Learning for Convergence in Adaptive Control without, " IEEE CDC, Atlanta GA, 2010, pp. 3675-3679.
    • (2010) IEEE CDC , pp. 3675-3679
    • Chowdhary, G.V.1    Johnson, E.2
  • 22
    • 0030392685 scopus 로고    scopus 로고
    • Constrained optimization and control of nonlinear systems: New results in optimal control
    • Lyshevski, S. E., "Constrained optimization and control of nonlinear systems: New results in optimal control, " Proceeding of the IEEE Conference Decision and Control, 1996, pp. 541-546.
    • (1996) Proceeding of the IEEE Conference Decision and Control , pp. 541-546
    • Lyshevski, S.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.