메뉴 건너뛰기




Volumn , Issue , 2009, Pages 3180-3187

Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem

Author keywords

[No Author keywords available]

Indexed keywords

ACTOR-CRITIC ALGORITHM; ACTOR-NETWORK; CLOSED-LOOP; CONTINUOUS TIME; DYNAMICAL STABILITY; INFINITE HORIZONS; ON-LINE ADAPTIVE ALGORITHMS; ON-LINE ALGORITHMS; OPTIMAL CONTROL PROBLEM; OPTIMAL CONTROL SOLUTION; OPTIMAL CONTROLLER; OPTIMAL VALUE FUNCTIONS; PERSISTENCE OF EXCITATION; POLICY ITERATION; SIMULATION EXAMPLE; TUNING ALGORITHM;

EID: 70449382072     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IJCNN.2009.5178586     Document Type: Conference Paper
Times cited : (47)

References (21)
  • 1
    • 14844340822 scopus 로고    scopus 로고
    • Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
    • M. Abu-Khalaf, F. L. Lewis, "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach", Automatica, vol. 41, no. 5, pp. 779-791, 2005.
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 2
    • 0028733775 scopus 로고
    • Reinforcement Learning in Continuous Time: Advantage Updating
    • Orlando FL
    • L. C. Baird III, "Reinforcement Learning in Continuous Time: Advantage Updating", Proc. Of ICNN, Orlando FL, vol. 4, pp. 2448- 2453, 1994.
    • (1994) Proc. Of ICNN , vol.4 , pp. 2448-2453
    • Baird III, L.C.1
  • 3
    • 0031332446 scopus 로고    scopus 로고
    • Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
    • R. Beard, G. Saridis, J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation", Automatica, vol. 33, no. 12, pp. 2159-2177, 1997.
    • (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
    • Beard, R.1    Saridis, G.2    Wen, J.3
  • 5
    • 0034848079 scopus 로고    scopus 로고
    • Successive Collocation: An Approximation to Optimal Nonlinear Control
    • J. W. Curtis, R. W. Beard, "Successive Collocation: An Approximation to Optimal Nonlinear Control", IEEE Proc. ACC01, vol. 5, pp. 3481-3485, 2001.
    • (2001) IEEE Proc. ACC01 , vol.5 , pp. 3481-3485
    • Curtis, J.W.1    Beard, R.W.2
  • 6
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement Learning In Continuous Time and Space
    • K. Doya, "Reinforcement Learning In Continuous Time and Space", Neural Computation, 12 (1), pp. 219-245, 2000.
    • (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
    • Doya, K.1
  • 9
    • 84914965022 scopus 로고
    • On an Iterative Technique for Riccati Equation Computations
    • February
    • D. Kleinman, "On an Iterative Technique for Riccati Equation Computations", IEEE Trans, on Automatic Control, vol. 13, pp. 114- 115, February, 1968.
    • (1968) IEEE Trans, on Automatic Control , vol.13 , pp. 114-115
    • Kleinman, D.1
  • 11
    • 0029304635 scopus 로고
    • Neural Net Controller with Guaranteed Tracking Performance
    • F. L. Lewis, K. Liu, and A. Yesildirek, "Neural Net Controller with Guaranteed Tracking Performance", IEEE Transactions on Neural Networks, vol. 6, no. 3, pp. 703-715, 1995.
    • (1995) IEEE Transactions on Neural Networks , vol.6 , Issue.3 , pp. 703-715
    • Lewis, F.L.1    Liu, K.2    Yesildirek, A.3
  • 17
    • 63049136575 scopus 로고    scopus 로고
    • Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
    • D. Vrabie, F. Lewis, "Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration", IEEE Proc. CDC08, pp. 73-79, 2008
    • (2008) IEEE Proc. CDC08 , pp. 73-79
    • Vrabie, D.1    Lewis, F.2
  • 18
    • 85060504479 scopus 로고    scopus 로고
    • Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration
    • to appear
    • D. Vrabie, O. Pastravanu, F. Lewis, M. Abu-Khalaf, "Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration", Automatica (to appear)
    • Automatica
    • Vrabie, D.1    Pastravanu, O.2    Lewis, F.3    Abu-Khalaf, M.4
  • 20
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling,
    • ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling, " Handbook of Intelligent Control, ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1
  • 21
    • 0024888479 scopus 로고
    • Neural networks for control and system identification
    • P. Werbos, "Neural networks for control and system identification", IEEE Proc. CDC89, vol. 1, pp. 260-265, 1989.
    • (1989) IEEE Proc. CDC89 , vol.1 , pp. 260-265
    • Werbos, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.