메뉴 건너뛰기




Volumn , Issue , 2013, Pages 6256-6261

Concurrent learning-based approximate optimal Regulation

Author keywords

[No Author keywords available]

Indexed keywords

ONLINE SYSTEMS; OPTIMAL CONTROL SYSTEMS;

EID: 84902313433     PISSN: 07431546     EISSN: 25762370     Source Type: Conference Proceeding    
DOI: 10.1109/CDC.2013.6760878     Document Type: Conference Paper
Times cited : (27)

References (23)
  • 1
    • 84871319455 scopus 로고    scopus 로고
    • A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
    • S. Bhasin, R. Kamalapurkar, M. Johnson, K. Vamvoudakis, F. L. Lewis, and W. Dixon, "A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems, " Automatica, vol. 49, no. 1, pp. 89-92, 2013.
    • (2013) Automatica , vol.49 , Issue.1 , pp. 89-92
    • Bhasin, S.1    Kamalapurkar, R.2    Johnson, M.3    Vamvoudakis, K.4    Lewis, F.L.5    Dixon, W.6
  • 2
  • 3
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • K. Vamvoudakis and F. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem, " Automatica, vol. 46, pp. 878-888, 2010.
    • (2010) Automatica , vol.46 , pp. 878-888
    • Vamvoudakis, K.1    Lewis, F.2
  • 5
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
    • D. Vrabie and F. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems, " Neural Netw., vol. 22, no. 3, pp. 237 - 246, 2009.
    • (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
    • Vrabie, D.1    Lewis, F.2
  • 6
    • 68149180889 scopus 로고    scopus 로고
    • Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
    • T. Dierks, B. Thumati, and S. Jagannathan, "Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence, " Neural Netw., vol. 22, no. 5-6, pp. 851-860, 2009.
    • (2009) Neural Netw. , vol.22 , Issue.5-6 , pp. 851-860
    • Dierks, T.1    Thumati, B.2    Jagannathan, S.3
  • 7
    • 77950853735 scopus 로고    scopus 로고
    • Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics
    • T. Dierks and S. Jagannathan, "Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics, " in Proc. IEEE Conf. Decis. Control, 2009, pp. 6750-6755.
    • (2009) Proc. IEEE Conf. Decis. Control , pp. 6750-6755
    • Dierks, T.1    Jagannathan, S.2
  • 8
    • 83655163786 scopus 로고    scopus 로고
    • Data-driven robust approx-imate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method
    • H. Zhang, L. Cui, X. Zhang, and Y. Luo, "Data-driven robust approx-imate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method, " IEEE Trans. Neural Netw., vol. 22, no. 12, pp. 2226-2236, 2011.
    • (2011) IEEE Trans. Neural Netw , vol.22 , Issue.12 , pp. 2226-2236
    • Zhang, H.1    Cui, L.2    Zhang, X.3    Luo, Y.4
  • 9
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • K. Doya, "Reinforcement learning in continuous time and space, " Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
    • (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
    • Doya, K.1
  • 10
    • 33846781129 scopus 로고    scopus 로고
    • Model-free gleaming designs for linear discrete-time zero-sum games with application to H(X) control
    • A. AI-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free gleaming designs for linear discrete-time zero-sum games with application to H(X) control, " Automatica, vol. 43, pp. 473-481, 2007.
    • (2007) Automatica , vol.43 , pp. 473-481
    • Ai-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 11
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • A. AI-Tamimi, F. L. Lewis, and M. Abu-Khalaf "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof, " IEEE Trans. Syst. Man Cybern. Part B Cybern., vol. 38, pp. 943-949, 2008.
    • (2008) IEEE Trans. Syst. Man Cybern. Part B Cybern , vol.38 , pp. 943-949
    • Ai-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 12
    • 33751238181 scopus 로고    scopus 로고
    • A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
    • R. Padhi, N. Unnikrishnan, X. Wang, and S. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems, " Neural Netw., vol. 19, no. 10, pp. 1648-1660, 2006.
    • (2006) Neural Netw. , vol.19 , Issue.10 , pp. 1648-1660
    • Padhi, R.1    Unnikrishnan, N.2    Wang, X.3    Balakrishnan, S.4
  • 13
    • 77950806766 scopus 로고    scopus 로고
    • Q-learning and pontryagin's minimum principle
    • Dec
    • P. Mehta and S. Meyn, "Q-learning and pontryagin's minimum principle, " in Proc. IEEE Conf Decis. Control, Dec. 2009, pp. 3598 -3605.
    • (2009) Proc. IEEE Conf Decis. Control , pp. 3598-3605
    • Mehta, P.1    Meyn, S.2
  • 16
    • 79952472584 scopus 로고    scopus 로고
    • Theory and flight-test validation of a concurrent-learning adaptive controller
    • March
    • G. Y. Chowdhary and E. N. Johnson, "Theory and flight-test validation of a concurrent-learning adaptive controller, " J. Guid. Contr. Dynam., vol. 34, no. 2, pp. 592-607, March 2011.
    • (2011) J. Guid. Contr. Dynam. , vol.34 , Issue.2 , pp. 592-607
    • Chowdhary, G.Y.1    Johnson, E.N.2
  • 19
    • 0027804823 scopus 로고
    • Neural net robot controller with guaranteed tracking performance
    • Chicago, Illinois
    • F. Lewis, K. Liu, and A. Yesildirek, "Neural net robot controller with guaranteed tracking performance, " in Proc. IEEE Int. Symp. Intell. Control, Chicago, Illinois, 1993, pp. 225-231.
    • (1993) Proc. IEEE Int. Symp. Intell. Control , pp. 225-231
    • Lewis, F.1    Liu, K.2    Yesildirek, A.3
  • 20
    • 0025399567 scopus 로고
    • Identification and control of dynamical systems using neural networks
    • K. Narendra and K. Parthasarathy, "Identification and control of dynamical systems using neural networks, " IEEE Trans. Neural Networks, vol. 1, no. 1, pp. 4-27, 1990.
    • (1990) IEEE Trans. Neural Networks , vol.1 , Issue.1 , pp. 4-27
    • Narendra, K.1    Parthasarathy, K.2
  • 21
    • 4043069840 scopus 로고    scopus 로고
    • On actor-critic algorithms
    • Y. Konda and J. TsitsikIis, "On actor-critic algorithms, " SIAM J. Contr. Optim., vol. 42, no. 4, pp. 1143-1166, 2004.
    • (2004) SIAM J. Contr. Optim. , vol.42 , Issue.4 , pp. 1143-1166
    • Konda, Y.1    Tsitsikiis, J.2
  • 22
    • 77957777969 scopus 로고    scopus 로고
    • Optimal control of affine nonlinear continuous-time systems
    • T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems, " in Proc. Am. Control Conf, 2010, pp. 1568- 1573.
    • (2010) Proc. Am. Control Conf , pp. 1568-1573
    • Dierks, T.1    Jagannathan, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.