메뉴 건너뛰기




Volumn , Issue , 2011, Pages 242-249

Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems

Author keywords

Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; globalized dual heuristic programming; intelligent control; neural dynamic programming; neural networks; optimal control

Indexed keywords

ADAPTIVE CRITIC DESIGNS; ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; DUAL HEURISTIC PROGRAMMING; NEURAL DYNAMIC PROGRAMMING; OPTIMAL CONTROLS;

EID: 80052212355     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2011.5967357     Document Type: Conference Paper
Times cited : (13)

References (36)
  • 1
    • 85012688561 scopus 로고
    • Princeton NJ: Princeton University Press
    • R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton University Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 4
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, ch. 13
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, 1992, ch. 13.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1
  • 5
    • 49049091767 scopus 로고    scopus 로고
    • ADP: The key direction for future research in intelligent control and understanding brain intelligence
    • Aug.
    • P. J. Werbos, "ADP: The key direction for future research in intelligent control and understanding brain intelligence," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 898-900, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 898-900
    • Werbos, P.J.1
  • 6
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: A theory of how it works and how to build it
    • Apr.
    • P. J. Werbos, "Intelligence in the brain: a theory of how it works and how to build it," Neural Networks, vol. 22, no. 3, pp. 200-212, Apr. 2009.
    • (2009) Neural Networks , vol.22 , Issue.3 , pp. 200-212
    • Werbos, P.J.1
  • 11
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • DOI 10.1109/72.914523, PII S1045922701014047
    • J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001. (Pubitemid 32371483)
    • (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 13
  • 14
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • July
    • F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits and Systems Magazine, vol. 9, no. 3, pp. 32-50, July 2009.
    • (2009) IEEE Circuits and Systems Magazine , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 16
    • 56349120789 scopus 로고    scopus 로고
    • E:-adaptive dynamic programming for discrete-time systems
    • Hong Kong, June
    • D. Liu and N. Jin, "E:-adaptive dynamic programming for discrete-time systems," in Proc. International Joint Conference on Neural Networks, Hong Kong, June 2008, pp. 1417-1424.
    • (2008) Proc. International Joint Conference on Neural Networks , pp. 1417-1424
    • Liu, D.1    Jin, N.2
  • 17
    • 84954411226 scopus 로고    scopus 로고
    • Adaptive critic based neurocontroller for turbogenerators with global dual heuristic programming
    • Singapore, Jan.
    • G. K. Venayagamoorthy, D. C. Wunsch, and R. G. Harley, "Adaptive critic based neurocontroller for turbogenerators with global dual heuristic programming," in Proc. IEEE PES Winter Meet., Singapore, Jan. 2000, vol. 1, pp. 291-294.
    • (2000) Proc. IEEE PES Winter Meet. , vol.1 , pp. 291-294
    • Venayagamoorthy, G.K.1    Wunsch, D.C.2    Harley, R.G.3
  • 18
    • 0036565019 scopus 로고    scopus 로고
    • Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
    • DOI 10.1109/TNN.2002.1000146, PII S1045922702044417
    • G. K. Venayagamoorthy, R. G. Harley, and D. C. Wunsch, "Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator," IEEE Trans. Neural Netw., vol. 13, no. 3, pp. 764-773, May 2002. (Pubitemid 34669664)
    • (2002) IEEE Transactions on Neural Networks , vol.13 , Issue.3 , pp. 764-773
    • Venayagamoorthy, G.K.1    Harley, R.G.2    Wunsch, D.C.3
  • 19
    • 0242337541 scopus 로고    scopus 로고
    • Adaptivecritic- based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks
    • Sept./Oct.
    • J. W. Park, R. G. Harley, and G. K. Venayagamoorthy, " Adaptivecritic- based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks," IEEE Trans. Ind. Appl., vol. 39, no. 5, pp. 1529-1540, Sept./Oct. 2003.
    • (2003) IEEE Trans. Ind. Appl. , vol.39 , Issue.5 , pp. 1529-1540
    • Park, J.W.1    Harley, R.G.2    Venayagamoorthy, G.K.3
  • 20
    • 17644391408 scopus 로고    scopus 로고
    • Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor
    • Apr.
    • G. G. Yen and P. G. DeLima, "Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor," IEEE Trans. Automation Science and Engineering, vol. 2, no. 2, pp. 121-131, Apr. 2005.
    • (2005) IEEE Trans. Automation Science and Engineering , vol.2 , Issue.2 , pp. 121-131
    • Yen, G.G.1    Delima, P.G.2
  • 21
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
    • M. Abu-Khalaf and F. L. Lewis. "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5,779-791, May 2005. (Pubitemid 40352391)
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 22
    • 33846781133 scopus 로고    scopus 로고
    • A neural network solution for fixed-final time optimal control of nonlinear systems
    • DOI 10.1016/j.automatica.2006.09.021, PII S0005109806004250
    • T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "A neural network solution for fixed-final time optimal control of nonlinear systems," Automatica, vol. 43, no. 3, pp. 482-490, Mar. 2007. (Pubitemid 46209051)
    • (2007) Automatica , vol.43 , Issue.3 , pp. 482-490
    • Cheng, T.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 23
    • 0030196717 scopus 로고    scopus 로고
    • Adaptive-critic-based neural networks for aircraft optimal control
    • S. N. Balakrishnan and Y. Biega, "Adaptive-critic based neural networks for aircraft optimal control," Journal of Guidance, Control, and Dynamics, vol. 19, no. 4, pp. 893-898, July-Aug. 1996. (Pubitemid 126539437)
    • (1996) Journal of Guidance, Control, and Dynamics , vol.19 , Issue.4 , pp. 893-898
    • Balakrishnan, S.N.1    Biega, V.2
  • 24
    • 0035427378 scopus 로고    scopus 로고
    • Adaptive-critic based optimal neuro control synthesis for distributed parameter systems
    • DOI 10.1016/S0005-1098(01)00093-0, PII S0005109801000930
    • R. Padhi, S. N. Balakrishnan, and T. Randolph, "Adaptive-critic based optimal neuro control synthesis for distributed parameter systems," Automatica, vol. 37, no. 8, pp. 1223-1234, Aug. 2001. (Pubitemid 32610253)
    • (2001) Automatica , vol.37 , Issue.8 , pp. 1223-1234
    • Padhi, R.1    Balakrishnan, S.N.2    Randolph, T.3
  • 25
    • 0036641793 scopus 로고    scopus 로고
    • State-constrained agile missile control with adaptive-critic-based neural networks
    • DOI 10.1109/TCST.2002.1014669, PII S1063653602053605
    • D. Han and S. N. Balakrishnan, "State-constrained agile missile control with adaptive critic-based neural networks," IEEE Trans. Control Systems Technology, vol. 10, no. 4, pp. 481-489, July 2002. (Pubitemid 34798672)
    • (2002) IEEE Transactions on Control Systems Technology , vol.10 , Issue.4 , pp. 481-489
    • Han, D.1    Balakrishnan, S.N.2
  • 26
    • 33751238181 scopus 로고    scopus 로고
    • A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
    • DOI 10.1016/j.neunet.2006.08.010, PII S0893608006001912
    • R. Padhi, N. Unnikrishnan, X. Wang, and S. N. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems," Neural Networks, vol. 19, no. 10, pp. 1648-1660, Dec. 2006. (Pubitemid 44793175)
    • (2006) Neural Networks , vol.19 , Issue.10 , pp. 1648-1660
    • Padhi, R.1    Unnikrishnan, N.2    Wang, X.3    Balakrishnan, S.N.4
  • 27
    • 49049111594 scopus 로고    scopus 로고
    • Issues on stability of ADP feedback controllers for dynamic systems
    • Aug.
    • S. N. Balakrishnan, J. Ding, and F. L. Lewis, "Issues on stability of ADP feedback controllers for dynamic systems," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 913-917, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 913-917
    • Balakrishnan, S.N.1    Ding, J.2    Lewis, F.L.3
  • 28
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Aug.
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 29
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Aug.
    • H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 30
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Sept.
    • H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sept. 2009.
    • (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 31
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
    • Apr.
    • D. Vrabie and F. Lewis, "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems" Neural Networks, vol. 22, no. 3, pp. 237-246, Apr. 2009.
    • (2009) Neural Networks , vol.22 , Issue.3 , pp. 237-246
    • Vrabie, D.1    Lewis, F.2
  • 32
    • 68149180889 scopus 로고    scopus 로고
    • Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
    • July-Aug.
    • T. Dierks, B. T. Thumati, and J. Sarangapani, "Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence," Neural Networks, vol. 22, no. 5-6, pp. 851-860, July-Aug. 2009.
    • (2009) Neural Networks , vol.22 , Issue.5-6 , pp. 851-860
    • Dierks, T.1    Thumati, B.T.2    Sarangapani, J.3
  • 33
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • May
    • K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 34
    • 77955423822 scopus 로고    scopus 로고
    • ∞ control design for unknown linear discrete-time systems via Q-learning with LMI
    • Aug.
    • ∞ control design for unknown linear discrete-time systems via Q-learning with LMI," Automatica, vol. 46, no. 8, pp. 1320-1326, Aug. 2010.
    • (2010) Automatica , vol.46 , Issue.8 , pp. 1320-1326
    • Kim, J.H.1    Lewis, F.L.2
  • 36
    • 57749111482 scopus 로고    scopus 로고
    • Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form
    • Dec.
    • J. Sarangapani and P. He, "Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form," IEEE Trans. Neural Netw., vol. 19, no. 12, pp. 2073-2087, Dec. 2008.
    • (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.12 , pp. 2073-2087
    • Sarangapani, J.1    He, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.