메뉴 건너뛰기




Volumn 22, Issue 12 PART 2, 2011, Pages 2226-2236

Data-driven robust approximate optimal tracking control for unknown general nonlinear systems using adaptive dynamic programming method

Author keywords

Adaptive dynamic programming; data driven model; neural networks; optimal tracking control; robust control

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATION ERRORS; CONTROL INPUTS; CONTROL SCHEMES; DATA-DRIVEN; DATA-DRIVEN MODEL; INPUT-OUTPUT DATA; LYAPUNOV APPROACH; MODELING ERRORS; NUMERICAL EXAMPLE; OPTIMAL CONTROLS; OPTIMAL FEEDBACK; OPTIMAL TRACKING; OPTIMAL TRACKING CONTROL; STABILITY ANALYSIS; SYSTEM DYNAMICS; SYSTEM STATE;

EID: 83655163786     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/TNN.2011.2168538     Document Type: Article
Times cited : (586)

References (41)
  • 2
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
    • DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
    • M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, May 2005. (Pubitemid 40352391)
    • (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 4
    • 0031332446 scopus 로고    scopus 로고
    • Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
    • Dec
    • R. Beard, G. Saridis, and J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation," Automatica, vol. 33, no. 12, pp. 2159-2177, Dec. 1997.
    • (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
    • Beard, R.1    Saridis, G.2    Wen, J.3
  • 5
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Aug
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern., Part B: Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern., Part B: Cybern. , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 6
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • May
    • K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 7
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ?-error bound
    • Jan.
    • F.-Y.Wang, N. Jin, D. Liu, and Q.Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ?-error bound," IEEE Trans. Neural Netw., vol. 22, no. 1, pp. 24-36, Jan. 2011.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.-Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 9
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Jan.
    • H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, Jan. 2011.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 10
    • 68149180889 scopus 로고    scopus 로고
    • Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
    • Jul.-Aug
    • T. Dierks, B. T. Thumati, and S. Jagannathan, "Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence," Neural Netw., vol. 22, nos. 5-6, pp. 851-860, Jul.-Aug. 2009.
    • (2009) Neural Netw. , vol.22 , Issue.5-6 , pp. 851-860
    • Dierks, T.1    Thumati, B.T.2    Jagannathan, S.3
  • 11
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Sep
    • H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
    • (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 13
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: An introduction
    • May
    • F.-Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell. Mag., vol. 4, no. 2, pp. 39-47, May 2009.
    • (2009) IEEE Comput. Intell. Mag. , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.-Y.1    Zhang, H.2    Liu, D.3
  • 14
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • Aug
    • F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, Aug. 2009.
    • (2009) IEEE Circuits Syst. Mag. , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 15
    • 33846781129 scopus 로고    scopus 로고
    • Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
    • DOI 10.1016/j.automatica.2006.09.019, PII S0005109806004249
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Model-free Q-learning designs for linear discrete-time zero-sum games with application to H control," Automatica, vol. 43, no. 3, pp. 473-481, Mar. 2007. (Pubitemid 46209050)
    • (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 16
    • 67650567581 scopus 로고    scopus 로고
    • Data-based optimal control for discretetime zero-sum games of 2-D systems using adaptive critic designs
    • Jun
    • Q. Wei, H. Zhang, and L. Cui, "Data-based optimal control for discretetime zero-sum games of 2-D systems using adaptive critic designs," ACTA Autom. Sinica, vol. 35, no. 6, pp. 682-692, Jun. 2009.
    • (2009) ACTA Autom. Sinica , vol.35 , Issue.6 , pp. 682-692
    • Wei, Q.1    Zhang, H.2    Cui, L.3
  • 17
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • Feb
    • D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, Feb. 2009.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 18
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems
    • Apr
    • D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, Apr. 2009.
    • (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
    • Vrabie, D.1    Lewis, F.L.2
  • 19
    • 0031272107 scopus 로고    scopus 로고
    • Identification of a multistep-ahead observer and its application to predictive control
    • R. K. Lim and M. Q. Phan, "Identification of a multistep-ahead observer and its application to predictive control," J. Guid. Control Dyn., vol. 20, no. 6, pp. 1200-1206, 1997. (Pubitemid 127561126)
    • (1997) Journal of Guidance, Control, and Dynamics , vol.20 , Issue.6 , pp. 1200-1206
    • Lim, R.K.1    Phan, M.Q.2
  • 20
    • 0031169892 scopus 로고    scopus 로고
    • The unfalsified control concept and learning
    • PII S0018928697033862
    • M. G. Safonov and T. C. Tsao, "The unfalsified control concept and learning," IEEE Trans. Autom. Control, vol. 42, no. 6, pp. 843-847, Jun. 1997. (Pubitemid 127760880)
    • (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.6 , pp. 843-847
    • Safonov, M.G.1    Tsao, T.-C.2
  • 22
    • 18444379381 scopus 로고    scopus 로고
    • Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes
    • DOI 10.1016/j.automatica.2005.02.006, PII S0005109805000786
    • J. M. Lee and J. H. Lee, "Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes," Automatica, vol. 41, no. 7, pp. 1281-1288, Jul. 2005. (Pubitemid 40644867)
    • (2005) Automatica , vol.41 , Issue.7 , pp. 1281-1288
    • Lee, J.M.1    Lee, J.H.2
  • 24
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Feb. 2011
    • F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern., Part B: Cybern., vol. 41, no. 1, pp. 14-25, Feb. 2011.
    • IEEE Trans. Syst., Man, Cybern., Part B: Cybern. , vol.41 , Issue.1 , pp. 14-25
    • Lewis, F.L.1    Vamvoudakis, K.G.2
  • 25
    • 33845994739 scopus 로고    scopus 로고
    • Fault tolerant control based on stochastic distributions via MLP neural networks
    • DOI 10.1016/j.neucom.2006.10.030, PII S0925231206002931
    • Y. Zhang, L. Guo, H. Yu, and K. Zhao, "Fault tolerant control based on stochastic distributions via MLP neural networks," Neurocomputing, vol. 70, nos. 4-6, pp. 867-874, Jan. 2007. (Pubitemid 46043985)
    • (2007) Neurocomputing , vol.70 , Issue.4-6 , pp. 867-874
    • Zhang, Y.1    Guo, L.2    Yu, H.3    Zhao, K.4
  • 26
    • 26444571792 scopus 로고    scopus 로고
    • Fault detection and diagnosis for general stochastic systems using B-spline expansions and nonlinear filters
    • DOI 10.1109/TCSI.2005.851686
    • L. Guo and H. Wang, "Fault detection and diagnosis for general stochastic systems using B-spline expansions and nonlinear filters," IEEE Trans. Circuits Syst. I, vol. 52, no. 8, pp. 1644-1652, Aug. 2005. (Pubitemid 41430881)
    • (2005) IEEE Transactions on Circuits and Systems I: Regular Papers , vol.52 , Issue.8 , pp. 1644-1652
    • Guo, L.1    Wang, H.2
  • 27
    • 77649238242 scopus 로고    scopus 로고
    • A generalized procedure in designing recurrent neural network identification and control of time-varyingdelayed nonlinear dynamic systems
    • Mar.
    • X. Wu, J. Zhang, and Q. Zhu, "A generalized procedure in designing recurrent neural network identification and control of time-varyingdelayed nonlinear dynamic systems," Neurocomputing, vol. 73, nos. 7-9, pp. 1376-1383, Mar. 2010.
    • (2010) Neurocomputing , vol.73 , Issue.7-9 , pp. 1376-1383
    • Wu, X.1    Zhang, J.2    Zhu, Q.3
  • 28
    • 33846077026 scopus 로고    scopus 로고
    • Nonlinear adaptive wavelet control using constructive wavelet networks
    • DOI 10.1109/TNN.2006.886759
    • J. Xu and Y. Tan, "Nonlinear adaptive wavelet control using constructive wavelet networks," IEEE Trans. Neural Netw., vol. 18, no. 1, pp. 115-127, Jan. 2007. (Pubitemid 46062921)
    • (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.1 , pp. 115-127
    • Xu, J.-X.1    Tan, Y.2
  • 29
    • 67650556658 scopus 로고    scopus 로고
    • On data-driven control theory: The state of the art and perspective
    • Z. Hou and J. Xu, "On data-driven control theory: The state of the art and perspective," ACTA Autom. Sinica, vol. 35, no. 6, pp. 650-667, 2009.
    • (2009) ACTA Autom. Sinica , vol.35 , Issue.6 , pp. 650-667
    • Hou, Z.1    Xu, J.2
  • 30
    • 0034259653 scopus 로고    scopus 로고
    • Markov data-based LQG control
    • R. E. Skelton and G. Shi, "Markov data-based LQG control," J. Dyn. Syst., Meas., Control, vol. 122, no. 3, pp. 551-559, 2000.
    • (2000) J. Dyn. Syst., Meas., Control , vol.122 , Issue.3 , pp. 551-559
    • Skelton, R.E.1    Shi, G.2
  • 32
    • 0030242079 scopus 로고    scopus 로고
    • An optimal tracking neurocontroller for nonlinear dynamic systems
    • Sep
    • Y. M. Park, M. S. Choi, and K. W. Lee, "An optimal tracking neurocontroller for nonlinear dynamic systems," IEEE Trans. Neural Netw., vol. 7, no. 5, pp. 1099-1110, Sep. 1996.
    • (1996) IEEE Trans. Neural Netw. , vol.7 , Issue.5 , pp. 1099-1110
    • Park, Y.M.1    Choi, M.S.2    Lee, K.W.3
  • 33
    • 33845863921 scopus 로고    scopus 로고
    • Optimal output tracking control for nonlinear systems via successive approximation approach
    • DOI 10.1016/j.na.2006.01.021, PII S0362546X06000630
    • G. Tang, Y. Zhao, and B. Zhang, "Optimal output tracking control for nonlinear systems via successive approximation approach," Nonlin. Anal., vol. 66, no. 6, pp. 1365-1377, Mar. 2007. (Pubitemid 46027396)
    • (2007) Nonlinear Analysis, Theory, Methods and Applications , vol.66 , Issue.6 , pp. 1365-1377
    • Tang, G.-Y.1    Zhao, Y.-D.2    Zhang, B.-L.3
  • 34
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
    • Aug
    • H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern., Part B: Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern., Part B: Cybern. , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 35
    • 77950853735 scopus 로고    scopus 로고
    • Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics
    • Shanghai, Dec
    • T. Dierks and S. Jagannathan, "Optimal tracking control of affine nonlinear discrete-time systems with unknown internal dynamics," in Proc. 48th IEEE Conf. Decis. Control Conf. Chin. Control, Shanghai, Dec. 2009, pp. 6750-6755.
    • (2009) Proc. 48th IEEE Conf. Decis. Control Conf. Chin. Control , pp. 6750-6755
    • Dierks, T.1    Jagannathan, S.2
  • 36
    • 0033284936 scopus 로고    scopus 로고
    • Synthesis and experimental testing of a nonlinear optimal tracking controller
    • San Diego, CA, Jun
    • T. W. Mclain, C. A. Bailry, and R. W. Beard, "Synthesis and experimental testing of a nonlinear optimal tracking controller," in Proc. Amer. Control Conf., vol. 4. San Diego, CA, Jun. 1999, pp. 2847-2851.
    • (1999) Proc. Amer. Control Conf. , vol.4 , pp. 2847-2851
    • McLain, T.W.1    Bailry, C.A.2    Beard, R.W.3
  • 37
    • 34547133970 scopus 로고    scopus 로고
    • Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks
    • DOI 10.1109/TNN.2007.899229, Neural Networks for Feedback Control Systems
    • V. Yadav, R. Padhi, and S. M. Balakrishnan, "Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 1115-1128, Jul. 2007. (Pubitemid 47098885)
    • (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.4 , pp. 1115-1128
    • Yadav, V.1    Padhi, R.2    Balakrishnan, S.N.3
  • 38
    • 33947583893 scopus 로고    scopus 로고
    • Stability analysis of nonlinear system identification via delayed neural networks
    • DOI 10.1109/TCSII.2006.886464
    • J. D. J. Rubio and W. Yu, "Stability analysis of nonlinear system identification via delayed neural networks," IEEE Trans. Circuits Syst. II, vol. 54, no. 2, pp. 161-165, Feb. 2007. (Pubitemid 46477682)
    • (2007) IEEE Transactions on Circuits and Systems II: Express Briefs , vol.54 , Issue.2 , pp. 161-165
    • De Jesus Rubio, J.1    Yu, W.2
  • 39
    • 39549084132 scopus 로고    scopus 로고
    • Neural network adaptive control for a class of nonlinear uncertain dynamical systems with asymptotic stability guarantees
    • DOI 10.1109/TNN.2007.902704
    • T. Hayakawa, W. M. Haddad, and N. Hovakimyan, "Neural network adaptive control for a class of nonlinear uncertain dynamical systems with asymptotic stability guarantees," IEEE Trans. Neural Netw., vol. 19, no. 1, pp. 80-89, Jan. 2008. (Pubitemid 351279227)
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.1 , pp. 80-89
    • Hayakawa, T.1    Haddad, W.M.2    Hovakimyan, N.3
  • 40
    • 0036980830 scopus 로고    scopus 로고
    • A unification between partial stability and stability theory for time-varying systems
    • Dec
    • V. Chellaboina and W. M. Haddad, "A unification between partial stability and stability theory for time-varying systems," IEEE Control Syst. Mag., vol. 22, no. 6, pp. 66-75, Dec. 2002.
    • (2002) IEEE Control Syst. Mag. , vol.22 , Issue.6 , pp. 66-75
    • Chellaboina, V.1    Haddad, W.M.2
  • 41
    • 0004178386 scopus 로고    scopus 로고
    • Englewood Cliffs, NJ: Prentice-Hall
    • H. K. Khalil, Nonlinear System. Englewood Cliffs, NJ: Prentice-Hall, 2002.
    • (2002) Nonlinear System
    • Khalil, H.K.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.