메뉴 건너뛰기




Volumn 23, Issue 7, 2012, Pages 1118-1129

Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update

Author keywords

Hamilton Jacobi Bellman; online approximators; online nonlinear optimal control; time based policy update

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATORS; HAMILTON JACOBI BELLMAN EQUATION; HAMILTON-JACOBI-BELLMAN; NONLINEAR DISCRETE-TIME SYSTEMS; ONLINE NONLINEAR OPTIMAL CONTROLS; TIME-BASED POLICY UPDATE; UNIFORMLY ULTIMATELY BOUNDED;

EID: 84875270081     PISSN: 2162237X     EISSN: 21622388     Source Type: Journal    
DOI: 10.1109/TNNLS.2012.2196708     Document Type: Article
Times cited : (229)

References (30)
  • 2
    • 0027556823 scopus 로고
    • Control of nonlinear dynamical systems using neural networks: Controllability and stabilization
    • Mar
    • A. U. Levin and K. S. Narendra, "Control of nonlinear dynamical systems using neural networks: Controllability and stabilization," IEEE Trans. Neural Netw., vol. 4, no. 2, pp. 192-206, Mar. 1993.
    • (1993) IEEE Trans. Neural Netw , vol.4 , Issue.2 , pp. 192-206
    • Levin, A.U.1    Narendra, K.S.2
  • 4
    • 0037350867 scopus 로고    scopus 로고
    • Existence of SDRE stabilizing feedback
    • Mar
    • J. Shamma and J. Cloutier, "Existence of SDRE stabilizing feedback," IEEE Trans. Autom. Control, vol. 48, no. 3, pp. 513-517, Mar. 2003.
    • (2003) IEEE Trans. Autom. Control , vol.48 , Issue.3 , pp. 513-517
    • Shamma, J.1    Cloutier, J.2
  • 5
    • 0022116454 scopus 로고
    • An existence theorem for discrete-time infinite-horizon optimal control problems
    • Sep
    • S. Keerthi and E. Gilbert, "An existence theorem for discrete-time infinite-horizon optimal control problems," IEEE Trans. Autom. Control, vol. 30, no. 9, pp. 907-909, Sep. 1985.
    • (1985) IEEE Trans. Autom. Control , vol.30 , Issue.9 , pp. 907-909
    • Keerthi, S.1    Gilbert, E.2
  • 6
    • 0032209114 scopus 로고    scopus 로고
    • Neural approximations for infinite-horizon optimal control of nonlinear stochastic systems
    • Nov
    • T. Parisini and R. Zoppoli, "Neural approximations for infinite-horizon optimal control of nonlinear stochastic systems," IEEE Trans. Neural Netw., vol. 9, no. 6, pp. 1388-1408, Nov. 1998.
    • (1998) IEEE Trans. Neural Netw , vol.9 , Issue.6 , pp. 1388-1408
    • Parisini, T.1    Zoppoli, R.2
  • 8
    • 39549085591 scopus 로고    scopus 로고
    • Generalized Hamilton-Jacobi-Bellman formulation based neural network control of affine nonlinear discretetime systems
    • Jan
    • Z. Chen and S. Jagannathan, "Generalized Hamilton-Jacobi-Bellman formulation based neural network control of affine nonlinear discretetime systems," IEEE Trans. Neural Netw., vol. 10, no. 1, pp. 90-106, Jan. 2008.
    • (2008) IEEE Trans. Neural Netw , vol.10 , Issue.1 , pp. 90-106
    • Chen, Z.1    Jagannathan, S.2
  • 9
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Aug
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern., B, vol. 38, no. 4, pp. 943-949, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern., B , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 10
    • 70349253929 scopus 로고    scopus 로고
    • Neural network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Sep
    • H. Zhang, Y. Luo, and D. Liu, "Neural network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
    • (2009) IEEE Trans. Neural Netw , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 11
    • 34547098844 scopus 로고    scopus 로고
    • Kernel-based least squares policy iteration for reinforcement learning
    • Jul
    • X. Xu, D. Hu, and X. Lu, "Kernel-based least squares policy iteration for reinforcement learning," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 973-992, Jul. 2007.
    • (2007) IEEE Trans. Neural Netw , vol.18 , Issue.4 , pp. 973-992
    • Xu, X.1    Hu, D.2    Lu, X.3
  • 12
    • 34547095501 scopus 로고    scopus 로고
    • Least squares solutions of the HJB equation with neural network value-function approximators
    • Jul
    • Y. Tassa and T. Erez, "Least squares solutions of the HJB equation with neural network value-function approximators," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 1031-1041, Jul. 2007.
    • (2007) IEEE Trans. Neural Netw , vol.18 , Issue.4 , pp. 1031-1041
    • Tassa, Y.1    Erez, T.2
  • 13
    • 36348986773 scopus 로고    scopus 로고
    • Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach
    • Nov
    • T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "Fixed-final-time- constrained optimal control of nonlinear systems using neural network HJB approach," IEEE Trans. Neural Netw., vol. 18, no. 6, pp. 1725-1736, Nov. 2007.
    • (2007) IEEE Trans. Neural Netw , vol.18 , Issue.6 , pp. 1725-1736
    • Cheng, T.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 14
    • 2442499479 scopus 로고    scopus 로고
    • Trajectory priming with dynamic fuzzy networks in nonlinear optimal control
    • Mar
    • Y. Becerikli, Y. Oysal, and A. F. Konar, "Trajectory priming with dynamic fuzzy networks in nonlinear optimal control," IEEE Trans. Neural Netw., vol. 15, no. 2, pp. 383-394, Mar. 2004.
    • (2004) IEEE Trans. Neural Netw , vol.15 , Issue.2 , pp. 383-394
    • Becerikli, Y.1    Oysal, Y.2    Konar, A.F.3
  • 16
  • 17
    • 0031236002 scopus 로고    scopus 로고
    • Adaptive critic designs
    • Sep
    • D. V. Prokhorov and D. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
    • (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
    • Prokhorov, D.V.1    Wunsch, D.2
  • 18
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • Mar
    • J. Si and Y. T. Wang, "Online learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 19
    • 77955513754 scopus 로고    scopus 로고
    • Approximate robust policy iteration using multilayer perceptron neural networks for discounted infinite-horizon Markov decision processes with uncertain correlate transition matrices
    • Aug.
    • B. Li and J. Si, "Approximate robust policy iteration using multilayer perceptron neural networks for discounted infinite-horizon Markov decision processes with uncertain correlate transition matrices," IEEE Trans. Neural Netw., vol. 21, no. 8, pp. 1270-1280, Aug. 2010.
    • (2010) IEEE Trans. Neural Netw , vol.21 , Issue.8 , pp. 1270-1280
    • Li, B.1    Si, J.2
  • 20
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
    • (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 21
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
    • (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 22
    • 0025627940 scopus 로고
    • Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks
    • K. Hornik, M. Stinchcombe, and H. White, "Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks," Neural Netw., vol. 3, no. 5, pp. 551-560, 1990.
    • (1990) Neural Netw , vol.3 , Issue.5 , pp. 551-560
    • Hornik, K.1    Stinchcombe, M.2    White, H.3
  • 23
    • 23944484578 scopus 로고    scopus 로고
    • Optimized discretetime state dependent Riccati equation regulator
    • A. S. Dutka, A. W. Ordys, and M. J. Grimble, "Optimized discretetime state dependent Riccati equation regulator," in Proc. Amer. Control Conf., vol. 4. 2005, pp. 2293-2298.
    • (2005) Proc. Amer. Control Conf , vol.4 , pp. 2293-2298
    • Dutka, A.S.1    Ordys, A.W.2    Grimble, M.J.3
  • 24
    • 0034291333 scopus 로고    scopus 로고
    • Dynamic surface control for a class of nonlinear systems
    • Oct
    • D. Swaroop, J. K. Hedrick, P. P. Yip, and J. C. Gerdes, "Dynamic surface control for a class of nonlinear systems," IEEE Trans. Autom. Control, vol. 45, no. 10, pp. 1893-1899, Oct. 2000.
    • (2000) IEEE Trans. Autom. Control , vol.45 , Issue.10 , pp. 1893-1899
    • Swaroop, D.1    Hedrick, J.K.2    Yip, P.P.3    Gerdes, J.C.4
  • 25
    • 0018011435 scopus 로고
    • Kronecker products and matrix calculus in system theory
    • Sep
    • J. W. Brewer, "Kronecker products and matrix calculus in system theory," IEEE Trans. Circuits Syst., vol. 25, no. 9, pp. 772-781, Sep. 1978.
    • (1978) IEEE Trans. Circuits Syst , vol.25 , Issue.9 , pp. 772-781
    • Brewer, J.W.1
  • 26
    • 77957772128 scopus 로고    scopus 로고
    • Optimal control of affine nonlinear discrete-time systems
    • Jun
    • T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear discrete-time systems," in Proc. Medit. Conf. Control Autom., Jun. 2009, pp. 1390-1395.
    • (2009) Proc. Medit. Conf. Control Autom , pp. 1390-1395
    • Dierks, T.1    Jagannathan, S.2
  • 27
    • 0034439915 scopus 로고    scopus 로고
    • H∞-optimal tracking control techniques for nonlinear underactuated systems
    • Dec
    • G. Toussaint, T. Basar, and F. Bullo, "H∞-optimal tracking control techniques for nonlinear underactuated systems," in Proc. IEEE Decision Control Conf., Dec. 2000, pp. 2078-2083.
    • (2000) Proc. IEEE Decision Control Conf. , pp. 2078-2083
    • Toussaint, G.1    Basar, T.2    Bullo, F.3
  • 28
    • 77957777969 scopus 로고    scopus 로고
    • Optimal control of affine nonlinear continuous-time systems
    • Jun
    • T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems," in Proc. IEEE Amer. Control Conf., Jun. 2010, pp. 1568-1573.
    • (2010) Proc. IEEE Amer. Control Conf. , pp. 1568-1573
    • Dierks, T.1    Jagannathan, S.2
  • 30
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • May
    • J. N. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Trans. Autom. Control, vol. 42, no. 5, pp. 674-690, May 1997.
    • (1997) IEEE Trans. Autom. Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.