SCOPUS 정보 검색 플랫폼

IEEE Transactions on Neural Networks and Learning Systems

Volumn 23, Issue 7, 2012, Pages 1118-1129

Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update

(2) Dierks, Travis a Jagannathan, Sarangapani b

a DRS Sustainment Systems Inc (United States)

b Architectural and Environmental Engineering (United States)

Author keywords

Hamilton Jacobi Bellman; online approximators; online nonlinear optimal control; time based policy update

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATORS; HAMILTON JACOBI BELLMAN EQUATION; HAMILTON-JACOBI-BELLMAN; NONLINEAR DISCRETE-TIME SYSTEMS; ONLINE NONLINEAR OPTIMAL CONTROLS; TIME-BASED POLICY UPDATE; UNIFORMLY ULTIMATELY BOUNDED;

CONTROL; COST FUNCTIONS; DIGITAL CONTROL SYSTEMS; DISCRETE TIME CONTROL SYSTEMS; DYNAMIC PROGRAMMING; HARDWARE;

OPTIMAL CONTROL SYSTEMS;

EID: 84875270081 PISSN: 2162237X EISSN: 21622388 Source Type: Journal
DOI: 10.1109/TNNLS.2012.2196708 Document Type: Article

Times cited : (229)

References (30)

1
- 34047119733
- Boca Raton FL: CRC Press
- S. Jagannathan, Neural Network Control of Nonlinear Discrete-Time Systems. Boca Raton, FL: CRC Press, 2006.
- (2006) Neural Network Control of Nonlinear Discrete-Time Systems
- Jagannathan, S.¹

2
- 0027556823
- Control of nonlinear dynamical systems using neural networks: Controllability and stabilization
- Mar
- A. U. Levin and K. S. Narendra, "Control of nonlinear dynamical systems using neural networks: Controllability and stabilization," IEEE Trans. Neural Netw., vol. 4, no. 2, pp. 192-206, Mar. 1993.
- (1993) IEEE Trans. Neural Netw , vol.4 , Issue.2 , pp. 192-206
- Levin, A.U.¹ Narendra, K.S.²

3
- 0004163205
- 2nd ed. Hoboken, NJ: Wiley
- F. L. Lewis and V. L. Syrmos, Optimal Control, 2nd ed. Hoboken, NJ: Wiley, 1995.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

4
- 0037350867
- Existence of SDRE stabilizing feedback
- Mar
- J. Shamma and J. Cloutier, "Existence of SDRE stabilizing feedback," IEEE Trans. Autom. Control, vol. 48, no. 3, pp. 513-517, Mar. 2003.
- (2003) IEEE Trans. Autom. Control , vol.48 , Issue.3 , pp. 513-517
- Shamma, J.¹ Cloutier, J.²

5
- 0022116454
- An existence theorem for discrete-time infinite-horizon optimal control problems
- Sep
- S. Keerthi and E. Gilbert, "An existence theorem for discrete-time infinite-horizon optimal control problems," IEEE Trans. Autom. Control, vol. 30, no. 9, pp. 907-909, Sep. 1985.
- (1985) IEEE Trans. Autom. Control , vol.30 , Issue.9 , pp. 907-909
- Keerthi, S.¹ Gilbert, E.²

6
- 0032209114
- Neural approximations for infinite-horizon optimal control of nonlinear stochastic systems
- Nov
- T. Parisini and R. Zoppoli, "Neural approximations for infinite-horizon optimal control of nonlinear stochastic systems," IEEE Trans. Neural Netw., vol. 9, no. 6, pp. 1388-1408, Nov. 1998.
- (1998) IEEE Trans. Neural Netw , vol.9 , Issue.6 , pp. 1388-1408
- Parisini, T.¹ Zoppoli, R.²

7
- 0003565783
- 2nd ed. Belmonth MA: Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control, 2nd ed. Belmonth, MA: Athena Scientific, 2000.
- (2000) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

8
- 39549085591
- Generalized Hamilton-Jacobi-Bellman formulation based neural network control of affine nonlinear discretetime systems
- Jan
- Z. Chen and S. Jagannathan, "Generalized Hamilton-Jacobi-Bellman formulation based neural network control of affine nonlinear discretetime systems," IEEE Trans. Neural Netw., vol. 10, no. 1, pp. 90-106, Jan. 2008.
- (2008) IEEE Trans. Neural Netw , vol.10 , Issue.1 , pp. 90-106
- Chen, Z.¹ Jagannathan, S.²

9
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern., B, vol. 38, no. 4, pp. 943-949, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern., B , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

10
- 70349253929
- Neural network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Sep
- H. Zhang, Y. Luo, and D. Liu, "Neural network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
- (2009) IEEE Trans. Neural Netw , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

11
- 34547098844
- Kernel-based least squares policy iteration for reinforcement learning
- Jul
- X. Xu, D. Hu, and X. Lu, "Kernel-based least squares policy iteration for reinforcement learning," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 973-992, Jul. 2007.
- (2007) IEEE Trans. Neural Netw , vol.18 , Issue.4 , pp. 973-992
- Xu, X.¹ Hu, D.² Lu, X.³

12
- 34547095501
- Least squares solutions of the HJB equation with neural network value-function approximators
- Jul
- Y. Tassa and T. Erez, "Least squares solutions of the HJB equation with neural network value-function approximators," IEEE Trans. Neural Netw., vol. 18, no. 4, pp. 1031-1041, Jul. 2007.
- (2007) IEEE Trans. Neural Netw , vol.18 , Issue.4 , pp. 1031-1041
- Tassa, Y.¹ Erez, T.²

13
- 36348986773
- Fixed-final-time-constrained optimal control of nonlinear systems using neural network HJB approach
- Nov
- T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "Fixed-final-time- constrained optimal control of nonlinear systems using neural network HJB approach," IEEE Trans. Neural Netw., vol. 18, no. 6, pp. 1725-1736, Nov. 2007.
- (2007) IEEE Trans. Neural Netw , vol.18 , Issue.6 , pp. 1725-1736
- Cheng, T.¹ Lewis, F.L.² Abu-Khalaf, M.³

14
- 2442499479
- Trajectory priming with dynamic fuzzy networks in nonlinear optimal control
- Mar
- Y. Becerikli, Y. Oysal, and A. F. Konar, "Trajectory priming with dynamic fuzzy networks in nonlinear optimal control," IEEE Trans. Neural Netw., vol. 15, no. 2, pp. 383-394, Mar. 2004.
- (2004) IEEE Trans. Neural Netw , vol.15 , Issue.2 , pp. 383-394
- Becerikli, Y.¹ Oysal, Y.² Konar, A.F.³

15
- 84921399937
- New York: Wiley
- J. Si, A. G. Barto,W. B. Powell, and D. Wunsch, Handbook of Learning and Approximate Dynamics Programming. New York: Wiley, 2004.
- (2004) Handbook of Learning and Approximate Dynamics Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.⁴

16
- 34249047468
- Continuous-time adaptive critics
- May
- T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Netw., vol. 18, no. 3, pp. 631-647, May 2007.
- (2007) IEEE Trans. Neural Netw , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

17
- 0031236002
- Adaptive critic designs
- Sep
- D. V. Prokhorov and D. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
- (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.²

18
- 0035273403
- Online learning control by association and reinforcement
- Mar
- J. Si and Y. T. Wang, "Online learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
- (2001) IEEE Trans. Neural Netw , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.T.²

19
- 77955513754
- Approximate robust policy iteration using multilayer perceptron neural networks for discounted infinite-horizon Markov decision processes with uncertain correlate transition matrices
- Aug.
- B. Li and J. Si, "Approximate robust policy iteration using multilayer perceptron neural networks for discounted infinite-horizon Markov decision processes with uncertain correlate transition matrices," IEEE Trans. Neural Netw., vol. 21, no. 8, pp. 1270-1280, Aug. 2010.
- (2010) IEEE Trans. Neural Netw , vol.21 , Issue.8 , pp. 1270-1280
- Li, B.¹ Si, J.²

20
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

21
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

22
- 0025627940
- Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks
- K. Hornik, M. Stinchcombe, and H. White, "Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks," Neural Netw., vol. 3, no. 5, pp. 551-560, 1990.
- (1990) Neural Netw , vol.3 , Issue.5 , pp. 551-560
- Hornik, K.¹ Stinchcombe, M.² White, H.³

23
- 23944484578
- Optimized discretetime state dependent Riccati equation regulator
- A. S. Dutka, A. W. Ordys, and M. J. Grimble, "Optimized discretetime state dependent Riccati equation regulator," in Proc. Amer. Control Conf., vol. 4. 2005, pp. 2293-2298.
- (2005) Proc. Amer. Control Conf , vol.4 , pp. 2293-2298
- Dutka, A.S.¹ Ordys, A.W.² Grimble, M.J.³

24
- 0034291333
- Dynamic surface control for a class of nonlinear systems
- Oct
- D. Swaroop, J. K. Hedrick, P. P. Yip, and J. C. Gerdes, "Dynamic surface control for a class of nonlinear systems," IEEE Trans. Autom. Control, vol. 45, no. 10, pp. 1893-1899, Oct. 2000.
- (2000) IEEE Trans. Autom. Control , vol.45 , Issue.10 , pp. 1893-1899
- Swaroop, D.¹ Hedrick, J.K.² Yip, P.P.³ Gerdes, J.C.⁴

25
- 0018011435
- Kronecker products and matrix calculus in system theory
- Sep
- J. W. Brewer, "Kronecker products and matrix calculus in system theory," IEEE Trans. Circuits Syst., vol. 25, no. 9, pp. 772-781, Sep. 1978.
- (1978) IEEE Trans. Circuits Syst , vol.25 , Issue.9 , pp. 772-781
- Brewer, J.W.¹

26
- 77957772128
- Optimal control of affine nonlinear discrete-time systems
- Jun
- T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear discrete-time systems," in Proc. Medit. Conf. Control Autom., Jun. 2009, pp. 1390-1395.
- (2009) Proc. Medit. Conf. Control Autom , pp. 1390-1395
- Dierks, T.¹ Jagannathan, S.²

27
- 0034439915
- H∞-optimal tracking control techniques for nonlinear underactuated systems
- Dec
- G. Toussaint, T. Basar, and F. Bullo, "H∞-optimal tracking control techniques for nonlinear underactuated systems," in Proc. IEEE Decision Control Conf., Dec. 2000, pp. 2078-2083.
- (2000) Proc. IEEE Decision Control Conf. , pp. 2078-2083
- Toussaint, G.¹ Basar, T.² Bullo, F.³

28
- 77957777969
- Optimal control of affine nonlinear continuous-time systems
- Jun
- T. Dierks and S. Jagannathan, "Optimal control of affine nonlinear continuous-time systems," in Proc. IEEE Amer. Control Conf., Jun. 2010, pp. 1568-1573.
- (2010) Proc. IEEE Amer. Control Conf. , pp. 1568-1573
- Dierks, T.¹ Jagannathan, S.²

29
- 0003785722
- Ph.D. dissertation Electr. Eng. Dept., Rensselaer Polytechnic Inst., Troy, NY
- R. W. Beard, "Improving the closed-loop performance of nonlinear systems," Ph.D. dissertation, Electr. Eng. Dept., Rensselaer Polytechnic Inst., Troy, NY, 1995.
- (1995) Improving the Closed-loop Performance of Nonlinear Systems
- Beard, R.W.¹

30
- 0031143730
- An analysis of temporal-difference learning with function approximation
- May
- J. N. Tsitsiklis and B. Van Roy, "An analysis of temporal-difference learning with function approximation," IEEE Trans. Autom. Control, vol. 42, no. 5, pp. 674-690, May 1997.
- (1997) IEEE Trans. Autom. Control , vol.42 , Issue.5 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.