SCOPUS 정보 검색 플랫폼

IEEE SSCI 2011: Symposium Series on Computational Intelligence - ADPRL 2011: 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Volumn , Issue , 2011, Pages 242-249

Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems

(3) Liu, Derong a Wang, Ding a Zhao, Dongbin a

a INSTITUTE OF AUTOMATION (China)

Author keywords

Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming; globalized dual heuristic programming; intelligent control; neural dynamic programming; neural networks; optimal control

Indexed keywords

ADAPTIVE CRITIC DESIGNS; ADAPTIVE DYNAMIC PROGRAMMING; APPROXIMATE DYNAMIC PROGRAMMING; DUAL HEURISTIC PROGRAMMING; NEURAL DYNAMIC PROGRAMMING; OPTIMAL CONTROLS;

ADAPTIVE CONTROL SYSTEMS; ALGORITHMS; CELLULAR RADIO SYSTEMS; CONVERGENCE OF NUMERICAL METHODS; COST FUNCTIONS; DIGITAL CONTROL SYSTEMS; DISCRETE TIME CONTROL SYSTEMS; HEURISTIC PROGRAMMING; INTELLIGENT CONTROL; NEURAL NETWORKS; OPTIMIZATION; REINFORCEMENT LEARNING;

DYNAMIC PROGRAMMING;

EID: 80052212355 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2011.5967357 Document Type: Conference Paper

Times cited : (13)

References (36)

1
- 85012688561
- Princeton NJ: Princeton University Press
- R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton University Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.E.¹

2
- 85056061207
- Boca Raton FL: CRC Press
- J. Sarangapani, Neural Network Control of Nonlinear Discrete-time Systems. Boca Raton, FL: CRC Press, 2006.
- (2006) Neural Network Control of Nonlinear Discrete-time Systems
- Sarangapani, J.¹

3
- 84892309675
- London: Springer-Verlag
- W. Yu, Recent Advances in Intelligent Control Systems. London: Springer-Verlag, 2009.
- (2009) Recent Advances in Intelligent Control Systems
- Yu, W.¹

4
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, ch. 13
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, 1992, ch. 13.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

5
- 49049091767
- ADP: The key direction for future research in intelligent control and understanding brain intelligence
- Aug.
- P. J. Werbos, "ADP: The key direction for future research in intelligent control and understanding brain intelligence," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 898-900, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 898-900
- Werbos, P.J.¹

6
- 67349247013
- Intelligence in the brain: A theory of how it works and how to build it
- Apr.
- P. J. Werbos, "Intelligence in the brain: a theory of how it works and how to build it," Neural Networks, vol. 22, no. 3, pp. 200-212, Apr. 2009.
- (2009) Neural Networks , vol.22 , Issue.3 , pp. 200-212
- Werbos, P.J.¹

7
- 34249833101
- Q-learning
- c. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

8
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

9
- 0031236002
- Adaptive critic designs
- PII S1045922797052430
- D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sept. 1997. (Pubitemid 127763331)
- (1997) IEEE Transactions on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch II, D.C.²

10
- 0036588686
- Adaptive dynamic programming
- DOI 10.1109/TSMCC.2002.801727
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 32, no. 2, pp. 140-153, May 2002. (Pubitemid 35289398)
- (2002) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

11
- 0035273403
- On-line learning control by association and reinforcement
- DOI 10.1109/72.914523, PII S1045922701014047
- J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001. (Pubitemid 32371483)
- (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.-T.²

12
- 84921399937
- New York: IEEE Press/Wiley
- J. Si, A. G. Barto, W. B. Powell, and D. C. Wunsch, Eds., Handbook of Learning and Approximate Dynamic Programming. New York: IEEE Press/Wiley, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.C.⁴

13
- 66449130966
- Adaptive dynamic programming: An introduction
- May
- F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: an introduction," IEEE CompUlationallntelligence Magazine, vol. 4, no. 2, pp. 39-47, May 2009.
- (2009) IEEE CompUlationallntelligence Magazine , vol.4 , Issue.2 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

14
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- July
- F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits and Systems Magazine, vol. 9, no. 3, pp. 32-50, July 2009.
- (2009) IEEE Circuits and Systems Magazine , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

15
- 0034863083
- Action-dependent adaptive critic designs
- D. Liu, X. Xiong, and Y. Zhang, "Action-dependent adaptive critic designs," in Proc. International Joint Conference on Neural Networks, Washington, DC, July 2001, vol. 2, pp. 990-995. (Pubitemid 32805299)
- (2001) Proceedings of the International Joint Conference on Neural Networks , vol.2 , pp. 990-995
- Liu, D.¹ Xiong, X.² Zhang, Y.³

16
- 56349120789
- E:-adaptive dynamic programming for discrete-time systems
- Hong Kong, June
- D. Liu and N. Jin, "E:-adaptive dynamic programming for discrete-time systems," in Proc. International Joint Conference on Neural Networks, Hong Kong, June 2008, pp. 1417-1424.
- (2008) Proc. International Joint Conference on Neural Networks , pp. 1417-1424
- Liu, D.¹ Jin, N.²

17
- 84954411226
- Adaptive critic based neurocontroller for turbogenerators with global dual heuristic programming
- Singapore, Jan.
- G. K. Venayagamoorthy, D. C. Wunsch, and R. G. Harley, "Adaptive critic based neurocontroller for turbogenerators with global dual heuristic programming," in Proc. IEEE PES Winter Meet., Singapore, Jan. 2000, vol. 1, pp. 291-294.
- (2000) Proc. IEEE PES Winter Meet. , vol.1 , pp. 291-294
- Venayagamoorthy, G.K.¹ Wunsch, D.C.² Harley, R.G.³

18
- 0036565019
- Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator
- DOI 10.1109/TNN.2002.1000146, PII S1045922702044417
- G. K. Venayagamoorthy, R. G. Harley, and D. C. Wunsch, "Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator," IEEE Trans. Neural Netw., vol. 13, no. 3, pp. 764-773, May 2002. (Pubitemid 34669664)
- (2002) IEEE Transactions on Neural Networks , vol.13 , Issue.3 , pp. 764-773
- Venayagamoorthy, G.K.¹ Harley, R.G.² Wunsch, D.C.³

19
- 0242337541
- Adaptivecritic- based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks
- Sept./Oct.
- J. W. Park, R. G. Harley, and G. K. Venayagamoorthy, " Adaptivecritic- based optimal neurocontrol for synchronous generators in a power system using MLP/RBF neural networks," IEEE Trans. Ind. Appl., vol. 39, no. 5, pp. 1529-1540, Sept./Oct. 2003.
- (2003) IEEE Trans. Ind. Appl. , vol.39 , Issue.5 , pp. 1529-1540
- Park, J.W.¹ Harley, R.G.² Venayagamoorthy, G.K.³

20
- 17644391408
- Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor
- Apr.
- G. G. Yen and P. G. DeLima, "Improving the performance of globalized dual heuristic programming for fault tolerant control through an online learning supervisor," IEEE Trans. Automation Science and Engineering, vol. 2, no. 2, pp. 121-131, Apr. 2005.
- (2005) IEEE Trans. Automation Science and Engineering , vol.2 , Issue.2 , pp. 121-131
- Yen, G.G.¹ Delima, P.G.²

21
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- DOI 10.1016/j.automatica.2004.11.034, PII S0005109805000105
- M. Abu-Khalaf and F. L. Lewis. "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5,779-791, May 2005. (Pubitemid 40352391)
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

22
- 33846781133
- A neural network solution for fixed-final time optimal control of nonlinear systems
- DOI 10.1016/j.automatica.2006.09.021, PII S0005109806004250
- T. Cheng, F. L. Lewis, and M. Abu-Khalaf, "A neural network solution for fixed-final time optimal control of nonlinear systems," Automatica, vol. 43, no. 3, pp. 482-490, Mar. 2007. (Pubitemid 46209051)
- (2007) Automatica , vol.43 , Issue.3 , pp. 482-490
- Cheng, T.¹ Lewis, F.L.² Abu-Khalaf, M.³

23
- 0030196717
- Adaptive-critic-based neural networks for aircraft optimal control
- S. N. Balakrishnan and Y. Biega, "Adaptive-critic based neural networks for aircraft optimal control," Journal of Guidance, Control, and Dynamics, vol. 19, no. 4, pp. 893-898, July-Aug. 1996. (Pubitemid 126539437)
- (1996) Journal of Guidance, Control, and Dynamics , vol.19 , Issue.4 , pp. 893-898
- Balakrishnan, S.N.¹ Biega, V.²

24
- 0035427378
- Adaptive-critic based optimal neuro control synthesis for distributed parameter systems
- DOI 10.1016/S0005-1098(01)00093-0, PII S0005109801000930
- R. Padhi, S. N. Balakrishnan, and T. Randolph, "Adaptive-critic based optimal neuro control synthesis for distributed parameter systems," Automatica, vol. 37, no. 8, pp. 1223-1234, Aug. 2001. (Pubitemid 32610253)
- (2001) Automatica , vol.37 , Issue.8 , pp. 1223-1234
- Padhi, R.¹ Balakrishnan, S.N.² Randolph, T.³

25
- 0036641793
- State-constrained agile missile control with adaptive-critic-based neural networks
- DOI 10.1109/TCST.2002.1014669, PII S1063653602053605
- D. Han and S. N. Balakrishnan, "State-constrained agile missile control with adaptive critic-based neural networks," IEEE Trans. Control Systems Technology, vol. 10, no. 4, pp. 481-489, July 2002. (Pubitemid 34798672)
- (2002) IEEE Transactions on Control Systems Technology , vol.10 , Issue.4 , pp. 481-489
- Han, D.¹ Balakrishnan, S.N.²

26
- 33751238181
- A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems
- DOI 10.1016/j.neunet.2006.08.010, PII S0893608006001912
- R. Padhi, N. Unnikrishnan, X. Wang, and S. N. Balakrishnan, "A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems," Neural Networks, vol. 19, no. 10, pp. 1648-1660, Dec. 2006. (Pubitemid 44793175)
- (2006) Neural Networks , vol.19 , Issue.10 , pp. 1648-1660
- Padhi, R.¹ Unnikrishnan, N.² Wang, X.³ Balakrishnan, S.N.⁴

27
- 49049111594
- Issues on stability of ADP feedback controllers for dynamic systems
- Aug.
- S. N. Balakrishnan, J. Ding, and F. L. Lewis, "Issues on stability of ADP feedback controllers for dynamic systems," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 913-917, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 913-917
- Balakrishnan, S.N.¹ Ding, J.² Lewis, F.L.³

28
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug.
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

29
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm
- Aug.
- H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.¹ Wei, Q.² Luo, Y.³

30
- 70349253929
- Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Sept.
- H. Zhang, Y. Luo, and D. Liu, "Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sept. 2009.
- (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.¹ Luo, Y.² Liu, D.³

31
- 67349145396
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- Apr.
- D. Vrabie and F. Lewis, "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems" Neural Networks, vol. 22, no. 3, pp. 237-246, Apr. 2009.
- (2009) Neural Networks , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.²

32
- 68149180889
- Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
- July-Aug.
- T. Dierks, B. T. Thumati, and J. Sarangapani, "Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence," Neural Networks, vol. 22, no. 5-6, pp. 851-860, July-Aug. 2009.
- (2009) Neural Networks , vol.22 , Issue.5-6 , pp. 851-860
- Dierks, T.¹ Thumati, B.T.² Sarangapani, J.³

33
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- May
- K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, May 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

34
- 77955423822
- ∞ control design for unknown linear discrete-time systems via Q-learning with LMI
- Aug.
- ∞ control design for unknown linear discrete-time systems via Q-learning with LMI," Automatica, vol. 46, no. 8, pp. 1320-1326, Aug. 2010.
- (2010) Automatica , vol.46 , Issue.8 , pp. 1320-1326
- Kim, J.H.¹ Lewis, F.L.²

35
- 77950853735
- Optimal tracking control of affine non liner discrete-time systems with unknown internal dynamics
- Shanghai, P. R. China, Dec.
- T. Dierks and J. Sarangapani, "Optimal tracking control of affine non liner discrete-time systems with unknown internal dynamics", in Joint 48th IEEE Conference on Decision and Control and 28th Chinese Control Conference, Shanghai, P. R. China, Dec. 2009, pp. 6750-6755.
- (2009) Joint 48th IEEE Conference on Decision and Control and 28th Chinese Control Conference , pp. 6750-6755
- Dierks, T.¹ Sarangapani, J.²

36
- 57749111482
- Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form
- Dec.
- J. Sarangapani and P. He, "Neural-network-based state feedback control of a nonlinear discrete-time system in nonstrict feedback form," IEEE Trans. Neural Netw., vol. 19, no. 12, pp. 2073-2087, Dec. 2008.
- (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.12 , pp. 2073-2087
- Sarangapani, J.¹ He, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.