SCOPUS 정보 검색 플랫폼

Volumn 17, Issue 11, 2013, Pages 2101-2108

Heuristic dynamic programming with internal goal representation

a The University of Rhode Island (United States)

Author keywords

Adaptive dynamic programming (ADP); Goal representation heuristic dynamic programming (GrHDP); Maze navigation path planning; Reinforcement learning (RL)

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; CONVERGENT SPEED; HEURISTIC DYNAMIC PROGRAMMING; NAVIGATION PROBLEM; REINFORCEMENT LEARNING APPROACH; SIMULATION ENVIRONMENT; STRUCTURE-BASED; SUM OF SQUARE ERRORS;

DYNAMIC PROGRAMMING; NAVIGATION; REINFORCEMENT LEARNING;

PROBLEM SOLVING;

EID: 84885936244 PISSN: 14327643 EISSN: 14337479 Source Type: Journal
DOI: 10.1007/s00500-013-1112-9 Document Type: Article

Times cited : (26)

References (41)

1
- 84872295913
- Learning and control in virtual reality for machine intelligence
- Fang X, He H, Ni Z, Tang Y (2012) Learning and control in virtual reality for machine intelligence. In: International conference intelligent control and information processing (ICICIP'12), IEEE, Dalian, China, pp 63-67.
- (2012) In: International conference intelligent control and information processing (ICICIP'12), IEEE, Dalian, China , pp. 63-67
- Fang, X.¹ He, H.² Ni, Z.³ Tang, Y.⁴

2
- 79960115021
- Adaptive learning and control for mimo system based on adaptive dynamic programming
- Fu J, He H, Zhou X (2011) Adaptive learning and control for mimo system based on adaptive dynamic programming. IEEE Trans Neural Netw 22(7): 1133-1148.
- (2011) IEEE Trans Neural Netw , vol.22 , Issue.7 , pp. 1133-1148
- Fu, J.¹ He, H.² Zhou, X.³

3
- 79957876523
- An adaptive dynamic programming approach for closely-coupled mimo system control
- Fu J, He H, Liu Q, Ni Z (2011) An adaptive dynamic programming approach for closely-coupled mimo system control. In: Int symp neural networks (ISNN'11), pp 1-10.
- (2011) In: Int symp neural networks (ISNN'11) , pp. 1-10
- Fu, J.¹ He, H.² Liu, Q.³ Ni, Z.⁴

4
- 80052225230
- Adaptive dynamic programming with balanced weights seeking strategy
- Fu J, He H, Ni Z (2011) Adaptive dynamic programming with balanced weights seeking strategy. In: IEEE symposium on adaptive dynamic programming and reinforcement learning (ADPRL), IEEE symposium series on computational intelligence (SSCI), France.
- (2011) In: IEEE symposium on adaptive dynamic programming and reinforcement learning (ADPRL), IEEE symposium series on computational intelligence (SSCI), France
- Fu, J.¹ He, H.² Ni, Z.³

5
- 34047138362
- Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input contraints
- He P, Jagannathan S (2007) Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input contraints. IEEE Trans Syst Man Cybern Part B-Cybern 37(2): 425-436.
- (2007) IEEE Trans Syst Man Cybern Part B-Cybern , vol.37 , Issue.2 , pp. 425-436
- He, P.¹ Jagannathan, S.²

6
- 84891585216
- New York: Wiley
- He H (2011) Self-adaptive systems for machine intelligence. Wiley, New York.
- (2011) Self-Adaptive Systems for Machine Intelligence
- He, H.¹

7
- 82655173881
- A three-network architecture for on-line learning and optimization based on adaptive dynamic programming
- He H, Ni Z, Fu J (2012) A three-network architecture for on-line learning and optimization based on adaptive dynamic programming. Neurocomputing 78(1): 3-13.
- (2012) Neurocomputing , vol.78 , Issue.1 , pp. 3-13
- He, H.¹ Ni, Z.² Fu, J.³

8
- 84885924144
- Hoboken: Wiley-IEEE Press
- He H, Ni Z, Zhao D (2012) Reinforcement learning and approximate dynamic programming for feedback control, ch. learning and optimization in hierarchical adaptive critic design. Wiley-IEEE Press, Hoboken.
- (2012) Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, Ch. Learning and Optimization in Hierarchical Adaptive Critic Design
- He, H.¹ Ni, Z.² Zhao, D.³

9
- 84885914192
- Actor-critic design for on-line learning and optimization for machine intelligence
- He H, Ni Z, Prokhorov DV (2011) Actor-critic design for on-line learning and optimization for machine intelligence. In: International conference on cognitive and neural systems (ICCNS'11), Boston.
- (2011) In: International conference on cognitive and neural systems (ICCNS'11), Boston
- He, H.¹ Ni, Z.² Prokhorov, D.V.³

10
- 84872330793
- Data-driven learning and control with multiple critic networks
- He H, Ni Z, Zhao D (2012) Data-driven learning and control with multiple critic networks. In: The 10th world congress on, intelligent control and automation (WCICA'12), pp 523-527.
- (2012) In: The 10th world congress on, intelligent control and automation (WCICA'12) , pp. 523-527
- He, H.¹ Ni, Z.² Zhao, D.³

11
- 49149131955
- Beyond feedforward models trained by backpropagation: a practical training tool for a more efficient universal approximator
- Ilin R, Kozma R, Werbos P (2008) Beyond feedforward models trained by backpropagation: a practical training tool for a more efficient universal approximator. Neural Netw IEEE Trans 19(6): 929-937.
- (2008) Neural Netw IEEE Trans , vol.19 , Issue.6 , pp. 929-937
- Ilin, R.¹ Kozma, R.² Werbos, P.³

12
- 34548237758
- Cellular SRN trained by extended Kalman filter shows promise for ADP
- Werbos
- Ilin R, Kozma R, Werbos (2006) Cellular SRN trained by extended Kalman filter shows promise for ADP. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), IEEE, pp 506-510.
- (2006) In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), IEEE , pp. 506-510
- Ilin, R.¹ Kozma, R.²

13
- 34548725198
- Efficient learning in cellular simultaneous recurrent neural networks-the case of maze navigation problem
- Ilin R, Kozma R, Werbos P (2007) Efficient learning in cellular simultaneous recurrent neural networks-the case of maze navigation problem. In: IEEE international symposium on approximate dynamic programming and reinforcement learning (ADPRL), IEEE, pp 324-329.
- (2007) In: IEEE international symposium on approximate dynamic programming and reinforcement learning (ADPRL), IEEE , pp. 324-329
- Ilin, R.¹ Kozma, R.² Werbos, P.³

14
- 84891584860
- Hoboken: Wiley-IEEE Press
- Lewis F, Liu D (eds) (2013) Reinforcement learning and approximate dynamic programming for feedback control. Wiley-IEEE Press, Hoboken.
- (2013) Reinforcement Learning and Approximate Dynamic Programming for Feedback Control
- Lewis, F.¹ Liu, D.²

15
- 84862811991
- A boundedness result for the direct heuristic dynamic programming
- Liu F, Sun J, Si J, Guo W, Mei S (2012) A boundedness result for the direct heuristic dynamic programming. Neural Netw 32: 229-235.
- (2012) Neural Netw , vol.32 , pp. 229-235
- Liu, F.¹ Sun, J.² Si, J.³ Guo, W.⁴ Mei, S.⁵

16
- 84863467146
- Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
- Liu D, Wang D, Zhao D, Wei Q, Jin N (2012) Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming. IEEE Trans Autom Sci Eng 9(3): 628-634.
- (2012) IEEE Trans Autom Sci Eng , vol.9 , Issue.3 , pp. 628-634
- Liu, D.¹ Wang, D.² Zhao, D.³ Wei, Q.⁴ Jin, N.⁵

17
- 84881555023
- Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
- Liu D, Wei Q (2013) Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems. IEEE Trans Cybern 43(2): 779-789.
- (2013) IEEE Trans Cybern , vol.43 , Issue.2 , pp. 779-789
- Liu, D.¹ Wei, Q.²

18
- 0004255908
- New York: McGraw-Hill, Inc
- Mitchell TM (1997) Machine learning. McGraw-Hill, Inc, New York.
- (1997) Machine Learning
- Mitchell, T.M.¹

19
- 84876149222
- Adaptive learning in tracking control based on the dual critic network design
- Ni Z, He H, Wen J (2013) Adaptive learning in tracking control based on the dual critic network design. IEEE Trans Neural Netw Learn Syst 6(24): 913-928.
- (2013) IEEE Trans Neural Netw Learn Syst , vol.6 , Issue.24 , pp. 913-928
- Ni, Z.¹ He, H.² Wen, J.³

20
- 84891525568
- Real-time tracking control on adaptive critic design with uniformly ultimately bounded condition
- IEEE symposium series on computational intelligence (SSCI), USA
- Ni Z, Fang X, He H, Zhao D, Xu X (2013) Real-time tracking control on adaptive critic design with uniformly ultimately bounded condition. In: IEEE symposium on adaptive dynamic programming and reinforcement learning (ADPRL'13). IEEE symposium series on computational intelligence (SSCI), USA.
- (2013) In: IEEE symposium on adaptive dynamic programming and reinforcement learning (ADPRL'13)
- Ni, Z.¹ Fang, X.² He, H.³ Zhao, D.⁴ Xu, X.⁵

21
- 80054754525
- An online actor-critic learning approach with Levenberg-Marquardt algorithm
- Ni Z, He H, Prokhorov DV, Fu J (2011) An online actor-critic learning approach with Levenberg-Marquardt algorithm. In: The 2011 international joint conference on neural networks (IJCNN), IEEE, pp 2333-2340.
- (2011) In: The 2011 international joint conference on neural networks (IJCNN), IEEE , pp. 2333-2340
- Ni, Z.¹ He, H.² Prokhorov, D.V.³ Fu, J.⁴

22
- 84885934212
- Adaptive learning with goal generator network based on heuristic dynamic programming
- Ni Z, He H, Prokhorov DV (2012) Adaptive learning with goal generator network based on heuristic dynamic programming. In: Internatinal conference on cognitive and neural systems (ICCNS'12), Boston.
- (2012) In: Internatinal conference on cognitive and neural systems (ICCNS'12), Boston
- Ni, Z.¹ He, H.² Prokhorov, D.V.³

23
- 84887990637
- Goal representation heuristic dynamic programming on maze navigation
- Ni Z, He H, Wen J, Xu X (2013) Goal representation heuristic dynamic programming on maze navigation. IEEE Trans Neural Netw Learn Syst (to be published).
- (2013) IEEE Trans Neural Netw Learn Syst (to be published)
- Ni, Z.¹ He, H.² Wen, J.³ Xu, X.⁴

24
- 84865079504
- Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming
- Ni Z, He H, Zhao D, Prokhorov D (2012) Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming. In: The 2012 international joint conference on neural networks (IJCNN), IEEE, pp 1-8.
- (2012) In: The 2012 international joint conference on neural networks (IJCNN), IEEE , pp. 1-8
- Ni, Z.¹ He, H.² Zhao, D.³ Prokhorov, D.⁴

25
- 0005942467
- Neural network design for j function approximation in dynamic programming
- Pang X, Werbos PJ (1996) Neural network design for j function approximation in dynamic programming. In: Mathematical modelling and scientific computing. http://arxiv. org/pdf/adap-org/9806001. pdf.
- (1996) In: Mathematical modelling and scientific computing
- Pang, X.¹ Werbos, P.J.²

26
- 0003448868
- PhD. Dissertation. PhD thesis
- Prokhorov DV (1997) Adaptive critic designs and their applications, PhD. Dissertation. PhD thesis.
- (1997) Adaptive critic designs and their applications
- Prokhorov, D.V.¹

27
- 0029592634
- Adaptive critic designs: a case study for neurocontrol
- Prokhorov DV, Santiago RA, Wunsch DC II (1995) Adaptive critic designs: a case study for neurocontrol. Neural Netw 8(9): 1367-1372.
- (1995) Neural Netw , vol.8 , Issue.9 , pp. 1367-1372
- Prokhorov, D.V.¹ Santiago, R.A.² Wunsch II, D.C.³

28
- 0031236002
- Adaptive critic designs
- Prokhorov D, Wunsch D (1997) Adaptive critic designs. IEEE Trans Neural Netw 8(5): 997-1007.
- (1997) IEEE Trans Neural Netw , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.¹ Wunsch, D.²

29
- 0035273403
- Online learning control by association and reinforcement
- Si J, Wang Y-T (2001) Online learning control by association and reinforcement. IEEE Trans Neural Netw 12(2): 264-276.
- (2001) IEEE Trans Neural Netw , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.-T.²

30
- 84921399937
- New York: Wiley
- Si J, Barto AG, Powell WB, Wunsch DC (eds) (2004) Handbook of learning and approximate dynamic programming. Wiley, New York.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.C.⁴

31
- 0004102479
- MIT Press, Cambridge
- Sutton R, Barto A (1998) Reinforcement learning: an introduction. MIT Press, Cambridge.
- (1998) Reinforcement learning: An introduction
- Sutton, R.¹ Barto, A.²

32
- 84864489666
- Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
- Wang D, Liu D, Wei Q, Zhao D, Jin N (2012) Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming. Automatica 48: 1825-1832.
- (2012) Automatica , vol.48 , pp. 1825-1832
- Wang, D.¹ Liu, D.² Wei, Q.³ Zhao, D.⁴ Jin, N.⁵

33
- 0025229247
- Consistency of HDP applied to a simple reinforcement learning problem
- Werbos PJ (1990) Consistency of HDP applied to a simple reinforcement learning problem. Neural Netw 3(2): 179-189.
- (1990) Neural Netw , vol.3 , Issue.2 , pp. 179-189
- Werbos, P.J.¹

34
- 84887994897
- van Nostrand Reinhold, New York
- Werbos PJ (1992) Handbook of itelligent control, ch. Approximate dynamic programming for real-teim control and nerual modeling. van Nostrand Reinhold, New York.
- (1992) Handbook of itelligent control, ch. Approximate dynamic programming for real-teim control and nerual modeling
- Werbos, P.J.¹

35
- 49049091767
- Adp: the key direction for future research in intelligent control and understanding brain intelligence
- Werbos PJ (2008) Adp: the key direction for future research in intelligent control and understanding brain intelligence. IEEE Trans Syst Man Cybern Part B-Cybern 38(4): 898-900.
- (2008) IEEE Trans Syst Man Cybern Part B-Cybern , vol.38 , Issue.4 , pp. 898-900
- Werbos, P.J.¹

36
- 67349247013
- Intelligence in the brain: a theory of how it works and how to build it
- Werbos PJ (2009) Intelligence in the brain: a theory of how it works and how to build it. Neural Netw 22(3): 200-212.
- (2009) Neural Netw , vol.22 , Issue.3 , pp. 200-212
- Werbos, P.J.¹

37
- 84876119970
- Hoboken: Wiley-IEEE Press
- Werbos P (2013) Reinforcement learning and approximate dynamic programming for feedback control, ch. reinforcement learning and approximate dynamic programming (RLADP)-foundations, common misconceptions and challenges ahead. Wiley-IEEE Press, Hoboken.
- (2013) Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, Ch. Reinforcement Learning and Approximate Dynamic Programming (RLADP)-Foundations, Common Misconceptions and Challenges Ahead
- Werbos, P.¹

38
- 0030421566
- Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot
- Werbos P, Pang X (1996) Generalized maze navigation: SRN critics solve what feedforward or Hebbian nets cannot. In: Systems, man, and cybernetics, 1996, IEEE international conference on, vol 3, pp 1764-1769.
- (1996) In: Systems, man, and cybernetics, 1996, IEEE international conference on , vol.3 , pp. 1764-1769
- Werbos, P.¹ Pang, X.²

39
- 34548771972
- Two novel on-policy reinforcement learning algorithms based on td (λ)-methods
- Wiering M, van Hasselt H (2007) Two novel on-policy reinforcement learning algorithms based on td (λ)-methods. In: IEEE international symposium on approximate dynamic programming and reinforcement learning (ADPRL), IEEE, pp 280-287.
- (2007) In: IEEE international symposium on approximate dynamic programming and reinforcement learning (ADPRL), IEEE , pp. 280-287
- Wiering, M.¹ van Hasselt, H.²

40
- 0033698503
- The cellular simultaneous recurrent network adaptive critic design for the generalized maze problem has a simple closed-form solution
- Wunsch D (2000) The cellular simultaneous recurrent network adaptive critic design for the generalized maze problem has a simple closed-form solution. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), IEEE, vol 3, pp 79-82.
- (2000) In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), IEEE , vol.3 , pp. 79-82
- Wunsch, D.¹

41
- 70349615619
- Direct heuristic dynamic programming for nonlinear tracking conrol with filtered tracking error
- Yang L, Si J, Tsakalis KS, Rodriguez AA (2009) Direct heuristic dynamic programming for nonlinear tracking conrol with filtered tracking error. IEEE Trans Syst Man Cybern Part B-Cybern 39(6): 1617-1622.
- (2009) IEEE Trans Syst Man Cybern Part B-Cybern , vol.39 , Issue.6 , pp. 1617-1622
- Yang, L.¹ Si, J.² Tsakalis, K.S.³ Rodriguez, A.A.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.