메뉴 건너뛰기




Volumn 17, Issue 11, 2013, Pages 2101-2108

Heuristic dynamic programming with internal goal representation

Author keywords

Adaptive dynamic programming (ADP); Goal representation heuristic dynamic programming (GrHDP); Maze navigation path planning; Reinforcement learning (RL)

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; CONVERGENT SPEED; HEURISTIC DYNAMIC PROGRAMMING; NAVIGATION PROBLEM; REINFORCEMENT LEARNING APPROACH; SIMULATION ENVIRONMENT; STRUCTURE-BASED; SUM OF SQUARE ERRORS;

EID: 84885936244     PISSN: 14327643     EISSN: 14337479     Source Type: Journal    
DOI: 10.1007/s00500-013-1112-9     Document Type: Article
Times cited : (26)

References (41)
  • 2
    • 79960115021 scopus 로고    scopus 로고
    • Adaptive learning and control for mimo system based on adaptive dynamic programming
    • Fu J, He H, Zhou X (2011) Adaptive learning and control for mimo system based on adaptive dynamic programming. IEEE Trans Neural Netw 22(7): 1133-1148.
    • (2011) IEEE Trans Neural Netw , vol.22 , Issue.7 , pp. 1133-1148
    • Fu, J.1    He, H.2    Zhou, X.3
  • 3
    • 79957876523 scopus 로고    scopus 로고
    • An adaptive dynamic programming approach for closely-coupled mimo system control
    • Fu J, He H, Liu Q, Ni Z (2011) An adaptive dynamic programming approach for closely-coupled mimo system control. In: Int symp neural networks (ISNN'11), pp 1-10.
    • (2011) In: Int symp neural networks (ISNN'11) , pp. 1-10
    • Fu, J.1    He, H.2    Liu, Q.3    Ni, Z.4
  • 5
    • 34047138362 scopus 로고    scopus 로고
    • Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input contraints
    • He P, Jagannathan S (2007) Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input contraints. IEEE Trans Syst Man Cybern Part B-Cybern 37(2): 425-436.
    • (2007) IEEE Trans Syst Man Cybern Part B-Cybern , vol.37 , Issue.2 , pp. 425-436
    • He, P.1    Jagannathan, S.2
  • 7
    • 82655173881 scopus 로고    scopus 로고
    • A three-network architecture for on-line learning and optimization based on adaptive dynamic programming
    • He H, Ni Z, Fu J (2012) A three-network architecture for on-line learning and optimization based on adaptive dynamic programming. Neurocomputing 78(1): 3-13.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 3-13
    • He, H.1    Ni, Z.2    Fu, J.3
  • 11
    • 49149131955 scopus 로고    scopus 로고
    • Beyond feedforward models trained by backpropagation: a practical training tool for a more efficient universal approximator
    • Ilin R, Kozma R, Werbos P (2008) Beyond feedforward models trained by backpropagation: a practical training tool for a more efficient universal approximator. Neural Netw IEEE Trans 19(6): 929-937.
    • (2008) Neural Netw IEEE Trans , vol.19 , Issue.6 , pp. 929-937
    • Ilin, R.1    Kozma, R.2    Werbos, P.3
  • 15
    • 84862811991 scopus 로고    scopus 로고
    • A boundedness result for the direct heuristic dynamic programming
    • Liu F, Sun J, Si J, Guo W, Mei S (2012) A boundedness result for the direct heuristic dynamic programming. Neural Netw 32: 229-235.
    • (2012) Neural Netw , vol.32 , pp. 229-235
    • Liu, F.1    Sun, J.2    Si, J.3    Guo, W.4    Mei, S.5
  • 16
    • 84863467146 scopus 로고    scopus 로고
    • Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
    • Liu D, Wang D, Zhao D, Wei Q, Jin N (2012) Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming. IEEE Trans Autom Sci Eng 9(3): 628-634.
    • (2012) IEEE Trans Autom Sci Eng , vol.9 , Issue.3 , pp. 628-634
    • Liu, D.1    Wang, D.2    Zhao, D.3    Wei, Q.4    Jin, N.5
  • 17
    • 84881555023 scopus 로고    scopus 로고
    • Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems
    • Liu D, Wei Q (2013) Finite-approximation-error-based optimal control approach for discrete-time nonlinear systems. IEEE Trans Cybern 43(2): 779-789.
    • (2013) IEEE Trans Cybern , vol.43 , Issue.2 , pp. 779-789
    • Liu, D.1    Wei, Q.2
  • 19
    • 84876149222 scopus 로고    scopus 로고
    • Adaptive learning in tracking control based on the dual critic network design
    • Ni Z, He H, Wen J (2013) Adaptive learning in tracking control based on the dual critic network design. IEEE Trans Neural Netw Learn Syst 6(24): 913-928.
    • (2013) IEEE Trans Neural Netw Learn Syst , vol.6 , Issue.24 , pp. 913-928
    • Ni, Z.1    He, H.2    Wen, J.3
  • 25
    • 0005942467 scopus 로고    scopus 로고
    • Neural network design for j function approximation in dynamic programming
    • Pang X, Werbos PJ (1996) Neural network design for j function approximation in dynamic programming. In: Mathematical modelling and scientific computing. http://arxiv. org/pdf/adap-org/9806001. pdf.
    • (1996) In: Mathematical modelling and scientific computing
    • Pang, X.1    Werbos, P.J.2
  • 27
    • 0029592634 scopus 로고
    • Adaptive critic designs: a case study for neurocontrol
    • Prokhorov DV, Santiago RA, Wunsch DC II (1995) Adaptive critic designs: a case study for neurocontrol. Neural Netw 8(9): 1367-1372.
    • (1995) Neural Netw , vol.8 , Issue.9 , pp. 1367-1372
    • Prokhorov, D.V.1    Santiago, R.A.2    Wunsch II, D.C.3
  • 29
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • Si J, Wang Y-T (2001) Online learning control by association and reinforcement. IEEE Trans Neural Netw 12(2): 264-276.
    • (2001) IEEE Trans Neural Netw , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 32
    • 84864489666 scopus 로고    scopus 로고
    • Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    • Wang D, Liu D, Wei Q, Zhao D, Jin N (2012) Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming. Automatica 48: 1825-1832.
    • (2012) Automatica , vol.48 , pp. 1825-1832
    • Wang, D.1    Liu, D.2    Wei, Q.3    Zhao, D.4    Jin, N.5
  • 33
    • 0025229247 scopus 로고
    • Consistency of HDP applied to a simple reinforcement learning problem
    • Werbos PJ (1990) Consistency of HDP applied to a simple reinforcement learning problem. Neural Netw 3(2): 179-189.
    • (1990) Neural Netw , vol.3 , Issue.2 , pp. 179-189
    • Werbos, P.J.1
  • 35
    • 49049091767 scopus 로고    scopus 로고
    • Adp: the key direction for future research in intelligent control and understanding brain intelligence
    • Werbos PJ (2008) Adp: the key direction for future research in intelligent control and understanding brain intelligence. IEEE Trans Syst Man Cybern Part B-Cybern 38(4): 898-900.
    • (2008) IEEE Trans Syst Man Cybern Part B-Cybern , vol.38 , Issue.4 , pp. 898-900
    • Werbos, P.J.1
  • 36
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: a theory of how it works and how to build it
    • Werbos PJ (2009) Intelligence in the brain: a theory of how it works and how to build it. Neural Netw 22(3): 200-212.
    • (2009) Neural Netw , vol.22 , Issue.3 , pp. 200-212
    • Werbos, P.J.1
  • 40
    • 0033698503 scopus 로고    scopus 로고
    • The cellular simultaneous recurrent network adaptive critic design for the generalized maze problem has a simple closed-form solution
    • Wunsch D (2000) The cellular simultaneous recurrent network adaptive critic design for the generalized maze problem has a simple closed-form solution. In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), IEEE, vol 3, pp 79-82.
    • (2000) In: Proceedings of the IEEE international joint conference on neural networks (IJCNN), IEEE , vol.3 , pp. 79-82
    • Wunsch, D.1
  • 41
    • 70349615619 scopus 로고    scopus 로고
    • Direct heuristic dynamic programming for nonlinear tracking conrol with filtered tracking error
    • Yang L, Si J, Tsakalis KS, Rodriguez AA (2009) Direct heuristic dynamic programming for nonlinear tracking conrol with filtered tracking error. IEEE Trans Syst Man Cybern Part B-Cybern 39(6): 1617-1622.
    • (2009) IEEE Trans Syst Man Cybern Part B-Cybern , vol.39 , Issue.6 , pp. 1617-1622
    • Yang, L.1    Si, J.2    Tsakalis, K.S.3    Rodriguez, A.A.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.