메뉴 건너뛰기




Volumn , Issue , 2012, Pages

Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CRITIC DESIGNS; COMPARATIVE STUDIES; CONTROL SYSTEM PERFORMANCE; CRITIC NETWORK; HEURISTIC DYNAMIC PROGRAMMING; HIERARCHICAL STRUCTURES; REINFORCEMENT LEARNING CONTROL;

EID: 84865079504     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IJCNN.2012.6252524     Document Type: Conference Paper
Times cited : (21)

References (25)
  • 2
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: A theory of how it works and how to build it
    • P. J. Werbos, "Intelligence in the brain: A theory of how it works and how to build it," Neural Networks, pp. 200-212, 2009.
    • (2009) Neural Networks , pp. 200-212
    • Werbos, P.J.1
  • 5
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with -error bound
    • F. Y. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with "-error bound," IEEE Transactions on Neural Networks, vol. 22, no. 1, pp. 24-36, 2011.
    • (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 6
    • 79960115021 scopus 로고    scopus 로고
    • Adaptive learning and control for mimo system based on adaptive dynamic programming
    • J. Fu, H. He, and X. Zhou, "Adaptive learning and control for mimo system based on adaptive dynamic programming," IEEE Transactions on Neural Networks, vol. 22, no. 7, pp. 1133-1148, 2011.
    • (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.7 , pp. 1133-1148
    • Fu, J.1    He, H.2    Zhou, X.3
  • 7
    • 51749084880 scopus 로고    scopus 로고
    • DHP-based wide-area coordinating control of a power system with a large wind farm and multiple FACTS devices
    • W. Qiao, G. Venayagamoorthy, and R. Harley, "DHP-based wide-area coordinating control of a power system with a large wind farm and multiple FACTS devices," in Proc. IEEE Int. Conf. Neural Netw., pp. 2093-2098, 2007.
    • (2007) Proc. IEEE Int. Conf. Neural Netw. , pp. 2093-2098
    • Qiao, W.1    Venayagamoorthy, G.2    Harley, R.3
  • 8
    • 49049116711 scopus 로고    scopus 로고
    • Comparison of adaptive critics and classical approaches based wide area controllers for a power system
    • S. Ray, G. K. Venayagamoorthy, B. Chaudhuri, and R. Majumder, "Comparison of adaptive critics and classical approaches based wide area controllers for a power system," IEEE Trans. on Syst. Man, Cybern., Part B, vol. 38, no. 4, pp. 1002-1007, 2008.
    • (2008) IEEE Trans. on Syst. Man, Cybern., Part B , vol.38 , Issue.4 , pp. 1002-1007
    • Ray, S.1    Venayagamoorthy, G.K.2    Chaudhuri, B.3    Majumder, R.4
  • 9
    • 82655173881 scopus 로고    scopus 로고
    • A three-network architecture for on-line learning and optimization based on adaptive dynamic programming
    • H. He, Z. Ni, and J. Fu, "A three-network architecture for on-line learning and optimization based on adaptive dynamic programming," Neurocomputing, vol. 78, no. 1, pp. 3-13, 2012.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 3-13
    • He, H.1    Ni, Z.2    Fu, J.3
  • 10
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: An introduction
    • F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intel. Mag., vol. 4, no. 2, pp. 39-47, 2009.
    • (2009) IEEE Comput. Intel. Mag. , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.Y.1    Zhang, H.2    Liu, D.3
  • 11
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • F. L. Lewis, and D. Vrabie., "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Sys. Mag., vol. 9, no. 3, pp. 32-50, 2009.
    • (2009) IEEE Circuits Sys. Mag. , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 13
    • 0033750123 scopus 로고    scopus 로고
    • Neurocontroller alternatives for fuzzy ball-and-beam systems with nonuniform nonlinear friction
    • P. H. Eaton, and D. V. Prokhorov, and D. C. Wunsch II., "Neurocontroller alternatives for fuzzy ball-and-beam systems with nonuniform nonlinear friction," IEEE Trans. Neural Netw., vol. 11, no. 2, pp. 423-435, 2000.
    • (2000) IEEE Trans. Neural Netw. , vol.11 , Issue.2 , pp. 423-435
    • Eaton, P.H.1    Prokhorov, D.V.2    Wunsch II, D.C.3
  • 14
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. on Neural Netw., vol. 12, no. 2, pp. 264-276, 2001.
    • (2001) IEEE Trans. on Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 17
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimaltracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm
    • H. G. Zhang, Q. L. Wei, and Y. H. Luo, "A novel infinite-time optimaltracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm," IEEE Transactions on System, Man and Cybernetics, Part B, vol. 38, no. 4, pp. 937-942, 2008.
    • (2008) IEEE Transactions on System, Man and Cybernetics, Part B , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.G.1    Wei, Q.L.2    Luo, Y.H.3
  • 18
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraints
    • H. G. Zhang, Y. H. Luo, and D. Liu, "Neural-network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Transactions on Neural Networks, vol. 20, no. 9, pp. 1490-1503, 2009.
    • (2009) IEEE Transactions on Neural Networks , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.G.1    Luo, Y.H.2    Liu, D.3
  • 19
    • 78650805234 scopus 로고    scopus 로고
    • An iterative approximate dynamic programming method to solve for a class of nonlinear zero-sum differential games
    • H. G. Zhang, Q. L. Wei, and D. Liu, "An iterative approximate dynamic programming method to solve for a class of nonlinear zero-sum differential games," Automatica, vol. 47, no. 1, pp. 207-214, 2011.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.G.1    Wei, Q.L.2    Liu, D.3
  • 20
    • 0037561866 scopus 로고    scopus 로고
    • Dual heuristic programming excitation neurocontrol for generators in a multimachine power system
    • G. K. Venayagamoorthy, R. G. Harley, and D. C. Wunsch, "Dual heuristic programming excitation neurocontrol for generators in a multimachine power system," IEEE Trans. on Industry Applications, vol. 39, no. 2, pp. 382-394, 2003.
    • (2003) IEEE Trans. on Industry Applications , vol.39 , Issue.2 , pp. 382-394
    • Venayagamoorthy, G.K.1    Harley, R.G.2    Wunsch, D.C.3
  • 22
    • 0029592634 scopus 로고
    • Adaptive critic designs: A case study for neurocontrol
    • D. V. Prokhorov, R. A. Santiago, and D. C. Wunsch, "Adaptive critic designs: A case study for neurocontrol," Neural Networks Letter, vol. 8, no. 9, pp. 1367-1372, 1995.
    • (1995) Neural Networks Letter , vol.8 , Issue.9 , pp. 1367-1372
    • Prokhorov, D.V.1    Santiago, R.A.2    Wunsch, D.C.3
  • 25
    • 48949116211 scopus 로고    scopus 로고
    • Stability and almost disturbance decoupling analysis of nonlinear system subject to feedback linearization and feedforward neural network controller
    • T. L. Chien, C. C. Chen, Y. C. Huang, and W.-J. Lin, "Stability and almost disturbance decoupling analysis of nonlinear system subject to feedback linearization and feedforward neural network controller," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1220-1230, 2008.
    • (2008) IEEE Trans. Neural Netw. , vol.19 , Issue.7 , pp. 1220-1230
    • Chien, T.L.1    Chen, C.C.2    Huang, Y.C.3    Lin, W.-J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.