메뉴 건너뛰기




Volumn , Issue , 2012, Pages 523-527

Data-driven learning and control with multiple critic networks

Author keywords

adaptive dynamic programming (ADP); external reinforcement signal; goal representation; hierarchical structure; internal reinforcement signal; multiple critic networks

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; GOAL REPRESENTATION; HIERARCHICAL STRUCTURES; MULTIPLE CRITIC; REINFORCEMENT SIGNAL;

EID: 84872330793     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/WCICA.2012.6357935     Document Type: Conference Paper
Times cited : (5)

References (19)
  • 1
    • 82655173881 scopus 로고    scopus 로고
    • A three-network architecture for on-line learning and optimization based on adaptive dynamic programming
    • H. He, Z. Ni, and J. Fu, "A three-network architecture for on-line learning and optimization based on adaptive dynamic programming," Neurocomputing, vol. 78, no. 1, pp. 3-13, 2012.
    • (2012) Neurocomputing , vol.78 , Issue.1 , pp. 3-13
    • He, H.1    Ni, Z.2    Fu, J.3
  • 3
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: A theory of how it works and how to build it
    • P. J. Werbos, "Intelligence in the brain: A theory of how it works and how to build it," Neural Networks, pp. 200-212, 2009.
    • (2009) Neural Networks , pp. 200-212
    • Werbos, P.J.1
  • 5
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
    • F. Y. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound," IEEE Transactions on Neural Networks, vol. 22, no. 1, pp. 24-36, 2011.
    • (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 6
    • 79960115021 scopus 로고    scopus 로고
    • Adaptive learning and control for mimo system based on adaptive dynamic programming
    • J. Fu, H. He, and X. Zhou, "Adaptive learning and control for mimo system based on adaptive dynamic programming," IEEE Transactions on Neural Networks, vol. 22, no. 7, pp. 1133-1148, 2011.
    • (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.7 , pp. 1133-1148
    • Fu, J.1    He, H.2    Zhou, X.3
  • 7
    • 49049116711 scopus 로고    scopus 로고
    • Comparison of adaptive critics and classical approaches based wide area controllers for a power system
    • S. Ray, G. K. Venayagamoorthy, B. Chaudhuri, and R. Majumder, "Comparison of adaptive critics and classical approaches based wide area controllers for a power system," IEEE Trans. on Syst. Man, Cybern., Part B, vol. 38, no. 4, pp. 1002-1007, 2008.
    • (2008) IEEE Trans. on Syst. Man, Cybern., Part B , vol.38 , Issue.4 , pp. 1002-1007
    • Ray, S.1    Venayagamoorthy, G.K.2    Chaudhuri, B.3    Majumder, R.4
  • 10
    • 0033750123 scopus 로고    scopus 로고
    • Neurocontroller alternatives for fuzzy ball-and-beam systems with nonuniform nonlinear friction
    • P. H. Eaton, and D. V. Prokhorov, and D. C. Wunsch II., "Neurocontroller alternatives for fuzzy ball-and-beam systems with nonuniform nonlinear friction," IEEE Trans. Neural Netw., vol. 11, no. 2, pp. 423-435, 2000.
    • (2000) IEEE Trans. Neural Netw. , vol.11 , Issue.2 , pp. 423-435
    • Eaton, P.H.1    Prokhorov, D.V.2    Wunsch, I.I.D.C.3
  • 11
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. on Neural Netw., vol. 12, no. 2, pp. 264-276, 2001.
    • (2001) IEEE Trans. on Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 14
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm
    • H. G. Zhang, Q. L. Wei, and Y. H. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm," IEEE Transactions on System, Man and Cybernetics, Part B, vol. 38, no. 4, pp. 937-942, 2008.
    • (2008) IEEE Transactions on System, Man and Cybernetics, Part B , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.G.1    Wei, Q.L.2    Luo, Y.H.3
  • 15
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraints
    • H. G. Zhang, Y. H. Luo, and D. Liu, "Neural-network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Transactions on Neural Networks, vol. 20, no. 9, pp. 1490-1503, 2009.
    • (2009) IEEE Transactions on Neural Networks , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.G.1    Luo, Y.H.2    Liu, D.3
  • 16
    • 78650805234 scopus 로고    scopus 로고
    • An iterative approximate dynamic programming method to solve for a class of nonlinear zerosum differential games
    • H. G. Zhang, Q. L. Wei, and D. Liu, "An iterative approximate dynamic programming method to solve for a class of nonlinear zerosum differential games," Automatica, vol. 47, no. 1, pp. 207-214, 2011.
    • (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.G.1    Wei, Q.L.2    Liu, D.3
  • 17
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • P. J. Werbos, "Backpropagation through time: What it does and how to do it," in Proc. IEEE, vol. 78, pp. 1550-1560, 1990.
    • (1990) Proc. IEEE , vol.78 , pp. 1550-1560
    • Werbos, P.J.1
  • 19
    • 84865079504 scopus 로고    scopus 로고
    • Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming
    • press
    • Z. Ni, H. He, D. Zhao, and D. V. Prokhorov, "Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming," Proc. Int. Joint Conf. Neural Networks (IJCNN), 2012 (in press).
    • (2012) Proc. Int. Joint Conf. Neural Networks (IJCNN)
    • Ni, Z.1    He, H.2    Zhao, D.3    Prokhorov, D.V.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.