메뉴 건너뛰기




Volumn 78, Issue 1, 2012, Pages 3-13

A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Author keywords

Actor critic design; Adaptive dynamic programming; Goal representation; Multi state optimization; Online learning and control; Reinforcement learning; Three network architecture

Indexed keywords

ACTION NETWORK; ACTOR CRITIC; ADAPTIVE DYNAMIC PROGRAMMING; CONTROL PERFORMANCE; CRITIC NETWORK; DESIGN FRAMEWORKS; DETAILED DESIGN; EFFECTIVE LEARNING; GOAL REPRESENTATION; INVERTED PENDULUM; MULTI STATE; ONLINE LEARNING; REFERENCE NETWORK; REINFORCEMENT SIGNAL;

EID: 82655173881     PISSN: 09252312     EISSN: 18728286     Source Type: Journal    
DOI: 10.1016/j.neucom.2011.05.031     Document Type: Article
Times cited : (216)

References (31)
  • 1
    • 67349247013 scopus 로고    scopus 로고
    • Intelligence in the brain: a theory of how it works and how to build it
    • Werbos P.J. Intelligence in the brain: a theory of how it works and how to build it. Neural Netw. 2009, 200-212.
    • (2009) Neural Netw. , pp. 200-212
    • Werbos, P.J.1
  • 8
    • 70449429571 scopus 로고    scopus 로고
    • Adaptive dynamic programming for discrete-time systems with infinite horizon and epsilon-error bound in the performance cost
    • Liu D., Jin N. Adaptive dynamic programming for discrete-time systems with infinite horizon and epsilon-error bound in the performance cost. Proceedings of the IEEE International Conference on Neural Networks 2009.
    • (2009) Proceedings of the IEEE International Conference on Neural Networks
    • Liu, D.1    Jin, N.2
  • 9
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: an introduction
    • Wang F.Y., Zhang H., Liu D. Adaptive dynamic programming: an introduction. IEEE Comput. Intel. Mag. 2009, 4(2):39-47.
    • (2009) IEEE Comput. Intel. Mag. , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.Y.1    Zhang, H.2    Liu, D.3
  • 11
    • 49049111594 scopus 로고    scopus 로고
    • Issues on stability of ADP feedback controllers for dynamical systems
    • Special Issue on ADP/RL invited survey paper
    • Balakrishnan S.N., Ding J., Lewis F.L. Issues on stability of ADP feedback controllers for dynamical systems. IEEE Trans. Syst. Man Cybern., Part B 2008, 38(4):913-917. Special Issue on ADP/RL invited survey paper.
    • (2008) IEEE Trans. Syst. Man Cybern., Part B , vol.38 , Issue.4 , pp. 913-917
    • Balakrishnan, S.N.1    Ding, J.2    Lewis, F.L.3
  • 12
    • 33847648898 scopus 로고    scopus 로고
    • Adaptive critic designs for discrete-time zero-sum games with application to h-infinity control
    • Al-Tamimi A., Abu-Khalaf M., Lewis F.L. Adaptive critic designs for discrete-time zero-sum games with application to h-infinity control. IEEE Trans. Syst. Man Cybern. Part B 2007, 37(1):240-247.
    • (2007) IEEE Trans. Syst. Man Cybern. Part B , vol.37 , Issue.1 , pp. 240-247
    • Al-Tamimi, A.1    Abu-Khalaf, M.2    Lewis, F.L.3
  • 14
    • 49049116711 scopus 로고    scopus 로고
    • Comparison of adaptive critics and classical approaches based wide area controllers for a power system
    • Ray S., Venayagamoorthy G.K., Chaudhuri B., Majumder R. Comparison of adaptive critics and classical approaches based wide area controllers for a power system. IEEE Trans. Syst. Man Cybern. Part B 2008, 38(4):1002-1007.
    • (2008) IEEE Trans. Syst. Man Cybern. Part B , vol.38 , Issue.4 , pp. 1002-1007
    • Ray, S.1    Venayagamoorthy, G.K.2    Chaudhuri, B.3    Majumder, R.4
  • 15
    • 70349253929 scopus 로고    scopus 로고
    • Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
    • Zhang H.G., Luo Y.H., Liu D. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Trans. Neural Netw. 2009, 20(9):1490-1503.
    • (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.G.1    Luo, Y.H.2    Liu, D.3
  • 16
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
    • Wang F.Y., Jin N., Liu D., Wei Q. Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound. IEEE Trans. Neural Netw. 2011, 22(1):24-36.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.Y.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 17
    • 26844483839 scopus 로고    scopus 로고
    • A self-learning call admission control scheme for CDMA cellular networks
    • Liu D., Zhang Y., Zhang H.G. A self-learning call admission control scheme for CDMA cellular networks. IEEE Trans. Neural Netw. 2005, 16(5):1219-1228.
    • (2005) IEEE Trans. Neural Netw. , vol.16 , Issue.5 , pp. 1219-1228
    • Liu, D.1    Zhang, Y.2    Zhang, H.G.3
  • 18
    • 79960115021 scopus 로고    scopus 로고
    • Adaptive learning and control for MIMO system based on adaptive dynamic programming
    • He H., Fu J., Zhou X. Adaptive learning and control for MIMO system based on adaptive dynamic programming. IEEE Trans. Neural Netw. 2011, 22(7):1133-1148.
    • (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.7 , pp. 1133-1148
    • He, H.1    Fu, J.2    Zhou, X.3
  • 19
    • 85012688561 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Bellman R.E. Dynamic Programming 1957, Princeton University Press, Princeton, NJ.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 20
    • 0025503558 scopus 로고
    • Backpropagation through time: what it does and how to do it
    • Werbos P.J. Backpropagation through time: what it does and how to do it. Proc/ IEEE 1990, vol. 78:1550-1560.
    • (1990) Proc/ IEEE , vol.78 , pp. 1550-1560
    • Werbos, P.J.1
  • 21
    • 0004146423 scopus 로고
    • Backpropagation: basics and new developments
    • MIT Press, Cambridge, MA
    • Werbos P.J. Backpropagation: basics and new developments. The Handbook of Brain Theory and Neural Networks 1995, 134-139. MIT Press, Cambridge, MA.
    • (1995) The Handbook of Brain Theory and Neural Networks , pp. 134-139
    • Werbos, P.J.1
  • 23
    • 0002437599 scopus 로고
    • Neuralcontrol and supervised learning
    • Van Nostrand, New York
    • Werbos P.J. Neuralcontrol and supervised learning. Handbook of Intelligent Control 1992, Van Nostrand, New York.
    • (1992) Handbook of Intelligent Control
    • Werbos, P.J.1
  • 24
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • Si J., Wang Y.T. On-line learning control by association and reinforcement. IEEE Trans. Neural Netw. 2001, 12(2):264-276.
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 26
    • 0001773535 scopus 로고
    • Applications of advances in nonlinear sensitivity analysis
    • Werbos P.J. Applications of advances in nonlinear sensitivity analysis. System Modeling and Optimization 1981.
    • (1981) System Modeling and Optimization
    • Werbos, P.J.1
  • 27
    • 84855328773 scopus 로고    scopus 로고
    • Stable adaptive control using new critic designs," [online], available:
    • P.J. Werbos, Stable adaptive control using new critic designs," [online], available: 2008. http://arxiv.orgasadap-org/9810001.
    • (2008)
    • Werbos, P.J.1
  • 30
    • 0031672813 scopus 로고    scopus 로고
    • Nonlinear optimal control of a triple link inverted pendulum with single control input
    • Eltohamy K.D., Kuo C.-Y. Nonlinear optimal control of a triple link inverted pendulum with single control input. Int. J. Contr. 1998, 69(2):239-256.
    • (1998) Int. J. Contr. , vol.69 , Issue.2 , pp. 239-256
    • Eltohamy, K.D.1    Kuo, C.-Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.