SCOPUS 정보 검색 플랫폼

Volumn 78, Issue 1, 2012, Pages 3-13

A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

a University of Rhode Island (United States)

b WUHAN UNIVERSITY OF TECHNOLOGY (China)

Author keywords

Actor critic design; Adaptive dynamic programming; Goal representation; Multi state optimization; Online learning and control; Reinforcement learning; Three network architecture

Indexed keywords

ACTION NETWORK; ACTOR CRITIC; ADAPTIVE DYNAMIC PROGRAMMING; CONTROL PERFORMANCE; CRITIC NETWORK; DESIGN FRAMEWORKS; DETAILED DESIGN; EFFECTIVE LEARNING; GOAL REPRESENTATION; INVERTED PENDULUM; MULTI STATE; ONLINE LEARNING; REFERENCE NETWORK; REINFORCEMENT SIGNAL;

BENCHMARKING; DESIGN; DYNAMIC PROGRAMMING; E-LEARNING; LEARNING ALGORITHMS; OPTIMIZATION; REINFORCEMENT; REINFORCEMENT LEARNING;

NETWORK ARCHITECTURE;

ADAPTATION; ADAPTIVE DYNAMIC PROGRAMMING; ALGORITHM; ARTICLE; ARTIFICIAL NEURAL NETWORK; AUTOMATION; LEARNING; MACHINE LEARNING; ONLINE SYSTEM; PRIORITY JOURNAL; PROCESS DESIGN; PROCESS OPTIMIZATION; SIMULATION;

EID: 82655173881 PISSN: 09252312 EISSN: 18728286 Source Type: Journal
DOI: 10.1016/j.neucom.2011.05.031 Document Type: Article

Times cited : (216)

References (31)

1
- 67349247013
- Intelligence in the brain: a theory of how it works and how to build it
- Werbos P.J. Intelligence in the brain: a theory of how it works and how to build it. Neural Netw. 2009, 200-212.
- (2009) Neural Netw. , pp. 200-212
- Werbos, P.J.¹

2
- 84891585216
- Wiley
- He H. Self-Adaptive Systems for Machine Intelligence 2011, Wiley.
- (2011) Self-Adaptive Systems for Machine Intelligence
- He, H.¹

3
- 34548766755
- Using ADP to understand and replicate brain intelligence: the next level design
- Werbos P.J. Using ADP to understand and replicate brain intelligence: the next level design. IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning 2007, 209-216.
- (2007) IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning , pp. 209-216
- Werbos, P.J.¹

4
- 84921399937
- IEEE Press
- Si J., Barto A.G., Powell W.B., Wunsch D.C. Handbook of Learning and Approximate Dynamic Programming 2004, IEEE Press.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.C.⁴

5
- 0031236002
- Adaptive critic designs
- Prokhorov D.V., Wunsch D.C. Adaptive critic designs. IEEE Trans. Neural Netw. 1997, 8(5):997-1007.
- (1997) IEEE Trans. Neural Netw. , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.C.²

6
- 0003544743
- Van Nostrand, New York
- White D.A., Sofge D.A. Handbook of Intelligent Control 1992, Van Nostrand, New York.
- (1992) Handbook of Intelligent Control
- White, D.A.¹ Sofge, D.A.²

7
- 47349092417
- Wiley-Interscience
- Powell W.B. Approximate Dynamic Programming: Solving the Curses of Dimensionality 2007, Wiley-Interscience.
- (2007) Approximate Dynamic Programming: Solving the Curses of Dimensionality
- Powell, W.B.¹

8
- 70449429571
- Adaptive dynamic programming for discrete-time systems with infinite horizon and epsilon-error bound in the performance cost
- Liu D., Jin N. Adaptive dynamic programming for discrete-time systems with infinite horizon and epsilon-error bound in the performance cost. Proceedings of the IEEE International Conference on Neural Networks 2009.
- (2009) Proceedings of the IEEE International Conference on Neural Networks
- Liu, D.¹ Jin, N.²

9
- 66449130966
- Adaptive dynamic programming: an introduction
- Wang F.Y., Zhang H., Liu D. Adaptive dynamic programming: an introduction. IEEE Comput. Intel. Mag. 2009, 4(2):39-47.
- (2009) IEEE Comput. Intel. Mag. , vol.4 , Issue.2 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

10
- 70449382072
- Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem
- Vamvoudakis K., Lewis F.L. Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem. Proceedings of the IEEE International Conference on Neural Networks 2009.
- (2009) Proceedings of the IEEE International Conference on Neural Networks
- Vamvoudakis, K.¹ Lewis, F.L.²

11
- 49049111594
- Issues on stability of ADP feedback controllers for dynamical systems
- Special Issue on ADP/RL invited survey paper
- Balakrishnan S.N., Ding J., Lewis F.L. Issues on stability of ADP feedback controllers for dynamical systems. IEEE Trans. Syst. Man Cybern., Part B 2008, 38(4):913-917. Special Issue on ADP/RL invited survey paper.
- (2008) IEEE Trans. Syst. Man Cybern., Part B , vol.38 , Issue.4 , pp. 913-917
- Balakrishnan, S.N.¹ Ding, J.² Lewis, F.L.³

12
- 33847648898
- Adaptive critic designs for discrete-time zero-sum games with application to h-infinity control
- Al-Tamimi A., Abu-Khalaf M., Lewis F.L. Adaptive critic designs for discrete-time zero-sum games with application to h-infinity control. IEEE Trans. Syst. Man Cybern. Part B 2007, 37(1):240-247.
- (2007) IEEE Trans. Syst. Man Cybern. Part B , vol.37 , Issue.1 , pp. 240-247
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

13
- 84867564780
- Handbook of learning and approximate dynamic programming
- IEEE Press
- Venayagamoorthy G.K., Harley R.G. Handbook of learning and approximate dynamic programming. Application of Approximate Dynamic Programming in Power System Control 2004, 479-515. IEEE Press.
- (2004) Application of Approximate Dynamic Programming in Power System Control , pp. 479-515
- Venayagamoorthy, G.K.¹ Harley, R.G.²

14
- 49049116711
- Comparison of adaptive critics and classical approaches based wide area controllers for a power system
- Ray S., Venayagamoorthy G.K., Chaudhuri B., Majumder R. Comparison of adaptive critics and classical approaches based wide area controllers for a power system. IEEE Trans. Syst. Man Cybern. Part B 2008, 38(4):1002-1007.
- (2008) IEEE Trans. Syst. Man Cybern. Part B , vol.38 , Issue.4 , pp. 1002-1007
- Ray, S.¹ Venayagamoorthy, G.K.² Chaudhuri, B.³ Majumder, R.⁴

15
- 70349253929
- Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints
- Zhang H.G., Luo Y.H., Liu D. Neural-network-based near-optimal control for a class of discrete-time affine nonlinear systems with control constraints. IEEE Trans. Neural Netw. 2009, 20(9):1490-1503.
- (2009) IEEE Trans. Neural Netw. , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.G.¹ Luo, Y.H.² Liu, D.³

16
- 78651311269
- Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
- Wang F.Y., Jin N., Liu D., Wei Q. Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound. IEEE Trans. Neural Netw. 2011, 22(1):24-36.
- (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.1 , pp. 24-36
- Wang, F.Y.¹ Jin, N.² Liu, D.³ Wei, Q.⁴

17
- 26844483839
- A self-learning call admission control scheme for CDMA cellular networks
- Liu D., Zhang Y., Zhang H.G. A self-learning call admission control scheme for CDMA cellular networks. IEEE Trans. Neural Netw. 2005, 16(5):1219-1228.
- (2005) IEEE Trans. Neural Netw. , vol.16 , Issue.5 , pp. 1219-1228
- Liu, D.¹ Zhang, Y.² Zhang, H.G.³

18
- 79960115021
- Adaptive learning and control for MIMO system based on adaptive dynamic programming
- He H., Fu J., Zhou X. Adaptive learning and control for MIMO system based on adaptive dynamic programming. IEEE Trans. Neural Netw. 2011, 22(7):1133-1148.
- (2011) IEEE Trans. Neural Netw. , vol.22 , Issue.7 , pp. 1133-1148
- He, H.¹ Fu, J.² Zhou, X.³

19
- 85012688561
- Princeton University Press, Princeton, NJ
- Bellman R.E. Dynamic Programming 1957, Princeton University Press, Princeton, NJ.
- (1957) Dynamic Programming
- Bellman, R.E.¹

20
- 0025503558
- Backpropagation through time: what it does and how to do it
- Werbos P.J. Backpropagation through time: what it does and how to do it. Proc/ IEEE 1990, vol. 78:1550-1560.
- (1990) Proc/ IEEE , vol.78 , pp. 1550-1560
- Werbos, P.J.¹

21
- 0004146423
- Backpropagation: basics and new developments
- MIT Press, Cambridge, MA
- Werbos P.J. Backpropagation: basics and new developments. The Handbook of Brain Theory and Neural Networks 1995, 134-139. MIT Press, Cambridge, MA.
- (1995) The Handbook of Brain Theory and Neural Networks , pp. 134-139
- Werbos, P.J.¹

22
- 85032189594
- Model-based adaptive critic designs
- IEEE Press
- Ferrari S., Stengel R.F. Model-based adaptive critic designs. Handbook of Learning and Approximate Dynamic Programming 2004, IEEE Press.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Ferrari, S.¹ Stengel, R.F.²

23
- 0002437599
- Neuralcontrol and supervised learning
- Van Nostrand, New York
- Werbos P.J. Neuralcontrol and supervised learning. Handbook of Intelligent Control 1992, Van Nostrand, New York.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

24
- 0035273403
- On-line learning control by association and reinforcement
- Si J., Wang Y.T. On-line learning control by association and reinforcement. IEEE Trans. Neural Netw. 2001, 12(2):264-276.
- (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.T.²

25
- 0004102479
- MIT Press, Cambridge, MA
- Sutton R.S., Barto A.G. Reinforcement Learning: An Introduction 1998, MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

26
- 0001773535
- Applications of advances in nonlinear sensitivity analysis
- Werbos P.J. Applications of advances in nonlinear sensitivity analysis. System Modeling and Optimization 1981.
- (1981) System Modeling and Optimization
- Werbos, P.J.¹

27
- 84855328773
- Stable adaptive control using new critic designs," [online], available:
- P.J. Werbos, Stable adaptive control using new critic designs," [online], available: 2008. http://arxiv.orgasadap-org/9810001.
- (2008)
- Werbos, P.J.¹

28
- 84886519458
- Helicopter flight control using direct neural dynamic programming
- IEEE Press
- Enns R., Si J. Helicopter flight control using direct neural dynamic programming. Handbook of Learning and Approximate Dynamic Programming 2004, 535-559. IEEE Press.
- (2004) Handbook of Learning and Approximate Dynamic Programming , pp. 535-559
- Enns, R.¹ Si, J.²

29
- 84893246005
- Direct neural dynamic programming
- IEEE Press
- Si J., Liu D. Direct neural dynamic programming. Handbook of Learning and Approximate Dynamic Programming 2004, 125-151. IEEE Press.
- (2004) Handbook of Learning and Approximate Dynamic Programming , pp. 125-151
- Si, J.¹ Liu, D.²

30
- 0031672813
- Nonlinear optimal control of a triple link inverted pendulum with single control input
- Eltohamy K.D., Kuo C.-Y. Nonlinear optimal control of a triple link inverted pendulum with single control input. Int. J. Contr. 1998, 69(2):239-256.
- (1998) Int. J. Contr. , vol.69 , Issue.2 , pp. 239-256
- Eltohamy, K.D.¹ Kuo, C.-Y.²

31
- 80054754525
- An online actor-critic learning approach with Levenberg-Marquardt algorithm
- Ni Z., He H., Prokhorov D.V., Fu J. An online actor-critic learning approach with Levenberg-Marquardt algorithm. Proceedings of the International Joint Conference on Neural Networks (IJCNN'11) 2011.
- (2011) Proceedings of the International Joint Conference on Neural Networks (IJCNN'11)
- Ni, Z.¹ He, H.² Prokhorov, D.V.³ Fu, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.