SCOPUS 정보 검색 플랫폼

Proceedings of the World Congress on Intelligent Control and Automation (WCICA)

Volumn , Issue , 2012, Pages 523-527

Data-driven learning and control with multiple critic networks

(3) He, Haibo a Ni, Zhen a Zhao, Dongbin b

a University of Rhode Island (United States)

Author keywords

adaptive dynamic programming (ADP); external reinforcement signal; goal representation; hierarchical structure; internal reinforcement signal; multiple critic networks

Indexed keywords

ADAPTIVE DYNAMIC PROGRAMMING; GOAL REPRESENTATION; HIERARCHICAL STRUCTURES; MULTIPLE CRITIC; REINFORCEMENT SIGNAL;

BENCHMARKING; DYNAMIC PROGRAMMING; INTELLIGENT CONTROL; REINFORCEMENT;

E-LEARNING;

EID: 84872330793 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/WCICA.2012.6357935 Document Type: Conference Paper

Times cited : (5)

References (19)

1
- 82655173881
- A three-network architecture for on-line learning and optimization based on adaptive dynamic programming
- H. He, Z. Ni, and J. Fu, "A three-network architecture for on-line learning and optimization based on adaptive dynamic programming," Neurocomputing, vol. 78, no. 1, pp. 3-13, 2012.
- (2012) Neurocomputing , vol.78 , Issue.1 , pp. 3-13
- He, H.¹ Ni, Z.² Fu, J.³

2
- 34548766755
- Using ADP to understand and replicate brain intelligence: The next level design
- P. J. Werbos, "Using ADP to understand and replicate brain intelligence: the next level design," in IEEE Int. Symposium on Approximate Dynamic Programming and Reinforcement Learning, pp. 209-216, 2007.
- (2007) IEEE Int. Symposium on Approximate Dynamic Programming and Reinforcement Learning , pp. 209-216
- Werbos, P.J.¹

3
- 67349247013
- Intelligence in the brain: A theory of how it works and how to build it
- P. J. Werbos, "Intelligence in the brain: A theory of how it works and how to build it," Neural Networks, pp. 200-212, 2009.
- (2009) Neural Networks , pp. 200-212
- Werbos, P.J.¹

4
- 84921399937
- IEEE Press
- J. Si, A. G. Barto, W. B. Powell, and D. C. Wunsch, Handbook of Learning and Approximate Dynamic Programming. IEEE Press, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.C.⁴

5
- 78651311269
- Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
- F. Y. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound," IEEE Transactions on Neural Networks, vol. 22, no. 1, pp. 24-36, 2011.
- (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.1 , pp. 24-36
- Wang, F.Y.¹ Jin, N.² Liu, D.³ Wei, Q.⁴

6
- 79960115021
- Adaptive learning and control for mimo system based on adaptive dynamic programming
- J. Fu, H. He, and X. Zhou, "Adaptive learning and control for mimo system based on adaptive dynamic programming," IEEE Transactions on Neural Networks, vol. 22, no. 7, pp. 1133-1148, 2011.
- (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.7 , pp. 1133-1148
- Fu, J.¹ He, H.² Zhou, X.³

7
- 49049116711
- Comparison of adaptive critics and classical approaches based wide area controllers for a power system
- S. Ray, G. K. Venayagamoorthy, B. Chaudhuri, and R. Majumder, "Comparison of adaptive critics and classical approaches based wide area controllers for a power system," IEEE Trans. on Syst. Man, Cybern., Part B, vol. 38, no. 4, pp. 1002-1007, 2008.
- (2008) IEEE Trans. on Syst. Man, Cybern., Part B , vol.38 , Issue.4 , pp. 1002-1007
- Ray, S.¹ Venayagamoorthy, G.K.² Chaudhuri, B.³ Majumder, R.⁴

8
- 0031236002
- Adaptive critic designs
- D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. on Neural Netw., vol. 8, no. 5, pp. 997-1007, 1997.
- (1997) IEEE Trans. on Neural Netw. , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch, D.C.²

9
- 0002437599
- New York: Van Nostrand
- P. J. Werbos, Handbook of Intelligent Control, ch. Neuralcontrol and Supervised Learning: an Overview and Evaluation. New York: Van Nostrand, 1992.
- (1992) Handbook of Intelligent Control, ch. Neuralcontrol and Supervised Learning: An Overview and Evaluation
- Werbos, P.J.¹

10
- 0033750123
- Neurocontroller alternatives for fuzzy ball-and-beam systems with nonuniform nonlinear friction
- P. H. Eaton, and D. V. Prokhorov, and D. C. Wunsch II., "Neurocontroller alternatives for fuzzy ball-and-beam systems with nonuniform nonlinear friction," IEEE Trans. Neural Netw., vol. 11, no. 2, pp. 423-435, 2000.
- (2000) IEEE Trans. Neural Netw. , vol.11 , Issue.2 , pp. 423-435
- Eaton, P.H.¹ Prokhorov, D.V.² Wunsch, I.I.D.C.³

11
- 0035273403
- On-line learning control by association and reinforcement
- J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. on Neural Netw., vol. 12, no. 2, pp. 264-276, 2001.
- (2001) IEEE Trans. on Neural Netw. , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.T.²

12
- 84893246005
- IEEE Press
- J. Si, L. Yang, and D. Liu, Handbook of Learning and Approximate Dynamic Programming, ch. Direct Neural Dynamic Programming, pp. 125-151. IEEE Press, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming, Ch. Direct Neural Dynamic Programming , pp. 125-151
- Si, J.¹ Yang, L.² Liu, D.³

13
- 70349615619
- Direct heuristic dynamic programming for nonlinear tracking conrol with filtered tracking error
- L. Yang, J. Si, K. S. Tsakalis, and A. A. Rodriguez, "Direct heuristic dynamic programming for nonlinear tracking conrol with filtered tracking error," IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics, vol. 39, no. 6, pp. 1617-1622, 2009.
- (2009) IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics , vol.39 , Issue.6 , pp. 1617-1622
- Yang, L.¹ Si, J.² Tsakalis, K.S.³ Rodriguez, A.A.⁴

14
- 49049119493
- A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm
- H. G. Zhang, Q. L. Wei, and Y. H. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm," IEEE Transactions on System, Man and Cybernetics, Part B, vol. 38, no. 4, pp. 937-942, 2008.
- (2008) IEEE Transactions on System, Man and Cybernetics, Part B , vol.38 , Issue.4 , pp. 937-942
- Zhang, H.G.¹ Wei, Q.L.² Luo, Y.H.³

15
- 70349253929
- Neural-network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraints
- H. G. Zhang, Y. H. Luo, and D. Liu, "Neural-network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraints," IEEE Transactions on Neural Networks, vol. 20, no. 9, pp. 1490-1503, 2009.
- (2009) IEEE Transactions on Neural Networks , vol.20 , Issue.9 , pp. 1490-1503
- Zhang, H.G.¹ Luo, Y.H.² Liu, D.³

16
- 78650805234
- An iterative approximate dynamic programming method to solve for a class of nonlinear zerosum differential games
- H. G. Zhang, Q. L. Wei, and D. Liu, "An iterative approximate dynamic programming method to solve for a class of nonlinear zerosum differential games," Automatica, vol. 47, no. 1, pp. 207-214, 2011.
- (2011) Automatica , vol.47 , Issue.1 , pp. 207-214
- Zhang, H.G.¹ Wei, Q.L.² Liu, D.³

17
- 0025503558
- Backpropagation through time: What it does and how to do it
- P. J. Werbos, "Backpropagation through time: What it does and how to do it," in Proc. IEEE, vol. 78, pp. 1550-1560, 1990.
- (1990) Proc. IEEE , vol.78 , pp. 1550-1560
- Werbos, P.J.¹

18
- 0003809577
- Wiley-Interscience
- P. J. Werbos, The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting. Wiley-Interscience, 1994.
- (1994) The Roots of Backpropagation: From Ordered Derivatives to Neural Networks and Political Forecasting
- Werbos, P.J.¹

19
- 84865079504
- Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming
- press
- Z. Ni, H. He, D. Zhao, and D. V. Prokhorov, "Reinforcement learning control based on multi-goal representation using hierarchical heuristic dynamic programming," Proc. Int. Joint Conf. Neural Networks (IJCNN), 2012 (in press).
- (2012) Proc. Int. Joint Conf. Neural Networks (IJCNN)
- Ni, Z.¹ He, H.² Zhao, D.³ Prokhorov, D.V.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.