SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2004, Pages 374-379

Understand direct NDP with linear quadratic regulation

Author keywords

[No Author keywords available]

Indexed keywords

CLOSED LOOP CONTROL SYSTEMS; COMPUTER SIMULATION; HEURISTIC METHODS; INTELLIGENT CONTROL; LEARNING SYSTEMS; MATRIX ALGEBRA;

CLOSED LOOP PROPERTIES; COST FUNCTIONS; LINEAR QUADRATIC REGULATORS (LQR) DESIGNS; NEURAL DYNAMIC PROGRAMMING (NDP);

DYNAMIC PROGRAMMING;

EID: 20344405190 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (1)

References (14)

1
- 0035273403
- Online learning control by association and reinforcement
- J. Si, and Y. Wang, "Online learning control by association and reinforcement," IEEE Transactions on Neural Networks, vol. 12, no. 2, pp. 264-276, 2001.
- (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.²

2
- 0344666440
- Analyzing and enhancing direct NDP designs using a control-theoretic approach
- L. Yang, J. Si, K. Tsakalis, and A. Rodriguez, "Analyzing and enhancing direct NDP designs using a control-theoretic approach," IEEE International Symposium on Intelligent Control, pp. 529-532, 2003.
- (2003) IEEE International Symposium on Intelligent Control , pp. 529-532
- Yang, L.¹ Si, J.² Tsakalis, K.³ Rodriguez, A.⁴

3
- 0020970738
- Neuron like adaptive elements that can solve difficult learning control problems
- A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuron like adaptive elements that can solve difficult learning control problems," IEEE Transactions On Systems, Man, and Cybernetics, vol. 13, pp. 834-847, 1983.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.13 , pp. 834-847
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

4
- 0002557583
- Advanced forecasting methods for global crisis warning and models of intelligence
- P. Werbos, "Advanced forecasting methods for global crisis warning and models of intelligence," General System Yearbook, vol. 22, pp. 25-38, 1977.
- (1977) General System Yearbook , vol.22 , pp. 25-38
- Werbos, P.¹

8
- 85012688561
- Princeton University Press
- R. Bellman, Dynamic Programming, Princeton University Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.¹

9
- 0004049893
- Ph.D. Dissertation, University of Cambridge, England
- C. J. C. H. Watkins, Learning from Delayed Rewards, Ph.D. Dissertation, University of Cambridge, England, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

10
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. N. Tsitsiklis, "Asynchronous Stochastic Approximation and Q-learning," Machine Learning, vol. 16, no. 3, pp. 185-202, 1994.
- (1994) Machine Learning , vol.16 , Issue.3 , pp. 185-202
- Tsitsiklis, J.N.¹

11
- 33847202724
- Learning to predict by the methods of temporal differences
- R. S. Sutton, "Learning to predict by the methods of temporal differences," Machine Learning, vol. 3, pp. 9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

12
- 0028584964
- Adaptive linear quadratic control using policy iteration
- S. J. Bradtke, B. E. Ydstie, and A. G. Barto, "Adaptive linear quadratic control using policy iteration," Proceedings of American Control Converence, pp. 3475-3479, 1994.
- (1994) Proceedings of American Control Converence , pp. 3475-3479
- Bradtke, S.J.¹ Ydstie, B.E.² Barto, A.G.³

13
- 0003754075
- Ph.D. Dissertation, Linkoping University, Sweden
- T. Landelius, Reinforcement Learning and Distributed Local Model Synthesis, Ph.D. Dissertation, Linkoping University, Sweden, 1997.
- (1997) Reinforcement Learning and Distributed Local Model Synthesis
- Landelius, T.¹

14
- 0003585352
- Prentice Hall
- K. Zhou, J. C. Doyle, and K. Glover, Robust and Optimal Control, Prentice Hall, 1996.
- (1996) Robust and Optimal Control
- Zhou, K.¹ Doyle, J.C.² Glover, K.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.