SCOPUS 정보 검색 플랫폼

Journal of Intelligent and Robotic Systems: Theory and Applications

Volumn 55, Issue 2-3, 2009, Pages 177-201

Performance evaluation of direct heuristic dynamic programming using control-theoretic measures

(4) Yang, Lei a Si, Jennie a Tsakalis, Konstantinos S a Rodriguez, Armando A a

a Arizona State University (United States)

Author keywords

Approximate dynamic programming (ADP); Direct heuristic dynamic programming (direct HDP); Linear quadratic regulator (LQR); On line learning control; Sensitivity and complementary sensitivity

Indexed keywords

APPROXIMATE DYNAMIC PROGRAMMING (ADP); DIRECT HEURISTIC DYNAMIC PROGRAMMING (DIRECT HDP); LINEAR QUADRATIC REGULATOR (LQR); ON-LINE LEARNING CONTROL; SENSITIVITY AND COMPLEMENTARY SENSITIVITY;

CONVERGENCE OF NUMERICAL METHODS; DYNAMIC PROGRAMMING; DYNAMICAL SYSTEMS; EDUCATION; ERROR STATISTICS; HEURISTIC METHODS; HEURISTIC PROGRAMMING; LEARNING ALGORITHMS; REINFORCEMENT; REINFORCEMENT LEARNING; SENSITIVITY ANALYSIS; SYSTEMS ENGINEERING;

CONTROLLERS;

EID: 67349172656 PISSN: 09210296 EISSN: 15730409 Source Type: Journal
DOI: 10.1007/s10846-008-9307-5 Document Type: Article

Times cited : (15)

References (21)

1
- 84921399937
- Wiley-IEEE New York
- Si, J., Barto, A.G., Powell, W.B., Wunsch, D. (eds.): Handbook of Learning and Approximate Dynamic Programming. Wiley-IEEE, New York (2004)
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.⁴

2
- 85012688561
- Princeton University Press Princeton
- Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
- (1957) Dynamic Programming
- Bellman, R.¹

3
- 0003487482
- Athena Scientific Belmont
- Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 85102627959
- Wiley New York
- Puterman, M.L.: Markov Decision Processes-Discrete Stochastic Dynamic Programming. Wiley, New York (1994)
- (1994) Markov Decision Processes-Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

5
- 33847202724
- Learning to predict by the methods of temporal difference
- R.S. Sutton 1988 Learning to predict by the methods of temporal difference Mach. Learn. 3 9 44
- (1988) Mach. Learn. , vol.3 , pp. 9-44
- Sutton, R.S.¹

6
- 0020970738
- Neuron like adaptive elements that can solve difficult learning control problems
- A.G. Barto R.S. Sutton C.W. Anderson 1983 Neuron like adaptive elements that can solve difficult learning control problems IEEE Trans. Syst. Man, Cybern. 13 834 847
- (1983) IEEE Trans. Syst. Man, Cybern. , vol.13 , pp. 834-847
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

7
- 0029753630
- Reinforcement learning with replacing eligibility traces
- R.S. Sutton 1996 Reinforcement learning with replacing eligibility traces Mach. Learn. 22 1 123 158 (Pubitemid 126724365)
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

8
- 0000985504
- TD-Gammon, a self-teaching backgammon program achieves master-level play
- G. Tesauro 1994 TD-Gammon, a self-teaching backgammon program achieves master-level play Neural Comput. 6 215 219
- (1994) Neural Comput. , vol.6 , pp. 215-219
- Tesauro, G.¹

9
- 0002557583
- Advanced forecasting methods for global crisis warning and models of intelligence
- P.J. Werbos 1977 Advanced forecasting methods for global crisis warning and models of intelligence Gen. Syst. Yearb. 22 25 38
- (1977) Gen. Syst. Yearb. , vol.22 , pp. 25-38
- Werbos, P.J.¹

10
- 0002011091
- A menu of design for reinforcement learning over time
- MIT Cambridge
- Werbos, P.J.: A menu of design for reinforcement learning over time. In: Miller, W.T., III, Sutton, R.S., Werbos, P.J. (eds.) Neural Networks for Control, ch. 3, pp. 67-95. MIT, Cambridge (1990)
- (1990) Neural Networks for Control, Ch. 3 , pp. 67-95
- Werbos, P.J.¹ Miller Iii, W.T.² Sutton, R.S.³ Werbos, P.J.⁴

11
- 0002437599
- Neuro-control and supervised learning: An overview and valuation
- Van Nostrand New York
- Werbos, P.J.: Neuro-control and supervised learning: an overview and valuation. In: White, D., Sofge, D. (eds.) Handbook of Intelligent Control, pp. 65-89. Van Nostrand, New York (1992)
- (1992) Handbook of Intelligent Control , pp. 65-89
- Werbos, P.J.¹ White, D.² Sofge, D.³

12
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- Van Nostrand New York
- Werbos, P.J.: Approximate dynamic programming for real-time control and neural modeling. In: White, D., Sofge, D. (eds.) Handbook of Intelligent Control, pp. 493-525. Van Nostrand, New York (1992)
- (1992) Handbook of Intelligent Control , pp. 493-525
- Werbos, P.J.¹ White, D.² Sofge, D.³

13
- 0029592634
- Adaptive critic designs: A case study for neurocontrol
- DOI 10.1016/0893-6080(95)00042-9
- D.V. Prokhorov R.A. Santiago D.C. Wunsch 1995 Adaptive critic designs: a case study for neurocontrol Neural Netw. 8 9 1367 1372 (Pubitemid 26072896)
- (1995) Neural Networks , vol.8 , Issue.9 , pp. 1367-1372
- Prokhorov, D.V.¹ Santiago, R.A.² Wunsch II, D.C.³

14
- 0031236002
- Adaptive critic designs
- PII S1045922797052430
- D.V. Prokhorov D.C. Wunsch 1997 Adaptive critic designs IEEE Trans. Neural Netw. 8 5 997 1007 (Pubitemid 127763331)
- (1997) IEEE Transactions on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch II, D.C.²

15
- 0035273403
- On-line learning control by association and reinforcement
- DOI 10.1109/72.914523, PII S1045922701014047
- J. Si Y. Wang 2001 Online learning control by association and reinforcement IEEE Trans. Neural Netw. 12 2 264 276 (Pubitemid 32371483)
- (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
- Si, J.¹ Wang, Y.-T.²

16
- 0036157443
- Apache helicopter stabilization using neural dynamic programming
- R. Enns J. Si 2002 Apache helicopter stabilization using neural dynamic programming AIAA J. Guid. Control Dyn. 25 1 19 25 (Pubitemid 34109509)
- (2002) Journal of Guidance, Control, and Dynamics , vol.25 , Issue.1 , pp. 19-25
- Enns, R.¹ Si, J.²

17
- 0043026775
- Helicopter trimming and tracking control using direct neural dynamic programming
- R. Enns J. Si 2003 Helicopter trimming and tracking control using direct neural dynamic programming IEEE Trans. Neural Netw. 14 4 929 939
- (2003) IEEE Trans. Neural Netw. , vol.14 , Issue.4 , pp. 929-939
- Enns, R.¹ Si, J.²

18
- 0042767744
- Helicopter flight-control reconfiguration for main rotor actuator failures
- R. Enns J. Si 2003 Helicopter flight-control reconfiguration for main rotor actuator failures AIAA J. Guid. Control Dyn. 26 4 572 584
- (2003) AIAA J. Guid. Control Dyn. , vol.26 , Issue.4 , pp. 572-584
- Enns, R.¹ Si, J.²

19
- 0003585352
- Prentice Hall Englewood Cliffs
- Zhou, K., Doyle, J.C., Glover, K.: Robust and Optimal Control. Prentice Hall, Englewood Cliffs (1996)
- (1996) Robust and Optimal Control
- Zhou, K.¹ Doyle, J.C.² Glover, K.³

20
- 0031672813
- Nonlinear optimal control of a triple link inverted pendulum with single control input
- K.D. Eltohamy C.Y. Kuo 1998 Nonlinear optimal control of a triple link inverted pendulum with single control input Int. J. Control 69 2 239 256
- (1998) Int. J. Control , vol.69 , Issue.2 , pp. 239-256
- Eltohamy, K.D.¹ Kuo, C.Y.²

21
- 0007908166
- Experiments with reinforcement learning in problems with continuous state and action spaces
- University of Massachussetts, Amherst
- Santamaria, J.C., Sutton, R.S., Ram, A.: Experiments with reinforcement learning in problems with continuous state and action spaces. COINS Technical Report 96-88, University of Massachussetts, Amherst (1996)
- (1996) COINS Technical Report 96-88
- Santamaria, J.C.¹ Sutton, R.S.² Ram, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.