메뉴 건너뛰기




Volumn 55, Issue 2-3, 2009, Pages 177-201

Performance evaluation of direct heuristic dynamic programming using control-theoretic measures

Author keywords

Approximate dynamic programming (ADP); Direct heuristic dynamic programming (direct HDP); Linear quadratic regulator (LQR); On line learning control; Sensitivity and complementary sensitivity

Indexed keywords

APPROXIMATE DYNAMIC PROGRAMMING (ADP); DIRECT HEURISTIC DYNAMIC PROGRAMMING (DIRECT HDP); LINEAR QUADRATIC REGULATOR (LQR); ON-LINE LEARNING CONTROL; SENSITIVITY AND COMPLEMENTARY SENSITIVITY;

EID: 67349172656     PISSN: 09210296     EISSN: 15730409     Source Type: Journal    
DOI: 10.1007/s10846-008-9307-5     Document Type: Article
Times cited : (15)

References (21)
  • 2
    • 85012688561 scopus 로고
    • Princeton University Press Princeton
    • Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 5
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal difference
    • R.S. Sutton 1988 Learning to predict by the methods of temporal difference Mach. Learn. 3 9 44
    • (1988) Mach. Learn. , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 6
    • 0020970738 scopus 로고
    • Neuron like adaptive elements that can solve difficult learning control problems
    • A.G. Barto R.S. Sutton C.W. Anderson 1983 Neuron like adaptive elements that can solve difficult learning control problems IEEE Trans. Syst. Man, Cybern. 13 834 847
    • (1983) IEEE Trans. Syst. Man, Cybern. , vol.13 , pp. 834-847
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 7
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • R.S. Sutton 1996 Reinforcement learning with replacing eligibility traces Mach. Learn. 22 1 123 158 (Pubitemid 126724365)
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 8
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program achieves master-level play
    • G. Tesauro 1994 TD-Gammon, a self-teaching backgammon program achieves master-level play Neural Comput. 6 215 219
    • (1994) Neural Comput. , vol.6 , pp. 215-219
    • Tesauro, G.1
  • 9
    • 0002557583 scopus 로고
    • Advanced forecasting methods for global crisis warning and models of intelligence
    • P.J. Werbos 1977 Advanced forecasting methods for global crisis warning and models of intelligence Gen. Syst. Yearb. 22 25 38
    • (1977) Gen. Syst. Yearb. , vol.22 , pp. 25-38
    • Werbos, P.J.1
  • 11
    • 0002437599 scopus 로고
    • Neuro-control and supervised learning: An overview and valuation
    • Van Nostrand New York
    • Werbos, P.J.: Neuro-control and supervised learning: an overview and valuation. In: White, D., Sofge, D. (eds.) Handbook of Intelligent Control, pp. 65-89. Van Nostrand, New York (1992)
    • (1992) Handbook of Intelligent Control , pp. 65-89
    • Werbos, P.J.1    White, D.2    Sofge, D.3
  • 12
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • Van Nostrand New York
    • Werbos, P.J.: Approximate dynamic programming for real-time control and neural modeling. In: White, D., Sofge, D. (eds.) Handbook of Intelligent Control, pp. 493-525. Van Nostrand, New York (1992)
    • (1992) Handbook of Intelligent Control , pp. 493-525
    • Werbos, P.J.1    White, D.2    Sofge, D.3
  • 13
    • 0029592634 scopus 로고
    • Adaptive critic designs: A case study for neurocontrol
    • DOI 10.1016/0893-6080(95)00042-9
    • D.V. Prokhorov R.A. Santiago D.C. Wunsch 1995 Adaptive critic designs: a case study for neurocontrol Neural Netw. 8 9 1367 1372 (Pubitemid 26072896)
    • (1995) Neural Networks , vol.8 , Issue.9 , pp. 1367-1372
    • Prokhorov, D.V.1    Santiago, R.A.2    Wunsch II, D.C.3
  • 15
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • DOI 10.1109/72.914523, PII S1045922701014047
    • J. Si Y. Wang 2001 Online learning control by association and reinforcement IEEE Trans. Neural Netw. 12 2 264 276 (Pubitemid 32371483)
    • (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.-T.2
  • 16
    • 0036157443 scopus 로고    scopus 로고
    • Apache helicopter stabilization using neural dynamic programming
    • R. Enns J. Si 2002 Apache helicopter stabilization using neural dynamic programming AIAA J. Guid. Control Dyn. 25 1 19 25 (Pubitemid 34109509)
    • (2002) Journal of Guidance, Control, and Dynamics , vol.25 , Issue.1 , pp. 19-25
    • Enns, R.1    Si, J.2
  • 17
    • 0043026775 scopus 로고    scopus 로고
    • Helicopter trimming and tracking control using direct neural dynamic programming
    • R. Enns J. Si 2003 Helicopter trimming and tracking control using direct neural dynamic programming IEEE Trans. Neural Netw. 14 4 929 939
    • (2003) IEEE Trans. Neural Netw. , vol.14 , Issue.4 , pp. 929-939
    • Enns, R.1    Si, J.2
  • 18
    • 0042767744 scopus 로고    scopus 로고
    • Helicopter flight-control reconfiguration for main rotor actuator failures
    • R. Enns J. Si 2003 Helicopter flight-control reconfiguration for main rotor actuator failures AIAA J. Guid. Control Dyn. 26 4 572 584
    • (2003) AIAA J. Guid. Control Dyn. , vol.26 , Issue.4 , pp. 572-584
    • Enns, R.1    Si, J.2
  • 20
    • 0031672813 scopus 로고    scopus 로고
    • Nonlinear optimal control of a triple link inverted pendulum with single control input
    • K.D. Eltohamy C.Y. Kuo 1998 Nonlinear optimal control of a triple link inverted pendulum with single control input Int. J. Control 69 2 239 256
    • (1998) Int. J. Control , vol.69 , Issue.2 , pp. 239-256
    • Eltohamy, K.D.1    Kuo, C.Y.2
  • 21
    • 0007908166 scopus 로고    scopus 로고
    • Experiments with reinforcement learning in problems with continuous state and action spaces
    • University of Massachussetts, Amherst
    • Santamaria, J.C., Sutton, R.S., Ram, A.: Experiments with reinforcement learning in problems with continuous state and action spaces. COINS Technical Report 96-88, University of Massachussetts, Amherst (1996)
    • (1996) COINS Technical Report 96-88
    • Santamaria, J.C.1    Sutton, R.S.2    Ram, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.