SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Neural Networks

Volumn , Issue , 2009, Pages 3224-3231

Generalized Policy Iteration for continuous-time systems

(2) Vrabie, Draguna a Lewis, Frank L a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CONTROLLERS; APPROXIMATE DYNAMIC PROGRAMMING; CONTINUOUS-TIME FORMULATION; CT SYSTEM; INTERNAL DYNAMICS; ITERATIVE PROCESS; OPTIMAL CONTROL PROBLEM; OPTIMAL CONTROL SOLUTION; POLICY EVALUATION; POLICY ITERATION; SIMULATION RESULT; VALUE FUNCTIONS; VALUE ITERATION;

ADAPTIVE CONTROL SYSTEMS; ALGORITHMS; DYNAMIC PROGRAMMING; NEURAL NETWORKS; NONLINEAR CONTROL SYSTEMS; OPTIMAL CONTROL SYSTEMS; SYSTEMS ENGINEERING;

CONTINUOUS TIME SYSTEMS;

EID: 70449448940 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IJCNN.2009.5178964 Document Type: Conference Paper

Times cited : (26)

References (25)

1
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M Abu-Khalaf, F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach", Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

2
- 33845759425
- Policy Iterations and the Hamilton-Jacobi-Isaacs equation for H-infmity state-feedback control with input saturation
- December
- M. Abu-Khalaf, F. L. Lewis, Huang, J., "Policy Iterations and the Hamilton-Jacobi-Isaacs equation for H-infmity state-feedback control with input saturation, " IEEE Transactions on Automatic Control, pp. 1989-1995, December, 2006.
- (2006) IEEE Transactions on Automatic Control , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

3
- 33846781129
- Model-Free Q-Learning Designs for Discrete-Time Zero-Sum Games with Application to H-Infinity Control
- A. Al-Tamimi, F. L. Lewis, M. Abu-Khalaf, "Model-Free Q-Learning Designs for Discrete-Time Zero-Sum Games with Application to H-Infinity Control", Automatica, Vol. 43, pp. 473-481, 2007.
- (2007) Automatica , vol.43 , pp. 473-481
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

4
- 33847648898
- A. Al-Tamimi, M. Abu-Khalaf, F. L. Lewis, Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to H-infinity Control, IEEE Trans. on Sys., Man, and Cyb -B, 37, No. l, February, 2007.
- A. Al-Tamimi, M. Abu-Khalaf, F. L. Lewis, "Adaptive Critic Designs for Discrete-Time Zero-Sum Games With Application to H-infinity Control", IEEE Trans. on Sys., Man, and Cyb -B, Vol. 37, No. l, February, 2007.

5
- 0028584964
- Adaptive linear quadratic control using policy iteration,
- Baltmore, Maryland, June
- S. J. Bradtke, B. E. Ydestie, A. G. Barto, "Adaptive linear quadratic control using policy iteration, " Proceedings of the American Control Conference, pp. 3475-3476, Baltmore, Maryland, June, 1994
- (1994) Proceedings of the American Control Conference , pp. 3475-3476
- Bradtke, S.J.¹ Ydestie, B.E.² Barto, A.G.³

6
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- R. Beard, G. Saridis, J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation", Automatica, vol. 33, no. 12, pp. 2159-2177, 1997.
- (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
- Beard, R.¹ Saridis, G.² Wen, J.³

7
- 0003487482
- Athena Scientific, MA
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

8
- 0033629916
- Reinforcement Learning In Continuous Time and Space
- K. Doya, "Reinforcement Learning In Continuous Time and Space", Neural Computation, 12 (1), pp. 219-245, 2000.
- (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

9
- 34249047468
- Continuous-time adaptive critics
- T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics", IEEE Transactions on Neural Networks, 18 (3), 631-647, 2007.
- (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

10
- 0025627940
- Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks
- K. Hornik, M. Stinchcombe, H. White, "Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks", Neural Networks, 3, pp. 551-560, 1990.
- (1990) Neural Networks , vol.3 , pp. 551-560
- Hornik, K.¹ Stinchcombe, M.² White, H.³

11
- 0002526302
- Construction of Suboptimal Control Sequences
- R. J. Leake, Ruey-Wen Liu, "Construction of Suboptimal Control Sequences", J. SIAM Control, 5 (1), 1967.
- (1967) J. SIAM Control , vol.5 , Issue.1
- Leake, R.J.¹ Wen Liu, R.²

12
- 0004163205
- John Wiley
- F. L. Lewis, V. L. Syrmos, Optimal Control, John Wiley, 1995.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

13
- 84914965022
- On an iterative technique for Riccati equation computations
- February
- D. Kleinman, "On an iterative technique for Riccati equation computations", IEEE Trans. on Automatic Control, vol. 13, pp. 114-115, February, 1968.
- (1968) IEEE Trans. on Automatic Control , vol.13 , pp. 114-115
- Kleinman, D.¹

14
- 0003754075
- PhD Dissertation, Linkoping University, Sweden
- T. Landelius, Reinforcement Learning and Distributed Local Model Synthesis, PhD Dissertation, Linkoping University, Sweden, 1997.
- (1997) Reinforcement Learning and Distributed Local Model Synthesis
- Landelius, T.¹

15
- 0036588686
- Adaptive dynamic programming
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming", IEEE Trans. on Systems, Man and Cybernetics, vol. 32, no. 2, pp 140-153, 2002.
- (2002) IEEE Trans. on Systems, Man and Cybernetics , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

16
- 0031236002
- Adaptive critic designs
- D. Prokhorov, D. Wunsch, "Adaptive critic designs, " IEEE Trans. on Neural Networks, vol. 8, no 5, pp. 997-1007, 1997.
- (1997) IEEE Trans. on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.¹ Wunsch, D.²

17
- 84921399937
- John Wiley, New Jersey
- J. Si, A. Barto, W. Powell, D. Wunsch, Handbook of Learning and Approximate Dynamic Programming, John Wiley, New Jersey, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powell, W.³ Wunsch, D.⁴

18
- 0004044108
- Willey, 2nd Edition
- nd Edition, 2003.
- (2003) Aircraft Control and Simulation
- Stevens, B.L.¹ Lewis, F.L.²

19
- 0004102479
- MIT Press, Cambridge, Massachusetts
- R. S. Sutton, A. G. Barto, Reinforcement Learning mdash; An Introduction, MIT Press, Cambridge, Massachusetts, 1998.
- (1998) Reinforcement Learning mdash; An Introduction
- Sutton, R.S.¹ Barto, A.G.²

20
- 0042466434
- On the convergence of optimistic policy iteration
- J. N. Tsitsiklis, "On the convergence of optimistic policy iteration", Journal of Machine Learning Research, 3, pp. 59-72, 2002.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 59-72
- Tsitsiklis, J.N.¹

21
- 63049136575
- Adaptive optimal control algorithm for continuous-time nonlinear systems based on policy iteration
- IEEE
- D. Vrabie, F. Lewis, "Adaptive optimal control algorithm for continuous-time nonlinear systems based on policy iteration", IEEE Proc. CDC'08, IEEE, 2008.
- (2008) IEEE Proc. CDC'08
- Vrabie, D.¹ Lewis, F.²

22
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- to be published, doi:10.1016/j.automatica.2008.08.017
- D. Vrabie, O. Pastravanu, F. Lewis, M. Abu-Khalaf, "Adaptive optimal control for continuous-time linear systems based on policy iteration", Automatica (to be published), doi:10.1016/j.automatica.2008.08.017.
- Automatica
- Vrabie, D.¹ Pastravanu, O.² Lewis, F.³ Abu-Khalaf, M.⁴

23
- 0003529238
- Ph. D. Thesis
- P. J. Werbos, Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences, Ph. D. Thesis, 1974.
- (1974) Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences
- Werbos, P.J.¹

24
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling,
- ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling, " Handbook of Intelligent Control, ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

25
- 0024888479
- Neural networks for control and system identification
- IEEE
- P. Werbos, "Neural networks for control and system identification", IEEE Proc. CDC'89, IEEE, 1989.
- (1989) IEEE Proc. CDC'89
- Werbos, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.