SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Neural Networks

Volumn , Issue , 2009, Pages 3180-3187

Online actor critic algorithm to solve the continuous-time infinite horizon optimal control problem

(2) Vamvoudakis, Kyriakos G a Lewis, Frank L a

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACTOR-CRITIC ALGORITHM; ACTOR-NETWORK; CLOSED-LOOP; CONTINUOUS TIME; DYNAMICAL STABILITY; INFINITE HORIZONS; ON-LINE ADAPTIVE ALGORITHMS; ON-LINE ALGORITHMS; OPTIMAL CONTROL PROBLEM; OPTIMAL CONTROL SOLUTION; OPTIMAL CONTROLLER; OPTIMAL VALUE FUNCTIONS; PERSISTENCE OF EXCITATION; POLICY ITERATION; SIMULATION EXAMPLE; TUNING ALGORITHM;

ADAPTIVE ALGORITHMS; LEARNING ALGORITHMS; NEURAL NETWORKS; NONLINEAR SYSTEMS; ONLINE SYSTEMS; OPTIMAL CONTROL SYSTEMS; OPTIMIZATION; SYSTEM STABILITY; TUNING;

CONTINUOUS TIME SYSTEMS;

EID: 70449382072 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IJCNN.2009.5178586 Document Type: Conference Paper

Times cited : (47)

References (21)

1
- 14844340822
- Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
- M. Abu-Khalaf, F. L. Lewis, "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach", Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

2
- 0028733775
- Reinforcement Learning in Continuous Time: Advantage Updating
- Orlando FL
- L. C. Baird III, "Reinforcement Learning in Continuous Time: Advantage Updating", Proc. Of ICNN, Orlando FL, vol. 4, pp. 2448- 2453, 1994.
- (1994) Proc. Of ICNN , vol.4 , pp. 2448-2453
- Baird III, L.C.¹

3
- 0031332446
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- R. Beard, G. Saridis, J. Wen, "Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation", Automatica, vol. 33, no. 12, pp. 2159-2177, 1997.
- (1997) Automatica , vol.33 , Issue.12 , pp. 2159-2177
- Beard, R.¹ Saridis, G.² Wen, J.³

4
- 0003487482
- Athena Scientific, MA
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 0034848079
- Successive Collocation: An Approximation to Optimal Nonlinear Control
- J. W. Curtis, R. W. Beard, "Successive Collocation: An Approximation to Optimal Nonlinear Control", IEEE Proc. ACC01, vol. 5, pp. 3481-3485, 2001.
- (2001) IEEE Proc. ACC01 , vol.5 , pp. 3481-3485
- Curtis, J.W.¹ Beard, R.W.²

6
- 0033629916
- Reinforcement Learning In Continuous Time and Space
- K. Doya, "Reinforcement Learning In Continuous Time and Space", Neural Computation, 12 (1), pp. 219-245, 2000.
- (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

7
- 34249047468
- Continuous-Time Adaptive Critics
- T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-Time Adaptive Critics", IEEE Transactions on Neural Networks, 18 (3), pp. 631-647, 2007.
- (2007) IEEE Transactions on Neural Networks , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

8
- 0003644124
- MIT Press, Cambridge, Massachusetts
- R. A. Howard, Dynamic Programming and Markov Processes, MIT Press, Cambridge, Massachusetts, 1960.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

9
- 84914965022
- On an Iterative Technique for Riccati Equation Computations
- February
- D. Kleinman, "On an Iterative Technique for Riccati Equation Computations", IEEE Trans, on Automatic Control, vol. 13, pp. 114- 115, February, 1968.
- (1968) IEEE Trans, on Automatic Control , vol.13 , pp. 114-115
- Kleinman, D.¹

10
- 0004025786
- Taylor & Francis
- F. L. Lewis, S. Jagannathan, A. Yesildirek, Neural Network Control of Robot Manipulators and Nonlinear Systems, Taylor & Francis 1999.
- (1999) Neural Network Control of Robot Manipulators and Nonlinear Systems
- Lewis, F.L.¹ Jagannathan, S.² Yesildirek, A.³

11
- 0029304635
- Neural Net Controller with Guaranteed Tracking Performance
- F. L. Lewis, K. Liu, and A. Yesildirek, "Neural Net Controller with Guaranteed Tracking Performance", IEEE Transactions on Neural Networks, vol. 6, no. 3, pp. 703-715, 1995.
- (1995) IEEE Transactions on Neural Networks , vol.6 , Issue.3 , pp. 703-715
- Lewis, F.L.¹ Liu, K.² Yesildirek, A.³

12
- 0004163205
- John Wiley
- F. L. Lewis, V. L. Syrmos, Optimal Control, John Wiley, 1995.
- (1995) Optimal Control
- Lewis, F.L.¹ Syrmos, V.L.²

13
- 0036588686
- Adaptive Dynamic Programming
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive Dynamic Programming", IEEE Trans, on Systems, Man and Cybernetics, vol. 32, no. 2, pp 140-153, 2002.
- (2002) IEEE Trans, on Systems, Man and Cybernetics , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

14
- 0031236002
- Adaptive critic designs
- D. Prokhorov, D. Wunsch, "Adaptive critic designs, " IEEE Trans, on Neural Networks, vol. 8, no 5, pp. 997-1007, 1997.
- (1997) IEEE Trans, on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.¹ Wunsch, D.²

15
- 84921399937
- John Wiley, New Jersey
- J. Si, A. Barto, W. Powel, D. Wunch, Handbook of Learning and Approximate Dynamic Programming, John Wiley, New Jersey, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powel, W.³ Wunch, D.⁴

16
- 0004102479
- MIT Press, Cambridge, Massachusetts
- R. S. Sutton, A. G. Barto, Reinforcement Learning- An Introduction, MIT Press, Cambridge, Massachusetts, 1998.
- (1998) Reinforcement Learning- An Introduction
- Sutton, R.S.¹ Barto, A.G.²

17
- 63049136575
- Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration
- D. Vrabie, F. Lewis, "Adaptive Optimal Control Algorithm for Continuous-Time Nonlinear Systems Based on Policy Iteration", IEEE Proc. CDC08, pp. 73-79, 2008
- (2008) IEEE Proc. CDC08 , pp. 73-79
- Vrabie, D.¹ Lewis, F.²

18
- 85060504479
- Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration
- to appear
- D. Vrabie, O. Pastravanu, F. Lewis, M. Abu-Khalaf, "Adaptive Optimal Control for Continuous-Time Linear Systems Based on Policy Iteration", Automatica (to appear)
- Automatica
- Vrabie, D.¹ Pastravanu, O.² Lewis, F.³ Abu-Khalaf, M.⁴

19
- 0343839296
- Beyond Regression
- Ph. D. Thesis
- P. J. Werbos, Beyond Regression: New Tools for Prediction and Analysis in the Behavior Sciences, Ph. D. Thesis, 1974.
- (1974) New Tools for Prediction and Analysis in the Behavior Sciences
- Werbos, P.J.¹

20
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling,
- ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling, " Handbook of Intelligent Control, ed. D. A. White and D. A. Sofge, New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

21
- 0024888479
- Neural networks for control and system identification
- P. Werbos, "Neural networks for control and system identification", IEEE Proc. CDC89, vol. 1, pp. 260-265, 1989.
- (1989) IEEE Proc. CDC89 , vol.1 , pp. 260-265
- Werbos, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.