SCOPUS 정보 검색 플랫폼

AIAA Guidance, Navigation, and Control (GNC) Conference

Volumn , Issue , 2013, Pages

Adaptive optimal control of partially-unknown constrained-input systems using policy iteration with experience replay

(5) Modares, Hamidreza a Lewis, Frank L b Naghibi Sistani, Mohammad Bagher a Chowdhary, Girish c,e Yucelen, Tansel d

a FERDOWSI UNIVERSITY OF MASHHAD (Iran)

b UNIVERSITY OF TEXAS AT ARLINGTON (United States)

c MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

d University of Missouri Rolla (United States)

e Oklahoma State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE OPTIMAL CONTROL; FEEDBACK CONTROL LAW; NEAR-OPTIMAL CONTROL; ONLINE LEARNING ALGORITHMS; OPTIMAL CONTROL PROBLEM; OPTIMAL CONTROL SOLUTION; PERSISTENCE OF EXCITATION; POLICY ITERATION ALGORITHMS;

ALGORITHMS; CONTROL; CONTROL THEORY; ITERATIVE METHODS; NEURAL NETWORKS;

OPTIMAL CONTROL SYSTEMS;

EID: 84883680649 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (10)

References (23)

1
- 0004163205
- nd ed., Wiley
- nd ed., Wiley, 2012.
- (2012) Optimal Control
- Lewis, F.L.¹ Vrabie, D.² Syrmos, V.³

2
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., and Barto, A. G., Reinforcement learning-an introduction, Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement learning-an introduction
- Sutton, R.S.¹ Barto, A.G.²

3
- 47349092417
- Wiley-Interscience
- Powell, W. B., Approximate Dynamic Programming: solving the curses of dimensionality, Wiley-Interscience, 2007.
- (2007) Approximate Dynamic Programming: Solving the curses of dimensionality
- Powell, W.B.¹

4
- 0003565783
- nd ed., MA: Athena Scientific
- nd ed., MA: Athena Scientific, 2012.
- (2012) Dynamic Programming and Optimal Control: Approximate Dynamic Programming
- Bertsekas, D.P.¹

5
- 0003644124
- Cambridge, MA: MIT Press
- Howard, R. A., Dynamic programming and markov processes. Cambridge, MA: MIT Press, 1960.
- (1960) Dynamic programming and markov processes
- Howard, R.A.¹

6
- 0003785722
- Ph. D. Dissertation, Electrical Engineering Dep., Rensselaer Polytech Ins., Troy, New York
- Beard, R. W., "Improving the Closed-loop Performance of Nonlinear Systems, " Ph. D. Dissertation, Electrical Engineering Dep., Rensselaer Polytech Ins., Troy, New York, 1995.
- (1995) Improving the Closed-loop Performance of Nonlinear Systems
- Beard, R.W.¹

7
- 14844340822
- Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach
- Abu-Khalaf, M., and Lewis, F. L., "Nearly Optimal Control Laws for Nonlinear Systems with Saturating Actuators Using a Neural Network HJB Approach, " Automatica, Vol. 41, 2005, pp. 779, 791.
- (2005) Automatica , vol.41
- Abu-Khalaf, M.¹ Lewis, F.L.²

8
- 0033629916
- Reinforcement Learning in Continuous-time and Space
- Doya, K., "Reinforcement Learning in Continuous-time and Space, " Neural Computation, Vol. 12, No. 1, 2000, pp. 219, 245.
- (2000) Neural Computation , vol.12 , Issue.1
- Doya, K.¹

9
- 77950630017
- Online Actor-critic Algorithm to Solve the Continuous Infinite-time Horizon Optimal Control Problem
- Vamvoudakis, K., and Lewis, F. L., "Online Actor-critic Algorithm to Solve the Continuous Infinite-time Horizon Optimal Control Problem, " Automatica, Vol. 46, 2010, pp. 878, 888.
- (2010) Automatica , vol.46
- Vamvoudakis, K.¹ Lewis, F.L.²

10
- 0036588686
- Adaptive Dynamic Programming
- Murray, J. J., Cox, C. J., Lendaris, G. G., and Saeks, R., "Adaptive Dynamic Programming, " IEEE Trans. Syst., Man, Cybern., Part C: Appl. Rev., Vol. 32, No. 2, 2002, pp. 140, 153.
- (2002) IEEE Trans. Syst. Man, Cybern., Part C: Appl. Rev , vol.32 , Issue.2
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

11
- 67349145396
- Neural Network Approach to Continuous-time Direct Adaptive Optimal Control for Partially Unknown Nonlinear Systems
- Vrabie, D., and Lewis, F. L., "Neural Network Approach to Continuous-time Direct Adaptive Optimal Control for Partially Unknown Nonlinear Systems, " Neural Netw., Vol. 22, 2009, pp. 237, 246.
- (2009) Neural Netw , vol.22
- Vrabie, D.¹ Lewis, F.L.²

12
- 84881367645
- Ph. D. Dissertation, Florida Univ
- Bhasin, S., "Reinforcement Learning and Optimal Control Methods for Uncertain Nonlinear Systems, " Ph. D. Dissertation, Florida Univ, 2011.
- (2011) Reinforcement Learning and Optimal Control Methods for Uncertain Nonlinear Systems
- Bhasin, S.¹

13
- 58349110975
- Adaptive Optimal Control for Continuous-time Linear Systems Based on Policy Iteration
- Vrabie, D., Pastravanu, O., Abu-Khalaf, M., and Lewis, F. L., "Adaptive Optimal Control for Continuous-time Linear Systems Based on Policy Iteration, " Automatica, Vol. 45, No. 2, 2009, pp. 477, 484.
- (2009) Automatica , vol.45 , Issue.2
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

14
- 0031143730
- An Analysis of Temporal-Difference Learning with Function Approximation
- Tsitsiklis, J. N., and Van Roy, B., "An Analysis of Temporal-Difference Learning with Function Approximation, " IEEE Trans. Automatic Control, Vol. 42, 1997, pp. 674, 690.
- (1997) IEEE Trans. Automatic Control , vol.42
- Tsitsiklis, J.N.¹ Roy, B.V.²

15
- 71749106087
- Real-time Reinforcement Learning by Sequential Actor-critics and Experience Replay
- Wawrzynski, P., "Real-time Reinforcement Learning by Sequential Actor-critics and Experience Replay. " Neural Netw., Vol. 22, 2009, pp. 1484, 1497.
- (2009) Neural Netw , vol.22
- Wawrzynski, P.¹

16
- 56749173285
- Efficient Experience Reuse in Non-Markovian Environments
- Control Inf. Technol., Tokyo, Japan
- Dung, L. T., Komeda, T., and Takagi, M., "Efficient Experience Reuse in Non-Markovian Environments. " Proceeding of the Internatinal Conference Instrum, Control Inf. Technol., Tokyo, Japan, 2008, pp. 3327-3332.
- (2008) Proceeding of the Internatinal Conference Instrum , pp. 3327-3332
- Dung, L.T.¹ Komeda, T.² Takagi, M.³

17
- 60349130974
- Batch reinforcement learning in a complex domain
- Honolulu, HI
- Kalyanakrishnan, S., and Stone, P., "Batch reinforcement learning in a complex domain. " Proceeding of the 6th Internation Conference on Autoomus Agents and Multi-Agent Systms, Honolulu, HI, pp. 650-657, 2007.
- (2007) Proceeding of the 6th Internation Conference on Autoomus Agents and Multi-Agent Systms , pp. 650-657
- Kalyanakrishnan, S.¹ Stone, P.²

18
- 0000123778
- Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching
- Lin, L. J., "Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching. " Machine Learning, Vol. 8, 1992, pp. 293, 321.
- (1992) Machine Learning , vol.8
- Lin, L.J.¹

19
- 84857501996
- Experience Replay for Real-Time Reinforcement Learning Control
- Adam, S., Busoniu, L., and Babuska, R., "Experience Replay for Real-Time Reinforcement Learning Control. " IEEE Trans. Syst. Man, Cybern., Part C: Appl. Rev., Vol. 42, 2012, pp. 201, 212.
- (2012) IEEE Trans. Syst. Man Cybern., Part C: Appl. Rev , vol.42
- Adam, S.¹ Busoniu, L.² Babuska, R.³

20
- 79953141961
- PhD. Dissertation, Georgia institute of technology
- Chowdhary, G. V., "Concurrent learning for convergence in adaptive control without persistency of excitation, " PhD. Dissertation, Georgia institute of technology, 2010.
- (2010) Concurrent learning for convergence in adaptive control without persistency of excitation
- Chowdhary, G.V.¹

21
- 84883670357
- Concurrent Learning for Convergence in Adaptive Control without
- Atlanta GA
- Chowdhary, G. V., and Johnson, E., "Concurrent Learning for Convergence in Adaptive Control without, " IEEE CDC, Atlanta GA, 2010, pp. 3675-3679.
- (2010) IEEE CDC , pp. 3675-3679
- Chowdhary, G.V.¹ Johnson, E.²

22
- 0030392685
- Constrained optimization and control of nonlinear systems: New results in optimal control
- Lyshevski, S. E., "Constrained optimization and control of nonlinear systems: New results in optimal control, " Proceeding of the IEEE Conference Decision and Control, 1996, pp. 541-546.
- (1996) Proceeding of the IEEE Conference Decision and Control , pp. 541-546
- Lyshevski, S.E.¹

23
- 0003917259
- New York: Academic Press
- Finlayson, B. A., The method of weighted residuals and variational principles. New York: Academic Press, 1990.
- (1990) The method of weighted residuals and variational principles
- Finlayson, B.A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.