SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Neural Networks

Volumn , Issue , 2012, Pages

Integral reinforcement learning with explorations for continuous-time nonlinear systems

(3) Lee, Jae Young a Park, Jin Bae a Choi, Yoon Ho b

a Yonsei University (South Korea)

b Kyonggi University (South Korea)

Author keywords

[No Author keywords available]

Indexed keywords

ACTOR CRITIC; CONTINUOUS TIME; CONTINUOUS TIME NONLINEAR SYSTEMS; CONTROL INPUTS; INPUT-AFFINE; LEAST SQUARE; PARAMETERIZATIONS; PERSISTENTLY EXCITING; SIMULATION EXAMPLE; TIME VARYING SIGNAL;

CONTINUOUS TIME SYSTEMS; NEURAL NETWORKS; REINFORCEMENT LEARNING;

NONLINEAR SYSTEMS;

EID: 84865092901 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IJCNN.2012.6252508 Document Type: Conference Paper

Times cited : (18)

References (22)

1
- 0004102479
- MIT Press, Cambridge, Massachussetts
- R. S. Sutton and A. G. Barto, Reinforcement Learning-An Introduction, MIT Press, Cambridge, Massachussetts, 1998.
- (1998) Reinforcement Learning-An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 84921399937
- Wiley-IEEE Press
- J. Si, A. G. Barto, W. B. Powell, and D. Wunsch, Handbook of Learning and Approximate Dynamic Programming, Wiley-IEEE Press, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.G.² Powell, W.B.³ Wunsch, D.⁴

3
- 66449130966
- Adaptive dynamic programming: An introduction
- F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: an introduction," IEEE Computational Magazine, vol. 4, no. 2, pp. 39-47, 2009.
- (2009) IEEE Computational Magazine , vol.4 , Issue.2 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

4
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits and Systems Magazine, vol. 9, no. 3, pp. 32-50, 2009.
- (2009) IEEE Circuits and Systems Magazine , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

5
- 0031236002
- Adaptive critic designs
- D. V. Prokhorov and D. C. Wunsch II, "Adaptive critic designs," IEEE Trans. Neural Networks, vol. 8, no. 5, pp. 997-1007, 1997.
- (1997) IEEE Trans. Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.V.¹ Wunsch II, D.C.²

6
- 0028584964
- Adaptive linear quadratic control using policy iteration
- Baltimore, Maryland
- S. J. Bradtke and B. E. Ydstie, "Adaptive linear quadratic control using policy iteration," Proc. American Control Conference, Baltimore, Maryland, pp. 3475-3479, 1994.
- (1994) Proc. American Control Conference , pp. 3475-3479
- Bradtke, S.J.¹ Ydstie, B.E.²

7
- 33847648898
- Adaptive critic designs for discrete-time zero-sum games with application to H1 control
- A. Al-Tamimi, M. Abu-Khalaf, and F. L. Lewis, "Adaptive critic designs for discrete-time zero-sum games with application to H1 control," IEEE Trans. Syst., Man, Cybern.-Part B, vol. 37, no. 1, pp. 240-247, 2007.
- (2007) IEEE Trans. Syst., Man, Cybern.-Part B , vol.37 , Issue.1 , pp. 240-247
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

8
- 33846781129
- Model-free Q-learning designs for discrete-time zero-sum games with application to H1 control
- A. Al-Tamimi, M. Abu-Khalaf, and F. L. Lewis, "Model-free Q-learning designs for discrete-time zero-sum games with application to H1 control," Automatica, vol. 43, no. 3, 473-481, 2007.
- (2007) Automatica , vol.43 , Issue.3 , pp. 473-481
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

9
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof," IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics, vol. 38, no. 4, 2008.
- (2008) IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics , vol.38 , Issue.4
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

10
- 0033629916
- Reinforcement learning in continuous-time and space
- K. Doya, "Reinforcement learning in continuous-time and space," Neural Computation 12, pp. 219-245, 2000.
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

11
- 34249047468
- Continuous-time adaptive critics
- T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Network, vol. 18, no. 3, pp. 631-647, 2007.
- (2007) IEEE Trans. Neural Network , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

12
- 0036588686
- Adaptive dynamic programming
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive Dynamic Programming," IEEE Trans. Systems, Mans and Cybernetics- PART B: Cybernetics, vol. 32, no. 2, pp. 140-153, 2002.
- (2002) IEEE Trans. Systems, Mans and Cybernetics- PART B: Cybernetics , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

13
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

14
- 14844340822
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Automatica, vol. 41, no. 5, pp. 779-791, 2005.
- (2005) Automatica , vol.41 , Issue.5 , pp. 779-791
- Abu-Khalaf, M.¹ Lewis, F.L.²

15
- 67349145396
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- D. Vrable and F. L. Lewis "Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems," Neural Networks, vol. 22, no. 3, pp. 237-246, 2009.
- (2009) Neural Networks , vol.22 , Issue.3 , pp. 237-246
- Vrable, D.¹ Lewis, F.L.²

16
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- 878-888
- K. G. Vamvoudakis, F. L. Lewis "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, pp. 878-888 vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

17
- 79953151751
- A model-free robust policy iteration algorithm for optimal control of nonlinear systems
- Atlanta, GA
- S. Bhasin, M. Johnson, W. E. Dixon, "A model-free robust policy iteration algorithm for optimal control of nonlinear systems," 49th IEEE Conf. Decision and Control, Atlanta, GA, pp. 3060-3065, 2010.
- (2010) 49th IEEE Conf. Decision and Control , pp. 3060-3065
- Bhasin, S.¹ Johnson, M.² Dixon, W.E.³

18
- 78751528766
- Policy-iteration-based adaptive optimal control for uncertain continuous-time linear systems with excitation signals
- Ilsan, Kyonggi-Do, South Korea, Oct.
- J. Y. Lee, J. B. Park, and Y. H. Choi, "Policy-iteration-based adaptive optimal control for uncertain continuous-time linear systems with excitation signals," in Proc. Int'l Conf. on Control, Automation, and Systems (ICCAS), Ilsan, Kyonggi-Do, South Korea, pp. 646-651, Oct. 2010.
- (2010) Proc. Int'l Conf. on Control, Automation, and Systems (ICCAS) , pp. 646-651
- Lee, J.Y.¹ Park, J.B.² Choi, Y.H.³

19
- 84867400046
- Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems
- accepted for publication
- J. Y. Lee, J. B. Park, and Y. H. Choi, "Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems," Automatica, accepted for publication, 2012.
- (2012) Automatica
- Lee, J.Y.¹ Park, J.B.² Choi, Y.H.³

20
- 0018441647
- An approximation theory of optimal control for trainable manipulators
- G. N. Saridis and C. G. Lee, "An approximation theory of optimal control for trainable manipulators," IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics, vol. 9, no. 3, 1979.
- (1979) IEEE Trans. Systems, Man, and Cybernetics-PART B: Cybernetics , vol.9 , Issue.3
- Saridis, G.N.¹ Lee, C.G.²

21
- 14544289894
- A note on persistency of excitation
- J. C. Willems, P. Rapisarda, I. Markovsky, and Bart L. M. Moor, "A note on persistency of excitation," Systems & Control Letters, vol. 54, no. 4, pp. 325-329, 2005.
- (2005) Systems & Control Letters , vol.54 , Issue.4 , pp. 325-329
- Willems, J.C.¹ Rapisarda, P.² Markovsky, I.³ Moor, B.L.M.⁴

22
- 62949149213
- Constrained nonlinear optimal control: A converse HJB approach
- Pasadena, CA 91125
- V. Nevistic and J. A. Primbs, "Constrained nonlinear optimal control: a converse HJB approach," Technical report CIT-CDS 96-021, California Institute of Technology, Pasadena, CA 91125, 1996
- (1996) Technical Report CIT-CDS 96-021, California Institute of Technology
- Nevistic, V.¹ Primbs, J.A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.