SCOPUS 정보 검색 플랫폼

IEEE Control Systems

Volumn 32, Issue 6, 2012, Pages 76-105

Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers

(3) Lewis, Frank L a Vrabie, Draguna b Vamvoudakis, Kyriakos G c

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

b UNITED TECHNOLOGIES RESEARCH CENTER (United States)

c University of California (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CONTROLLERS; CONTINUOUS-TIME DYNAMICAL SYSTEMS; FEEDBACK CONTROLLER; INDIRECT ADAPTIVE CONTROLLERS; OPTIMAL CONTROL POLICY; OPTIMAL CONTROLLER; PERFORMANCE FUNCTIONS; SYSTEM IDENTIFICATION TECHNIQUES;

FEEDBACK CONTROL; IDENTIFICATION (CONTROL SYSTEMS); NONLINEAR CONTROL SYSTEMS; NONLINEAR EQUATIONS; ONLINE SYSTEMS; OPTIMAL CONTROL SYSTEMS; OPTIMIZATION; PHILOSOPHICAL ASPECTS; REINFORCEMENT LEARNING; RICCATI EQUATIONS;

ADAPTIVE CONTROL SYSTEMS;

EID: 84883537695 PISSN: 1066033X EISSN: None Source Type: Journal
DOI: 10.1109/MCS.2012.2214134 Document Type: Review

Times cited : (955)

References (60)

1
- 0004255876
- Reading, MA: Addison-Wesley
- K. J. Astrom and B. Wittenmark, Adaptive Control. Reading, MA: Addison-Wesley, 1995.
- (1995) Adaptive Control
- Astrom, K.J.¹ Wittenmark, B.²

2
- 41849112337
- Philadelphia, PA: SIAM Press
- P. Ioannou and B. Fidan, Adaptive Control Tutorial. Philadelphia, PA: SIAM Press, 2006.
- (2006) Adaptive Control Tutorial
- Ioannou, P.¹ Fidan, B.²

3
- 0004163205
- 3rd ed. New York: Wiley
- F. L. Lewis, D. Vrabie, and V. Syrmos, Optimal Control, 3rd ed. New York: Wiley, 2012.
- (2012) Optimal Control
- Lewis, F.L.¹ Vrabie, D.² Syrmos, V.³

4
- 0031213212
- Optimal design of adaptive tracking controllers for non-linear systems
- PII S0005109897000721
- Z.-H. Li and M. Krstic, "Optimal design of adaptive tracking controllers for nonlinear systems," Automatica, vol. 33, no. 8, pp. 1459-1473, 1997. (Pubitemid 127392279)
- (1997) Automatica , vol.33 , Issue.8 , pp. 1459-1473
- Li, Z.-H.¹ Krstic, M.²

5
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

6
- 47349092417
- Hoboken, NJ: Wiley
- W. B. Powell, Approximate Dynamic Programming. Hoboken, NJ: Wiley, 2007.
- (2007) Approximate Dynamic Programming
- Powell, W.B.¹

7
- 0002011091
- A menu of designs for reinforcement learning over time
- W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
- P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1991, pp. 67-95.
- (1991) Neural Networks for Control , pp. 67-95
- Werbos, P.J.¹

8
- 49049110053
- Special issue on adaptive dynamic programming and reinforcement learning for feedback control
- Aug.
- F. L. Lewis, G. Lendaris, and D. Liu, "Special issue on adaptive dynamic programming and reinforcement learning for feedback control," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 896-897, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B , vol.38 , Issue.4 , pp. 896-897
- Lewis, F.L.¹ Lendaris, G.² Liu, D.³

9
- 84889784415
- Berlin, Germany: Springer-Verlag
- X. Cao, Stochastic Learning and Optimization. Berlin, Germany: Springer-Verlag, 2007.
- (2007) Stochastic Learning and Optimization
- Cao, X.¹

10
- 77956759998
- Reinforcement learning control and pattern recognition systems
- J. M. Mendel and K. S. Fu, Eds. New York: Academic
- J. M. Mendel and R. W. MacLaren, "Reinforcement learning control and pattern recognition systems," in Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications, J. M. Mendel and K. S. Fu, Eds. New York: Academic, 1970, pp. 287-318.
- (1970) Adaptive, Learning, and Pattern Recognition Systems: Theory and Applications , pp. 287-318
- Mendel, J.M.¹ MacLaren, R.W.²

11
- 77955814101
- Boca Raton, FL: CRC Press
- L. Busoniu, R. Babuska, B. De Schutter, and D. Ernst, Reinforcement Learning and Dynamic Programming Using Function Approximators. Boca Raton, FL: CRC Press, 2009.
- (2009) Reinforcement Learning and Dynamic Programming Using Function Approximators
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³ Ernst, D.⁴

12
- 1842684992
- Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology
- DOI 10.1016/j.conb.2004.03.017, PII S0959438804000492
- W. Schultz, "Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioral ecology," Current Opinion Neurobiol., vol. 14, no. 2, pp. 139-147, 2004. (Pubitemid 38479929)
- (2004) Current Opinion in Neurobiology , vol.14 , Issue.2 , pp. 139-147
- Schultz, W.¹

13
- 0035422340
- Neural mechanisms for learning and control
- Aug.
- K. Doya, H. Kimura, and M. Kawato, "Neural mechanisms for learning and control," IEEE Control Syst. Mag., vol. 21, no. 4, pp. 42-54, Aug. 2000.
- (2000) IEEE Control Syst. Mag. , vol.21 , Issue.4 , pp. 42-54
- Doya, K.¹ Kimura, H.² Kawato, M.³

14
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold
- P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control, D. A. White and D. A. Sofge, Eds. New York: Van Nostrand Reinhold, 1992.
- (1992) Handbook of Intelligent Control
- Werbos, P.J.¹

15
- 67349145396
- Neural network approach to continuoustime direct adaptive optimal control for partially-unknown nonlinear systems
- Apr.
- D. Vrabie and F. L. Lewis, "Neural network approach to continuoustime direct adaptive optimal control for partially-unknown nonlinear systems," Neural Netw., vol. 22, no. 3, pp. 237-246, Apr. 2009.
- (2009) Neural Netw. , vol.22 , Issue.3 , pp. 237-246
- Vrabie, D.¹ Lewis, F.L.²

16
- 0020970738
- Neuron-like adaptive elements that can solve difficult learning control problems
- Sep./Oct.
- A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuron-like adaptive elements that can solve difficult learning control problems," IEEE Trans. Syst., Man, Cybern., vol. SMC-13, no. 5, pp. 834-846, Sep./Oct. 1983.
- (1983) IEEE Trans. Syst., Man, Cybern. , vol.SMC-13 , Issue.5 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

17
- 0003487482
- Belmont, MA: Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro-Dynamic Programming. Belmont, MA: Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

18
- 0003787146
- Princeton, NJ: Princeton Univ. Press
- R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.E.¹

19
- 0024888479
- Neural networks for control and system identification
- P. J. Werbos, "Neural networks for control and system identification," in Proc. IEEE Conf. Decision Control, Tampa, FL, 1989, pp. 260-265.
- Proc. IEEE Conf. Decision Control, Tampa, FL, 1989 , pp. 260-265
- Werbos, P.J.¹

20
- 0031236002
- Adaptive critic designs
- Sep.
- D. Prokhorov and D. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
- (1997) IEEE Trans. Neural Netw. , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.¹ Wunsch, D.²

21
- 84921399937
- Piscataway, NJ: IEEE Press
- J. Si, A. Barto, W. Powell, and D. Wunsch, Handbook of Learning and Approximate Dynamic Programming. Piscataway, NJ: IEEE Press, 2004.
- (2004) Handbook of Learning and Approximate Dynamic Programming
- Si, J.¹ Barto, A.² Powell, W.³ Wunsch, D.⁴

22
- 49049111594
- Issues on stability of ADP feedback controllers for dynamical systems
- Aug.
- S. N. Balakrishnan, J. Ding, and F. L. Lewis, "Issues on stability of ADP feedback controllers for dynamical systems," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 913-917, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B , vol.38 , Issue.4 , pp. 913-917
- Balakrishnan, S.N.¹ Ding, J.² Lewis, F.L.³

23
- 66449130966
- Adaptive dynamic programming: An introduction
- May
- F. Y. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput, Intell, Mag., vol. 4, no. 2, pp. 39-47, May 2009.
- (2009) IEEE Comput, Intell, Mag. , vol.4 , Issue.2 , pp. 39-47
- Wang, F.Y.¹ Zhang, H.² Liu, D.³

24
- 70349116541
- Reinforcement learning and adaptive dynamic programming for feedback control
- F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, 2009.
- (2009) IEEE Circuits Syst. Mag. , vol.9 , Issue.3 , pp. 32-50
- Lewis, F.L.¹ Vrabie, D.²

25
- 0036641793
- State-constrained agile missile control with adaptive-critic-based neural networks
- DOI 10.1109/TCST.2002.1014669, PII S1063653602053605
- D. Han and S. N. Balakrishnan, "State-constrained agile missile control with adaptive-critic-based neural networks," IEEE Trans. Control Syst. Technol., vol. 10, no. 4, pp. 481-489, Jul. 2002. (Pubitemid 34798672)
- (2002) IEEE Transactions on Control Systems Technology , vol.10 , Issue.4 , pp. 481-489
- Han, D.¹ Balakrishnan, S.N.²

26
- 84871899254
- New York: Springer-Verlag
- D. Prokhorov, Computational Intelligence in Automotive Applications. New York: Springer-Verlag, 2008.
- (2008) Computational Intelligence in Automotive Applications
- Prokhorov, D.¹

27
- 0036060633
- An adaptive critic global controller
- S. Ferrari and R. F. Stengel, "An adaptive critic global controller," in Proc. American Control Conf., May 2002, pp. 2665-2670.
- Proc. American Control Conf., May 2002 , pp. 2665-2670
- Ferrari, S.¹ Stengel, R.F.²

28
- 0029592634
- Adaptive critic designs: A case study for neurocontrol
- DOI 10.1016/0893-6080(95)00042-9
- D. Prokhorov, R. A. Santiago, and D. C. Wunsch, II, "Adaptive critic designs: A case study for neurocontrol," Neural Netw., vol. 8, no. 9, pp. 1367-1372, 1995. (Pubitemid 26072896)
- (1995) Neural Networks , vol.8 , Issue.9 , pp. 1367-1372
- Prokhorov, D.V.¹ Santiago, R.A.² Wunsch II, D.C.³

29
- 0036588686
- Adaptive dynamic programming
- J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, "Adaptive dynamic programming," IEEE Trans. Syst., Man Cybern. C, vol. 32, no. 2, pp. 140-153, 2002.
- (2002) IEEE Trans. Syst., Man Cybern. C , vol.32 , Issue.2 , pp. 140-153
- Murray, J.J.¹ Cox, C.J.² Lendaris, G.G.³ Saeks, R.⁴

30
- 0042767744
- Helicopter flight control reconfiguration for main rotor actuator failures
- R. Enns and J. Si, "Helicopter flight control reconfiguration for main rotor actuator failures," AIAA J. Guidance, Control, Dynamics, vol. 26, no. 4, pp. 572-584, 2003.
- (2003) AIAA J. Guidance, Control, Dynamics , vol.26 , Issue.4 , pp. 572-584
- Enns, R.¹ Si, J.²

31
- 49049106959
- Direct heuristic dynamic programming method for power system stability enhancement
- C. Lu, J. Si, and X. Xie, "Direct heuristic dynamic programming method for power system stability enhancement," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 1008-1013, 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B , vol.38 , Issue.4 , pp. 1008-1013
- Lu, C.¹ Si, J.² Xie, X.³

32
- 0033685661
- Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle
- G. G. Lendaris, L. Schultz, and T. Shannon, "Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle," in Proc. Int. Conf. Neural Networks, 2000, pp. 73-78.
- Proc. Int. Conf. Neural Networks, 2000 , pp. 73-78
- Lendaris, G.G.¹ Schultz, L.² Shannon, T.³

33
- 0034548295
- Convergence analysis of adaptive critic based optimal control
- X. Liu and S. N. Balakrishnan, "Convergence analysis of adaptive critic based optimal control," in Proc. American Control Conf., June 2000, pp. 1929-1933.
- Proc. American Control Conf., June 2000 , pp. 1929-1933
- Liu, X.¹ Balakrishnan, S.N.²

34
- 49049089962
- Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
- Aug.
- A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern. B, vol. 38, no. 4, pp. 943-949, Aug. 2008.
- (2008) IEEE Trans. Syst., Man, Cybern. B , vol.38 , Issue.4 , pp. 943-949
- Al-Tamimi, A.¹ Lewis, F.L.² Abu-Khalaf, M.³

35
- 0003668973
- London, U.K.: John Murray
- C. Darwin, On the Origin of Species by Means of Natural Selection. London, U.K.: John Murray, 1859.
- (1859) On the Origin of Species by Means of Natural Selection
- Darwin, C.¹

36
- 0003881809
- New York: Wiley
- D. G. Luenberger, Introduction to Dynamic Systems. New York: Wiley, 1979.
- (1979) Introduction to Dynamic Systems
- Luenberger, D.G.¹

37
- 58349110975
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- D. Vrabie, O. Pastravanu, M. Abu-Khalaf, and F. L. Lewis, "Adaptive optimal control for continuous-time linear systems based on policy iteration," Automatica, vol. 45, no. 2, pp. 477-484, 2009.
- (2009) Automatica , vol.45 , Issue.2 , pp. 477-484
- Vrabie, D.¹ Pastravanu, O.² Abu-Khalaf, M.³ Lewis, F.L.⁴

38
- 0003663467
- New York: McGraw-Hill
- A. Papoulis, Probability Random Variables and Stochastic Processes. New York: McGraw-Hill, 2002.
- (2002) Probability Random Variables and Stochastic Processes
- Papoulis, A.¹

39
- 0022738693
- Decentralized learning in finite Markov chains
- June
- R. M. Wheeler and K. S. Narendra, "Decentralized learning in finite Markov chains," IEEE Trans. Autom. Control, vol. 31, no. 6, pp. 519-526, June 1986.
- (1986) IEEE Trans. Autom. Control , vol.31 , Issue.6 , pp. 519-526
- Wheeler, R.M.¹ Narendra, K.S.²

40
- 77950806766
- Q-learning and Pontryagin's minimum principle
- P. Mehta and S. Meyn, "Q-learning and Pontryagin's minimum principle," in Proc. IEEE Conf. Decision Control, Dec. 2009, pp. 3598-3605.
- Proc. IEEE Conf. Decision Control, Dec. 2009 , pp. 3598-3605
- Mehta, P.¹ Meyn, S.²

41
- 67650505616
- Algorithm and stability of ATC receding horizon control
- H. Zhang, J. Huang, and F. L. Lewis, "Algorithm and stability of ATC receding horizon control," in Proc. IEEE Symp. Adaptive Dynamic Programming Reinforcement, Nashville, TN, Mar. 2009, pp. 28-35.
- Proc. IEEE Symp. Adaptive Dynamic Programming Reinforcement, Nashville, TN, Mar. 2009 , pp. 28-35
- Zhang, H.¹ Huang, J.² Lewis, F.L.³

42
- 0004049893
- Ph.D. dissertation, Cambridge University, Cambridge, U.K.
- C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Cambridge University, Cambridge, U.K., 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

43
- 34249833101
- Q-learning
- C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3-4, pp. 279-292, 1992.
- (1992) Mach. Learn. , vol.8 , Issue.3-4 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

44
- 0015109409
- An iterative technique for the computation of the steady state gains for the discrete optimal regulator
- Aug.
- G. A. Hewer, "An iterative technique for the computation of the steady state gains for the discrete optimal regulator," IEEE Trans Autom. Control, vol. 16, no. 4, pp. 382-384, Aug. 1971.
- (1971) IEEE Trans Autom. Control , vol.16 , Issue.4 , pp. 382-384
- Hewer, G.A.¹

45
- 0004074291
- London, U.K.: Oxford Univ. Press
- P. Lancaster and L. Rodman, Algebraic Riccati Equations. London, U.K.: Oxford Univ. Press, 1995.
- (1995) Algebraic Riccati Equations
- Lancaster, P.¹ Rodman, L.²

46
- 0003536131
- London, U.K.: Springer-Verlag
- K. L. Moore, Iterative Learning Control for Deterministic Systems. London, U.K.: Springer-Verlag, 1993.
- (1993) Iterative Learning Control for Deterministic Systems
- Moore, K.L.¹

47
- 0026883666
- L2-gain analysis of nonlinear systems and nonlinear state feedback H? Control
- A. J. Van, "L2-gain analysis of nonlinear systems and nonlinear state feedback H? control," IEEE Trans. Autom. Control, vol. 37, no. 6, pp. 770-784, 1992.
- (1992) IEEE Trans. Autom. Control , vol.37 , Issue.6 , pp. 770-784
- Van, A.J.¹

48
- 0003473124
- Englewood Cliffs, NJ: Prentice-Hall
- L. Ljung, System Identification: Theory for the User. Englewood Cliffs, NJ: Prentice-Hall, 1999.
- (1999) System Identification: Theory for the User
- Ljung, L.¹

49
- 0028584964
- Adaptive linear quadratic control using policy iteration
- S. Bradtke, B. Ydstie, and A. Barto, "Adaptive linear quadratic control using policy iteration," in Proc. Amer. Control Conf., Baltimore, MD, 1994, pp. 3475-3479.
- Proc. Amer. Control Conf., Baltimore, MD, 1994 , pp. 3475-3479
- Bradtke, S.¹ Ydstie, B.² Barto, A.³

50
- 79551685808
- Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
- Feb.
- F. L. Lewis and K. G. Vamvoudakis, "Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data," IEEE Trans. Syst., Man, Cybern. B, vol. 41, no. 1, pp. 14-25, Feb. 2011.
- (2011) IEEE Trans. Syst., Man, Cybern. B , vol.41 , Issue.1 , pp. 14-25
- Lewis, F.L.¹ Vamvoudakis, K.G.²

51
- 33845759425
- Policy iterations on the Hamilton-Jacobi-Isaacs equation for H state feedback control with input saturation
- DOI 10.1109/TAC.2006.884959
- M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Policy iterations on the Hamilton-Jacobi-Isaacs equation for state feedback control with input saturation," IEEE Trans. Autom. Control, vol. 51, no. 12, pp. 1989-1995, Dec. 2006. (Pubitemid 46002295)
- (2006) IEEE Transactions on Automatic Control , vol.51 , Issue.12 , pp. 1989-1995
- Abu-Khalaf, M.¹ Lewis, F.L.² Huang, J.³

52
- 0028733775
- Reinforcement learning in continuous time: Advantage updating
- L. C. Baird, "Reinforcement learning in continuous time: Advantage updating," in Proc. Int. Conf. Neural Networks, Orlando, FL, June1994, pp. 2448-2453.
- Proc. Int. Conf. Neural Networks, Orlando, FL, June1994 , pp. 2448-2453
- Baird, L.C.¹

53
- 0033629916
- Reinforcement learning in continuous time and space
- K. Doya, "Reinforcement learning in continuous time and space," Neural Comput., vol. 12, no. 1, pp. 219-245, 2000.
- (2000) Neural Comput. , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

54
- 34249047468
- Continuous-time adaptive critics
- May
- T. Hanselmann, L. Noakes, and A. Zaknich, "Continuous-time adaptive critics," IEEE Trans. Neural Netw., vol. 18, no. 3, pp. 631-647, May 2007.
- (2007) IEEE Trans. Neural Netw. , vol.18 , Issue.3 , pp. 631-647
- Hanselmann, T.¹ Noakes, L.² Zaknich, A.³

55
- 84914965022
- On an iterative technique for Riccati equation computations
- Feb.
- D. L. Kleinman, "On an iterative technique for Riccati equation computations," IEEE Trans. Autom. Control, vol. AC-13, no. 1, pp. 114-115, Feb. 1968.
- (1968) IEEE Trans. Autom. Control , vol.AC-13 , Issue.1 , pp. 114-115
- Kleinman, D.L.¹

56
- 62949149213
- Dept. Control Dynamical Systems, California Institute of Technology, Pasadena, CA, Tech. Rep. 96-021
- V. Nevistic and J. Primbs, "Constrained nonlinear optimal control: A converse HJB approach," Dept. Control Dynamical Systems, California Institute of Technology, Pasadena, CA, Tech. Rep. 96-021, 1996.
- (1996) Constrained Nonlinear Optimal Control: A Converse HJB Approach
- Nevistic, V.¹ Primbs, J.²

57
- 77950630017
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- K. G. Vamvoudakis and F. L. Lewis, "Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem," Automatica, vol. 46, no. 5, pp. 878-888, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 878-888
- Vamvoudakis, K.G.¹ Lewis, F.L.²

58
- 79960443754
- Adaptive dynamic programming for online solution of a zero-sum differential game
- D. Vrabie and F. L. Lewis, "Adaptive dynamic programming for online solution of a zero-sum differential game," J. Control Theory: Its Appl., vol. 9, no. 3, pp. 353-360, 2011.
- (2011) J. Control Theory: Its Appl. , vol.9 , Issue.3 , pp. 353-360
- Vrabie, D.¹ Lewis, F.L.²

59
- 79960897012
- Multi-player non-zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations
- K. G. Vamvoudakis and F. Lewis, "Multi-player non-zero sum games: Online adaptive learning solution of coupled Hamilton-Jacobi equations," Automatica, vol. 47, no. 8, pp. 556-569, 2011.
- (2011) Automatica , vol.47 , Issue.8 , pp. 556-569
- Vamvoudakis, K.G.¹ Lewis, F.²

60
- 77955423822
- Model-free H-infinity control design for unknown linear discrete-time systems via Q-learning with LMI
- Aug.
- J. H. Kim and F. L. Lewis, "Model-free H-infinity control design for unknown linear discrete-time systems via Q-learning with LMI," Automatica, vol. 46, no. 8, pp. 1320-1326, Aug. 2010.
- (2010) Automatica , vol.46 , Issue.8 , pp. 1320-1326
- Kim, J.H.¹ Lewis, F.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.