SCOPUS 정보 검색 플랫폼

Transactions of the Institute of Measurement & Control

Volumn 30, Issue 3-4, 2008, Pages 207-223

Biologically inspired scheme for continuous-time approximate dynamic programming

(3) Vrabie, Draguna a Lewis, Frank a Abu Khalaf, Murad b

a UNIVERSITY OF TEXAS AT ARLINGTON (United States)

b MATHWORKS INC (United States)

Author keywords

Actor Critic structures; approximate dynamic programming; linear quadratic regulation; policy iteration

Indexed keywords

BIOMIMETICS; CONTINUOUS TIME SYSTEMS; CONTROL SYSTEM ANALYSIS; CONTROLLERS; DYNAMIC PROGRAMMING; ITERATIVE METHODS; LEARNING SYSTEMS; LINEAR SYSTEMS; MAN MACHINE SYSTEMS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING; RICCATI EQUATIONS; ROBUST CONTROL; STATE FEEDBACK; STATE SPACE METHODS; SYSTEM THEORY;

ALGEBRAIC RICCATI EQUATIONS; APPROXIMATE DYNAMIC PROGRAMMING; BIOLOGICALLY INSPIRED; LINEAR QUADRATIC REGULATIONS; MODELLING AND ANALYSIS; OPTIMAL STATE FEEDBACK; POLICY ITERATION; REINFORCEMENT LEARNING TECHNIQUES;

ADAPTIVE CONTROL SYSTEMS;

EID: 47949083966 PISSN: 01423312 EISSN: None Source Type: Journal
DOI: 10.1177/0142331207088188 Document Type: Article

Times cited : (13)

References (29)

1
- 33847648898
- Adaptive critic designs for discrete-time zero-sum games with application to H-Infinity control
- Al-Tamimi, A., Abu-Khalaf, M. and Lewis, F.L. 2007a: Adaptive critic designs for discrete-time zero-sum games with application to H-Infinity control. IEEE Transactions on Systems, Man, and Cybernetics-Part B 37, 240-47.
- (2007) IEEE Transactions on Systems, Man, and Cybernetics-Part B , vol.37 , pp. 240-247
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

2
- 33846781129
- Model-free Q-learning designs for discrete-time zero-sum games with application to H-Infinity control
- Al-Tamimi, A., Abu-Khalaf, M. and Lewis, F.L. 2007b: Model-free Q-learning designs for discrete-time zero-sum games with application to H-Infinity control. Automatica 43, 473-82.
- (2007) Automatica , vol.43 , pp. 473-482
- Al-Tamimi, A.¹ Abu-Khalaf, M.² Lewis, F.L.³

3
- 0028733775
- Reinforcement learning in continuous-time: advantage updating
- Proceedings of the International Conference on Neural Networks, Orlando, FL, June
- Baird, L. 1994: Reinforcement learning in continuous-time: advantage updating. Proceedings of the International Conference on Neural Networks, Orlando, FL, June.
- (1994)
- Baird, L.¹

4
- 0020970738
- Neuronlike elements that can solve difficult learning control problems
- Barto, G., Sutton, R.S. and Anderson, C.W. 1983: Neuronlike elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics SMC-, 13, 835-46.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics SMC- , vol.13 , pp. 835-846
- Barto, G.¹ Sutton, R.S.² Anderson, C.W.³

5
- 0003487482
- Athena Scientific.
- Bertsekas, D. and Tsitsiklis, J. 1996: Neuro-dynamic programming. Athena Scientific.
- (1996) Neuro-dynamic programming
- Bertsekas, D.¹ Tsitsiklis, J.²

6
- 84980552700
- Dynamic programming and suboptimal control: a survey from ADP to MPC
- Proceeding of CDC'05.
- Bertsekas, D.P. 2005: Dynamic programming and suboptimal control: a survey from ADP to MPC. Proceeding of CDC'05.
- (2005)
- Bertsekas, D.P.¹

7
- 0028584964
- Adaptive linear quadratic control using policy iteration
- Proceedings of the American Control Conference, Baltmore, MD, June, 3475-76
- Bradtke, S.J., Ydestie, B.E. and Barto, A.G. 1994: Adaptive linear quadratic control using policy iteration. Proceedings of the American Control Conference, Baltmore, MD, June, 3475-76.
- (1994)
- Bradtke, S.J.¹ Ydestie, B.E.² Barto, A.G.³

8
- 0018011435
- Kronecker products and matrix calculus in system theory
- Brewer, J.W. 1978: Kronecker products and matrix calculus in system theory. IEEE Transactions on Circuits and System CAS-, 25, 772-81.
- (1978) IEEE Transactions on Circuits and System CAS- , vol.25 , pp. 772-781
- Brewer, J.W.¹

9
- 85156231814
- Temporal difference learning in continuous-time and space
- Doya, K. 1996: Temporal difference learning in continuous-time and space. Advances in Neural Information Processing Systems 8, 1073-79.
- (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1073-1079
- Doya, K.¹

10
- 0033629916
- Reinforcement learning in continuous-time and space
- Doya, K. 2000: Reinforcement learning in continuous-time and space. Neural Computation 12, 219-45.
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

11
- 0036060633
- An adaptive critic global controller
- Proceedings of the American Control Conference, Anchorage, AK, 2665-70
- Ferrari, S. and Stengel, R. 2002: An adaptive critic global controller. Proceedings of the American Control Conference, Anchorage, AK, 2665-70.
- (2002)
- Ferrari, S.¹ Stengel, R.²

12
- 0039058777
- A 40-Hz auditory potential recorded from the human scalp
- Galambos, R., Makeig, S. and Talmachoff, J. 1981: A 40-Hz auditory potential recorded from the human scalp. Proceedings of the National Academy of Sciences, 78, 2643-47.
- (1981) Proceedings of the National Academy of Sciences , vol.78 , pp. 2643-2647
- Galambos, R.¹ Makeig, S.² Talmachoff, J.³

13
- 0003644124
- MIT Press.
- Howard, R.A. 1960: Dynamic programming and Markov processes. MIT Press.
- (1960) Dynamic programming and Markov processes
- Howard, R.A.¹

14
- 84914965022
- On an iterative technique for Riccati equation computations
- Kleinman, D. 1968: On an iterative technique for Riccati equation computations. IEEE Transactions on Automatic Control 13, 114-15.
- (1968) IEEE Transactions on Automatic Control , vol.13 , pp. 114-115
- Kleinman, D.¹

15
- 0003754075
- PhD dissertation, Linköping University.
- Landelius, T. 1997: Reinforcement learning and distributed local model synthesis. PhD dissertation, Linköping University.
- (1997) Reinforcement learning and distributed local model synthesis
- Landelius, T.¹

16
- 47949130058
- Optimality in biological and artificial networks?
- Levine, D.S. and Elsberry, W.R., editors. 1997: Optimality in biological and artificial networks? Lawrence Erlbaum Associates.
- (1997) Lawrence Erlbaum Associates
- Levine, D.S.¹ Elsberry, W.R.²

17
- 0344375746
- Lawrence Erlbaum Associates.
- Levine, D.S., Brown, V.R. and Shirey, V.T., editors. 2000: Oscillations in neural systems. Lawrence Erlbaum Associates.
- (2000) Oscillations in neural systems
- Levine, D.S.¹ Brown, V.R.² Shirey, V.T.³

18
- 0034849306
- Adaptive critic based neuro-observer
- Proceedings of the American Control Conference, Arlington, VA, 1616-21
- Liu, X. and Balakrishnan, S.N. 2001: Adaptive critic based neuro-observer. Proceedings of the American Control Conference, Arlington, VA, 1616-21.
- (2001)
- Liu, X.¹ Balakrishnan, S.N.²

19
- 0022633118
- A neural cocktail party processor
- Malsburg, C. von der and Schneider, W. 1986: A neural cocktail party processor. Biological Cybernetics 54, 29-40.
- (1986) Biological Cybernetics , vol.54 , pp. 29-40
- von der Malsburg, C.¹ Schneider, W.²

20
- 0036588686
- Adaptive dynamic programming. IEEE Transaction on Systems
- Murray, J., Cox, C., Lendaris, G. and Saeks, R. 2002: Adaptive dynamic programming. IEEE Transaction on Systems, Man, and Cybernetics 32, 140-53.
- (2002) Man, and Cybernetics , vol.32 , pp. 140-153
- Murray, J.¹ Cox, C.² Lendaris, G.³ Saeks, R.⁴

21
- 0031236002
- Adaptive critic designs
- Prokhorov, D. and Wunsch, D. 1997: Adaptive critic designs. IEEE Transactions on Neural Networks 8, 997-1007.
- (1997) IEEE Transactions on Neural Networks , vol.8 , pp. 997-1007
- Prokhorov, D.¹ Wunsch, D.²

22
- 0028969330
- Visual feature integration and the temporal correlation hypothesis
- Singer, W. and Gray, C.M. 1995: Visual feature integration and the temporal correlation hypothesis. Annual Review of Neuroscience 18, 555-86.
- (1995) Annual Review of Neuroscience , vol.18 , pp. 555-586
- Singer, W.¹ Gray, C.M.²

23
- 47949105735
- The interplay of intrinsic and synaptic membrane currents in delta, theta and 40-Hz oscillations
- In Levine, D.S., Brown, V.R. & Shirey, V.T., editors. Lawrence Erlbaum Associates.
- Soltesz, I. 2000: The interplay of intrinsic and synaptic membrane currents in delta, theta and 40-Hz oscillations. In Levine, D.S., Brown, V.R. & Shirey, V.T., editors. Oscillations in neural systems. Lawrence Erlbaum Associates.
- (2000) Oscillations in neural systems
- Soltesz, I.¹

24
- 33847202724
- Learning to predict by the method of temporal differences
- Sutton, R.S. 1988: Learning to predict by the method of temporal differences. Machine Learning 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

25
- 0004102479
- MIT Press.
- Sutton, R.S. and Barto, A.G. 1998: Reinforcement learning: an introduction. MIT Press.
- (1998) Reinforcement learning: an introduction
- Sutton, R.S.¹ Barto, A.G.²

26
- 34548721141
- Continuous-time ADP for linear systems with partially unknown dynamics
- Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, 247-53
- Vrabie, D., Abu-Khalaf, M., Lewis, F.L. and Wang, Youyi. 2007: Continuous-time ADP for linear systems with partially unknown dynamics. Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, 247-53.
- (2007)
- Vrabie, D.¹ Abu-Khalaf, M.² Lewis, F.L.³ Wang, Y.⁴

27
- 0004049893
- Ph.D. thesis, Cambridge University.
- Watkins, C. 1989: Learning from delayed rewards. Ph.D. thesis, Cambridge University.
- (1989) Learning from delayed rewards
- Watkins, C.¹

28
- 0002031779
- Approximate dynamic programming for real-time control and neural modeling
- In White, D.A. and Sofge, D.A., editors. Van Nostrand.
- Werbos, P.J. 1992: Approximate dynamic programming for real-time control and neural modeling. In White, D.A. and Sofge, D.A., editors. Handbook of intelligent control. Van Nostrand.
- (1992) Handbook of intelligent control
- Werbos, P.J.¹

29
- 47949095751
- Optimization: a foundation for understanding consciousness
- In Levine, D.S. and Elsberry, W.R., editors Lawrence Erlbaum Associates.
- Werbos, P.J. 1997: Optimization: a foundation for understanding consciousness. In Levine, D.S. and Elsberry, W.R., editors, Optimality in biological and artificial networks? Lawrence Erlbaum Associates.
- (1997) Optimality in biological and artificial networks?
- Werbos, P.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.