SCOPUS 정보 검색 플랫폼

Automatica

Volumn 46, Issue 5, 2010, Pages 804-814

Approximate dynamic programming with a fuzzy parameterization

(4) Buşoniu, Lucian a Ernst, Damien c De Schutter, Bart a,b Babuška, Robert a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

c UNIVERSITY OF LIÈGE (Belgium)

Author keywords

Approximate dynamic programming; Convergence analysis; Fuzzy approximation; Value iteration

Indexed keywords

ACTION SPACES; APPROXIMATE DYNAMIC PROGRAMMING; APPROXIMATION ACCURACY; APPROXIMATORS; ASYNCHRONOUS ALGORITHMS; CONTROL ACTIONS; CONVERGENCE ANALYSIS; DETERMINISTIC PROCESS; DISCRETE SETS; DISCRETIZATIONS; FINITE NUMBER; FUZZY APPROXIMATION; FUZZY PARTITION; ITERATION ALGORITHMS; NON-LINEAR OPTIMAL CONTROL; OPTIMAL SOLUTIONS; PROCESS STATE; REWARD FUNCTION; STATE SPACE; SUBOPTIMALITY; TWO-LINK MANIPULATOR; VALUE ITERATION;

ALGORITHMS; ASYMPTOTIC ANALYSIS; DYNAMIC POSITIONING; MANIPULATORS; OPTIMAL SYSTEMS; OPTIMIZATION; PERCOLATION (SOLID STATE); PROCESS CONTROL;

DYNAMIC PROGRAMMING;

EID: 77950867376 PISSN: 00051098 EISSN: None Source Type: Journal
DOI: 10.1016/j.automatica.2010.02.006 Document Type: Article

Times cited : (54)

References (26)

1
- 74049127928
- Fitted Q-iteration in continuous action-space MDPs
- Platt J.C., Koller D., Singer Y., and Roweis S.T. (Eds), MIT Press
- Antos A., Munos R., and Szepesvári Cs. Fitted Q-iteration in continuous action-space MDPs. In: Platt J.C., Koller D., Singer Y., and Roweis S.T. (Eds). Advances in neural information processing systems: Vol. 20 (2008), MIT Press 9-16
- (2008) Advances in neural information processing systems: Vol. 20 , pp. 9-16
- Antos, A.¹ Munos, R.² Szepesvári, Cs.³

2
- 0041877717
- A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
- Berenji H.R., and Vengerov D. A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters. IEEE Transactions on Fuzzy Systems 11 4 (2003) 478-485
- (2003) IEEE Transactions on Fuzzy Systems , vol.11 , Issue.4 , pp. 478-485
- Berenji, H.R.¹ Vengerov, D.²

3
- 0003565783
- Athena Scientific
- Bertsekas D.P. Dynamic programming and optimal control, Vol. 2. 3rd ed. (2007), Athena Scientific
- (2007) Dynamic programming and optimal control, Vol. 2. 3rd ed.
- Bertsekas, D.P.¹

4
- 0003487482
- Athena Scientific
- Bertsekas D.P., and Tsitsiklis J.N. Neuro-dynamic programming (1996), Athena Scientific
- (1996) Neuro-dynamic programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

5
- 0003449524
- Prentice Hall
- Brown M., and Harris C. Neurofuzzy adaptive modeling and control (1994), Prentice Hall
- (1994) Neurofuzzy adaptive modeling and control
- Brown, M.¹ Harris, C.²

6
- 34547223380
- Buşoniu, L., De Schutter, B., & Babuška, R. (2006). Decentralized reinforcement learning control of a robotic manipulator. In Proceedings 9th international conference of control, automation, robotics, and vision, ICARCV-06 (pp. 1347-1352). Singapore, 5-8 December.
- Buşoniu, L., De Schutter, B., & Babuška, R. (2006). Decentralized reinforcement learning control of a robotic manipulator. In Proceedings 9th international conference of control, automation, robotics, and vision, ICARCV-06 (pp. 1347-1352). Singapore, 5-8 December.

7
- 50249166041
- Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2007). Fuzzy approximation for convergent model-based reinforcement learning. In Proceedings 2007 IEEE international conference on fuzzy systems, FUZZ-IEEE-07 (pp. 968-973). London, UK, 23-26 July.
- Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2007). Fuzzy approximation for convergent model-based reinforcement learning. In Proceedings 2007 IEEE international conference on fuzzy systems, FUZZ-IEEE-07 (pp. 968-973). London, UK, 23-26 July.

8
- 55249099118
- Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2008a). Consistency of fuzzy model-based reinforcement learning. In Proceedings 2008 IEEE international conference on fuzzy systems, FUZZ-IEEE-08 (pp. 518-524). Hong Kong, 1-6 June.
- Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2008a). Consistency of fuzzy model-based reinforcement learning. In Proceedings 2008 IEEE international conference on fuzzy systems, FUZZ-IEEE-08 (pp. 518-524). Hong Kong, 1-6 June.

9
- 49949101369
- Continuous-state reinforcement learning with fuzzy approximation
- Adaptive agents and multi-agent systems III. Tuyls I.K., Nowé A., Guessoum Z., and Kudenko D. (Eds), Springer
- Buşoniu L., Ernst D., De Schutter B., and Babuška R. Continuous-state reinforcement learning with fuzzy approximation. In: Tuyls I.K., Nowé A., Guessoum Z., and Kudenko D. (Eds). Adaptive agents and multi-agent systems III. Lecture notes in computer science Vol. 4865 (2008), Springer 27-43
- (2008) Lecture notes in computer science , vol.4865 , pp. 27-43
- Buşoniu, L.¹ Ernst, D.² De Schutter, B.³ Babuška, R.⁴

10
- 0026206780
- An optimal one-way multigrid algorithm for discrete-time stochastic control
- Chow C.-S., and Tsitsiklis J.N. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Transactions on Automatic Control 36 8 (1991) 898-914
- (1991) IEEE Transactions on Automatic Control , vol.36 , Issue.8 , pp. 898-914
- Chow, C.-S.¹ Tsitsiklis, J.N.²

11
- 21844465127
- Tree-based batch mode reinforcement learning
- Ernst D., Geurts P., and Wehenkel L. Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6 (2005) 503-556
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

12
- 70449644892
- Farahmand, A. M., Ghavamzadeh, M., Szepesvári, Cs., & Mannor, S. (2009). Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems. In Proceedings 2009 American control conference, ACC-09(pp. 725-730). St. Louis, US, 10-12 June.
- Farahmand, A. M., Ghavamzadeh, M., Szepesvári, Cs., & Mannor, S. (2009). Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems. In Proceedings 2009 American control conference, ACC-09(pp. 725-730). St. Louis, US, 10-12 June.

13
- 33845529505
- Reinforcement learning: An overview
- Aachen, Germany, 14-15 September
- Glorennec, P. Y. (2000). Reinforcement learning: An overview. In Proceedings European symposium on intelligent techniques, ESIT-00 (pp. 17-35). Aachen, Germany, 14-15 September.
- (2000) Proceedings European symposium on intelligent techniques, ESIT-00 , pp. 17-35
- Glorennec, P.Y.¹

14
- 77950858524
- Gordon, G. (1995). Stable function approximation in dynamic programming. In Proceedings 12th international conference on machine learning, ICML-95(pp. 261-268). Tahoe City, US, 9-12 July.
- Gordon, G. (1995). Stable function approximation in dynamic programming. In Proceedings 12th international conference on machine learning, ICML-95(pp. 261-268). Tahoe City, US, 9-12 July.

15
- 0030377615
- Horiuchi, T., Fujino, A., Katai, O., & Sawaragi, T. (1996). Fuzzy interpolation-based Q-learning with continuous states and actions. In Proceedings 5th IEEE international conference on fuzzy systems, FUZZ-IEEE-96 (pp. 594-600). New Orleans, US, 8-11 September.
- Horiuchi, T., Fujino, A., Katai, O., & Sawaragi, T. (1996). Fuzzy interpolation-based Q-learning with continuous states and actions. In Proceedings 5th IEEE international conference on fuzzy systems, FUZZ-IEEE-96 (pp. 594-600). New Orleans, US, 8-11 September.

16
- 0003560988
- Springer
- Istratescu V.I. Fixed point theory: An introduction (2002), Springer
- (2002) Fixed point theory: An introduction
- Istratescu, V.I.¹

17
- 0032140718
- Fuzzy inference system learning by reinforcement methods
- Jouffe L. Fuzzy inference system learning by reinforcement methods. IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews 28 3 (1998) 338-355
- (1998) IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews , vol.28 , Issue.3 , pp. 338-355
- Jouffe, L.¹

18
- 0004204349
- Wiley
- Kruse R., Gebhardt J.E., and Klowon F. Foundations of fuzzy systems (1994), Wiley
- (1994) Foundations of fuzzy systems
- Kruse, R.¹ Gebhardt, J.E.² Klowon, F.³

19
- 4644323293
- Least-squares policy iteration
- Lagoudakis M.G., and Parr R. Least-squares policy iteration. Journal of Machine Learning Research 4 (2003) 1107-1149
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

20
- 0041876271
- A reinforcement learning adaptive fuzzy controller for robots
- Lin C.-K. A reinforcement learning adaptive fuzzy controller for robots. Fuzzy Sets and Systems 137 3 (2003) 339-352
- (2003) Fuzzy Sets and Systems , vol.137 , Issue.3 , pp. 339-352
- Lin, C.-K.¹

21
- 0036832953
- Variable-resolution discretization in optimal control
- Munos R., and Moore A. Variable-resolution discretization in optimal control. Machine Learning 49 2-3 (2002) 291-323
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 291-323
- Munos, R.¹ Moore, A.²

22
- 44649189852
- Finite time bounds for fitted value iteration
- Munos R., and Szepesvári Cs. Finite time bounds for fitted value iteration. Journal of Machine Learning Research 9 (2008) 815-857
- (2008) Journal of Machine Learning Research , vol.9 , pp. 815-857
- Munos, R.¹ Szepesvári, Cs.²

23
- 0008872081
- Analysis of a numerical dynamic programming algorithm applied to economic models
- Santos M.S., and Vigo-Aguiar J. Analysis of a numerical dynamic programming algorithm applied to economic models. Econometrica 66 2 (1998) 409-426
- (1998) Econometrica , vol.66 , Issue.2 , pp. 409-426
- Santos, M.S.¹ Vigo-Aguiar, J.²

24
- 0004102479
- MIT Press
- Sutton R.S., and Barto A.G. Reinforcement learning: An introduction (1998), MIT Press
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

25
- 14344263882
- Szepesvári, Cs., & Smart, W. D. (2004). Interpolation-based Q-learning. In Proceedings 21st international conference on machine learning, ICML-04(pp. 791-798). Bannf, Canada, 4-8 July.
- Szepesvári, Cs., & Smart, W. D. (2004). Interpolation-based Q-learning. In Proceedings 21st international conference on machine learning, ICML-04(pp. 791-798). Bannf, Canada, 4-8 July.

26
- 0029752470
- Feature-based methods for large scale dynamic programming
- Tsitsiklis J.N., and Van Roy B. Feature-based methods for large scale dynamic programming. Machine Learning 22 1-3 (1996) 59-94
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.