SCOPUS 정보 검색 플랫폼

IEEE International Conference on Fuzzy Systems

Volumn , Issue , 2008, Pages 518-524

Consistency of fuzzy model-based reinforcement learning

(4) Buşoniu, Lucian a Ernst, Damien c,d De Schutter, Bart a,b Babuška, Robert a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

c University of Liège ^*

d UNIVERSITY OF LIÈGE (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

FUZZY LOGIC; FUZZY SYSTEMS; LEARNING ALGORITHMS; LEARNING SYSTEMS; POLYNOMIAL APPROXIMATION; PROBABILITY DENSITY FUNCTION; REINFORCEMENT; REINFORCEMENT LEARNING; SOLUTIONS;

ACTION SPACES; APPROXIMATE ALGORITHMS; APPROXIMATION ACCURACIES; CONTROL ACTIONS; DISCRETE SETS; DISCRETIZATION; EXPERIMENTAL STUDIES; FUZZY MODELS; FUZZY PARTITIONS; ITERATION ALGORITHMS; LEARNING CONTROLS; MODEL-BASED; OPTIMAL SOLUTIONS; PROCESS STATES; REWARD FUNCTIONS; STATE SPACES;

APPROXIMATION ALGORITHMS;

EID: 55249099118 PISSN: 10987584 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/FUZZY.2008.4630417 Document Type: Conference Paper

Times cited : (7)

References (23)

1
- 0003565783
- 3rd ed. Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control. 3rd ed. Athena Scientific, 2007, vol. 2.
- (2007) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

2
- 50249089930
- Reinforcement
- MIT Press
- R. S. Sutton and A. G. Barlo, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Learning: An Introduction
- Sutton, R.S.¹ Barlo, A.G.²

3
- 50249166041
- Fuzzy approximation for convergent model-based reinforcement learning
- London, UK, 23-26 July
- L. Buşoniu, D. Ernst, B. De Schutter. and R. Babuska, "Fuzzy approximation for convergent model-based reinforcement learning," in Proceedings 2007 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-07). London, UK, 23-26 July 2007. pp. 968-973.
- (2007) Proceedings 2007 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-07) , pp. 968-973
- Buşoniu, L.¹ Ernst, D.² De Schutter, B.³ Babuska, R.⁴

4
- 49949101369
- _, Continuous-state reinforcement learning with fuzzy approximation, in Adaptive Agents and Multi-Agent Systems III, ser. Lecture Notes in Computer Science, K. Tuyls. A. Nowé, Z. Guessoum, and D. Kudenko. Eds. Springer. 2008, 4865, pp. 27-43.
- _, "Continuous-state reinforcement learning with fuzzy approximation," in Adaptive Agents and Multi-Agent Systems III, ser. Lecture Notes in Computer Science, K. Tuyls. A. Nowé, Z. Guessoum, and D. Kudenko. Eds. Springer. 2008, vol. 4865, pp. 27-43.

5
- 33845529505
- Reinforcement learning: An overview
- Aachen, Germany. 14-15 September
- P. Y. Glorennee, "Reinforcement learning: An overview," in Proceedings European Symposium on Intelligent Techniques (ESIT-00). Aachen, Germany. 14-15 September 2000. pp. 17-35.
- (2000) Proceedings European Symposium on Intelligent Techniques (ESIT-00) , pp. 17-35
- Glorennee, P.Y.¹

6
- 0030377615
- Fuzzy interpolation-based Q-learning with continuous states and actions
- New Orleans, US, 8-11 September
- T. Horiuchi, A. Fujinci, O. Katai, and T. Sawaragi, "Fuzzy interpolation-based Q-learning with continuous states and actions," in Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96), New Orleans, US, 8-11 September 1996. pp. 594-600.
- (1996) Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96) , pp. 594-600
- Horiuchi, T.¹ Fujinci, A.² Katai, O.³ Sawaragi, T.⁴

7
- 0032140718
- Fuzzy inference system learning by reinforcement methods
- L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews, vol. 28, no. 3, pp. 338-355, 1998.
- (1998) IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews , vol.28 , Issue.3 , pp. 338-355
- Jouffe, L.¹

8
- 0026923465
- Learning and tuning fuzzy logic controllers through reinforcements
- H. R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers through reinforcements." IEEE Transactions on Neural Networks, vol. 3, no. 5. pp. 724-740, 1992.
- (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.5 , pp. 724-740
- Berenji, H.R.¹ Khedkar, P.²

9
- 0041877717
- A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
- H. R. Berenji and D. Vengerov, "A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters." IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 478-485, 2003.
- (2003) IEEE Transactions on Fuzzy Systems , vol.11 , Issue.4 , pp. 478-485
- Berenji, H.R.¹ Vengerov, D.²

10
- 24644466803
- A fuzzy reinforcement learning approach to power control in wireless transmitters
- D. Vengerov, N. Bambos, and H. R. Berenji, "A fuzzy reinforcement learning approach to power control in wireless transmitters." IEEE Transactions on Systems, Man. and Cybernetics-Part B: Cybernetics. vol. 35, no. 4, pp. 768-778, 2005.
- (2005) IEEE Transactions on Systems, Man. and Cybernetics-Part B: Cybernetics , vol.35 , Issue.4 , pp. 768-778
- Vengerov, D.¹ Bambos, N.² Berenji, H.R.³

11
- 0041876271
- A reinforcement learning adaptive fuzzy controller for robots
- C.-K. Lin. "A reinforcement learning adaptive fuzzy controller for robots," Fuzzy Sets and Systems, vol. 137, no. 3, pp. 339-352, 2003.
- (2003) Fuzzy Sets and Systems , vol.137 , Issue.3 , pp. 339-352
- Lin, C.-K.¹

12
- 0026206780
- An optimal one-way multigrid algorithm for discrete-time stochastic control
- 12
- [ 12] C.-S. Chow and J. Tsitsiklis, "An optimal one-way multigrid algorithm for discrete-time stochastic control," IEEE Transactions on Automatic Control, vol. 36, no. 8, pp. 898-914, 1991.
- (1991) IEEE Transactions on Automatic Control , vol.36 , Issue.8 , pp. 898-914
- Chow, C.-S.¹ Tsitsiklis, J.²

13
- 84880694195
- Stable function approximation in dynamic programming
- Tahoe City, US, 9-12 July
- G. Gordon, "Stable function approximation in dynamic programming," in Proceedings Twelfth International Conference on Machine Learning (ICML-95), Tahoe City, US, 9-12 July 1995, pp. 261-268.
- (1995) Proceedings Twelfth International Conference on Machine Learning (ICML-95) , pp. 261-268
- Gordon, G.¹

14
- 0029752470
- Feature-based methods for large scale dynamic programming
- J. N. Tsitsiklis and B. Van Roy. "Feature-based methods for large scale dynamic programming," Machine Learning, vol. 22, no. 1-3, pp. 59-94, 1996.
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

15
- 0008872081
- Analysis of a numerical dynamic programming algorithm applied to economic models
- M. S. Santos and J. Vigo-Aguiar, "Analysis of a numerical dynamic programming algorithm applied to economic models." Econometrica, vol. 66, no. 2, pp. 409-426, 1998.
- (1998) Econometrica , vol.66 , Issue.2 , pp. 409-426
- Santos, M.S.¹ Vigo-Aguiar, J.²

16
- 31844456754
- Finite time bounds for sampling based fitted value iteration
- Bonn, Germany, 7-11 August
- C. Szepesvári and R. Munos, "Finite time bounds for sampling based fitted value iteration," in Proceedings Twenty-Second International Conference on Machine Learning (ICML-05). Bonn, Germany, 7-11 August 2005, pp. 880-887.
- (2005) Proceedings Twenty-Second International Conference on Machine Learning (ICML-05) , pp. 880-887
- Szepesvári, C.¹ Munos, R.²

17
- 22944460232
- Convergence and divergence in standard and averaging reinforcement learning
- Pisa. Italy, 20-24 September
- M. Wiering, "Convergence and divergence in standard and averaging reinforcement learning," in Proceedings 15th European Conference on Machine Learning (ECML'04), Pisa. Italy, 20-24 September 2004, pp. 477-488.
- (2004) Proceedings 15th European Conference on Machine Learning (ECML'04) , pp. 477-488
- Wiering, M.¹

18
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen. "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

19
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

20
- 14344263882
- Interpolation-based Q-learning
- Bannf, Canada, 4-8 July
- C. Szepesvári and W. D. Smart, "Interpolation-based Q-learning," in Proceedings Twenty-First International Conference on Machine Learning (ICML-04), Bannf, Canada, 4-8 July 2004, pp. 791-798.
- (2004) Proceedings Twenty-First International Conference on Machine Learning (ICML-04) , pp. 791-798
- Szepesvári, C.¹ Smart, W.D.²

21
- 85153965130
- Reinforcement learning with soft state aggregation
- G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds
- S. P. Singh. T. Jaakkola, and M. I. Jordan, "Reinforcement learning with soft state aggregation." in Advances in Neural Information Processing Systems 7, G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds., 1995, pp. 361-368.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 361-368
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

22
- 1442288723
- Near optimal closed-loop control. Application to electric power systems,
- Ph.D. dissertation, University of Liège, Belgium. March
- D. Ernst. "Near optimal closed-loop control. Application to electric power systems," Ph.D. dissertation, University of Liège, Belgium. March 2003.
- (2003)
- Ernst, D.¹

23
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Bled, Slovenia, 27-30 June
- A. Y. Ng, D. Harada, and S. Russell, "Policy invariance under reward transformations: Theory and application to reward shaping," in Proceedings Sixteenth International Conference on Machine Learning (ICML'99), Bled, Slovenia, 27-30 June 1999, pp. 278-287.
- (1999) Proceedings Sixteenth International Conference on Machine Learning (ICML'99) , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.