SCOPUS 정보 검색 플랫폼

IEEE International Conference on Fuzzy Systems

Volumn , Issue , 2007, Pages

Fuzzy approximation for convergent model-based reinforcement learning

(4) Buşoniu, Lucian a Ernst, Damien b De Schutter, Bart a Babuška, Robert a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b SUPELEC Campus de Rennes (France)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION ALGORITHMS; CONTROL THEORY; EDUCATION; FUZZY CONTROL; FUZZY LOGIC; FUZZY SYSTEMS; HEURISTIC PROGRAMMING; ITERATIVE METHODS; LEARNING SYSTEMS; POLYNOMIAL APPROXIMATION; REINFORCEMENT; REINFORCEMENT LEARNING;

APPROXIMATE SOLUTIONS; CONSISTENCY PROPERTIES; CONTROL ACTIONS; CONVERGENCE RESULTS; DISCRETE VALUES; FUZZY APPROXIMATIONS; FUZZY REPRESENTATIONS; HEURISTIC SOLUTIONS; INTERNATIONAL CONFERENCES; LEARNING CONTROL; MODEL-BASED; ORIGINAL ALGORITHMS; Q VALUES; SIMULATION EXAMPLE;

LEARNING ALGORITHMS;

EID: 50249166041 PISSN: 10987584 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/FUZZY.2007.4295497 Document Type: Conference Paper

Times cited : (11)

References (24)

1
- 50249089930
- Reinforcement
- Cambridge, US: MIT Press
- R. S. Sutton and A. G. B arto, Reinforcement Learning: An Introduction. Cambridge, US: MIT Press, 1998.
- (1998) Learning: An Introduction
- Sutton, R.S.¹ arto, A.G.B.²

2
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Liftman, and A. W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Liftman, M.L.² Moore, A.W.³

3
- 0003565783
- 2nd ed. Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control, 2nd ed. Athena Scientific, 2001, vol. 2.
- (2001) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

4
- 0029752470
- Feature-based methods for large scale dynamic programming
- J. N. Tsitsiklis and B. Van Roy, "Feature-based methods for large scale dynamic programming," Machine Learning, vol. 22, no. 1-3, pp. 59-94, 1996.
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

5
- 84880694195
- Stable function approximation in dynamic programming
- Tahoe City, US, 9-12 July
- G. Gordon, "Stable function approximation in dynamic programming," in Proceedings Twelfth International Conference on Machine Learning (ICML-95), Tahoe City, US, 9-12 July 1995, pp. 261-268.
- (1995) Proceedings Twelfth International Conference on Machine Learning (ICML-95) , pp. 261-268
- Gordon, G.¹

6
- 0242580448
- Variable-resolution discretization in optimal control
- R. Munos and A. Moore, "Variable-resolution discretization in optimal control," Machine Learning, vol. 1, pp. 1-31, 2001.
- (2001) Machine Learning , vol.1 , pp. 1-31
- Munos, R.¹ Moore, A.²

7
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, pp. 161-178, 2002.
- (2002) Machine Learning , vol.49 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

8
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

9
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr, "Least-squares policy iteration," Journal of Machine Learning Research, vol. 4, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

10
- 14344263882
- Interpolation-based Q-learning
- Bannf, Canada, July 4-8
- C. Szepesvári and W. D. Smart, "Interpolation-based Q-learning," in Proceedings 21st International Conference on Machine Learning (ICML-04), Bannf, Canada, July 4-8 2004.
- (2004) Proceedings 21st International Conference on Machine Learning (ICML-04)
- Szepesvári, C.¹ Smart, W.D.²

11
- 84898958374
- Gradient descent for general reinforcement learning
- Denver, US, 30 November, 5 December
- L. Baird and A. Moore, "Gradient descent for general reinforcement learning," in Advances in Neural Information Processing Systems 11 (NLPS-98), Denver, US, 30 November - 5 December 1998, pp. 968-974.
- (1998) Advances in Neural Information Processing Systems 11 (NLPS-98) , pp. 968-974
- Baird, L.¹ Moore, A.²

12
- 85153965130
- Reinforcement learning with soft state aggregation
- Denver, Colorado, USA
- S. P. Singh, T. Jaakkola, and M. I. Jordan, "Reinforcement learning with soft state aggregation," in Advances in Neural Information Processing Systems 7, Denver, Colorado, USA, 1994, pp. 361-368.
- (1994) Advances in Neural Information Processing Systems 7 , pp. 361-368
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

13
- 1442288723
- Near optimal closed-loop control. Application to electric power systems,
- Ph.D. dissertation, University of Liège, Belgium, March
- D. Ernst, "Near optimal closed-loop control. Application to electric power systems," Ph.D. dissertation, University of Liège, Belgium, March 2003.
- (2003)
- Ernst, D.¹

14
- 85153940465
- Generalization in reinforcement learning: Safely approximating the value function
- Denver, Colorado, US
- J. Boyan and A. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems 7 (NIPS-94), Denver, Colorado, US, 1994, pp. 369-376.
- (1994) Advances in Neural Information Processing Systems 7 (NIPS-94) , pp. 369-376
- Boyan, J.¹ Moore, A.²

15
- 34249833101
- Q-leaming
- C. J. C. H. Watkins and P. Dayan, "Q-leaming," Macnine Learning, vol. 8, pp. 279-292, 1992.
- (1992) Macnine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

16
- 33845529505
- Reinforcement learning: An overview
- Aachen, Germany, 14-15 September
- P. Y. Glorennec, "Reinforcement learning: An overview," in Proceedings European Symposium on Intelligent Techniques (ESIT-00), Aachen, Germany, 14-15 September 2000, pp. 17-35.
- (2000) Proceedings European Symposium on Intelligent Techniques (ESIT-00) , pp. 17-35
- Glorennec, P.Y.¹

17
- 0030377615
- Fuzzy interpolation-based Q-Ieaming with continuous states and actions
- New Orleans, US, 8-11 September
- T. Horiuchi, A. Fujino, O. Katai, and T. Sawaragi, "Fuzzy interpolation-based Q-Ieaming with continuous states and actions," in Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96), New Orleans, US, 8-11 September 1996, pp. 594-600.
- (1996) Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96) , pp. 594-600
- Horiuchi, T.¹ Fujino, A.² Katai, O.³ Sawaragi, T.⁴

18
- 0032140718
- Fuzzy inference system learning by reinforcement methods
- L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews, vol. 28, no. 3, pp. 338-355, 1998.
- (1998) IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews , vol.28 , Issue.3 , pp. 338-355
- Jouffe, L.¹

19
- 0026923465
- Learning and tuning fuzzy logic controllers through reinforcements
- H. R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers through reinforcements," IEEE Transactions on Neural Networks, vol. 3, no. 5, pp. 724-740, 1992.
- (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.5 , pp. 724-740
- Berenji, H.R.¹ Khedkar, P.²

20
- 0041877717
- A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
- H. R. Berenji and D. Vengerov, "A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters," IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 478-485, 2003.
- (2003) IEEE Transactions on Fuzzy Systems , vol.11 , Issue.4 , pp. 478-485
- Berenji, H.R.¹ Vengerov, D.²

21
- 24644466803
- A fuzzy reinforcement learning approach to power control in wireless transmitters
- D. Vengerov, N. Bambos, and H. R. Berenji, "A fuzzy reinforcement learning approach to power control in wireless transmitters," IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, vol. 35, no. 4, pp. 768-778, 2005.
- (2005) IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics , vol.35 , Issue.4 , pp. 768-778
- Vengerov, D.¹ Bambos, N.² Berenji, H.R.³

22
- 0001537801
- Evolutionary learning, reinforcement learning, and fuzzy mies for knowledge acquisition in agent-based systems
- A. Bonarini, "Evolutionary learning, reinforcement learning, and fuzzy mies for knowledge acquisition in agent-based systems," Proceedings of the IEEE, vol. 89, no. 9. pp. 1334-1346, 2001.
- (2001) Proceedings of the IEEE , vol.89 , Issue.9 , pp. 1334-1346
- Bonarini, A.¹

23
- 0041876271
- A reinforcement learning adaptive fuzzy controller for robots
- C.-K. Lin, "A reinforcement learning adaptive fuzzy controller for robots," Fuzzy Sets and Systems, vol. 137, pp. 339-352, 2003.
- (2003) Fuzzy Sets and Systems , vol.137 , pp. 339-352
- Lin, C.-K.¹

24
- 50249132307
- Closed-loop learning of visual control policies,
- Ph.D. dissertation, University of Liège, Belgium, December
- S. Jodogne, "Closed-loop learning of visual control policies," Ph.D. dissertation, University of Liège, Belgium, December 2006.
- (2006)
- Jodogne, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.