SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 4865 LNAI, Issue , 2008, Pages 27-43

Continuous-state reinforcement learning with fuzzy approximation

(4) Buşoniu, Lucian a Ernst, Damien b De Schutter, Bart a Babuška, Robert a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b SUPELEC Campus de Rennes (France)

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE SYSTEMS; AGENTS; ALGORITHMS; APPROXIMATION ALGORITHMS; CONTROL THEORY; EDUCATION; FUZZY CONTROL; INTELLIGENT AGENTS; ITERATIVE METHODS; LEARNING ALGORITHMS; LEARNING SYSTEMS; POLYNOMIAL APPROXIMATION; REINFORCEMENT; REINFORCEMENT LEARNING;

ADAPTIVE AGENTS; DISCRETE SETS; ENVIRONMENT STATES; EUROPEAN; FUZZY APPROXIMATIONS; FUZZY REPRESENTATIONS; LEARNING AGENTS; LEARNING PARADIGMS; MODEL-BASED; MODEL-FREE; MULTI-AGENT LEARNING; Q VALUES; Q-LEARNING; SIMULATION EXAMPLE;

MULTI AGENT SYSTEMS;

EID: 49949101369 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-77949-0_3 Document Type: Conference Paper

Times cited : (17)

References (21)

1
- 0004102479
- MIT Press, Cambridge
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

2
- 0003565783
- 2nd edn, Athena Scientific
- Bertsekas, D.P.: Dynamic Programming and Optimal Control, 2nd edn., vol. 2. Athena Scientific (2001)
- (2001) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

3
- 34249833101
- Q-learning
- Watkins, C.J.C.H., Dayan, P.: Q-learning. Machine Learning 8, 279-292 (1992)
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

4
- 33845529505
- Reinforcement learning: An overview
- Aachen, Germany, September 14-15
- Glorennec, P.Y.: Reinforcement learning: An overview. In: ESIT 2000. Proceedings European Symposium on Intelligent Techniques, Aachen, Germany, September 14-15, 2000, pp. 17-35 (2000)
- (2000) ESIT 2000. Proceedings European Symposium on Intelligent Techniques , pp. 17-35
- Glorennec, P.Y.¹

5
- 0030377615
- Fuzzy interpolation-based Qlearning with continuous states and actions
- New Orleans, US, September 8-11
- Horiuchi, T., Fujino, A., Katai, O., Sawaragi, T.: Fuzzy interpolation-based Qlearning with continuous states and actions. In: FUZZ-IEEE 1996. Proceedings 5th IEEE International Conference on Fuzzy Systems, New Orleans, US, September 8-11, 1996, pp. 594-600 (1996)
- (1996) FUZZ-IEEE 1996. Proceedings 5th IEEE International Conference on Fuzzy Systems , pp. 594-600
- Horiuchi, T.¹ Fujino, A.² Katai, O.³ Sawaragi, T.⁴

6
- 0032140718
- Fuzzy inference system learning by reinforcement methods
- Jouffe, L.: Fuzzy inference system learning by reinforcement methods. IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews 28(3), 338-355 (1998)
- (1998) IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews , vol.28 , Issue.3 , pp. 338-355
- Jouffe, L.¹

7
- 0026923465
- Learning and tuning fuzzy logic controllers through reinforcements
- Berenji, H.R., Khedkar, P.: Learning and tuning fuzzy logic controllers through reinforcements. IEEE Transactions on Neural Networks 3(5), 724-740 (1992)
- (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.5 , pp. 724-740
- Berenji, H.R.¹ Khedkar, P.²

8
- 0041877717
- A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
- Berenji, H.R., Vengerov, D.: A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters. IEEE Transactions on Fuzzy Systems 11(4), 478-485 (2003)
- (2003) IEEE Transactions on Fuzzy Systems , vol.11 , Issue.4 , pp. 478-485
- Berenji, H.R.¹ Vengerov, D.²

9
- 24644466803
- A fuzzy reinforcement learning approach to power control in wireless transmitters
- Vengerov, D., Bambos, N., Berenji, H.R.: A fuzzy reinforcement learning approach to power control in wireless transmitters. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics 35(4), 768-778 (2005)
- (2005) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.35 , Issue.4 , pp. 768-778
- Vengerov, D.¹ Bambos, N.² Berenji, H.R.³

10
- 0041876271
- A reinforcement learning adaptive fuzzy controller for robots
- Lin, C.K.: A reinforcement learning adaptive fuzzy controller for robots. Fuzzy Sets and Systems 137, 339-352 (2003)
- (2003) Fuzzy Sets and Systems , vol.137 , pp. 339-352
- Lin, C.K.¹

11
- 0029752470
- Feature-based methods for large scale dynamic programming
- Tsitsiklis, J.N., Van Roy, B.: Feature-based methods for large scale dynamic programming. Machine Learning 22(1-3), 59-94 (1996)
- (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

12
- 31844456754
- Finite time bounds for sampling based fitted value iteration
- Bonn, Germany, August 7-11
- Szepesvári, C., Munos, R.: Finite time bounds for sampling based fitted value iteration. In: ICML 2005. Proceedings Twenty-Second International Conference on Machine Learning, Bonn, Germany, August 7-11, 2005, pp. 880-887 (2005)
- (2005) ICML 2005. Proceedings Twenty-Second International Conference on Machine Learning , pp. 880-887
- Szepesvári, C.¹ Munos, R.²

13
- 84880694195
- Stable function approximation in dynamic programming
- Tahoe City, US, July 9-12
- Gordon, G.: Stable function approximation in dynamic programming. In: ICML 1995. Proceedings Twelfth International Conference on Machine Learning, Tahoe City, US, July 9-12, 1995, pp. 261-268 (1995)
- (1995) ICML 1995. Proceedings Twelfth International Conference on Machine Learning , pp. 261-268
- Gordon, G.¹

14
- 22944460232
- Wiering, M.: Convergence and divergence in standard and averaging reinforcement learning. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), 3201, pp. 477-488. Springer, Heidelberg (2004)
- Wiering, M.: Convergence and divergence in standard and averaging reinforcement learning. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 477-488. Springer, Heidelberg (2004)

15
- 0036832956
- Kernel-based reinforcement learning
- Ormoneit, D., Sen, S.: Kernel-based reinforcement learning. Machine Learning 49(2-3), 161-178 (2002)
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

16
- 21844465127
- Tree-based batch mode reinforcement learning
- Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503-556 (2005)
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

17
- 14344263882
- Interpolation-based Q-learning
- Bannf, Canada, July 4-8
- Szepesvári, C., Smart, W.D.: Interpolation-based Q-learning. In: ICML 2004. Proceedings Twenty-First International Conference on Machine Learning, Bannf, Canada, July 4-8, 2004 (2004)
- (2004) ICML 2004. Proceedings Twenty-First International Conference on Machine Learning
- Szepesvári, C.¹ Smart, W.D.²

18
- 85153965130
- Reinforcement learning with soft state aggregation
- Denver, US, pp
- Singh, S.P., Jaakkola, T., Jordan, M.I.: Reinforcement learning with soft state aggregation. In: NIPS 1994. Advances in Neural Information Processing Systems 7, Denver, US, pp. 361-368 (1994)
- (1994) NIPS 1994. Advances in Neural Information Processing Systems , vol.7 , pp. 361-368
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

19
- 1442288723
- Near Optimal Closed-loop Control
- PhD thesis, University of Liège, Belgium March
- Ernst, D.: Near Optimal Closed-loop Control. Application to Electric Power Systems. PhD thesis, University of Liège, Belgium (March 2003)
- (2003) Application to Electric Power Systems
- Ernst, D.¹

20
- 0036832953
- Variable-resolution discretization in optimal control
- Munos, R., Moore, A.: Variable-resolution discretization in optimal control. Machine Learning 49(2-3), 291-323 (2002)
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 291-323
- Munos, R.¹ Moore, A.²

21
- 26944466214
- Sherstov, A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), 3607, pp. 194-205. Springer, Heidelberg (2005)
- Sherstov, A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), vol. 3607, pp. 194-205. Springer, Heidelberg (2005)

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.