-
4
-
-
33845529505
-
Reinforcement learning: An overview
-
Aachen, Germany, September 14-15
-
Glorennec, P.Y.: Reinforcement learning: An overview. In: ESIT 2000. Proceedings European Symposium on Intelligent Techniques, Aachen, Germany, September 14-15, 2000, pp. 17-35 (2000)
-
(2000)
ESIT 2000. Proceedings European Symposium on Intelligent Techniques
, pp. 17-35
-
-
Glorennec, P.Y.1
-
5
-
-
0030377615
-
Fuzzy interpolation-based Qlearning with continuous states and actions
-
New Orleans, US, September 8-11
-
Horiuchi, T., Fujino, A., Katai, O., Sawaragi, T.: Fuzzy interpolation-based Qlearning with continuous states and actions. In: FUZZ-IEEE 1996. Proceedings 5th IEEE International Conference on Fuzzy Systems, New Orleans, US, September 8-11, 1996, pp. 594-600 (1996)
-
(1996)
FUZZ-IEEE 1996. Proceedings 5th IEEE International Conference on Fuzzy Systems
, pp. 594-600
-
-
Horiuchi, T.1
Fujino, A.2
Katai, O.3
Sawaragi, T.4
-
6
-
-
0032140718
-
Fuzzy inference system learning by reinforcement methods
-
Jouffe, L.: Fuzzy inference system learning by reinforcement methods. IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews 28(3), 338-355 (1998)
-
(1998)
IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews
, vol.28
, Issue.3
, pp. 338-355
-
-
Jouffe, L.1
-
7
-
-
0026923465
-
Learning and tuning fuzzy logic controllers through reinforcements
-
Berenji, H.R., Khedkar, P.: Learning and tuning fuzzy logic controllers through reinforcements. IEEE Transactions on Neural Networks 3(5), 724-740 (1992)
-
(1992)
IEEE Transactions on Neural Networks
, vol.3
, Issue.5
, pp. 724-740
-
-
Berenji, H.R.1
Khedkar, P.2
-
8
-
-
0041877717
-
A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
-
Berenji, H.R., Vengerov, D.: A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters. IEEE Transactions on Fuzzy Systems 11(4), 478-485 (2003)
-
(2003)
IEEE Transactions on Fuzzy Systems
, vol.11
, Issue.4
, pp. 478-485
-
-
Berenji, H.R.1
Vengerov, D.2
-
9
-
-
24644466803
-
A fuzzy reinforcement learning approach to power control in wireless transmitters
-
Vengerov, D., Bambos, N., Berenji, H.R.: A fuzzy reinforcement learning approach to power control in wireless transmitters. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics 35(4), 768-778 (2005)
-
(2005)
IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics
, vol.35
, Issue.4
, pp. 768-778
-
-
Vengerov, D.1
Bambos, N.2
Berenji, H.R.3
-
10
-
-
0041876271
-
A reinforcement learning adaptive fuzzy controller for robots
-
Lin, C.K.: A reinforcement learning adaptive fuzzy controller for robots. Fuzzy Sets and Systems 137, 339-352 (2003)
-
(2003)
Fuzzy Sets and Systems
, vol.137
, pp. 339-352
-
-
Lin, C.K.1
-
11
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
Tsitsiklis, J.N., Van Roy, B.: Feature-based methods for large scale dynamic programming. Machine Learning 22(1-3), 59-94 (1996)
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
12
-
-
31844456754
-
Finite time bounds for sampling based fitted value iteration
-
Bonn, Germany, August 7-11
-
Szepesvári, C., Munos, R.: Finite time bounds for sampling based fitted value iteration. In: ICML 2005. Proceedings Twenty-Second International Conference on Machine Learning, Bonn, Germany, August 7-11, 2005, pp. 880-887 (2005)
-
(2005)
ICML 2005. Proceedings Twenty-Second International Conference on Machine Learning
, pp. 880-887
-
-
Szepesvári, C.1
Munos, R.2
-
13
-
-
84880694195
-
Stable function approximation in dynamic programming
-
Tahoe City, US, July 9-12
-
Gordon, G.: Stable function approximation in dynamic programming. In: ICML 1995. Proceedings Twelfth International Conference on Machine Learning, Tahoe City, US, July 9-12, 1995, pp. 261-268 (1995)
-
(1995)
ICML 1995. Proceedings Twelfth International Conference on Machine Learning
, pp. 261-268
-
-
Gordon, G.1
-
14
-
-
22944460232
-
-
Wiering, M.: Convergence and divergence in standard and averaging reinforcement learning. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), 3201, pp. 477-488. Springer, Heidelberg (2004)
-
Wiering, M.: Convergence and divergence in standard and averaging reinforcement learning. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 477-488. Springer, Heidelberg (2004)
-
-
-
-
15
-
-
0036832956
-
Kernel-based reinforcement learning
-
Ormoneit, D., Sen, S.: Kernel-based reinforcement learning. Machine Learning 49(2-3), 161-178 (2002)
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
16
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
Ernst, D., Geurts, P., Wehenkel, L.: Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6, 503-556 (2005)
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
17
-
-
14344263882
-
Interpolation-based Q-learning
-
Bannf, Canada, July 4-8
-
Szepesvári, C., Smart, W.D.: Interpolation-based Q-learning. In: ICML 2004. Proceedings Twenty-First International Conference on Machine Learning, Bannf, Canada, July 4-8, 2004 (2004)
-
(2004)
ICML 2004. Proceedings Twenty-First International Conference on Machine Learning
-
-
Szepesvári, C.1
Smart, W.D.2
-
18
-
-
85153965130
-
Reinforcement learning with soft state aggregation
-
Denver, US, pp
-
Singh, S.P., Jaakkola, T., Jordan, M.I.: Reinforcement learning with soft state aggregation. In: NIPS 1994. Advances in Neural Information Processing Systems 7, Denver, US, pp. 361-368 (1994)
-
(1994)
NIPS 1994. Advances in Neural Information Processing Systems
, vol.7
, pp. 361-368
-
-
Singh, S.P.1
Jaakkola, T.2
Jordan, M.I.3
-
19
-
-
1442288723
-
Near Optimal Closed-loop Control
-
PhD thesis, University of Liège, Belgium March
-
Ernst, D.: Near Optimal Closed-loop Control. Application to Electric Power Systems. PhD thesis, University of Liège, Belgium (March 2003)
-
(2003)
Application to Electric Power Systems
-
-
Ernst, D.1
-
20
-
-
0036832953
-
Variable-resolution discretization in optimal control
-
Munos, R., Moore, A.: Variable-resolution discretization in optimal control. Machine Learning 49(2-3), 291-323 (2002)
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 291-323
-
-
Munos, R.1
Moore, A.2
-
21
-
-
26944466214
-
-
Sherstov, A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), 3607, pp. 194-205. Springer, Heidelberg (2005)
-
Sherstov, A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), vol. 3607, pp. 194-205. Springer, Heidelberg (2005)
-
-
-
|