-
3
-
-
50249166041
-
Fuzzy approximation for convergent model-based reinforcement learning
-
London, UK, 23-26 July
-
L. Buşoniu, D. Ernst, B. De Schutter. and R. Babuska, "Fuzzy approximation for convergent model-based reinforcement learning," in Proceedings 2007 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-07). London, UK, 23-26 July 2007. pp. 968-973.
-
(2007)
Proceedings 2007 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-07)
, pp. 968-973
-
-
Buşoniu, L.1
Ernst, D.2
De Schutter, B.3
Babuska, R.4
-
4
-
-
49949101369
-
-
_, Continuous-state reinforcement learning with fuzzy approximation, in Adaptive Agents and Multi-Agent Systems III, ser. Lecture Notes in Computer Science, K. Tuyls. A. Nowé, Z. Guessoum, and D. Kudenko. Eds. Springer. 2008, 4865, pp. 27-43.
-
_, "Continuous-state reinforcement learning with fuzzy approximation," in Adaptive Agents and Multi-Agent Systems III, ser. Lecture Notes in Computer Science, K. Tuyls. A. Nowé, Z. Guessoum, and D. Kudenko. Eds. Springer. 2008, vol. 4865, pp. 27-43.
-
-
-
-
6
-
-
0030377615
-
Fuzzy interpolation-based Q-learning with continuous states and actions
-
New Orleans, US, 8-11 September
-
T. Horiuchi, A. Fujinci, O. Katai, and T. Sawaragi, "Fuzzy interpolation-based Q-learning with continuous states and actions," in Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96), New Orleans, US, 8-11 September 1996. pp. 594-600.
-
(1996)
Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96)
, pp. 594-600
-
-
Horiuchi, T.1
Fujinci, A.2
Katai, O.3
Sawaragi, T.4
-
7
-
-
0032140718
-
Fuzzy inference system learning by reinforcement methods
-
L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews, vol. 28, no. 3, pp. 338-355, 1998.
-
(1998)
IEEE Transactions on Systems, Man, and Cybernetics - Part C: Applications and Reviews
, vol.28
, Issue.3
, pp. 338-355
-
-
Jouffe, L.1
-
8
-
-
0026923465
-
Learning and tuning fuzzy logic controllers through reinforcements
-
H. R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers through reinforcements." IEEE Transactions on Neural Networks, vol. 3, no. 5. pp. 724-740, 1992.
-
(1992)
IEEE Transactions on Neural Networks
, vol.3
, Issue.5
, pp. 724-740
-
-
Berenji, H.R.1
Khedkar, P.2
-
9
-
-
0041877717
-
A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
-
H. R. Berenji and D. Vengerov, "A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters." IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 478-485, 2003.
-
(2003)
IEEE Transactions on Fuzzy Systems
, vol.11
, Issue.4
, pp. 478-485
-
-
Berenji, H.R.1
Vengerov, D.2
-
10
-
-
24644466803
-
A fuzzy reinforcement learning approach to power control in wireless transmitters
-
D. Vengerov, N. Bambos, and H. R. Berenji, "A fuzzy reinforcement learning approach to power control in wireless transmitters." IEEE Transactions on Systems, Man. and Cybernetics-Part B: Cybernetics. vol. 35, no. 4, pp. 768-778, 2005.
-
(2005)
IEEE Transactions on Systems, Man. and Cybernetics-Part B: Cybernetics
, vol.35
, Issue.4
, pp. 768-778
-
-
Vengerov, D.1
Bambos, N.2
Berenji, H.R.3
-
11
-
-
0041876271
-
A reinforcement learning adaptive fuzzy controller for robots
-
C.-K. Lin. "A reinforcement learning adaptive fuzzy controller for robots," Fuzzy Sets and Systems, vol. 137, no. 3, pp. 339-352, 2003.
-
(2003)
Fuzzy Sets and Systems
, vol.137
, Issue.3
, pp. 339-352
-
-
Lin, C.-K.1
-
12
-
-
0026206780
-
An optimal one-way multigrid algorithm for discrete-time stochastic control
-
12
-
[ 12] C.-S. Chow and J. Tsitsiklis, "An optimal one-way multigrid algorithm for discrete-time stochastic control," IEEE Transactions on Automatic Control, vol. 36, no. 8, pp. 898-914, 1991.
-
(1991)
IEEE Transactions on Automatic Control
, vol.36
, Issue.8
, pp. 898-914
-
-
Chow, C.-S.1
Tsitsiklis, J.2
-
13
-
-
84880694195
-
Stable function approximation in dynamic programming
-
Tahoe City, US, 9-12 July
-
G. Gordon, "Stable function approximation in dynamic programming," in Proceedings Twelfth International Conference on Machine Learning (ICML-95), Tahoe City, US, 9-12 July 1995, pp. 261-268.
-
(1995)
Proceedings Twelfth International Conference on Machine Learning (ICML-95)
, pp. 261-268
-
-
Gordon, G.1
-
14
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
J. N. Tsitsiklis and B. Van Roy. "Feature-based methods for large scale dynamic programming," Machine Learning, vol. 22, no. 1-3, pp. 59-94, 1996.
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
15
-
-
0008872081
-
Analysis of a numerical dynamic programming algorithm applied to economic models
-
M. S. Santos and J. Vigo-Aguiar, "Analysis of a numerical dynamic programming algorithm applied to economic models." Econometrica, vol. 66, no. 2, pp. 409-426, 1998.
-
(1998)
Econometrica
, vol.66
, Issue.2
, pp. 409-426
-
-
Santos, M.S.1
Vigo-Aguiar, J.2
-
16
-
-
31844456754
-
Finite time bounds for sampling based fitted value iteration
-
Bonn, Germany, 7-11 August
-
C. Szepesvári and R. Munos, "Finite time bounds for sampling based fitted value iteration," in Proceedings Twenty-Second International Conference on Machine Learning (ICML-05). Bonn, Germany, 7-11 August 2005, pp. 880-887.
-
(2005)
Proceedings Twenty-Second International Conference on Machine Learning (ICML-05)
, pp. 880-887
-
-
Szepesvári, C.1
Munos, R.2
-
17
-
-
22944460232
-
Convergence and divergence in standard and averaging reinforcement learning
-
Pisa. Italy, 20-24 September
-
M. Wiering, "Convergence and divergence in standard and averaging reinforcement learning," in Proceedings 15th European Conference on Machine Learning (ECML'04), Pisa. Italy, 20-24 September 2004, pp. 477-488.
-
(2004)
Proceedings 15th European Conference on Machine Learning (ECML'04)
, pp. 477-488
-
-
Wiering, M.1
-
18
-
-
0036832956
-
Kernel-based reinforcement learning
-
D. Ormoneit and S. Sen. "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
19
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
20
-
-
14344263882
-
Interpolation-based Q-learning
-
Bannf, Canada, 4-8 July
-
C. Szepesvári and W. D. Smart, "Interpolation-based Q-learning," in Proceedings Twenty-First International Conference on Machine Learning (ICML-04), Bannf, Canada, 4-8 July 2004, pp. 791-798.
-
(2004)
Proceedings Twenty-First International Conference on Machine Learning (ICML-04)
, pp. 791-798
-
-
Szepesvári, C.1
Smart, W.D.2
-
21
-
-
85153965130
-
Reinforcement learning with soft state aggregation
-
G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds
-
S. P. Singh. T. Jaakkola, and M. I. Jordan, "Reinforcement learning with soft state aggregation." in Advances in Neural Information Processing Systems 7, G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds., 1995, pp. 361-368.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 361-368
-
-
Singh, S.P.1
Jaakkola, T.2
Jordan, M.I.3
-
22
-
-
1442288723
-
Near optimal closed-loop control. Application to electric power systems,
-
Ph.D. dissertation, University of Liège, Belgium. March
-
D. Ernst. "Near optimal closed-loop control. Application to electric power systems," Ph.D. dissertation, University of Liège, Belgium. March 2003.
-
(2003)
-
-
Ernst, D.1
-
23
-
-
0141596576
-
Policy invariance under reward transformations: Theory and application to reward shaping
-
Bled, Slovenia, 27-30 June
-
A. Y. Ng, D. Harada, and S. Russell, "Policy invariance under reward transformations: Theory and application to reward shaping," in Proceedings Sixteenth International Conference on Machine Learning (ICML'99), Bled, Slovenia, 27-30 June 1999, pp. 278-287.
-
(1999)
Proceedings Sixteenth International Conference on Machine Learning (ICML'99)
, pp. 278-287
-
-
Ng, A.Y.1
Harada, D.2
Russell, S.3
|