-
2
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Liftman, and A. W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Liftman, M.L.2
Moore, A.W.3
-
4
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
J. N. Tsitsiklis and B. Van Roy, "Feature-based methods for large scale dynamic programming," Machine Learning, vol. 22, no. 1-3, pp. 59-94, 1996.
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
-
5
-
-
84880694195
-
Stable function approximation in dynamic programming
-
Tahoe City, US, 9-12 July
-
G. Gordon, "Stable function approximation in dynamic programming," in Proceedings Twelfth International Conference on Machine Learning (ICML-95), Tahoe City, US, 9-12 July 1995, pp. 261-268.
-
(1995)
Proceedings Twelfth International Conference on Machine Learning (ICML-95)
, pp. 261-268
-
-
Gordon, G.1
-
6
-
-
0242580448
-
Variable-resolution discretization in optimal control
-
R. Munos and A. Moore, "Variable-resolution discretization in optimal control," Machine Learning, vol. 1, pp. 1-31, 2001.
-
(2001)
Machine Learning
, vol.1
, pp. 1-31
-
-
Munos, R.1
Moore, A.2
-
7
-
-
0036832956
-
Kernel-based reinforcement learning
-
D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, pp. 161-178, 2002.
-
(2002)
Machine Learning
, vol.49
, pp. 161-178
-
-
Ormoneit, D.1
Sen, S.2
-
8
-
-
21844465127
-
Tree-based batch mode reinforcement learning
-
D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, vol. 6, pp. 503-556, 2005.
-
(2005)
Journal of Machine Learning Research
, vol.6
, pp. 503-556
-
-
Ernst, D.1
Geurts, P.2
Wehenkel, L.3
-
11
-
-
84898958374
-
Gradient descent for general reinforcement learning
-
Denver, US, 30 November, 5 December
-
L. Baird and A. Moore, "Gradient descent for general reinforcement learning," in Advances in Neural Information Processing Systems 11 (NLPS-98), Denver, US, 30 November - 5 December 1998, pp. 968-974.
-
(1998)
Advances in Neural Information Processing Systems 11 (NLPS-98)
, pp. 968-974
-
-
Baird, L.1
Moore, A.2
-
12
-
-
85153965130
-
Reinforcement learning with soft state aggregation
-
Denver, Colorado, USA
-
S. P. Singh, T. Jaakkola, and M. I. Jordan, "Reinforcement learning with soft state aggregation," in Advances in Neural Information Processing Systems 7, Denver, Colorado, USA, 1994, pp. 361-368.
-
(1994)
Advances in Neural Information Processing Systems 7
, pp. 361-368
-
-
Singh, S.P.1
Jaakkola, T.2
Jordan, M.I.3
-
13
-
-
1442288723
-
Near optimal closed-loop control. Application to electric power systems,
-
Ph.D. dissertation, University of Liège, Belgium, March
-
D. Ernst, "Near optimal closed-loop control. Application to electric power systems," Ph.D. dissertation, University of Liège, Belgium, March 2003.
-
(2003)
-
-
Ernst, D.1
-
14
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
Denver, Colorado, US
-
J. Boyan and A. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems 7 (NIPS-94), Denver, Colorado, US, 1994, pp. 369-376.
-
(1994)
Advances in Neural Information Processing Systems 7 (NIPS-94)
, pp. 369-376
-
-
Boyan, J.1
Moore, A.2
-
16
-
-
33845529505
-
Reinforcement learning: An overview
-
Aachen, Germany, 14-15 September
-
P. Y. Glorennec, "Reinforcement learning: An overview," in Proceedings European Symposium on Intelligent Techniques (ESIT-00), Aachen, Germany, 14-15 September 2000, pp. 17-35.
-
(2000)
Proceedings European Symposium on Intelligent Techniques (ESIT-00)
, pp. 17-35
-
-
Glorennec, P.Y.1
-
17
-
-
0030377615
-
Fuzzy interpolation-based Q-Ieaming with continuous states and actions
-
New Orleans, US, 8-11 September
-
T. Horiuchi, A. Fujino, O. Katai, and T. Sawaragi, "Fuzzy interpolation-based Q-Ieaming with continuous states and actions," in Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96), New Orleans, US, 8-11 September 1996, pp. 594-600.
-
(1996)
Proceedings 5th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE-96)
, pp. 594-600
-
-
Horiuchi, T.1
Fujino, A.2
Katai, O.3
Sawaragi, T.4
-
18
-
-
0032140718
-
Fuzzy inference system learning by reinforcement methods
-
L. Jouffe, "Fuzzy inference system learning by reinforcement methods," IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews, vol. 28, no. 3, pp. 338-355, 1998.
-
(1998)
IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews
, vol.28
, Issue.3
, pp. 338-355
-
-
Jouffe, L.1
-
19
-
-
0026923465
-
Learning and tuning fuzzy logic controllers through reinforcements
-
H. R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers through reinforcements," IEEE Transactions on Neural Networks, vol. 3, no. 5, pp. 724-740, 1992.
-
(1992)
IEEE Transactions on Neural Networks
, vol.3
, Issue.5
, pp. 724-740
-
-
Berenji, H.R.1
Khedkar, P.2
-
20
-
-
0041877717
-
A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
-
H. R. Berenji and D. Vengerov, "A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters," IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 478-485, 2003.
-
(2003)
IEEE Transactions on Fuzzy Systems
, vol.11
, Issue.4
, pp. 478-485
-
-
Berenji, H.R.1
Vengerov, D.2
-
21
-
-
24644466803
-
A fuzzy reinforcement learning approach to power control in wireless transmitters
-
D. Vengerov, N. Bambos, and H. R. Berenji, "A fuzzy reinforcement learning approach to power control in wireless transmitters," IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, vol. 35, no. 4, pp. 768-778, 2005.
-
(2005)
IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics
, vol.35
, Issue.4
, pp. 768-778
-
-
Vengerov, D.1
Bambos, N.2
Berenji, H.R.3
-
22
-
-
0001537801
-
Evolutionary learning, reinforcement learning, and fuzzy mies for knowledge acquisition in agent-based systems
-
A. Bonarini, "Evolutionary learning, reinforcement learning, and fuzzy mies for knowledge acquisition in agent-based systems," Proceedings of the IEEE, vol. 89, no. 9. pp. 1334-1346, 2001.
-
(2001)
Proceedings of the IEEE
, vol.89
, Issue.9
, pp. 1334-1346
-
-
Bonarini, A.1
-
23
-
-
0041876271
-
A reinforcement learning adaptive fuzzy controller for robots
-
C.-K. Lin, "A reinforcement learning adaptive fuzzy controller for robots," Fuzzy Sets and Systems, vol. 137, pp. 339-352, 2003.
-
(2003)
Fuzzy Sets and Systems
, vol.137
, pp. 339-352
-
-
Lin, C.-K.1
-
24
-
-
50249132307
-
Closed-loop learning of visual control policies,
-
Ph.D. dissertation, University of Liège, Belgium, December
-
S. Jodogne, "Closed-loop learning of visual control policies," Ph.D. dissertation, University of Liège, Belgium, December 2006.
-
(2006)
-
-
Jodogne, S.1
|