-
1
-
-
0141988716
-
Recent advances in hierarchical reinforcement learning
-
Apr.
-
A. G. Barto, S. Mahadevan, "Recent advances in hierarchical reinforcement learning," Discrete Event Dynamic Systems: Theory and Applications, vol.13, pp.41-77, Apr. 2003.
-
(2003)
Discrete Event Dynamic Systems: Theory and Applications
, vol.13
, pp. 41-77
-
-
Barto, A.G.1
Mahadevan, S.2
-
2
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Jan.
-
R.S. Sutton, D. Precup, S.P. Singh, "Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning," Artificial Intelligence, vol.112, pp.181-211, Jan. 1999.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.P.3
-
4
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
T.G. Dietterich, "Hierarchical reinforcement learning with the MAXQ value function decomposition," Journal of Artificial Intelligence Research, vol.13, pp.227-303, 2000.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
5
-
-
0004782095
-
Learning hierarchical control structures for multiple tasks and changing environments
-
Zurich, Switzerland
-
B.L. Digney, "Learning hierarchical control structures for multiple tasks and changing environments," in Proc. of the 15th International Conference on Simulation of Adaptive Behavior, Zurich, Switzerland, 1998. pp.321-330.
-
(1998)
Proc. of the 15th International Conference on Simulation of Adaptive Behavior
, pp. 321-330
-
-
Digney, B.L.1
-
6
-
-
0013465187
-
Autonomous discovery of subgoals in reinforcement learning using deverse density
-
San Fransisco: Morgan Kaufmann
-
A. McGovern, A. Barto, "Autonomous discovery of subgoals in reinforcement learning using deverse density," in Proc. of the 8th International Conference on Machine Learning, San Fransisco: Morgan Kaufmann, 2001. pp.361-368.
-
(2001)
Proc. of the 8th International Conference on Machine Learning
, pp. 361-368
-
-
McGovern, A.1
Barto, A.2
-
7
-
-
84945250000
-
Q-cut: Dynamic discovery ofsub-goals in reinforcement learning
-
Springer
-
I. Menache, S. Mannor, N. Shimkin, "Q-cut: dynamic discovery ofsub-goals in reinforcement learning," in Volume 2430 of Lecture Notes in Computer Science, Springer, 2002. pp.295-306.
-
(2002)
Lecture Notes in Computer Science
, vol.2430
, pp. 295-306
-
-
Menache, I.1
Mannor, S.2
Shimkin, N.3
-
9
-
-
0015956495
-
Towards a network theory of the immune system
-
Jan.
-
N. K. Jerne, "Towards a network theory of the immune system," Annual Immunology, vol. 125C, pp.373-389, Jan. 1974.
-
(1974)
Annual Immunology
, vol.125 C
, pp. 373-389
-
-
Jerne, N.K.1
-
10
-
-
84950235798
-
An evolutionary immune network for data clustering
-
Rio de Janeiro
-
L.N. de Castro, F. N. Von Zuben, "An evolutionary immune network for data clustering," in Proc. of the IEEE Brazilian Symposium on Artificial Neural Networks, vol.1, Rio de Janeiro, 2000. pp.84-89.
-
(2000)
Proc. of the IEEE Brazilian Symposium on Artificial Neural Networks
, vol.1
, pp. 84-89
-
-
De Castro, L.N.1
Von Zuben, F.N.2
-
11
-
-
34249833101
-
Q-learning
-
Mar.
-
C. Watkins, P. Dayan, "Q-learning," Machine Learning, vol. 8, pp.279-292, Mar. 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
12
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
Apr.
-
L. G. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning, vol. 8, pp.293-321, Apr. 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.G.1
|