-
2
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
San Francisco, CA. Morgan Kaufmann
-
Crites, R. H., & Barto, A. G. (1995). Improving elevator performance using reinforcement learning. In Advances in Neural Information Processing Systems, Vol. 8, pp. 1017-1023 San Francisco, CA. Morgan Kaufmann.
-
(1995)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1017-1023
-
-
Crites, R.H.1
Barto, A.G.2
-
3
-
-
0001234682
-
Feudal reinforcement learning
-
Morgan Kaufmann, San Francisco, CA
-
Dayan, P., & Hinton, G. (1993). Feudal reinforcement learning. In Advances in Neural Information Processing Systems, 5, pp. 271-278. Morgan Kaufmann, San Francisco, CA.
-
(1993)
Advances in Neural Information Processing Systems
, vol.5
, pp. 271-278
-
-
Dayan, P.1
Hinton, G.2
-
4
-
-
0006424007
-
-
Tech. rep. CS-95-10 Department of Computer Science, Brown University, Providence, Rhode Island
-
Dean, T., & Lin, S.-H. (1995). Decomposition techniques for planning in stochastic domains. Tech. rep. CS-95-10, Department of Computer Science, Brown University, Providence, Rhode Island.
-
(1995)
Decomposition Techniques for Planning in Stochastic Domains
-
-
Dean, T.1
Lin, S.-H.2
-
5
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research. To appear.
-
(2000)
Journal of Artificial Intelligence Research
-
-
Dietterich, T.G.1
-
7
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less time
-
Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103.
-
(1993)
Machine Learning
, vol.13
, pp. 103
-
-
Moore, A.W.1
Atkeson, C.G.2
-
9
-
-
84898956770
-
Reinforcement learning with hierarchies of machines
-
Cambridge, MA. MIT Press
-
Parr, R., & Russell, S. (1998). Reinforcement learning with hierarchies of machines. In Advances in Neural Information Processing Systems, Vol. 10, pp. 1043-1049 Cambridge, MA. MIT Press.
-
(1998)
Advances in Neural Information Processing Systems
, vol.10
, pp. 1043-1049
-
-
Parr, R.1
Russell, S.2
-
10
-
-
0346087506
-
-
Tech. rep., University of Colorado, Department of Computer Science, Boulder, CO. To appear in Machine Learning
-
Singh, S., Jaakkola, T., Littman, M. L., & Szepesvári, C. (1998). Convergence results for single-step on-policy reinforcement-learning algorithms. Tech. rep., University of Colorado, Department of Computer Science, Boulder, CO. To appear in Machine Learning.
-
(1998)
Convergence Results for Single-step On-policy Reinforcement-learning Algorithms
-
-
Singh, S.1
Jaakkola, T.2
Littman, M.L.3
Szepesvári, C.4
-
11
-
-
0001027894
-
Transfer of learning by composing solutions of elemental sequential tasks
-
Singh, S. P. (1992). Transfer of learning by composing solutions of elemental sequential tasks. Machine Learning, 8, 323.
-
(1992)
Machine Learning
, vol.8
, pp. 323
-
-
Singh, S.P.1
-
13
-
-
0003899594
-
-
Tech. rep., University of Massachusetts, Department of Computer and Information Sciences, Amherst, MA. To appear in Artificial Intelligence
-
Sutton, R. S., Precup, D., & Singh, S. (1998). Between MDPs and Semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales. Tech. rep., University of Massachusetts, Department of Computer and Information Sciences, Amherst, MA. To appear in Artificial Intelligence.
-
(1998)
Between MDPs and Semi-MDPs: Learning, Planning, and Representing Knowledge at Multiple Temporal Scales
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
14
-
-
0029276036
-
Temporal difference learning and TD-Gammon
-
Tesauro, G. (1995). Temporal difference learning and TD-Gammon. Communications of the ACM, 28 (3), 58-68.
-
(1995)
Communications of the ACM
, vol.28
, Issue.3
, pp. 58-68
-
-
Tesauro, G.1
-
15
-
-
84918834208
-
A reinforcement learning approach to job-shop scheduling
-
Morgan Kaufmann, San Francisco, CA
-
Zhang, W., & Dietterich, T. G. (1995). A reinforcement learning approach to job-shop scheduling. In 1995 International Joint Conference on Artificial Intelligence, pp. 1114-1120. Morgan Kaufmann, San Francisco, CA.
-
(1995)
1995 International Joint Conference on Artificial Intelligence
, pp. 1114-1120
-
-
Zhang, W.1
Dietterich, T.G.2
|