-
1
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
Barto, A., Bradtke, S., & Singh, S. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
-
(1995)
Artificial Intelligence
, vol.72
, pp. 81-138
-
-
Barto, A.1
Bradtke, S.2
Singh, S.3
-
2
-
-
33749244036
-
Reusing old policies to accelerate learning on new MDPs
-
UM-CS-1999-026, Department of Computer Science, University of Massachusetts at Amherst
-
Bernstein, D. (1999). Reusing old policies to accelerate learning on new MDPs (Technical Report UM-CS-1999-026). Department of Computer Science, University of Massachusetts at Amherst.
-
(1999)
Technical Report
-
-
Bernstein, D.1
-
4
-
-
0002479021
-
Exploring unknown environments with real-time search or reinforcement learning
-
Koenig, S. (1999). Exploring unknown environments with real-time search or reinforcement learning. Advances in Neural Information Processing Systems (NIPS) 12 (pp. 1003-1009).
-
(1999)
Advances in Neural Information Processing Systems (NIPS)
, vol.12
, pp. 1003-1009
-
-
Koenig, S.1
-
5
-
-
0029751419
-
The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
-
Koenig, S., & Simmons, R. (1996). The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms. Machine Learning, 22, 227-250.
-
(1996)
Machine Learning
, vol.22
, pp. 227-250
-
-
Koenig, S.1
Simmons, R.2
-
7
-
-
0025400088
-
Real-time heuristic search
-
Korf, R. (1990). Real-time heuristic search. Artificial Intelligence, 42, 189-211.
-
(1990)
Artificial Intelligence
, vol.42
, pp. 189-211
-
-
Korf, R.1
-
9
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
Matarić, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4, 73-83.
-
(1997)
Autonomous Robots
, vol.4
, pp. 73-83
-
-
Matarić, M.1
-
10
-
-
0027684215
-
Prioritized sweeping: Reinforcement learning with less data and less time
-
Moore, A., & Atkeson, C. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103-130.
-
(1993)
Machine Learning
, vol.13
, pp. 103-130
-
-
Moore, A.1
Atkeson, C.2
-
17
-
-
84974678409
-
Layered learning
-
Barcelona, Spain: Springer, Berlin
-
Stone, P., & Veloso, M. (2000). Layered learning. Proceedings of the 11th European Conference on Machine Learning (pp. 369-381). Barcelona, Spain: Springer, Berlin.
-
(2000)
Proceedings of the 11th European Conference on Machine Learning
, pp. 369-381
-
-
Stone, P.1
Veloso, M.2
-
20
-
-
0003411271
-
Efficient exploration in reinforcement learning
-
CS-92-102, Carnegie Mellon University
-
Thrun, S. (1992). Efficient exploration in reinforcement learning (Technical Report CS-92-102). Carnegie Mellon University.
-
(1992)
Technical Report
-
-
Thrun, S.1
-
23
-
-
0035951444
-
Autonomous mental development by robots and animals
-
Weng, J., McClelland, J., Pentland, A., Sporns, O., Stockman, I., Sur, M., & Thelen, E. (2000). Autonomous mental development by robots and animals. Science, 291, 599-600.
-
(2000)
Science
, vol.291
, pp. 599-600
-
-
Weng, J.1
McClelland, J.2
Pentland, A.3
Sporns, O.4
Stockman, I.5
Sur, M.6
Thelen, E.7
-
24
-
-
27344453198
-
Potential-based shaping and Q-value initialization are equivalent
-
Wiewiora, E. (2003). Potential-based shaping and Q-value initialization are equivalent. Journal of Artificial Intelligence Research, 19, 205-208.
-
(2003)
Journal of Artificial Intelligence Research
, vol.19
, pp. 205-208
-
-
Wiewiora, E.1
|