-
2
-
-
0003275980
-
Adaptive computations and machine learning
-
MIT Press, Cambridge, MA
-
R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, Adaptive Computations and Machine Learning, MIT Press, Cambridge, MA, 1998.
-
(1998)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
3
-
-
0029679044
-
Reinforcement learning: A survey
-
L.P. Kaelbling, M.L. Littman, and A.W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research 4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
4
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
G. Tesauro, D.S. Touretzky, and T. Leen, eds., MIT Press
-
J.A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems, G. Tesauro, D.S. Touretzky, and T. Leen, eds., 7, pp. 369-376, MIT Press, 1995.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
5
-
-
0003989207
-
-
PhD thesis, School of Computer Science, Carnegie Mellon University, June. Also available as technical report CMU-CS-99-143
-
G.J. Gordon, Approximate Solutions to Markov Decision Processes. PhD thesis, School of Computer Science, Carnegie Mellon University, June 1999. Also available as technical report CMU-CS-99-143.
-
(1999)
Approximate Solutions to Markov Decision Processes
-
-
Gordon, G.J.1
-
6
-
-
0031074521
-
Locally weighted learning
-
C.G. Atkeson, A.W. Moore, and S. Schaal, "Locally weighted learning," Artificial Intelligence Review 11, pp. 11-73, 1997.
-
(1997)
Artificial Intelligence Review
, vol.11
, pp. 11-73
-
-
Atkeson, C.G.1
Moore, A.W.2
Schaal, S.3
-
7
-
-
84918743166
-
Influential observations in linear regression
-
March
-
R.D. Cook, "Influential observations in linear regression," Journal of the American Statistical Association 74, pp. 169-174, March 1979.
-
(1979)
Journal of the American Statistical Association
, vol.74
, pp. 169-174
-
-
Cook, R.D.1
-
12
-
-
0001259406
-
Robot learning by nonparametric regression
-
V. Graefe, ed
-
S. Schaal and C.G. Atkeson, "Robot learning by nonparametric regression," in Proceedings of Intelligent Robots and Systems 1994 (IROS '94), V. Graefe, ed., pp. 137-154, 1995.
-
(1995)
Proceedings of Intelligent Robots and Systems 1994 (IROS '94)
, pp. 137-154
-
-
Schaal, S.1
Atkeson, C.G.2
-
13
-
-
0028740409
-
Learning by watching: Extracting reusable task knowledge from visual observation of human performance
-
December
-
Y. Kuniyoshi, M. Inaba, and H. Inoue, "Learning by watching: Extracting reusable task knowledge from visual observation of human performance," IEEE Transactions on Robotics and Automation 10, pp. 799-822, December 1994.
-
(1994)
IEEE Transactions on Robotics and Automation
, vol.10
, pp. 799-822
-
-
Kuniyoshi, Y.1
Inaba, M.2
Inoue, H.3
-
14
-
-
0031287713
-
Transfer of elementary skills via human-robot interaction
-
M. Kaiser, "Transfer of elementary skills via human-robot interaction," Adaptive Behavior 5(3/4), pp. 249-280, 1997.
-
(1997)
Adaptive Behavior
, vol.5
, Issue.3-4
, pp. 249-280
-
-
Kaiser, M.1
-
15
-
-
0001847657
-
Imitative learning mechanisms in robots and humans
-
V. Klingspor, ed., (Bari, Italy), July
-
J. Demiris and G. Hayes, "Imitative learning mechanisms in robots and humans," in Proceedings of the 5th European Workshop on Learning Robots, V. Klingspor, ed., (Bari, Italy), July 1996.
-
(1996)
Proceedings of the 5th European Workshop on Learning Robots
-
-
Demiris, J.1
Hayes, G.2
-
17
-
-
84976813028
-
Learning to coordinate behaviors
-
AAAI Press, Menlo Park, CA
-
P. Maes and R.A. Brooks, "Learning to coordinate behaviors," in Proceedings of the Eighth National Conference on Artificial Intelligence (AAAI '90), pp. 796-802, AAAI Press, (Menlo Park, CA), 1990.
-
(1990)
Proceedings of the Eighth National Conference on Artificial Intelligence (AAAI '90)
, pp. 796-802
-
-
Maes, P.1
Brooks, R.A.2
-
18
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
June
-
S. Mahadevan and J. Connell, "Automatic programming of behavior-based robots using reinforcement learning," Machine Learning 55, pp. 311-365, June 1992.
-
(1992)
Machine Learning
, vol.55
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
19
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
L.-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning 8, pp. 293-321, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.-J.1
-
20
-
-
0030149709
-
Purposive behavior acquisition for a real robot by vision-based reinforcement learning
-
M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behavior acquisition for a real robot by vision-based reinforcement learning," Machine Learning 23, pp. 279-303, 1996.
-
(1996)
Machine Learning
, vol.23
, pp. 279-303
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
21
-
-
0029753630
-
Reinforcement learning with replacing eligibility traces
-
S.P. Singh and R.S. Sutton, "Reinforcement learning with replacing eligibility traces," Machine Learning 22, pp. 123-158, 1996.
-
(1996)
Machine Learning
, vol.22
, pp. 123-158
-
-
Singh, S.P.1
Sutton, R.S.2
-
22
-
-
0028739953
-
Robot shaping: Developing autonomous agents through learning
-
M. Dorigo and M. Colombetti, "Robot shaping: Developing autonomous agents through learning," Artificial Intelligence 71(2), pp. 321-370, 1994.
-
(1994)
Artificial Intelligence
, vol.71
, Issue.2
, pp. 321-370
-
-
Dorigo, M.1
Colombetti, M.2
|