-
2
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
Barto, A. G., Bradtke, S. J., & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
-
(1995)
Artificial Intelligence
, vol.72
, pp. 81-138
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
3
-
-
0003602259
-
-
Tech. Rep. COINS-89-95. Amherst: Department of Computer and Information Science, University of Massachusetts
-
Barto, A. G., Sutton, R. S., & Watkins, C. J. C. H. (1989). Learning and sequential decision making (Tech. Rep. COINS-89-95) Amherst: Department of Computer and Information Science, University of Massachusetts.
-
(1989)
Learning and Sequential Decision Making
-
-
Barto, A.G.1
Sutton, R.S.2
Watkins, C.J.C.H.3
-
4
-
-
0000030514
-
From Tom Thumb to the Dockers: Some experiments with foraging robots
-
J.-A. Meyer, H. L. Roitblat, & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
-
Drogoul, A., & Ferber, J. (1993). From Tom Thumb to the Dockers: Some experiments with foraging robots. In J.-A. Meyer, H. L. Roitblat, & S. W. Wilson (Eds.), From animals to animats II: Proceedings of the Second International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.
-
(1993)
From Animals to Animats II: Proceedings of the Second International Conference on Simulation of Adaptive Behavior
-
-
Drogoul, A.1
Ferber, J.2
-
6
-
-
84977005393
-
Collective robotics: From social insects to robots
-
Kube, C. R., & Zhang, H. (1993). Collective robotics: From social insects to robots. Adaptive Behavior, 2, 189-218.
-
(1993)
Adaptive Behavior
, vol.2
, pp. 189-218
-
-
Kube, C.R.1
Zhang, H.2
-
7
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
Lin, L.-J. (1992). Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293-321.
-
(1992)
Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.-J.1
-
8
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55, 311-365.
-
(1992)
Artificial Intelligence
, vol.55
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
11
-
-
0001201710
-
Learning to behave socially
-
D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
-
Matarić, M. J. (1994b). Learning to behave socially. In D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.), From animals to animats III: Third International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.
-
(1994)
From Animals to Animats III: Third International Conference on Simulation of Adaptive Behavior
-
-
Matarić, M.J.1
-
12
-
-
0010862056
-
Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot
-
D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
-
Millán, J. del R. (1994). Learning efficient reactive behavioral sequences from basic reflexes in a goal-directed autonomous robot. In D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.), From animals to animats III: Third International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.
-
(1994)
From Animals to Animats III: Third International Conference on Simulation of Adaptive Behavior
-
-
Millán, J.D.R.1
-
13
-
-
0000714373
-
A reinforcement connectionist approach to robot path finding in non-maze-like environments
-
Millán, J. del R., & Torras, C. (1992). A reinforcement connectionist approach to robot path finding in non-maze-like environments. Machine Learning, 8, 363-395.
-
(1992)
Machine Learning
, vol.8
, pp. 363-395
-
-
Millán, J.D.R.1
Torras, C.2
-
14
-
-
0001187959
-
Explanation-based neural networks learning for robot control
-
C. L. Giles, S. J. Hanson, & J. D. Cowan (Eds.). San Mateo, CA: Morgan Kaufmann
-
Mitchell, T. M., & Thrun, S.B. (1993). Explanation-based neural networks learning for robot control. In C. L. Giles, S. J. Hanson, & J. D. Cowan (Eds.), Advances in neural information processing systems 5. San Mateo, CA: Morgan Kaufmann.
-
(1993)
Advances in Neural Information Processing Systems
, vol.5
-
-
Mitchell, T.M.1
Thrun, S.B.2
-
16
-
-
85033767014
-
Collective choice of strategic type
-
J.-A. Meyer, H. L. Roitblat, & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
-
Numaoka, C., & Takeuchi, A. (1993). Collective choice of strategic type. In J.-A. Meyer, H. L. Roitblat, & S. W. Wilson (Eds.), From animals to animats II: Proceedings of the Second International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.
-
(1993)
From Animals to Animats II: Proceedings of the Second International Conference on Simulation of Adaptive Behavior
-
-
Numaoka, C.1
Takeuchi, A.2
-
17
-
-
0011267427
-
Obstacle avoidance through reinforcement learning
-
J. E. Moody, S. J. Hanson, & R. P. Lippmann (Eds.). San Mateo, CA: Morgan Kaufmann
-
Prescott, T. J., & Mayhew, J. E. W. (1992). Obstacle avoidance through reinforcement learning. In J. E. Moody, S. J. Hanson, & R. P. Lippmann (Eds.), Advances in Neural Information Processing Systems 4. San Mateo, CA: Morgan Kaufmann.
-
(1992)
Advances in Neural Information Processing Systems
, vol.4
-
-
Prescott, T.J.1
Mayhew, J.E.W.2
-
18
-
-
0001024813
-
A case study in the behavior-oriented design of autonomous agents
-
D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.). Cambridge MA: MIT Press
-
Steels, L. (1994). A case study in the behavior-oriented design of autonomous agents. In D. Cliff, P. Husbands, J.-A. Meyer, & S. W. Wilson (Eds.), From animals to animats III: Third International Conference on Simulation of Adaptive Behavior. Cambridge MA: MIT Press.
-
(1994)
From Animals to Animats III: Third International Conference on Simulation of Adaptive Behavior
-
-
Steels, L.1
-
19
-
-
0003617454
-
-
Unpublished doctoral thesis, Department of Computer and Information Science, University of Massachusetts, Amherst
-
Sutton, R. S. (1984). Temporal credit assignment in reinforcement learning. Unpublished doctoral thesis, Department of Computer and Information Science, University of Massachusetts, Amherst.
-
(1984)
Temporal Credit Assignment in Reinforcement Learning
-
-
Sutton, R.S.1
-
20
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
San Mateo, CA: Morgan Kaufmann
-
Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the Seventh International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann.
-
(1990)
Proceedings of the Seventh International Conference on Machine Learning
-
-
Sutton, R.S.1
-
21
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. cooperative agents
-
San Mateo, CA: Morgan Kaufmann
-
Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. Proceedings of the Tenth International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann.
-
(1993)
Proceedings of the Tenth International Conference on Machine Learning
-
-
Tan, M.1
-
23
-
-
0001875923
-
An adaptive communication protocol for cooperating mobile robots
-
J.-A. Meyer, H. L. Roitblat, & S. W. Wilson (Eds.). Cambridge, MA: MIT Press
-
Yanco, H., & Stein, L. A. (1993). An adaptive communication protocol for cooperating mobile robots. In J.-A. Meyer, H. L. Roitblat, & S. W. Wilson (Eds.), From animals to animats II: Proceedings of the Second International Conference on Simulation of Adaptive Behavior. Cambridge, MA: MIT Press.
-
(1993)
From Animals to Animats II: Proceedings of the Second International Conference on Simulation of Adaptive Behavior
-
-
Yanco, H.1
Stein, L.A.2
|