-
4
-
-
0002201501
-
Learning and sequential decision making
-
In M. Gabriel and J.W. Moore (Eds.), Cambridge, MA: MIT Press.
-
Barto, A.G., Sutton, R.S. and Watkins, C.J.C.H. (1990). Learning and sequential decision making. In M. Gabriel and J.W. Moore (Eds.), Learning and computational neuroscience: foundations of adaptive networks (pp. 539-602). Cambridge, MA: MIT Press.
-
(1990)
Learning and Computational Neuroscience: Foundations of Adaptive Networks
, pp. 539-602
-
-
Barto, A.G.1
Sutton, R.S.2
Watkins, C.J.C.H.3
-
5
-
-
0022688781
-
A robust layered control system for a mobile robot
-
Brooks R.A. A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation. 2:1986;14-23.
-
(1986)
IEEE Journal of Robotics and Automation
, vol.2
, pp. 14-23
-
-
Brooks, R.A.1
-
6
-
-
0007512578
-
Truncating temporal differences: On the efficient implementation of TD(l) for reinforcement learning
-
Cichosz P. Truncating temporal differences: on the efficient implementation of TD(l) for reinforcement learning. Journal of Artificial Intelligence Research. 2:1995;287-318.
-
(1995)
Journal of Artificial Intelligence Research
, vol.2
, pp. 287-318
-
-
Cichosz, P.1
-
9
-
-
44049116478
-
Forward models: Supervised learning with a distal teacher
-
Jordan M.I., Rumelhart D.E. Forward models: supervised learning with a distal teacher. Cognitive Science. 16:1992;307-354.
-
(1992)
Cognitive Science
, vol.16
, pp. 307-354
-
-
Jordan, M.I.1
Rumelhart, D.E.2
-
10
-
-
0022674420
-
Real time obstacle avoidance for manipulators and mobile robots
-
Khatib O. Real time obstacle avoidance for manipulators and mobile robots. The International Journal of Robotics Research. 5:1986;90-98.
-
(1986)
The International Journal of Robotics Research
, vol.5
, pp. 90-98
-
-
Khatib, O.1
-
12
-
-
0023365547
-
A simple motion-planning algorithm for general robot manipulators
-
Lozano-Pérez T. A simple motion-planning algorithm for general robot manipulators. IEEE Journal of Robotics and Automation. 3:1987;224-238.
-
(1987)
IEEE Journal of Robotics and Automation
, vol.3
, pp. 224-238
-
-
Lozano-Pérez, T.1
-
13
-
-
0029386560
-
Reinforcement learning of goal-directed obstacle-avoiding reaction strategies in an autonomous mobile robot
-
Millán J. del R. Reinforcement learning of goal-directed obstacle-avoiding reaction strategies in an autonomous mobile robot. Robotics and Autonomous Systems. 15:1995;275-299.
-
(1995)
Robotics and Autonomous Systems
, vol.15
, pp. 275-299
-
-
Millán, J.1
Del, R.2
-
15
-
-
0000714373
-
A reinforcement connectionist approach to robot path finding in non-maze-like environments
-
Millán J. del R., Torras C. A reinforcement connectionist approach to robot path finding in non-maze-like environments. Machine Learning. 8:1992;363-395.
-
(1992)
Machine Learning
, vol.8
, pp. 363-395
-
-
Millán, J.1
Del, R.2
Torras, C.3
-
16
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton R.S. Learning to predict by the methods of temporal differences. Machine Learning. 3:1988;9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
18
-
-
0015475961
-
The mathematics of coordinated control of prosthetic arms and manipulators
-
Whitney D. The mathematics of coordinated control of prosthetic arms and manipulators. ASME Journal of Dynamics Systems, Mathematics, and Control. 94:1972;303-309.
-
(1972)
ASME Journal of Dynamics Systems, Mathematics, and Control
, vol.94
, pp. 303-309
-
-
Whitney, D.1
-
19
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning. 8:1992;229-256.
-
(1992)
Machine Learning
, vol.8
, pp. 229-256
-
-
Williams, R.J.1
|