SCOPUS 정보 검색 플랫폼

Volumn 11, Issue 2, 1998, Pages 359-376

Learning reaching strategies through reinforcement for a sensor-based manipulator

(2) Martín, Pedro a Millán, José Del R b

Author keywords

Differential inverse kinematics; Multilink manipulators; Neural networks; Reaching strategies; Reactive systems; Reinforcement learning

Indexed keywords

CALCULATIONS; COLLISION AVOIDANCE; INVERSE KINEMATICS; MANIPULATORS; NEURAL NETWORKS; ROBOTIC ARMS; SENSORS;

DIFFERENTIAL INVERSE KINEMATICS; MANIPULATOR FORWARD KINEMATICS; MULTILINK MANIPULATORS; MULTILINK ROBOT ARM; REINFORCEMENT LEARNING;

LEARNING SYSTEMS;

ARTICLE; AVOIDANCE BEHAVIOR; FEEDBACK SYSTEM; LEARNING; NERVE CELL NETWORK; NONHUMAN; PRIORITY JOURNAL; REACTION ANALYSIS; REINFORCEMENT; ROBOTICS; SIMULATION; TASK PERFORMANCE;

EID: 0032029373 PISSN: 08936080 EISSN: None Source Type: Journal
DOI: 10.1016/S0893-6080(97)00137-8 Document Type: Article

Times cited : (6)

References (19)

1
- 0028744877
- Path planning with neural subgoal search
- Baginski, B., & Eldracher, M. (1994). Path planning with neural subgoal search. In Proceedings of the IEEE World Congress on Computational Intelligence (pp. 2732-2736).
- (1994) In Proceedings of the IEEE World Congress on Computational Intelligence , pp. 2732-2736
- Baginski, B.¹ Eldracher, M.²

2
- 0026369737
- Robot motion planning: A distributed representation approach
- Barraquand J., Latombe J.C. Robot motion planning: a distributed representation approach. The International Journal of Robotics Research. 10:1991;628-649.
- (1991) The International Journal of Robotics Research , vol.10 , pp. 628-649
- Barraquand, J.¹ Latombe, J.C.²

3
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Barto A.G., Sutton R.S., Anderson C.W. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man and Cybernetics. 13:1983;834-846.
- (1983) IEEE Transactions on Systems, Man and Cybernetics , vol.13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

4
- 0002201501
- Learning and sequential decision making
- In M. Gabriel and J.W. Moore (Eds.), Cambridge, MA: MIT Press.
- Barto, A.G., Sutton, R.S. and Watkins, C.J.C.H. (1990). Learning and sequential decision making. In M. Gabriel and J.W. Moore (Eds.), Learning and computational neuroscience: foundations of adaptive networks (pp. 539-602). Cambridge, MA: MIT Press.
- (1990) Learning and Computational Neuroscience: Foundations of Adaptive Networks , pp. 539-602
- Barto, A.G.¹ Sutton, R.S.² Watkins, C.J.C.H.³

5
- 0022688781
- A robust layered control system for a mobile robot
- Brooks R.A. A robust layered control system for a mobile robot. IEEE Journal of Robotics and Automation. 2:1986;14-23.
- (1986) IEEE Journal of Robotics and Automation , vol.2 , pp. 14-23
- Brooks, R.A.¹

6
- 0007512578
- Truncating temporal differences: On the efficient implementation of TD(l) for reinforcement learning
- Cichosz P. Truncating temporal differences: on the efficient implementation of TD(l) for reinforcement learning. Journal of Artificial Intelligence Research. 2:1995;287-318.
- (1995) Journal of Artificial Intelligence Research , vol.2 , pp. 287-318
- Cichosz, P.¹

7
- 0344864889
- Fast and efficient reinforcement learning with truncated temporal differences
- San Francisco: Morgan Kaufman.
- Cichosz, P. and Mulawka, J.J. (1995). Fast and efficient reinforcement learning with truncated temporal differences. In Proceedings of the 12th International Conference on Machine Learning (pp. 99-107). San Francisco: Morgan Kaufman.
- (1995) In Proceedings of the 12th International Conference on Machine Learning , pp. 99-107
- Cichosz, P.¹ Mulawka, J.J.²

8
- 0026913154
- Gross motion planning - A survey
- Hwang Y.K., Ahuja N. Gross motion planning - a survey. ACM Computing Surveys. 24:1992;219-291.
- (1992) ACM Computing Surveys , vol.24 , pp. 219-291
- Hwang, Y.K.¹ Ahuja, N.²

9
- 44049116478
- Forward models: Supervised learning with a distal teacher
- Jordan M.I., Rumelhart D.E. Forward models: supervised learning with a distal teacher. Cognitive Science. 16:1992;307-354.
- (1992) Cognitive Science , vol.16 , pp. 307-354
- Jordan, M.I.¹ Rumelhart, D.E.²

10
- 0022674420
- Real time obstacle avoidance for manipulators and mobile robots
- Khatib O. Real time obstacle avoidance for manipulators and mobile robots. The International Journal of Robotics Research. 5:1986;90-98.
- (1986) The International Journal of Robotics Research , vol.5 , pp. 90-98
- Khatib, O.¹

11
- 0025475651
- Inversion of neural networks by gradient descent
- Kindermann J., Linden A. Inversion of neural networks by gradient descent. Journal of Parallel Computing. 14:1992;277-286.
- (1992) Journal of Parallel Computing , vol.14 , pp. 277-286
- Kindermann, J.¹ Linden, A.²

12
- 0023365547
- A simple motion-planning algorithm for general robot manipulators
- Lozano-Pérez T. A simple motion-planning algorithm for general robot manipulators. IEEE Journal of Robotics and Automation. 3:1987;224-238.
- (1987) IEEE Journal of Robotics and Automation , vol.3 , pp. 224-238
- Lozano-Pérez, T.¹

13
- 0029386560
- Reinforcement learning of goal-directed obstacle-avoiding reaction strategies in an autonomous mobile robot
- Millán J. del R. Reinforcement learning of goal-directed obstacle-avoiding reaction strategies in an autonomous mobile robot. Robotics and Autonomous Systems. 15:1995;275-299.
- (1995) Robotics and Autonomous Systems , vol.15 , pp. 275-299
- Millán, J.¹ Del, R.²

14
- 0030171602
- Rapid, safe and incremental learning of navigation strategies
- Millán J. del R. Rapid, safe and incremental learning of navigation strategies. IEEE Transactions on Systems, Man and Cybernetics - Part B. 26:1996;408-420.
- (1996) IEEE Transactions on Systems, Man and Cybernetics - Part B , vol.26 , pp. 408-420
- Millán, J.¹ Del, R.²

15
- 0000714373
- A reinforcement connectionist approach to robot path finding in non-maze-like environments
- Millán J. del R., Torras C. A reinforcement connectionist approach to robot path finding in non-maze-like environments. Machine Learning. 8:1992;363-395.
- (1992) Machine Learning , vol.8 , pp. 363-395
- Millán, J.¹ Del, R.² Torras, C.³

16
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton R.S. Learning to predict by the methods of temporal differences. Machine Learning. 3:1988;9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

17
- 85152550360
- A modular Q-learning architecture for manipulator task decomposition
- San Francisco: Morgan Kaufman.
- Tham, C.K. and Prager, R.W. (1994). A modular Q-learning architecture for manipulator task decomposition. In Proceedings of the 11th International Conference on Machine Learning (pp. 309-317). San Francisco: Morgan Kaufman.
- (1994) In Proceedings of the 11th International Conference on Machine Learning , pp. 309-317
- Tham, C.K.¹ Prager, R.W.²

18
- 0015475961
- The mathematics of coordinated control of prosthetic arms and manipulators
- Whitney D. The mathematics of coordinated control of prosthetic arms and manipulators. ASME Journal of Dynamics Systems, Mathematics, and Control. 94:1972;303-309.
- (1972) ASME Journal of Dynamics Systems, Mathematics, and Control , vol.94 , pp. 303-309
- Whitney, D.¹

19
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams R.J. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning. 8:1992;229-256.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.