SCOPUS 정보 검색 플랫폼

Proceedings of SPIE - The International Society for Optical Engineering

Volumn 4573, Issue , 2001, Pages 92-103

Reinforcement learning for robot control

(2) Smart, William D a Kaelbling, Leslie Pack a

a WASHINGTON UNIVERSITY (United States)

Author keywords

Learning by demonstration; Learning control; Machine learning; Mobile robots; Reinforcement learning

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; CONTROL SYSTEM ANALYSIS; MOBILE ROBOTS; PROBLEM SOLVING;

REINFORCEMENT LEARNING;

ROBOT LEARNING;

EID: 0035763997 PISSN: 0277786X EISSN: None Source Type: Journal
DOI: 10.1117/12.457434 Document Type: Article

Times cited : (10)

References (22)

1
- 34249833101
- Q-learning
- C.J.C.H. Watkins and P. Dayan, "Q-learning," Machine Learning 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

2
- 0003275980
- Adaptive computations and machine learning
- MIT Press, Cambridge, MA
- R.S. Sutton and A.G. Barto, Reinforcement Learning: An Introduction, Adaptive Computations and Machine Learning, MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

3
- 0029679044
- Reinforcement learning: A survey
- L.P. Kaelbling, M.L. Littman, and A.W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research 4, pp. 237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

4
- 85153940465
- Generalization in reinforcement learning: Safely approximating the value function
- G. Tesauro, D.S. Touretzky, and T. Leen, eds., MIT Press
- J.A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems, G. Tesauro, D.S. Touretzky, and T. Leen, eds., 7, pp. 369-376, MIT Press, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 369-376
- Boyan, J.A.¹ Moore, A.W.²

5
- 0003989207
- PhD thesis, School of Computer Science, Carnegie Mellon University, June. Also available as technical report CMU-CS-99-143
- G.J. Gordon, Approximate Solutions to Markov Decision Processes. PhD thesis, School of Computer Science, Carnegie Mellon University, June 1999. Also available as technical report CMU-CS-99-143.
- (1999) Approximate Solutions to Markov Decision Processes
- Gordon, G.J.¹

6
- 0031074521
- Locally weighted learning
- C.G. Atkeson, A.W. Moore, and S. Schaal, "Locally weighted learning," Artificial Intelligence Review 11, pp. 11-73, 1997.
- (1997) Artificial Intelligence Review , vol.11 , pp. 11-73
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

7
- 84918743166
- Influential observations in linear regression
- March
- R.D. Cook, "Influential observations in linear regression," Journal of the American Statistical Association 74, pp. 169-174, March 1979.
- (1979) Journal of the American Statistical Association , vol.74 , pp. 169-174
- Cook, R.D.¹

8
- 0004090962
- PhD thesis, Department of Computer Science, Brown University, May
- W.D. Smart, Making Reinforcement Learning Work on Real Robots. PhD thesis, Department of Computer Science, Brown University, May 2002.
- (2002) Making Reinforcement Learning Work on Real Robots
- Smart, W.D.¹

9
- 0001898381
- Practical reinforcement learning in continuous spaces
- W.D. Smart and L.P. Kaelbling, "Practical reinforcement learning in continuous spaces," in Proceedings of the Seventeenth International Conference on Machine Learning (ICML-2000), pp. 903-910, 2000.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning (ICML-2000) , pp. 903-910
- Smart, W.D.¹ Kaelbling, L.P.²

10
- 33645627992
- Machine learning for robots: A comparison of different paradigms
- S. Mahadevan, "Machine learning for robots: A comparison of different paradigms," in Proceedings of the Workshop on Towards Real Autonomy, IEEE/RSJ Internaltional Conference on Intelligent Robots and Systems (IROS '96), 1996.
- (1996) Proceedings of the Workshop on Towards Real Autonomy, IEEE/RSJ Internaltional Conference on Intelligent Robots and Systems (IROS '96)
- Mahadevan, S.¹

11
- 0003896797
- Kluwer Academic Publishers, Boston, MA
- D.A. Pomerleau, Neural Network Perception for Mobile Robot Guidance, Kluwer Academic Publishers, Boston, MA, 1993.
- (1993) Neural Network Perception for Mobile Robot Guidance
- Pomerleau, D.A.¹

12
- 0001259406
- Robot learning by nonparametric regression
- V. Graefe, ed
- S. Schaal and C.G. Atkeson, "Robot learning by nonparametric regression," in Proceedings of Intelligent Robots and Systems 1994 (IROS '94), V. Graefe, ed., pp. 137-154, 1995.
- (1995) Proceedings of Intelligent Robots and Systems 1994 (IROS '94) , pp. 137-154
- Schaal, S.¹ Atkeson, C.G.²

13
- 0028740409
- Learning by watching: Extracting reusable task knowledge from visual observation of human performance
- December
- Y. Kuniyoshi, M. Inaba, and H. Inoue, "Learning by watching: Extracting reusable task knowledge from visual observation of human performance," IEEE Transactions on Robotics and Automation 10, pp. 799-822, December 1994.
- (1994) IEEE Transactions on Robotics and Automation , vol.10 , pp. 799-822
- Kuniyoshi, Y.¹ Inaba, M.² Inoue, H.³

14
- 0031287713
- Transfer of elementary skills via human-robot interaction
- M. Kaiser, "Transfer of elementary skills via human-robot interaction," Adaptive Behavior 5(3/4), pp. 249-280, 1997.
- (1997) Adaptive Behavior , vol.5 , Issue.3-4 , pp. 249-280
- Kaiser, M.¹

15
- 0001847657
- Imitative learning mechanisms in robots and humans
- V. Klingspor, ed., (Bari, Italy), July
- J. Demiris and G. Hayes, "Imitative learning mechanisms in robots and humans," in Proceedings of the 5th European Workshop on Learning Robots, V. Klingspor, ed., (Bari, Italy), July 1996.
- (1996) Proceedings of the 5th European Workshop on Learning Robots
- Demiris, J.¹ Hayes, G.²

16
- 0002734328
- Robot see, robot do: An overview of robot imitation
- P. Bakker and Y. Kinuyoshi, "Robot see, robot do: An overview of robot imitation," in Proceedings of the AISB96 Workshop on Learning in Robots and Animals, pp. 3-11, 1996.
- (1996) Proceedings of the AISB96 Workshop on Learning in Robots and Animals , pp. 3-11
- Bakker, P.¹ Kinuyoshi, Y.²

17
- 84976813028
- Learning to coordinate behaviors
- AAAI Press, Menlo Park, CA
- P. Maes and R.A. Brooks, "Learning to coordinate behaviors," in Proceedings of the Eighth National Conference on Artificial Intelligence (AAAI '90), pp. 796-802, AAAI Press, (Menlo Park, CA), 1990.
- (1990) Proceedings of the Eighth National Conference on Artificial Intelligence (AAAI '90) , pp. 796-802
- Maes, P.¹ Brooks, R.A.²

18
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- June
- S. Mahadevan and J. Connell, "Automatic programming of behavior-based robots using reinforcement learning," Machine Learning 55, pp. 311-365, June 1992.
- (1992) Machine Learning , vol.55 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

19
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L.-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning 8, pp. 293-321, 1992.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

20
- 0030149709
- Purposive behavior acquisition for a real robot by vision-based reinforcement learning
- M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behavior acquisition for a real robot by vision-based reinforcement learning," Machine Learning 23, pp. 279-303, 1996.
- (1996) Machine Learning , vol.23 , pp. 279-303
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

21
- 0029753630
- Reinforcement learning with replacing eligibility traces
- S.P. Singh and R.S. Sutton, "Reinforcement learning with replacing eligibility traces," Machine Learning 22, pp. 123-158, 1996.
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

22
- 0028739953
- Robot shaping: Developing autonomous agents through learning
- M. Dorigo and M. Colombetti, "Robot shaping: Developing autonomous agents through learning," Artificial Intelligence 71(2), pp. 321-370, 1994.
- (1994) Artificial Intelligence , vol.71 , Issue.2 , pp. 321-370
- Dorigo, M.¹ Colombetti, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.