메뉴 건너뛰기




Volumn 4573, Issue , 2001, Pages 92-103

Reinforcement learning for robot control

Author keywords

Learning by demonstration; Learning control; Machine learning; Mobile robots; Reinforcement learning

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; CONTROL SYSTEM ANALYSIS; MOBILE ROBOTS; PROBLEM SOLVING;

EID: 0035763997     PISSN: 0277786X     EISSN: None     Source Type: Journal    
DOI: 10.1117/12.457434     Document Type: Article
Times cited : (10)

References (22)
  • 4
    • 85153940465 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • G. Tesauro, D.S. Touretzky, and T. Leen, eds., MIT Press
    • J.A. Boyan and A.W. Moore, "Generalization in reinforcement learning: Safely approximating the value function," in Advances in Neural Information Processing Systems, G. Tesauro, D.S. Touretzky, and T. Leen, eds., 7, pp. 369-376, MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 369-376
    • Boyan, J.A.1    Moore, A.W.2
  • 5
    • 0003989207 scopus 로고    scopus 로고
    • PhD thesis, School of Computer Science, Carnegie Mellon University, June. Also available as technical report CMU-CS-99-143
    • G.J. Gordon, Approximate Solutions to Markov Decision Processes. PhD thesis, School of Computer Science, Carnegie Mellon University, June 1999. Also available as technical report CMU-CS-99-143.
    • (1999) Approximate Solutions to Markov Decision Processes
    • Gordon, G.J.1
  • 13
    • 0028740409 scopus 로고
    • Learning by watching: Extracting reusable task knowledge from visual observation of human performance
    • December
    • Y. Kuniyoshi, M. Inaba, and H. Inoue, "Learning by watching: Extracting reusable task knowledge from visual observation of human performance," IEEE Transactions on Robotics and Automation 10, pp. 799-822, December 1994.
    • (1994) IEEE Transactions on Robotics and Automation , vol.10 , pp. 799-822
    • Kuniyoshi, Y.1    Inaba, M.2    Inoue, H.3
  • 14
    • 0031287713 scopus 로고    scopus 로고
    • Transfer of elementary skills via human-robot interaction
    • M. Kaiser, "Transfer of elementary skills via human-robot interaction," Adaptive Behavior 5(3/4), pp. 249-280, 1997.
    • (1997) Adaptive Behavior , vol.5 , Issue.3-4 , pp. 249-280
    • Kaiser, M.1
  • 18
    • 0026880130 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • June
    • S. Mahadevan and J. Connell, "Automatic programming of behavior-based robots using reinforcement learning," Machine Learning 55, pp. 311-365, June 1992.
    • (1992) Machine Learning , vol.55 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2
  • 19
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • L.-J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning 8, pp. 293-321, 1992.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.-J.1
  • 20
    • 0030149709 scopus 로고    scopus 로고
    • Purposive behavior acquisition for a real robot by vision-based reinforcement learning
    • M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behavior acquisition for a real robot by vision-based reinforcement learning," Machine Learning 23, pp. 279-303, 1996.
    • (1996) Machine Learning , vol.23 , pp. 279-303
    • Asada, M.1    Noda, S.2    Tawaratsumida, S.3    Hosoda, K.4
  • 21
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • S.P. Singh and R.S. Sutton, "Reinforcement learning with replacing eligibility traces," Machine Learning 22, pp. 123-158, 1996.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 22
    • 0028739953 scopus 로고
    • Robot shaping: Developing autonomous agents through learning
    • M. Dorigo and M. Colombetti, "Robot shaping: Developing autonomous agents through learning," Artificial Intelligence 71(2), pp. 321-370, 1994.
    • (1994) Artificial Intelligence , vol.71 , Issue.2 , pp. 321-370
    • Dorigo, M.1    Colombetti, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.