메뉴 건너뛰기




Volumn , Issue , 2010, Pages

Constructing skill trees for reinforcement learning agents from demonstration trajectories

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; CHAINS; DEMONSTRATIONS; FORESTRY; MANIPULATORS; REINFORCEMENT LEARNING; TREES (MATHEMATICS);

EID: 85162033542     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (63)

References (26)
  • 1
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Special Issue on Reinforcement Learning
    • A.G. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Dynamic Systems, 13:41-77, 2003. Special Issue on Reinforcement Learning.
    • (2003) Discrete Event Dynamic Systems , vol.13 , pp. 41-77
    • Barto, A.G.1    Mahadevan, S.2
  • 3
    • 80055032021 scopus 로고    scopus 로고
    • Skill discovery in continuous reinforcement learning domains using skill chaining
    • G.D. Konidaris and A.G. Barto. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pages 1015-1023, 2009.
    • (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1015-1023
    • Konidaris, G.D.1    Barto, A.G.2
  • 6
    • 0031343489 scopus 로고    scopus 로고
    • A feedback control structure for on-line learning tasks
    • M. Huber and R.A. Grupen. A feedback control structure for on-line learning tasks. Robotics and Autonomous Systems, 22(3-4):303-315, 1997.
    • (1997) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 303-315
    • Huber, M.1    Grupen, R.A.2
  • 7
    • 84979715630 scopus 로고    scopus 로고
    • Supervised actor-critic reinforcement learning
    • J. Si, A.G. Barto, A. Powell, and D. Wunsch, editors. John Wiley & Sons, Inc., New York
    • M. Rosenstein and A.G. Barto. Supervised actor-critic reinforcement learning. In J. Si, A.G. Barto, A. Powell, and D. Wunsch, editors, Learning and Approximate Dynamic Programming: Scaling up the Real World, pages 359-380. John Wiley & Sons, Inc., New York, 2004.
    • (2004) Learning and Approximate Dynamic Programming: Scaling Up the Real World , pp. 359-380
    • Rosenstein, M.1    Barto, A.G.2
  • 9
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R.S. Sutton, D. Precup, and S.P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3
  • 13
    • 77956435931 scopus 로고    scopus 로고
    • Value function approximation in reinforcement learning using the Fourier basis
    • University of Massachusetts Amherst, June
    • G.D. Konidaris and S. Osentoski. Value function approximation in reinforcement learning using the Fourier basis. Technical Report UM-CS-2008-19, Department of Computer Science, University of Massachusetts Amherst, June 2008.
    • (2008) Technical Report UM-CS-2008-19, Department of Computer Science
    • Konidaris, G.D.1    Osentoski, S.2
  • 17
    • 77957761338 scopus 로고    scopus 로고
    • LQR-Trees: Feedback motion planning on sparse randomized trees
    • R. Tedrake. LQR-Trees: Feedback motion planning on sparse randomized trees. In Proceedings of Robotics: Science and Systems, pages 18-24, 2009.
    • (2009) Proceedings of Robotics: Science and Systems , pp. 18-24
    • Tedrake, R.1
  • 20
    • 17144391260 scopus 로고    scopus 로고
    • Performance-derived behavior vocabularies: Data-driven acquisition of skills from motion
    • O.C. Jenkins and M. Matarić. Performance-derived behavior vocabularies: data-driven acquisition of skills from motion. International Journal of Humanoid Robotics, 1(2):237-288, 2004.
    • (2004) International Journal of Humanoid Robotics , vol.1 , Issue.2 , pp. 237-288
    • Jenkins, O.C.1    Matarić, M.2
  • 23
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71(7-9):1180-1190, 2008.
    • (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.