메뉴 건너뛰기




Volumn , Issue , 2009, Pages 1015-1023

Skill discovery in continuous reinforcement learning domains using skill chaining

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS DOMAIN; CONTINUOUS REINFORCEMENT; PERFORMANCE BENEFITS; REINFORCEMENT LEARNINGS;

EID: 80055032021     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (290)

References (26)
  • 1
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Special Issue on Reinforcement Learning
    • A.G. Barto and S. Mahadevan. Recent advances in hierarchical reinforcement learning. Discrete Event Systems, 13:41-77, 2003. Special Issue on Reinforcement Learning.
    • (2003) Discrete Event Systems , vol.13 , pp. 41-77
    • Barto, A.G.1    Mahadevan, S.2
  • 2
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • R.S. Sutton, D. Precup, and S.P. Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2):181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3
  • 23
    • 77956435931 scopus 로고    scopus 로고
    • Value function approximation in reinforcement learning using the fourier basis
    • University of Massachusetts Amherst, June
    • G.D. Konidaris and S. Osentoski. Value function approximation in reinforcement learning using the Fourier basis. Technical Report UM-CS-2008-19, Department of Computer Science, University of Massachusetts Amherst, June 2008.
    • (2008) Technical Report UM-cs-2008-19, Department of Computer Science
    • Konidaris, G.D.1    Osentoski, S.2
  • 24
    • 70349322784 scopus 로고    scopus 로고
    • Learning representation and control in Markov decision processes: New frontiers
    • S. Mahadevan. Learning representation and control in Markov Decision Processes: New frontiers. Foundations and Trends in Machine Learning, 1(4):403-565, 2009.
    • (2009) Foundations and Trends in Machine Learning , vol.1 , Issue.4 , pp. 403-565
    • Mahadevan, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.