메뉴 건너뛰기




Volumn 2, Issue , 2012, Pages 1063-1070

Compositional planning using optimal option models

Author keywords

[No Author keywords available]

Indexed keywords

BELLMAN EQUATIONS; CLASSICAL PLANNING; DYNAMIC PROGRAMMING ALGORITHM; FUNDAMENTAL OPERATIONS; GENERALISATION; LEVELS OF ABSTRACTION; MACRO-OPERATORS; MODEL LEARNING; OPTION MODELS; PRIMITIVE ACTIONS; SUBGOALS; TEMPORAL ABSTRACTION; VALUE FUNCTIONS;

EID: 84867135062     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (41)

References (14)
  • 1
    • 0001119106 scopus 로고
    • On representations of problems of reasoning about actions
    • Amarel, S. On representations of problems of reasoning about actions. Machine Intelligence, 3:131-171, 1968.
    • (1968) Machine Intelligence , vol.3 , pp. 131-171
    • Amarel, S.1
  • 4
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.1
  • 8
    • 0002982589 scopus 로고
    • Chunking in SOAR: The anatomy of a general learning mechanism
    • Laird, J., Rosenbloom, P., and Newell, A. Chunking in SOAR: The anatomy of a general learning mechanism. Machine Learning, 1(1):1146, 1986.
    • (1986) Machine Learning , vol.1 , Issue.1 , pp. 1146
    • Laird, J.1    Rosenbloom, P.2    Newell, A.3
  • 14
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R., Precup, D., and Singh, S. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2): 181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.