메뉴 건너뛰기




Volumn 5506 LNCS, Issue PART 1, 2009, Pages 829-834

Automatic discovery of subgoals in reinforcement learning using strongly connected components

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATIC DETECTION; AUTOMATIC DISCOVERY; DISCOVERY ALGORITHM; GRAPH PARTITIONING; GRAPH THEORETIC APPROACH; HIERARCHICAL STRUCTURES; LINEAR TIME; PARAMETER-TUNING; REAL-WORLD PROBLEM; STRONGLY CONNECTED COMPONENT; SUBGOALS; TIME COMPLEXITY;

EID: 70349120325     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-02490-0_101     Document Type: Conference Paper
Times cited : (21)

References (9)
  • 6
    • 84945250000 scopus 로고    scopus 로고
    • Q-cut - Dynamic discovery of sub-goals in reinforcement learning
    • Elomaa, T., Mannila, H., Toivonen, H. (eds.) ECML 2002. Springer, Heidelberg
    • Menache, I., Mannor, S., Shimkin, N.: Q-cut - dynamic discovery of sub-goals in reinforcement learning. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) ECML 2002. LNCS, vol.2430, pp. 295-306. Springer, Heidelberg (2002)
    • (2002) LNCS , vol.2430 , pp. 295-306
    • Menache, I.1    Mannor, S.2    Shimkin, N.3
  • 9
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R.S., Precup, D., Singh, S.P.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Journal of Artificial Intelligence 112, 181-211 (1999)
    • (1999) Journal of Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.