메뉴 건너뛰기




Volumn , Issue , 2005, Pages 817-824

Identifying useful subgoals in reinforcement learning by local graph partitioning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTATION THEORY; COSTS; GRAPH THEORY;

EID: 31844447221     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (231)

References (18)
  • 2
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 7
    • 0000123778 scopus 로고
    • Self-Improving reactive agents based on reinforcement learning, planning and teaching
    • Lin, L. (1992). Self-Improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293-321.
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.1
  • 16
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D., & Singh, S. P. (1999). Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3
  • 18
    • 0027697605 scopus 로고
    • An optimal graph theoretic approach to data clustering: Theory and its application to image segmentation
    • Wu, Z., & Leahy, R. (1993). An optimal graph theoretic approach to data clustering: theory and its application to image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15, 1101-1113.
    • (1993) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.15 , pp. 1101-1113
    • Wu, Z.1    Leahy, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.