메뉴 건너뛰기




Volumn , Issue , 2005, Pages 553-560

Proto-value functions: Developmental reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ORTHONORMAL BASIS; PROTOVALUE FUNCTIONS; REPRESENTATION POLICY ITERATION;

EID: 31844433360     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1102351.1102421     Document Type: Conference Paper
Times cited : (109)

References (14)
  • 10
    • 14344264466 scopus 로고    scopus 로고
    • Q-cut: Dynamic discovery of sub-goals in reinforcement learning
    • Menache, I., Mannor, S., & Shimkin, N. (2002). Q-cut: Dynamic discovery of sub-goals in reinforcement learning, ECML.
    • (2002) ECML
    • Menache, I.1    Mannor, S.2    Shimkin, N.3
  • 13
    • 31844439372 scopus 로고    scopus 로고
    • Local graph partitioning as a basis for generating temporally extended actions in reinforcement learning
    • Simsek, O., Wolfe, A., & Barto, A. (2005). Local graph partitioning as a basis for generating temporally extended actions in reinforcement learning. International Conference on Machine Learning.
    • (2005) International Conference on Machine Learning
    • Simsek, O.1    Wolfe, A.2    Barto, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.