메뉴 건너뛰기




Volumn 4, Issue , 2014, Pages 1350-1358

Time-regularized interrupting options

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING SYSTEMS;

EID: 84919807958     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (33)

References (15)
  • 4
    • 84862001711 scopus 로고    scopus 로고
    • Transfer in reinforcement learning via shared features
    • 98888, June
    • Konidaris, George, Scheidwasser, Ilya, and Barto, Andrew. Transfer in reinforcement learning via shared features. J. Mach. Learn. Res., 98888:1333-1371, June 2012.
    • (2012) J. Mach. Learn. Res. , pp. 1333-1371
    • Konidaris, G.1    Scheidwasser, I.2    Barto, A.3
  • 7
    • 84945250000 scopus 로고    scopus 로고
    • Q- cut: Dynamic discovery of sub-goals in reinforcement learning
    • Springer
    • Menache, Ishai, Mannor, Shie, and Shimkin, Nahum. Q- cut: dynamic discovery of sub-goals in reinforcement learning. In Machine Learning: ECML 2002, pp. 295- 306. Springer, 2002.
    • (2002) Machine Learning: ECML 2002 , pp. 295-306
    • Menache, I.1    Mannor, S.2    Shimkin, N.3
  • 12
    • 27544506565 scopus 로고    scopus 로고
    • Reinforcement learning for robo cup soccer keep away
    • Stone, Peter, Sutton, Richard S, and Kuhlmann, Gregory. Reinforcement learning for robocup soccer keep away. Adaptive Behavior, 13(3): 165-188, 2005.
    • (2005) Adaptive Behavior , vol.13 , Issue.3 , pp. 165-188
    • Stone, P.1    Sutton, R.S.2    Kuhlmann, G.3
  • 14
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPS and semi-MDPS: A framework for temporal abstraction in reinforcement learning
    • August
    • Sutton, Richard S, Precup, Doina, and Singh, Satinder. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1):181-211, August 1999.
    • (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.