메뉴 건너뛰기




Volumn , Issue , 2012, Pages

Scaling life-long off-policy learning

Author keywords

[No Author keywords available]

Indexed keywords

LIFE LONG LEARNING; LIFE-TIMES; MULTI-STEP PREDICTION; PHYSICAL ROBOTS; REAL TIME; TRAINING DATA;

EID: 84872849054     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/DevLrn.2012.6400860     Document Type: Conference Paper
Times cited : (23)

References (18)
  • 4
    • 78049390740 scopus 로고    scopus 로고
    • Policy search for motor primitives in robotics
    • Kober, J., Peters, J. (2011). Policy search for motor primitives in robotics. Machine Learning 84:171-203.
    • (2011) Machine Learning , vol.84 , pp. 171-203
    • Kober, J.1    Peters, J.2
  • 7
    • 84866006400 scopus 로고    scopus 로고
    • Multi-timescale Nexting in a Reinforcement Learning Robot
    • Proceedings of the 12th International Simulation of Conference on Adaptive Behavior
    • Modayil, J., White, A., Sutton, R. S. (2012). Multi-timescale Nexting in a Reinforcement Learning Robot. In Proceedings of the 12th International Simulation of Conference on Adaptive Behavior, LNAI 7426, 299-309
    • (2012) LNAI , vol.7426 , pp. 299-309
    • Modayil, J.1    White, A.2    Sutton, R.S.3
  • 13
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup D., Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. In Artificial Intelligence 112:181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.