메뉴 건너뛰기




Volumn , Issue , 2005, Pages 1313-1320

Temporal abstraction in temporal-difference networks

Author keywords

[No Author keywords available]

Indexed keywords

ELIGIBILITY TRACES; FUNCTION APPROXIMATION; PRIMARY CONTRIBUTION; SEQUENCE OF ACTIONS; TEMPORAL ABSTRACTION; TIME INTERVAL; TIME STEP;

EID: 84864067372     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (30)

References (10)
  • 1
    • 0034198996 scopus 로고    scopus 로고
    • Observable operator models for discrete stochastic time series
    • MIT Press
    • Jaeger, H. (2000). Observable operator models for discrete stochastic time series. Neural Computation, 12(6):1371-1398. MIT Press.
    • (2000) Neural Computation , vol.12 , Issue.6 , pp. 1371-1398
    • Jaeger, H.1
  • 2
    • 84898982129 scopus 로고    scopus 로고
    • Predictive representations of state
    • T. G. Dietterich, S. Becker and Z. Ghahramani (eds.). MIT Press
    • Littman, M., Sutton, R. S., & Singh, S. (2002). Predictive representations of state. In T. G. Dietterich, S. Becker and Z. Ghahramani (eds.), Advances In Neural Information Processing Systems 14, pp. 1555-1561. MIT Press.
    • (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 1555-1561
    • Littman, M.1    Sutton, R.S.2    Singh, S.3
  • 3
    • 4644328593 scopus 로고    scopus 로고
    • Off-policy temporal-difference learning with function approximation
    • C. E. Brodley, A. P. Danyluk (eds.). San Francisco, CA: Morgan Kaufmann
    • Precup, D., Sutton, R. S., & Dasgupta, S. (2001). Off-policy temporal-difference learning with function approximation. In C. E. Brodley, A. P. Danyluk (eds.), Proceedings of the Eighteenth International Conference on Machine Learning, pp. 417-424. San Francisco, CA: Morgan Kaufmann.
    • (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 417-424
    • Precup, D.1    Sutton, R.S.2    Dasgupta, S.3
  • 8
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D., Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, pp. 181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.