메뉴 건너뛰기




Volumn , Issue , 2005, Pages

Temporal-difference networks

Author keywords

[No Author keywords available]

Indexed keywords

MONTE CARLO METHODS;

EID: 84899003536     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (53)

References (12)
  • 1
    • 0036832950 scopus 로고    scopus 로고
    • Technical update: Least-squares temporal difference learning
    • Boyan, J. A. (2000). Technical update: Least-squares temporal difference learning. Machine Learning 49:233-246.
    • (2000) Machine Learning , vol.49 , pp. 233-246
    • Boyan, J.A.1
  • 2
    • 0001771345 scopus 로고    scopus 로고
    • Linear least-squares algorithms for temporal difference learning
    • Bradtke, S. J. and Barto, A. G. (1996). Linear least-squares algorithms for temporal difference learning. Machine Learning 22(1-3):33-57.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 33-57
    • Bradtke, S.J.1    Barto, A.G.2
  • 3
    • 0001158047 scopus 로고
    • Improving generalization for temporal difference learning: The successor representation
    • Dayan, P. (1993). Improving generalization for temporal difference learning: The successor representation. Neural Computation 5(4):613-624.
    • (1993) Neural Computation , vol.5 , Issue.4 , pp. 613-624
    • Dayan, P.1
  • 10
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning 3:9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 11
    • 0001059972 scopus 로고
    • TD models: Modeling the world at a mixture of time scales
    • A. Prieditis and S. Russell (eds.), Morgan Kaufmann, San Francisco
    • Sutton, R. S. (1995). TD models: Modeling the world at a mixture of time scales. In A. Prieditis and S. Russell (eds.), Proceedings of the Twelfth International Conference on Machine Learning, pp. 531.539. Morgan Kaufmann, San Francisco.
    • (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 531539
    • Sutton, R.S.1
  • 12
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D. and Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112:181-121.
    • (1999) Artificial Intelligence , vol.112 , pp. 121-181
    • Sutton, R.S.1    Precup, D.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.