메뉴 건너뛰기




Volumn , Issue , 2007, Pages 441-448

ILSTD: Eligibility traces and convergence analysis

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL ADVANTAGES; CONVERGENCE ANALYSIS; ELIGIBILITY TRACES; FEATURE VECTORS; FULL COSTS; INCREMENTAL METHOD; LEAST SQUARE; LINEAR FUNCTIONS; NEW RESULTS; POLICY EVALUATION; TEMPORAL DIFFERENCE LEARNING;

EID: 56449115872     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (28)

References (11)
  • 3
    • 0036832950 scopus 로고    scopus 로고
    • Technical update: Least-squares temporal difference learning
    • Justin A. Boyan. Technical update: Least-squares temporal difference learning. Machine Learning, 49:233-246, 2002.
    • (2002) Machine Learning , vol.49 , pp. 233-246
    • Boyan, J.A.1
  • 4
    • 0001771345 scopus 로고    scopus 로고
    • Linear least-squares algorithms for temporal difference learning
    • S. Bradtke and A. Barto. Linear least-squares algorithms for temporal difference learning. Machine Learning, 22:33-57, 1996.
    • (1996) Machine Learning , vol.22 , pp. 33-57
    • Bradtke, S.1    Barto, A.2
  • 6
    • 84864051496 scopus 로고    scopus 로고
    • The University of Alberta reinforcement learning library
    • RL Library. The University of Alberta reinforcement learning library. http: //rlai.cs.ualberta.ca/RLR/environment.html, 2006.
    • (2006)
  • 9
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Richard S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 10
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using sparse coarse coding
    • The MIT Press
    • Richard S. Sutton. Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Advances in Neural Information Processing Systems 8, pages 1038-1044. The MIT Press, 1996.
    • (1996) Advances in Neural Information Processing Systems , vol.8 , pp. 1038-1044
    • Sutton, R.S.1
  • 11
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporaldifference learning with function approximation
    • John N. Tsitsiklis and Benjamin Van Roy. An analysis of temporaldifference learning with function approximation. IEEE Transactions on Automatic Control, 42(5):674-690, 1997.
    • (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.