메뉴 건너뛰기




Volumn 22, Issue 1-3, 1996, Pages 95-121

On the worst-case analysis of temporal-difference learning algorithms

Author keywords

Machine learning; On line learning; Temporal difference learning; Worst case analysis

Indexed keywords


EID: 0013419177     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1007/BF00114725     Document Type: Article
Times cited : (32)

References (8)
  • 2
    • 0000430514 scopus 로고
    • The convergence of TD(λ) for general λ
    • Peter Dayan (1992). The convergence of TD(λ) for general λ. Machine Learning, 8(3/4):341-362.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 341-362
    • Dayan, P.1
  • 3
    • 0028388685 scopus 로고
    • TD(λ) converges with probability I
    • Peter Dayan & Terrence J. Sejnowski (1994) TD(λ) converges with probability I. Machine Learning, 14(3):295-301.
    • (1994) Machine Learning , vol.14 , Issue.3 , pp. 295-301
    • Dayan, P.1    Sejnowski, T.J.2
  • 5
    • 4243385070 scopus 로고
    • On the convergence of stochastic iterative dynamic programming algorithms
    • MIT Computational Cognitive Science
    • Tommi Jaakkola, Michael I. Jordan. & Satinder P. Singh. (1993). On the convergence of stochastic iterative dynamic programming algorithms Technical Report 9307, MIT Computational Cognitive Science.
    • (1993) Technical Report , vol.9307
    • Jaakkola, T.1    Jordan, M.I.2    Singh, S.P.3
  • 6
    • 0003698024 scopus 로고
    • Additive versus exponentiated gradient updates for learning linear functions
    • University of California Santa Cruz, Computer Research Laboratory
    • Jyrki Kivinen & Manfred K. Warmuth (1994) Additive versus exponentiated gradient updates for learning linear functions Technical Report UCSC-CRL-94-16, University of California Santa Cruz, Computer Research Laboratory.
    • (1994) Technical Report UCSC-CRL-94-16
    • Kivinen, J.1    Warmuth, M.K.2
  • 7
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Richard S. Sutton. (1988). Learning to predict by the methods of temporal differences Machine Learning, 3:9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.