메뉴 건너뛰기




Volumn , Issue , 2008, Pages 560-567

A worst-case comparison between temporal difference and residual gradient with linear function approximation

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MACHINE LEARNING; MARKOV PROCESSES; FUNCTIONS; INTERNET; LEARNING SYSTEMS; PROBABILITY DENSITY FUNCTION; ROBOT LEARNING;

EID: 56449125197     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1390156.1390227     Document Type: Conference Paper
Times cited : (22)

References (15)
  • 4
    • 0030145382 scopus 로고    scopus 로고
    • Worst-case quadratic loss bounds for prediction using linear functions and gradient descent
    • Cesa-Bianchi, N., Long, P. M., & Warmuth, M. (1996). Worst-case quadratic loss bounds for prediction using linear functions and gradient descent. IEEE Transactions on Neural Networks, 7, 604-619.
    • (1996) IEEE Transactions on Neural Networks , vol.7 , pp. 604-619
    • Cesa-Bianchi, N.1    Long, P.M.2    Warmuth, M.3
  • 6
    • 0008815681 scopus 로고    scopus 로고
    • Exponentiated gradient versus gradient descent for linear predictors
    • Kivinen, J., & Warmuth, M. K. (1997). Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132, 1-63.
    • (1997) Information and Computation , vol.132 , pp. 1-63
    • Kivinen, J.1    Warmuth, M.K.2
  • 11
    • 0013419177 scopus 로고    scopus 로고
    • On the worst-case analysis of temporal-difference learning algorithms
    • Schapire, R. E., & Warmuth, M. K. (1996). On the worst-case analysis of temporal-difference learning algorithms. Machine Learning, 22, 95-122.
    • (1996) Machine Learning , vol.22 , pp. 95-122
    • Schapire, R.E.1    Warmuth, M.K.2
  • 13
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 15
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • Tsitsiklis, J. N., & Van Roy, B. (1997). An analysis of temporal-difference learning with function approximation. IEEE Transactions on Automatic Control, 42, 674-690.
    • (1997) IEEE Transactions on Automatic Control , vol.42 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.