메뉴 건너뛰기




Volumn , Issue , 2002, Pages

Batch value function approximation via support vectors

Author keywords

[No Author keywords available]

Indexed keywords

GRADIENT METHODS;

EID: 84899029004     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (29)

References (6)
  • 2
    • 24044537711 scopus 로고    scopus 로고
    • Learning instanceindependent value functions to enhance local search
    • Moll, R., Barto, A. G., Perkins, T. J., & Sutton, R. S. (1999). Learning instanceindependent value functions to enhance local search. NIPS-II, 1017-1023.
    • (1999) NIPS-II , pp. 1017-1023
    • Moll, R.1    Barto, A.G.2    Perkins, T.J.3    Sutton, R.S.4
  • 3
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G. (1995). Temporal difference learning and TD-Gammon. CACM, 28(3), 58-68.
    • (1995) CACM , vol.28 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 4
    • 0040264113 scopus 로고
    • Learning a preference predicate
    • Utgoff, P. E., & Saxena, S. (1987). Learning a preference predicate. In ICML-87, 115-121.
    • (1987) ICML-87 , pp. 115-121
    • Utgoff, P.E.1    Saxena, S.2
  • 6
    • 84918834208 scopus 로고
    • A reinforcement learning approach to jobshop scheduling
    • Zhang, W., & Dietterich, T. G. (1995). A reinforcement learning approach to jobshop scheduling. In IJCAI95, 1114-1120.
    • (1995) IJCAI95 , pp. 1114-1120
    • Zhang, W.1    Dietterich, T.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.