메뉴 건너뛰기




Volumn , Issue , 1999, Pages 996-1002

Finite-sample convergence rates for q-learning and indirect algorithms

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ITERATIVE METHODS; OPTIMIZATION;

EID: 84899026236     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (255)

References (4)
  • 1
    • 0000439891 scopus 로고
    • On the convergence of stochastic iterative dynamic programming algorithms
    • Jaakkola, T., and Jordan., M. I., Singh, S. On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation, 6(6), 1185-1201, 1994.
    • (1994) Neural Computation , vol.6 , Issue.6 , pp. 1185-1201
    • Jaakkola, T.1    Jordan, M.I.2    Singh, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.