메뉴 건너뛰기




Volumn , Issue , 2011, Pages

Speedy Q-learning

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS;

EID: 85162416897     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (131)

References (20)
  • 10
    • 0000439891 scopus 로고
    • On the convergence of stochastic iterative dynamic programming
    • T. Jaakkola, M. I. Jordan, and S. Singh. On the convergence of stochastic iterative dynamic programming. Neural Computation, 6(6):1185-1201, 1994.
    • (1994) Neural Computation , vol.6 , Issue.6 , pp. 1185-1201
    • Jaakkola, T.1    Jordan, M.I.2    Singh, S.3
  • 12
    • 84899026236 scopus 로고    scopus 로고
    • Finite-sample convergence rates for Q-learning and indirect algorithms
    • MIT Press
    • M. Kearns and S. Singh. Finite-sample convergence rates for Q-learning and indirect algorithms. In Advances in Neural Information Processing Systems 12, pages 996-1002. MIT Press, 1999.
    • (1999) Advances in Neural Information Processing Systems , vol.12 , pp. 996-1002
    • Kearns, M.1    Singh, S.2
  • 14
    • 0000955979 scopus 로고    scopus 로고
    • Incremental multi-step q-learning
    • J. Peng and R. J. Williams. Incremental multi-step Q-learning. Machine Learning, 22(1-3):283-290, 1996.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 283-290
    • Peng, J.1    Williams, R.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.