메뉴 건너뛰기




Volumn 16, Issue 2, 1999, Pages

Convergence of discretization procedure in Q-learning

Author keywords

Continuous state and control spaces; Discretization; Dynamic programming; Markovian decision problems; Q Learning

Indexed keywords


EID: 77449144618     PISSN: 10008152     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (7)

References (5)
  • 2
    • 0016515669 scopus 로고
    • Convergence of discretization procedure in dynamic programming
    • Bertseks D P. Convergence of discretization procedure in dynamic programming. IEEE Trans. Automat. Contr., 1975, AC-20:415-419.
    • (1975) IEEE Trans. Automat. Contr. , vol.AC-20 , pp. 415-419
    • Bertseks, D.P.1
  • 3
    • 0017997986 scopus 로고
    • Approximations of dynamic programs I
    • Whitt W. Approximations of dynamic programs I. Mathematics of Opertion Research, 1978,3:231-243.
    • (1978) Mathematics of Opertion Research , vol.3 , pp. 231-243
    • Whitt, W.1
  • 4
    • 0026206780 scopus 로고
    • An optimal one-way multigrid algorithm for discrete-time stochastic control
    • Chow CS & Tsitsiklis JN. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Trans. Automat. Contr., 1991, AC-36-898-914.
    • (1991) IEEE Trans. Automat. Contr. , vol.AC-36 , pp. 898-914
    • Chow, C.S.1    Tsitsiklis, J.N.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.