SCOPUS 정보 검색 플랫폼

Volumn 16, Issue 2, 1999, Pages

Convergence of discretization procedure in Q-learning

Author keywords

Continuous state and control spaces; Discretization; Dynamic programming; Markovian decision problems; Q Learning

Indexed keywords

EID: 77449144618 PISSN: 10008152 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (7)

References (5)

1
- 0004049893
- Ph.D. Dissertation. King's College, UK
- Watkins C I C H. Learning from delayed rewards. Ph.D. Dissertation, King's College, UK, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.I.C.H.¹

2
- 0016515669
- Convergence of discretization procedure in dynamic programming
- Bertseks D P. Convergence of discretization procedure in dynamic programming. IEEE Trans. Automat. Contr., 1975, AC-20:415-419.
- (1975) IEEE Trans. Automat. Contr. , vol.AC-20 , pp. 415-419
- Bertseks, D.P.¹

3
- 0017997986
- Approximations of dynamic programs I
- Whitt W. Approximations of dynamic programs I. Mathematics of Opertion Research, 1978,3:231-243.
- (1978) Mathematics of Opertion Research , vol.3 , pp. 231-243
- Whitt, W.¹

4
- 0026206780
- An optimal one-way multigrid algorithm for discrete-time stochastic control
- Chow CS & Tsitsiklis JN. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Trans. Automat. Contr., 1991, AC-36-898-914.
- (1991) IEEE Trans. Automat. Contr. , vol.AC-36 , pp. 898-914
- Chow, C.S.¹ Tsitsiklis, J.N.²

5
- 0003923091
- Academic Press
- Bertseks D P. Stochastic optimal control: the discrete time case. Academic Press, 1978.
- (1978) Stochastic Optimal Control: The Discrete Time Case
- Bertseks, D.P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.