메뉴 건너뛰기




Volumn 137, Issue 2, 2008, Pages 435-451

Q-learning algorithms with random truncation bounds and applications to effective parallel computing

Author keywords

Convergence; Q learning; Rate of convergence; Recursive algorithms

Indexed keywords

SIGNAL PROCESSING;

EID: 42449139197     PISSN: 00223239     EISSN: 15732878     Source Type: Journal    
DOI: 10.1007/s10957-007-9331-9     Document Type: Article
Times cited : (3)

References (12)
  • 2
    • 0038476095 scopus 로고    scopus 로고
    • Optimal remapping in dynamic bulk synchronous computations via a stochastic control approach
    • Yin, G., Xu, C., Wang, L.Y.: Optimal remapping in dynamic bulk synchronous computations via a stochastic control approach. IEEE Trans. Parallel Distrib. Syst. 14, 51-62 (2003)
    • (2003) IEEE Trans. Parallel Distrib. Syst. , vol.14 , pp. 51-62
    • Yin, G.1    Xu, C.2    Wang, L.Y.3
  • 4
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • Tsitsiklis, J.N.: Asynchronous stochastic approximation and Q-learning. Mach. Learn. 16, 185-202 (1994)
    • (1994) Mach. Learn. , vol.16 , pp. 185-202
    • Tsitsiklis, J.N.1
  • 6
    • 18344407260 scopus 로고
    • Asymptotic properties of distributed and communicating stochastic approximation algorithms
    • Kushner, H.J., Yin, G.: Asymptotic properties of distributed and communicating stochastic approximation algorithms. SIAM J. Control Optim. 25, 1266-1290 (1987)
    • (1987) SIAM J. Control Optim. , vol.25 , pp. 1266-1290
    • Kushner, H.J.1    Yin, G.2
  • 7
    • 0000040028 scopus 로고
    • Stochastic approximation algorithms for parallel and distributed processing
    • Kushner, H.J., Yin, G.: Stochastic approximation algorithms for parallel and distributed processing. Stochastics 22, 219-250 (1987)
    • (1987) Stochastics , vol.22 , pp. 219-250
    • Kushner, H.J.1    Yin, G.2
  • 8
    • 0011595015 scopus 로고
    • Stochastic approximation procedure with randomly varying truncations
    • Chen, H.F., Zhu, Y.M.: Stochastic approximation procedure with randomly varying truncations. Sci. Sin. (Ser. A) 29, 914-926 (1986)
    • (1986) Sci. Sin. (Ser. A) , vol.29 , pp. 914-926
    • Chen, H.F.1    Zhu, Y.M.2
  • 9
    • 42449118951 scopus 로고
    • On w.p.1 convergence of a parallel stochastic approximation algorithm
    • Yin, G., Zhu, Y.M.: On w.p.1 convergence of a parallel stochastic approximation algorithm. Probab. Eng. Inf. Sci. 3, 55-75 (1989)
    • (1989) Probab. Eng. Inf. Sci. , vol.3 , pp. 55-75
    • Yin, G.1    Zhu, Y.M.2
  • 11
    • 0141862136 scopus 로고    scopus 로고
    • Asymptotic properties of sign algorithms for adaptive filtering
    • Chen, H.-F., Yin, G.: Asymptotic properties of sign algorithms for adaptive filtering. IEEE Trans. Autom. Control 48, 1545-1556 (2003)
    • (2003) IEEE Trans. Autom. Control , vol.48 , pp. 1545-1556
    • Chen, H.-F.1    Yin, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.