메뉴 건너뛰기




Volumn 2, Issue , 2004, Pages 1643-1648

An adaptation of particle swarm optimization for markov decision processes

Author keywords

Markov decision process; Particle swarm optimization; Reinforcement learning

Indexed keywords

MARKOV DECISION PROCESS; MULTIDEMENSIONAL SEARCHING; PARTICLE SWARM OPTIMIZATION; REINFORCEMENT LEARNING;

EID: 15744369589     PISSN: 1062922X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICSMC.2004.1399867     Document Type: Conference Paper
Times cited : (6)

References (15)
  • 3
    • 1542319250 scopus 로고    scopus 로고
    • An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems
    • H. S. Chang, M. Fu, and S. I. Marcus, "An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems," in Proc. of the 42nd IEEE Conf. Decision and Control, 2003.
    • (2003) Proc. of the 42nd IEEE Conf. Decision and Control
    • Chang, H.S.1    Fu, M.2    Marcus, S.I.3
  • 5
    • 3543128853 scopus 로고    scopus 로고
    • Parallel rollout for on-line solution of partially observable Markov decision processes
    • H. S. Chang, R. Givan, and E. K. P. Chong, "Parallel rollout for on-line solution of partially observable Markov decision processes," Discrete Event Dynamic Systems: Theory and Application, Vol. 14, No. 3, pp. 309-341, 2004.
    • (2004) Discrete Event Dynamic Systems: Theory and Application , vol.14 , Issue.3 , pp. 309-341
    • Chang, H.S.1    Givan, R.2    Chong, E.K.P.3
  • 13
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement learning algorithms
    • S. Singh, T. Jaakkola, M. Littman, and C. Szepesvari, "Convergence results for single-step on-policy reinforcement learning algorithms," Machine Learning, Vol. 38, pp. 287-308, 2000.
    • (2000) Machine Learning , vol.38 , pp. 287-308
    • Singh, S.1    Jaakkola, T.2    Littman, M.3    Szepesvari, C.4
  • 15
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learning, Vol. 16, pp. 185-202, 1994.
    • (1994) Machine Learning , vol.16 , pp. 185-202
    • Tsitsiklis, J.N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.