메뉴 건너뛰기




Volumn , Issue , 2011, Pages 2173-2180

Swarm reinforcement learning methods for problems with continuous state-action space

Author keywords

particle swarm optimization; reinforcement learning; swarm intelligence

Indexed keywords

ACTOR-CRITIC METHODS; BIPED ROBOT; CONTINUOUS STATE-ACTION SPACES; INDIVIDUAL LEARNING; MULTIPLE SET; NUMERICAL EXPERIMENTS; OPTIMAL POLICIES; Q-LEARNING METHOD; REINFORCEMENT LEARNING METHOD; SWARM INTELLIGENCE; SWARM REINFORCEMENT LEARNING;

EID: 83755225300     PISSN: 1062922X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICSMC.2011.6083999     Document Type: Conference Paper
Times cited : (7)

References (13)
  • 3
    • 34250753459 scopus 로고    scopus 로고
    • Reinforcement learning through interaction among multiple agents
    • CD-ROM
    • H. Iima and Y. Kuroe, "Reinforcement learning through interaction among multiple agents", SICE-ICASE International Joint Conference 2006 CD-ROM, pp.2457-2462, 2006.
    • (2006) SICE-ICASE International Joint Conference 2006 , pp. 2457-2462
    • Iima, H.1    Kuroe, Y.2
  • 8
    • 0008336447 scopus 로고    scopus 로고
    • An analysis of actor/critic algorithms using eligibility traces: Reinforcement learning with imperfect value function
    • H. Kimura and S. Kobayashi, "An analysis of actor/critic algorithms using eligibility traces: Reinforcement learning with imperfect value function", 15th International Conference on Machine Learning, pp.278-286, 1998.
    • (1998) 15th International Conference on Machine Learning , pp. 278-286
    • Kimura, H.1    Kobayashi, S.2
  • 9
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • K. Doya, "Reinforcement learning in continuous time and space", Neural Computation, Vol.12, pp.219-245, 2000.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 10
    • 83755205139 scopus 로고    scopus 로고
    • Reinforcement learning for rhythmic movements using a neural oscillator network
    • in Japanese
    • Y. Nakamura, M. Sato, and S. Ishii, "Reinforcement learning for rhythmic movements using a neural oscillator network", IEICE Transactions on Information and Systems, Vol.J87-D-II, No.3, pp.893-902, 2004 (in Japanese).
    • (2004) IEICE Transactions on Information and Systems , vol.J87-D-II , Issue.3 , pp. 893-902
    • Nakamura, Y.1    Sato, M.2    Ishii, S.3
  • 13
    • 0026045478 scopus 로고
    • Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment
    • G. Taga, Y. Yamaguchi, and H. Shimizu, "Self-organized control of bipedal locomotion by neural oscillators in unpredictable environment", Biological Cybernetics, Vol.65, pp.147-159, 1991.
    • (1991) Biological Cybernetics , vol.65 , pp. 147-159
    • Taga, G.1    Yamaguchi, Y.2    Shimizu, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.