메뉴 건너뛰기




Volumn 1, Issue , 2002, Pages 151-156

Step size adaptation in evolution strategies using reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

EID: 36348992992     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/CEC.2002.1006225     Document Type: Conference Paper
Times cited : (39)

References (11)
  • 7
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G., "Temporal difference learning and TD-Gammon," Communications of the ACM, 38(3), pp.58-68, 1995.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 10
    • 20444380868 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement-learning algorithms
    • Singh, S., Jaakkola, T., Littman, M.L., Szpes-vari, C, "Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms," Machine Learning, 1999.
    • (1999) Machine Learning
    • Singh, S.1    Jaakkola, T.2    Littman, M.L.3    Szpesvari, C.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.