메뉴 건너뛰기




Volumn , Issue , 2000, Pages 461-462

A multiagent variant of Dyna-Q

Author keywords

[No Author keywords available]

Indexed keywords

MULTIPLE AGENTS; SINGLE-AGENT;

EID: 84962092260     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICMAS.2000.858525     Document Type: Conference Paper
Times cited : (4)

References (4)
  • 1
    • 0012929784 scopus 로고
    • Dyna, an integrated architecture for learning, planning, and reacting
    • R. Sutton. Dyna, an integrated architecture for learning, planning, and reacting. SIGART Bulletin, 2:160-163, 1991.
    • (1991) SIGART Bulletin , vol.2 , pp. 160-163
    • Sutton, R.1
  • 3
    • 0004049893 scopus 로고
    • PhD thesis, King's College, Cambridge University
    • C. Watkins. Learning from Delayed Rewards. PhD thesis, King's College, Cambridge University, 1989.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.