메뉴 건너뛰기




Volumn , Issue , 2002, Pages 326-331

Reinforcement learning of coordination in cooperative multi-agent systems

Author keywords

[No Author keywords available]

Indexed keywords

COSTS; GAME THEORY; HEURISTIC METHODS; LEARNING SYSTEMS; PROBABILITY; PROBLEM SOLVING;

EID: 0036932299     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (173)

References (11)
  • 6
    • 0032359707 scopus 로고    scopus 로고
    • Individual learning of coordination knowledge
    • Sen, S., and Sekaran, M. 1998. Individual learning of coordination knowledge. JETAI 10(3):333-356.
    • (1998) JETAI , vol.10 , Issue.3 , pp. 333-356
    • Sen, S.1    Sekaran, M.2
  • 8
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement-learning algorithms
    • Singh, S.; Jaakkola, T.; Littman, M. L.; and Szpesvari, C. 2000. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning Journal 38(3):287-308.
    • (2000) Machine Learning Journal , vol.38 , Issue.3 , pp. 287-308
    • Singh, S.1    Jaakkola, T.2    Littman, M.L.3    Szpesvari, C.4
  • 10
    • 0004049893 scopus 로고
    • Ph.D. Dissertation, Cambridge University, Cambridge, England
    • Watkins, C. J. C. H. 1989. Learning from Delayed Rewards. Ph.D. Dissertation, Cambridge University, Cambridge, England.
    • (1989) Learning from Delayed Rewards
    • Watkins, C.J.C.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.