메뉴 건너뛰기




Volumn 2, Issue , 2002, Pages 1296-1301

The necessity of average rewards in cooperative multirobot learning

Author keywords

[No Author keywords available]

Indexed keywords

DECENTRALIZED CONTROL; DYNAMICS; LEARNING ALGORITHMS; MONTE CARLO METHODS; OPTIMAL CONTROL SYSTEMS;

EID: 0036057598     PISSN: 10504729     EISSN: None     Source Type: Journal    
DOI: 10.1109/ROBOT.2002.1014721     Document Type: Article
Times cited : (27)

References (11)
  • 1
    • 0003849946 scopus 로고
    • Interaction and intelligent behavior
    • Ph.D. thesis, MIT EECS
    • (1994)
    • Mataric, M.J.1
  • 2
    • 0003577275 scopus 로고
    • Heterogeneous multi-robot cooperation
    • Ph.D. thesis, MIT EECS
    • (1994)
    • Parker, L.E.1
  • 5
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • Ph.D. thesis, King's College, Cambridge, UK
    • (1989)
    • Watkins, C.J.C.H.1
  • 6
    • 0003404197 scopus 로고    scopus 로고
    • Behavioral diversity in learning robot teams
    • Ph.D. thesis, Dept. of Computer Science, Georgia Tech
    • (1998)
    • Balch, T.1
  • 8
    • 0005549669 scopus 로고    scopus 로고
    • Taxonomies of multirobot task and reward
    • Technical Report Robotic Institute, CMU
    • (1998)
    • Balch, T.1
  • 11
    • 0004013540 scopus 로고
    • Introduction to robotics
    • Addison-Wesley, chapter 9
    • (1991) , pp. 483-543
    • McKerrow, P.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.