메뉴 건너뛰기




Volumn 2, Issue , 2003, Pages 2727-2734

Learning to role-switch in multi-robot systems

Author keywords

Multi robot systems; Q learning; Role switching

Indexed keywords

FINITE AUTOMATA; INTELLIGENT AGENTS; OBJECT RECOGNITION; ROBOT LEARNING; STATE SPACE METHODS;

EID: 0344014098     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (43)

References (17)
  • 1
    • 0011962595 scopus 로고    scopus 로고
    • Probabilistic planning for behavior-based robots
    • Atrash, A. and Koenig, S., "Probabilistic Planning for Behavior-based Robots", (2001). Proc. FLAIRS-01, Key West, FL., pp.531-535.
    • (2001) Proc. FLAIRS-01, Key West, FL , pp. 531-535
    • Atrash, A.1    Koenig, S.2
  • 3
    • 0003404197 scopus 로고    scopus 로고
    • Behavioral diversity in learning robot teams
    • Ph.D. Dissertation, College of Computing, Georgia Tech.
    • Balch, T. (1998). "Behavioral Diversity in Learning Robot Teams", Ph.D. Dissertation, College of Computing, Georgia Tech.
    • (1998)
    • Balch, T.1
  • 4
    • 23044520797 scopus 로고    scopus 로고
    • Big red: The cornell small league robot soccer team
    • D'Andrea, R. et al. (2000) "Big Red: The Cornell Small League Robot Soccer Team", in Lecture Note in Computer Science, v. 1856, p 657-660
    • (2000) Lecture Note in Computer Science , vol.1856 , pp. 657-660
    • D'Andrea, R.1
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, Thomas G. {2000}. "Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition", Journal of Artificial Intelligence Research, 13, pp. 227-303
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 10
    • 0032051328 scopus 로고    scopus 로고
    • Evaluating the usability of robot programming toolsets
    • MacKenzie, D. and Arkin, R., "Evaluating the Usability of Robot Programming Toolsets", (1998). International Journal of Robotics Research, Vol. 4, No. 7, pp.381-401
    • (1998) International Journal of Robotics Research , vol.4 , Issue.7 , pp. 381-401
    • MacKenzie, D.1    Arkin, R.2
  • 11
    • 85158146654 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Mahadevan, S. and Connell, J., (1991). "Automatic Programming of Behavior-Based Robots Using Reinforcement Learning", Proc. AAAI-91, pp. 768-73.
    • (1991) Proc. AAAI-91 , pp. 768-773
    • Mahadevan, S.1    Connell, J.2
  • 14
    • 84957895797 scopus 로고
    • Reward functions for accelerated learning
    • William W. Cohen and Haym Hirsh, eds., Morgan Kaufmann Publishers, San Francisco, CA
    • Mataric′, Maja J. "Reward Functions for Accelerated Learning" in Machine Learning: Proceedings of the Eleventh International Conference, William W. Cohen and Haym Hirsh, eds., Morgan Kaufmann Publishers, San Francisco, CA, 1994, 181-189.
    • (1994) Machine Learning: Proceedings of the Eleventh International Conference , pp. 181-189
    • Mataric, M.J.1
  • 17
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • Ph.D. Thesis, King's College, Cambridge, UK
    • Watkins, C. (1989). "Learning from Delayed Rewards", Ph.D. Thesis, King's College, Cambridge, UK.
    • (1989)
    • Watkins, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.