메뉴 건너뛰기




Volumn , Issue , 2007, Pages 64-69

Hysteretic Q-Learning : An algorithm for decentralized reinforcement learning in cooperative multi-agent teams

Author keywords

[No Author keywords available]

Indexed keywords

BOOLEAN FUNCTIONS; DAMPING; DISTRIBUTED PARAMETER CONTROL SYSTEMS; EDUCATION; HYSTERESIS; INTELLIGENT AGENTS; INTELLIGENT ROBOTS; INTELLIGENT SYSTEMS; REINFORCEMENT; REINFORCEMENT LEARNING; ROBOTICS; ROBOTS; STIFFNESS;

EID: 51349117828     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IROS.2007.4399095     Document Type: Conference Paper
Times cited : (250)

References (18)
  • 2
    • 0034205975 scopus 로고    scopus 로고
    • Multiagent systems: A survey from a machine learning perspective
    • P. Stone and M. M. Veloso, "Multiagent systems: A survey from a machine learning perspective," Autonomous Robots, vol. 8, no. 3, pp. 345-383, 2000.
    • (2000) Autonomous Robots , vol.8 , Issue.3 , pp. 345-383
    • Stone, P.1    Veloso, M.M.2
  • 4
  • 5
    • 33744789733 scopus 로고    scopus 로고
    • Fuzzy reinforcement learning for embedded soccer agents in a multi-agent context
    • A. M. Tehrani, M. S. Kamel, and A. M. Khamis, "Fuzzy reinforcement learning for embedded soccer agents in a multi-agent context," Int. J. Robot. Autom., vol. 21, no. 2, pp. 110-119, 2006.
    • (2006) Int. J. Robot. Autom , vol.21 , Issue.2 , pp. 110-119
    • Tehrani, A.M.1    Kamel, M.S.2    Khamis, A.M.3
  • 6
    • 34547223380 scopus 로고    scopus 로고
    • Decentralized reinforcement learning control of a robotic manipulator
    • Singapore, Dec
    • L. Busoniu, R. Babuska, and B. D. Schutter, "Decentralized reinforcement learning control of a robotic manipulator," in Proc. of the 9th ICARCV, Singapore, Dec. 2006, pp. 1347-1352.
    • (2006) Proc. of the 9th ICARCV , pp. 1347-1352
    • Busoniu, L.1    Babuska, R.2    Schutter, B.D.3
  • 7
    • 0010220982 scopus 로고    scopus 로고
    • Planning, learning and coordination in multiagent decision processes
    • C. Boutilier, "Planning, learning and coordination in multiagent decision processes," in Theoretical Aspects of Rationality and Knowledge, 1996, pp. 195-201.
    • (1996) Theoretical Aspects of Rationality and Knowledge , pp. 195-201
    • Boutilier, C.1
  • 9
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • C. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , pp. 279-292
    • Watkins, C.1    Dayan, P.2
  • 10
    • 85152198941 scopus 로고
    • Multiagent reinforcement learning: Independent vs. cooperative agents
    • M. Tan, "Multiagent reinforcement learning: Independent vs. cooperative agents," in 10th International Conference on Machine Learning, 1993, p. 330 337.
    • (1993) 10th International Conference on Machine Learning , pp. 330-337
    • Tan, M.1
  • 12
    • 0012286079 scopus 로고    scopus 로고
    • An algorithm for distributed reinforcement learning in cooperative multi-agent systems
    • Morgan Kaufmann, San Francisco, CA
    • M. Lauer and M. Riedmiller, "An algorithm for distributed reinforcement learning in cooperative multi-agent systems," in Proc. 17th ICML. Morgan Kaufmann, San Francisco, CA, 2000, pp. 535-542.
    • (2000) Proc. 17th ICML , pp. 535-542
    • Lauer, M.1    Riedmiller, M.2
  • 13
    • 4544251885 scopus 로고    scopus 로고
    • Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems
    • S. Kapetanakis and D. Kudenko, "Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems," in Proc. of AAMAS '04, 2004, pp. 1258-1259.
    • (2004) Proc. of AAMAS '04 , pp. 1258-1259
    • Kapetanakis, S.1    Kudenko, D.2
  • 14
    • 0032208335 scopus 로고    scopus 로고
    • Elevator group control using multiple reinforcement learning agents
    • R. H. Crites and A. G. Barto, "Elevator group control using multiple reinforcement learning agents," Machine Learning, vol. 33, no. 2-3, pp. 235-262, 1998.
    • (1998) Machine Learning , vol.33 , Issue.2-3 , pp. 235-262
    • Crites, R.H.1    Barto, A.G.2
  • 15
    • 34250651573 scopus 로고    scopus 로고
    • Multi-robot box-pushing: Single-agent q-learning vs. team q-learning
    • Y. Wang and C. W. de Silva, "Multi-robot box-pushing: Single-agent q-learning vs. team q-learning," in Proc. of IROS, 2006, pp. 3694-3699.
    • (2006) Proc. of IROS , pp. 3694-3699
    • Wang, Y.1    de Silva, C.W.2
  • 16
    • 28444436227 scopus 로고    scopus 로고
    • Adaptive organization of generalized behavioral concepts for autonomous robots: Schema-based modular reinforcement learning
    • June
    • T. Taniguchi and T. Sawaragi, "Adaptive organization of generalized behavioral concepts for autonomous robots: schema-based modular reinforcement learning," in Proc. of Computational Intelligence in Robotics and Automation, June 2005, pp. 601-606.
    • (2005) Proc. of Computational Intelligence in Robotics and Automation , pp. 601-606
    • Taniguchi, T.1    Sawaragi, T.2
  • 17
    • 34250672679 scopus 로고    scopus 로고
    • Improving reinforcement learning speed for robot control
    • L.Matignon, G. J. Laurent, and N. LeFort-Piat, "Improving reinforcement learning speed for robot control," in Proc. of IROS, 2006, pp. 3172-3177.
    • (2006) Proc. of IROS , pp. 3172-3177
    • Matignon, L.1    Laurent, G.J.2    LeFort-Piat, N.3
  • 18
    • 51349111777 scopus 로고    scopus 로고
    • M. Benda, V. Jagannathan, and R. Dodhiawala, On optimal cooperation of knowledge sources - an experimental investigation. Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, Tech. Rep. BCS-G2010-280, 1986.
    • M. Benda, V. Jagannathan, and R. Dodhiawala, "On optimal cooperation of knowledge sources - an experimental investigation." Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, Tech. Rep. BCS-G2010-280, 1986.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.