-
2
-
-
0034205975
-
Multiagent systems: A survey from a machine learning perspective
-
P. Stone and M. M. Veloso, "Multiagent systems: A survey from a machine learning perspective," Autonomous Robots, vol. 8, no. 3, pp. 345-383, 2000.
-
(2000)
Autonomous Robots
, vol.8
, Issue.3
, pp. 345-383
-
-
Stone, P.1
Veloso, M.M.2
-
4
-
-
34547192059
-
Multi-agent reinforcement learning: A survey
-
December
-
L. Busoniu, R. Babuska, and B. D. Schutter, "Multi-agent reinforcement learning: A survey," in Proc. of the 9th ICARCV, December 2006, pp. 527-532.
-
(2006)
Proc. of the 9th ICARCV
, pp. 527-532
-
-
Busoniu, L.1
Babuska, R.2
Schutter, B.D.3
-
5
-
-
33744789733
-
Fuzzy reinforcement learning for embedded soccer agents in a multi-agent context
-
A. M. Tehrani, M. S. Kamel, and A. M. Khamis, "Fuzzy reinforcement learning for embedded soccer agents in a multi-agent context," Int. J. Robot. Autom., vol. 21, no. 2, pp. 110-119, 2006.
-
(2006)
Int. J. Robot. Autom
, vol.21
, Issue.2
, pp. 110-119
-
-
Tehrani, A.M.1
Kamel, M.S.2
Khamis, A.M.3
-
6
-
-
34547223380
-
Decentralized reinforcement learning control of a robotic manipulator
-
Singapore, Dec
-
L. Busoniu, R. Babuska, and B. D. Schutter, "Decentralized reinforcement learning control of a robotic manipulator," in Proc. of the 9th ICARCV, Singapore, Dec. 2006, pp. 1347-1352.
-
(2006)
Proc. of the 9th ICARCV
, pp. 1347-1352
-
-
Busoniu, L.1
Babuska, R.2
Schutter, B.D.3
-
7
-
-
0010220982
-
Planning, learning and coordination in multiagent decision processes
-
C. Boutilier, "Planning, learning and coordination in multiagent decision processes," in Theoretical Aspects of Rationality and Knowledge, 1996, pp. 195-201.
-
(1996)
Theoretical Aspects of Rationality and Knowledge
, pp. 195-201
-
-
Boutilier, C.1
-
9
-
-
34249833101
-
Technical note: Q-learning
-
C. Watkins and P. Dayan, "Technical note: Q-learning," Machine Learning, vol. 8, pp. 279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
10
-
-
85152198941
-
Multiagent reinforcement learning: Independent vs. cooperative agents
-
M. Tan, "Multiagent reinforcement learning: Independent vs. cooperative agents," in 10th International Conference on Machine Learning, 1993, p. 330 337.
-
(1993)
10th International Conference on Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
12
-
-
0012286079
-
An algorithm for distributed reinforcement learning in cooperative multi-agent systems
-
Morgan Kaufmann, San Francisco, CA
-
M. Lauer and M. Riedmiller, "An algorithm for distributed reinforcement learning in cooperative multi-agent systems," in Proc. 17th ICML. Morgan Kaufmann, San Francisco, CA, 2000, pp. 535-542.
-
(2000)
Proc. 17th ICML
, pp. 535-542
-
-
Lauer, M.1
Riedmiller, M.2
-
13
-
-
4544251885
-
Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems
-
S. Kapetanakis and D. Kudenko, "Reinforcement learning of coordination in heterogeneous cooperative multi-agent systems," in Proc. of AAMAS '04, 2004, pp. 1258-1259.
-
(2004)
Proc. of AAMAS '04
, pp. 1258-1259
-
-
Kapetanakis, S.1
Kudenko, D.2
-
14
-
-
0032208335
-
Elevator group control using multiple reinforcement learning agents
-
R. H. Crites and A. G. Barto, "Elevator group control using multiple reinforcement learning agents," Machine Learning, vol. 33, no. 2-3, pp. 235-262, 1998.
-
(1998)
Machine Learning
, vol.33
, Issue.2-3
, pp. 235-262
-
-
Crites, R.H.1
Barto, A.G.2
-
15
-
-
34250651573
-
Multi-robot box-pushing: Single-agent q-learning vs. team q-learning
-
Y. Wang and C. W. de Silva, "Multi-robot box-pushing: Single-agent q-learning vs. team q-learning," in Proc. of IROS, 2006, pp. 3694-3699.
-
(2006)
Proc. of IROS
, pp. 3694-3699
-
-
Wang, Y.1
de Silva, C.W.2
-
16
-
-
28444436227
-
Adaptive organization of generalized behavioral concepts for autonomous robots: Schema-based modular reinforcement learning
-
June
-
T. Taniguchi and T. Sawaragi, "Adaptive organization of generalized behavioral concepts for autonomous robots: schema-based modular reinforcement learning," in Proc. of Computational Intelligence in Robotics and Automation, June 2005, pp. 601-606.
-
(2005)
Proc. of Computational Intelligence in Robotics and Automation
, pp. 601-606
-
-
Taniguchi, T.1
Sawaragi, T.2
-
17
-
-
34250672679
-
Improving reinforcement learning speed for robot control
-
L.Matignon, G. J. Laurent, and N. LeFort-Piat, "Improving reinforcement learning speed for robot control," in Proc. of IROS, 2006, pp. 3172-3177.
-
(2006)
Proc. of IROS
, pp. 3172-3177
-
-
Matignon, L.1
Laurent, G.J.2
LeFort-Piat, N.3
-
18
-
-
51349111777
-
-
M. Benda, V. Jagannathan, and R. Dodhiawala, On optimal cooperation of knowledge sources - an experimental investigation. Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, Tech. Rep. BCS-G2010-280, 1986.
-
M. Benda, V. Jagannathan, and R. Dodhiawala, "On optimal cooperation of knowledge sources - an experimental investigation." Boeing Advanced Technology Center, Boeing Computing Services, Seattle, Washington, Tech. Rep. BCS-G2010-280, 1986.
-
-
-
|