-
1
-
-
40949147745
-
A comprehensive survey of multiagent reinforcement learning
-
DOI 10.1109/TSMCC.2007.913919
-
Busoniu L, Babuska R, De Schutter B,. A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on System Man Cybernetics: Part C 2008; 38 (2): 156-172. (Pubitemid 351404112)
-
(2008)
IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews
, vol.38
, Issue.2
, pp. 156-172
-
-
Busoniu, L.1
Babuska, R.2
De Schutter, B.3
-
3
-
-
33746826183
-
Multiagent reinforcement learning for multi-robot systems: A survey
-
Yang E, Gu D,. Multiagent reinforcement learning for multi-robot systems: a survey. Technical Report CSM-404, Department of Computer Science, Univervisty of Essex, Colchester, UK, 2004.
-
(2004)
Technical Report CSM-404, Department of Computer Science, Univervisty of Essex, Colchester, UK
-
-
Yang, E.1
Gu, D.2
-
4
-
-
84865781568
-
Self-organization for coordinating decentralized reinforcement learning
-
In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
-
Zhang C, Lesser V, Abdallah S,. Self-organization for coordinating decentralized reinforcement learning. In Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2010; 739-746.
-
(2010)
Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems
, pp. 739-746
-
-
Zhang, C.1
Lesser, V.2
Abdallah, S.3
-
5
-
-
27344449757
-
Decentralized control of cooperative systems: Categorization and complexity analysis
-
Goldman CV, Zilberstein S,. Decentralized control of cooperative systems: categorization and complexity analysis. Journal of Artificial Intelligence Research 2004; 22: 143-174. (Pubitemid 41525885)
-
(2004)
Journal of Artificial Intelligence Research
, vol.22
, pp. 143-174
-
-
Goldman, C.V.1
Zilberstein, S.2
-
6
-
-
34247560904
-
A hybrid reinforcement learning approach to autonomic resource allocation
-
1662383, Proceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
-
Tesauro G, Jong NK, Das R, Bennani MN, A hybrid reinforcement learning approach to autonomic resource allocation. In 2006 IEEE International Conference on Autonomic Computing. IEEE Press: New York, 2006; 65-73. (Pubitemid 46666907)
-
(2006)
Proceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
, vol.2006
, pp. 65-73
-
-
Tesauro, G.1
Jong, N.K.2
Das, R.3
Bennani, M.N.4
-
7
-
-
28844499442
-
Resource allocation in the Grid with learning agents
-
DOI 10.1007/s10723-005-9003-7
-
Galstyan A, Czajkowski K, Lerman K,. Resource allocation in the grid with learning agents. Journal of Grid Computing 2005; 3 (1): 91-100. (Pubitemid 41762487)
-
(2005)
Journal of Grid Computing
, vol.3
, Issue.1-2
, pp. 91-100
-
-
Galstyan, A.1
Czajkowski, K.2
Lerman, K.3
-
8
-
-
0036274424
-
Pricing in agent economies using multi-agent Q-learning
-
DOI 10.1023/A:1015504423309
-
Tesauro G, Kephart JO,. Pricing in agent economies using multi-agent Q-learning. Autonomous Agent and Multi-Agent Systems 2002; 5 (3): 289-304. (Pubitemid 37113883)
-
(2002)
Autonomous Agents and Multi-Agent Systems
, vol.5
, Issue.3
, pp. 289-304
-
-
Tesauro, G.1
Kephart, J.O.2
-
9
-
-
34249045960
-
Perspectives on multiagent learning
-
DOI 10.1016/j.artint.2007.02.004, PII S0004370207000525, Foundations of Multi-Agent Learning
-
Sandholm T,. Perspectives on multiagent learning. Journal of Artificial Intelligence 2007; 171 (7): 382-391. (Pubitemid 46802424)
-
(2007)
Artificial Intelligence
, vol.171
, Issue.7
, pp. 382-391
-
-
Sandholm, T.1
-
10
-
-
34249024789
-
Multiagent learning is not the answer. It is the question
-
DOI 10.1016/j.artint.2006.12.005, PII S0004370207000021, Foundations of Multi-Agent Learning
-
Stone P,. Multiagent learning is not the answer. It is the question. Journal of Artificial Intelligence 2007; 171 (7): 402-405. (Pubitemid 46802413)
-
(2007)
Artificial Intelligence
, vol.171
, Issue.7
, pp. 402-405
-
-
Stone, P.1
-
11
-
-
34147161536
-
If multi-agent learning is the answer,what is the question?
-
Shoham Y, Powers B, Grenager T,. If multi-agent learning is the answer,what is the question?. Journal of Artificial Intelligence 2007; 171 (7): 365-377.
-
(2007)
Journal of Artificial Intelligence
, vol.171
, Issue.7
, pp. 365-377
-
-
Shoham, Y.1
Powers, B.2
Grenager, T.3
-
13
-
-
79955976414
-
Decentralized MDPs with sparse interactions
-
Melo FS, Veloso M,. Decentralized MDPs with sparse interactions. Artifcial Intelligence 2011; 175: 1757-1789.
-
(2011)
Artifcial Intelligence
, vol.175
, pp. 1757-1789
-
-
Melo, F.S.1
Veloso, M.2
-
14
-
-
40949099898
-
Utile coordination: Learning interdependencies among cooperative agents
-
In. IEEE Press: New York
-
Kok JR, Hoen P, Bakker B, Vlassis N,. Utile coordination: learning interdependencies among cooperative agents. In Proceedings of Symposium on Computational Intelligence and Games. IEEE Press: New York, 2005; 29-36.
-
(2005)
Proceedings of Symposium on Computational Intelligence and Games
, pp. 29-36
-
-
Kok, J.R.1
Hoen, P.2
Bakker, B.3
Vlassis, N.4
-
16
-
-
84867671358
-
Learning multi-agent state space representations
-
In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
-
De H Y M, Vrancx P, Nowé A,. Learning multi-agent state space representations. In Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2010; 715-722.
-
(2010)
Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems
, pp. 715-722
-
-
De Y, M.H.1
Vrancx, P.2
Nowé, A.3
-
17
-
-
84873855111
-
Learning what to observe in multi-agent systems
-
In. University of Twente Publisher: Enschede, the Netherlands
-
De Hauwere YM, Vrancx P, Nowé A,. Learning what to observe in multi-agent systems. In Proceedings of 20th Belgian-Netherlands Conference on Artificial Intelligence. University of Twente Publisher: Enschede, the Netherlands, 2009; 83-90.
-
(2009)
Proceedings of 20th Belgian-Netherlands Conference on Artificial Intelligence
, pp. 83-90
-
-
De Hauwere, Y.M.1
Vrancx, P.2
Nowé, A.3
-
18
-
-
26944461811
-
Sparse tabular multiagent Q-learning
-
Kok JR, Vlassis N,. Sparse tabular multiagent Q-learning. Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium, 2004; 65-71.
-
(2004)
Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium
, pp. 65-71
-
-
Kok, J.R.1
Vlassis, N.2
-
22
-
-
0001395498
-
Distributed value functions
-
In. Morgan Kaufmann Publishers: San Mateo, CA
-
Schneider J, Wong WK, Moore A, Riedmiller M,. Distributed value functions. In Proceedings of the 16th International Conference on Machine Learning. Morgan Kaufmann Publishers: San Mateo, CA, 1999; 371-378.
-
(1999)
Proceedings of the 16th International Conference on Machine Learning
, pp. 371-378
-
-
Schneider, J.1
Wong, W.K.2
Moore, A.3
Riedmiller, M.4
-
25
-
-
84899992307
-
Interaction-driven Markov games for decentralized multiagent planning under uncertainty
-
In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
-
Spaan M, Melo FS,. Interaction-driven Markov games for decentralized multiagent planning under uncertainty. In Proceedings of 7th International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2008; 525-532.
-
(2008)
Proceedings of 7th International Conference on Autonomous Agents and Multiagent Systems
, pp. 525-532
-
-
Spaan, M.1
Melo, F.S.2
-
26
-
-
84899840405
-
Learning of coordination: Exploiting sparse interactions in multiagent systems
-
In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
-
Melo FS, Veloso M,. Learning of coordination: exploiting sparse interactions in multiagent systems. In Proceedings of 8th International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2009; 772-780.
-
(2009)
Proceedings of 8th International Conference on Autonomous Agents and Multiagent Systems
, pp. 772-780
-
-
Melo, F.S.1
Veloso, M.2
-
30
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
Bernstein DS, Givan R, Immerman N, Zilberstein S,. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research 2002; 27 (4): 819-840.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.S.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
33
-
-
51649127552
-
Formal models and algorithms for decentralized decision making under uncertainty
-
Seuken S, Zilberstein S,. Formal models and algorithms for decentralized decision making under uncertainty. Autonomous Agents and Multi-Agent Systems 2008; 17 (2): 190-250.
-
(2008)
Autonomous Agents and Multi-Agent Systems
, vol.17
, Issue.2
, pp. 190-250
-
-
Seuken, S.1
Zilberstein, S.2
-
34
-
-
0036355306
-
Multiagent teamwork: Analyzing the optimality and complexity of key theories and models
-
Pynadath DV, Tambe M,. Multiagent teamwork: analyzing the optimality and complexity of key theories and models. In Proceedings of the First International Joint Conference on Autonomous Agents and Multiagent Systems: Part 2. ACM Press: New York, 2002; 873-880. (Pubitemid 34975283)
-
(2002)
Proceedings of the International Conference on Autonomous Agents
, Issue.1
, pp. 873-880
-
-
Pynadath, D.V.1
Tambe, M.2
-
36
-
-
84880690163
-
Sequential optimality and coordination in multiagent systems
-
In. Morgan Kaufmann Publishers: San Mateo, CA
-
Boutilier C,. Sequential optimality and coordination in multiagent systems. In International Joint Conference on Artificial Intelligence. Morgan Kaufmann Publishers: San Mateo, CA, 1999; 478-485.
-
(1999)
International Joint Conference on Artificial Intelligence
, pp. 478-485
-
-
Boutilier, C.1
-
37
-
-
1142293055
-
Transition-independent decentralized Markov decision processes
-
In. ACM Press: New York
-
Becker R, Zilberstein S, Lesser V, Goldman CV,. Transition-independent decentralized Markov decision processes. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems. ACM Press: New York, 2003; 41-48.
-
(2003)
Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 41-48
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
38
-
-
27344432831
-
Solving transition independent decentralized Markov decision processes
-
Becker R, Zilberstein S, Lesser V, Goldman CV,. Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research 2004; 22: 423-455. (Pubitemid 41525892)
-
(2004)
Journal of Artificial Intelligence Research
, vol.22
, pp. 423-455
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
39
-
-
57749106245
-
Interaction structure and dimensionality in decentralized problem solving
-
In. ACM Press: New York
-
Allen M, Petrik M, Zilberstein S,. Interaction structure and dimensionality in decentralized problem solving. In Conference on Artificial Intelligence (AAAI). ACM Press: New York, 2008; 1440-1441.
-
(2008)
Conference on Artificial Intelligence (AAAI)
, pp. 1440-1441
-
-
Allen, M.1
Petrik, M.2
Zilberstein, S.3
-
40
-
-
0031630561
-
The dynamics of reinforcement learning in cooperative multiagent systems
-
In. AAAI Press: Menlo Park, California
-
Claus C, Boutilier C,. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of National of Conference on Artificial Intelligence. AAAI Press: Menlo Park, California, 1998; 746-752.
-
(1998)
Proceedings of National of Conference on Artificial Intelligence
, pp. 746-752
-
-
Claus, C.1
Boutilier, C.2
-
41
-
-
0002109085
-
IMulti-agent reinforcement learning: Independent vs. Cooperative agents
-
In. Morgan Kaufmann Publishers: San Mateo, CA
-
Tan M,. IMulti-agent reinforcement learning: independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning. Morgan Kaufmann Publishers: San Mateo, CA, 1993; 1440-1441.
-
(1993)
Proceedings of the Tenth International Conference on Machine Learning
, pp. 1440-1441
-
-
Tan, M.1
-
42
-
-
0028555752
-
Learning to coordinate without sharing information
-
In. John Wiley & Sons, Inc.: Hoboken, New Jersey
-
Sen S, Sekaran M, Hale J, Learning to coordinate without sharing information. In Proceedings of the National Conference on Artificial Intelligence. John Wiley & Sons, Inc.: Hoboken, New Jersey, 1994; 426-426.
-
(1994)
Proceedings of the National Conference on Artificial Intelligence
, pp. 426-426
-
-
Sen, S.1
Sekaran, M.2
Hale, J.3
|