-
1
-
-
27344432831
-
Solving transition independent decentralized Markov decision processes
-
R. Becker, S. Zilberstein, V. Lesser, and C. Goldman. Solving transition independent decentralized Markov decision processes. J. Artificial Intelligence Research, 22:423-455, 2004.
-
(2004)
J. Artificial Intelligence Research
, vol.22
, pp. 423-455
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.4
-
3
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
D. Bernstein, S. Zilberstein, and N. Immerman. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4):819-840, 2002.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.1
Zilberstein, S.2
Immerman, N.3
-
7
-
-
4544325183
-
Approximate solutions for partially observable stochastic games with common payoffs
-
R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems, volume 1, pages 136-143, 2004.
-
(2004)
Proc. Int. Joint Conf. Autonomous Agents and Multi Agent Systems
, vol.1
, pp. 136-143
-
-
Emery-Montemerlo, R.1
Gordon, G.2
Schneider, J.3
Thrun, S.4
-
9
-
-
4644369748
-
Nash Q-learning for general sum stochastic games
-
J. Hu and M. Wellman. Nash Q-learning for general sum stochastic games. J. Machine Learning Research, 4:1039-1069, 2003.
-
(2003)
J. Machine Learning Research
, vol.4
, pp. 1039-1069
-
-
Hu, J.1
Wellman, M.2
-
10
-
-
40949099898
-
Utile coordination: Learning interdependencies among cooperative agents
-
J. Kok, P. Hoen, B. Bakker, and N. Vlassis. Utile coordination: Learning interdependencies among cooperative agents. In Proc. Symp. on Computational Intelligence and Games, pages 29-36, 2005.
-
(2005)
Proc. Symp. on Computational Intelligence and Games
, pp. 29-36
-
-
Kok, J.1
Hoen, P.2
Bakker, B.3
Vlassis, N.4
-
11
-
-
0000619048
-
Extensive games and the problem of information
-
H. Kuhn. Extensive games and the problem of information. Annals of Mathematics Studies, 28:193-216, 1953.
-
(1953)
Annals of Mathematics Studies
, vol.28
, pp. 193-216
-
-
Kuhn, H.1
-
12
-
-
0031632806
-
Solving very large weakly coupled Markov decision processes
-
N. Meuleau, M. Hauskrecht, K. Kim, L. Peshkin, L. Kaelbling, T. Dean, and C. Boutilier. Solving very large weakly coupled Markov decision processes. In Proc. Nat. Conf. Artificial Intelligence, pages 165-172, 1998.
-
(1998)
Proc. Nat. Conf. Artificial Intelligence
, pp. 165-172
-
-
Meuleau, N.1
Hauskrecht, M.2
Kim, K.3
Peshkin, L.4
Kaelbling, L.5
Dean, T.6
Boutilier, C.7
-
13
-
-
29344437834
-
Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. in Proc
-
R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In Proc. Nat. Conf. Artificial Intelligence, pages 133-139, 2005.
-
(2005)
Nat. Conf. Artificial Intelligence
, pp. 133-139
-
-
Nair, R.1
Varakantham, P.2
Tambe, M.3
Yokoo, M.4
-
17
-
-
0010276944
-
Implicit imitation in multiagent reinforcement learning
-
B. Price and C. Boutilier. Implicit imitation in multiagent reinforcement learning. In Proc. Int. Conf. Machine Learning, pages 325-334, 1999.
-
(1999)
Proc. Int. Conf. Machine Learning
, pp. 325-334
-
-
Price, B.1
Boutilier, C.2
-
18
-
-
1142292938
-
The communicative multiagent team decision problem: Analyzing teamwork theories and models
-
D. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. J. Artificial Intelligence Research, 16:389-423, 2002.
-
(2002)
J. Artificial Intelligence Research
, vol.16
, pp. 389-423
-
-
Pynadath, D.1
Tambe, M.2
-
19
-
-
84903438513
-
Decentralized communication strategies for coordinated multi-agent policies
-
M. Roth, R. Simmons, and M. Veloso. Decentralized communication strategies for coordinated multi-agent policies. In Multi-Robot Systems: From Swarms to Intelligent Automata, volume III, pages 93-106, 2005.
-
(2005)
Multi-Robot Systems: From Swarms to Intelligent Automata
, vol.3
, pp. 93-106
-
-
Roth, M.1
Simmons, R.2
Veloso, M.3
-
21
-
-
84899022377
-
How to dynamically merge Markov decision processes
-
M. Jordan, M. Kearns, and S. Solla, editors
-
S. Singh and D. Cohn. How to dynamically merge Markov decision processes. In M. Jordan, M. Kearns, and S. Solla, editors, Adv. Neural Information Processing Systems 10, pages 1057-1063, 1998.
-
(1998)
Adv. Neural Information Processing Systems
, vol.10
, pp. 1057-1063
-
-
Singh, S.1
Cohn, D.2
|