-
1
-
-
0031632806
-
Solving very large weakly coupled Markov decision processes
-
N. Meuleau, M. Hauskrecht, K. Kim, L. Peshkin, L. Kaelbling, T. Dean, C. Boutilier, Solving very large weakly coupled Markov decision processes, in: Proc. 15th AAAI Conf. Artificial Intelligence, 1998, pp. 165-172.
-
(1998)
Proc. 15th AAAI Conf. Artificial Intelligence
, pp. 165-172
-
-
Meuleau, N.1
Hauskrecht, M.2
Kim, K.3
Peshkin, L.4
Kaelbling, L.5
Dean, T.6
Boutilier, C.7
-
4
-
-
0032645144
-
Team-partitioned, opaque-transition reinforcement learning
-
P. Stone, M. Veloso, Team-partitioned, opaque-transition reinforcement learning, in: Proc. RoboCup-98, 1998, pp. 206-212.
-
(1998)
Proc. RoboCup-98
, pp. 206-212
-
-
Stone, P.1
Veloso, M.2
-
7
-
-
40949099898
-
Utile coordination: Learning interdependencies among cooperative agents
-
J. Kok, P. Hoen, B. Bakker, N. Vlassis, Utile coordination: learning interdependencies among cooperative agents, in: IEEE Symp. Computational Intelligence and Games, 2005, pp. 61-68.
-
(2005)
IEEE Symp. Computational Intelligence and Games
, pp. 61-68
-
-
Kok, J.1
Hoen, P.2
Bakker, B.3
Vlassis, N.4
-
8
-
-
60349107649
-
Exploiting factored representations for decentralized execution in multiagent teams
-
M. Roth, R. Simmons, M. Veloso, Exploiting factored representations for decentralized execution in multiagent teams, in: Proc. 6th Int. Conf. Autonomous Agents and Multiagent Systems, 2007, pp. 469-475.
-
(2007)
Proc. 6th Int. Conf. Autonomous Agents and Multiagent Systems
, pp. 469-475
-
-
Roth, M.1
Simmons, R.2
Veloso, M.3
-
9
-
-
0141591857
-
Graphical models for game theory
-
M. Kearns, M. Littman, S. Singh, Graphical models for game theory, in: Proc. 17th Conf. Uncertainty in Artificial Intelligence, 2001, pp. 253-260.
-
(2001)
Proc. 17th Conf. Uncertainty in Artificial Intelligence
, pp. 253-260
-
-
Kearns, M.1
Littman, M.2
Singh, S.3
-
10
-
-
79958100489
-
Action-graph games
-
TR-2008-13, Univ. British Columbia
-
A. Xin Jiang, K. Leyton-Brown, N. Bhat, Action-graph games, Tech. rep. TR-2008-13, Univ. British Columbia, 2008.
-
(2008)
Tech. Rep
-
-
Xin Jiang, A.1
Leyton-Brown, K.2
Bhat, N.3
-
12
-
-
27344449757
-
Decentralized control of cooperative systems: Categorization and complexity analysis
-
C. Goldman, and S. Zilberstein Decentralized control of cooperative systems: categorization and complexity analysis Journal of Artificial Intelligence Research 22 2004 143 174 (Pubitemid 41525885)
-
(2004)
Journal of Artificial Intelligence Research
, vol.22
, pp. 143-174
-
-
Goldman, C.V.1
Zilberstein, S.2
-
13
-
-
78650588227
-
Exploiting coordination locales in distributed POMDPs via social model shaping
-
P. Varakantham, J. Kwak, M. Taylor, J. Marecki, P. Scerri, M. Tambe, Exploiting coordination locales in distributed POMDPs via social model shaping, in: Proc. 19th Int. Conf. Automated Planning and Scheduling, 2009, pp. 313-320.
-
(2009)
Proc. 19th Int. Conf. Automated Planning and Scheduling
, pp. 313-320
-
-
Varakantham, P.1
Kwak, J.2
Taylor, M.3
Marecki, J.4
Scerri, P.5
Tambe, M.6
-
15
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
PII S000437029800023X
-
L. Kaelbling, M. Littman, and A. Cassandra Planning and acting in partially observable stochastic domains Artificial Intelligence 101 1998 99 134 (Pubitemid 128387390)
-
(1998)
Artificial Intelligence
, vol.101
, Issue.1-2
, pp. 99-134
-
-
Kaelbling, L.P.1
Littman, M.L.2
Cassandra, A.R.3
-
16
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
D. Bernstein, R. Givan, N. Immerman, and S. Zilberstein The complexity of decentralized control of Markov decision processes Mathematics of Operations Research 27 4 2002 819 840
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
19
-
-
0015658957
-
The optimal control of partially observable Markov processes over a finite horizon
-
R. Smallwood, and E. Sondik The optimal control of partially observable Markov processes over a finite horizon Operations Research 21 5 1973 1071 1088
-
(1973)
Operations Research
, vol.21
, Issue.5
, pp. 1071-1088
-
-
Smallwood, R.1
Sondik, E.2
-
20
-
-
0032596468
-
On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems
-
O. Madani, S. Hanks, A. Condon, On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision problems, in: Proc. 16th AAAI Conf. Artificial Intelligence, 1999, pp. 541-548.
-
(1999)
Proc. 16th AAAI Conf. Artificial Intelligence
, pp. 541-548
-
-
Madani, O.1
Hanks, S.2
Condon, A.3
-
22
-
-
33646244605
-
A (revised) survey of approximate methods for solving partially observable Markov decision processes
-
National ICT Australia
-
D. Aberdeen, A (revised) survey of approximate methods for solving partially observable Markov decision processes, Tech. rep., National ICT Australia, 2003.
-
(2003)
Tech. Rep.
-
-
Aberdeen, D.1
-
23
-
-
85138579181
-
Learning policies for partially observable environments: Scaling up
-
M. Littman, A. Cassandra, L. Kaelbling, Learning policies for partially observable environments: scaling up, in: Proc. 12th Int. Conf. Machine Learning, 1995, pp. 362-370.
-
(1995)
Proc. 12th Int. Conf. Machine Learning
, pp. 362-370
-
-
Littman, M.1
Cassandra, A.2
Kaelbling, L.3
-
26
-
-
51649127552
-
Formal models and algorithms for decentralized decision-making under uncertainty
-
S. Seuken, and S. Zilberstein Formal models and algorithms for decentralized decision-making under uncertainty Journal of Autonomous Agents and Multiagent Systems 17 2 2008 190 250
-
(2008)
Journal of Autonomous Agents and Multiagent Systems
, vol.17
, Issue.2
, pp. 190-250
-
-
Seuken, S.1
Zilberstein, S.2
-
29
-
-
0008084202
-
Optimal policies for partially observable Markov decision processes
-
Dept. Computer Sciences, Brown Univ.
-
A. Cassandra, Optimal policies for partially observable Markov decision processes, Tech. rep. CS-94-14, Dept. Computer Sciences, Brown Univ., 1994.
-
(1994)
Tech. Rep. CS-94-14
-
-
Cassandra, A.1
-
32
-
-
0038637209
-
Multi-agent reinforcement learning: Independent vs. cooperative agents
-
Morgan Kaufman
-
M. Tan Multi-agent reinforcement learning: independent vs. cooperative agents Readings in Agents 1997 Morgan Kaufman 487 494
-
(1997)
Readings in Agents
, pp. 487-494
-
-
Tan, M.1
-
33
-
-
33744514808
-
Generalised weakened fictitious play
-
DOI 10.1016/j.geb.2005.08.005, PII S089982560500103X
-
D. Leslie, and E. Collins Generalised weakened fictitious play Games and Economic Behavior 56 2006 285 298 (Pubitemid 43812068)
-
(2006)
Games and Economic Behavior
, vol.56
, Issue.2
, pp. 285-298
-
-
Leslie, D.S.1
Collins, E.J.2
-
36
-
-
0010220982
-
Planning, learning and coordination in multiagent decision processes
-
C. Boutilier, Planning, learning and coordination in multiagent decision processes, in: Theoretical Aspects of Rationality and Knowledge, 1996, pp. 195-210.
-
(1996)
Theoretical Aspects of Rationality and Knowledge
, pp. 195-210
-
-
Boutilier, C.1
-
37
-
-
1142292938
-
The communicative multiagent team decision problem: Analyzing teamwork theories and models
-
D. Pynadath, and M. Tambe The communicative multiagent team decision problem: analyzing teamworktheories and models Journal of Artificial Intelligence Research 16 2002 389 423 (Pubitemid 43057178)
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 389-423
-
-
Pynadath, D.V.1
Tambe, M.2
-
39
-
-
4544325183
-
Approximate solutions for partially observable stochastic games with common payoffs
-
R. Emery-Montemerlo, G. Gordon, J. Schneider, S. Thrun, Approximate solutions for partially observable stochastic games with common payoffs, in: Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems, 2004, pp. 136-143.
-
(2004)
Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems
, pp. 136-143
-
-
Emery-Montemerlo, R.1
Gordon, G.2
Schneider, J.3
Thrun, S.4
-
41
-
-
1142293055
-
Transition-independent decentralized Markov decision processes
-
R. Becker, S. Zilberstein, V. Lesser, C. Goldman, Transition-independent decentralized Markov decision processes, in: Proc. 2nd Int. Conf. Autonomous Agents and Multiagent Systems, 2003, pp. 41-48.
-
(2003)
Proc. 2nd Int. Conf. Autonomous Agents and Multiagent Systems
, pp. 41-48
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.4
-
43
-
-
0036923118
-
Context-specific multiagent coordination and planning with factored MDPs
-
C. Guestrin, S. Venkataraman, D. Koller, Context-specific multiagent coordination and planning with factored MDPs, in: Proc. 18th AAAI Conf. Artificial Intelligence, 2002, pp. 253-259.
-
(2002)
Proc. 18th AAAI Conf. Artificial Intelligence
, pp. 253-259
-
-
Guestrin, C.1
Venkataraman, S.2
Koller, D.3
-
46
-
-
33750710723
-
A polynomial-time algorithm for action-graph games
-
Proceedings of the 21st National Conference on Artificial Intelligence and the 18th Innovative Applications of Artificial Intelligence Conference, AAAI-06/IAAI-06
-
A. Xin Jiang, K. Leyton-Brown, A polynomial-time algorithm for action-graph games, in: Proc. 21st AAAI Conf. Artificial Intelligence, 2006, pp. 679-684. (Pubitemid 44705361)
-
(2006)
Proceedings of the National Conference on Artificial Intelligence
, vol.1
, pp. 679-684
-
-
Jiang, A.X.1
Leyton-Brown, K.2
-
47
-
-
36349034015
-
Computing pure nash equilibria in symmetric action graph games
-
AAAI-07/IAAI-07 Proceedings: 22nd AAAI Conference on Artificial Intelligence and the 19th Innovative Applications of Artificial Intelligence Conference
-
A. Xin Jiang, K. Leyton-Brown, Computing pure Nash equilibria in symmetric action-graph games, in: Proc. 22nd AAAI Conf. Artificial Intelligence, 2007, pp. 79-85. (Pubitemid 350149556)
-
(2007)
Proceedings of the National Conference on Artificial Intelligence
, vol.1
, pp. 79-85
-
-
Jiang, A.X.1
Leyton-Brown, K.2
-
48
-
-
4544301377
-
Decentralized Markov decision processes with event-driven interactions
-
R. Becker, V. Lesser, S. Zilberstein, Decentralized Markov decision processes with event-driven interactions, in: Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems, 2004, pp. 302-309.
-
(2004)
Proc. 3rd Int. Conf. Autonomous Agents and Multiagent Systems
, pp. 302-309
-
-
Becker, R.1
Lesser, V.2
Zilberstein, S.3
-
49
-
-
33846298515
-
Analyzing myopic approaches for multi-agent communication
-
DOI 10.1109/IAT.2005.44, 1565602, Proceedings - 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'05
-
R. Becker, V. Lesser, S. Zilberstein, Analyzing myopic approaches for multiagent communication, in: Proc. IEEE Int. Conf. Intelligent Agent Technology, 2005, pp. 550-557. (Pubitemid 46116612)
-
(2005)
Proceedings - 2005 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT'05
, vol.2005
, pp. 550-557
-
-
Becker, R.1
Lesser, V.2
Zilberstein, S.3
-
50
-
-
59449098657
-
Analyzing myopic approaches for multiagent communications
-
R. Becker, A. Carlin, V. Lesser, and S. Zilberstein Analyzing myopic approaches for multiagent communications Computational Intelligence 25 1 2009 31 50
-
(2009)
Computational Intelligence
, vol.25
, Issue.1
, pp. 31-50
-
-
Becker, R.1
Carlin, A.2
Lesser, V.3
Zilberstein, S.4
|