-
2
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov decision processes. Math, of OR, 27(4): 819-840, 2002.
-
(2002)
Math, of OR
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.S.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
3
-
-
0346942368
-
Decision-theoretic planning: Structural assumptions and computational leverage
-
C. Boutilier, T. Dean, and S. Hanks. Decision-theoretic planning: Structural assumptions and computational leverage. J AIR, 11: 1-94, 1999.
-
(1999)
J AIR
, vol.11
, pp. 1-94
-
-
Boutilier, C.1
Dean, T.2
Hanks, S.3
-
4
-
-
0002436850
-
Tractable inference for complex stochastic processes
-
X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In UAI, 1998.
-
(1998)
UAI
-
-
Boyen, X.1
Koller, D.2
-
5
-
-
4544325183
-
Approximate solutions for partially observable stochastic games with common payoffs
-
R. Emery-Montemerlo, G. Gordon, J. Schneider, and S. Thrun. Approximate solutions for partially observable stochastic games with common payoffs. In AAMAS, 2004.
-
(2004)
AAMAS
-
-
Emery-Montemerlo, R.1
Gordon, G.2
Schneider, J.3
Thrun, S.4
-
7
-
-
84899946303
-
Decentralised coordination of low-power embedded devices using the max-sum algorithm
-
A. Farinelli, A. Rogers, A. Petcu, and N. R. Jennings. Decentralised coordination of low-power embedded devices using the max-sum algorithm. In AAMAS, 2008.
-
(2008)
AAMAS
-
-
Farinelli, A.1
Rogers, A.2
Petcu, A.3
Jennings, N.R.4
-
8
-
-
4544318426
-
Efficient solution algorithms for factored MDPs
-
C. Guestrin, D. Koller, R. Parr, and S. Venkataraman. Efficient solution algorithms for factored MDPs. JAIR, 19: 399-468, 2003.
-
(2003)
JAIR
, vol.19
, pp. 399-468
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
Venkataraman, S.4
-
10
-
-
33748543203
-
Collaborative multiagent reinforcement learning by payoff propagation
-
J. R. Kok and N. Vlassis. Collaborative multiagent reinforcement learning by payoff propagation. JMLR, 7: 1789-1828, 2006.
-
(2006)
JMLR
, vol.7
, pp. 1789-1828
-
-
Kok, J.R.1
Vlassis, N.2
-
11
-
-
84880688552
-
Computing factored value functions for policies in structured MDPs
-
D. Koller and R. Parr. Computing factored value functions for policies in structured MDPs. In IJCAI, 1999.
-
(1999)
IJCAI
-
-
Koller, D.1
Parr, R.2
-
12
-
-
84899828955
-
Constraint-based dynamic programming for decentralized POMDPs with structured interactions
-
A. Kumar and S. Zilberstein. Constraint-based dynamic programming for decentralized POMDPs with structured interactions. In AAMAS, 2009.
-
(2009)
AAMAS
-
-
Kumar, A.1
Zilberstein, S.2
-
13
-
-
84868288428
-
Scalable multiagent planning using probabilistic inference
-
A. Kumar, S. Zilberstein, and M. Toussaint. Scalable multiagent planning using probabilistic inference. In IJCAI, 2011.
-
(2011)
IJCAI
-
-
Kumar, A.1
Zilberstein, S.2
Toussaint, M.3
-
14
-
-
84899969517
-
Not all agents are equal: Scaling up distributed POMDPs for agent networks
-
J. Marecki, T. Gupta, P. Varakantham, M. Tambe, and M. Yokoo. Not all agents are equal: scaling up distributed POMDPs for agent networks. In AAMAS, 2008.
-
(2008)
AAMAS
-
-
Marecki, J.1
Gupta, T.2
Varakantham, P.3
Tambe, M.4
Yokoo, M.5
-
15
-
-
0007788905
-
The factored frontier algorithm for approximate inference in DBNs
-
K. P. Murphy and Y. Weiss. The factored frontier algorithm for approximate inference in DBNs. In UAI, 2001.
-
(2001)
UAI
-
-
Murphy, K.P.1
Weiss, Y.2
-
16
-
-
34247214638
-
Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs
-
R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In AAAI, 2005.
-
(2005)
AAAI
-
-
Nair, R.1
Varakantham, P.2
Tambe, M.3
Yokoo, M.4
-
18
-
-
84868288325
-
Decentralized POMDPs
-
M. Wiering and M. van Otterlo, editors Springer Berlin Heidelberg
-
F. A. Oliehoek. Decentralized POMDPs. In M. Wiering and M. van Otterlo, editors, Reinforcement Learning: State of the Art. Springer Berlin Heidelberg, 2012.
-
(2012)
Reinforcement Learning: State of the Art
-
-
Oliehoek, F.A.1
-
19
-
-
57349184659
-
The cross-entropy method for policy search in decentralized POMDPs
-
F. A. Oliehoek, J. F. Kooi, and N. Vlassis. The cross-entropy method for policy search in decentralized POMDPs. Informatica, 32: 341-357, 2008.
-
(2008)
Informatica
, vol.32
, pp. 341-357
-
-
Oliehoek, F.A.1
Kooi, J.F.2
Vlassis, N.3
-
20
-
-
52249098423
-
Optimal and approximate Q-value functions for decentralized POMDPs
-
F. A. Oliehoek, M. T. J. Spaan, and N. Vlassis. Optimal and approximate Q-value functions for decentralized POMDPs. JAIR, 32: 289-353, 2008.
-
(2008)
JAIR
, vol.32
, pp. 289-353
-
-
Oliehoek, F.A.1
Spaan, M.T.J.2
Vlassis, N.3
-
22
-
-
84885985853
-
Exploiting structure in cooperative Bayesian games
-
F. A. Oliehoek, S. Whiteson, and M. T. J. Spaan. Exploiting structure in cooperative Bayesian games. In UAI, 2012.
-
(2012)
UAI
-
-
Oliehoek, F.A.1
Whiteson, S.2
Spaan, M.T.J.3
-
23
-
-
84860644195
-
Efficient planning for factored infinite-horizon DEC-POMDPs
-
J. Pajarinen and J. Peltonen. Efficient planning for factored infinite-horizon DEC-POMDPs. In IJCAI, 2011.
-
(2011)
IJCAI
-
-
Pajarinen, J.1
Peltonen, J.2
-
24
-
-
84881044687
-
The complexity of multiagent systems: The price of silence
-
Z. Rabinovich, C. V. Goldman, and J. S. Rosenschein. The complexity of multiagent systems: the price of silence. In AAMAS, 2003.
-
(2003)
AAMAS
-
-
Rabinovich, Z.1
Goldman, C.V.2
Rosenschein, J.S.3
-
25
-
-
78650949545
-
Bounded approximate decentralised coordination via the max-sum algorithm
-
A. Rogers, A. Farinelli, R. Stranders, and N. Jennings. Bounded approximate decentralised coordination via the max-sum algorithm. Artif Intel., 175(2): 730-759, 2011.
-
(2011)
Artif Intel.
, vol.175
, Issue.2
, pp. 730-759
-
-
Rogers, A.1
Farinelli, A.2
Stranders, R.3
Jennings, N.4
-
26
-
-
51649127552
-
Formal models and algorithms for decentralized decision making under uncertainty
-
S. Seuken and S. Zilberstein. Formal models and algorithms for decentralized decision making under uncertainty. Autonomous Agents and Multi-Agent Systems, 17(2): 190-250, 2008.
-
(2008)
Autonomous Agents and Multi-Agent Systems
, vol.17
, Issue.2
, pp. 190-250
-
-
Seuken, S.1
Zilberstein, S.2
-
27
-
-
84868299292
-
Scaling up optimal heuristic search in dec-POMDPs via incremental expansion
-
M. T. J. Spaan, F. A. Oliehoek, and C. Amato. Scaling up optimal heuristic search in Dec-POMDPs via incremental expansion. In IJCAI, 2011.
-
(2011)
IJCAI
-
-
Spaan, M.T.J.1
Oliehoek, F.A.2
Amato, C.3
-
29
-
-
68949157375
-
Transfer learning for reinforcement learning domains: A survey
-
M. E. Taylor and P. Stone. Transfer learning for reinforcement learning domains: A survey. JMLR, 10: 1633-1685, 2009.
-
(2009)
JMLR
, vol.10
, pp. 1633-1685
-
-
Taylor, M.E.1
Stone, P.2
-
30
-
-
78650588227
-
Exploiting coordination locales in distributed POMDPs via social model shaping
-
P. Varakantham, J. Kwak, M. E. Taylor, J. Marecki, P. Scerri, and M. Tambe. Exploiting coordination locales in distributed POMDPs via social model shaping. In ICAPS, 2009.
-
(2009)
ICAPS
-
-
Varakantham, P.1
Kwak, J.2
Taylor, M.E.3
Marecki, J.4
Scerri, P.5
Tambe, M.6
-
31
-
-
78650622568
-
Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies
-
P. Varakantham, J. Marecki, Y. Yabu, M. Tambe, and M. Yokoo. Letting loose a SPIDER on a network of POMDPs: Generating quality guaranteed policies. In AAMAS, 2007.
-
(2007)
AAMAS
-
-
Varakantham, P.1
Marecki, J.2
Yabu, Y.3
Tambe, M.4
Yokoo, M.5
-
32
-
-
84899454751
-
Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents
-
P. Velagapudi, P. Varakantham, P. Scerri, and K. Sycara. Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents. In AAMAS, 2011.
-
(2011)
AAMAS
-
-
Velagapudi, P.1
Varakantham, P.2
Scerri, P.3
Sycara, K.4
-
33
-
-
78650593547
-
Influence-based policy abstraction for weakly-coupled dec-POMDPs
-
S. J. Witwicki and E. H. Durfee. Influence-based policy abstraction for weakly-coupled Dec-POMDPs. In ICAPS, 2010.
-
(2010)
ICAPS
-
-
Witwicki, S.J.1
Durfee, E.H.2
-
34
-
-
80053153738
-
Rollout sampling policy iteration for decentralized POMDPs
-
F. Wu, S. Zilberstein, and X. Chen. Rollout sampling policy iteration for decentralized POMDPs. In UAI, 2010.
-
(2010)
UAI
-
-
Wu, F.1
Zilberstein, S.2
Chen, X.3
|