-
5
-
-
0019263725
-
Time-varying feedback laws for decentralized control
-
Anderson, B., & Moore, J. (1980). Time-varying feedback laws for decentralized control. Nineteenth IEEE Conference on Decision and Control including the Symposium on Adaptive Processes, 19(1), 519-524.
-
(1980)
Nineteenth IEEE Conference on Decision and Control including the Symposium on Adaptive Processes
, vol.19
, Issue.1
, pp. 519-524
-
-
Anderson, B.1
Moore, J.2
-
6
-
-
27344432831
-
Solving transition independent decentralized Markov decision processes
-
Becker, R., Zilberstein, S., Lesser, V., & Goldman, C. (2004). Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research, 22, 423-455.
-
(2004)
Journal of Artificial Intelligence Research
, vol.22
, pp. 423-455
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.4
-
7
-
-
85012688561
-
-
Princeton University Press, Princeton, New- Jersey
-
Bellman, R. (1957). Dynamic programming. Princeton University Press, Princeton, New- Jersey.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
8
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
Bernstein, D., Givan, R., Immerman, N., & Zilberstein, S. (2002). The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research, 27(4), 819-840.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
9
-
-
84880740944
-
Bounded policy iteration for decentralized POMDPs
-
Bernstein, D. S., Hansen, E. A., & Zilberstein, S. (2005). Bounded policy iteration for decentralized POMDPs. In Proc. of the Nineteenth Int. Joint Conf. on Artificial Intelligence (IJCAI), pp. 1287-1292.
-
(2005)
Proc. of the Nineteenth Int. Joint Conf. on Artificial Intelligence (IJCAI)
, pp. 1287-1292
-
-
Bernstein, D.S.1
Hansen, E.A.2
Zilberstein, S.3
-
12
-
-
34548099216
-
Shaping multi-agent systems with gradient reinforcement learning
-
Buffet, O., Dutech, A., & Charpillet, F. (2007). Shaping multi-agent systems with gradient reinforcement learning. Autonomous Agent and Multi-Agent System Journal (AAMASJ), 15(2), 197-220.
-
(2007)
Autonomous Agent and Multi-Agent System Journal (AAMASJ)
, vol.15
, Issue.2
, pp. 197-220
-
-
Buffet, O.1
Dutech, A.2
Charpillet, F.3
-
14
-
-
0036040313
-
A heuristic approach for solving decentralized-POMDP: Assessment on the pursuit problem
-
Chadès, I., Scherrer, B., & Charpillet, F. (2002). A heuristic approach for solving decentralized-POMDP: assessment on the pursuit problem. In Proc. of the 2002 ACM Symposium on Applied Computing, pp. 57-62.
-
(2002)
Proc. of the 2002 ACM Symposium on Applied Computing
, pp. 57-62
-
-
Chadès, I.1
Scherrer, B.2
Charpillet, F.3
-
15
-
-
34547469909
-
Valid inequalities for mixed integer linear programs
-
Cornuéjols, G. (2008). Valid inequalities for mixed integer linear programs. Mathematical Programming B, 112, 3-44.
-
(2008)
Mathematical Programming B
, vol.112
, pp. 3-44
-
-
Cornuéjols, G.1
-
16
-
-
0008644215
-
On the significance of solving linear programming problems with some integer variables
-
Dantzig, G. B. (1960). On the significance of solving linear programming problems with some integer variables. Econometrica, 28(1), 30-44.
-
(1960)
Econometrica
, vol.28
, Issue.1
, pp. 30-44
-
-
Dantzig, G.B.1
-
17
-
-
0006464452
-
A probabilistic production and inventory problem
-
d'Epenoux, F. (1963). A probabilistic production and inventory problem. Management Science, 10(1), 98-108.
-
(1963)
Management Science
, vol.10
, Issue.1
, pp. 98-108
-
-
D'Epenoux, F.1
-
22
-
-
0038009937
-
A global newton method to compute Nash equilibria
-
Govindan, S., & Wilson, R. (2001). A global newton method to compute Nash equilibria. Journal of Economic Theory, 110, 65-86.
-
(2001)
Journal of Economic Theory
, vol.110
, pp. 65-86
-
-
Govindan, S.1
Wilson, R.2
-
26
-
-
0032073263
-
Planning and acting in partially observable stochastic domains
-
Kaelbling, L., Littman, M., & Cassandra, A. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134.
-
(1998)
Artificial Intelligence
, vol.101
, pp. 99-134
-
-
Kaelbling, L.1
Littman, M.2
Cassandra, A.3
-
27
-
-
0027964134
-
Fast algorithms for finding randomized strategies in game trees
-
Koller, D., Megiddo, N., & von Stengel, B. (1994). Fast algorithms for finding randomized strategies in game trees. In Proceedings of the 26th ACM Symposium on Theory of Computing (STOC '94), pp. 750-759.
-
(1994)
Proceedings of the 26th ACM Symposium on Theory of Computing (STOC '94)
, pp. 750-759
-
-
Koller, D.1
Megiddo, N.2
Von Stengel, B.3
-
28
-
-
0030535025
-
Finding mixed strategies with small supports in extensive form games
-
Koller, D., & Megiddo, N. (1996). Finding mixed strategies with small supports in extensive form games. International Journal of Game Theory, 25(1), 73-92.
-
(1996)
International Journal of Game Theory
, vol.25
, Issue.1
, pp. 73-92
-
-
Koller, D.1
Megiddo, N.2
-
29
-
-
0001644591
-
Bimatrix equilibrium points and mathematical programming
-
Lemke, C. (1965). Bimatrix Equilibrium Points and Mathematical Programming. Management Science, 11(7), 681-689.
-
(1965)
Management Science
, vol.11
, Issue.7
, pp. 681-689
-
-
Lemke, C.1
-
30
-
-
0003488911
-
-
Addison-Wesley Publishing Company, Reading, Massachussetts
-
Luenberger, D. (1984). Linear and Nonlinear Programming. Addison-Wesley Publishing Company, Reading, Massachussetts.
-
(1984)
Linear and Nonlinear Programming
-
-
Luenberger, D.1
-
32
-
-
84880823326
-
Taming decentralized POMDPs: towards efficient policy computation for multiagent setting
-
Nair, R., Tambe, M., Yokoo, M., Pynadath, D., & Marsella, S. (2003). Taming decentralized POMDPs: towards efficient policy computation for multiagent setting. In Proc. of Int. Joint Conference on Artificial Intelligence, IJCAI'03.
-
(2003)
Proc. of Int. Joint Conference on Artificial Intelligence, IJCAI'03
-
-
Nair, R.1
Tambe, M.2
Yokoo, M.3
Pynadath, D.4
Marsella, S.5
-
33
-
-
52249098423
-
Optimal and approximate Q-value functions for decentralized POMDPs
-
Oliehoek, F., Spaan, M., & Vlassis, N. (2008). Optimal and approximate Q-value functions for decentralized POMDPs. Journal of Artificial Intelligence Research (JAIR), 32, 289-353.
-
(2008)
Journal of Artificial Intelligence Research (JAIR)
, vol.32
, pp. 289-353
-
-
Oliehoek, F.1
Spaan, M.2
Vlassis, N.3
-
34
-
-
84899811776
-
Lossless clustering of histories in decentralized POMDPs
-
Oliehoek, F., Whiteson, S., & Spaan, M. (2009). Lossless clustering of histories in decentralized POMDPs. In Proc. of The International Joint Conference on Autonomous Agents and Multi Agent Systems, pp. 577-584.
-
(2009)
Proc. of The International Joint Conference on Autonomous Agents and Multi Agent Systems
, pp. 577-584
-
-
Oliehoek, F.1
Whiteson, S.2
Spaan, M.3
-
37
-
-
0000977910
-
The Complexity Of Markov Decision Processes
-
Papadimitriou, C. H., & Tsitsiklis, J. (1987). The Complexity Of Markov Decision Processes. Mathematics of Operations Research, 12 (3), 441-450.
-
(1987)
Mathematics of Operations Research
, vol.12
, Issue.3
, pp. 441-450
-
-
Papadimitriou, C.H.1
Tsitsiklis, J.2
-
38
-
-
0036267866
-
Game theory and decision theory in multi-agent systems
-
Parsons, S., & Wooldridge, M. (2002). Game theory and decision theory in multi-agent systems. Autonomous Agents and Multi-Agent Systems (JAAMAS), 5(3), 243-254.
-
(2002)
Autonomous Agents and Multi-Agent Systems (JAAMAS)
, vol.5
, Issue.3
, pp. 243-254
-
-
Parsons, S.1
Wooldridge, M.2
-
42
-
-
1142292938
-
The communicative multiagent team decision problem: Analyzing teamwork theories and models
-
Pynadath, D., & Tambe, M. (2002). The Communicative Multiagent Team Decision Problem: Analyzing Teamwork Theories And Models. Journal of Artificial Intelligence Research, 16, 389-423. (Pubitemid 43057178)
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 389-423
-
-
Pynadath, D.V.1
Tambe, M.2
-
43
-
-
0038246079
-
The application of linear programming to team decision problems
-
Radner, R. (1959). The application of linear programming to team decision problems. Management Science, 5, 143-150.
-
(1959)
Management Science
, vol.5
, pp. 143-150
-
-
Radner, R.1
-
45
-
-
0000685151
-
-
chap. Distributed rational decision making, The MIT Press. Ed. by G. Weiss
-
Sandholm, T. (1999). Multiagent systems, chap. Distributed rational decision making, pp. 201-258. The MIT Press. Ed. by G. Weiss.
-
(1999)
Multiagent Systems
, pp. 201-258
-
-
Sandholm, T.1
-
50
-
-
1942452236
-
Learning predictive state representations
-
Singh, S., Littman, M., Jong, N., Pardoe, D., & Stone, P. (2003). Learning predictive state representations. In Proc. of the Twentieth Int. Conf. of Machine Learning (ICML'03).
-
(2003)
Proc. of the Twentieth Int. Conf. of Machine Learning (ICML'03)
-
-
Singh, S.1
Littman, M.2
Jong, N.3
Pardoe, D.4
Stone, P.5
-
52
-
-
80053226937
-
MAA*: A heuristic search algorithm for solving decentralized POMDPs
-
Szer, D., Charpillet, F., & Zilberstein, S. (2005). MAA*: A heuristic search algorithm for solving decentralized POMDPs. In Proc. of the Twenty-First Conf. on Uncertainty in Artificial Intelligence (UAI'05), pp. 576- 583.
-
(2005)
Proc. of the Twenty-First Conf. on Uncertainty in Artificial Intelligence (UAI'05)
, pp. 576-583
-
-
Szer, D.1
Charpillet, F.2
Zilberstein, S.3
-
53
-
-
4544322319
-
Interac-DEC-MDP : Towards the use of interactions in DEC-MDP
-
Thomas, V., Bourjot, C., & Chevrier, V. (2004). Interac-DEC-MDP : Towards the use of interactions in DEC-MDP. In Proc. of the Third Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (AAMAS'04), New York, USA, pp. 1450-1451.
-
(2004)
Proc. of the Third Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems (AAMAS'04), New York, USA
, pp. 1450-1451
-
-
Thomas, V.1
Bourjot, C.2
Chevrier, V.3
-
55
-
-
67649370955
-
-
chap. 45-"Computing equilibria for two-person games", North-Holland, Amsterdam
-
von Stengel, B. (2002). Handbook of Game Theory, Vol.3, chap. 45-"Computing equilibria for two-person games", pp. 1723-1759. North-Holland, Amsterdam.
-
(2002)
Handbook of Game Theory
, vol.3
, pp. 1723-1759
-
-
Von Stengel, B.1
-
56
-
-
34247270255
-
Mixed-integer linear programming for transitionindependent decentralized MDPs, New York, NY, USA. ACM
-
Wu, J., & Durfee, E. H. (2006). Mixed-integer linear programming for transitionindependent decentralized MDPs. In Proc. of the fifth Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS'06), pp. 1058-1060 New York, NY, USA. ACM.
-
(2006)
Proc. of the fifth Int. Joint Conf. on Autonomous Agents and Multiagent Systems (AAMAS'06)
, pp. 1058-1060
-
-
Wu, J.1
Durfee, E.H.2
-
57
-
-
84962090726
-
Communication in multi-agent Markov decision processes
-
Boston, MA
-
Xuan, P., Lesser, V., & Zilberstein, S. (2000). Communication in multi-agent Markov decision processes. In Proc. of ICMAS Workshop on Game Theoretic and Decision Theoretics Agents Boston, MA.
-
(2000)
Proc. of ICMAS Workshop on Game Theoretic and Decision Theoretics Agents
-
-
Xuan, P.1
Lesser, V.2
Zilberstein, S.3
|