-
1
-
-
0031630561
-
The dynamics of reinforcement learning in cooperative multiagent systems
-
AAAI Press
-
Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In AAAI'98, 746-752. AAAI Press.
-
(1998)
AAAI'98
, pp. 746-752
-
-
Claus, C.1
Boutilier, C.2
-
2
-
-
0012296128
-
Multiagent planning with factored mdps
-
Guestrin, C.; Koller, D.; and Parr, R. 2001. Multiagent planning with factored mdps. In NIPS-14, 1523-1530.
-
(2001)
NIPS-14
, pp. 1523-1530
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
-
3
-
-
4544236179
-
Coordinated reinforcement learning
-
San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
-
Guestrin, C.; Lagoudakis, M. G.; and Parr, R. 2002. Coordinated reinforcement learning. In ICML '02: Proceedings of the Nineteenth International Conference on Machine Learning, 227-234. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc.
-
(2002)
ICML '02: Proceedings of the Nineteenth International Conference on Machine Learning
, pp. 227-234
-
-
Guestrin, C.1
Lagoudakis, M.G.2
Parr, R.3
-
4
-
-
33748543203
-
Collaborative multiagent reinforcement learning by payoff propagation
-
Kok, J. R., and Vlassis, N. 2006. Collaborative multiagent reinforcement learning by payoff propagation. Journal of Machine Learning Research 7:1789-1828. (Pubitemid 44373693)
-
(2006)
Journal of Machine Learning Research
, vol.7
, pp. 1789-1828
-
-
Kok, J.R.1
Vlassis, N.2
-
5
-
-
84899828955
-
Constraint-based dynamic programming for decentralized pomdps with structured interactions
-
Kumar, A., and Zilberstein, S. 2009. Constraint-based dynamic programming for decentralized pomdps with structured interactions. In AAMAS.
-
(2009)
AAMAS
-
-
Kumar, A.1
Zilberstein, S.2
-
7
-
-
84899969517
-
Not all agents are equal: Scaling up distributed pomdps for agent networks
-
Marecki, J.; Gupta, T.; Varakantham, P.; Tambe, M.; and Yokoo, M. 2008. Not all agents are equal: Scaling up distributed pomdps for agent networks. In AAMAS, 485-492.
-
(2008)
AAMAS
, pp. 485-492
-
-
Marecki, J.1
Gupta, T.2
Varakantham, P.3
Tambe, M.4
Yokoo, M.5
-
9
-
-
78751696710
-
Decentralised coordination of mobile sensors using the max-sum algorithm
-
Stranders, R.; Farinelli, A.; Rogers, A.; and Jennings, N. R. 2009. Decentralised coordination of mobile sensors using the max-sum algorithm. In IJCAI, 299-304.
-
(2009)
IJCAI
, pp. 299-304
-
-
Stranders, R.1
Farinelli, A.2
Rogers, A.3
Jennings, N.R.4
-
10
-
-
62949185084
-
Introducing communication in dis-pomdps with locality of interaction
-
Tasaki, M.; Yabu, Y.; Iwanari, Y.; Yokoo, M.; Tambe, M.; Marecki, J.; and Varakantham, P. 2008. Introducing communication in dis-pomdps with locality of interaction. In Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, volume 2, 169-175.
-
(2008)
Proceedings of the 2008 IEEE/WIC/ACM International Conference on Intelligent Agent Technology
, vol.2
, pp. 169-175
-
-
Tasaki, M.1
Yabu, Y.2
Iwanari, Y.3
Yokoo, M.4
Tambe, M.5
Marecki, J.6
Varakantham, P.7
-
11
-
-
29344437834
-
Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps
-
Varakantham, P.; Tambe, M.; and Yokoo, M. 2005. Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps. In AAAI, 133-139.
-
(2005)
AAAI
, pp. 133-139
-
-
Varakantham, P.1
Tambe, M.2
Yokoo, M.3
-
12
-
-
84899884456
-
Integrating organizational control into multi-agent learning
-
Zhang, C.; Abdallah, S.; and Lesser, V. 2009. Integrating organizational control into multi-agent learning. In AAMAS'09.
-
(2009)
AAMAS'09
-
-
Zhang, C.1
Abdallah, S.2
Lesser, V.3
-
13
-
-
84865781568
-
Self-organization for coordinating decentralized reinforcement learning
-
Zhang, C.; Lesser, V.; and Abdallah, S. 2010. Self-organization for coordinating decentralized reinforcement learning. In AAMAS'10.
-
(2010)
AAMAS'10
-
-
Zhang, C.1
Lesser, V.2
Abdallah, S.3
|