-
1
-
-
33644800166
-
Preprocessing techniques for accelerating the DCOP algorithm ADOPT
-
Utrecht, The Netherlands
-
S. Muhammad Ali, S. Koenig, and M. Tambe. Preprocessing techniques for accelerating the DCOP algorithm ADOPT. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), pages 1041-1048, Utrecht, The Netherlands, 2005.
-
(2005)
Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS)
, pp. 1041-1048
-
-
Ali, S.M.1
Koenig, S.2
Tambe, M.3
-
2
-
-
0036817725
-
Editorial: Advances in multi-robot systems
-
T. Aral, E. Pagello, and L. E. Parker. Editorial: Advances in multi-robot systems. IEEE Transactions on Robotics and Automation, 18(5):665-661, 2002.
-
(2002)
IEEE Transactions on Robotics and Automation
, vol.18
, Issue.5
, pp. 665-1661
-
-
Aral, T.1
Pagello, E.2
Parker, L.E.3
-
3
-
-
1142293055
-
Transition-independent decentralized Markov decision processes
-
Melbourne, Australia
-
R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman. Transition-independent decentralized Markov decision processes. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), Melbourne, Australia, 2003.
-
(2003)
Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS)
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
8
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors. Morgan Kaufmann Publishers, Inc.
-
J. A. Boyan and M. L. Littman. Packet routing in dynamically changing networks: A reinforcement learning approach. In Jack D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances in Neural Information Processing Systems (NIPS) 6, pages 671-678. Morgan Kaufmann Publishers, Inc., 1994.
-
(1994)
Advances in Neural Information Processing Systems (NIPS)
, vol.6
, pp. 671-678
-
-
Boyan, J.A.1
Littman, M.L.2
-
9
-
-
0033692328
-
Collaborative multi-robot exploration
-
W. Burgard, M. Moors, D. Fox, R. Simmons, and S. Thrun. Collaborative multi-robot exploration. In Proceedings of the IEEE International Conference on Robotics and Automation, 2000.
-
(2000)
Proceedings of the IEEE International Conference on Robotics and Automation
-
-
Burgard, W.1
Moors, M.2
Fox, D.3
Simmons, R.4
Thrun, S.5
-
16
-
-
0035395660
-
Scaling up agent coordination strategies
-
July
-
E. H. Durfee. Scaling up agent coordination strategies. IEEE Computer, 34(7):39-46, July 2001.
-
(2001)
IEEE Computer
, vol.34
, Issue.7
, pp. 39-46
-
-
Durfee, E.H.1
-
17
-
-
31144432283
-
Cooperative information sharing to improve distributed learning in multi-agent systems
-
P. S. Dutta, N. R. Jennings, and L. Moreau. Cooperative information sharing to improve distributed learning in multi-agent systems. Journal of Artificial Intelligence Research, 24:407-463, 2005.
-
(2005)
Journal of Artificial Intelligence Research
, vol.24
, pp. 407-463
-
-
Dutta, P.S.1
Jennings, N.R.2
Moreau, L.3
-
18
-
-
27344449757
-
Decentralized control of cooperative systems: Categorization and complexity analysis
-
November
-
C. Goldman and S. Zilberstein. Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research, 22:143-174, November 2004.
-
(2004)
Journal of Artificial Intelligence Research
, vol.22
, pp. 143-174
-
-
Goldman, C.1
Zilberstein, S.2
-
19
-
-
1142293050
-
Optimizing information exchange in cooperative multi-agent systems
-
New York, NY, USA. ACM Press
-
C. V. Goldman and S. Zilberstein. Optimizing information exchange in cooperative multi-agent systems. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), pages 137-144, New York, NY, USA, 2003. ACM Press.
-
(2003)
Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS)
, pp. 137-144
-
-
Goldman, C.V.1
Zilberstein, S.2
-
22
-
-
4544236179
-
Coordinated reinforcement learning
-
Sydney, Australia, July
-
C. Guestrin, M. Lagoudakis, and R. Parr. Coordinated reinforcement learning. In International Conference on Machine Learning (ICML), Sydney, Australia, July 2002b.
-
(2002)
International Conference on Machine Learning (ICML)
-
-
Guestrin, C.1
Lagoudakis, M.2
Parr, R.3
-
23
-
-
0036923118
-
Context-specific multiagent coordination and planning with factored MDPs
-
Edmonton, Canada, July
-
C. Guestrin, S. Venkataraman, and D. Koller. Context-specific multiagent coordination and planning with factored MDPs. In Proceedings of the National Conference on Artificial Intelligence (AAAI), Edmonton, Canada, July 2002c.
-
(2002)
Proceedings of the National Conference on Artificial Intelligence (AAAI)
-
-
Guestrin, C.1
Venkataraman, S.2
Koller, D.3
-
25
-
-
0004491880
-
RoboCup: The robot world cup initiative
-
H. Kitano, M. Asada, Y. Kuniyoshi, I. Noda, and E. Osawa. RoboCup: The robot world cup initiative. In Proceedings of the IJCAI-95 Workshop on Entertainment and AI/AHfe, 1995.
-
(1995)
Proceedings of the IJCAI-95 Workshop on Entertainment and AI/AHfe
-
-
Kitano, H.1
Asada, M.2
Kuniyoshi, Y.3
Noda, I.4
Osawa, E.5
-
26
-
-
14344250637
-
Sparse cooperative Q-learning
-
RUSS Greiner and Dale Schuurmans, editors, Banff, Canada, July. ACM
-
J. R. Kok and N. Vlassis. Sparse cooperative Q-learning. In RUSS Greiner and Dale Schuurmans, editors, Proceedings of the International Conference on Machine Learning, pages 481-488, Banff, Canada, July 2004. ACM.
-
(2004)
Proceedings of the International Conference on Machine Learning
, pp. 481-488
-
-
Kok, J.R.1
Vlassis, N.2
-
27
-
-
33748562008
-
Using the max-plus algorithm for multiagent decision making in coordination graphs
-
Osaka, Japan, July
-
J. R. Kok and N. Vlassis. Using the max-plus algorithm for multiagent decision making in coordination graphs. In RoboCup-2005: Robot Soccer World Cup IX, Osaka, Japan, July 2005.
-
(2005)
RoboCup-2005: Robot Soccer World Cup IX
-
-
Kok, J.R.1
Vlassis, N.2
-
28
-
-
12244304892
-
Non-communicative multi-robot coordination in dynamic environments
-
February
-
J. R. Kok, M. T. J. Spaan, and N. Vlassis. Non-communicative multi-robot coordination in dynamic environments. Robotics and Autonomous Systems, 50(2-3):99-114, February 2005.
-
(2005)
Robotics and Autonomous Systems
, vol.50
, Issue.2-3
, pp. 99-114
-
-
Kok, J.R.1
Spaan, M.T.J.2
Vlassis, N.3
-
33
-
-
33746360402
-
Distributed optimization in adaptive networks
-
S. Thrun, L. Saul, and B. Schölkopf, editors. MIT Press, Cambridge, MA
-
C. C. Moallemi and B. Van Roy. Distributed optimization in adaptive networks. In S. Thrun, L. Saul, and B. Schölkopf, editors, Advances in Neural Information Processing Systems (NIPS) 16. MIT Press, Cambridge, MA, 2004.
-
(2004)
Advances in Neural Information Processing Systems (NIPS)
, vol.16
-
-
Moallemi, C.C.1
Van Roy, B.2
-
34
-
-
10044277219
-
ADOPT: Asynchronous distributed constraint optimization with quality guarantees
-
P. Jay Modi, W-M. Shen, M. Tambe, and M. Yokoo. ADOPT: Asynchronous distributed constraint optimization with quality guarantees. Artificial Intelligence, 161(1-2): 149-180, 2005.
-
(2005)
Artificial Intelligence
, vol.161
, Issue.1-2
, pp. 149-180
-
-
Modi, P.J.1
Shen, W.-M.2
Tambe, M.3
Yokoo, M.4
-
37
-
-
0036573011
-
Distributed algorithms for multi-robot observation of multiple moving targets
-
L. E. Parker. Distributed algorithms for multi-robot observation of multiple moving targets. Autonomous Robots, 12(3):231-255, 2002.
-
(2002)
Autonomous Robots
, vol.12
, Issue.3
, pp. 231-255
-
-
Parker, L.E.1
-
39
-
-
0012646255
-
Learning to cooperate via policy search
-
Morgan Kaufmann Publishers
-
L. Peshkin, K.-E. Kim, N. Meuleau, and L. P. Kaelbling. Learning to cooperate via policy search. In Proceedings of Uncertainty in Artificial Intelligence (UAI), pages 489-496. Morgan Kaufmann Publishers, 2000.
-
(2000)
Proceedings of Uncertainty in Artificial Intelligence (UAI)
, pp. 489-496
-
-
Peshkin, L.1
Kim, K.-E.2
Meuleau, N.3
Kaelbling, L.P.4
-
41
-
-
1142292938
-
The communicative multiagent team decision problem: Analyzing teamwork theories and models
-
D. V. Pynadath and M. Tambe. The communicative multiagent team decision problem: Analyzing teamwork theories and models. Journal of Artificial Intelligence Research, 16:389-423, 2002.
-
(2002)
Journal of Artificial Intelligence Research
, vol.16
, pp. 389-423
-
-
Pynadath, D.V.1
Tambe, M.2
-
42
-
-
0001395498
-
Distributed value functions
-
Bled, Slovenia
-
J. Schneider, W.-K. Wong, A. Moore, and M. Riedmiller. Distributed value functions. In International Conference on Machine Learning (ICML), Bled, Slovenia, 1999.
-
(1999)
International Conference on Machine Learning (ICML)
-
-
Schneider, J.1
Wong, W.-K.2
Moore, A.3
Riedmiller, M.4
-
45
-
-
27544506565
-
Reinforcement learning for RoboCup-soccer keepaway
-
P. Stone, R. S. Sutton, and G. Kuhlmann. Reinforcement learning for RoboCup-soccer keepaway. Adaptive Behavior, 13(3): 165-188, 2005.
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
47
-
-
0032096675
-
Multiagent systems
-
K. Sycara. Multiagent systems. AI Magazine, 19(2):79-92, 1998.
-
(1998)
AI Magazine
, vol.19
, Issue.2
, pp. 79-92
-
-
Sycara, K.1
-
48
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. cooperative agents
-
Amherst, MA
-
M. Tan. Multi-agent reinforcement learning: Independent vs. cooperative agents. In International Conference on Machine Learning (ICML), Amherst, MA, 1993.
-
(1993)
International Conference on Machine Learning (ICML)
-
-
Tan, M.1
-
49
-
-
0029276036
-
Temporal difference learning and TD-Gammon
-
March
-
G. Tesauro. Temporal difference learning and TD-Gammon. Communications offne ACM, 38(3), March 1995.
-
(1995)
Communications Offne ACM
, vol.38
, Issue.3
-
-
Tesauro, G.1
-
51
-
-
15744395091
-
Anytime algorithms for multiagent decision making using coordination graphs
-
The Hague, The Netherlands, October
-
N, Vlassis, R. Elhorst, and J. R. Kok. Anytime algorithms for multiagent decision making using coordination graphs. In Proceedings of the International Conference on Systems, Man, and Cybernetics (SMC), The Hague, The Netherlands, October 2004.
-
(2004)
Proceedings of the International Conference on Systems, Man, and Cybernetics (SMC)
-
-
Vlassis, N.1
Elhorst, R.2
Kok, J.R.3
-
52
-
-
3943084089
-
Tree consistency and bounds on the performance of the max-product algorithm and its generalizations
-
April
-
M. J. Wainwright, T. S. Jaakkola, and A. S. Willsky. Tree consistency and bounds on the performance of the max-product algorithm and its generalizations. Statistics and Computing, 14: 143-166, April 2004.
-
(2004)
Statistics and Computing
, vol.14
, pp. 143-166
-
-
Wainwright, M.J.1
Jaakkola, T.S.2
Willsky, A.S.3
-
53
-
-
34249833101
-
Technical note: Q-learning
-
C. Watkins and P. Dayan. Technical note: Q-learning. Machine Learning, 8(3-4):279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
55
-
-
0141695638
-
Understanding belief propagation and its generalizations
-
chapter 8. Morgan Kaufmann Publishers Inc., January
-
J. S. Yedidia, W. T. Freeman, and Y. Weiss. Understanding belief propagation and its generalizations. In Exploring Artificial Intelligence in the New Millennium, chapter 8, pages 239-269. Morgan Kaufmann Publishers Inc., January 2003.
-
(2003)
Exploring Artificial Intelligence in the New Millennium
, pp. 239-269
-
-
Yedidia, J.S.1
Freeman, W.T.2
Weiss, Y.3
-
56
-
-
1142305807
-
Distributed constraint optimization as a formal model of partially adversarial cooperation
-
University of Michigan, Ann Arbor, MI 48109
-
M. Yokoo and E. H. Durfee. Distributed constraint optimization as a formal model of partially adversarial cooperation. Technical Report CSE-TR-101-91, University of Michigan, Ann Arbor, MI 48109, 1991.
-
(1991)
Technical Report
, vol.CSE-TR-101-91
-
-
Yokoo, M.1
Durfee, E.H.2
|