-
1
-
-
0036565297
-
Coalition agents experiment: Multiagent cooperation in international coalitions
-
Allsopp, D. N., Beautement, P., Bradshaw, J. M., Durfee, E. H., Kirton, M., Knoblock, C. A., Suri, N., Tate, A., & Thompson, C. W. (2002). Coalition agents experiment: Multiagent cooperation in international coalitions. IEEE Intelligent Systems, 17(3), 26-35.
-
(2002)
IEEE Intelligent Systems
, vol.17
, Issue.3
, pp. 26-35
-
-
Allsopp, D.N.1
Beautement, P.2
Bradshaw, J.M.3
Durfee, E.H.4
Kirton, M.5
Knoblock, C.A.6
Suri, N.7
Tate, A.8
Thompson, C.W.9
-
2
-
-
0004142943
-
Direct gradient-based reinforcement learning: I. Gradient estimation algorithms
-
Research School of Information Science and Engineering, Australian National University
-
Baxter, J., & Bartlett, P. L. (1999). Direct gradient-based reinforcement learning: I. Gradient estimation algorithms. Tech. rep., Research School of Information Science and Engineering, Australian National University.
-
(1999)
Tech. Rep.
-
-
Baxter, J.1
Bartlett, P.L.2
-
3
-
-
1142293055
-
Transition-independent decentralized markov decision processes
-
Melbourne. ACM Press, New York
-
Becker, R., Zilberstein, S., Lesser, V., & Goldman, C. V. (2003). Transition-independent decentralized markov decision processes. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2003), pp. 41-48, Melbourne. ACM Press, New York.
-
(2003)
Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2003)
, pp. 41-48
-
-
Becker, R.1
Zilberstein, S.2
Lesser, V.3
Goldman, C.V.4
-
4
-
-
0036874366
-
The complexity of decentralized control of markov decision processes
-
Bernstein, D., Givan, R., Immerman, N., & Zilberstein, S. (2002). The complexity of decentralized control of markov decision processes. Mathematics of Operations Research, 27(4), 819-840.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
6
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
Cowan, J. D., Tesauro, G., & Alspector, J. (Eds.)
-
Boyan, J. A., & Littman, M. L. (1993). Packet routing in dynamically changing networks: A reinforcement learning approach. In Cowan, J. D., Tesauro, G., & Alspector, J. (Eds.), Advances in Neural Information Processing Systems, Vol. 6, pp. 671-678.
-
(1993)
Advances in Neural Information Processing Systems
, vol.6
, pp. 671-678
-
-
Boyan, J.A.1
Littman, M.L.2
-
7
-
-
0002037135
-
Computational organisation theory
-
Weiss, G. (Ed.), The MIT Press, Cambridge, MA
-
Carley, K. M., & Gasser, L. (1999). Computational organisation theory. In Weiss, G. (Ed.), Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence, pp. 299-330. The MIT Press, Cambridge, MA.
-
(1999)
Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence
, pp. 299-330
-
-
Carley, K.M.1
Gasser, L.2
-
8
-
-
0000873984
-
AntNet: Distributed stigmergetic control for communications networks
-
Caro, G. D., & Dorigo, M. (1998a). AntNet: Distributed stigmergetic control for communications networks. Journal of Artificial Intelligence Research, 9, 317-365.
-
(1998)
Journal of Artificial Intelligence Research
, vol.9
, pp. 317-365
-
-
Caro, G.D.1
Dorigo, M.2
-
10
-
-
18544399451
-
Industrial applications of distributed AI
-
Chaib-draa, B. (1995). Industrial applications of distributed AI. Communications of the ACM, 38(11), 49-53.
-
(1995)
Communications of the ACM
, vol.38
, Issue.11
, pp. 49-53
-
-
Chaib-Draa, B.1
-
11
-
-
0000644584
-
Archon: A distributed artificial intelligence system for industrial applications
-
O'Hare, G. M. P., & Jennings, N. R. (Eds.), Wiley
-
Cockburn, D., & Jennings, N. R. (1996). Archon: A distributed artificial intelligence system for industrial applications. In O'Hare, G. M. P., & Jennings, N. R. (Eds.), Foundations of Distributed Artificial Intelligence, pp. 319-344. Wiley.
-
(1996)
Foundations of Distributed Artificial Intelligence
, pp. 319-344
-
-
Cockburn, D.1
Jennings, N.R.2
-
12
-
-
0001064543
-
Teamwork
-
Special Issue on Cognitive Science and Artificial Intelligence
-
Cohen, P. R., & Levesque, H. J. (1991). Teamwork. Nous, 55(4), 487-512. Special Issue on Cognitive Science and Artificial Intelligence.
-
(1991)
Nous
, vol.55
, Issue.4
, pp. 487-512
-
-
Cohen, P.R.1
Levesque, H.J.2
-
13
-
-
0004116989
-
-
chap. 24: Single Source Shortest Paths. MIT Press
-
Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms (2nd edition)., chap. 24: Single Source Shortest Paths. MIT Press.
-
(2001)
Introduction to Algorithms (2nd Edition)
-
-
Cormen, T.H.1
Leiserson, C.E.2
Rivest, R.L.3
Stein, C.4
-
14
-
-
0001700825
-
TAEMS: A framework for environment centered analysis and design of coordination mechanisms
-
O'Hare, G., &: Jennings, N. (Eds.), chap. 16. Wiley Inter-Science
-
Decker, K. (1995a). TAEMS: A framework for environment centered analysis and design of coordination mechanisms. In O'Hare, G., &: Jennings, N. (Eds.), Foundations of Distributed Artificial Intelligence, chap. 16. Wiley Inter-Science.
-
(1995)
Foundations of Distributed Artificial Intelligence
-
-
Decker, K.1
-
17
-
-
84948987049
-
-
Holonic and Multi-Agent Systems for Manufacturing, chap. Multiagent-based process planning and scheduling in context of supply chains, Springer-Verlag, Heidelberg
-
Denkena, B., Zwich, M., & Woelk, P. (2004). Holonic and Multi-Agent Systems for Manufacturing, Vol. 2744/2004 of Lecture Notes in Computer Science, chap. Multiagent-based process planning and scheduling in context of supply chains, pp. 100-109. Springer-Verlag, Heidelberg.
-
(2004)
Lecture Notes in Computer Science
, vol.2744
, Issue.2004
, pp. 100-109
-
-
Denkena, B.1
Zwich, M.2
Woelk, P.3
-
18
-
-
0026225005
-
Partial global planning: A coordination framework for distributed hypothesis formation
-
Durfee, E. H., & Lesser, V. (1991). Partial global planning: A coordination framework for distributed hypothesis formation. IEEE Transactions on Systems, Man, and Cybernetics, 21(5), 1167-1183.
-
(1991)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.21
, Issue.5
, pp. 1167-1183
-
-
Durfee, E.H.1
Lesser, V.2
-
19
-
-
31144452335
-
Cooperative information sharing to improve distributed learning
-
Dutta, P. S., Dasmahapatra, S., Gunn, S. R., Jennings, N. R., & Moreau, L. (2004). Cooperative information sharing to improve distributed learning. In AAMAS-04 workshop on Learning and Evolution in Agent-Based Systems, pp. 18-23.
-
(2004)
AAMAS-04 Workshop on Learning and Evolution in Agent-Based Systems
, pp. 18-23
-
-
Dutta, P.S.1
Dasmahapatra, S.2
Gunn, S.R.3
Jennings, N.R.4
Moreau, L.5
-
21
-
-
26844556873
-
Finding interaction partners using cognition-based decision strategies
-
Dutta, P. S., Moreau, L., & Jennings, N. R. (2003). Finding interaction partners using cognition-based decision strategies. In Working notes of the IJCAI-2003 workshop on Cognitive Modeling of Agents and Multi-Agent Interactions, pp. 46-55.
-
(2003)
Working Notes of the IJCAI-2003 Workshop on Cognitive Modeling of Agents and Multi-Agent Interactions
, pp. 46-55
-
-
Dutta, P.S.1
Moreau, L.2
Jennings, N.R.3
-
22
-
-
2442557843
-
Forming stable partnerships
-
Dutta, P. S., & Sen, S. (2003). Forming stable partnerships. Cognitive Systems Research, 4(3), 211-221.
-
(2003)
Cognitive Systems Research
, vol.4
, Issue.3
, pp. 211-221
-
-
Dutta, P.S.1
Sen, S.2
-
23
-
-
1442265466
-
Power systems stability control : Reinforcement learning framework
-
Ernst, D., Glavic, M., & Wehenkel, L. (2004). Power systems stability control : Reinforcement learning framework. IEEE Transactions on Power Systems, 19(1), 427-435.
-
(2004)
IEEE Transactions on Power Systems
, vol.19
, Issue.1
, pp. 427-435
-
-
Ernst, D.1
Glavic, M.2
Wehenkel, L.3
-
25
-
-
0030263767
-
Collaborative plans for complex group action
-
Grosz, B., & Kraus, S. (1996). Collaborative plans for complex group action. Artificial Intelligence, 86(2), 269-357.
-
(1996)
Artificial Intelligence
, vol.86
, Issue.2
, pp. 269-357
-
-
Grosz, B.1
Kraus, S.2
-
28
-
-
84974223844
-
Commitments and conventions: The foundation of coordination in multi-agent systems
-
Jennings, N. R. (1993). Commitments and conventions: The foundation of coordination in multi-agent systems. The Knowledge Engineering Review, 8(3), 223-250.
-
(1993)
The Knowledge Engineering Review
, vol.8
, Issue.3
, pp. 223-250
-
-
Jennings, N.R.1
-
29
-
-
0029326031
-
Controlling cooperative problem solving in industrial multi-agent systems using joint intentions
-
Jennings, N. R. (1995). Controlling cooperative problem solving in industrial multi-agent systems using joint intentions. Artificial Intelligence, 75(2), 195-240.
-
(1995)
Artificial Intelligence
, vol.75
, Issue.2
, pp. 195-240
-
-
Jennings, N.R.1
-
30
-
-
0037701292
-
Agent-based control systems
-
Jennings, N. R., & Bussmann, S. (2003). Agent-based control systems. IEEE Control Systems Magazine, 23(3), 61-74.
-
(2003)
IEEE Control Systems Magazine
, vol.23
, Issue.3
, pp. 61-74
-
-
Jennings, N.R.1
Bussmann, S.2
-
31
-
-
0012496995
-
Adept: An agent-based approach to business process management
-
Jennings, N. R., Norman, T. J., & Faratin, P. (1998). Adept: An agent-based approach to business process management. ACM SIGMOD, 27(4), 32-39.
-
(1998)
ACM SIGMOD
, vol.27
, Issue.4
, pp. 32-39
-
-
Jennings, N.R.1
Norman, T.J.2
Faratin, P.3
-
32
-
-
31144461730
-
-
O'Reilly Wireless Devcenter
-
Krag, T., &: Buettrich, S. (2004). Wireless Mesh Networking. O'Reilly Wireless Devcenter. http://www.oreillynet.com/pub/a/wireless/2004/01/22/wirelessmesh.html.
-
(2004)
Wireless Mesh Networking
-
-
Krag, T.1
Buettrich, S.2
-
34
-
-
31144442004
-
Distributed interpretation: A model and experiment
-
Bond, A. H., & Gasser, L. (Eds.), Morgan Kaufmann, San Mateo, CA
-
Lesser, V. R., &: Erman, L. D. (1988). Distributed interpretation: A model and experiment. In Bond, A. H., & Gasser, L. (Eds.), Readings in Distributed Artificial Intelligence, pp. 120-139. Morgan Kaufmann, San Mateo, CA.
-
(1988)
Readings in Distributed Artificial Intelligence
, pp. 120-139
-
-
Lesser, V.R.1
Erman, L.D.2
-
35
-
-
0001963197
-
Self-improving factory simulation using continuous-time average reward reinforcement learning
-
Morgan Kaufmann
-
Mahadevan, S., Marchalleck, N., Das, T., & Gosavi, A. (1997). Self-improving factory simulation using continuous-time average reward reinforcement learning. In Proceedings of the Fourteenth International Machine Learning Conference, pp. 202-210. Morgan Kaufmann.
-
(1997)
Proceedings of the Fourteenth International Machine Learning Conference
, pp. 202-210
-
-
Mahadevan, S.1
Marchalleck, N.2
Das, T.3
Gosavi, A.4
-
36
-
-
1142293032
-
Cooperative negotiation for soft real-time distributed resource allocation
-
Melbourne. ACM Press
-
Mailler, R., Lesser, V., & Horling, B. (2003). Cooperative negotiation for soft real-time distributed resource allocation. In Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003), pp. 576-583, Melbourne. ACM Press.
-
(2003)
Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003)
, pp. 576-583
-
-
Mailler, R.1
Lesser, V.2
Horling, B.3
-
39
-
-
0030168518
-
Hidden state and reinforcement learning with instance-based state identification
-
Cybernetics 26
-
McCallum, R. A. (1996). Hidden state and reinforcement learning with instance-based state identification. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics 26), 464-473.
-
(1996)
IEEE Transactions on Systems, Man and Cybernetics, Part B
, pp. 464-473
-
-
McCallum, R.A.1
-
41
-
-
0004255908
-
-
chap. 13: Reinforcement Learning. McGraw-Hill
-
Mitchell, T. M. (1997). Machine Learning, chap. 13: Reinforcement Learning. McGraw-Hill.
-
(1997)
Machine Learning
-
-
Mitchell, T.M.1
-
42
-
-
0342855202
-
The aaria agent architecture: From manufacturing requirements to agent-based system design
-
Parunak, H. V. D., Baker, A. D., & Clark, S. J. (2001). The aaria agent architecture: From manufacturing requirements to agent-based system design. Integrated Computer-Aided Engineering, 8(1), 45-58.
-
(2001)
Integrated Computer-Aided Engineering
, vol.8
, Issue.1
, pp. 45-58
-
-
Parunak, H.V.D.1
Baker, A.D.2
Clark, S.J.3
-
44
-
-
0042172300
-
Multifractal cross-traffic estimation
-
Ribeiro, V., Coates, M., Riedi, R., Sarvotham, S., Hendricks, B., & Baraniuk, R. (2000). Multifractal cross-traffic estimation. In ITC Conference on IP Traffic, Modeling and Management.
-
(2000)
ITC Conference on IP Traffic, Modeling and Management
-
-
Ribeiro, V.1
Coates, M.2
Riedi, R.3
Sarvotham, S.4
Hendricks, B.5
Baraniuk, R.6
-
47
-
-
1242265508
-
Minimizing communication cost in a distributed Bayesian network using a decentralized MDP
-
ACM Press
-
Shen, J., Lesser, V., & Carver, N. (2003). Minimizing communication cost in a distributed Bayesian network using a decentralized MDP. In Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003), pp. 678-685. ACM Press.
-
(2003)
Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003)
, pp. 678-685
-
-
Shen, J.1
Lesser, V.2
Carver, N.3
-
48
-
-
85153965130
-
Reinforcement learning with soft state aggregation
-
Tesauro, G., Touretzky, D., & Leen, T. (Eds.), The MIT Press
-
Singh, S. P., Jaakkola, T., & Jordan, M. I. (1995). Reinforcement learning with soft state aggregation. In Tesauro, G., Touretzky, D., & Leen, T. (Eds.), Advances in Neural Information Processing Systems, Vol. 7, pp. 361-368. The MIT Press.
-
(1995)
Advances in Neural Information Processing Systems
, vol.7
, pp. 361-368
-
-
Singh, S.P.1
Jaakkola, T.2
Jordan, M.I.3
-
49
-
-
18144424551
-
TPOT-RL applied to network routing
-
Stone, P. (2000). TPOT-RL applied to network routing. In Proceedings of ICML 2000, pp. 935-942.
-
(2000)
Proceedings of ICML 2000
, pp. 935-942
-
-
Stone, P.1
-
51
-
-
84898939480
-
Policy-gradient methods for reinforcement learning with function approximation
-
Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy-gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems, 12, 1057-1063.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
52
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
54
-
-
0347090472
-
Auction-based effective bandwidth allocation mechanism
-
Takahashi, E., & Tanaka, Y. (2003). Auction-based effective bandwidth allocation mechanism. Telecommunications Systems, 24(2), 323-338.
-
(2003)
Telecommunications Systems
, vol.24
, Issue.2
, pp. 323-338
-
-
Takahashi, E.1
Tanaka, Y.2
-
56
-
-
0004141908
-
-
chap. 5: The Network Layer. Prentice Hall PTR
-
th edition)., chap. 5: The Network Layer. Prentice Hall PTR.
-
(2003)
th Edition)
-
-
Tanenbaum, A.S.1
-
57
-
-
0036832958
-
Reinforcement learning for call admission control and routing under quality of service constraints in multimedia networks
-
Tong, H. (2002). Reinforcement learning for call admission control and routing under quality of service constraints in multimedia networks. Machine Learning, 49(2), 111-139.
-
(2002)
Machine Learning
, vol.49
, Issue.2
, pp. 111-139
-
-
Tong, H.1
-
58
-
-
0030735427
-
Distributed detection with multiple sensors: Part i - Fundamentals
-
Viswanathan, R., & Varshney, P. K. (1997). Distributed detection with multiple sensors: Part I - fundamentals. In Proceedings of the IEEE, Vol. 85-1, pp. 54-63.
-
(1997)
Proceedings of the IEEE
, vol.85
, Issue.1
, pp. 54-63
-
-
Viswanathan, R.1
Varshney, P.K.2
-
59
-
-
9544239260
-
Decentralized supply chain formation: A market protocol and competitive equilibrium analysis
-
Walsh, W. E., & Wellman, M. P. (2003). Decentralized supply chain formation: A market protocol and competitive equilibrium analysis. Journal of Artificial Intelligence Research, 19, 513-567.
-
(2003)
Journal of Artificial Intelligence Research
, vol.19
, pp. 513-567
-
-
Walsh, W.E.1
Wellman, M.P.2
-
60
-
-
0004049893
-
-
Ph.D. thesis, Psychology Department, University of Cambridge
-
Watkins, C. J. C. H. (1989). Learning from delayed rewards. Ph.D. thesis, Psychology Department, University of Cambridge.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
63
-
-
0000337576
-
Simple statistical gradient-following algorithms for connectionist reinforcement learning
-
Williams, R. J. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8(3), 229-256.
-
(1992)
Machine Learning
, vol.8
, Issue.3
, pp. 229-256
-
-
Williams, R.J.1
-
64
-
-
0030284198
-
A probabilistic framework for cooperative multi-agent distributed interpretation and optimization of communication
-
Xiang, Y. (1996). A probabilistic framework for cooperative multi-agent distributed interpretation and optimization of communication. Artificial Intelligence, 87(1-2), 295-342.
-
(1996)
Artificial Intelligence
, vol.87
, Issue.1-2
, pp. 295-342
-
-
Xiang, Y.1
-
65
-
-
0034827257
-
Communication decisions in multi-agent cooperation: Model and experiments
-
Montreal
-
Xuan, P., Lesser, V., & Zilberstein, S. (2001). Communication decisions in multi-agent cooperation: model and experiments. In Proceedings of the Fifth International Conference on Autonomous Agents (Agents-01), pp. 616-623, Montreal.
-
(2001)
Proceedings of the Fifth International Conference on Autonomous Agents (Agents-01)
, pp. 616-623
-
-
Xuan, P.1
Lesser, V.2
Zilberstein, S.3
|