SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 24, Issue , 2005, Pages 407-463

Cooperative information sharing to improve distributed learning in multi-agent systems

(3) Dutta, Partha S a Jennings, Nicholas R a Moreau, Luc a

a UNIVERSITY OF SOUTHAMPTON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

BENCHMARKING; HEURISTIC METHODS; INFORMATION ANALYSIS; LEARNING ALGORITHMS; MULTI AGENT SYSTEMS; ROUTERS;

BENCHMARK STRATEGIES; DISTRIBUTED LEARNING; INFORMATION-SHARING PROTOCOLS; STATE INFORMATION;

LEARNING SYSTEMS;

EID: 31144432283 PISSN: 10769757 EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.1735 Document Type: Article

Times cited : (23)

References (65)

1
- 0036565297
- Coalition agents experiment: Multiagent cooperation in international coalitions
- Allsopp, D. N., Beautement, P., Bradshaw, J. M., Durfee, E. H., Kirton, M., Knoblock, C. A., Suri, N., Tate, A., & Thompson, C. W. (2002). Coalition agents experiment: Multiagent cooperation in international coalitions. IEEE Intelligent Systems, 17(3), 26-35.
- (2002) IEEE Intelligent Systems , vol.17 , Issue.3 , pp. 26-35
- Allsopp, D.N.¹ Beautement, P.² Bradshaw, J.M.³ Durfee, E.H.⁴ Kirton, M.⁵ Knoblock, C.A.⁶ Suri, N.⁷ Tate, A.⁸ Thompson, C.W.⁹

2
- 0004142943
- Direct gradient-based reinforcement learning: I. Gradient estimation algorithms
- Research School of Information Science and Engineering, Australian National University
- Baxter, J., & Bartlett, P. L. (1999). Direct gradient-based reinforcement learning: I. Gradient estimation algorithms. Tech. rep., Research School of Information Science and Engineering, Australian National University.
- (1999) Tech. Rep.
- Baxter, J.¹ Bartlett, P.L.²

3
- 1142293055
- Transition-independent decentralized markov decision processes
- Melbourne. ACM Press, New York
- Becker, R., Zilberstein, S., Lesser, V., & Goldman, C. V. (2003). Transition-independent decentralized markov decision processes. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2003), pp. 41-48, Melbourne. ACM Press, New York.
- (2003) Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2003) , pp. 41-48
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

4
- 0036874366
- The complexity of decentralized control of markov decision processes
- Bernstein, D., Givan, R., Immerman, N., & Zilberstein, S. (2002). The complexity of decentralized control of markov decision processes. Mathematics of Operations Research, 27(4), 819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

5
- 84880690163
- Sequential optimality and coordination in multiagent systems
- Stockholm
- Boutilier, C. (1999). Sequential optimality and coordination in multiagent systems. In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99), pp. 478-485, Stockholm.
- (1999) Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (IJCAI-99) , pp. 478-485
- Boutilier, C.¹

6
- 0000719863
- Packet routing in dynamically changing networks: A reinforcement learning approach
- Cowan, J. D., Tesauro, G., & Alspector, J. (Eds.)
- Boyan, J. A., & Littman, M. L. (1993). Packet routing in dynamically changing networks: A reinforcement learning approach. In Cowan, J. D., Tesauro, G., & Alspector, J. (Eds.), Advances in Neural Information Processing Systems, Vol. 6, pp. 671-678.
- (1993) Advances in Neural Information Processing Systems , vol.6 , pp. 671-678
- Boyan, J.A.¹ Littman, M.L.²

7
- 0002037135
- Computational organisation theory
- Weiss, G. (Ed.), The MIT Press, Cambridge, MA
- Carley, K. M., & Gasser, L. (1999). Computational organisation theory. In Weiss, G. (Ed.), Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence, pp. 299-330. The MIT Press, Cambridge, MA.
- (1999) Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence , pp. 299-330
- Carley, K.M.¹ Gasser, L.²

8
- 0000873984
- AntNet: Distributed stigmergetic control for communications networks
- Caro, G. D., & Dorigo, M. (1998a). AntNet: Distributed stigmergetic control for communications networks. Journal of Artificial Intelligence Research, 9, 317-365.
- (1998) Journal of Artificial Intelligence Research , vol.9 , pp. 317-365
- Caro, G.D.¹ Dorigo, M.²

9
- 0003854897
- Two ant colony algorithms for best-effort routing in datagram networks
- IASTED/ACTA Press
- Caro, G. D., & Dorigo, M. (1998b). Two ant colony algorithms for best-effort routing in datagram networks. In Proceedings of the Tenth IASTED International Conference on Parallel and Distributed Computing and Systems, pp. 541-546. IASTED/ACTA Press.
- (1998) Proceedings of the Tenth IASTED International Conference on Parallel and Distributed Computing and Systems , pp. 541-546
- Caro, G.D.¹ Dorigo, M.²

10
- 18544399451
- Industrial applications of distributed AI
- Chaib-draa, B. (1995). Industrial applications of distributed AI. Communications of the ACM, 38(11), 49-53.
- (1995) Communications of the ACM , vol.38 , Issue.11 , pp. 49-53
- Chaib-Draa, B.¹

11
- 0000644584
- Archon: A distributed artificial intelligence system for industrial applications
- O'Hare, G. M. P., & Jennings, N. R. (Eds.), Wiley
- Cockburn, D., & Jennings, N. R. (1996). Archon: A distributed artificial intelligence system for industrial applications. In O'Hare, G. M. P., & Jennings, N. R. (Eds.), Foundations of Distributed Artificial Intelligence, pp. 319-344. Wiley.
- (1996) Foundations of Distributed Artificial Intelligence , pp. 319-344
- Cockburn, D.¹ Jennings, N.R.²

12
- 0001064543
- Teamwork
- Special Issue on Cognitive Science and Artificial Intelligence
- Cohen, P. R., & Levesque, H. J. (1991). Teamwork. Nous, 55(4), 487-512. Special Issue on Cognitive Science and Artificial Intelligence.
- (1991) Nous , vol.55 , Issue.4 , pp. 487-512
- Cohen, P.R.¹ Levesque, H.J.²

13
- 0004116989
- chap. 24: Single Source Shortest Paths. MIT Press
- Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms (2nd edition)., chap. 24: Single Source Shortest Paths. MIT Press.
- (2001) Introduction to Algorithms (2nd Edition)
- Cormen, T.H.¹ Leiserson, C.E.² Rivest, R.L.³ Stein, C.⁴

14
- 0001700825
- TAEMS: A framework for environment centered analysis and design of coordination mechanisms
- O'Hare, G., &: Jennings, N. (Eds.), chap. 16. Wiley Inter-Science
- Decker, K. (1995a). TAEMS: A framework for environment centered analysis and design of coordination mechanisms. In O'Hare, G., &: Jennings, N. (Eds.), Foundations of Distributed Artificial Intelligence, chap. 16. Wiley Inter-Science.
- (1995) Foundations of Distributed Artificial Intelligence
- Decker, K.¹

15
- 0003994274
- Ph.D. thesis, University of Massachusetts, Amherst, Massachusetts
- Decker, K. S. (1995b). Environment centered analysis and design of coordination mechanisms. Ph.D. thesis, University of Massachusetts, Amherst, Massachusetts.
- (1995) Environment Centered Analysis and Design of Coordination Mechanisms
- Decker, K.S.¹

16
- 0001860486
- Designing a family of coordination algorithms
- San Fransisco
- Decker, K. S., &: Lesser, V. R. (1995). Designing a family of coordination algorithms. In Proceedings of the First International Conference on Multi-agent Systems, pp. 73-80, San Fransisco.
- (1995) Proceedings of the First International Conference on Multi-agent Systems , pp. 73-80
- Decker, K.S.¹ Lesser, V.R.²

17
- 84948987049
- Holonic and Multi-Agent Systems for Manufacturing, chap. Multiagent-based process planning and scheduling in context of supply chains, Springer-Verlag, Heidelberg
- Denkena, B., Zwich, M., & Woelk, P. (2004). Holonic and Multi-Agent Systems for Manufacturing, Vol. 2744/2004 of Lecture Notes in Computer Science, chap. Multiagent-based process planning and scheduling in context of supply chains, pp. 100-109. Springer-Verlag, Heidelberg.
- (2004) Lecture Notes in Computer Science , vol.2744 , Issue.2004 , pp. 100-109
- Denkena, B.¹ Zwich, M.² Woelk, P.³

18
- 0026225005
- Partial global planning: A coordination framework for distributed hypothesis formation
- Durfee, E. H., & Lesser, V. (1991). Partial global planning: A coordination framework for distributed hypothesis formation. IEEE Transactions on Systems, Man, and Cybernetics, 21(5), 1167-1183.
- (1991) IEEE Transactions on Systems, Man, and Cybernetics , vol.21 , Issue.5 , pp. 1167-1183
- Durfee, E.H.¹ Lesser, V.²

19
- 31144452335
- Cooperative information sharing to improve distributed learning
- Dutta, P. S., Dasmahapatra, S., Gunn, S. R., Jennings, N. R., & Moreau, L. (2004). Cooperative information sharing to improve distributed learning. In AAMAS-04 workshop on Learning and Evolution in Agent-Based Systems, pp. 18-23.
- (2004) AAMAS-04 Workshop on Learning and Evolution in Agent-Based Systems , pp. 18-23
- Dutta, P.S.¹ Dasmahapatra, S.² Gunn, S.R.³ Jennings, N.R.⁴ Moreau, L.⁵

20
- 33644814145
- Sharing information for q-learning-based network bandwidth estimation and network failure detection (poster)
- to appear
- Dutta, P. S., Jennings, N. R., &: Moreau, L. (2005). Sharing information for q-learning-based network bandwidth estimation and network failure detection (poster). In Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems (to appear).
- (2005) Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems
- Dutta, P.S.¹ Jennings, N.R.² Moreau, L.³

21
- 26844556873
- Finding interaction partners using cognition-based decision strategies
- Dutta, P. S., Moreau, L., & Jennings, N. R. (2003). Finding interaction partners using cognition-based decision strategies. In Working notes of the IJCAI-2003 workshop on Cognitive Modeling of Agents and Multi-Agent Interactions, pp. 46-55.
- (2003) Working Notes of the IJCAI-2003 Workshop on Cognitive Modeling of Agents and Multi-Agent Interactions , pp. 46-55
- Dutta, P.S.¹ Moreau, L.² Jennings, N.R.³

22
- 2442557843
- Forming stable partnerships
- Dutta, P. S., & Sen, S. (2003). Forming stable partnerships. Cognitive Systems Research, 4(3), 211-221.
- (2003) Cognitive Systems Research , vol.4 , Issue.3 , pp. 211-221
- Dutta, P.S.¹ Sen, S.²

23
- 1442265466
- Power systems stability control : Reinforcement learning framework
- Ernst, D., Glavic, M., & Wehenkel, L. (2004). Power systems stability control : Reinforcement learning framework. IEEE Transactions on Power Systems, 19(1), 427-435.
- (2004) IEEE Transactions on Power Systems , vol.19 , Issue.1 , pp. 427-435
- Ernst, D.¹ Glavic, M.² Wehenkel, L.³

24
- 0041648459
- Kluwer Academic Publishers
- Feinberg, E., & Schwartz, A. (2001). Handbook of Markov Decision Processes: Models and Applications. Kluwer Academic Publishers.
- (2001) Handbook of Markov Decision Processes: Models and Applications
- Feinberg, E.¹ Schwartz, A.²

25
- 0030263767
- Collaborative plans for complex group action
- Grosz, B., & Kraus, S. (1996). Collaborative plans for complex group action. Artificial Intelligence, 86(2), 269-357.
- (1996) Artificial Intelligence , vol.86 , Issue.2 , pp. 269-357
- Grosz, B.¹ Kraus, S.²

26
- 0003919801
- Network Working Group, IETF.
- Hedrick, C. (1988). Routing Information Protocol (RFC 1058). Network Working Group, IETF. http://www.ietf.org/rfc/rfc1058.txt.
- (1988) Routing Information Protocol (RFC 1058)
- Hedrick, C.¹

27
- 0043174465
- Pathload: A measurement tool for end-to-end available bandwidth
- Jain, M., & Dovrolis, C. (2002). Pathload: A measurement tool for end-to-end available bandwidth. In Proceedings of the Passive and Active Measurements (PAM) Workshop, pp. 14-25.
- (2002) Proceedings of the Passive and Active Measurements (PAM) Workshop , pp. 14-25
- Jain, M.¹ Dovrolis, C.²

28
- 84974223844
- Commitments and conventions: The foundation of coordination in multi-agent systems
- Jennings, N. R. (1993). Commitments and conventions: The foundation of coordination in multi-agent systems. The Knowledge Engineering Review, 8(3), 223-250.
- (1993) The Knowledge Engineering Review , vol.8 , Issue.3 , pp. 223-250
- Jennings, N.R.¹

29
- 0029326031
- Controlling cooperative problem solving in industrial multi-agent systems using joint intentions
- Jennings, N. R. (1995). Controlling cooperative problem solving in industrial multi-agent systems using joint intentions. Artificial Intelligence, 75(2), 195-240.
- (1995) Artificial Intelligence , vol.75 , Issue.2 , pp. 195-240
- Jennings, N.R.¹

30
- 0037701292
- Agent-based control systems
- Jennings, N. R., & Bussmann, S. (2003). Agent-based control systems. IEEE Control Systems Magazine, 23(3), 61-74.
- (2003) IEEE Control Systems Magazine , vol.23 , Issue.3 , pp. 61-74
- Jennings, N.R.¹ Bussmann, S.²

31
- 0012496995
- Adept: An agent-based approach to business process management
- Jennings, N. R., Norman, T. J., & Faratin, P. (1998). Adept: An agent-based approach to business process management. ACM SIGMOD, 27(4), 32-39.
- (1998) ACM SIGMOD , vol.27 , Issue.4 , pp. 32-39
- Jennings, N.R.¹ Norman, T.J.² Faratin, P.³

32
- 31144461730
- O'Reilly Wireless Devcenter
- Krag, T., &: Buettrich, S. (2004). Wireless Mesh Networking. O'Reilly Wireless Devcenter. http://www.oreillynet.com/pub/a/wireless/2004/01/22/wirelessmesh.html.
- (2004) Wireless Mesh Networking
- Krag, T.¹ Buettrich, S.²

33
- 0032674479
- Measuring bandwidth
- Lai, K., & Baker, M. (1999). Measuring bandwidth. In Proceedings of the IEEE INFOCOM (1), pp. 235-245.
- (1999) Proceedings of the IEEE INFOCOM , Issue.1 , pp. 235-245
- Lai, K.¹ Baker, M.²

34
- 31144442004
- Distributed interpretation: A model and experiment
- Bond, A. H., & Gasser, L. (Eds.), Morgan Kaufmann, San Mateo, CA
- Lesser, V. R., &: Erman, L. D. (1988). Distributed interpretation: A model and experiment. In Bond, A. H., & Gasser, L. (Eds.), Readings in Distributed Artificial Intelligence, pp. 120-139. Morgan Kaufmann, San Mateo, CA.
- (1988) Readings in Distributed Artificial Intelligence , pp. 120-139
- Lesser, V.R.¹ Erman, L.D.²

35
- 0001963197
- Self-improving factory simulation using continuous-time average reward reinforcement learning
- Morgan Kaufmann
- Mahadevan, S., Marchalleck, N., Das, T., & Gosavi, A. (1997). Self-improving factory simulation using continuous-time average reward reinforcement learning. In Proceedings of the Fourteenth International Machine Learning Conference, pp. 202-210. Morgan Kaufmann.
- (1997) Proceedings of the Fourteenth International Machine Learning Conference , pp. 202-210
- Mahadevan, S.¹ Marchalleck, N.² Das, T.³ Gosavi, A.⁴

36
- 1142293032
- Cooperative negotiation for soft real-time distributed resource allocation
- Melbourne. ACM Press
- Mailler, R., Lesser, V., & Horling, B. (2003). Cooperative negotiation for soft real-time distributed resource allocation. In Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003), pp. 576-583, Melbourne. ACM Press.
- (2003) Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003) , pp. 576-583
- Mailler, R.¹ Lesser, V.² Horling, B.³

37
- 0024122396
- Managing the flow of intelligent parts
- Maley, J. (1988). Managing the flow of intelligent parts. Robotics and Computer-Integrated Manufacturing, 4(3/4), 525-530.
- (1988) Robotics and Computer-Integrated Manufacturing , vol.4 , Issue.3-4 , pp. 525-530
- Maley, J.¹

38
- 84967417109
- John Wiley & Sons
- March, J. G., & Simon, H. A. (1958). Organizations. John Wiley & Sons.
- (1958) Organizations
- March, J.G.¹ Simon, H.A.²

39
- 0030168518
- Hidden state and reinforcement learning with instance-based state identification
- Cybernetics 26
- McCallum, R. A. (1996). Hidden state and reinforcement learning with instance-based state identification. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics 26), 464-473.
- (1996) IEEE Transactions on Systems, Man and Cybernetics, Part B , pp. 464-473
- McCallum, R.A.¹

40
- 0003234432
- Cooperating mobile agents for dynamic network routing
- Springer-Verlag. ISBN 3-540-65578-6
- Minar, N., Kramer, K. H., & Maes, P. (1999). Cooperating mobile agents for dynamic network routing. In Proceedings of the Software Agents for Future Communications Systems. Springer-Verlag. ISBN 3-540-65578-6.
- (1999) Proceedings of the Software Agents for Future Communications Systems
- Minar, N.¹ Kramer, K.H.² Maes, P.³

41
- 0004255908
- chap. 13: Reinforcement Learning. McGraw-Hill
- Mitchell, T. M. (1997). Machine Learning, chap. 13: Reinforcement Learning. McGraw-Hill.
- (1997) Machine Learning
- Mitchell, T.M.¹

42
- 0342855202
- The aaria agent architecture: From manufacturing requirements to agent-based system design
- Parunak, H. V. D., Baker, A. D., & Clark, S. J. (2001). The aaria agent architecture: From manufacturing requirements to agent-based system design. Integrated Computer-Aided Engineering, 8(1), 45-58.
- (2001) Integrated Computer-Aided Engineering , vol.8 , Issue.1 , pp. 45-58
- Parunak, H.V.D.¹ Baker, A.D.² Clark, S.J.³

43
- 0036082856
- Reinforcement learning for adaptive routing
- Peshkin, L., & Savova, V. (2002). Reinforcement learning for adaptive routing. In Proceedings of the International Joint Conference on Neural Networks (IJCNN).
- (2002) Proceedings of the International Joint Conference on Neural Networks (IJCNN)
- Peshkin, L.¹ Savova, V.²

44
- 0042172300
- Multifractal cross-traffic estimation
- Ribeiro, V., Coates, M., Riedi, R., Sarvotham, S., Hendricks, B., & Baraniuk, R. (2000). Multifractal cross-traffic estimation. In ITC Conference on IP Traffic, Modeling and Management.
- (2000) ITC Conference on IP Traffic, Modeling and Management
- Ribeiro, V.¹ Coates, M.² Riedi, R.³ Sarvotham, S.⁴ Hendricks, B.⁵ Baraniuk, R.⁶

45
- 0030687849
- COLLAGEN: When agents collaborate with people
- Rich, C., & Sidner, C. L. (1997). COLLAGEN: When agents collaborate with people. In Proceedings of the First International Conference on Autonomous Agents (Agents '97), pp. 284-291.
- (1997) Proceedings of the First International Conference on Autonomous Agents (Agents '97) , pp. 284-291
- Rich, C.¹ Sidner, C.L.²

46
- 0003584577
- chap. 17: Making Complex Decisions. Prentice Hall
- Russel, S. J., & Norvig, P. (2002). Artificial Intelligence: A Modern Approach (2nd edition)., chap. 17: Making Complex Decisions. Prentice Hall.
- (2002) Artificial Intelligence: A Modern Approach (2nd Edition)
- Russel, S.J.¹ Norvig, P.²

47
- 1242265508
- Minimizing communication cost in a distributed Bayesian network using a decentralized MDP
- ACM Press
- Shen, J., Lesser, V., & Carver, N. (2003). Minimizing communication cost in a distributed Bayesian network using a decentralized MDP. In Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003), pp. 678-685. ACM Press.
- (2003) Proceedings of the Second International Joint Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2003) , pp. 678-685
- Shen, J.¹ Lesser, V.² Carver, N.³

48
- 85153965130
- Reinforcement learning with soft state aggregation
- Tesauro, G., Touretzky, D., & Leen, T. (Eds.), The MIT Press
- Singh, S. P., Jaakkola, T., & Jordan, M. I. (1995). Reinforcement learning with soft state aggregation. In Tesauro, G., Touretzky, D., & Leen, T. (Eds.), Advances in Neural Information Processing Systems, Vol. 7, pp. 361-368. The MIT Press.
- (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 361-368
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

49
- 18144424551
- TPOT-RL applied to network routing
- Stone, P. (2000). TPOT-RL applied to network routing. In Proceedings of ICML 2000, pp. 935-942.
- (2000) Proceedings of ICML 2000 , pp. 935-942
- Stone, P.¹

50
- 0032645144
- Team partitioned, opaque transition reinforcement learning
- Stone, P., & Veloso, M. (1999). Team partitioned, opaque transition reinforcement learning. In Proceedings of the Third Annual Conference on Autonomous Agents, pp. 206-212.
- (1999) Proceedings of the Third Annual Conference on Autonomous Agents , pp. 206-212
- Stone, P.¹ Veloso, M.²

51
- 84898939480
- Policy-gradient methods for reinforcement learning with function approximation
- Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy-gradient methods for reinforcement learning with function approximation. Advances in Neural Information Processing Systems, 12, 1057-1063.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

52
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

53
- 0004102479
- MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning). MIT Press.
- (1998) Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning)
- Sutton, R.S.¹ Barto, A.G.²

54
- 0347090472
- Auction-based effective bandwidth allocation mechanism
- Takahashi, E., & Tanaka, Y. (2003). Auction-based effective bandwidth allocation mechanism. Telecommunications Systems, 24(2), 323-338.
- (2003) Telecommunications Systems , vol.24 , Issue.2 , pp. 323-338
- Takahashi, E.¹ Tanaka, Y.²

55
- 0001936250
- Towards flexible teamwork
- Tambe, M. (1997). Towards flexible teamwork. Journal of Artificial Intelligence Research, 7, 83-124.
- (1997) Journal of Artificial Intelligence Research , vol.7 , pp. 83-124
- Tambe, M.¹

56
- 0004141908
- chap. 5: The Network Layer. Prentice Hall PTR
- th edition)., chap. 5: The Network Layer. Prentice Hall PTR.
- (2003) th Edition)
- Tanenbaum, A.S.¹

57
- 0036832958
- Reinforcement learning for call admission control and routing under quality of service constraints in multimedia networks
- Tong, H. (2002). Reinforcement learning for call admission control and routing under quality of service constraints in multimedia networks. Machine Learning, 49(2), 111-139.
- (2002) Machine Learning , vol.49 , Issue.2 , pp. 111-139
- Tong, H.¹

58
- 0030735427
- Distributed detection with multiple sensors: Part i - Fundamentals
- Viswanathan, R., & Varshney, P. K. (1997). Distributed detection with multiple sensors: Part I - fundamentals. In Proceedings of the IEEE, Vol. 85-1, pp. 54-63.
- (1997) Proceedings of the IEEE , vol.85 , Issue.1 , pp. 54-63
- Viswanathan, R.¹ Varshney, P.K.²

59
- 9544239260
- Decentralized supply chain formation: A market protocol and competitive equilibrium analysis
- Walsh, W. E., & Wellman, M. P. (2003). Decentralized supply chain formation: A market protocol and competitive equilibrium analysis. Journal of Artificial Intelligence Research, 19, 513-567.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 513-567
- Walsh, W.E.¹ Wellman, M.P.²

60
- 0004049893
- Ph.D. thesis, Psychology Department, University of Cambridge
- Watkins, C. J. C. H. (1989). Learning from delayed rewards. Ph.D. thesis, Psychology Department, University of Cambridge.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

61
- 34249833101
- Technical note: Q-learning
- Watkins, C. J. C. H., & Dayan, P. (1992). Technical note: Q-learning. Machine Learning, 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

62
- 0002278965
- Adaptive switching circuits
- Widrow, B., & Hoff, M. E. (1960). Adaptive switching circuits. In WESCON Convention Record Part IV, pp. 96-104.
- (1960) WESCON Convention Record Part IV , pp. 96-104
- Widrow, B.¹ Hoff, M.E.²

63
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams, R. J. (1992). Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8(3), 229-256.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 229-256
- Williams, R.J.¹

64
- 0030284198
- A probabilistic framework for cooperative multi-agent distributed interpretation and optimization of communication
- Xiang, Y. (1996). A probabilistic framework for cooperative multi-agent distributed interpretation and optimization of communication. Artificial Intelligence, 87(1-2), 295-342.
- (1996) Artificial Intelligence , vol.87 , Issue.1-2 , pp. 295-342
- Xiang, Y.¹

65
- 0034827257
- Communication decisions in multi-agent cooperation: Model and experiments
- Montreal
- Xuan, P., Lesser, V., & Zilberstein, S. (2001). Communication decisions in multi-agent cooperation: model and experiments. In Proceedings of the Fifth International Conference on Autonomous Agents (Agents-01), pp. 616-623, Montreal.
- (2001) Proceedings of the Fifth International Conference on Autonomous Agents (Agents-01) , pp. 616-623
- Xuan, P.¹ Lesser, V.² Zilberstein, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.