SCOPUS 정보 검색 플랫폼

Concurrency and Computation: Practice and Experience

Volumn 26, Issue 1, 2014, Pages 51-70

Coordinated learning by exploiting sparse interaction in multiagent systems

(3) Yu, Chao a Zhang, Minjie a Ren, Fenghui a

a UNIVERSITY OF WOLLONGONG (Australia)

Author keywords

coordination; multiagent learning; reinforcement learning; sparse interaction

Indexed keywords

LEARNING SYSTEMS; MULTI AGENT SYSTEMS; OBSERVABILITY; REINFORCEMENT LEARNING;

COORDINATED BEHAVIOR; COORDINATED LEARNING; COORDINATION; DISTRIBUTED LEARNING; DYNAMIC ENVIRONMENTS; MULTI-AGENT LEARNING; SPARSE INTERACTION; STATISTICAL INFORMATION;

AUTONOMOUS AGENTS;

EID: 84890565590 PISSN: 15320626 EISSN: 15320634 Source Type: Journal
DOI: 10.1002/cpe.2947 Document Type: Article

Times cited : (5)

References (42)

1
- 40949147745
- A comprehensive survey of multiagent reinforcement learning
- DOI 10.1109/TSMCC.2007.913919
- Busoniu L, Babuska R, De Schutter B,. A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on System Man Cybernetics: Part C 2008; 38 (2): 156-172. (Pubitemid 351404112)
- (2008) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.38 , Issue.2 , pp. 156-172
- Busoniu, L.¹ Babuska, R.² De Schutter, B.³

2
- 77950988223
- Learning complementary multiagent behaviors: A case study
- In. Springer: Berlin/Heidelberg
- Kalyanakrishnan S, Stone P, Learning complementary multiagent behaviors: a case study. In Proceedings of the 13th RoboCup International Symposium. Springer: Berlin/Heidelberg, 2010; 153-165.
- (2010) Proceedings of the 13th RoboCup International Symposium , pp. 153-165
- Kalyanakrishnan, S.¹ Stone, P.²

3
- 33746826183
- Multiagent reinforcement learning for multi-robot systems: A survey
- Yang E, Gu D,. Multiagent reinforcement learning for multi-robot systems: a survey. Technical Report CSM-404, Department of Computer Science, Univervisty of Essex, Colchester, UK, 2004.
- (2004) Technical Report CSM-404, Department of Computer Science, Univervisty of Essex, Colchester, UK
- Yang, E.¹ Gu, D.²

4
- 84865781568
- Self-organization for coordinating decentralized reinforcement learning
- In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
- Zhang C, Lesser V, Abdallah S,. Self-organization for coordinating decentralized reinforcement learning. In Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2010; 739-746.
- (2010) Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems , pp. 739-746
- Zhang, C.¹ Lesser, V.² Abdallah, S.³

5
- 27344449757
- Decentralized control of cooperative systems: Categorization and complexity analysis
- Goldman CV, Zilberstein S,. Decentralized control of cooperative systems: categorization and complexity analysis. Journal of Artificial Intelligence Research 2004; 22: 143-174. (Pubitemid 41525885)
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 143-174
- Goldman, C.V.¹ Zilberstein, S.²

6
- 34247560904
- A hybrid reinforcement learning approach to autonomic resource allocation
- 1662383, Proceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
- Tesauro G, Jong NK, Das R, Bennani MN, A hybrid reinforcement learning approach to autonomic resource allocation. In 2006 IEEE International Conference on Autonomic Computing. IEEE Press: New York, 2006; 65-73. (Pubitemid 46666907)
- (2006) Proceedings - 3rd International Conference on Autonomic Computing, ICAC 2006 , vol.2006 , pp. 65-73
- Tesauro, G.¹ Jong, N.K.² Das, R.³ Bennani, M.N.⁴

7
- 28844499442
- Resource allocation in the Grid with learning agents
- DOI 10.1007/s10723-005-9003-7
- Galstyan A, Czajkowski K, Lerman K,. Resource allocation in the grid with learning agents. Journal of Grid Computing 2005; 3 (1): 91-100. (Pubitemid 41762487)
- (2005) Journal of Grid Computing , vol.3 , Issue.1-2 , pp. 91-100
- Galstyan, A.¹ Czajkowski, K.² Lerman, K.³

8
- 0036274424
- Pricing in agent economies using multi-agent Q-learning
- DOI 10.1023/A:1015504423309
- Tesauro G, Kephart JO,. Pricing in agent economies using multi-agent Q-learning. Autonomous Agent and Multi-Agent Systems 2002; 5 (3): 289-304. (Pubitemid 37113883)
- (2002) Autonomous Agents and Multi-Agent Systems , vol.5 , Issue.3 , pp. 289-304
- Tesauro, G.¹ Kephart, J.O.²

9
- 34249045960
- Perspectives on multiagent learning
- DOI 10.1016/j.artint.2007.02.004, PII S0004370207000525, Foundations of Multi-Agent Learning
- Sandholm T,. Perspectives on multiagent learning. Journal of Artificial Intelligence 2007; 171 (7): 382-391. (Pubitemid 46802424)
- (2007) Artificial Intelligence , vol.171 , Issue.7 , pp. 382-391
- Sandholm, T.¹

10
- 34249024789
- Multiagent learning is not the answer. It is the question
- DOI 10.1016/j.artint.2006.12.005, PII S0004370207000021, Foundations of Multi-Agent Learning
- Stone P,. Multiagent learning is not the answer. It is the question. Journal of Artificial Intelligence 2007; 171 (7): 402-405. (Pubitemid 46802413)
- (2007) Artificial Intelligence , vol.171 , Issue.7 , pp. 402-405
- Stone, P.¹

11
- 34147161536
- If multi-agent learning is the answer,what is the question?
- Shoham Y, Powers B, Grenager T,. If multi-agent learning is the answer,what is the question?. Journal of Artificial Intelligence 2007; 171 (7): 365-377.
- (2007) Journal of Artificial Intelligence , vol.171 , Issue.7 , pp. 365-377
- Shoham, Y.¹ Powers, B.² Grenager, T.³

12
- 38549135277
- An overview of cooperative and competitive multiagent learning
- Tuyls K,. An overview of cooperative and competitive multiagent learning. First International Workshop on Learning and Adaptation in MAS(LAMAS), Utrecht, The Netherlands, 2005; 1-46.
- (2005) First International Workshop on Learning and Adaptation in MAS(LAMAS), Utrecht, the Netherlands , pp. 1-46
- Tuyls, K.¹

13
- 79955976414
- Decentralized MDPs with sparse interactions
- Melo FS, Veloso M,. Decentralized MDPs with sparse interactions. Artifcial Intelligence 2011; 175: 1757-1789.
- (2011) Artifcial Intelligence , vol.175 , pp. 1757-1789
- Melo, F.S.¹ Veloso, M.²

14
- 40949099898
- Utile coordination: Learning interdependencies among cooperative agents
- In. IEEE Press: New York
- Kok JR, Hoen P, Bakker B, Vlassis N,. Utile coordination: learning interdependencies among cooperative agents. In Proceedings of Symposium on Computational Intelligence and Games. IEEE Press: New York, 2005; 29-36.
- (2005) Proceedings of Symposium on Computational Intelligence and Games , pp. 29-36
- Kok, J.R.¹ Hoen, P.² Bakker, B.³ Vlassis, N.⁴

15
- 47149086135
- Sparse cooperative Q-learning
- In. ACM Press: New York
- Kok JR, Vlassis N,. Sparse cooperative Q-learning. In Proceedings of 21st International Conference on Machine Learning. ACM Press: New York, 2004; 61-68.
- (2004) Proceedings of 21st International Conference on Machine Learning , pp. 61-68
- Kok, J.R.¹ Vlassis, N.²

16
- 84867671358
- Learning multi-agent state space representations
- In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
- De H Y M, Vrancx P, Nowé A,. Learning multi-agent state space representations. In Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2010; 715-722.
- (2010) Proceedings of 9th International Conference of Autonomous Agents and Multiagent Systems , pp. 715-722
- De Y, M.H.¹ Vrancx, P.² Nowé, A.³

17
- 84873855111
- Learning what to observe in multi-agent systems
- In. University of Twente Publisher: Enschede, the Netherlands
- De Hauwere YM, Vrancx P, Nowé A,. Learning what to observe in multi-agent systems. In Proceedings of 20th Belgian-Netherlands Conference on Artificial Intelligence. University of Twente Publisher: Enschede, the Netherlands, 2009; 83-90.
- (2009) Proceedings of 20th Belgian-Netherlands Conference on Artificial Intelligence , pp. 83-90
- De Hauwere, Y.M.¹ Vrancx, P.² Nowé, A.³

18
- 26944461811
- Sparse tabular multiagent Q-learning
- Kok JR, Vlassis N,. Sparse tabular multiagent Q-learning. Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium, 2004; 65-71.
- (2004) Annual Machine Learning Conference of Belgium and the Netherlands, Brussels, Belgium , pp. 65-71
- Kok, J.R.¹ Vlassis, N.²

19
- 33846942607
- Hierarchical multi-agent reinforcement learning
- Ghavamzadeh M, Mahadevan S, Makar R,. Hierarchical multi-agent reinforcement learning. Autonomous Agents and Multi-Agent Systems 2006; 13 (2): 197-229.
- (2006) Autonomous Agents and Multi-Agent Systems , vol.13 , Issue.2 , pp. 197-229
- Ghavamzadeh, M.¹ Mahadevan, S.² Makar, R.³

20
- 36949027865
- Hierarchical average reward reinforcement learning
- Ghavamzadeh M, Mahadevan S,. Hierarchical average reward reinforcement learning. The Journal of Machine Learning Research 2007; 8: 2629-2669. (Pubitemid 350241862)
- (2007) Journal of Machine Learning Research , vol.8 , pp. 2629-2669
- Ghavamzadeh, M.¹ Mahadevan, S.²

21
- 4544236179
- Coordinated reinforcement learning
- In. Morgan Kaufmann Publishers: San Mateo, CA
- Guestrin C, Lagoudakis M,. Coordinated reinforcement learning. In Proceedings of 19th International Conference on Machine Learning. Morgan Kaufmann Publishers: San Mateo, CA, 2002; 227-234.
- (2002) Proceedings of 19th International Conference on Machine Learning , pp. 227-234
- Guestrin, C.¹ Lagoudakis, M.²

22
- 0001395498
- Distributed value functions
- In. Morgan Kaufmann Publishers: San Mateo, CA
- Schneider J, Wong WK, Moore A, Riedmiller M,. Distributed value functions. In Proceedings of the 16th International Conference on Machine Learning. Morgan Kaufmann Publishers: San Mateo, CA, 1999; 371-378.
- (1999) Proceedings of the 16th International Conference on Machine Learning , pp. 371-378
- Schneider, J.¹ Wong, W.K.² Moore, A.³ Riedmiller, M.⁴

23
- 80055062322
- Coordinated multi-agent reinforcement learning in networked distributed pomdps
- In. AAAI Press: Menlo Park, California
- Zhang C, Lesser V,. Coordinated multi-agent reinforcement learning in networked distributed pomdps. In Proceedings of the 25th National Conference on Argificial Intelligence (AAAI). AAAI Press: Menlo Park, California, 2011; 764-770.
- (2011) Proceedings of the 25th National Conference on Argificial Intelligence (AAAI) , pp. 764-770
- Zhang, C.¹ Lesser, V.²

24
- 14344256227
- PhD thesis, Computer Science Department, Stanford University, August
- Guestrin C,. Planning under uncertainty in complex structured environments. PhD thesis, Computer Science Department, Stanford University, August 2003.
- (2003) Planning under Uncertainty in Complex Structured Environments
- Guestrin, C.¹

25
- 84899992307
- Interaction-driven Markov games for decentralized multiagent planning under uncertainty
- In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
- Spaan M, Melo FS,. Interaction-driven Markov games for decentralized multiagent planning under uncertainty. In Proceedings of 7th International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2008; 525-532.
- (2008) Proceedings of 7th International Conference on Autonomous Agents and Multiagent Systems , pp. 525-532
- Spaan, M.¹ Melo, F.S.²

26
- 84899840405
- Learning of coordination: Exploiting sparse interactions in multiagent systems
- In. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC
- Melo FS, Veloso M,. Learning of coordination: exploiting sparse interactions in multiagent systems. In Proceedings of 8th International Conference on Autonomous Agents and Multiagent Systems. International Foundation for Autonomous Agents and Multiagent Systems: Richland, SC, 2009; 772-780.
- (2009) Proceedings of 8th International Conference on Autonomous Agents and Multiagent Systems , pp. 772-780
- Melo, F.S.¹ Veloso, M.²

27
- 0003998452
- John Wiley & Sons, Inc.: Hoboken, New Jersey
- Puterman ML,. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc.: Hoboken, New Jersey, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

28
- 0004102479
- MIT Press: Cambridge
- Sutton RS, Barto AG,. Reinforcement Learning: An Introduction. MIT Press: Cambridge, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

29
- 34249833101
- Q-learning
- Watkins CJCH, Dayan P,. Q-learning. Machine Learning 1992; 8 (3): 279-292.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

30
- 0036874366
- The complexity of decentralized control of Markov decision processes
- Bernstein DS, Givan R, Immerman N, Zilberstein S,. The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research 2002; 27 (4): 819-840.
- (2002) Mathematics of Operations Research , vol.27 , Issue.4 , pp. 819-840
- Bernstein, D.S.¹ Givan, R.² Immerman, N.³ Zilberstein, S.⁴

31
- 79958114832
- Complexity of decentralized control: Special cases
- Allen M, Zilberstein S,. Complexity of decentralized control: special cases. Advanced Neural Information Processing Systems 2009; 22: 19-27.
- (2009) Advanced Neural Information Processing Systems , vol.22 , pp. 19-27
- Allen, M.¹ Zilberstein, S.²

32
- 60349107649
- Exploiting factored representations for decentralized execution in multiagent teams
- In. ACM Press: New York
- Roth M, Simmons R, Veloso M,. Exploiting factored representations for decentralized execution in multiagent teams. In Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems. ACM Press: New York, 2007; 469-475.
- (2007) Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 469-475
- Roth, M.¹ Simmons, R.² Veloso, M.³

33
- 51649127552
- Formal models and algorithms for decentralized decision making under uncertainty
- Seuken S, Zilberstein S,. Formal models and algorithms for decentralized decision making under uncertainty. Autonomous Agents and Multi-Agent Systems 2008; 17 (2): 190-250.
- (2008) Autonomous Agents and Multi-Agent Systems , vol.17 , Issue.2 , pp. 190-250
- Seuken, S.¹ Zilberstein, S.²

34
- 0036355306
- Multiagent teamwork: Analyzing the optimality and complexity of key theories and models
- Pynadath DV, Tambe M,. Multiagent teamwork: analyzing the optimality and complexity of key theories and models. In Proceedings of the First International Joint Conference on Autonomous Agents and Multiagent Systems: Part 2. ACM Press: New York, 2002; 873-880. (Pubitemid 34975283)
- (2002) Proceedings of the International Conference on Autonomous Agents , Issue.1 , pp. 873-880
- Pynadath, D.V.¹ Tambe, M.²

35
- 0002500351
- Planning, learning and coordination in multiagent decision processes
- In. Morgan Kaufmann Publishers: San Mateo, CA
- Boutilier C,. Planning, learning and coordination in multiagent decision processes. In Proceedings of 6th Conference on Theoretical Aspects of Rationality and Knowledge. Morgan Kaufmann Publishers: San Mateo, CA, 1996; 195-210.
- (1996) Proceedings of 6th Conference on Theoretical Aspects of Rationality and Knowledge , pp. 195-210
- Boutilier, C.¹

36
- 84880690163
- Sequential optimality and coordination in multiagent systems
- In. Morgan Kaufmann Publishers: San Mateo, CA
- Boutilier C,. Sequential optimality and coordination in multiagent systems. In International Joint Conference on Artificial Intelligence. Morgan Kaufmann Publishers: San Mateo, CA, 1999; 478-485.
- (1999) International Joint Conference on Artificial Intelligence , pp. 478-485
- Boutilier, C.¹

37
- 1142293055
- Transition-independent decentralized Markov decision processes
- In. ACM Press: New York
- Becker R, Zilberstein S, Lesser V, Goldman CV,. Transition-independent decentralized Markov decision processes. In Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems. ACM Press: New York, 2003; 41-48.
- (2003) Proceedings of the Second International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 41-48
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

38
- 27344432831
- Solving transition independent decentralized Markov decision processes
- Becker R, Zilberstein S, Lesser V, Goldman CV,. Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research 2004; 22: 423-455. (Pubitemid 41525892)
- (2004) Journal of Artificial Intelligence Research , vol.22 , pp. 423-455
- Becker, R.¹ Zilberstein, S.² Lesser, V.³ Goldman, C.V.⁴

39
- 57749106245
- Interaction structure and dimensionality in decentralized problem solving
- In. ACM Press: New York
- Allen M, Petrik M, Zilberstein S,. Interaction structure and dimensionality in decentralized problem solving. In Conference on Artificial Intelligence (AAAI). ACM Press: New York, 2008; 1440-1441.
- (2008) Conference on Artificial Intelligence (AAAI) , pp. 1440-1441
- Allen, M.¹ Petrik, M.² Zilberstein, S.³

40
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- In. AAAI Press: Menlo Park, California
- Claus C, Boutilier C,. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of National of Conference on Artificial Intelligence. AAAI Press: Menlo Park, California, 1998; 746-752.
- (1998) Proceedings of National of Conference on Artificial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

41
- 0002109085
- IMulti-agent reinforcement learning: Independent vs. Cooperative agents
- In. Morgan Kaufmann Publishers: San Mateo, CA
- Tan M,. IMulti-agent reinforcement learning: independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning. Morgan Kaufmann Publishers: San Mateo, CA, 1993; 1440-1441.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 1440-1441
- Tan, M.¹

42
- 0028555752
- Learning to coordinate without sharing information
- In. John Wiley & Sons, Inc.: Hoboken, New Jersey
- Sen S, Sekaran M, Hale J, Learning to coordinate without sharing information. In Proceedings of the National Conference on Artificial Intelligence. John Wiley & Sons, Inc.: Hoboken, New Jersey, 1994; 426-426.
- (1994) Proceedings of the National Conference on Artificial Intelligence , pp. 426-426
- Sen, S.¹ Sekaran, M.² Hale, J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.