SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2002, Pages 326-331

Reinforcement learning of coordination in cooperative multi-agent systems

Author keywords

[No Author keywords available]

Indexed keywords

COSTS; GAME THEORY; HEURISTIC METHODS; LEARNING SYSTEMS; PROBABILITY; PROBLEM SOLVING;

REINFORCEMENT LEARNING;

MULTI AGENT SYSTEMS;

EID: 0036932299 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (173)

References (11)

1
- 84880690163
- Sequential optimality and coordination in multiagent systems
- Boutilier, C. 1999. Sequential optimality and coordination in multiagent systems. In Proceedings of the Sixteenth International Joint Conference on Articial Intelligence (IJCAI-99), 478-485.
- (1999) Proceedings of the Sixteenth International Joint Conference on Articial Intelligence (IJCAI-99) , pp. 478-485
- Boutilier, C.¹

2
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C., and Boutilier, C. 1998. The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the Fifteenth National Conference on Articial Intelligence, 746-752.
- (1998) Proceedings of the Fifteenth National Conference on Articial Intelligence , pp. 746-752
- Claus, C.¹ Boutilier, C.²

3
- 0004247096
- Cambridge, MA: MIT Press
- Fudenberg, D., and Levine, D. K. 1998. The Theory of Learning in Games. Cambridge, MA: MIT Press.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

4
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P.; Littman, M.; and Moore, A. W. 1996. Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4.
- (1996) Journal of Artificial Intelligence Research , vol.4
- Kaelbling, L.P.¹ Littman, M.² Moore, A.W.³

5
- 0012286079
- An algorithm for distributed reinforcement learning in cooperative multi-agent systems
- Lauer, M., and Riedmiller, M. 2000. An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proceedings of the Seventeenth International Conference in Machine Learning.
- (2000) Proceedings of the Seventeenth International Conference in Machine Learning
- Lauer, M.¹ Riedmiller, M.²

6
- 0032359707
- Individual learning of coordination knowledge
- Sen, S., and Sekaran, M. 1998. Individual learning of coordination knowledge. JETAI 10(3):333-356.
- (1998) JETAI , vol.10 , Issue.3 , pp. 333-356
- Sen, S.¹ Sekaran, M.²

7
- 0028555752
- Learning to coordinate without sharing information
- Sen, S.; Sekaran, M.; and Hale, J. 1994. Learning to coordinate without sharing information. In Proceedings of the Twelfth National Conference on Artificial Intelligence, 426-431.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

8
- 0033901602
- Convergence results for single-step on-policy reinforcement-learning algorithms
- Singh, S.; Jaakkola, T.; Littman, M. L.; and Szpesvari, C. 2000. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning Journal 38(3):287-308.
- (2000) Machine Learning Journal , vol.38 , Issue.3 , pp. 287-308
- Singh, S.¹ Jaakkola, T.² Littman, M.L.³ Szpesvari, C.⁴

9
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Tan, M. 1993. Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the Tenth International Conference on Machine Learning, 330-337.
- (1993) Proceedings of the Tenth International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

10
- 0004049893
- Ph.D. Dissertation, Cambridge University, Cambridge, England
- Watkins, C. J. C. H. 1989. Learning from Delayed Rewards. Ph.D. Dissertation, Cambridge University, Cambridge, England.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.