SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2009, Pages 83-90

Learning what to observe in multi-agent systems

Author keywords

[No Author keywords available]

Indexed keywords

MULTI AGENT SYSTEM (MAS); MULTI-AGENT REINFORCEMENT LEARNING; MULTIPLE AGENTS; OPTIMAL POLICIES; SINGLE-AGENT; STATE INFORMATION; STATE SPACE;

ARTIFICIAL INTELLIGENCE;

MULTI AGENT SYSTEMS;

EID: 84873855111 PISSN: 15687805 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (10)

References (16)

2
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
- D. Chapman and L.P. Kaelbling. Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proceedings of the 12th International Joint Conference on Artificial Intelligence, pages 726-731, 1991.
- (1991) Proceedings of the 12th International Joint Conference on Artificial Intelligence , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

3
- 57849155854
- Using generalized learning automata for state space aggregation in mas. Lecture Notes in Computer Science
- YM. De Hauwere, P. Vrancx, and A. Nowé. Using generalized learning automata for state space aggregation in mas. Lecture Notes in Computer Science, Knowledge-Based Intelligent Information and Engineering Systems (KES 2008), 5177:182-193, 2008.
- (2008) Knowledge-Based Intelligent Information and Engineering Systems (KES 2008) , vol.5177 , pp. 182-193
- De Hauwere, Y.M.¹ Vrancx, P.² Nowé, A.³

5
- 4644369748
- Nash q-learning for general-sum stochastic games
- J. Hu and M.P. Wellman. Nash q-learning for general-sum stochastic games. Journal of Machine Learning Research, 4:1039-1069, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1039-1069
- Hu, J.¹ Wellman, M.P.²

6
- 40949099898
- Utile coordination: Learning interdependencies among cooperative agents
- J.R. Kok, P.J. 't Hoen, B. Bakker, and N. Vlassis. Utile coordination: Learning interdependencies among cooperative agents. In Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG05), pages 29-36, 2005.
- (2005) Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG05) , pp. 29-36
- Kok, J.R.¹ 't Hoen, P.J.² Bakker, B.³ Vlassis, N.⁴

8
- 47149086135
- Sparse tabular multiagent q-learning
- J.R. Kok and N. Vlassis. Sparse tabular multiagent q-learning. In Proceedings of the 13th Benelux Conference on Machine Learning, 2004.
- (2004) Proceedings of the 13th Benelux Conference on Machine Learning
- Kok, J.R.¹ Vlassis, N.²

9
- 84899840405
- Learning of coordination: Exploiting sparse interactions in multiagent systems
- F.S. Melo and M. Veloso. Learning of coordination: Exploiting sparse interactions in multiagent systems. In Proceedings of the 8th International Conference on Autonomous Agents and Multi-Agent Systems, 2009.
- (2009) Proceedings of the 8th International Conference on Autonomous Agents and Multi-Agent Systems
- Melo, F.S.¹ Veloso, M.²

10
- 0028555752
- Learning to coordinate without sharing information
- S. Sen, I. Sen, M. Sekaran, and J. Hale. Learning to coordinate without sharing information. In Proceedings of the Twelfth National Conference on Artificial Intelligence, pages 426-431, 1994.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence , pp. 426-431
- Sen, S.¹ Sen, I.² Sekaran, M.³ Hale, J.⁴

11
- 0004102479
- MIT Press
- R.S. Sutton and A.G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

13
- 84873851838
- Networks of Learning Automata: Techniques for Online Stochastic Optimization
- M.A.L. Thathachar and P.S. Sastry. Networks of Learning Automata: Techniques for Online Stochastic Optimization. Kluwer Academic Publishers, 2004.
- (2004) Kluwer Academic Publishers
- Thathachar, M.A.L.¹ Sastry, P.S.²

14
- 0028497630
- Asynchronous stochastic approximation and q-learning
- J.N. Tsitsiklis. Asynchronous stochastic approximation and q-learning. Journal of Machine Learning, 16(3):185-202, 1994.
- (1994) Journal of Machine Learning , vol.16 , Issue.3 , pp. 185-202
- Tsitsiklis, J.N.¹

15
- 0004049893
- PhD thesis University of Cambridge
- C. Watkins. Learning from Delayed Rewards. PhD thesis, University of Cambridge, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

16
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- R.J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Journal of Machine Learning, 8(3):229-256, 1992.
- (1992) Journal of Machine Learning , vol.8 , Issue.3 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.