SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2005, Pages 41-46

Efficient no-regret multiagent learning

Author keywords

[No Author keywords available]

Indexed keywords

GAME MATRICES; POLYNOMIAL BOUNDS; STOCHASTIC SAMPLING;

MATRIX ALGEBRA; MULTI AGENT SYSTEMS; OPTIMIZATION; POLYNOMIALS; STATISTICAL METHODS;

LEARNING SYSTEMS;

EID: 29344441750 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (23)

References (20)

3
- 0036531878
- Multiagent learning using a variable learning rate
- Bowling, M., and Veloso, M. 2002. Multiagent learning using a variable learning rate. Artificial Intelligence 136:215-250.
- (2002) Artificial Intelligence , vol.136 , pp. 215-250
- Bowling, M.¹ Veloso, M.²

4
- 84899027977
- Convergence and no-regret in multiagent learning
- Bowling, M. 2005. Convergence and no-regret in multiagent learning. In Proceedings of NIPS 2004/5.
- (2005) Proceedings of NIPS 2004/5
- Bowling, M.¹

6
- 1942421183
- AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
- Conitzer, V., and Sandholm, T. 2003. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In Proceedings of the 20th International Conference on Machine Learning.
- (2003) Proceedings of the 20th International Conference on Machine Learning
- Conitzer, V.¹ Sandholm, T.²

8
- 0002267135
- Adaptive game playing using multiplicative weights
- Freund, Y., and Schapire. R. E. 1999. Adaptive game playing using multiplicative weights. Games and Economic Behavior 29:79 -103.
- (1999) Games and Economic Behavior , vol.29 , pp. 79-103
- Freund, Y.¹ Schapire, R.E.²

9
- 0000668347
- Consistency and cautious fictitious play
- Fudenberg, D., and Levine, D. 1995. Consistency and cautious fictitious play. Journal of Economic Dynamics and Control 19:1065-1089.
- (1995) Journal of Economic Dynamics and Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.²

10
- 9444291149
- Correlated q-learning
- Greenwald, A., and Hall, K. 2002. Correlated q-learning. In Proceedings of the AAAI Symposium on Collaborative Learning Agents.
- (2002) Proceedings of the AAAI Symposium on Collaborative Learning Agents
- Greenwald, A.¹ Hall, K.²

12
- 9444236608
- On no-regret learning, fictitious play, and nash equilibrium
- Jafari, A.; Greenwald, A.; Gondek, D.; and Ercal, G. 2001. On no-regret learning, fictitious play, and nash equilibrium. In Proceedings of the Eighteenth International Conference on Machine Learning, 226-223.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning , pp. 226-1223
- Jafari, A.¹ Greenwald, A.² Gondek, D.³ Ercal, G.⁴

13
- 35148838877
- The weighted majority algorithm
- Littlestone, N., and Warmuth, M. 1994. The weighted majority algorithm. Information and Computation 108:212-261.
- (1994) Information and Computation , vol.108 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.²

15
- 0242466944
- Friend-or-foe Q-learning in general-sum games
- Littman, M. L. 2001. Friend-or-foe Q-learning in general-sum games. In Proceedings of the Eighteenth International Conference on Machine Learnig.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learnig
- Littman, M.L.¹

16
- 84898936075
- New criteria and a new algorithm for learning in multi-agent systems
- Powers. R., and Shoham, Y. 2005. New criteria and a new algorithm for learning in multi-agent systems. In Proceedings of NIPS 2004/5.
- (2005) Proceedings of NIPS 2004/5
- Powers, R.¹ Shoham, Y.²

17
- 0001644761
- Nash convergence of gradient dynamics in general-sum games
- Singh, S.; Kearns, M.; and Mansour, Y. 2000. Nash convergence of gradient dynamics in general-sum games. In Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence, 541-548.
- (2000) Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence , pp. 541-548
- Singh, S.¹ Kearns, M.² Mansour, Y.³

18
- 0004102479
- MIT Press
- Sutton, R., and Burto, A. G. 1998. Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Burto, A.G.²

19
- 67649405225
- Reinforcement learning to play an optimal nash equilibrium in team markov games
- Wang, X., and Sandholm, T. 2002. Reinforcement learning to play an optimal nash equilibrium in team markov games. In Advances in Neural Information Processing Systems 15, NIPS.
- (2002) Advances in Neural Information Processing Systems 15, NIPS
- Wang, X.¹ Sandholm, T.²

20
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Zinkevich, M. 2003. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th International Conference on Machine Learning.
- (2003) Proceedings of the 20th International Conference on Machine Learning
- Zinkevich, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.