-
2
-
-
34247189601
-
-
Banerjee, B., Peng, J.: Rvσ(t): a unifying approach to performance and convergence in online multiagent learning. In: AAMAS 2006: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems, pp. 798-800. ACM Press, New York (2006)
-
Banerjee, B., Peng, J.: Rvσ(t): a unifying approach to performance and convergence in online multiagent learning. In: AAMAS 2006: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems, pp. 798-800. ACM Press, New York (2006)
-
-
-
-
3
-
-
84899027977
-
Convergence and no-regret in multiagent learning
-
MIT Press, Cambridge
-
Bowling, M.: Convergence and no-regret in multiagent learning. In: Neural Information Processing Systems, vol. 17. MIT Press, Cambridge (2005)
-
(2005)
Neural Information Processing Systems
, vol.17
-
-
Bowling, M.1
-
4
-
-
36348967415
-
Convergence of gradient dynamics with a variable learning rate
-
Morgan Kaufmann, San Francisco
-
Bowling, M., Veloso, M.: Convergence of gradient dynamics with a variable learning rate. In: Proc. 18th International Conf. on Machine Learning, pp. 27-34. Morgan Kaufmann, San Francisco (2001)
-
(2001)
Proc. 18th International Conf. on Machine Learning
, pp. 27-34
-
-
Bowling, M.1
Veloso, M.2
-
5
-
-
84880865940
-
Rational and convergent learning in stochastic games
-
Bowling, M.H., Veloso, M.M.: Rational and convergent learning in stochastic games. In: IJCAI, pp. 1021-1026 (2001)
-
(2001)
IJCAI
, pp. 1021-1026
-
-
Bowling, M.H.1
Veloso, M.M.2
-
6
-
-
0041965975
-
R-max - a general polynomial time algorithm for near-optimal reinforcement learning
-
Brafman, R.I., Tennenholtz, M.: R-max - a general polynomial time algorithm for near-optimal reinforcement learning. J. Mach. Learn. Res. 3, 213-231 (2003)
-
(2003)
J. Mach. Learn. Res
, vol.3
, pp. 213-231
-
-
Brafman, R.I.1
Tennenholtz, M.2
-
7
-
-
0031630561
-
The dynamics of reinforcement learning in cooperative multiagent systems
-
Claus, O., Boutilier, C.: The dynamics of reinforcement learning in cooperative multiagent systems. In: AAAI/IAAI, pp. 746-752 (1998)
-
(1998)
AAAI/IAAI
, pp. 746-752
-
-
Claus, O.1
Boutilier, C.2
-
10
-
-
56049126679
-
-
Greenwald, A, Jafari, A, Ercal, G, Gondek, D, On no-regret learning, fictitious play, and nash equilibrium
-
Greenwald, A., Jafari, A., Ercal, G., Gondek, D.: On no-regret learning, fictitious play, and nash equilibrium
-
-
-
-
11
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
New Brunswick, NJ, pp, Morgan Kaufmann, San Francisco
-
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning (ML 1994), New Brunswick, NJ, pp. 157-163. Morgan Kaufmann, San Francisco (1994)
-
(1994)
Proceedings of the 11th International Conference on Machine Learning (ML
, pp. 157-163
-
-
Littman, M.L.1
-
13
-
-
33745609272
-
Learning against opponents with bounded memory
-
Powers, R., Shoham, Y.: Learning against opponents with bounded memory. In: IJCAI, pp. 817-822 (2005)
-
(2005)
IJCAI
, pp. 817-822
-
-
Powers, R.1
Shoham, Y.2
-
14
-
-
0001644761
-
-
Singh, S., Kearns, M., Mansour, Y.: Nash convergence of gradient dynamics in general-sum games, pp. 541-548
-
Nash convergence of gradient dynamics in general-sum games
, pp. 541-548
-
-
Singh, S.1
Kearns, M.2
Mansour, Y.3
-
15
-
-
56049126463
-
-
Stone, P., Littman, M.L.: Implicit negotiation in repeated games. In: Meyer, J.-J.C., Tambe, M. (eds.) ATAL 2001. LNCS (LNAI), 2333, pp. 96-105. Springer, Heidelberg (2002)
-
Stone, P., Littman, M.L.: Implicit negotiation in repeated games. In: Meyer, J.-J.C., Tambe, M. (eds.) ATAL 2001. LNCS (LNAI), vol. 2333, pp. 96-105. Springer, Heidelberg (2002)
-
-
-
-
16
-
-
34247179640
-
Learning against multiple opponents
-
ACM Press, New York
-
Vu, T., Powers, R., Shoham, Y.: Learning against multiple opponents. In: A AMAS 2006: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems, pp. 752-759. ACM Press, New York (2006)
-
(2006)
A AMAS 2006: Proceedings of the fifth international joint conference on Autonomous agents and multiagent systems
, pp. 752-759
-
-
Vu, T.1
Powers, R.2
Shoham, Y.3
|