-
1
-
-
0042496192
-
Gambling in a rigged casino: The adversarial multi-armed bandit problem
-
NeuroCOLT2 Technical Report Series
-
Auer, P.; Cesa-Bianchi, N.; Freund, Y.; and Schapire, R. E. 1998. Gambling in a rigged casino: The adversarial multi-armed bandit problem. Technical Report NC2-TR-1998-025, NeuroCOLT2 Technical Report Series.
-
(1998)
Technical Report
, vol.NC2-TR-1998-025
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
3
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
Bowling, M., and Veloso, M. 2002. Multiagent learning using a variable learning rate. Artificial Intelligence 136:215-250.
-
(2002)
Artificial Intelligence
, vol.136
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
4
-
-
84899027977
-
Convergence and no-regret in multiagent learning
-
Bowling, M. 2005. Convergence and no-regret in multiagent learning. In Proceedings of NIPS 2004/5.
-
(2005)
Proceedings of NIPS 2004/5
-
-
Bowling, M.1
-
6
-
-
1942421183
-
AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents
-
Conitzer, V., and Sandholm, T. 2003. AWESOME: A general multiagent learning algorithm that converges in self-play and learns a best response against stationary opponents. In Proceedings of the 20th International Conference on Machine Learning.
-
(2003)
Proceedings of the 20th International Conference on Machine Learning
-
-
Conitzer, V.1
Sandholm, T.2
-
8
-
-
0002267135
-
Adaptive game playing using multiplicative weights
-
Freund, Y., and Schapire. R. E. 1999. Adaptive game playing using multiplicative weights. Games and Economic Behavior 29:79 -103.
-
(1999)
Games and Economic Behavior
, vol.29
, pp. 79-103
-
-
Freund, Y.1
Schapire, R.E.2
-
11
-
-
0000929496
-
Multiagent reinforcement learning: Theoretical framework and an algorithm
-
San Francisco. CA: Morgan Kaufmann
-
Hu, J., and Wellman, M. P. 1998. Multiagent reinforcement learning: Theoretical framework and an algorithm. In Proc. of the 15th Int. Conf. on Machine Learning (ML'98), 242-250. San Francisco. CA: Morgan Kaufmann.
-
(1998)
Proc. of the 15th Int. Conf. on Machine Learning (ML'98)
, pp. 242-250
-
-
Hu, J.1
Wellman, M.P.2
-
12
-
-
9444236608
-
On no-regret learning, fictitious play, and nash equilibrium
-
Jafari, A.; Greenwald, A.; Gondek, D.; and Ercal, G. 2001. On no-regret learning, fictitious play, and nash equilibrium. In Proceedings of the Eighteenth International Conference on Machine Learning, 226-223.
-
(2001)
Proceedings of the Eighteenth International Conference on Machine Learning
, pp. 226-1223
-
-
Jafari, A.1
Greenwald, A.2
Gondek, D.3
Ercal, G.4
-
14
-
-
85149834820
-
Markov games as a framework for multiagent reinforcement learning
-
San Mateo, CA: Morgan Kaufmann
-
Littman, M. L. 1994. Markov games as a framework for multiagent reinforcement learning. In Proc. of the 11th Int. Conf. on Machine Learning, 157-163. San Mateo, CA: Morgan Kaufmann.
-
(1994)
Proc. of the 11th Int. Conf. on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
16
-
-
84898936075
-
New criteria and a new algorithm for learning in multi-agent systems
-
Powers. R., and Shoham, Y. 2005. New criteria and a new algorithm for learning in multi-agent systems. In Proceedings of NIPS 2004/5.
-
(2005)
Proceedings of NIPS 2004/5
-
-
Powers, R.1
Shoham, Y.2
|