-
1
-
-
0029513526
-
Gambling in a rigged casino: The adversarial multi-arm bandit problem
-
Auer, P., Cesa-Bianchi, N., Freund, Y., & Schapire, R. E. (1995). Gambling in a rigged casino: The adversarial multi-arm bandit problem. FOCS (pp. 322-331).
-
(1995)
FOCS
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
3
-
-
0036531878
-
Multiagent learning using a variable learning rate
-
Bowling, M., & Veloso, M. (2002). Multiagent learning using a variable learning rate. Artificial Intelligence, 136, 215-250.
-
(2002)
Artificial Intelligence
, vol.136
, pp. 215-250
-
-
Bowling, M.1
Veloso, M.2
-
4
-
-
0034247018
-
A near optimal polynomial time algorithm for learning in certain classes of stochastic games
-
Brafman, R., & Tennenholtz, M. (2000). A near optimal polynomial time algorithm for learning in certain classes of stochastic games. Artificial Intelligence, 121, 31-47.
-
(2000)
Artificial Intelligence
, vol.121
, pp. 31-47
-
-
Brafman, R.1
Tennenholtz, M.2
-
6
-
-
0031140246
-
How to use expert advice
-
Cesa-Bianchi, N., Freund, Y., Haussler, D., Helmbold, D. P., Schapire, R. E., & Warmuth, M. K. (1997). How to use expert advice. Journal of the ACM, 44, 427-485.
-
(1997)
Journal of the ACM
, vol.44
, pp. 427-485
-
-
Cesa-Bianchi, N.1
Freund, Y.2
Haussler, D.3
Helmbold, D.P.4
Schapire, R.E.5
Warmuth, M.K.6
-
8
-
-
0002095886
-
A randomization rule for selecting forecasts
-
Foster, D. P., & Vohra, R. V. (1993). A randomization rule for selecting forecasts. Operations Research, 41, 704-709.
-
(1993)
Operations Research
, vol.41
, pp. 704-709
-
-
Foster, D.P.1
Vohra, R.V.2
-
9
-
-
0002267135
-
Adaptive game playing using multiplicative weights
-
Freund, Y., & Schapire, R. E. (1999). Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29, 79-103.
-
(1999)
Games and Economic Behavior
, vol.29
, pp. 79-103
-
-
Freund, Y.1
Schapire, R.E.2
-
12
-
-
0001976283
-
Approximation to Bayes risk in repeated play
-
Princeton University Press
-
Hannan, J. (1957). Approximation to Bayes risk in repeated play. vol. Ill of Contributions to the Theory of Games, 97-139. Princeton University Press.
-
(1957)
Contributions to the Theory of Games
, vol.3
, pp. 97-139
-
-
Hannan, J.1
-
13
-
-
0000929496
-
Multiagent reinforcement learning: Theoretical framework and an algorithm
-
Hu, J., & Wellman, M. P. (1998). Multiagent reinforcement learning: Theoretical framework and an algorithm. International Conference on Machine Learning (pp. 242-250).
-
(1998)
International Conference on Machine Learning
, pp. 242-250
-
-
Hu, J.1
Wellman, M.P.2
-
14
-
-
1142305713
-
Learning to play games in extensive form by valuation
-
Jehiel, P., & Samet, D. (2001). Learning to play games in extensive form by valuation. NAJ Economics, v3nl.
-
(2001)
NAJ Economics
, vol.3
, Issue.1
-
-
Jehiel, P.1
Samet, D.2
-
15
-
-
0000221289
-
Rational learning leads to Nash equilibrium
-
Kalai, E., & Lehrer, E. (1993). Rational learning leads to Nash equilibrium. Econometrica, 61, 1019-1045.
-
(1993)
Econometrica
, vol.61
, pp. 1019-1045
-
-
Kalai, E.1
Lehrer, E.2
-
18
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. International Conference on Machine Learning (pp. 157-163).
-
(1994)
International Conference on Machine Learning
, pp. 157-163
-
-
Littman, M.L.1
-
19
-
-
0004145762
-
-
New York: John Wiley and Sons. Dover republication 1989
-
Luce, R. D., & Raiffa, H. (1957). Games and decisions. New York: John Wiley and Sons. Dover republication 1989.
-
(1957)
Games and Decisions
-
-
Luce, R.D.1
Raiffa, H.2
-
20
-
-
0038675791
-
On repeated games with incomplete information played by non-Baycsian players
-
Megiddo, N. (1980). On repeated games with incomplete information played by non-Baycsian players. International Journal of Game Theory, 9, 157-167.
-
(1980)
International Journal of Game Theory
, vol.9
, pp. 157-167
-
-
Megiddo, N.1
-
22
-
-
0034836562
-
Algorithms, games and the Internet
-
Papadimitriou, C. (2001). Algorithms, games and the Internet. STOC (pp. 749-753).
-
(2001)
STOC
, pp. 749-753
-
-
Papadimitriou, C.1
-
23
-
-
0001644761
-
Nash convergence of gradient dynamics in general-sum games
-
Stanford, CA
-
Singh, S., Kearns, M., & Mansour, Y. (2000). Nash convergence of gradient dynamics in general-sum games. Proceedings of the Uncertainty in Artificial Intelligence Conference (UAI) (pp. 541-548). Stanford, CA.
-
(2000)
Proceedings of the Uncertainty in Artificial Intelligence Conference (UAI)
, pp. 541-548
-
-
Singh, S.1
Kearns, M.2
Mansour, Y.3
|