-
3
-
-
0029513526
-
Gambling in a rigged casino: The adversarial multi-armed bandit problem
-
IEEE Computer Society Press, Los Alamitos
-
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: Gambling in a rigged casino: the adversarial multi-armed bandit problem. In: Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pp. 322-331. IEEE Computer Society Press, Los Alamitos (1995)
-
(1995)
Proceedings of the 36th Annual Symposium on Foundations of Computer Science
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
4
-
-
79955835579
-
Multiple Tree for Partially Observable Monte-Carlo Tree Search
-
Di Chio, C., Cagnoni, S., Cotta, C., Ebner, M., Ekárt, A., Esparcia-Alcázar, A.I., Merelo, J.J., Neri, F., Preuss, M., Richter, H., Togelius, J., Yannakakis, G.N. (eds.) EvoApplications 2011, Part I. Springer, Heidelberg
-
Auger, D.: Multiple Tree for Partially Observable Monte-Carlo Tree Search. In: Di Chio, C., Cagnoni, S., Cotta, C., Ebner, M., Ekárt, A., Esparcia-Alcázar, A.I., Merelo, J.J., Neri, F., Preuss, M., Richter, H., Togelius, J., Yannakakis, G.N. (eds.) EvoApplications 2011, Part I. LNCS, vol. 6624, pp. 53-62. Springer, Heidelberg (2011)
-
(2011)
LNCS
, vol.6624
, pp. 53-62
-
-
Auger, D.1
-
5
-
-
0006630130
-
A sublinear-time randomized approximation algorithm for matrix games
-
Grigoriadis, M.D., Khachiyan, L.G.: A sublinear-time randomized approximation algorithm for matrix games. Operations Research Letters 18(2), 53-58 (1995)
-
(1995)
Operations Research Letters
, vol.18
, Issue.2
, pp. 53-58
-
-
Grigoriadis, M.D.1
Khachiyan, L.G.2
-
6
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6(1), 4-22 (1985)
-
(1985)
Advances in Applied Mathematics
, vol.6
, Issue.1
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
7
-
-
79953227801
-
Monte Carlo Sampling for Regret Minimization in Extensive Games
-
Lanctot, M., Waugh, K., Zinkevich, M., Bowling, M.: Monte Carlo Sampling for Regret Minimization in Extensive Games. Advances in Neural Information Processing Systems 22, 1078-1086 (2009)
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1078-1086
-
-
Lanctot, M.1
Waugh, K.2
Zinkevich, M.3
Bowling, M.4
-
8
-
-
79959754797
-
MCRNR: Fast Computing of Restricted Nash Responses by Means of Sampling
-
Ponsen, M., Lanctot, M., de Jong, S.: MCRNR: Fast Computing of Restricted Nash Responses by Means of Sampling. In: Proceedings of Interactive Decision Theory and Game Theory Workshop, AAAI 2010 (2010)
-
(2010)
Proceedings of Interactive Decision Theory and Game Theory Workshop, AAAI 2010
-
-
Ponsen, M.1
Lanctot, M.2
De Jong, S.3
-
9
-
-
79955805743
-
Upper Confidence Trees with Short Term Partial Information
-
Di Chio, C., Cagnoni, S., Cotta, C., Ebner, M., Ekárt, A., Esparcia-Alcázar, A.I., Merelo, J.J., Neri, F., Preuss, M., Richter, H., Togelius, J., Yannakakis, G.N. (eds.) EvoApplications 2011, Part I. Springer, Heidelberg
-
Teytaud, O., Flory, S.: Upper Confidence Trees with Short Term Partial Information. In: Di Chio, C., Cagnoni, S., Cotta, C., Ebner, M., Ekárt, A., Esparcia-Alcázar, A.I., Merelo, J.J., Neri, F., Preuss, M., Richter, H., Togelius, J., Yannakakis, G.N. (eds.) EvoApplications 2011, Part I. LNCS, vol. 6624, pp. 153-162. Springer, Heidelberg (2011)
-
(2011)
LNCS
, vol.6624
, pp. 153-162
-
-
Teytaud, O.1
Flory, S.2
|