-
2
-
-
0029513526
-
Gambling in a rigged casino: The adversarial multi-armed bandit problem
-
IEEE Computer Society Press, Los Alamitos
-
Auer, P., Cesa-Bianchi, N., Freund, Y., Schapire, R.E.: Gambling in a rigged casino: the adversarial multi-armed bandit problem. In: Proceedings of the 36th Annual Symposium on Foundations of Computer Science, pp. 322-331. IEEE Computer Society Press, Los Alamitos (1995)
-
(1995)
Proceedings of the 36th Annual Symposium on Foundations of Computer Science
, pp. 322-331
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
3
-
-
77956556555
-
Multi-agent learning experiments on repeated matrix games
-
Bouzy, B., Métivier, M.: Multi-agent learning experiments on repeated matrix games. In: ICML, pp. 119-126 (2010)
-
(2010)
ICML
, pp. 119-126
-
-
Bouzy, B.1
Métivier, M.2
-
4
-
-
38049037928
-
Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search
-
Ciancarini, P., van den Herik, H.J. (eds.) CG 2006. Springer, Heidelberg
-
Coulom, R.: Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search. In: Ciancarini, P., van den Herik, H.J. (eds.) CG 2006. LNCS, vol. 4630, pp. 72-83. Springer, Heidelberg (2007)
-
(2007)
LNCS
, vol.4630
, pp. 72-83
-
-
Coulom, R.1
-
5
-
-
0006630130
-
A sublinear-time randomized approximation algorithm for matrix games
-
Grigoriadis, M.D., Khachiyan, L.G.: A sublinear-time randomized approximation algorithm for matrix games. Operations Research Letters 18(2), 53-58 (1995)
-
(1995)
Operations Research Letters
, vol.18
, Issue.2
, pp. 53-58
-
-
Grigoriadis, M.D.1
Khachiyan, L.G.2
-
6
-
-
85129098689
-
-
AK Peters, Wellesley
-
Hearn, R.A., Demaine, E.: Games, Puzzles, and Computation. AK Peters, Wellesley (2009)
-
(2009)
Games, Puzzles, and Computation
-
-
Hearn, R.A.1
Demaine, E.2
-
7
-
-
33750293964
-
Bandit based monte-carlo planning
-
Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. Springer, Heidelberg
-
Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282-293. Springer, Heidelberg (2006)
-
(2006)
LNCS (LNAI)
, vol.4212
, pp. 282-293
-
-
Kocsis, L.1
Szepesvári, C.2
-
8
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
9
-
-
71249086073
-
The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments
-
Lee, C.-S., Wang, M.-H., Chaslot, G., Hoock, J.-B., Rimmel, A., Teytaud, O., Tsai, S.-R., Hsu, S.-C., Hong, T.-P.: The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments. IEEE Transactions on Computational Intelligence and AI in games (2009)
-
(2009)
IEEE Transactions on Computational Intelligence and AI in Games
-
-
Lee, C.-S.1
Wang, M.-H.2
Chaslot, G.3
Hoock, J.-B.4
Rimmel, A.5
Teytaud, O.6
Tsai, S.-R.7
Hsu, S.-C.8
Hong, T.-P.9
-
10
-
-
0037840849
-
On the undecidability of probabilistic planning and related stochastic optimization problems
-
Madani, O., Hanks, S., Condon, A.: On the undecidability of probabilistic planning and related stochastic optimization problems. Artif. Intell. 147(1-2), 5-34 (2003)
-
(2003)
Artif. Intell.
, vol.147
, Issue.1-2
, pp. 5-34
-
-
Madani, O.1
Hanks, S.2
Condon, A.3
-
11
-
-
0001205548
-
Complexity of finite-horizon markov decision process problems
-
Mundhenk, M., Goldsmith, J., Lusena, C., Allender, E.: Complexity of finite-horizon markov decision process problems. J. ACM 47(4), 681-720 (2000)
-
(2000)
J. ACM
, vol.47
, Issue.4
, pp. 681-720
-
-
Mundhenk, M.1
Goldsmith, J.2
Lusena, C.3
Allender, E.4
|