-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite time analysis of the multiarmed bandit problem. Machine Learning 47(2/3), 235-256 (2002) (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
85012688561
-
-
Princeton Univ. Press, Princeton
-
Bellman, R.: Dynamic Programming. Princeton Univ. Press, Princeton (1957)
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
3
-
-
84886995576
-
On the huge benefit of quasi-random mutations for multimodal optimization with application to grid-based tuning of neurocontrollers
-
Chaslot, G., Hoock, J.-B., Teytaud, F., Teytaud, O.: On the huge benefit of quasi-random mutations for multimodal optimization with application to grid-based tuning of neurocontrollers. In: ESANN, Bruges, Belgium (2009)
-
ESANN, Bruges, Belgium (2009)
-
-
Chaslot, G.1
Hoock, J.-B.2
Teytaud, F.3
Teytaud, O.4
-
4
-
-
84874005312
-
Monte-Carlo Strategies for Computer Go
-
Schobbens, P.-Y., Vanhoof, W., Schwanen, G. (eds.)
-
Chaslot, G., Saito, J.-T., Bouzy, B., Uiterwijk, J.W.H.M., van den Herik, H.J.: Monte-Carlo Strategies for Computer Go. In: Schobbens, P.-Y., Vanhoof, W., Schwanen, G. (eds.) Proceedings of the 18th BeNeLux Conference on Artificial Intelligence, Namur, Belgium, pp. 83-91 (2006)
-
(2006)
Proceedings of the 18th BeNeLux Conference on Artificial Intelligence, Namur, Belgium
, pp. 83-91
-
-
Chaslot, G.1
Saito, J.-T.2
Bouzy, B.3
Uiterwijk, J.W.H.M.4
Van Den Herik, H.J.5
-
5
-
-
67650687540
-
Progressive strategies for monte-carlo tree search
-
Wang, P., et al. (eds.) World Scientific Publishing Co. Pte. Ltd., Singapore
-
Chaslot, G., Winands, M., Uiterwijk, J., van den Herik, H., Bouzy, B.: Progressive strategies for monte-carlo tree search. In: Wang, P., et al. (eds.) Proceedings of the 10th Joint Conference on Information Sciences (JCIS 2007), pp. 655-661. World Scientific Publishing Co. Pte. Ltd., Singapore (2007)
-
(2007)
Proceedings of the 10th Joint Conference on Information Sciences (JCIS 2007)
, pp. 655-661
-
-
Chaslot, G.1
Winands, M.2
Uiterwijk, J.3
Van Den Herik, H.4
Bouzy, B.5
-
6
-
-
34547971839
-
Efficient selectivity and backup operators in monte-carlo tree search
-
Ciancarini, P., van den Herik, H.J. (eds.)
-
Coulom, R.: Efficient selectivity and backup operators in monte-carlo tree search. In: Ciancarini, P., van den Herik, H.J. (eds.) Proceedings of the 5th International Conference on Computers and Games, Turin, Italy (2006)
-
Proceedings of the 5th International Conference on Computers and Games, Turin, Italy (2006)
-
-
Coulom, R.1
-
7
-
-
71149107214
-
Bandit-Based Optimization on Graphs with Application to Library Performance Tuning
-
De Mesmay, F., Rimmel, A., Voronenko, Y., Püschel, M.: Bandit-Based Optimization on Graphs with Application to Library Performance Tuning. In: ICML, Montréal, Canada (2009)
-
ICML, Montréal, Canada (2009)
-
-
De Mesmay, F.1
Rimmel, A.2
Voronenko, Y.3
Püschel, M.4
-
9
-
-
33750293964
-
Bandit-based monte-carlo planning
-
Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. Springer, Heidelberg
-
Kocsis, L., Szepesvari, C.: Bandit-based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282-293. Springer, Heidelberg (2006)
-
(2006)
LNCS (LNAI)
, vol.4212
, pp. 282-293
-
-
Kocsis, L.1
Szepesvari, C.2
-
10
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
11
-
-
71249086073
-
The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments
-
Lee, C.-S., Wang, M.-H., Chaslot, G., Hoock, J.-B., Rimmel, A., Teytaud, O., Tsai, S.-R., Hsu, S.-C., Hong, T.-P.: The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments. IEEE Transactions on Computational Intelligence and AI in games (2009)
-
(2009)
IEEE Transactions on Computational Intelligence and AI in Games
-
-
Lee, C.-S.1
Wang, M.-H.2
Chaslot, G.3
Hoock, J.-B.4
Rimmel, A.5
Teytaud, O.6
Tsai, S.-R.7
Hsu, S.-C.8
Hong, T.-P.9
-
13
-
-
84867854715
-
Optimal robust expensive optimization is tractable
-
ACM, New York G.: Mathematics of Computing/G.1: NUMERICAL ANALYSIS/G.1.6: Optimization, I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.8: Problem Solving, Control Methods, and Search
-
Rolet, P., Sebag, M., Teytaud, O.: Optimal robust expensive optimization is tractable. In: Gecco 2009, Montréal Canada, 8 p. ACM, New York (2009); G.: Mathematics of Computing/G.1: NUMERICAL ANALYSIS/G.1.6: Optimization, I.: Computing Methodologies/I.2: ARTIFICIAL INTELLIGENCE/I.2.8: Problem Solving, Control Methods, and Search
-
(2009)
Gecco 2009, Montréal Canada
-
-
Rolet, P.1
Sebag, M.2
Teytaud, O.3
-
15
-
-
84863381440
-
Algorithms for infinitely many-armed bandits
-
Wang, Y., Audibert, J.-Y., Munos, R.: Algorithms for infinitely many-armed bandits. In: Advances in Neural Information Processing Systems, vol. 21 (2008)
-
(2008)
Advances in Neural Information Processing Systems
, vol.21
-
-
Wang, Y.1
Audibert, J.-Y.2
Munos, R.3
-
16
-
-
34547981323
-
Modifications of UCT and sequence-like simulations for Monte-Carlo Go
-
Wang, Y., Gelly, S.: Modifications of UCT and sequence-like simulations for Monte-Carlo Go. In: IEEE Symposium on Computational Intelligence and Games, Honolulu, Hawaii, pp. 175-182 (2007)
-
(2007)
IEEE Symposium on Computational Intelligence and Games, Honolulu, Hawaii
, pp. 175-182
-
-
Wang, Y.1
Gelly, S.2
|