-
1
-
-
0345224411
-
The continuum-armed bandit problem
-
Agrawal R., « The Continuum-Armed Bandit Problem», SIAM J. Control Optim., vol. 33, no 6, p. 1926-1951, 1995.
-
(1995)
SIAM J. Control Optim.
, vol.33
, Issue.6
, pp. 1926-1951
-
-
Agrawal, R.1
-
2
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer P., Cesa-Bianchi N., Fischer P., « Finite-time analysis of the multiarmed bandit problem», Machine Learning, vol. 47, no 2/3, p. 235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
3
-
-
0000768035
-
Denumerable-armed bandits
-
September, available at
-
Banks J. S., Sundaram R. K., « Denumerable-Armed Bandits », Econometrica, vol. 60, no 5, p. 1071-96, September, 1992. available at http://ideas.repec.org/a/ecm/emetrp/v60y1992i5p1071-96.html.
-
(1992)
Econometrica
, vol.60
, Issue.5
, pp. 1071-1096
-
-
Banks, J.S.1
Sundaram, R.K.2
-
5
-
-
80053655011
-
Bayesian generation and integration of k-nearest-neighbor patterns for 19×19 Go
-
G. Kendall and Simon Lucas, editors, Colchester, UK
-
Bouzy B., Chaslot G., « Bayesian generation and integration of k-nearest-neighbor patterns for 19×19 Go », G. Kendall and Simon Lucas, editors, IEEE 2005 Symposium on Computational Intelligence in Games, Colchester, UK, p. 176-181, 2005.
-
(2005)
IEEE 2005 Symposium on Computational Intelligence in Games
, pp. 176-181
-
-
Bouzy, B.1
Chaslot, G.2
-
6
-
-
84902513084
-
Monte Carlo go developments
-
E. A. Heinz, H. J. van den Herik, H. Iida eds, Kluwer Academic Publishers
-
Bouzy B., Helmstetter B., « Monte Carlo go developments », in E. A. Heinz, H. J. van den Herik, H. Iida (eds), 10th Advances in Computer Games, Kluwer Academic Publishers, p. 159-174, 2003.
-
(2003)
10th Advances in Computer Games
, pp. 159-174
-
-
Bouzy, B.1
Helmstetter, B.2
-
8
-
-
84874005312
-
Monte-Carlo strategies for computer go
-
P.-Y. Schobbens, W. Vanhoof, G. Schwanen eds, Namur, Belgium
-
Chaslot G., Saito J.-T., Bouzy B., Uiterwijk J. W. H. M., van den Herik H. J., « Monte-Carlo Strategies for Computer Go », in P.-Y. Schobbens, W. Vanhoof, G. Schwanen (eds), Proceedings of the 18th BeNeLux Conference on Artificial Intelligence, Namur, Belgium, p. 83-91, 2006.
-
(2006)
Proceedings of the 18th BeNeLux Conference on Artificial Intelligence
, pp. 83-91
-
-
Chaslot, G.1
Saito, J.-T.2
Bouzy, B.3
Uiterwijk, J.W.H.M.4
Van Den Herik, H.J.5
-
9
-
-
67650687540
-
Progressive strategies for Monte-Carlo tree search
-
P. Wang et al. eds, World Scientific Publishing Co. Pte. Ltd.
-
Chaslot G., Winands M., Uiterwijk J., van den Herik H., Bouzy B., « Progressive Strategies for Monte-Carlo Tree Search », in P. Wang et al. (eds), Proceedings of the 10th Joint Conference on Information Sciences (JCIS 2007), World Scientific Publishing Co. Pte. Ltd., p. 655-661, 2007.
-
(2007)
Proceedings of the 10th Joint Conference on Information Sciences (JCIS 2007)
, pp. 655-661
-
-
Chaslot, G.1
Winands, M.2
Uiterwijk, J.3
Van Den Herik, H.4
Bouzy, B.5
-
11
-
-
77954359576
-
-
P. Ciancarini and H. J. van den Herik, editors, Proceedings of the 5th International Conference on Computers and Games, Turin, Italy
-
Coulom R., « Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search », In P. Ciancarini and H. J. van den Herik, editors, Proceedings of the 5th International Conference on Computers and Games, Turin, Italy, 2006.
-
(2006)
Efficient Selectivity and Backup Operators in Monte-Carlo Tree Search
-
-
Coulom, R.1
-
12
-
-
70349287633
-
Computing elo ratings of move patterns in the game of go
-
Amsterdam, The Netherlands
-
Coulom R., « Computing Elo Ratings of Move Patterns in the Game of Go », Computer Games Workshop, Amsterdam, The Netherlands, 2007.
-
(2007)
Computer Games Workshop
-
-
Coulom, R.1
-
13
-
-
33244456637
-
Robbing the bandit: Less regret in online geometric optimization against an adaptive adversary
-
ACM Press, New York, NY, USA
-
Dani V., Hayes T. P., « Robbing the bandit: less regret in online geometric optimization against an adaptive adversary », SODA '06: Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm, ACM Press, New York, NY, USA, p. 937-943, 2006.
-
(2006)
SODA '06: Proceedings of the Seventeenth Annual ACM-SIAM Symposium on Discrete Algorithm
, pp. 937-943
-
-
Dani, V.1
Hayes, T.P.2
-
14
-
-
34547990649
-
Combining online and offline knowledge in UCT
-
ACM Press, New York, NY, USA
-
Gelly S., Silver D., « Combining online and offline knowledge in UCT », ICML '07: Proceedings of the 24th international conference on Machine learning, ACM Press, New York, NY, USA, p. 273-280, 2007.
-
(2007)
ICML '07: Proceedings of the 24th International Conference on Machine Learning
, pp. 273-280
-
-
Gelly, S.1
Silver, D.2
-
15
-
-
33750293964
-
Bandit-based Monte-Carlo planning
-
Kocsis L., Szepesvari C., « Bandit-based Monte-Carlo Planning », ECML'06, p. 282-293, 2006.
-
(2006)
ECML'06
, pp. 282-293
-
-
Kocsis, L.1
Szepesvari, C.2
-
16
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai T., Robbins H., « Asymptotically efficient adaptive allocation rules », Advances in applied mathematics, vol. 6, p. 4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
17
-
-
84886997003
-
SVM and pattern-enriched common fate graphs for the game of Go
-
Ralaivola L., Wu L., Baldi P., « SVM and pattern-enriched common fate graphs for the game of Go », Proceedings of ESANN 2005, p. 485-490, 2005.
-
(2005)
Proceedings of ESANN 2005
, pp. 485-490
-
-
Ralaivola, L.1
Wu, L.2
Baldi, P.3
-
18
-
-
34547981323
-
Modifications of UCT and sequence-like simulations for Monte-Carlo go
-
Honolulu, Hawaii
-
Wang Y., Gelly S., « Modifications of UCT and sequence-like simulations for Monte-Carlo Go », IEEE Symposium on Computational Intelligence and Games, Honolulu, Hawaii, p. 175-182, 2007.
-
(2007)
IEEE Symposium on Computational Intelligence and Games
, pp. 175-182
-
-
Wang, Y.1
Gelly, S.2
|