-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
DOI 10.1023/A:1013689704352, Computational Learning Theory
-
P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002 (Pubitemid 34126111)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
14644444172
-
An adaptive sampling algorithm for solving Markov decision processes
-
DOI 10.1287/opre.1040.0145
-
H. S. Chang, M. C. Fu, J. Hu, and S. I. Marcus. An adaptive sampling algorithm for solving Markov decision processes. Operations Research, 53(1):126-139, 2005 (Pubitemid 40320220)
-
(2005)
Operations Research
, vol.53
, Issue.1
, pp. 126-139
-
-
Chang, H.S.1
Fu, M.C.2
Hu, J.3
Marcus, S.I.4
-
3
-
-
78751700612
-
Monte Carlo tree search techniques in the game of Kriegspiel
-
P. Ciancarini and G. P. Favini. Monte Carlo tree search techniques in the game of Kriegspiel. In IJCAI-09, 2009
-
(2009)
IJCAI-09
-
-
Ciancarini, P.1
Favini, G.P.2
-
5
-
-
38049037928
-
Efficient selection and backup operators in Monte-Carlo tree search
-
5th Intl. Conf. on Computer and Games, Turin, Italy, May
-
R. Coulom. Efficient selection and backup operators in Monte-Carlo tree search. In 5th Intl. Conf. on Computer and Games, vol. 4360 of LNCS, pp. 72-83, Turin, Italy, May 2006
-
(2006)
LNCS
, vol.4360
, pp. 72-83
-
-
Coulom, R.1
-
6
-
-
57749181518
-
Simulation-based approach to general game playing
-
AAAI Press. ISBN 978-1-57735-368-3
-
H. Finnsson and Y. Björnsson. Simulation-based approach to general game playing. In AAAI-08, pp. 259-264. AAAI Press, 2008. ISBN 978-1-57735-368-3
-
(2008)
AAAI-08
, pp. 259-264
-
-
Finnsson, H.1
Björnsson, Y.2
-
7
-
-
34547990649
-
Combining online and online knowledge in UCT
-
Corvallis, OR, June
-
S. Gelly and D. Silver. Combining online and online knowledge in UCT. In 24th ICML, pp. 273-280, Corvallis, OR, June 2007
-
(2007)
24th ICML
, pp. 273-280
-
-
Gelly, S.1
Silver, D.2
-
8
-
-
57749091602
-
Achieving master level play in 9×9 computer Go
-
Chicago, IL, July
-
S. Gelly and D. Silver. Achieving master level play in 9×9 computer Go. In 23rd AAAI, pp. 1537-1540, Chicago, IL, July 2008
-
(2008)
23rd AAAI
, pp. 1537-1540
-
-
Gelly, S.1
Silver, D.2
-
9
-
-
84880657398
-
GIB: Steps toward an expert-level bridge-playing program
-
M. L. Ginsberg. GIB: Steps toward an expert-level bridge-playing program. In IJCAI-99, pp. 584-589, 1999
-
(1999)
IJCAI-99
, pp. 584-589
-
-
Ginsberg, M.L.1
-
10
-
-
33750293964
-
Bandit based Monte-Carlo planning
-
Machine Learning: ECML 2006 - 17th European Conference on Machine Learning, Proceedings
-
L. Kocsis and C. Szepesvári. Bandit based Monte-Carlo planning. In 17th ECML, vol. 4212 of LNCS, pp. 282-293, Berlin, Germany, Sept. 2006 (Pubitemid 44618839)
-
(2006)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
, vol.LNAI4212
, pp. 282-293
-
-
Kocsis, L.1
Szepesvari, C.2
-
11
-
-
0020717461
-
Pathology on game trees revisited, and an alternative to minimaxing
-
D. S. Nau. Pathology on game trees revisited, and an alternative to minimaxing. Artif. Intell., 21(1-2):221-244, 1983
-
(1983)
Artif. Intell.
, vol.21
, Issue.1-2
, pp. 221-244
-
-
Nau, D.S.1
-
12
-
-
0020787874
-
On the nature of pathology in game searching
-
J. Pearl. On the nature of pathology in game searching. Artif. Intell., 20(4):427-453, 1983
-
(1983)
Artif. Intell.
, vol.20
, Issue.4
, pp. 427-453
-
-
Pearl, J.1
-
13
-
-
78650622420
-
On adversarial search spaces and sampling-based planning
-
Toronto, Canada, May
-
R. Ramanujan, A. Sabharwal, and B. Selman. On adversarial search spaces and sampling-based planning. In 20th ICAPS, pp. 242-245, Toronto, Canada, May 2010
-
(2010)
20th ICAPS
, pp. 242-245
-
-
Ramanujan, R.1
Sabharwal, A.2
Selman, B.3
-
14
-
-
0036146034
-
World-championship-caliber Scrabble
-
DOI 10.1016/S0004-3702(01)00166-7, PII S0004370201001667
-
B. Sheppard. World-championship-caliber Scrabble. Artif. Intell., 134(1-2):241-275, 2002 (Pubitemid 34086585)
-
(2002)
Artificial Intelligence
, vol.134
, Issue.1-2
, pp. 241-275
-
-
Sheppard, B.1
|