-
1
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer, P.; Cesa-Bianchi, N.; and Fischer, P. 2002. Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47:235-256.
-
(2002)
Mach. Learn
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
84868274640
-
Pachi: State of the art open source Go program
-
Braudiš, P., and Loup Gailly, J. 2011. Pachi: State of the art open source Go program. In ACG 13.
-
(2011)
ACG
, vol.13
-
-
Braudiš, P.1
Loup Gailly, J.2
-
3
-
-
79952624396
-
Pure exploration in finitely-armed and continuous-armed bandits
-
Bubeck, S.; Munos, R.; and Stoltz, G. 2011. Pure exploration in finitely-armed and continuous-armed bandits. Theor. Comput. Sci. 412(19):1832-1852.
-
(2011)
Theor. Comput. Sci
, vol.412
, Issue.19
, pp. 1832-1852
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
-
4
-
-
77953733536
-
-
Technical report, University of Alberta, Dept. of Computing Science, TR09-08
-
Enzenberger, M., and Müller, M. 2009. Fuego - An Open-source Framework for Board Games and Go Engine Based on Monte-Carlo Tree Search. Technical report, University of Alberta, Dept. of Computing Science, TR09-08.
-
(2009)
Fuego - An Open-source Framework for Board Games and Go Engine Based on Monte-Carlo Tree Search
-
-
Enzenberger, M.1
Müller, M.2
-
5
-
-
85167430664
-
High-quality policies for the Canadian traveler's problem
-
Eyerich, P.; Keller, T.; and Helmert, M. 2010. High-quality policies for the Canadian traveler's problem. In In Proc. AAAI 2010, 51-58.
-
(2010)
Proc. AAAI 2010
, pp. 51-58
-
-
Eyerich, P.1
Keller, T.2
Helmert, M.3
-
6
-
-
84868288849
-
Exploration exploitation in Go: UCT for Monte-Carlo Go
-
Gelly, S., and Wang, Y. 2006. Exploration exploitation in Go: UCT for Monte-Carlo Go. Computer.
-
(2006)
Computer
-
-
Gelly, S.1
Wang, Y.2
-
7
-
-
84868291906
-
-
Technical Report UCB/EECS-2011-119, EECS Department, University of California, Berkeley
-
Hay, N., and Russell, S. J. 2011. Metareasoning for Monte Carlo Tree Search. Technical Report UCB/EECS-2011-119, EECS Department, University of California, Berkeley.
-
(2011)
Metareasoning for Monte Carlo Tree Search
-
-
Hay, N.1
Russell, S. J.2
-
9
-
-
33750293964
-
Bandit based Monte-Carlo planning
-
Kocsis, L., and Szepesvári, C. 2006. Bandit based Monte-Carlo planning. In ECML, 282-293.
-
(2006)
ECML
, pp. 282-293
-
-
Kocsis, L.1
Szepesvári, C.2
-
12
-
-
84859009761
-
Semimyopic measurement selection for optimization under uncertainty
-
Tolpin, D., and Shimony, S. E. 2012. Semimyopic measurement selection for optimization under uncertainty. IEEE Transactions on Systems, Man, and Cybernetics, Part B 42(2):565-579.
-
(2012)
IEEE Transactions on Systems, Man, and Cybernetics, Part B
, vol.42
, Issue.2
, pp. 565-579
-
-
Tolpin, D.1
Shimony, S. E.2
-
13
-
-
33646406807
-
Multi-armed bandit algorithms and empirical evaluation
-
Vermorel, J., and Mohri, M. 2005. Multi-armed bandit algorithms and empirical evaluation. In ECML, 437-448.
-
(2005)
ECML
, pp. 437-448
-
-
Vermorel, J.1
Mohri, M.2
|