-
1
-
-
84864970677
-
Best arm identification in multi-armed bandits
-
J. Audibert, S. Bubeck, et al. Best arm identification in multi-armed bandits. In COLT, 2010.
-
(2010)
COLT
-
-
Audibert, J.1
Bubeck, S.2
-
3
-
-
0002426110
-
A single-sample multiple decision procedure for ranking means of normal populations with known variances
-
R. Bechhofer. A single-sample multiple decision procedure for ranking means of normal populations with known variances. The Annals of Mathematical Statistics, 25(1):16-39, 1954.
-
(1954)
The Annals of Mathematical Statistics
, vol.25
, Issue.1
, pp. 16-39
-
-
Bechhofer, R.1
-
5
-
-
84868274640
-
Pachi: State of the art open source go program
-
P. Braudi?s and J. Loup Gailly. Pachi: State of the art open source Go program. In ACG 13, 2011.
-
(2011)
ACG
, vol.13
-
-
Braudis, P.1
Loup Gailly, J.2
-
6
-
-
79952624396
-
Pure exploration in finitely-armed and continuous-armed bandits
-
S. Bubeck, R. Munos, and G. Stoltz. Pure exploration in finitely-armed and continuous-armed bandits. Theor. Comput. Sci., 412(19):1832-1852, 2011.
-
(2011)
Theor. Comput. Sci.
, vol.412
, Issue.19
, pp. 1832-1852
-
-
Bubeck, S.1
Munos, R.2
Stoltz, G.3
-
8
-
-
84880655104
-
An analysis of time-dependent planning
-
T. Dean and M. Boddy. An analysis of time-dependent planning. In AAAI-88, pages 49-54, 1988.
-
(1988)
AAAI-88
, pp. 49-54
-
-
Dean, T.1
Boddy, M.2
-
9
-
-
0346986495
-
-
Technical Report CMU-CS-88-124, Computer Science Department, Carnegie-Mellon University, Pittsburgh, PA
-
J. Doyle. Artificial intelligence and rational selfgovernment. Technical Report CMU-CS-88-124, Computer Science Department, Carnegie-Mellon University, Pittsburgh, PA, 1988.
-
(1988)
Artificial Intelligence and Rational Selfgovernment
-
-
Doyle, J.1
-
11
-
-
78651309095
-
Paradoxes in learning and the marginal value of information
-
P. Frazier and W. Powell. Paradoxes in learning and the marginal value of information. Decision Analysis, 2010.
-
(2010)
Decision Analysis
-
-
Frazier, P.1
Powell, W.2
-
12
-
-
79956202655
-
Monte-carlo tree search and rapid action value estimation in computer go
-
S. Gelly and D. Silver. Monte-carlo tree search and rapid action value estimation in computer go. Artificial Intelligence, 2011.
-
(2011)
Artificial Intelligence
-
-
Gelly, S.1
Silver, D.2
-
13
-
-
84868288849
-
Exploration exploitation in go: Uct for monte-carlo go
-
S. Gelly and Y. Wang. Exploration exploitation in Go: UCT for Monte-Carlo Go. Computer, 2006.
-
(2006)
Computer
-
-
Gelly, S.1
Wang, Y.2
-
16
-
-
0342354937
-
A five-year plan for automatic chess
-
In E. Dale and D. Michie, editors, Oliver and Boyd
-
I. J. Good. A five-year plan for automatic chess. In E. Dale and D. Michie, editors, Machine Intelligence 2, pages 89-118. Oliver and Boyd, 1968.
-
(1968)
Machine Intelligence
, vol.2
, pp. 89-118
-
-
Good, I.J.1
-
18
-
-
84947403595
-
Probability inequalities for sums of bounded random variables
-
ISSN 01621459
-
W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):pp. 13-30, 1963. ISSN 01621459.
-
(1963)
Journal of the American Statistical Association
, vol.58
, Issue.301
, pp. 13-30
-
-
Hoeffding, W.1
-
20
-
-
0008091002
-
-
Elsevier
-
Also in L. Kanal, T. Levitt, and J. Lemmer, ed., Uncertainty in Artificial Intelligence 3, Elsevier, 1988, pps. 301-324.
-
(1988)
Uncertainty in Artificial Intelligence 3
, pp. 301-324
-
-
Kanal, L.1
Levitt, T.2
Lemmer, J.3
-
22
-
-
34547975806
-
Bandit based monte-carlo planning
-
L. Kocsis and C. Szepesvari. Bandit based monte-carlo planning. ECML, 2006.
-
(2006)
ECML
-
-
Kocsis, L.1
Szepesvari, C.2
-
24
-
-
0040235875
-
The economic value of analysis and computation
-
J. E. Matheson. The economic value of analysis and computation. Systems Science and Cybernetics, 4: 325-332, 1968.
-
(1968)
Systems Science and Cybernetics
, vol.4
, pp. 325-332
-
-
Matheson, J.E.1
-
25
-
-
84898061133
-
Empirical bernstein bounds and sample-variance penalization
-
A. Maurer and M. Pontil. Empirical Bernstein bounds and sample-variance penalization. In COLT, 2009.
-
(2009)
COLT
-
-
Maurer, A.1
Pontil, M.2
-
30
-
-
0346271770
-
Discrete-event simulation optimization using ranking, selection, and multiple comparison procedures: A survey
-
J. R. Swisher, S. H. Jacobson, and E. Yucesan. Discrete-event simulation optimization using ranking, selection, and multiple comparison procedures: A survey. ACM Transactions on Modeling and Computer Simulation, 2003.
-
(2003)
ACM Transactions on Modeling and Computer Simulation
-
-
Swisher, J.R.1
Jacobson, S.H.2
Yucesan, E.3
-
31
-
-
85167395709
-
MCTS based on simple regret
-
AAAI Press To appear
-
D. Tolpin and S. E. Shimony. MCTS based on simple regret. In AAAI. AAAI Press, 2012a. To appear.
-
(2012)
AAAI
-
-
Tolpin, D.1
Shimony, S.E.2
-
32
-
-
84859009761
-
Semimyopic measurement selection for optimization under uncertainty
-
D. Tolpin and S. E. Shimony. Semimyopic measurement selection for optimization under uncertainty. IEEE Transactions on Systems, Man, and Cybernetics, Part B, 42(2):565-579, 2012b.
-
(2012)
IEEE Transactions on Systems, Man, and Cybernetics, Part B
, vol.42
, Issue.2
, pp. 565-579
-
-
Tolpin, D.1
Shimony, S.E.2
-
33
-
-
0000090155
-
Sequential tests of statistical hypotheses
-
A.Wald. Sequential tests of statistical hypotheses. Annals of Mathematical Statistics, 16:117-186, 1945.
-
(1945)
Annals of Mathematical Statistics
, vol.16
, pp. 117-186
-
-
Wald, A.1
|