-
1
-
-
0036568025
-
Finite time analysis of the multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite time analysis of the multiarmed bandit problem. Machine Learning, 47(2-3):235-256, 2002.
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
2
-
-
0037709910
-
The nonstochastic multiarmed bandit problem
-
P. Auer, N. Cesa-Bianchi, Y. Freund, and R.E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32:48-77, 2002.
-
(2002)
SIAM Journal on Computing
, vol.32
, pp. 48-77
-
-
Auer, P.1
Cesa-Bianchi, N.2
Freund, Y.3
Schapire, R.E.4
-
3
-
-
0003915098
-
-
Technical report 91-57, Computer Science Department, University of Massachusetts
-
A.G. Barto, S.J. Bradtke, and S.P. Singh. Real-time learning and control using asynchronous dynamic programming. Technical report 91-57, Computer Science Department, University of Massachusetts, 1991.
-
(1991)
Real-time Learning and Control Using Asynchronous Dynamic Programming
-
-
Barto, A.G.1
Bradtke, S.J.2
Singh, S.P.3
-
4
-
-
0036149710
-
The challenge of poker
-
D. Billings, A. Davidson, J. Schaeffer, and D. Szafron. The challenge of poker. Artificial Intelligence, 134:201-240, 2002.
-
(2002)
Artificial Intelligence
, vol.134
, pp. 201-240
-
-
Billings, D.1
Davidson, A.2
Schaeffer, J.3
Szafron, D.4
-
5
-
-
24944458186
-
Monte Carlo Go developments
-
H.J. van den Herik, H. Iida, and E.A. Heinz, editors
-
B. Bouzy and B. Helmstetter. Monte Carlo Go developments. In H.J. van den Herik, H. Iida, and E.A. Heinz, editors, Advances in Computer Games 10, pages 159-174, 2004.
-
(2004)
Advances in Computer Games
, vol.10
, pp. 159-174
-
-
Bouzy, B.1
Helmstetter, B.2
-
6
-
-
14644444172
-
An adaptive sampling algorithm for solving Markov decision processes
-
H.S. Chang, M. Fu, J. Hu, and S.I. Marcus. An adaptive sampling algorithm for solving Markov decision processes. Operations Research, 53(1):126-139, 2005.
-
(2005)
Operations Research
, vol.53
, Issue.1
, pp. 126-139
-
-
Chang, H.S.1
Fu, M.2
Hu, J.3
Marcus, S.I.4
-
7
-
-
80053628578
-
Monte Carlo planning in RTS games
-
M. Chung, M. Buro, and J. Schaeffer. Monte Carlo planning in RTS games. In CIG 2005, Colchester, UK, 2005.
-
(2005)
CIG 2005, Colchester, UK
-
-
Chung, M.1
Buro, M.2
Schaeffer, J.3
-
8
-
-
84880649215
-
A sparse sampling algorithm for nearoptimal planning in large Markovian decisi on processes
-
M. Kearns, Y. Mansour, and A.Y. Ng. A sparse sampling algorithm for nearoptimal planning in large Markovian decisi on processes. In Proceedings of IJCAI'99, pages 1324-1331, 1999.
-
(1999)
Proceedings of IJCAI'99
, pp. 1324-1331
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.Y.3
-
9
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
T.L. Lai and H. Robbins. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 6:4-22, 1985.
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.L.1
Robbins, H.2
-
10
-
-
85017333287
-
On-line search for solving Markov decision processes via heuristic sampling
-
R.L. de Mántaras and L. Saitta, editors
-
L. Péret and F. Garcia. On-line search for solving Markov decision processes via heuristic sampling. In R.L. de Mántaras and L. Saitta, editors, ECAI, pages 530-534, 2004.
-
(2004)
ECAI
, pp. 530-534
-
-
Péret, L.1
Garcia, F.2
-
11
-
-
0036146034
-
World-championship-caliber Scrabble
-
B. Sheppard. World-championship-caliber Scrabble. Artificial Intelligence, 134(1-2):241-275, 2002.
-
(2002)
Artificial Intelligence
, vol.134
, Issue.1-2
, pp. 241-275
-
-
Sheppard, B.1
-
12
-
-
0028604197
-
An analysis of forward pruning
-
S.J.J. Smith and D.S. Nau. An analysis of forward pruning. In AAAI, pages 1386-1391, 1994.
-
(1994)
AAAI
, pp. 1386-1391
-
-
Smith, S.J.J.1
Nau, D.S.2
-
13
-
-
84898992015
-
On-line policy improvement using Monte-Carlo search
-
M.C. Mozer, M.I. Jordan, and T. Petsche, editors
-
G. Tesauro and G.R. Galperin. On-line policy improvement using Monte-Carlo search. In M.C. Mozer, M.I. Jordan, and T. Petsche, editors, NIPS 9, pages 1068-1074, 1997.
-
(1997)
NIPS
, vol.9
, pp. 1068-1074
-
-
Tesauro, G.1
Galperin, G.R.2
|