-
1
-
-
33750293964
-
Bandit-based monte-carlo planning
-
Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. Springer, Heidelberg
-
Kocsis, L., Szepesvari, C.: Bandit-based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol.4212, pp. 282-293. Springer, Heidelberg (2006)
-
(2006)
LNCS (LNAI)
, vol.4212
, pp. 282-293
-
-
Kocsis, L.1
Szepesvari, C.2
-
4
-
-
71149107214
-
Bandit-based optimization on graphs with application to library performance tuning
-
Montŕeal Canada
-
De Mesmay, F., Rimmel, A., Voronenko, Y., Püschel, M.: Bandit-Based Optimization on Graphs with Application to Library Performance Tuning. In: International Conference on Machine Learning, Montŕeal Canada (2009)
-
(2009)
International Conference on Machine Learning
-
-
De Mesmay, F.1
Rimmel, A.2
Voronenko, Y.3
Püschel, M.4
-
6
-
-
77952702283
-
Adding expert knowledge and exploration in Monte-Carlo Tree Search
-
Springer, Heidelberg
-
Chaslot, G., Fiter, C., Hoock, J.B., Rimmel, A., Teytaud, O.: Adding expert knowledge and exploration in Monte-Carlo Tree Search. In: Advances in Computer Games, Pamplona Espagne. Springer, Heidelberg (2009)
-
(2009)
Advances in Computer Games Pamplona Espagne
-
-
Chaslot, G.1
Fiter, C.2
Hoock, J.B.3
Rimmel, A.4
Teytaud, O.5
-
8
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R.S.: Learning to predict by the methods of temporal differences. Machine Learning, 9-44 (1988)
-
(1988)
Machine Learning
, pp. 9-44
-
-
Sutton, R.S.1
-
9
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47(2/3), 235-256 (2002)
-
(2002)
Machine Learning
, vol.47
, Issue.2-3
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
10
-
-
0002899547
-
Asymptotically efficient adaptive allocation rules
-
Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985)
-
(1985)
Advances in Applied Mathematics
, vol.6
, pp. 4-22
-
-
Lai, T.1
Robbins, H.2
-
11
-
-
26944466214
-
Function approximation via tile coding: Automating parameter choice
-
Zucker, J.-D., Saitta, L. (eds.) SARA 2005. Springer, Heidelberg
-
Sherstov, E.A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), vol.3607, pp. 194-205. Springer, Heidelberg (2005)
-
(2005)
LNCS (LNAI)
, vol.3607
, pp. 194-205
-
-
Sherstov, E.A.1
Stone, P.2
-
12
-
-
78951480078
-
Creating an Upper-Confidence-Tree program for Havannah
-
Pamplona Espagne
-
Teytaud, F., Teytaud, O.: Creating an Upper-Confidence-Tree program for Havannah. In: Advances in Computer Games 12, Pamplona Espagne (2009)
-
(2009)
Advances in Computer Games
, vol.12
-
-
Teytaud, F.1
Teytaud, O.2
|