-
2
-
-
17444409624
-
A tutorial on the cross-entropy method
-
Boer, P. de, Kroese, D., Mannor, S., and Rubinstein, R. (2004). A tutorial on the cross-entropy method. Annals of Operations Research, Vol.1, No. 134, pp. 19-67.
-
(2004)
Annals of Operations Research
, vol.1
, Issue.134
, pp. 19-67
-
-
De Boer, P.1
Kroese, D.2
Mannor, S.3
Rubinstein, R.4
-
3
-
-
60849092905
-
Cross-entropy for monte-carlo tree search
-
Chaslot, G., Winands, M., Szita, I., and Herik, H. J. van den (2008). Cross-Entropy for Monte-Carlo Tree Search. 1CGA Journal, Vol.31, No.3, pp. 145-157.
-
(2008)
1CGA Journal
, vol.31
, Issue.3
, pp. 145-157
-
-
Chaslot, G.1
Winands, M.2
Szita, I.3
Van Den Herik, H.J.4
-
4
-
-
35248818685
-
Tetris is hard, even to approximate
-
Demaine, E. D., Hohenberger, S., and Liben-Nowell, D. (2003). Tetris is hard, even to approximate. Proc. 9th International Computing and Combinatorics Conference (COCOON 2003), pp. 351-363.
-
(2003)
Proc. 9th International Computing and Combinatorics Conference (COCOON 2003)
, pp. 351-363
-
-
Demaine, E.D.1
Hohenberger, S.2
Liben-Nowell, D.3
-
8
-
-
0035377566
-
Completely derandomized self-adaptation in evolution strategies
-
Hansen, N. and Ostermeier, A. (2001). Completely Derandomized Self-Adaptation in Evolution Strategies. Evolutionary Computation, Vol.9, No.2, pp. 159-195.
-
(2001)
Evolutionary Computation
, vol.9
, Issue.2
, pp. 159-195
-
-
Hansen, N.1
Ostermeier, A.2
-
10
-
-
35048819671
-
Least-squares methods in reinforcement learning for control
-
Springer-Verlag, London, UK
-
Lagoudakis, M. G., Parr, R., and Littman, M. L. (2002). Least-squares methods in reinforcement learning for control. SETN '02: Proceedings of the Second Hellenic Conference on AI, pp. 249-260, Springer-Verlag, London, UK.
-
(2002)
SETN '02: Proceedings of the Second Hellenic Conference on AI
, pp. 249-260
-
-
Lagoudakis, M.G.1
Parr, R.2
Littman, M.L.3
-
11
-
-
84876627521
-
-
Xtris readme
-
Llima, R. E. (2005). Xtris readme. http://www.iagora.com/~espel/xtris/ README.
-
(2005)
-
-
Llima, R.E.1
-
12
-
-
33845323186
-
On the numeric stability of gaussian processes regression for relational reinforcement learning
-
Ramon, J. and Driessens, K. (2004). On the numeric stability of gaussian processes regression for relational reinforcement learning. ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14.
-
(2004)
ICML-2004 Workshop on Relational Reinforcement Learning
, pp. 10-14
-
-
Ramon, J.1
Driessens, K.2
-
13
-
-
33845344721
-
Learning tetris using the noisy cross-entropy method
-
Szita, I. and Lörincz, A. (2006). Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation, Vol.18, No.12, pp. 2936-2941.
-
(2006)
Neural Computation
, vol.18
, Issue.12
, pp. 2936-2941
-
-
Szita, I.1
Lörincz, A.2
-
14
-
-
70350140182
-
Building controllers for tetris
-
Thiery, C. and Scherrer, B. (2009). Building Controllers for Tetris. ICGA Journal, Vol.32, No.1, pp. 3-11.
-
(2009)
ICGA Journal
, vol.32
, Issue.1
, pp. 3-11
-
-
Thiery, C.1
Scherrer, B.2
-
15
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
Tsitsiklis, J. N. and Roy, B. van (1996). Feature-Based Methods for Large Scale Dynamic Programming. Machine Learning, Vol.22, pp. 59-94.
-
(1996)
Machine Learning
, vol.22
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
|