-
2
-
-
33845339117
-
Evolving a heuristic function for the game of Tetris
-
T. Scheffer (Ed.). Berlin
-
Bohm, N., Kokai, G., & Mandl, S. (2004). Evolving a heuristic function for the game of Tetris. In T. Scheffer (Ed.), Proc. Lernen, Wissensentdeckung und Adaptivität LWA-2004 (pp. 118-122). Berlin.
-
(2004)
Proc. Lernen, Wissensentdeckung und Adaptivität LWA-2004
, pp. 118-122
-
-
Bohm, N.1
Kokai, G.2
Mandl, S.3
-
3
-
-
17444409624
-
A tutorial on the cross-entropy method
-
de Boer, P., Kroese, D., Mannor, S., & Rubinstein, R. (2004). A tutorial on the cross-entropy method. Annals of Operations Research, 134(1), 19-67.
-
(2004)
Annals of Operations Research
, vol.134
, Issue.1
, pp. 19-67
-
-
De Boer, P.1
Kroese, D.2
Mannor, S.3
Rubinstein, R.4
-
4
-
-
35248818685
-
Tetris is hard, even to approximate
-
Berlin: Springer
-
Demaine, E. D., Hohenberger, S., & Liben-Nowell, D. (2003). Tetris is hard, even to approximate. In Proc. 9th International Computing and Combinatorics Conference (COCOON 2003) (pp. 351-363). Berlin: Springer.
-
(2003)
Proc. 9th International Computing and Combinatorics Conference (COCOON 2003)
, pp. 351-363
-
-
Demaine, E.D.1
Hohenberger, S.2
Liben-Nowell, D.3
-
5
-
-
84863891863
-
-
Fahey, C. P. (2003). Tetris AI. Available online at http://www. colinfahey.com
-
(2003)
Tetris AI
-
-
Fahey, C.P.1
-
7
-
-
33646243319
-
A natural policy gradient
-
T. G. Dietterich, S. Backer, & Z. Ghahramani (Eds.). Cambridge, MA: MIT Press
-
Kakade, S. (2001). A natural policy gradient. In T. G. Dietterich, S. Backer, & Z. Ghahramani (Eds.), Advances in neural information 'processing systems, 14 (pp. 1531-1538). Cambridge, MA: MIT Press.
-
(2001)
Advances in Neural Information 'Processing Systems
, vol.14
, pp. 1531-1538
-
-
Kakade, S.1
-
9
-
-
1942516890
-
The cross-entropy method for fast policy search
-
Menlo Park, CA: AAAI Press
-
Mannor, S., Rubinstein, R. Y., & Gat, Y. (2003). The cross-entropy method for fast policy search. In Proc. International Conf. on Machine Learning (ICML 2003), (pp. 512-519). Menlo Park, CA: AAAI Press.
-
(2003)
Proc. International Conf. on Machine Learning (ICML 2003)
, pp. 512-519
-
-
Mannor, S.1
Rubinstein, R.Y.2
Gat, Y.3
-
10
-
-
17444414191
-
Basis function adaption in temporal difference reinforcement learning
-
Menache, I., Marmor, S., & Shimkin, N. (2005). Basis function adaption in temporal difference reinforcement learning. Annals of Operations Research, 134(1), 215-238.
-
(2005)
Annals of Operations Research
, vol.134
, Issue.1
, pp. 215-238
-
-
Menache, I.1
Marmor, S.2
Shimkin, N.3
-
11
-
-
33845323186
-
On the numeric stability of gaussian processes regression for relational reinforcement learning
-
N.p.: Omni press
-
Ramon, J., & Driessens, K. (2004). On the numeric stability of gaussian processes regression for relational reinforcement learning. In ICML-2004 Workshop on Relational Reinforcement Learning (pp. 10-14). N.p.: Omni press.
-
(2004)
ICML-2004 Workshop on Relational Reinforcement Learning
, pp. 10-14
-
-
Ramon, J.1
Driessens, K.2
|