-
1
-
-
0003787146
-
-
Princeton University Press, Princeton, NJ
-
Bellman, R. E. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
-
3
-
-
0004302610
-
-
John Wiley & Sons, New York, N.Y., 3rd edition
-
Billingsley, P. (1995). Probability and measure. John Wiley & Sons, New York, N.Y., 3rd edition.
-
(1995)
Probability and Measure
-
-
Billingsley, P.1
-
4
-
-
17444409624
-
A tutorial on the cross-entropy method
-
Boer, P. de, Kroese, D., Mannor, S., and Rubinstein, R. (2004). A tutorial on the cross-entropy method. Annals of Operations Research, Vol.1, No. 134, pp. 19-67.
-
(2004)
Annals of Operations Research
, vol.1
, Issue.134
, pp. 19-67
-
-
De Boer, P.1
Kroese, D.2
Mannor, S.3
Rubinstein, R.4
-
6
-
-
0442320716
-
How to lose at tetris
-
Burgiel, H. (1997). How to Lose at Tetris. Mathematical Gazette, Vol.81, pp. 194-200.
-
(1997)
Mathematical Gazette
, vol.81
, pp. 194-200
-
-
Burgiel, H.1
-
7
-
-
79959391963
-
-
Technical report, Computer Science department of Rhodes University
-
Carr, D. (2005). Applying reinforcement learning to Tetris. Technical report, Computer Science department of Rhodes University.
-
(2005)
Applying Reinforcement Learning to Tetris
-
-
Carr, D.1
-
8
-
-
35248818685
-
Tetris is hard, even to approximate
-
Demaine, E. D., Hohenberger, S., and Liben-Nowell, D. (2003). Tetris is hard, even to approximate. Proc. 9th International Computing and Combinatorics Conference (COCOON 2003), pp. 351-363.
-
(2003)
Proc. 9th International Computing and Combinatorics Conference (COCOON 2003)
, pp. 351-363
-
-
Demaine, E.D.1
Hohenberger, S.2
Liben-Nowell, D.3
-
12
-
-
35048819671
-
Least-squares methods in reinforcement learning for control
-
Springer-Verlag, London, UK
-
Lagoudakis, M. G., Parr, R., and Littman, M. L. (2002). Least-squares methods in reinforcement learning for control. SETN '02: Proceedings of the Second Hellenic Conference on AI, pp. 249-260, Springer-Verlag, London, UK.
-
(2002)
SETN '02: Proceedings of the Second Hellenic Conference on AI
, pp. 249-260
-
-
Lagoudakis, M.G.1
Parr, R.2
Littman, M.L.3
-
13
-
-
84876663644
-
-
Xtris readme.
-
Llima, R. E. (2005). Xtris readme. http://www.iagora.com/~espel/xtris/ README.
-
(2005)
-
-
Llima, R.E.1
-
15
-
-
33845323186
-
On the numeric stability of gaussian processes regression for relational reinforcement learning
-
Ramon, J. and Driessens, K. (2004). On the numeric stability of gaussian processes regression for relational reinforcement learning. ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14.
-
(2004)
ICML-2004 Workshop on Relational Reinforcement Learning
, pp. 10-14
-
-
Ramon, J.1
Driessens, K.2
-
17
-
-
33845344721
-
Learning tetris using the noisy cross-entropy method
-
Szita, I. and Loyincz, A. (2006). Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation, Vol.18, No.12, pp. 2936-2941.
-
(2006)
Neural Computation
, vol.18
, Issue.12
, pp. 2936-2941
-
-
Szita, I.1
Loyincz, A.2
-
18
-
-
70350158274
-
Construction d'un joueur artificiel pour tetris
-
Thiery, C. and Scherrer, B. (2009). Construction d'un joueur artificiel pour Tetris. Revue d'Intelligence Artifi-cielle, Vol.23, pp. 387-407.
-
(2009)
Revue d'Intelligence Artifi-cielle
, vol.23
, pp. 387-407
-
-
Thiery, C.1
Scherrer, B.2
-
19
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
Tsitsiklis, J. N. and Roy, B. van (1996). Feature-Based Methods for Large Scale Dynamic Programming. Machine Learning, Vol.22, pp. 59-94.
-
(1996)
Machine Learning
, vol.22
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
|