SCOPUS 정보 검색 플랫폼

Volumn 32, Issue 1, 2009, Pages 23-33

Improvements on learning tetris with cross entropy

b LORIA (France)

Author keywords

[No Author keywords available]

Indexed keywords

REINFORCEMENT LEARNING;

CROSS ENTROPY; CROSS-ENTROPY METHOD; DOMAIN REINFORCEMENT; SETS OF FEATURES;

CONTROLLERS;

EID: 70350150649 PISSN: 13896911 EISSN: None Source Type: Journal
DOI: 10.3233/ICG-2009-32104 Document Type: Article

Times cited : (38)

References (15)

1
- 0004211236
- Athena Scientific
- Bertsekas, D. and Tsitsiklis, J. (1996). Neurodynamic Programming. Athena Scientific.
- (1996) Neurodynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

2
- 17444409624
- A tutorial on the cross-entropy method
- Boer, P. de, Kroese, D., Mannor, S., and Rubinstein, R. (2004). A tutorial on the cross-entropy method. Annals of Operations Research, Vol.1, No. 134, pp. 19-67.
- (2004) Annals of Operations Research , vol.1 , Issue.134 , pp. 19-67
- De Boer, P.¹ Kroese, D.² Mannor, S.³ Rubinstein, R.⁴

3
- 60849092905
- Cross-entropy for monte-carlo tree search
- Chaslot, G., Winands, M., Szita, I., and Herik, H. J. van den (2008). Cross-Entropy for Monte-Carlo Tree Search. 1CGA Journal, Vol.31, No.3, pp. 145-157.
- (2008) 1CGA Journal , vol.31 , Issue.3 , pp. 145-157
- Chaslot, G.¹ Winands, M.² Szita, I.³ Van Den Herik, H.J.⁴

4
- 35248818685
- Tetris is hard, even to approximate
- Demaine, E. D., Hohenberger, S., and Liben-Nowell, D. (2003). Tetris is hard, even to approximate. Proc. 9th International Computing and Combinatorics Conference (COCOON 2003), pp. 351-363.
- (2003) Proc. 9th International Computing and Combinatorics Conference (COCOON 2003) , pp. 351-363
- Demaine, E.D.¹ Hohenberger, S.² Liben-Nowell, D.³

5
- 70350136349
- Fahey, C. P. (2003). Tetris AI, Computer plays Tetris. http://colinfahey.com/tetris/tetris.html.
- (2003) Tetris AI, Computer Plays Tetris
- Fahey, C.P.¹

6
- 33748427607
- Springer-Verlag, Heidelberg, Germany
- Farias, V. and Roy, B. van (2006). Tetris: A study of randomized constraint sampling. Springer-Verlag, Heidelberg, Germany.
- (2006) Tetris: A Study of Randomized Constraint Sampling
- Farias, V.¹ Van Roy, B.²

7
- 70350172883
- Technical Report RR-6358, INRIA
- Girgin, S. and Preux, P. (2007). Feature Discovery in Reinforcement Learning using Genetic Programming. Technical Report RR-6358, INRIA. http://hal.inria.fr/inria-00187997/fr/.
- (2007) Feature Discovery in Reinforcement Learning Using Genetic Programming
- Girgin, S.¹ Preux, P.²

8
- 0035377566
- Completely derandomized self-adaptation in evolution strategies
- Hansen, N. and Ostermeier, A. (2001). Completely Derandomized Self-Adaptation in Evolution Strategies. Evolutionary Computation, Vol.9, No.2, pp. 159-195.
- (2001) Evolutionary Computation , vol.9 , Issue.2 , pp. 159-195
- Hansen, N.¹ Ostermeier, A.²

9
- 84898930479
- A natural policy gradient
- Kakade, S. (2001). A natural policy gradient. Advances in Neural Information Processing Systems (NIPS 14), pp. 1531-1538.
- (2001) Advances in Neural Information Processing Systems (NIPS 14) , pp. 1531-1538
- Kakade, S.¹

11
- 84876627521
- Xtris readme
- Llima, R. E. (2005). Xtris readme. http://www.iagora.com/~espel/xtris/ README.
- (2005)
- Llima, R.E.¹

12
- 33845323186
- On the numeric stability of gaussian processes regression for relational reinforcement learning
- Ramon, J. and Driessens, K. (2004). On the numeric stability of gaussian processes regression for relational reinforcement learning. ICML-2004 Workshop on Relational Reinforcement Learning, pp. 10-14.
- (2004) ICML-2004 Workshop on Relational Reinforcement Learning , pp. 10-14
- Ramon, J.¹ Driessens, K.²

13
- 33845344721
- Learning tetris using the noisy cross-entropy method
- Szita, I. and Lörincz, A. (2006). Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation, Vol.18, No.12, pp. 2936-2941.
- (2006) Neural Computation , vol.18 , Issue.12 , pp. 2936-2941
- Szita, I.¹ Lörincz, A.²

14
- 70350140182
- Building controllers for tetris
- Thiery, C. and Scherrer, B. (2009). Building Controllers for Tetris. ICGA Journal, Vol.32, No.1, pp. 3-11.
- (2009) ICGA Journal , vol.32 , Issue.1 , pp. 3-11
- Thiery, C.¹ Scherrer, B.²

15
- 0029752470
- Feature-based methods for large scale dynamic programming
- Tsitsiklis, J. N. and Roy, B. van (1996). Feature-Based Methods for Large Scale Dynamic Programming. Machine Learning, Vol.22, pp. 59-94.
- (1996) Machine Learning , vol.22 , pp. 59-94
- Tsitsiklis, J.N.¹ Van Roy, B.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.