SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Neural Computation

Volumn 18, Issue 12, 2006, Pages 2936-2941

Learning tetris using the noisy cross-entropy method

(2) Szita, István a Lorincz, András a

a EÖTVÖS LORÁND UNIVERSITY (Hungary)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHM; ARTICLE; COMPUTER SIMULATION; ENTROPY; HUMAN; LEARNING; REINFORCEMENT; STATISTICAL MODEL;

ALGORITHMS; COMPUTER SIMULATION; ENTROPY; HUMANS; LEARNING; MODELS, STATISTICAL; REINFORCEMENT (PSYCHOLOGY);

EID: 33845344721 PISSN: 08997667 EISSN: 1530888X Source Type: Journal
DOI: 10.1162/neco.2006.18.12.2936 Document Type: Article

Times cited : (213)

References (11)

1
- 0003487482
- Nashua, NH: Athena Scientific
- Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-Dynamic Programming. Nashua, NH: Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

2
- 33845339117
- Evolving a heuristic function for the game of Tetris
- T. Scheffer (Ed.). Berlin
- Bohm, N., Kokai, G., & Mandl, S. (2004). Evolving a heuristic function for the game of Tetris. In T. Scheffer (Ed.), Proc. Lernen, Wissensentdeckung und Adaptivität LWA-2004 (pp. 118-122). Berlin.
- (2004) Proc. Lernen, Wissensentdeckung und Adaptivität LWA-2004 , pp. 118-122
- Bohm, N.¹ Kokai, G.² Mandl, S.³

3
- 17444409624
- A tutorial on the cross-entropy method
- de Boer, P., Kroese, D., Mannor, S., & Rubinstein, R. (2004). A tutorial on the cross-entropy method. Annals of Operations Research, 134(1), 19-67.
- (2004) Annals of Operations Research , vol.134 , Issue.1 , pp. 19-67
- De Boer, P.¹ Kroese, D.² Mannor, S.³ Rubinstein, R.⁴

4
- 35248818685
- Tetris is hard, even to approximate
- Berlin: Springer
- Demaine, E. D., Hohenberger, S., & Liben-Nowell, D. (2003). Tetris is hard, even to approximate. In Proc. 9th International Computing and Combinatorics Conference (COCOON 2003) (pp. 351-363). Berlin: Springer.
- (2003) Proc. 9th International Computing and Combinatorics Conference (COCOON 2003) , pp. 351-363
- Demaine, E.D.¹ Hohenberger, S.² Liben-Nowell, D.³

5
- 84863891863
- Fahey, C. P. (2003). Tetris AI. Available online at http://www. colinfahey.com
- (2003) Tetris AI
- Fahey, C.P.¹

6
- 33748427607
- Tetris: A study of randomized constraint sampling
- G. Calafiore & F. Dabbene (Eds.). Berlin: Springer-Verlag
- Parias, V. F., & van Roy, B. (2006). Tetris: A study of randomized constraint sampling. In G. Calafiore & F. Dabbene (Eds.), Probabilistic and randomized methods for design under uncertainty. Berlin: Springer-Verlag.
- (2006) Probabilistic and Randomized Methods for Design under Uncertainty
- Parias, V.F.¹ Van Roy, B.²

7
- 33646243319
- A natural policy gradient
- T. G. Dietterich, S. Backer, & Z. Ghahramani (Eds.). Cambridge, MA: MIT Press
- Kakade, S. (2001). A natural policy gradient. In T. G. Dietterich, S. Backer, & Z. Ghahramani (Eds.), Advances in neural information 'processing systems, 14 (pp. 1531-1538). Cambridge, MA: MIT Press.
- (2001) Advances in Neural Information 'Processing Systems , vol.14 , pp. 1531-1538
- Kakade, S.¹

8
- 35048819671
- Least-squares methods in reinforcement learning for control
- Berlin: Springer-Verlag
- Lagoudakis, M. G., Parr, R., & Littman, M. L. (2002). Least-squares methods in reinforcement learning for control. In SEIN '02: Proceedings of the Second Hellenic Conference on AI (pp. 249-260). Berlin: Springer-Verlag.
- (2002) SEIN '02: Proceedings of the Second Hellenic Conference on AI , pp. 249-260
- Lagoudakis, M.G.¹ Parr, R.² Littman, M.L.³

9
- 1942516890
- The cross-entropy method for fast policy search
- Menlo Park, CA: AAAI Press
- Mannor, S., Rubinstein, R. Y., & Gat, Y. (2003). The cross-entropy method for fast policy search. In Proc. International Conf. on Machine Learning (ICML 2003), (pp. 512-519). Menlo Park, CA: AAAI Press.
- (2003) Proc. International Conf. on Machine Learning (ICML 2003) , pp. 512-519
- Mannor, S.¹ Rubinstein, R.Y.² Gat, Y.³

10
- 17444414191
- Basis function adaption in temporal difference reinforcement learning
- Menache, I., Marmor, S., & Shimkin, N. (2005). Basis function adaption in temporal difference reinforcement learning. Annals of Operations Research, 134(1), 215-238.
- (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 215-238
- Menache, I.¹ Marmor, S.² Shimkin, N.³

11
- 33845323186
- On the numeric stability of gaussian processes regression for relational reinforcement learning
- N.p.: Omni press
- Ramon, J., & Driessens, K. (2004). On the numeric stability of gaussian processes regression for relational reinforcement learning. In ICML-2004 Workshop on Relational Reinforcement Learning (pp. 10-14). N.p.: Omni press.
- (2004) ICML-2004 Workshop on Relational Reinforcement Learning , pp. 10-14
- Ramon, J.¹ Driessens, K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.