메뉴 건너뛰기




Volumn 18, Issue 12, 2006, Pages 2936-2941

Learning tetris using the noisy cross-entropy method

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHM; ARTICLE; COMPUTER SIMULATION; ENTROPY; HUMAN; LEARNING; REINFORCEMENT; STATISTICAL MODEL;

EID: 33845344721     PISSN: 08997667     EISSN: 1530888X     Source Type: Journal    
DOI: 10.1162/neco.2006.18.12.2936     Document Type: Article
Times cited : (205)

References (11)
  • 5
    • 84863891863 scopus 로고    scopus 로고
    • Fahey, C. P. (2003). Tetris AI. Available online at http://www. colinfahey.com
    • (2003) Tetris AI
    • Fahey, C.P.1
  • 7
    • 33646243319 scopus 로고    scopus 로고
    • A natural policy gradient
    • T. G. Dietterich, S. Backer, & Z. Ghahramani (Eds.). Cambridge, MA: MIT Press
    • Kakade, S. (2001). A natural policy gradient. In T. G. Dietterich, S. Backer, & Z. Ghahramani (Eds.), Advances in neural information 'processing systems, 14 (pp. 1531-1538). Cambridge, MA: MIT Press.
    • (2001) Advances in Neural Information 'Processing Systems , vol.14 , pp. 1531-1538
    • Kakade, S.1
  • 10
    • 17444414191 scopus 로고    scopus 로고
    • Basis function adaption in temporal difference reinforcement learning
    • Menache, I., Marmor, S., & Shimkin, N. (2005). Basis function adaption in temporal difference reinforcement learning. Annals of Operations Research, 134(1), 215-238.
    • (2005) Annals of Operations Research , vol.134 , Issue.1 , pp. 215-238
    • Menache, I.1    Marmor, S.2    Shimkin, N.3
  • 11
    • 33845323186 scopus 로고    scopus 로고
    • On the numeric stability of gaussian processes regression for relational reinforcement learning
    • N.p.: Omni press
    • Ramon, J., & Driessens, K. (2004). On the numeric stability of gaussian processes regression for relational reinforcement learning. In ICML-2004 Workshop on Relational Reinforcement Learning (pp. 10-14). N.p.: Omni press.
    • (2004) ICML-2004 Workshop on Relational Reinforcement Learning , pp. 10-14
    • Ramon, J.1    Driessens, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.