메뉴 건너뛰기




Volumn 4, Issue January, 2014, Pages 3338-3346

Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; BENCHMARKING; COMPUTER AIDED INSTRUCTION; INFORMATION SCIENCE;

EID: 84937779024     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (323)

References (23)
  • 11
    • 0036832951 scopus 로고    scopus 로고
    • A sparse sampling algorithm for near-optimal planning in large Markov decision processes
    • M. Kearns, Y. Mansour, and A. Y. Ng. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning, 49(2-3): 193-208, 2002.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 193-208
    • Kearns, M.1    Mansour, Y.2    Ng, A.Y.3
  • 15
    • 80052874098 scopus 로고    scopus 로고
    • Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis
    • Q. V. Le, W. Y. Zou, S. Y. Yeung, and A. Y. Ng. Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In IEEE Conference on Computer Vision and Pattern Recognition, pages 3361-3368, 2011.
    • (2011) IEEE Conference on Computer Vision and Pattern Recognition , pp. 3361-3368
    • Le, Q.V.1    Zou, W.Y.2    Yeung, S.Y.3    Ng, A.Y.4
  • 16
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11): 2278-2324, 1998.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 18
    • 84863380535 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • H. Lee, P. Pham, Y. Largman, and A. Y. Ng. Unsupervised feature learning for audio classification using convolutional deep belief networks. In Advances in Neural Information Processing Systems, pages 1096-1104, 2009.
    • (2009) Advances in Neural Information Processing Systems , pp. 1096-1104
    • Lee, H.1    Pham, P.2    Largman, Y.3    Ng, A.Y.4
  • 23
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-gammon
    • G. Tesauro. Temporal difference learning and TD-gammon. Communications of the ACM, 38(3): 58-68, 1995.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.