SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn 4, Issue January, 2014, Pages 3338-3346

Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning

(5) Guo, Xiaoxiao a Singh, Satinder a Lee, Honglak a Lewis, Richard a Wang, Xiaoshi a

a UNIVERSITY OF MICHIGAN (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; BENCHMARKING; COMPUTER AIDED INSTRUCTION; INFORMATION SCIENCE;

COMBINING MODEL; DEEP LEARNING; HUMAN PLAYERS; LEARNING ENVIRONMENTS; MONTE-CARLO TREE SEARCHES; ORDERS OF MAGNITUDE; POLICY SELECTION; TRAINING DATA;

REINFORCEMENT LEARNING;

EID: 84937779024 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (323)

References (23)

1
- 84879976780
- The arcade learning environment: An evaluation platform for general agents
- M. G. Bellemare, Y. Naddaf, J. Veness, and M. Bowling. The arcade learning environment: an evaluation platform for general agents. Journal of Artificial Intelligence Research, 47(1): 253-279, 2013.
- (2013) Journal of Artificial Intelligence Research , vol.47 , Issue.1 , pp. 253-279
- Bellemare, M.G.¹ Naddaf, Y.² Veness, J.³ Bowling, M.⁴

2
- 84868289914
- Investigating contingency awareness using atari 2600 games
- M. G. Bellemare, J. Veness, and M. Bowling. Investigating contingency awareness using Atari 2600 games. In the 26th AAAI Conference on Artificial Intelligence, 2012.
- (2012) The 26th AAAI Conference on Artificial Intelligence
- Bellemare, M.G.¹ Veness, J.² Bowling, M.³

3
- 84877748834
- Sketch-based linear value function approximation
- M. G. Bellemare, J. Veness, and M. Bowling. Sketch-based linear value function approximation. In Advances in Neural Information Processing Systems, pages 2222-2230, 2012.
- (2012) Advances in Neural Information Processing Systems , pp. 2222-2230
- Bellemare, M.G.¹ Veness, J.² Bowling, M.³

4
- 84919784622
- Skip context tree switching
- M. G. Bellemare, J. Veness, and E. Talvitie. Skip context tree switching. In Proceedings of the International Conference on Machine Learning, 2014.
- (2014) Proceedings of the International Conference on Machine Learning
- Bellemare, M.G.¹ Veness, J.² Talvitie, E.³

5
- 69349090197
- Learning deep architectures for AI
- Y. Bengio. Learning deep architectures for AI. Foundations and trends in Machine Learning, 2(1): 1-127, 2009.
- (2009) Foundations and Trends in Machine Learning , vol.2 , Issue.1 , pp. 1-127
- Bengio, Y.¹

6
- 84866714584
- Multi-column deep neural networks for image classification
- D. Ciresan, U. Meier, and J. Schmidhuber. Multi-column deep neural networks for image classification. In IEEE Conference on Computer Vision and Pattern Recognition, 2012.
- (2012) IEEE Conference on Computer Vision and Pattern Recognition
- Ciresan, D.¹ Meier, U.² Schmidhuber, J.³

7
- 77949524387
- Visualizing higher-layer features of a deep network
- D. Erhan, Y. Bengio, A. Courville, and P. Vincent. Visualizing higher-layer features of a deep network. Technical report, University of Montreal, 2009.
- (2009) Technical Report, University of Montreal
- Erhan, D.¹ Bengio, Y.² Courville, A.³ Vincent, P.⁴

8
- 84890543083
- Speech recognition with deep recurrent neural networks
- A. Graves, A. Mohamed, and G. Hinton. Speech recognition with deep recurrent neural networks. In IEEE International Conference on Acoustics, Speech and Signal Processing, pages 6645-6649, 2013.
- (2013) IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 6645-6649
- Graves, A.¹ Mohamed, A.² Hinton, G.³

9
- 84864659737
- HyperNEAT-GGP: A hyperNEAT-based atari general game player
- M. Hausknecht, P. Khandelwal, R. Miikkulainen, and P. Stone. HyperNEAT-GGP: A hyperNEAT-based Atari general game player. In Proceedings of the Fourteenth International Conference on Genetic and Evolutionary Computation Conference, pages 217-224, 2012.
- (2012) Proceedings of the Fourteenth International Conference on Genetic and Evolutionary Computation Conference , pp. 217-224
- Hausknecht, M.¹ Khandelwal, P.² Miikkulainen, R.³ Stone, P.⁴

10
- 84911364368
- Large-scale video classification with convolutional neural networks
- A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convolutional neural networks. In IEEE Conference on Computer Vision and Pattern Recognition, 2014.
- (2014) IEEE Conference on Computer Vision and Pattern Recognition
- Karpathy, A.¹ Toderici, G.² Shetty, S.³ Leung, T.⁴ Sukthankar, R.⁵ Fei-Fei, L.⁶

11
- 0036832951
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- M. Kearns, Y. Mansour, and A. Y. Ng. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning, 49(2-3): 193-208, 2002.
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 193-208
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

12
- 33750293964
- Bandit based Monte-Carlo planning
- L. Kocsis and C. Szepesvári. Bandit based Monte-Carlo planning. In European Conference on Machine Learning, pages 282-293. 2006.
- (2006) European Conference on Machine Learning , pp. 282-293
- Kocsis, L.¹ Szepesvári, C.²

13
- 84876231242
- ImageNet classification with deep convolutional neural networks
- A. Krizhevsky, I. Sutskever, and G. E. Hinton. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, 2012.
- (2012) Advances in Neural Information Processing Systems
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

14
- 84867135575
- Building high-level features using large scale unsupervised learning
- Q. V. Le, M. Ranzato, R. Monga, M. Devin, K. Chen, G. S. Corrado, J. Dean, and A. Y. Ng. Building high-level features using large scale unsupervised learning. In Proceedings of the 29th International Conference on Machine Learning, 2012.
- (2012) Proceedings of the 29th International Conference on Machine Learning
- Le, Q.V.¹ Ranzato, M.² Monga, R.³ Devin, M.⁴ Chen, K.⁵ Corrado, G.S.⁶ Dean, J.⁷ Ng, A.Y.⁸

15
- 80052874098
- Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis
- Q. V. Le, W. Y. Zou, S. Y. Yeung, and A. Y. Ng. Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In IEEE Conference on Computer Vision and Pattern Recognition, pages 3361-3368, 2011.
- (2011) IEEE Conference on Computer Vision and Pattern Recognition , pp. 3361-3368
- Le, Q.V.¹ Zou, W.Y.² Yeung, S.Y.³ Ng, A.Y.⁴

16
- 0032203257
- Gradient-based learning applied to document recognition
- Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11): 2278-2324, 1998.
- (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- LeCun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

17
- 71149119164
- Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations
- H. Lee, R. Grosse, R. Ranganath, and A. Y. Ng. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proceedings of the 26th International Conference on Machine Learning, pages 609-616, 2009.
- (2009) Proceedings of the 26th International Conference on Machine Learning , pp. 609-616
- Lee, H.¹ Grosse, R.² Ranganath, R.³ Ng, A.Y.⁴

18
- 84863380535
- Unsupervised feature learning for audio classification using convolutional deep belief networks
- H. Lee, P. Pham, Y. Largman, and A. Y. Ng. Unsupervised feature learning for audio classification using convolutional deep belief networks. In Advances in Neural Information Processing Systems, pages 1096-1104, 2009.
- (2009) Advances in Neural Information Processing Systems , pp. 1096-1104
- Lee, H.¹ Pham, P.² Largman, Y.³ Ng, A.Y.⁴

19
- 84904867557
- Playing atari with deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, and M. Riedmiller. Playing Atari with deep reinforcement learning. In Deep Learning, Neural Information Processing Systems Workshop, 2013.
- (2013) Deep Learning, Neural Information Processing Systems Workshop
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Graves, A.⁴ Antonoglou, I.⁵ Wierstra, D.⁶ Riedmiller, M.⁷

20
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohamed, G. E. Dahl, and G. Hinton. Acoustic modeling using deep belief networks. IEEE Transactions on Audio, Speech, and Language Processing, 20(1): 14-22, 2012.
- (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.E.² Hinton, G.³

21
- 84899437369
- A reduction of imitation learning and structured prediction to no-regret online learning
- S. Ross, G. J. Gordon, and J. A. Bagnell. A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics, 2011.
- (2011) Proceedings of the 14th International Conference on Artificial Intelligence and Statistics
- Ross, S.¹ Gordon, G.J.² Bagnell, J.A.³

22
- 84920366580
- arXiv preprint arXiv: 1404.7828
- J. Schmidhuber. Deep learning in neural networks: An overview. arXiv preprint arXiv: 1404.7828, 2014.
- (2014) Deep Learning in Neural Networks: An Overview
- Schmidhuber, J.¹

23
- 0029276036
- Temporal difference learning and TD-gammon
- G. Tesauro. Temporal difference learning and TD-gammon. Communications of the ACM, 38(3): 58-68, 1995.
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.