SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Recurrent environment simulators

(4) Chiappa, Silvia a Racaniere, Sébastien a Wierstra, Daan a Mohamed, Shakir a

a DEEPMIND (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

ENVIRONMENT SIMULATORS; HIGH-DIMENSIONAL; HIGH-DIMENSIONAL IMAGES; IN-DEPTH ANALYSIS; TIME STEP;

RECURRENT NEURAL NETWORKS;

EID: 85088226800 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (114)

References (27)

1
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, and P. Fischer. Finite-time analysis of the multiarmed bandit problem. Machine Learning, 47:235-256, 2002.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

2
- 85031088945
- CoRR, abs/1612.03801
- C. Beattie, J. Z. Leibo, D. Teplyashin, T. Ward, M. Wainwright, H. Küttler, A. Lefrancq, S. Green, V. Valdés, A. Sadik, J. Schrittwieser, K. Anderson, S. York, M. Cant, A. Cain, A. Bolton, S. Gaffney, H. King, D. Hassabis, S. Legg, and S. Petersen. Deepmind lab. CoRR, abs/1612.03801, 2016. URL http://arxiv.org/abs/1612.03801.
- (2016) Deepmind Lab
- Beattie, C.¹ Leibo, J.Z.² Teplyashin, D.³ Ward, T.⁴ Wainwright, M.⁵ Küttler, H.⁶ Lefrancq, A.⁷ Green, S.⁸ Valdés, V.⁹ Sadik, A.¹⁰ Schrittwieser, J.¹¹ Anderson, K.¹² York, S.¹³ Cant, M.¹⁴ Cain, A.¹⁵ Bolton, A.¹⁶ Gaffney, S.¹⁷ King, H.¹⁸ Hassabis, D.¹⁹ Legg, S.²⁰ Petersen, S.²¹ more..

3
- 84879976780
- The arcade learning environment: An evaluation platform for general agents
- M. G. Bellemare, Y. Naddaf, J. Veness, and M. Bowling. The Arcade Learning Environment: An evaluation platform for general agents. Journal of Artificial Intelligence Research, 47:253-279, 2013.
- (2013) Journal of Artificial Intelligence Research , vol.47 , pp. 253-279
- Bellemare, M.G.¹ Naddaf, Y.² Veness, J.³ Bowling, M.⁴

4
- 84965179228
- Scheduled sampling for sequence prediction with recurrent neural networks
- S. Bengio, O. Vinyals, N. Jaitly, and N. Shazeer. Scheduled sampling for sequence prediction with recurrent neural networks. In Advances in Neural Information Processing Systems 28 (NIPS), pp. 1171-1179. 2015.
- (2015) Advances in Neural Information Processing Systems 28 (NIPS) , pp. 1171-1179
- Bengio, S.¹ Vinyals, O.² Jaitly, N.³ Shazeer, N.⁴

5
- 84906979661
- A. Graves. Generating sequences with recurrent neural networks. 2013. URL http://arxiv.org/abs/1308.0850.
- (2013) Generating Sequences with Recurrent Neural Networks
- Graves, A.¹

6
- 0031573117
- Long short-term memory
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural Computation, 9(8):1735-1780, 1997.
- (1997) Neural Computation , vol.9 , Issue.8 , pp. 1735-1780
- Hochreiter, S.¹ Schmidhuber, J.²

7
- 85162020429
- Hippocampal contributions to control: The third way
- M. Lengyel and P. Dayan. Hippocampal contributions to control: The third way. In Advances in Neural Information Processing Systems 20 (NIPS), pp. 889-896, 2008.
- (2008) Advances in Neural Information Processing Systems 20 (NIPS) , pp. 889-896
- Lengyel, M.¹ Dayan, P.²

8
- 84898982129
- Predictive representations of state
- M. L. Littman, R. S. Sutton, and S. Singh. Predictive representations of state. In Advances in Neural Information Processing Systems 14 (NIPS), pp. 1555-1561. 2002.
- (2002) Advances in Neural Information Processing Systems 14 (NIPS) , pp. 1555-1561
- Littman, M.L.¹ Sutton, R.S.² Singh, S.³

9
- 0001596874
- Intuitive physics
- M. McCloskey. Intuitive physics. Scientific American, 248(4):122-130, 1983.
- (1983) Scientific American , vol.248 , Issue.4 , pp. 122-130
- McCloskey, M.¹

10
- 84924051598
- Human-level control through deep reinforcement learning
- 02
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, S. Petersen, C. Beattie, A. Sadik, I. Antonoglou, H. King, D. Kumaran, D. Wierstra, S. Legg, and D. Hassabis. Human-level control through deep reinforcement learning. Nature, 518(7540): 529-533, 02 2015. URL http://dx.doi.org/10.1038/nature14236.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰ Petersen, S.¹¹ Beattie, C.¹² Sadik, A.¹³ Antonoglou, I.¹⁴ King, H.¹⁵ Kumaran, D.¹⁶ Wierstra, D.¹⁷ Legg, S.¹⁸ Hassabis, D.¹⁹

11
- 84999036937
- Asynchronous methods for deep reinforcement learning
- V. Mnih, A. Puigdomènech Badia, M. Mirza, A. Graves, T. P Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In Proceedings of the 33rd International Conference on Machine Learning (ICML), 2016.
- (2016) Proceedings of the 33rd International Conference on Machine Learning (ICML)
- Mnih, V.¹ Puigdomènech Badia, A.² Mirza, M.³ Graves, A.⁴ Lillicrap, T.P.⁵ Harley, T.⁶ Silver, D.⁷ Kavukcuoglu, K.⁸

12
- 67349283062
- Reinforcement learning in the brain
- Y. Niv. Reinforcement learning in the brain. Journal of Mathematical Psychology, 53(3):139-154, 2009.
- (2009) Journal of Mathematical Psychology , vol.53 , Issue.3 , pp. 139-154
- Niv, Y.¹

13
- 84965178314
- Action-conditional video prediction using deep networks in Atari games
- J. Oh, X. Guo, H. Lee, R. L. Lewis, and S. P. Singh. Action-conditional video prediction using deep networks in Atari games. In Advances in Neural Information Processing Systems 28 (NIPS), pp. 2863-2871. 2015. URL http://arxiv.org/abs/1507.08750.
- (2015) Advances in Neural Information Processing Systems 28 (NIPS) , pp. 2863-2871
- Oh, J.¹ Guo, X.² Lee, H.³ Lewis, R.L.⁴ Singh, S.P.⁵

14
- 0035495009
- A sensorimotor account of vision and visual consciousness
- 05
- J. K. O'Regan and A. Noë. A sensorimotor account of vision and visual consciousness. Behavioral and brain sciences, 24(05):939-973, 2001.
- (2001) Behavioral and Brain Sciences , vol.24 , pp. 939-973
- O'Regan, J.K.¹ Noë, A.²

15
- 34047267520
- Intrinsic motivation systems for autonomous mental development
- P.-Y. Oudeyer, F. Kaplan, and V. V. Hafner. Intrinsic motivation systems for autonomous mental development. Evolutionary Computation, IEEE Transactions on, 11(2):265-286, 2007.
- (2007) Evolutionary Computation, IEEE Transactions on , vol.11 , Issue.2 , pp. 265-286
- Oudeyer, P.-Y.¹ Kaplan, F.² Hafner, V.V.³

16
- 85005954105
- CoRR, abs/1511.06309
- V. Patraucean, A. Handa, and R. Cipolla. Spatio-temporal video autoencoder with differentiable memory. CoRR, abs/1511.06309, 2015. URL http://arxiv.org/abs/1511.06309.
- (2015) Spatio-Temporal Video Autoencoder with Differentiable Memory
- Patraucean, V.¹ Handa, A.² Cipolla, R.³

17
- 70349349170
- Cambridge University Press
- J. Pearl. Causality. Cambridge University Press, 2009.
- (2009) Causality
- Pearl, J.¹

18
- 84969544782
- Unsupervised learning of video representations using LSTMs
- N. Srivastava, E. Mansimov, and R. Salakhutdinov. Unsupervised learning of video representations using LSTMs. In Proceedings of the 32nd International Conference on Machine Learning (ICML), pp. 843-852, 2015.
- (2015) Proceedings of the 32nd International Conference on Machine Learning (ICML) , pp. 843-852
- Srivastava, N.¹ Mansimov, E.² Salakhutdinov, R.³

19
- 85006142438
- CoRR, abs/1512.08836
- W. Sun, A. Venkatraman, B. Boots, and J. A. Bagnell. Learning to filter with predictive state inference machines. CoRR, abs/1512.08836, 2015. URL http://arxiv.org/abs/1512.08836.
- (2015) Learning to Filter with Predictive State Inference Machines
- Sun, W.¹ Venkatraman, A.² Boots, B.³ Bagnell, J.A.⁴

20
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement learning: An introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

21
- 84923276855
- Model regularization for stable sample rollouts
- E. Talvitie. Model regularization for stable sample rollouts. In Proceedings of the Thirtieth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-14), pp. 780-789, 2014.
- (2014) Proceedings of the Thirtieth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-14) , pp. 780-789
- Talvitie, E.¹

22
- 84965104233
- CoRR, abs/1502.02251
- N. Wahlström, T. B. Schön, and M. P. Deisenroth. From pixels to torques: Policy learning with deep dynamical models. CoRR, abs/1502.02251, 2015. URL http://arxiv.org/abs/1502.02251.
- (2015) From Pixels to Torques: Policy Learning with Deep Dynamical Models
- Wahlström, N.¹ Schön, T.B.² Deisenroth, M.P.³

23
- 84965129327
- Embed to control: A locally linear latent dynamics model for control from raw images
- M. Watter, J. Springenberg, J. Boedecker, and M. Riedmiller. Embed to control: A locally linear latent dynamics model for control from raw images. In Advances in Neural Information Processing Systems 28 (NIPS), pp. 2728-2736, 2015.
- (2015) Advances in Neural Information Processing Systems 28 (NIPS) , pp. 2728-2736
- Watter, M.¹ Springenberg, J.² Boedecker, J.³ Riedmiller, M.⁴

24
- 0001765578
- Gradient-based learning algorithms for recurrent networks and their computational complexity
- R. J. Williams and D. Zipser. Gradient-based learning algorithms for recurrent networks and their computational complexity. Bibliometrics, pp. 433-486, 1995.
- (1995) Bibliometrics , pp. 433-486
- Williams, R.J.¹ Zipser, D.²

25
- 76249122848
- v1.3.5
- B. Wymann, E. Espié, C. Guionneau, C. Dimitrakakis, R. Coulom, and A. Sumner. Torcs: The open racing car simulator, v1.3.5. 2013. URL http://www.torcs.org.
- (2013) Torcs: The Open Racing Car Simulator
- Wymann, B.¹ Espié, E.² Guionneau, C.³ Dimitrakakis, C.⁴ Coulom, R.⁵ Sumner, A.⁶

26
- 84960920723
- B. Xu, N. Wang, T. Chen, and M. Li. Empirical evaluation of rectified activations in convolutional network. 2015.
- (2015) Empirical Evaluation of Rectified Activations in Convolutional Network
- Xu, B.¹ Wang, N.² Chen, T.³ Li, M.⁴

27
- 84944053926
- CoRR, abs/1409.2329
- W. Zaremba, I. Sutskever, and O. Vinyals. Recurrent neural network regularization. CoRR, abs/1409.2329, 2014. URL http://arxiv.org/abs/1409.2329.
- (2014) Recurrent Neural Network Regularization
- Zaremba, W.¹ Sutskever, I.² Vinyals, O.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.