SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Learning to navigate in complex environments

(12) Mirowski, Piotr a Pascanu, Razvan a Viola, Fabio a Soyer, Hubert a Ballard, Andrew J a Banino, Andrea a Denil, Misha a Goroshin, Ross a Sifre, Laurent a Kavukcuoglu, Koray a Kumaran, Dharshan a Hadsell, Raia a

a DEEPMIND (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

AIR NAVIGATION; COMPLEX NETWORKS; MACHINE LEARNING; NAVIGATION;

CLASSIFICATION TASKS; COMPLEX ENVIRONMENTS; DYNAMIC ELEMENTS; HUMAN-LEVEL PERFORMANCE; LOOP CLOSURE; NETWORK ACTIVITIES; SENSORY INPUT; TASK PERFORMANCE;

REINFORCEMENT LEARNING;

EID: 85041891851 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (390)

References (32)

1
- 85071035389
- Deep reinforcement learning in a 3-d block-world environment
- Trevor Barron, Matthew Whitehead, and Alan Yeung. Deep reinforcement learning in a 3-d block-world environment. In Deep Reinforcement Learning: Frontiers and Challenges, IJCAI, 2016.
- (2016) Deep Reinforcement Learning: Frontiers and Challenges, IJCAI
- Barron, T.¹ Whitehead, M.² Yeung, A.³

2
- 85031088945
- arXiv
- Charles Beattie, Joel Z. Leibo, Denis Teplyashin, Tom Ward, Marcus Wainwright, Heinrich KÃijttler, Andrew Lefrancq, Simon Green, Victor Valdes, Amir Sadik, Julian Schrittwieser, Keith Anderson, Sarah York, Max Cant, Adam Cain, Adrian Bolton, Stephen Gaffney, Helen King, Demis Hassabis, Shane Legg, and Stig Petersen. Deepmind lab. In arXiv, 2016. URL https://arxiv.org/abs/1612.03801.
- (2016) Deepmind Lab
- Beattie, C.¹ Leibo, J.Z.² Teplyashin, D.³ Ward, T.⁴ Wainwright, M.⁵ KÃijttler, H.⁶ Lefrancq, A.⁷ Green, S.⁸ Valdes, V.⁹ Sadik, A.¹⁰ Schrittwieser, J.¹¹ Anderson, K.¹² York, S.¹³ Cant, M.¹⁴ Cain, A.¹⁵ Bolton, A.¹⁶ Gaffney, S.¹⁷ King, H.¹⁸ Hassabis, D.¹⁹ Legg, S.²⁰ Petersen, S.²¹ more..

3
- 0035357061
- A solution to the simultaneous localization and map building (slam) problem
- MWM Gamini Dissanayake, Paul Newman, Steve Clark, Hugh F. Durrant-Whyte, and Michael Csorba. A solution to the simultaneous localization and map building (slam) problem. IEEE Transactions on Robotics and Automation, 17(3):229-241, 2001.
- (2001) IEEE Transactions on Robotics and Automation , vol.17 , Issue.3 , pp. 229-241
- Gamini Dissanayake, M.W.M.¹ Newman, P.² Clark, S.³ Durrant-Whyte, H.F.⁴ Csorba, M.⁵

4
- 84937943470
- Depth map prediction from a single image using a multi-scale deep network
- David Eigen, Christian Puhrsch, and Rob Fergus. Depth map prediction from a single image using a multi-scale deep network. In Proc. of Neural Information Processing Systems, NIPS, 2014.
- (2014) Proc. Of Neural Information Processing Systems, NIPS
- Eigen, D.¹ Puhrsch, C.² Fergus, R.³

5
- 84890543083
- Speech recognition with deep recurrent neural networks
- Alex Graves, Mohamed Abdelrahman, and Geoffrey Hinton. Speech recognition with deep recurrent neural networks. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP, 2013.
- (2013) Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP
- Graves, A.¹ Abdelrahman, M.² Hinton, G.³

6
- 84993949467
- Hybrid computing using a neural network with dynamic external memory
- Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwinska, ´ Sergio Gómez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou, et al. Hybrid computing using a neural network with dynamic external memory. Nature, 2016.
- (2016) Nature
- Graves, A.¹ Wayne, G.² Reynolds, M.³ Harley, T.⁴ Danihelka, I.⁵ Grabska-Barwinska, A.⁶ Colmenarejo, S.G.⁷ Grefenstette, E.⁸ Ramalho, T.⁹ Agapiou, J.¹⁰

7
- 85065513732
- Deep recurrent q-learning for partially observable mdps
- Matthew J. Hausknecht and Peter Stone. Deep recurrent q-learning for partially observable mdps. Proc. of Conf. on Artificial Intelligence, AAAI, 2015.
- (2015) Proc. Of Conf. Of Artificial Intelligence, AAAI
- Hausknecht, M.J.¹ Stone, P.²

8
- 85088229768
- Reinforcement learning with unsupervised auxiliary tasks
- Max Jaderberg, Volodymir Mnih, Wojciech Czarnecki, Tom Schaul, Joel Z. Leibo, David Silver, and Koray Kavukcuoglu. Reinforcement learning with unsupervised auxiliary tasks. In Submitted to Int'l Conference on Learning Representations, ICLR, 2017.
- (2017) Submitted to Int'l Conference on Learning Representations, ICLR
- Jaderberg, M.¹ Mnih, V.² Czarnecki, W.³ Schaul, T.⁴ Leibo, J.Z.⁵ Silver, D.⁶ Kavukcuoglu, K.⁷

9
- 84883060087
- Evolving large-scale neural networks for vision-based reinforcement learning
- Jan Koutnik, Giuseppe Cuccu, JÃijrgen Schmidhuber, and Faustino Gomez. Evolving large-scale neural networks for vision-based reinforcement learning. In Proceedings of the 15th annual conference on Genetic and evolutionary computation, GECCO, 2013.
- (2013) Proceedings of the 15th Annual Conference on Genetic and Evolutionary Computation
- Koutnik, J.¹ Cuccu, G.² Schmidhuber, J.³ Gomez, F.⁴

10
- 85041964859
- CoRR, abs/1606.02396
- Tejas D. Kulkarni, Ardavan Saeedi, Simanta Gautam, and Samuel J. Gershman. Deep successor reinforcement learning. CoRR, abs/1606.02396, 2016. URL http://arxiv.org/abs/1606.02396.
- (2016) Deep Successor Reinforcement Learning
- Kulkarni, T.D.¹ Saeedi, A.² Gautam, S.³ Gershman, S.J.⁴

11
- 85039903894
- CoRR
- Guillaume Lample and Devendra Singh Chaplot. Playing FPS games with deep reinforcement learning. CoRR, 2016. URL http://arxiv.org/abs/1609.05521.
- (2016) Playing FPS Games with Deep Reinforcement Learning
- Lample, G.¹ Chaplot, D.S.²

12
- 85063887900
- Recurrent reinforcement learning: A hybrid approach
- Xiujun Li, Lihong Li, Jianfeng Gao, Xiaodong He, Jianshu Chen, Li Deng, and Ji He. Recurrent reinforcement learning: A hybrid approach. In Proceedings of the International Conference on Learning Representations, ICLR, 2016. URL https://arxiv.org/abs/1509.03044.
- (2016) Proceedings of the International Conference on Learning Representations, ICLR
- Li, X.¹ Li, L.² Gao, J.³ He, X.⁴ Chen, J.⁵ Deng, L.⁶ He, J.⁷

13
- 57249084011
- Visualizing data using t-sne
- Nov
- Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of Machine Learning Research, 9(Nov):2579-2605, 2008.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 2579-2605
- Van Der Maaten, L.¹ Hinton, G.²

14
- 80053286337
- Dynamic auto-encoders for semantic indexing
- Piotr Mirowski, Marc'Aurelio Ranzato, and Yann LeCun. Dynamic auto-encoders for semantic indexing. In NIPS Deep Learning and Unsupervised Learning Workshop, 2010.
- (2010) NIPS Deep Learning and Unsupervised Learning Workshop
- Mirowski, P.¹ Ranzato, M.² LeCun, Y.³

15
- 84924051598
- Human-level control through deep reinforcement learning
- Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, et al. Human-level control through deep reinforcement learning. Nature, 518:529-533, 2015.
- (2015) Nature , vol.518 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵

16
- 84999036937
- Asynchronous methods for deep reinforcement learning
- PuigdomÃ´
- ´lnech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. Asynchronous methods for deep reinforcement learning. In Proc. of Int'l Conf. on Machine Learning, ICML, 2016.
- (2016) Proc. Of Int'l Conf. Of Machine Learning, ICML
- Mnih, V.¹ AdriÃa² Badia, L.³ Mirza, M.⁴ Graves, A.⁵ Lillicrap, T.P.⁶ Harley, T.⁷ Silver, D.⁸ Kavukcuoglu, K.⁹

17
- 84980007683
- Massively parallel methods for deep reinforcement learning
- Arun Nair, Praveen Srinivasan, Sam Blackwell, Cagdas Alcicek, Rory Fearon, et al. Massively parallel methods for deep reinforcement learning. In Proceedings of the International Conference on Machine Learning Deep Learning Workshop, ICML, 2015.
- (2015) Proceedings of the International Conference on Machine Learning Deep Learning Workshop, ICML
- Nair, A.¹ Srinivasan, P.² Blackwell, S.³ Alcicek, C.⁴ Fearon, R.⁵

18
- 84959861546
- Language understanding for text-based games using deep reinforcement learning
- Karthik Narasimhan, Tejas D. Kulkarni, and Regina Barzilay. Language understanding for text-based games using deep reinforcement learning. In Proc. of Empirical Methods in Natural Language Processing, EMNLP, 2015.
- (2015) Proc. Of Empirical Methods in Natural Language Processing, EMNLP
- Narasimhan, K.¹ Kulkarni, T.D.² Barzilay, R.³

19
- 84999048282
- Control of memory, active perception, and action in minecraft
- Junhyuk Oh, Valliappa Chockalingam, Satinder P. Singh, and Honglak Lee. Control of memory, active perception, and action in minecraft. In Proc. of International Conference on Machine Learning, ICML, 2016.
- (2016) Proc. Of International Conference on Machine Learning, ICML
- Oh, J.¹ Chockalingam, V.² Singh, S.P.³ Lee, H.⁴

20
- 0018633672
- Hippocampus, space, and memory
- 03
- David S Olton, James T Becker, and Gail E Handelmann. Hippocampus, space, and memory. Behavioral and Brain Sciences, 2(03):313-322, 1979.
- (1979) Behavioral and Brain Sciences , vol.2 , pp. 313-322
- Olton, D.S.¹ Becker, J.T.² Handelmann, G.E.³

21
- 84907009416
- arXiv preprint
- Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. How to construct deep recurrent neural networks. arXiv preprint arXiv:1312.6026, 2013.
- (2013) How to Construct Deep Recurrent Neural Networks
- Pascanu, R.¹ Gulcehre, C.² Cho, K.³ Bengio, Y.⁴

22
- 84965136229
- Semi-supervised learning with ladder networks
- Antti Rasmus, Mathias Berglund, Mikko Honkala, Harri Valpola, and Tapani Raiko. Semi-supervised learning with ladder networks. In Advances in Neural Information Processing Systems, NIPS, 2015.
- (2015) Advances in Neural Information Processing Systems, NIPS
- Rasmus, A.¹ Berglund, M.² Honkala, M.³ Valpola, H.⁴ Raiko, T.⁵

23
- 84998645243
- Rule-injection hints as a means of improving network performance and learning time
- Springer
- Steven C Suddarth and YL Kergosien. Rule-injection hints as a means of improving network performance and learning time. In Neural Networks, pp. 120-129. Springer, 1990.
- (1990) Neural Networks , pp. 120-129
- Suddarth, S.C.¹ Kergosien, Y.L.²

24
- 0033170372
- Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
- Richard S Sutton, Doina Precup, and Satinder Singh. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial intelligence, 112(1):181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

25
- 85041957477
- arXiv
- Lei Tai and Ming Liu. Towards cognitive exploration through deep reinforcement learning for mobile robots. In arXiv, 2016. URL https://arxiv.org/abs/1610.01733.
- (2016) Towards Cognitive Exploration through Deep Reinforcement Learning for Mobile Robots
- Tai, L.¹ Liu, M.²

26
- 85019201204
- CoRR, abs/1604.07255
- Chen Tessler, Shahar Givony, Tom Zahavy, Daniel J. Mankowitz, and Shie Mannor. A deep hierarchical approach to lifelong learning in minecraft. CoRR, abs/1604.07255, 2016. URL http://arxiv.org/abs/1604.07255.
- (2016) A Deep Hierarchical Approach to Lifelong Learning in Minecraft
- Tessler, C.¹ Givony, S.² Zahavy, T.³ Mankowitz, D.J.⁴ Mannor, S.⁵

27
- 84893343292
- Lecture 6.5 - RMSprop: Divide the gradient by a running average of its recent magnitude
- Tijmen Tieleman and Geoffrey Hinton. Lecture 6.5 - rmsprop: Divide the gradient by a running average of its recent magnitude. In Coursera: Neural Networks for Machine Learning, volume 4, 2012.
- (2012) Coursera: Neural Networks for Machine Learning , vol.4
- Tieleman, T.¹ Hinton, G.²

28
- 84989286061
- A. van den Oord, N. Kalchbrenner, and K. Kavukcuoglu. Pixel recurrent neural networks. 2016.
- (2016) Pixel Recurrent Neural Networks
- Van Den Oord, A.¹ Kalchbrenner, N.² Kavukcuoglu, K.³

29
- 84930635225
- arXiv preprint
- Jason Weston, Sumit Chopra, and Antoine Bordes. Memory networks. arXiv preprint arXiv:1410.3916, 2014.
- (2014) Memory Networks
- Weston, J.¹ Chopra, S.² Bordes, A.³

30
- 84998996019
- Augmenting supervised neural networks with unsupervised objectives for large-scale image classification
- Yuting Zhang, Kibok Lee, and Honglak Lee. Augmenting supervised neural networks with unsupervised objectives for large-scale image classification. In Proc. of International Conference on Machine Learning, ICML, 2016.
- (2016) Proc. Of International Conference on Machine Learning, ICML
- Zhang, Y.¹ Lee, K.² Lee, H.³

31
- 84965095288
- Stacked what-where auto-encoders
- Junbo Zhao, Michaël Mathieu, Ross Goroshin, and Yann LeCun. Stacked what-where auto-encoders. Int'l Conf. on Learning Representations (Workshop), ICLR, 2015. URL http://arxiv.org/abs/1506.02351.
- (2015) Int'l Conf. On Learning Representations (Workshop), ICLR
- Zhao, J.¹ Mathieu, M.² Goroshin, R.³ LeCun, Y.⁴

32
- 85027704536
- CoRR, abs/1609.05143
- Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph J. Lim, Abhinav Gupta, Li Fei-Fei, and Ali Farhadi. Target-driven visual navigation in indoor scenes using deep reinforcement learning. CoRR, abs/1609.05143, 2016. URL http://arxiv.org/abs/1609.05143.
- (2016) Target-Driven Visual Navigation in Indoor Scenes Using Deep Reinforcement Learning
- Zhu, Y.¹ Mottaghi, R.² Kolve, E.³ Lim, J.J.⁴ Gupta, A.⁵ Fei-Fei, L.⁶ Farhadi, A.⁷

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.