SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2017, Pages

Loss is its own reward: Self-supervision for reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

END TO END; EXPECTED RETURN; OPTIMISATIONS; OPTIMIZING POLICIES; PRE-TRAINING; REINFORCEMENT LEARNINGS;

EID: 85144242642 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (70)

References (11)

1
- 85015444377
- arXiv preprint
- G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman, J. Tang, and W. Zaremba. OpenAI Gym. arXiv preprint arXiv:1606.01540, 2016.
- (2016) OpenAI Gym
- Brockman, G.¹ Cheung, V.² Pettersson, L.³ Schneider, J.⁴ Schulman, J.⁵ Tang, J.⁶ Zaremba, W.⁷

2
- 85072112813
- Adversarial feature learning
- J. Donahue, P. Krähenbühl, and T. Darrell. Adversarial feature learning. ICLR, 2017.
- (2017) ICLR
- Donahue, J.¹ Krähenbühl, P.² Darrell, T.³

3
- 85088229507
- Learning to act by predicting the future
- A. Dosovitskiy and V. Koltun. Learning to act by predicting the future. ICLR, 2017.
- (2017) ICLR
- Dosovitskiy, A.¹ Koltun, V.²

4
- 84977501896
- Deep spatial autoencoders for visuomotor learning
- C. Finn, X.Y. Tan, Y. Duan, T. Darrell, S. Levine, and P. Abbeel. Deep spatial autoencoders for visuomotor learning. In ICRA, 2016.
- (2016) ICRA
- Finn, C.¹ Tan, X.Y.² Duan, Y.³ Darrell, T.⁴ Levine, S.⁵ Abbeel, P.⁶

5
- 85088229768
- Reinforcement learning with unsupervised auxiliary tasks
- M. Jaderberg, V. Mnih, W. Marian Czarnecki, T. Schaul, J.Z. Leibo, D. Silver, and K. Kavukcuoglu. Reinforcement learning with unsupervised auxiliary tasks. ICLR, 2017.
- (2017) ICLR
- Jaderberg, M.¹ Mnih, V.² Marian Czarnecki, W.³ Schaul, T.⁴ Leibo, J.Z.⁵ Silver, D.⁶ Kavukcuoglu, K.⁷

6
- 84941943714
- Learning state representations with robotic priors
- R. Jonschkowski and O. Brock. Learning state representations with robotic priors. Autonomous Robots, 39(3):407-428, 2015.
- (2015) Autonomous Robots , vol.39 , Issue.3 , pp. 407-428
- Jonschkowski, R.¹ Brock, O.²

7
- 85083952350
- Data-dependent initializations of convolutional neural networks
- Philipp Krähenbühl, Carl Doersch, Jeff Donahue, and Trevor Darrell. Data-dependent initializations of convolutional neural networks. In ICLR, 2016.
- (2016) ICLR
- Krähenbühl, P.¹ Doersch, C.² Donahue, J.³ Darrell, T.⁴

8
- 85041891851
- Learning to navigate in complex environments
- P. Mirowski, R. Pascanu, F. Viola, H. Soyer, A. Ballard, A. Banino, M. Denil, R. Goroshin, L. Sifre, K. Kavukcuoglu, et al. Learning to navigate in complex environments. ICLR, 2017.
- (2017) ICLR
- Mirowski, P.¹ Pascanu, R.² Viola, F.³ Soyer, H.⁴ Ballard, A.⁵ Banino, A.⁶ Denil, M.⁷ Goroshin, R.⁸ Sifre, L.⁹ Kavukcuoglu, K.¹⁰

9
- 84924051598
- Human-level control through deep reinforcement learning
- V. Mnih, K. Kavukcuoglu, D. Silver, A.A. Rusu, J. Veness, M.G. Bellemare, A. Graves, M. Riedmiller, A.K. Fidjeland, G. Ostrovski, et al. Human-level control through deep reinforcement learning. Nature, 518(7540):529-533, 2015.
- (2015) Nature , vol.518 , Issue.7540 , pp. 529-533
- Mnih, V.¹ Kavukcuoglu, K.² Silver, D.³ Rusu, A.A.⁴ Veness, J.⁵ Bellemare, M.G.⁶ Graves, A.⁷ Riedmiller, M.⁸ Fidjeland, A.K.⁹ Ostrovski, G.¹⁰

10
- 84999036937
- Asynchronous methods for deep reinforcement learning
- V. Mnih, A.P. Badia, M. Mirza, A. Graves, T.P. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu. Asynchronous methods for deep reinforcement learning. ICML, 2016.
- (2016) ICML
- Mnih, V.¹ Badia, A.P.² Mirza, M.³ Graves, A.⁴ Lillicrap, T.P.⁵ Harley, T.⁶ Silver, D.⁷ Kavukcuoglu, K.⁸

11
- 84965129327
- Embed to control: A locally linear latent dynamics model for control from raw images
- M. Watter, J. Springenberg, J. Boedecker, and M. Riedmiller. Embed to control: A locally linear latent dynamics model for control from raw images. In NIPS, pp. 2746-2754, 2015.
- (2015) NIPS , pp. 2746-2754
- Watter, M.¹ Springenberg, J.² Boedecker, J.³ Riedmiller, M.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.