SCOPUS 정보 검색 플랫폼

5th International Conference on Learning Representations, ICLR 2017 - Conference Track Proceedings

Volumn , Issue , 2017, Pages

Learning invariant feature spaces to transfer skills with reinforcement learning

(5) Gupta, Abhishek a Devin, Coline a Liu, YuXuan a Abbeel, Pieter a,b Levine, Sergey a

a UNIVERSITY OF CALIFORNIA (United States)

b OpenAI LLC (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MACHINE LEARNING; REINFORCEMENT LEARNING; ROBOTICS;

ACTUATION MECHANISM; IMPLICIT LEARNING; INVARIANT FEATURES; PARTIAL CORRESPONDENCES; PROBLEM FORMULATION; PROCESS OF LEARNING; ROBOTIC MANIPULATION; SHARING INFORMATION;

LEARNING ALGORITHMS;

EID: 85048435317 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (305)

References (35)

1
- 85060433568
- Reinforcement learning transfer via common subspaces
- Haitham Bou Ammar and Matthew E. Taylor. Reinforcement learning transfer via common subspaces. In Adaptive and Learning Agents: International Workshop, 2012.
- (2012) Adaptive and Learning Agents: International Workshop
- Ammar, H.B.¹ Taylor, M.E.²

2
- 84960193691
- Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment
- Haitham Bou Ammar, Eric Eaton, Paul Ruvolo, and Matthew Taylor. Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment. In AAAI Conference on Artificial Intelligence, 2015a.
- (2015) AAAI Conference on Artificial Intelligence
- Ammar, H.B.¹ Eaton, E.² Ruvolo, P.³ Taylor, M.⁴

3
- 85007212130
- Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment
- Haitham Bou Ammar, Eric Eaton, Paul Ruvolo, and Matthew E Taylor. Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment. In Proc. of AAAI, 2015b.
- (2015) Proc. Of AAAI
- Ammar, H.B.¹ Eaton, E.² Ruvolo, P.³ Taylor, M.E.⁴

4
- 9444270330
- Exploiting task relatedness for multiple task learning
- Springer
- Shai Ben-David and Reba Schuller. Exploiting task relatedness for multiple task learning. In Learning Theory and Kernel Machines, pp. 567-580. Springer, 2003.
- (2003) Learning Theory and Kernel Machines , pp. 567-580
- Ben-David, S.¹ Schuller, R.²

5
- 85027999148
- Reuse of neural modules for general video game playing
- abs
- Alexander Braylan, Mark Hollenbeck, Elliot Meyerson, and Risto Miikkulainen. Reuse of neural modules for general video game playing. CoRR, abs/1512.01537, 2015.
- (2015) CoRR
- Braylan, A.¹ Hollenbeck, M.² Meyerson, E.³ Miikkulainen, R.⁴

6
- 0031189914
- Multitask learning
- Rich Caruana. Multitask learning. Machine Learning, 1997.
- (1997) Machine Learning
- Caruana, R.¹

7
- 24644436425
- Learning a similarity metric discriminatively, with application to face verification
- Sumit Chopra, Raia Hadsell, and Yann LeCun. Learning a similarity metric discriminatively, with application to face verification. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 1, pp. 539-546. IEEE, 2005.
- (2005) Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on , vol.1 , pp. 539-546
- Chopra, S.¹ Hadsell, R.² LeCun, Y.³

8
- 85014842865
- Learning transferable policies for monocular reactive MAV control
- Shreyansh Daftry, J. Andrew Bagnell, and Martial Hebert. Learning transferable policies for monocular reactive MAV control. In International Symposium on Experimental Robotics (ISER), 2016.
- (2016) International Symposium on Experimental Robotics (ISER)
- Daftry, S.¹ Andrew Bagnell, J.² Hebert, M.³

9
- 85041944275
- arXiv preprint
- Coline Devin, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, and Sergey Levine. Learning modular neural network policies for multi-task and multi-robot transfer. arXiv preprint arXiv:1609.07088, 2016.
- (2016) Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer
- Devin, C.¹ Gupta, A.² Darrell, T.³ Abbeel, P.⁴ Levine, S.⁵

10
- 13244267151
- Mirror neurons responding to observation of actions made with tools in monkey ventral premotor cortex
- P. F. Ferrari, S. Rozzi, and L. Fogassi. Mirror neurons responding to observation of actions made with tools in monkey ventral premotor cortex. Journal of Cognitive Neuroscience, 17(2), 2005.
- (2005) Journal of Cognitive Neuroscience , vol.17 , Issue.2
- Ferrari, P.F.¹ Rozzi, S.² Fogassi, L.³

11
- 84979887690
- Domain-adversarial training of neural networks
- Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Francois Laviolette, Mario Marchand, and Victor Lempitsky. Domain-adversarial training of neural networks. Journal of Machine Learning Research, 17, 2016.
- (2016) Journal of Machine Learning Research , vol.17
- Ganin, Y.¹ Ustinova, E.² Ajakan, H.³ Germain, P.⁴ Larochelle, H.⁵ Laviolette, F.⁶ Marchand, M.⁷ Lempitsky, V.⁸

12
- 85162042191
- Random projections for manifold learning
- Chinmay Hegde, Michael Wakin, and Richard Baraniuk. Random projections for manifold learning. In Advances in neural information processing systems, pp. 641-648, 2008.
- (2008) Advances in Neural Information Processing Systems , pp. 641-648
- Hegde, C.¹ Wakin, M.² Baraniuk, R.³

13
- 0000107975
- Relations between two sets of variates
- Harold Hotelling. Relations between two sets of variates. Biometrika, 28, 1936.
- (1936) Biometrika , vol.28
- Hotelling, H.¹

14
- 85083951076
- A method for stochastic optimization
- D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. In International Conference on Learning Representations, 2015.
- (2015) International Conference on Learning Representations
- Kingma, D.P.¹ Adam, J.Ba.²

15
- 37349036852
- A framework for transfer in reinforcement learning
- George Konidaris. A framework for transfer in reinforcement learning. In ICML-06 Workshop on Structural Knowledge Transfer for Machine Learning, 2006.
- (2006) ICML-06 Workshop on Structural Knowledge Transfer for Machine Learning
- Konidaris, G.¹

16
- 33749243349
- Autonomous shaping: Knowledge transfer in reinforcement learning
- George Konidaris and Andrew Barto. Autonomous shaping: knowledge transfer in reinforcement learning. In International Conference on Machine Learning (ICML), pp. 489-496, 2006.
- (2006) International Conference on Machine Learning (ICML) , pp. 489-496
- Konidaris, G.¹ Barto, A.²

17
- 84937822296
- Learning neural network policies with guided policy search under unknown dynamics
- Sergey Levine and Pieter Abbeel. Learning neural network policies with guided policy search under unknown dynamics. In Advances in Neural Information Processing Systems, 2014.
- (2014) Advances in Neural Information Processing Systems
- Levine, S.¹ Abbeel, P.²

18
- 84979924150
- End-to-end training of deep visuo-motor policies
- Sergey Levine, Chelsea Finn, Trevor Darrell, and Pieter Abbeel. End-to-end training of deep visuo-motor policies. Journal of Machine Learning Research, 17:1-40, 2016.
- (2016) Journal of Machine Learning Research , vol.17 , pp. 1-40
- Levine, S.¹ Finn, C.² Darrell, T.³ Abbeel, P.⁴

19
- 17444424051
- Iterative linear quadratic regulator design for nonlinear biological movement systems
- Weiwei Li and Emanuel Todorov. Iterative linear quadratic regulator design for nonlinear biological movement systems. In ICINCO (1), 2004.
- (2004) ICINCO , Issue.1
- Li, W.¹ Todorov, E.²

20
- 85007167143
- Continuous control with deep reinforcement learning
- abs
- Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforcement learning. CoRR, abs/1509.02971, 2015.
- (2015) CoRR
- Lillicrap, T.P.¹ Hunt, J.J.² Pritzel, A.³ Heess, N.⁴ Erez, T.⁵ Tassa, Y.⁶ Silver, D.⁷ Wierstra, D.⁸

21
- 1442270756
- Skillman, NJ: Pediatric Institute Publication
- Andrew Meltzoff. Born to learn: What infants learn from watching us. Skillman, NJ: Pediatric Institute Publication, 1999.
- (1999) Born to Learn: What Infants Learn from Watching Us
- Meltzoff, A.¹

22
- 83755217735
- Dynamic time warping
- Meinard Müller. Dynamic time warping. Information retrieval for music and motion, pp. 69-84, 2007.
- (2007) Information Retrieval for Music and Motion , pp. 69-84
- Müller, M.¹

23
- 77956031473
- A survey on transfer learning
- Sinno Jialin Pan and Qiang Yang. A survey on transfer learning. IEEE Transactions on knowledge and data engineering, 22(10):1345-1359, 2010.
- (2010) IEEE Transactions on Knowledge and Data Engineering , vol.22 , Issue.10 , pp. 1345-1359
- Pan, S.J.¹ Yang, Q.²

24
- 84980002149
- A preliminary study of transfer learning between unicycle robots
- Kaizad V Raimalwala, Bruce A Francis, and Angela P Schoellig. A preliminary study of transfer learning between unicycle robots. In 2016 AAAI Spring Symposium Series, 2016.
- (2016) 2016 AAAI Spring Symposium Series
- Raimalwala, K.V.¹ Francis, B.A.² Schoellig, A.P.³

25
- 3943111670
- The mirror neuron system
- Giacomo Rizzolatti and Laila Craighero. The mirror neuron system. Annual Review of Neuroscience, 27:169-192, 2004.
- (2004) Annual Review of Neuroscience , vol.27 , pp. 169-192
- Rizzolatti, G.¹ Craighero, L.²

26
- 85027984488
- Progressive neural networks
- abs
- Andrei A. Rusu, Neil C. Rabinowitz, Guillaume Desjardins, Hubert Soyer, James Kirkpatrick, Koray Kavukcuoglu, Razvan Pascanu, and Raia Hadsell. Progressive neural networks. CoRR, abs/1606.04671, 2016a.
- (2016) CoRR
- Rusu, A.A.¹ Rabinowitz, N.C.² Desjardins, G.³ Soyer, H.⁴ Kirkpatrick, J.⁵ Kavukcuoglu, K.⁶ Pascanu, R.⁷ Hadsell, R.⁸

27
- 85030465352
- arXiv preprint
- Andrei A Rusu, Matej Vecerik, Thomas Rothörl, Nicolas Heess, Razvan Pascanu, and Raia Hadsell. Sim-to-real robot learning from pixels with progressive nets. arXiv preprint arXiv:1610.04286, 2016b.
- (2016) Sim-to-Real Robot Learning from Pixels with Progressive Nets
- Rusu, A.A.¹ Vecerik, M.² Rothörl, T.³ Heess, N.⁴ Pascanu, R.⁵ Hadsell, R.⁶

28
- 34848816477
- Transfer learning via inter-task mappings for temporal difference learning
- Matthew Taylor, Peter Stone, and Yaxin Liu. Transfer learning via inter-task mappings for temporal difference learning. Journal of Machine Learning Research, 8(1):2125-2167, 2007.
- (2007) Journal of Machine Learning Research , vol.8 , Issue.1 , pp. 2125-2167
- Taylor, M.¹ Stone, P.² Liu, Y.³

29
- 68949157375
- Transfer learning for reinforcement learning domains: A survey
- Matthew E. Taylor and Peter Stone. Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research, 10:1633-1685, 2009.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 1633-1685
- Taylor, M.E.¹ Stone, P.²

30
- 84922201091
- Transferring instances for model-based reinforcement learning
- Matthew E. Taylor, Nicholas K. Jong, and Peter Stone. Transferring instances for model-based reinforcement learning. In Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 2008.
- (2008) Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD)
- Taylor, M.E.¹ Jong, N.K.² Stone, P.³

31
- 84872292044
- MujoCo: A physics engine for model-based control
- E. Todorov, T. Erez, and Y. Tassa. MuJoCo: A physics engine for model-based control. In International Conference on Intelligent Robots and Systems (IROS), 2012.
- (2012) International Conference on Intelligent Robots and Systems (IROS)
- Todorov, E.¹ Erez, T.² Tassa, Y.³

32
- 84973897613
- Simultaneous deep transfer across domains and tasks
- Eric Tzeng, Judy Hoffman, Trevor Darrell, and Kate Saenko. Simultaneous deep transfer across domains and tasks. In International Conference in Computer Vision (ICCV), 2015.
- (2015) International Conference in Computer Vision (ICCV)
- Tzeng, E.¹ Hoffman, J.² Darrell, T.³ Saenko, K.⁴

33
- 38949171411
- When pliers become fingers in the monkey motor system
- M. A. Umilta, L. Escola, I. Intskirveli, F. Grammont, M. Rochat, F. Caruana, A. Jezzini, V. Gallese, and G. Rizzolatti. When pliers become fingers in the monkey motor system. Proceedings of the National Academy of Sciences, 105(6):2209-2213, 2008.
- (2008) Proceedings of the National Academy of Sciences , vol.105 , Issue.6 , pp. 2209-2213
- Umilta, M.A.¹ Escola, L.² Intskirveli, I.³ Grammont, F.⁴ Rochat, M.⁵ Caruana, F.⁶ Jezzini, A.⁷ Gallese, V.⁸ Rizzolatti, G.⁹

34
- 78751695885
- Manifold alignment without correspondence
- Chang Wang and Sridhar Mahadevan. Manifold alignment without correspondence. In IJCAI, volume 2, pp. 3, 2009.
- (2009) IJCAI , vol.2 , pp. 3
- Wang, C.¹ Mahadevan, S.²

35
- 85133386144
- Distance metric learning, with application to clustering with side-information
- Cambridge, MA, USA, MIT Press. URL
- Eric P. Xing, Andrew Y. Ng, Michael I. Jordan, and Stuart Russell. Distance metric learning, with application to clustering with side-information. In Proceedings of the 15th International Conference on Neural Information Processing Systems, NIPS'02, pp. 521-528, Cambridge, MA, USA, 2002. MIT Press. URL http://dl.acm.org/citation.cfm?id=2968618.2968683.
- (2002) Proceedings of the 15th International Conference on Neural Information Processing Systems, NIPS'02 , pp. 521-528
- Xing, E.P.¹ Ng, A.Y.² Jordan, M.I.³ Russell, S.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.