-
3
-
-
85007212130
-
Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment
-
Haitham Bou Ammar, Eric Eaton, Paul Ruvolo, and Matthew E Taylor. Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment. In Proc. of AAAI, 2015b.
-
(2015)
Proc. Of AAAI
-
-
Ammar, H.B.1
Eaton, E.2
Ruvolo, P.3
Taylor, M.E.4
-
4
-
-
9444270330
-
Exploiting task relatedness for multiple task learning
-
Springer
-
Shai Ben-David and Reba Schuller. Exploiting task relatedness for multiple task learning. In Learning Theory and Kernel Machines, pp. 567-580. Springer, 2003.
-
(2003)
Learning Theory and Kernel Machines
, pp. 567-580
-
-
Ben-David, S.1
Schuller, R.2
-
7
-
-
24644436425
-
Learning a similarity metric discriminatively, with application to face verification
-
Sumit Chopra, Raia Hadsell, and Yann LeCun. Learning a similarity metric discriminatively, with application to face verification. In Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, volume 1, pp. 539-546. IEEE, 2005.
-
(2005)
Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on
, vol.1
, pp. 539-546
-
-
Chopra, S.1
Hadsell, R.2
LeCun, Y.3
-
9
-
-
85041944275
-
-
arXiv preprint
-
Coline Devin, Abhishek Gupta, Trevor Darrell, Pieter Abbeel, and Sergey Levine. Learning modular neural network policies for multi-task and multi-robot transfer. arXiv preprint arXiv:1609.07088, 2016.
-
(2016)
Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer
-
-
Devin, C.1
Gupta, A.2
Darrell, T.3
Abbeel, P.4
Levine, S.5
-
10
-
-
13244267151
-
Mirror neurons responding to observation of actions made with tools in monkey ventral premotor cortex
-
P. F. Ferrari, S. Rozzi, and L. Fogassi. Mirror neurons responding to observation of actions made with tools in monkey ventral premotor cortex. Journal of Cognitive Neuroscience, 17(2), 2005.
-
(2005)
Journal of Cognitive Neuroscience
, vol.17
, Issue.2
-
-
Ferrari, P.F.1
Rozzi, S.2
Fogassi, L.3
-
11
-
-
84979887690
-
Domain-adversarial training of neural networks
-
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, Francois Laviolette, Mario Marchand, and Victor Lempitsky. Domain-adversarial training of neural networks. Journal of Machine Learning Research, 17, 2016.
-
(2016)
Journal of Machine Learning Research
, vol.17
-
-
Ganin, Y.1
Ustinova, E.2
Ajakan, H.3
Germain, P.4
Larochelle, H.5
Laviolette, F.6
Marchand, M.7
Lempitsky, V.8
-
13
-
-
0000107975
-
Relations between two sets of variates
-
Harold Hotelling. Relations between two sets of variates. Biometrika, 28, 1936.
-
(1936)
Biometrika
, vol.28
-
-
Hotelling, H.1
-
18
-
-
84979924150
-
End-to-end training of deep visuo-motor policies
-
Sergey Levine, Chelsea Finn, Trevor Darrell, and Pieter Abbeel. End-to-end training of deep visuo-motor policies. Journal of Machine Learning Research, 17:1-40, 2016.
-
(2016)
Journal of Machine Learning Research
, vol.17
, pp. 1-40
-
-
Levine, S.1
Finn, C.2
Darrell, T.3
Abbeel, P.4
-
19
-
-
17444424051
-
Iterative linear quadratic regulator design for nonlinear biological movement systems
-
Weiwei Li and Emanuel Todorov. Iterative linear quadratic regulator design for nonlinear biological movement systems. In ICINCO (1), 2004.
-
(2004)
ICINCO
, Issue.1
-
-
Li, W.1
Todorov, E.2
-
20
-
-
85007167143
-
Continuous control with deep reinforcement learning
-
abs
-
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. Continuous control with deep reinforcement learning. CoRR, abs/1509.02971, 2015.
-
(2015)
CoRR
-
-
Lillicrap, T.P.1
Hunt, J.J.2
Pritzel, A.3
Heess, N.4
Erez, T.5
Tassa, Y.6
Silver, D.7
Wierstra, D.8
-
26
-
-
85027984488
-
Progressive neural networks
-
abs
-
Andrei A. Rusu, Neil C. Rabinowitz, Guillaume Desjardins, Hubert Soyer, James Kirkpatrick, Koray Kavukcuoglu, Razvan Pascanu, and Raia Hadsell. Progressive neural networks. CoRR, abs/1606.04671, 2016a.
-
(2016)
CoRR
-
-
Rusu, A.A.1
Rabinowitz, N.C.2
Desjardins, G.3
Soyer, H.4
Kirkpatrick, J.5
Kavukcuoglu, K.6
Pascanu, R.7
Hadsell, R.8
-
27
-
-
85030465352
-
-
arXiv preprint
-
Andrei A Rusu, Matej Vecerik, Thomas Rothörl, Nicolas Heess, Razvan Pascanu, and Raia Hadsell. Sim-to-real robot learning from pixels with progressive nets. arXiv preprint arXiv:1610.04286, 2016b.
-
(2016)
Sim-to-Real Robot Learning from Pixels with Progressive Nets
-
-
Rusu, A.A.1
Vecerik, M.2
Rothörl, T.3
Heess, N.4
Pascanu, R.5
Hadsell, R.6
-
28
-
-
34848816477
-
Transfer learning via inter-task mappings for temporal difference learning
-
Matthew Taylor, Peter Stone, and Yaxin Liu. Transfer learning via inter-task mappings for temporal difference learning. Journal of Machine Learning Research, 8(1):2125-2167, 2007.
-
(2007)
Journal of Machine Learning Research
, vol.8
, Issue.1
, pp. 2125-2167
-
-
Taylor, M.1
Stone, P.2
Liu, Y.3
-
29
-
-
68949157375
-
Transfer learning for reinforcement learning domains: A survey
-
Matthew E. Taylor and Peter Stone. Transfer learning for reinforcement learning domains: A survey. Journal of Machine Learning Research, 10:1633-1685, 2009.
-
(2009)
Journal of Machine Learning Research
, vol.10
, pp. 1633-1685
-
-
Taylor, M.E.1
Stone, P.2
-
33
-
-
38949171411
-
When pliers become fingers in the monkey motor system
-
M. A. Umilta, L. Escola, I. Intskirveli, F. Grammont, M. Rochat, F. Caruana, A. Jezzini, V. Gallese, and G. Rizzolatti. When pliers become fingers in the monkey motor system. Proceedings of the National Academy of Sciences, 105(6):2209-2213, 2008.
-
(2008)
Proceedings of the National Academy of Sciences
, vol.105
, Issue.6
, pp. 2209-2213
-
-
Umilta, M.A.1
Escola, L.2
Intskirveli, I.3
Grammont, F.4
Rochat, M.5
Caruana, F.6
Jezzini, A.7
Gallese, V.8
Rizzolatti, G.9
-
34
-
-
78751695885
-
Manifold alignment without correspondence
-
Chang Wang and Sridhar Mahadevan. Manifold alignment without correspondence. In IJCAI, volume 2, pp. 3, 2009.
-
(2009)
IJCAI
, vol.2
, pp. 3
-
-
Wang, C.1
Mahadevan, S.2
-
35
-
-
85133386144
-
Distance metric learning, with application to clustering with side-information
-
Cambridge, MA, USA, MIT Press. URL
-
Eric P. Xing, Andrew Y. Ng, Michael I. Jordan, and Stuart Russell. Distance metric learning, with application to clustering with side-information. In Proceedings of the 15th International Conference on Neural Information Processing Systems, NIPS'02, pp. 521-528, Cambridge, MA, USA, 2002. MIT Press. URL http://dl.acm.org/citation.cfm?id=2968618.2968683.
-
(2002)
Proceedings of the 15th International Conference on Neural Information Processing Systems, NIPS'02
, pp. 521-528
-
-
Xing, E.P.1
Ng, A.Y.2
Jordan, M.I.3
Russell, S.4
|