-
1
-
-
0033151712
-
Is imitation learning the route to humanoid robots
-
S. Schaal, "Is imitation learning the route to humanoid robots?" Trends in cognitive sciences, vol. 3, no. 6, pp. 233-242, 1999.
-
(1999)
Trends in Cognitive Sciences
, vol.3
, Issue.6
, pp. 233-242
-
-
Schaal, S.1
-
3
-
-
63149159130
-
A survey of robot learning from demonstration
-
B. D. Argall, S. Chernova, M. Veloso, and B. Browning, "A survey of robot learning from demonstration," Robotics and autonomous systems, vol. 57, no. 5, pp. 469-483, 2009.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argall, B.D.1
Chernova, S.2
Veloso, M.3
Browning, B.4
-
5
-
-
85010075270
-
-
M. Bojarski, D. Del Testa, D. Dworakowski, B. Firner, B. Flepp, P. Goyal, L. D. Jackel, M. Monfort, U. Muller, J. Zhang, et al., "End to end learning for self-driving cars," arXiv preprint arXiv:1604.07316, 2016.
-
(2016)
End to End Learning for Self-driving Cars
-
-
Bojarski, M.1
Del Testa, D.2
Dworakowski, D.3
Firner, B.4
Flepp, B.5
Goyal, P.6
Jackel, L.D.7
Monfort, M.8
Muller, U.9
Zhang, J.10
-
6
-
-
85058585233
-
A machine learning approach to visual perception of forest trails for mobile robots
-
A. Giusti, J. Guzzi, D. C. Cires?an, F.-L. He, J. P. Rodríguez, F. Fontana, M. Faessler, C. Forster, J. Schmidhuber, G. Di Caro, et al., "A machine learning approach to visual perception of forest trails for mobile robots," IEEE Robotics and Automation Letters, vol. 1, no. 2, pp. 661-667, 2016.
-
(2016)
IEEE Robotics and Automation Letters
, vol.1
, Issue.2
, pp. 661-667
-
-
Giusti, A.1
Guzzi, J.2
Ciresan, D.C.3
He, F.-L.4
Rodríguez, J.P.5
Fontana, F.6
Faessler, M.7
Forster, C.8
Schmidhuber, J.9
Di Caro, G.10
-
8
-
-
77953328215
-
Learning and reproduction of gestures by imitation
-
S. Calinon, F. D'halluin, E. L. Sauser, D. G. Caldwell, and A. G. Billard, "Learning and reproduction of gestures by imitation," IEEE Robotics & Automation Magazine, vol. 17, no. 2, pp. 44-54, 2010.
-
(2010)
IEEE Robotics & Automation Magazine
, vol.17
, Issue.2
, pp. 44-54
-
-
Calinon, S.1
D'Halluin, F.2
Sauser, E.L.3
Caldwell, D.G.4
Billard, A.G.5
-
9
-
-
84868033442
-
Keyframe-based learning from demonstration
-
B. Akgun, M. Cakmak, K. Jiang, and A. L. Thomaz, "Keyframe-based learning from demonstration," International Journal of Social Robotics, vol. 4, no. 4, pp. 343-355, 2012.
-
(2012)
International Journal of Social Robotics
, vol.4
, Issue.4
, pp. 343-355
-
-
Akgun, B.1
Cakmak, M.2
Jiang, K.3
Thomaz, A.L.4
-
10
-
-
84911495345
-
Learning from demonstrations through the use of non-rigid registration
-
J. Schulman, J. Ho, C. Lee, and P. Abbeel, "Learning from demonstrations through the use of non-rigid registration," in Proceedings of the 16th International Symposium on Robotics Research (ISRR), 2013.
-
(2013)
Proceedings of the 16th International Symposium on Robotics Research (ISRR)
-
-
Schulman, J.1
Ho, J.2
Lee, C.3
Abbeel, P.4
-
11
-
-
84906702595
-
Teleoperation with intelligent and customizable interfaces
-
A. Dragan, K. C. Lee, and S. Srinivasa, "Teleoperation with intelligent and customizable interfaces," Journal of Human-Robot Interaction, vol. 1, no. 3, 2013.
-
(2013)
Journal of Human-Robot Interaction
, vol.1
, Issue.3
-
-
Dragan, A.1
Lee, K.C.2
Srinivasa, S.3
-
13
-
-
78650930204
-
Practical methods for optimal control and estimation using nonlinear programming
-
J. T. Betts, Practical methods for optimal control and estimation using nonlinear programming. SIAM, 2010.
-
(2010)
SIAM
-
-
Betts, J.T.1
-
14
-
-
84892720625
-
A direct method for trajectory optimization of rigid bodies through contact
-
M. Posa, C. Cantu, and R. Tedrake, "A direct method for trajectory optimization of rigid bodies through contact," The International Journal of Robotics Research, vol. 33, no. 1, pp. 69-81, 2014.
-
(2014)
The International Journal of Robotics Research
, vol.33
, Issue.1
, pp. 69-81
-
-
Posa, M.1
Cantu, C.2
Tedrake, R.3
-
15
-
-
84937822296
-
Learning neural network policies with guided policy search under unknown dynamics
-
S. Levine and P. Abbeel, "Learning neural network policies with guided policy search under unknown dynamics," in Advances in Neural Information Processing Systems, 2014, pp. 1071-1079.
-
(2014)
Advances in Neural Information Processing Systems
, pp. 1071-1079
-
-
Levine, S.1
Abbeel, P.2
-
16
-
-
40649106649
-
Natural actor-critic
-
J. Peters and S. Schaal, "Natural actor-critic," Neurocomputing, vol. 71, no. 7, pp. 1180-1190, 2008.
-
(2008)
Neurocomputing
, vol.71
, Issue.7
, pp. 1180-1190
-
-
Peters, J.1
Schaal, S.2
-
17
-
-
84924051598
-
Human-level control through deep reinforcement learning
-
V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, et al., "Human-level control through deep reinforcement learning," Nature, vol. 518, no. 7540, pp. 529-533, 2015.
-
(2015)
Nature
, vol.518
, Issue.7540
, pp. 529-533
-
-
Mnih, V.1
Kavukcuoglu, K.2
Silver, D.3
Rusu, A.A.4
Veness, J.5
Bellemare, M.G.6
Graves, A.7
Riedmiller, M.8
Fidjeland, A.K.9
Ostrovski, G.10
-
18
-
-
84969963490
-
Trust region policy optimization
-
J. Schulman, S. Levine, P. Abbeel, M. Jordan, and P. Moritz, "Trust region policy optimization," in Proceedings of the 32nd International Conference on Machine Learning (ICML-15), 2015, pp. 1889-1897.
-
(2015)
Proceedings of the 32nd International Conference on Machine Learning (ICML-15)
, pp. 1889-1897
-
-
Schulman, J.1
Levine, S.2
Abbeel, P.3
Jordan, M.4
Moritz, P.5
-
19
-
-
84965135289
-
-
T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning," arXiv preprint arXiv:1509.02971, 2015.
-
(2015)
Continuous Control with Deep Reinforcement Learning
-
-
Lillicrap, T.P.1
Hunt, J.J.2
Pritzel, A.3
Heess, N.4
Erez, T.5
Tassa, Y.6
Silver, D.7
Wierstra, D.8
-
20
-
-
84979924150
-
End-to-end training of deep visuomotor policies
-
S. Levine, C. Finn, T. Darrell, and P. Abbeel, "End-to-end training of deep visuomotor policies," Journal of Machine Learning Research, vol. 17, no. 39, pp. 1-40, 2016.
-
(2016)
Journal of Machine Learning Research
, vol.17
, Issue.39
, pp. 1-40
-
-
Levine, S.1
Finn, C.2
Darrell, T.3
Abbeel, P.4
-
21
-
-
84971448181
-
Asynchronous methods for deep reinforcement learning
-
V. Mnih, A. P. Badia, M. Mirza, A. Graves, T. Lillicrap, T. Harley, D. Silver, and K. Kavukcuoglu, "Asynchronous methods for deep reinforcement learning," in International Conference on Machine Learning, 2016, pp. 1928-1937.
-
(2016)
International Conference on Machine Learning
, pp. 1928-1937
-
-
Mnih, V.1
Badia, A.P.2
Mirza, M.3
Graves, A.4
Lillicrap, T.5
Harley, T.6
Silver, D.7
Kavukcuoglu, K.8
-
22
-
-
85041194636
-
-
J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, "Proximal policy optimization algorithms," arXiv preprint arXiv:1707.06347, 2017.
-
(2017)
Proximal Policy Optimization Algorithms
-
-
Schulman, J.1
Wolski, F.2
Dhariwal, P.3
Radford, A.4
Klimov, O.5
-
23
-
-
0036675366
-
Robotic gastrointestinal surgery: Early experience and system description
-
M. Talamini, K. Campbell, and C. Stanfield, "Robotic gastrointestinal surgery: early experience and system description," Journal of laparoendoscopic & advanced surgical techniques, vol. 12, no. 4, pp. 225-232, 2002.
-
(2002)
Journal of Laparoendoscopic & Advanced Surgical Techniques
, vol.12
, Issue.4
, pp. 225-232
-
-
Talamini, M.1
Campbell, K.2
Stanfield, C.3
-
24
-
-
84867135104
-
A reduction of imitation learning and structured prediction to no-regret online learning
-
S. Ross, G. J. Gordon, and D. Bagnell, "A reduction of imitation learning and structured prediction to no-regret online learning." in AISTATS, vol. 1, no. 2, 2011, p. 6.
-
(2011)
AISTATS
, vol.1
, Issue.2
, pp. 6
-
-
Ross, S.1
Gordon, G.J.2
Bagnell, D.3
-
25
-
-
0042547347
-
Algorithms for inverse reinforcement learning
-
A. Y. Ng, S. J. Russell, et al., "Algorithms for inverse reinforcement learning." in Icml, 2000, pp. 663-670.
-
(2000)
ICML
, pp. 663-670
-
-
Ng, A.Y.1
Russell, S.J.2
-
27
-
-
57749097473
-
Maximum entropy inverse reinforcement learning
-
B. Ziebart, A. Maas, J. A. Bagnell, and A. K. Dey, "Maximum entropy inverse reinforcement learning," in AAAI Conference on Artificial Intelligence, 2008.
-
(2008)
AAAI Conference on Artificial Intelligence
-
-
Ziebart, B.1
Maas, A.2
Bagnell, J.A.3
Dey, A.K.4
-
29
-
-
84989292118
-
Guided cost learning: Deep inverse optimal control via policy optimization
-
C. Finn, S. Levine, and P. Abbeel, "Guided cost learning: Deep inverse optimal control via policy optimization," in Proceedings of the 33rd International Conference on Machine Learning, vol. 48, 2016.
-
(2016)
Proceedings of the 33rd International Conference on Machine Learning
, vol.48
-
-
Finn, C.1
Levine, S.2
Abbeel, P.3
-
31
-
-
2942523202
-
Discovering optimal imitation strategies
-
A. Billard, Y. Epars, S. Calinon, S. Schaal, and G. Cheng, "Discovering optimal imitation strategies," Robotics and autonomous systems, vol. 47, no. 2, pp. 69-77, 2004.
-
(2004)
Robotics and Autonomous Systems
, vol.47
, Issue.2
, pp. 69-77
-
-
Billard, A.1
Epars, Y.2
Calinon, S.3
Schaal, S.4
Cheng, G.5
-
32
-
-
84885081372
-
Learning movement primitives
-
S. Schaal, J. Peters, J. Nakanishi, and A. Ijspeert, "Learning movement primitives," Robotics Research, pp. 561-572, 2005.
-
(2005)
Robotics Research
, pp. 561-572
-
-
Schaal, S.1
Peters, J.2
Nakanishi, J.3
Ijspeert, A.4
-
33
-
-
85105191314
-
Learning and generalization of motor skills by learning from demonstration
-
P. Pastor, H. Hoffmann, T. Asfour, and S. Schaal, "Learning and generalization of motor skills by learning from demonstration," in Robotics and Automation, 2009. ICRA'09. IEEE International Conference on. IEEE, 2009, pp. 763-768.
-
(2009)
Robotics and Automation, 2009. ICRA'09. IEEE International Conference On. IEEE
, pp. 763-768
-
-
Pastor, P.1
Hoffmann, H.2
Asfour, T.3
Schaal, S.4
-
34
-
-
67649736232
-
Imitation learning for locomotion and manipulation
-
N. Ratliff, J. A. Bagnell, and S. S. Srinivasa, "Imitation learning for locomotion and manipulation," in Humanoid Robots, 2007 7th IEEE-RAS International Conference on. IEEE, 2007, pp. 392-397.
-
(2007)
Humanoid Robots, 2007 7th IEEE-RAS International Conference On. IEEE
, pp. 392-397
-
-
Ratliff, N.1
Bagnell, J.A.2
Srinivasa, S.S.3
-
35
-
-
85040313507
-
-
T. Hester, M. Vecerik, O. Pietquin, M. Lanctot, T. Schaul, B. Piot, A. Sendonaris, G. Dulac-Arnold, I. Osband, J. Agapiou, et al., "Learning from demonstrations for real world reinforcement learning," arXiv preprint arXiv:1704.03732, 2017.
-
(2017)
Learning from Demonstrations for Real World Reinforcement Learning
-
-
Hester, T.1
Vecerik, M.2
Pietquin, O.3
Lanctot, M.4
Schaul, T.5
Piot, B.6
Sendonaris, A.7
Dulac-Arnold, G.8
Osband, I.9
Agapiou, J.10
-
36
-
-
84887290913
-
Learning monocular reactive uav control in cluttered natural environments
-
S. Ross, N. Melik-Barkhudarov, K. S. Shankar, A. Wendel, D. Dey, J. A. Bagnell, and M. Hebert, "Learning monocular reactive uav control in cluttered natural environments," in Robotics and Automation (ICRA), 2013 IEEE International Conference on. IEEE, 2013, pp. 1765-1772.
-
(2013)
Robotics and Automation (ICRA), 2013 IEEE International Conference On. IEEE
, pp. 1765-1772
-
-
Ross, S.1
Melik-Barkhudarov, N.2
Shankar, K.S.3
Wendel, A.4
Dey, D.5
Bagnell, J.A.6
Hebert, M.7
-
37
-
-
85041675928
-
-
R. Rahmatizadeh, P. Abolghasemi, L. Bölöni, and S. Levine, "Visionbased multi-task manipulation for inexpensive robots using end-to-end learning from demonstration," arXiv preprint arXiv:1707.02920, 2017.
-
(2017)
Visionbased Multi-task Manipulation for Inexpensive Robots Using End-to-end Learning from Demonstration
-
-
Rahmatizadeh, R.1
Abolghasemi, P.2
Bölöni, L.3
Levine, S.4
-
39
-
-
85046997736
-
-
Y. Liu, A. Gupta, P. Abbeel, and S. Levine, "Imitation from observation: Learning to imitate behaviors from raw video via context translation," arXiv preprint arXiv:1707.03374, 2017.
-
(2017)
Imitation from Observation: Learning to Imitate Behaviors from Raw Video Via Context Translation
-
-
Liu, Y.1
Gupta, A.2
Abbeel, P.3
Levine, S.4
-
40
-
-
84879991767
-
Teleoperation of a humanoid robot using full-body motion capture, example movements, and machine learning
-
C. Stanton, A. Bogdanovych, and E. Ratanasena, "Teleoperation of a humanoid robot using full-body motion capture, example movements, and machine learning," in Proc. Australasian Conference on Robotics and Automation, 2012.
-
(2012)
Proc. Australasian Conference on Robotics and Automation
-
-
Stanton, C.1
Bogdanovych, A.2
Ratanasena, E.3
-
41
-
-
84962300132
-
First-person teleoperation of a humanoid robot
-
L. Fritsche, F. Unverzag, J. Peters, and R. Calandra, "First-person teleoperation of a humanoid robot," in Humanoid Robots (Humanoids), 2015 IEEE-RAS 15th International Conference on. IEEE, 2015, pp. 997-1002.
-
(2015)
Humanoid Robots (Humanoids), 2015 IEEE-RAS 15th International Conference On. IEEE
, pp. 997-1002
-
-
Fritsche, L.1
Unverzag, F.2
Peters, J.3
Calandra, R.4
-
42
-
-
85045154957
-
Baxter's homunculus: Virtual reality spaces for teleoperation in manufacturing
-
J. I. Lipton, A. J. Fay, and D. Rus, "Baxter's homunculus: Virtual reality spaces for teleoperation in manufacturing," IEEE Robotics and Automation Letters, vol. 3, no. 1, pp. 179-186, 2018.
-
(2018)
IEEE Robotics and Automation Letters
, vol.3
, Issue.1
, pp. 179-186
-
-
Lipton, J.I.1
Fay, A.J.2
Rus, D.3
-
44
-
-
85048682222
-
-
E. Rosen, D. Whitney, E. Phillips, G. Chien, J. Tompkin, G. Konidaris, and S. Tellex, "Communicating robot arm motion intent through mixed reality head-mounted displays," arXiv preprint arXiv:1708.03655, 2017.
-
(2017)
Communicating Robot Arm Motion Intent Through Mixed Reality Head-mounted Displays
-
-
Rosen, E.1
Whitney, D.2
Phillips, E.3
Chien, G.4
Tompkin, J.5
Konidaris, G.6
Tellex, S.7
-
45
-
-
85053845526
-
-
X. Yan, M. Khansari, Y. Bai, J. Hsu, A. Pathak, A. Gupta, J. Davidson, and H. Lee, "Learning grasping interaction with geometry-aware 3d representations," arXiv preprint arXiv:1708.07303, 2017.
-
(2017)
Learning Grasping Interaction with Geometry-aware 3d Representations
-
-
Yan, X.1
Khansari, M.2
Bai, Y.3
Hsu, J.4
Pathak, A.5
Gupta, A.6
Davidson, J.7
Lee, H.8
-
46
-
-
85063130521
-
Learning dexterous manipulation policies from experience and imitation
-
V. Kumar, A. Gupta, E. Todorov, and S. Levine, "Learning dexterous manipulation policies from experience and imitation," in ICRA, 2016.
-
(2016)
ICRA
-
-
Kumar, V.1
Gupta, A.2
Todorov, E.3
Levine, S.4
-
47
-
-
33845397555
-
Autonomous inverted helicopter flight via reinforcement learning
-
Springer Berlin Heidelberg
-
A. Y. Ng, A. Coates, M. Diel, V. Ganapathi, J. Schulte, B. Tse, E. Berger, and E. Liang, "Autonomous inverted helicopter flight via reinforcement learning," in Experimental Robotics IX. Springer Berlin Heidelberg, 2006, pp. 363-372.
-
(2006)
Experimental Robotics IX
, pp. 363-372
-
-
Ng, A.Y.1
Coates, A.2
Diel, M.3
Ganapathi, V.4
Schulte, J.5
Tse, B.6
Berger, E.7
Liang, E.8
-
48
-
-
44949241322
-
Reinforcement learning of motor skills with policy gradients
-
J. Peters and S. Schaal, "Reinforcement learning of motor skills with policy gradients," Neural networks, vol. 21, no. 4, pp. 682-697, 2008.
-
(2008)
Neural Networks
, vol.21
, Issue.4
, pp. 682-697
-
-
Peters, J.1
Schaal, S.2
-
49
-
-
14044262287
-
Stochastic policy gradient reinforcement learning on a simple 3d biped
-
IEEE
-
R. Tedrake, T. W. Zhang, and H. S. Seung, "Stochastic policy gradient reinforcement learning on a simple 3d biped," in Intelligent Robots and Systems, 2004.(IROS 2004). Proceedings. 2004 IEEE/RSJ International Conference on, vol. 3. IEEE, 2004, pp. 2849-2854.
-
(2004)
Intelligent Robots and Systems, 2004. (IROS 2004). Proceedings. 2004 IEEE/RSJ International Conference on
, vol.3
, pp. 2849-2854
-
-
Tedrake, R.1
Zhang, T.W.2
Seung, H.S.3
-
50
-
-
0042547347
-
Algorithms for inverse reinforcement learning
-
A. Y. Ng and S. J. Russell, "Algorithms for inverse reinforcement learning." in Icml, 2000, pp. 663-670.
-
(2000)
Icml
, pp. 663-670
-
-
Ng, A.Y.1
Russell, S.J.2
-
51
-
-
85025120617
-
-
M. Jaderberg, V. Mnih, W. M. Czarnecki, T. Schaul, J. Z. Leibo, D. Silver, and K. Kavukcuoglu, "Reinforcement learning with unsupervised auxiliary tasks," arXiv preprint arXiv:1611.05397, 2016.
-
(2016)
Reinforcement Learning with Unsupervised Auxiliary Tasks
-
-
Jaderberg, M.1
Mnih, V.2
Czarnecki, W.M.3
Schaul, T.4
Leibo, J.Z.5
Silver, D.6
Kavukcuoglu, K.7
|