-
2
-
-
77955809093
-
Autonomous helicopter aerobatics through apprenticeship learning
-
Abbeel, P., Coates, A. and Ng, A. (2010), "Autonomous helicopter aerobatics through apprenticeship learning" in International Journal of Robotics Research, Vol. 29, No. 13, pp. 1608-39.
-
(2010)
International Journal of Robotics Research
, vol.29
, Issue.13
, pp. 1608-1639
-
-
Abbeel, P.1
Coates, A.2
Ng, A.3
-
3
-
-
84883027643
-
Autonomous autorotation of an RC helicopter
-
Abbeel, P., Coates, A., Hunter, T. and Ng, A. (2009), "Autonomous autorotation of an RC helicopter" in Proceedings of the International Symposium on Experimental Robotics, pp. 385-94.
-
(2009)
Proceedings of the International Symposium on Experimental Robotics
, pp. 385-394
-
-
Abbeel, P.1
Coates, A.2
Hunter, T.3
Ng, A.4
-
4
-
-
84864030941
-
An application of reinforcement learning to aerobatic helicopter flight
-
Abbeel, P., Coates, A., Quigley, M. and Ng, A. (2007), "An application of reinforcement learning to aerobatic helicopter flight" in Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference, p. 1.
-
(2007)
Advances in Neural Information Processing Systems 19: Proceedings of the 2006 Conference
, pp. 1
-
-
Abbeel, P.1
Coates, A.2
Quigley, M.3
Ng, A.4
-
5
-
-
67650136522
-
Apprenticeship learning for motion planning with application to parking lot navigation
-
Abbeel, P., Dolgov, D., Ng, A. and Thrun, S. (2008), "Apprenticeship learning for motion planning with application to parking lot navigation" in Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on, pp. 1083-90.
-
(2008)
Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
, pp. 1083-1090
-
-
Abbeel, P.1
Dolgov, D.2
Ng, A.3
Thrun, S.4
-
6
-
-
0000396062
-
Natural gradient works efficiently in learning
-
Amari, S. (1998), "Natural gradient works efficiently in learning" in Neural Computation, Vol. 10, No. 2, pp. 251-76.
-
(1998)
Neural Computation
, vol.10
, Issue.2
, pp. 251-276
-
-
Amari, S.1
-
7
-
-
63149159130
-
A survey of robot learning from demonstration
-
Argall, B., Chernova, S., Veloso, M. and Browning, B. (2009), "A survey of robot learning from demonstration" in Robotics and Autonomous Systems, Vol. 57, No. 5, pp. 469-83.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argall, B.1
Chernova, S.2
Veloso, M.3
Browning, B.4
-
8
-
-
0002130986
-
Robot learning from demonstration
-
Morgan Kaufmann, Burlington, MA
-
Atkeson, C. and Schaal, S. (1997), "Robot learning from demonstration" in Proceedings of the International Conference on Machine Learning (ICML'97), Morgan Kaufmann, Burlington, MA, pp. 12-20.
-
(1997)
Proceedings of the International Conference on Machine Learning (ICML'97)
, pp. 12-20
-
-
Atkeson, C.1
Schaal, S.2
-
9
-
-
80053440459
-
Apprenticeship learning about multiple intentions
-
Babes, M., Marivate, V., Littman, M. and Subramanian, K. (2010), "Apprenticeship learning about multiple intentions", Proceedings of International Conference on Machine Learning (ICML 2011).
-
(2010)
Proceedings of International Conference on Machine Learning (ICML 2011)
-
-
Babes, M.1
Marivate, V.2
Littman, M.3
Subramanian, K.4
-
11
-
-
84862293297
-
Relative entropy inverse reinforcement learning
-
Boularias, A., Kober, J. and Peters, J. (2011), "Relative entropy inverse reinforcement learning" in Proceedings of Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), JMLR WC&P, Vol. 15, pp. 182-9.
-
(2011)
Proceedings of Fourteenth International Conference on Artificial Intelligence and Statistics (AISTATS 2011), JMLR WC&P
, vol.15
, pp. 182-189
-
-
Boularias, A.1
Kober, J.2
Peters, J.3
-
12
-
-
84865717612
-
User simulation in dialogue systems using inverse reinforcement learning
-
Chandramohan, S., Geist, M., Lefevre, F. and Pietquin, O. (2011), "User simulation in dialogue systems using inverse reinforcement learning", Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence (Italy), August.
-
(2011)
Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech 2011), Florence (Italy), August
-
-
Chandramohan, S.1
Geist, M.2
Lefevre, F.3
Pietquin, O.4
-
15
-
-
56449129785
-
Learning for control from multiple demonstrations
-
Coates, A., Abbeel, P. and Ng, A. (2008), "Learning for control from multiple demonstrations" in Proceedings of the 25th International Conference on Machine Learning, pp. 144-51.
-
(2008)
Proceedings of the 25th International Conference on Machine Learning
, pp. 144-151
-
-
Coates, A.1
Abbeel, P.2
Ng, A.3
-
16
-
-
78649831352
-
Selecting operator queries using expected myopic gain
-
Cohn, R., Maxim, M., Durfee, E. and Singh, S. (2010), "Selecting operator queries using expected myopic gain" in IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 40-7.
-
(2010)
IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
, pp. 40-47
-
-
Cohn, R.1
Maxim, M.2
Durfee, E.3
Singh, S.4
-
17
-
-
84877776997
-
Bayesian multitask inverse reinforcement learning
-
Dimitrakakis, C. and Rothkopf, C. (2011), "Bayesian multitask inverse reinforcement learning", paper presented at the 9th European Workshop on Reinforcement Learning (EWRL 2011), Athens, Greece, 9-11 September.
-
(2011)
Paper Presented at the 9th European Workshop on Reinforcement Learning (EWRL 2011), Athens, Greece, 9-11 September
-
-
Dimitrakakis, C.1
Rothkopf, C.2
-
18
-
-
0000030684
-
The expected-utility hypothesis and the measurability of utility
-
Friedman, M. and Savage, L. (1952), "The expected-utility hypothesis and the measurability of utility" in The Journal of Political Economy, No. 6, pp. 463-74.
-
(1952)
The Journal of Political Economy
, Issue.6
, pp. 463-474
-
-
Friedman, M.1
Savage, L.2
-
19
-
-
84871697922
-
Donut as I do: Learning from failed demonstrations
-
Grollman, D. and Billard, A. (2011), "Donut as I do: learning from failed demonstrations" in IEEE International Conference on Robotics and Automation, Shanghai, 9-13 May, pp. 9-13.
-
(2011)
IEEE International Conference on Robotics and Automation, Shanghai, 9-13 May
, pp. 9-13
-
-
Grollman, D.1
Billard, A.2
-
20
-
-
0029720233
-
Human-to-robot skill transfer using the spore approximation
-
Grudic, G. and Lawrence, P. (1996), "Human-to-robot skill transfer using the spore approximation" in Robotics and Automation, Proceedings, 1996 IEEE International Conference on, Vol. 4, pp. 2962-7.
-
(1996)
Robotics and Automation, Proceedings, 1996 IEEE International Conference on
, vol.4
, pp. 2962-2967
-
-
Grudic, G.1
Lawrence, P.2
-
21
-
-
77955814312
-
Learning to navigate through crowded environments
-
Henry, P., Vollmer, C., Ferris, B. and Fox, D. (2010), "Learning to navigate through crowded environments" in Robotics and Automation (ICRA), 2010 IEEE International Conference on, pp. 981-6.
-
(2010)
Robotics and Automation (ICRA), 2010 IEEE International Conference on
, pp. 981-986
-
-
Henry, P.1
Vollmer, C.2
Ferris, B.3
Fox, D.4
-
22
-
-
2342632212
-
Solving a huge number of similar tasks: A combination of multi-task learning and a hierarchical Bayesian approach
-
Heskes, T. (1998), "Solving a huge number of similar tasks: a combination of multi-task learning and a hierarchical Bayesian approach" in Proceedings of the 15th International Conference on Machine Learning (ICML'98), pp. 233-41.
-
(1998)
Proceedings of the 15th International Conference on Machine Learning (ICML'98)
, pp. 233-241
-
-
Heskes, T.1
-
23
-
-
11944275853
-
Information theory and statistical mechanics
-
Jaynes, E. (1957), "Information theory and statistical mechanics" in Physical Review, Vol. 108, No. 2, p. 171.
-
(1957)
Physical Review
, vol.108
, Issue.2
, pp. 171
-
-
Jaynes, E.1
-
24
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L., Littman, M. and Moore, A. (1996), "Reinforcement learning: a survey" in Journal of Artificial Intelligence Research, Vol. 4, pp. 237-85.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.1
Littman, M.2
Moore, A.3
-
27
-
-
77953327625
-
Imitation and reinforcement learning, practical algorithms for motor primitives in robotics
-
Kober, J. and Peters, J. (2010), "Imitation and reinforcement learning, practical algorithms for motor primitives in robotics" in Robotics and Automation Magazine, IEEE, Vol. 17, No. 2, pp. 55-62.
-
(2010)
Robotics and Automation Magazine, IEEE
, vol.17
, Issue.2
, pp. 55-62
-
-
Kober, J.1
Peters, J.2
-
28
-
-
85162069513
-
Hierarchical apprenticeship learning with application to quadruped locomotion
-
MIT Press, Cambridge, MA
-
Kolter, J., Abbeel, P. and Ng, A. (2008), "Hierarchical apprenticeship learning with application to quadruped locomotion", Advances in Neural Information Processing Systems, MIT Press, Cambridge, MA.
-
(2008)
Advances in Neural Information Processing Systems
-
-
Kolter, J.1
Abbeel, P.2
Ng, A.3
-
29
-
-
0142192295
-
Conditional random fields: Probabilistic models for segmenting and labeling sequence data
-
Morgan Kaufmann, Burlington, MA
-
Lafferty, J., McCallum, A. and Pereira, F. (2011), "Conditional random fields: probabilistic models for segmenting and labeling sequence data" in Proceedings of the International Conference on Machine Learning (ICML 2011), Morgan Kaufmann, Burlington, MA, pp. 282-9.
-
(2011)
Proceedings of the International Conference on Machine Learning (ICML 2011)
, pp. 282-289
-
-
Lafferty, J.1
McCallum, A.2
Pereira, F.3
-
30
-
-
84865112972
-
-
ACM SIGGRAPH 2010 papers, New York, NY
-
Lee, S. and Popovi, Z. (2010), "Learning behavior styles with inverse reinforcement learning" in ACM, New York, NY, pp. 1-7, ACM SIGGRAPH 2010 papers.
-
(2010)
Learning behavior styles with inverse reinforcement learning
, pp. 1-7
-
-
Lee, S.1
Popovi, Z.2
-
31
-
-
70349966131
-
Active learning for reward estimation in inverse reinforcement learning
-
Lopes, M., Melo, F. and Montesano, L. (2009), "Active learning for reward estimation in inverse reinforcement learning" in Machine Learning and Knowledge Discovery in Databases, Vol. 5782, No. 1, pp. 31-46.
-
(2009)
Machine Learning and Knowledge Discovery in Databases
, vol.5782
, Issue.1
, pp. 31-46
-
-
Lopes, M.1
Melo, F.2
Montesano, L.3
-
33
-
-
78049399307
-
Learning from demonstration using MDP induced metrics
-
Springer, Berlin
-
Melo, F. and Lopes, M. (2010), "Learning from demonstration using MDP induced metrics" in Machine Learning and Knowledge Discovery in Databases, Springer, Berlin, pp. 385-401.
-
(2010)
Machine Learning and Knowledge Discovery in Databases
, pp. 385-401
-
-
Melo, F.1
Lopes, M.2
-
34
-
-
84865148144
-
A survey of POMDP solution techniques
-
Murphy, K. (2000), "A survey of POMDP solution techniques" in Environment, Vol. 2, p. X3.
-
(2000)
Environment
, vol.2
-
-
Murphy, K.1
-
37
-
-
0003212629
-
Efficient training of artificial neural networks for autonomous navigation
-
Pomerleau, D. (1991), "Efficient training of artificial neural networks for autonomous navigation" in Neural Computation, Vol. 3, No. 1, pp. 88-97.
-
(1991)
Neural Computation
, vol.3
, Issue.1
, pp. 88-97
-
-
Pomerleau, D.1
-
39
-
-
80053156567
-
Inverse reinforcement learning with Gaussian process
-
Qiao, Q. and Beling, P. (2011), "Inverse reinforcement learning with Gaussian process" in American Control Conference (ACC), pp. 113-18.
-
(2011)
American Control Conference (ACC)
, pp. 113-118
-
-
Qiao, Q.1
Beling, P.2
-
41
-
-
33749252753
-
Maximum margin planning
-
Ratliff, N., Bagnell, J. and Zinkevich, M. (2006), "Maximum margin planning" in Proceedings of the 23rd International Conference on Machine Learning, pp. 729-36.
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning
, pp. 729-736
-
-
Ratliff, N.1
Bagnell, J.2
Zinkevich, M.3
-
42
-
-
67650957592
-
Learning to search: Functional gradient techniques for imitation learning
-
Ratliff, N., Silver, D. and Bagnell, J. (2009), "Learning to search: functional gradient techniques for imitation learning" in Autonomous Robots, No. 1, pp. 25-53.
-
(2009)
Autonomous Robots
, Issue.1
, pp. 25-53
-
-
Ratliff, N.1
Silver, D.2
Bagnell, J.3
-
44
-
-
80052420104
-
Preference elicitation and inverse reinforcement learning
-
Rothkopf, C. and Dimitrakakis, C. (2011), "Preference elicitation and inverse reinforcement learning" in Proceedings of 22nd European Conference on Machine Learning ECML, Part III, LNAI 6913, pp. 34-48.
-
(2011)
Proceedings of 22nd European Conference on Machine Learning ECML, Part III, LNAI 6913
, pp. 34-48
-
-
Rothkopf, C.1
Dimitrakakis, C.2
-
46
-
-
0033151712
-
Is imitation learning the route to humanoid robots?
-
Schaal, S. (1999), "Is imitation learning the route to humanoid robots?" in Trends in Cognitive Sciences, Vol. 3, No. 6, pp. 233-42.
-
(1999)
Trends in Cognitive Sciences
, vol.3
, Issue.6
, pp. 233-242
-
-
Schaal, S.1
-
47
-
-
78650179844
-
Modified reward function on abstract features in inverse reinforcement learning
-
Springer
-
Shen-yi, C., Hui, Q., Jia, F., Zhuo-jun, J., Miao-liang, Z., Springer (2010), "Modified reward function on abstract features in inverse reinforcement learning" in Journal of Zhejiang University - Science C, Vol. 11, No. 9, pp. 718-23.
-
(2010)
Journal of Zhejiang University - Science C
, vol.11
, Issue.9
, pp. 718-723
-
-
Shen-yi, C.1
Hui, Q.2
Jia, F.3
Zhuo-jun, J.4
Miao-liang, Z.5
-
48
-
-
33845622083
-
Inverse reinforcement learning with evaluation
-
Silva, V., Costa, A. and Lima, P. (2006), "Inverse reinforcement learning with evaluation" in IEEE International Conference on Robotics and Automation (ICRA06), Orlando, FL, USA, pp. 4246-51.
-
(2006)
IEEE International Conference on Robotics and Automation (ICRA06), Orlando, FL, USA
, pp. 4246-4251
-
-
Silva, V.1
Costa, A.2
Lima, P.3
-
49
-
-
77957947591
-
Learning from demonstration for autonomous navigation in complex unstructured terrain
-
Silver, D., Bagnell, J. and Stentz, A. (2010), "Learning from demonstration for autonomous navigation in complex unstructured terrain" in The International Journal of Robotics Research, Vol. 29, No. 12, p. 1565.
-
(2010)
The International Journal of Robotics Research
, vol.29
, Issue.12
, pp. 1565
-
-
Silver, D.1
Bagnell, J.2
Stentz, A.3
-
50
-
-
79957999943
-
Perceptual interpretation for autonomous navigation through dynamic imitation learning
-
Silver, D., Bagnell, J. and Stentz, A. (2011), "Perceptual interpretation for autonomous navigation through dynamic imitation learning" in International Symposium on Robotics Research, pp. 433-49.
-
(2011)
International Symposium on Robotics Research
, pp. 433-449
-
-
Silver, D.1
Bagnell, J.2
Stentz, A.3
-
51
-
-
0003871607
-
-
PhD thesis, Stanford University, Stanford, CA
-
Sondik, E. (1971), "The optimal control of partially observable Markov processes", Stanford University, Stanford, CA, PhD thesis.
-
(1971)
The optimal control of partially observable Markov processes
-
-
Sondik, E.1
-
52
-
-
0003420416
-
-
MIT Press, Cambridge, MA
-
Sutton, R. and Barto, A. (1998), Introduction to Reinforcement Learning, MIT Press, Cambridge, MA.
-
(1998)
Introduction to Reinforcement Learning
-
-
Sutton, R.1
Barto, A.2
-
53
-
-
85162012324
-
A game-theoretic approach to apprenticeship learning
-
MIT Press, Cambridge, MA
-
Syed, U. and Schapire, R. (2008), "A game-theoretic approach to apprenticeship learning" in Advances in Neural Information Processing Systems, MIT Press, Cambridge, MA, pp. 1449-56.
-
(2008)
Advances in Neural Information Processing Systems
, pp. 1449-1456
-
-
Syed, U.1
Schapire, R.2
-
54
-
-
77955839705
-
Parameterized maneuver learning for autonomous helicopter flight
-
Tang, J., Singh, A., Goehausen, N. and Abbeel, P. (2010), "Parameterized maneuver learning for autonomous helicopter flight" in Robotics and Automation (ICRA), 2010 IEEE International Conference on, pp. 1142-8.
-
(2010)
Robotics and Automation (ICRA), 2010 IEEE International Conference on
, pp. 1142-1148
-
-
Tang, J.1
Singh, A.2
Goehausen, N.3
Abbeel, P.4
-
55
-
-
77955836276
-
Reinforcement learning of motor skills in high dimensions: A path integral approach
-
Theodorou, E., Buchli, J. and Schaal, S. (2010), "Reinforcement learning of motor skills in high dimensions: a path integral approach" in Robotics and Automation (ICRA), 2010 IEEE International Conference on, pp. 2397-403.
-
(2010)
Robotics and Automation (ICRA), 2010 IEEE International Conference on
, pp. 2397-2403
-
-
Theodorou, E.1
Buchli, J.2
Schaal, S.3
-
56
-
-
84866840265
-
Enabling environment design via active indirect elicitation
-
Zhang, H. and Parkes, D. (2008), "Enabling environment design via active indirect elicitation", Proceedings Workshop on Preference Handling, Chicago, IL.
-
(2008)
Proceedings Workshop on Preference Handling, Chicago, IL
-
-
Zhang, H.1
Parkes, D.2
-
57
-
-
57749097473
-
Maximum entropy inverse reinforcement learning
-
Ziebart, B., Maas, A., Bagnell, J. and Dey, A. (2008), "Maximum entropy inverse reinforcement learning" in Proceedings 23rd AAAI Conference Artificial Intelligence, pp. 1433-8.
-
(2008)
Proceedings 23rd AAAI Conference Artificial Intelligence
, pp. 1433-1438
-
-
Ziebart, B.1
Maas, A.2
Bagnell, J.3
Dey, A.4
|