-
1
-
-
84884276459
-
Reinforcement learning in robotics: A survey
-
J. Kober, J. A. Bagnell, and J. Peters, "Reinforcement learning in robotics: A survey," International Journal of Robotic Research, vol. 32, no. 11, pp. 1238-1274, 2013.
-
(2013)
International Journal of Robotic Research
, vol.32
, Issue.11
, pp. 1238-1274
-
-
Kober, J.1
Bagnell, J.A.2
Peters, J.3
-
2
-
-
0004255876
-
-
Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc.
-
K. Astrom and B. Wittenmark, Adaptive control. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1994.
-
(1994)
Adaptive Control
-
-
Astrom, K.1
Wittenmark, B.2
-
3
-
-
80053441894
-
PILCO: A model-based and data-efficient approach to policy search
-
L. Getoor and T. Scheffer, Eds. Bellevue, Washington, USA: Omnipress
-
M. P. Deisenroth and C. E. Rasmussen, "PILCO: A model-based and data-efficient approach to policy search," in Proceedings of the 28th International Conference on Machine Learning, L. Getoor and T. Scheffer, Eds. Bellevue, Washington, USA: Omnipress, 2011, pp. 465-472.
-
(2011)
Proceedings of the 28th International Conference on Machine Learning
, pp. 465-472
-
-
Deisenroth, M.P.1
Rasmussen, C.E.2
-
4
-
-
84877783832
-
Regret bounds for the adaptive control of linear quadratic systems
-
Y. Abbasi-Yadkori, C. Szepesvári, S. Kakade, and U. V. Luxburg, "Regret bounds for the adaptive control of linear quadratic systems," in Proceedings of the 24th Annual Conference on Learning Theory, 2011.
-
(2011)
Proceedings of the 24th Annual Conference on Learning Theory
-
-
Abbasi-Yadkori, Y.1
Szepesvári, C.2
Kakade, S.3
Luxburg, U.V.4
-
5
-
-
84919821062
-
Variational Bayesian optimization for runtime risk-sensitive control
-
Sydney, Australia, July
-
S. Kuindersma, R. Grupen, and A. Barto, "Variational Bayesian optimization for runtime risk-sensitive control," in Robotics: Science and Systems VIII (RSS), Sydney, Australia, July 2012.
-
(2012)
Robotics: Science and Systems VIII (RSS)
-
-
Kuindersma, S.1
Grupen, R.2
Barto, A.3
-
6
-
-
84867138336
-
Near-optimal BRL using optimistic local transitions
-
J. Langford and J. Pineau, Eds. New York, NY, USA: Omnipress, Jul.
-
M. Araya, O. Buffet, and V. Thomas, "Near-optimal BRL using optimistic local transitions," in Proceedings of the 29th International Conference on Machine Learning (ICML-12), ser. ICML '12, J. Langford and J. Pineau, Eds. New York, NY, USA: Omnipress, Jul. 2012, pp. 97-104.
-
(2012)
Proceedings of the 29th International Conference on Machine Learning (ICML-12), Ser. ICML '12
, pp. 97-104
-
-
Araya, M.1
Buffet, O.2
Thomas, V.3
-
8
-
-
84867186048
-
Variational inference for Dirichlet process mixtures
-
Mar
-
D. M. Blei and M. I. Jordan, "Variational inference for Dirichlet process mixtures," Bayesian Analysis, vol. 1, no. 1, pp. 121-143, Mar. 2006.
-
(2006)
Bayesian Analysis
, vol.1
, Issue.1
, pp. 121-143
-
-
Blei, D.M.1
Jordan, M.I.2
-
9
-
-
33845488897
-
Direct trajectory optimization and costate estimation via an orthogonal collocation method
-
Nov
-
D. A. Benson, G. T. Huntington, T. P. Thorvaldsen, and A. V. Rao, "Direct trajectory optimization and costate estimation via an orthogonal collocation method," Journal of Guidance, Control, and Dynamics, vol. 29, no. 6, pp. 1435-1440, Nov. 2006.
-
(2006)
Journal of Guidance, Control, and Dynamics
, vol.29
, Issue.6
, pp. 1435-1440
-
-
Benson, D.A.1
Huntington, G.T.2
Thorvaldsen, T.P.3
Rao, A.V.4
-
10
-
-
84903590417
-
A survey on policy search for robotics
-
M. Deisenroth, G. Neumann, and J. Peters, "A survey on policy search for robotics," Foundations and trends in robotics, vol. 2, no. 1-2, pp. 1-142, 2013.
-
(2013)
Foundations and Trends in Robotics
, vol.2
, Issue.1-2
, pp. 1-142
-
-
Deisenroth, M.1
Neumann, G.2
Peters, J.3
-
11
-
-
84899013244
-
Streaming variational Bayes
-
T. Broderick, N. Boyd, A. Wibisono, A. C. Wilson, and M. I. Jordan, "Streaming variational Bayes," in Advances in Neural Information Processing Systems 26 (NIPS 2013), 2013.
-
(2013)
Advances in Neural Information Processing Systems 26 (NIPS 2013)
-
-
Broderick, T.1
Boyd, N.2
Wibisono, A.3
Wilson, A.C.4
Jordan, M.I.5
-
12
-
-
62949181077
-
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
-
Apr
-
J.-Y. Audibert, R. Munos, and C. Szepesvári, "Exploration-exploitation tradeoff using variance estimates in multi-armed bandits," Theoretical Computer Science, vol. 410, no. 19, pp. 1876-1902, Apr. 2009.
-
(2009)
Theoretical Computer Science
, vol.410
, Issue.19
, pp. 1876-1902
-
-
Audibert, J.-Y.1
Munos, R.2
Szepesvári, C.3
-
13
-
-
77956217715
-
Dirichlet process
-
C. Sammut and G. I. Webb, Eds. Springer
-
Y. W. Teh, "Dirichlet process," in Encyclopedia of Machine Learning, C. Sammut and G. I. Webb, Eds. Springer, 2010, pp. 280-287.
-
(2010)
Encyclopedia of Machine Learning
, pp. 280-287
-
-
Teh, Y.W.1
-
14
-
-
0033135677
-
Model predictive control: Past, present and future
-
M. Morari and J. H Lee, "Model predictive control: past, present and future," Computers & Chemical Engineering, vol. 23, no. 4, pp. 667-682, 1999.
-
(1999)
Computers & Chemical Engineering
, vol.23
, Issue.4
, pp. 667-682
-
-
Morari, M.1
Lee, J.H.2
-
15
-
-
81855173186
-
Practical methods for optimal control and estimation using nonlinear programming
-
J. T. Betts, Practical methods for optimal control and estimation using nonlinear programming. Society for Industrial & Applied Mathematics, 2010, vol. 19.
-
(2010)
Society for Industrial & Applied Mathematics
, vol.19
-
-
Betts, J.T.1
-
16
-
-
0024668467
-
Model predictive control: Theory and practicea survey
-
C. E. Garcia, D. M. Prett, and M. Morari, "Model predictive control: theory and practicea survey," Automatica, vol. 25, no. 3, pp. 335-348, 1989.
-
(1989)
Automatica
, vol.25
, Issue.3
, pp. 335-348
-
-
Garcia, C.E.1
Prett, D.M.2
Morari, M.3
-
18
-
-
84869387329
-
Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter
-
A. Aswani and P. Bouffard, "Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter," in Proc. American Control Conference (ACC), 2012.
-
(2012)
Proc. American Control Conference (ACC)
-
-
Aswani, A.1
Bouffard, P.2
-
19
-
-
0020190760
-
Nonlinear optimization by successive linear programming
-
F. Palacios-Gomez, L. Lasdon, and M. Engquist, "Nonlinear optimization by successive linear programming," Management Science, vol. 28, no. 10, pp. 1106-1120, 1982.
-
(1982)
Management Science
, vol.28
, Issue.10
, pp. 1106-1120
-
-
Palacios-Gomez, F.1
Lasdon, L.2
Engquist, M.3
-
21
-
-
84859432221
-
A nonparametric Bayesian approach toward robot learning by demonstration
-
Jun
-
S. P. Chatzis, D. Korkinof, and Y. Demiris, "A nonparametric Bayesian approach toward robot learning by demonstration," Robotics and Autonomous Systems, vol. 60, no. 6, pp. 789-802, Jun. 2012.
-
(2012)
Robotics and Autonomous Systems
, vol.60
, Issue.6
, pp. 789-802
-
-
Chatzis, S.P.1
Korkinof, D.2
Demiris, Y.3
-
23
-
-
84907389672
-
Gaussian processes for data-efficient learning in robotics and control
-
M. P. Deisenroth, D. Fox, and C. E. Rasmussen, "Gaussian processes for data-efficient learning in robotics and control," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 99, 2013.
-
(2013)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.99
-
-
Deisenroth, M.P.1
Fox, D.2
Rasmussen, C.E.3
-
24
-
-
72749107057
-
Neuroevolutionary reinforcement learning for generalized helicopter control
-
New York, NY, USA: ACM
-
R. Koppejan and S. Whiteson, "Neuroevolutionary reinforcement learning for generalized helicopter control," in Proceedings of the 11th Annual conference on Genetic and evolutionary computation, ser. GECCO '09. New York, NY, USA: ACM, 2009, pp. 145-152.
-
(2009)
Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, Ser. GECCO '09
, pp. 145-152
-
-
Koppejan, R.1
Whiteson, S.2
-
26
-
-
84883027643
-
Autonomous autorotation of an RC helicopter
-
P. Abbeel, A. Coates, T. Hunter, and A. Ng, "Autonomous autorotation of an RC helicopter," Experimental Robotics, pp. 385-394, 2009.
-
(2009)
Experimental Robotics
, pp. 385-394
-
-
Abbeel, P.1
Coates, A.2
Hunter, T.3
Ng, A.4
-
27
-
-
79952175443
-
Stable dynamic walking over uneven terrain
-
I. Manchester, U. Mettin, F. Iida, and R. Tedrake, "Stable dynamic walking over uneven terrain," International Journal of Robotics Research, vol. 30, no. 3, pp. 265-279, 2011.
-
(2011)
International Journal of Robotics Research
, vol.30
, Issue.3
, pp. 265-279
-
-
Manchester, I.1
Mettin, U.2
Iida, F.3
Tedrake, R.4
-
28
-
-
84911472718
-
An integrated system for real-time model-predictive control of humanoid robots
-
T. Erez, K. Lowrey, Y. Tassa, V. Kumar, S. Kolev, and E. Todorov, "An integrated system for real-time model-predictive control of humanoid robots," in IEEE/RAS International Conference on Humanoid Robots (Humanoids), 2013.
-
(2013)
IEEE/RAS International Conference on Humanoid Robots (Humanoids)
-
-
Erez, T.1
Lowrey, K.2
Tassa, Y.3
Kumar, V.4
Kolev, S.5
Todorov, E.6
-
30
-
-
0031103152
-
Multivariable adaptive algorithms for reconfigurable flight control
-
M. Bodson and J. E. Groszkiewicz, "Multivariable adaptive algorithms for reconfigurable flight control," IEEE Transactions on Control Systems Technology, vol. 5, no. 2, pp. 217-229, 1997.
-
(1997)
IEEE Transactions on Control Systems Technology
, vol.5
, Issue.2
, pp. 217-229
-
-
Bodson, M.1
Groszkiewicz, J.E.2
|