SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Robotics and Automation

Volumn 2015-June, Issue June, 2015, Pages 3239-3246

Optimism-driven exploration for nonlinear systems

(4) Moldovan, Teodor Mihai a Levine, Sergey a Jordan, Michael I a Abbeel, Pieter a

a UNIVERSITY OF CALIFORNIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

AGRICULTURAL ROBOTS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING; ROBOTICS;

BENCH-MARK PROBLEMS; COMPUTATIONAL REQUIREMENTS; DIRICHLET PROCESS MIXTURE; EFFICIENT LEARNING; EXPLORATION STRATEGIES; SAMPLE COMPLEXITY; SYSTEM INTERACTIONS; TRAJECTORY OPTIMIZATION;

MODEL PREDICTIVE CONTROL;

EID: 84938265627 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICRA.2015.7139645 Document Type: Conference Paper

Times cited : (33)

References (30)

1
- 84884276459
- Reinforcement learning in robotics: A survey
- J. Kober, J. A. Bagnell, and J. Peters, "Reinforcement learning in robotics: A survey," International Journal of Robotic Research, vol. 32, no. 11, pp. 1238-1274, 2013.
- (2013) International Journal of Robotic Research , vol.32 , Issue.11 , pp. 1238-1274
- Kober, J.¹ Bagnell, J.A.² Peters, J.³

2
- 0004255876
- Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc.
- K. Astrom and B. Wittenmark, Adaptive control. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1994.
- (1994) Adaptive Control
- Astrom, K.¹ Wittenmark, B.²

3
- 80053441894
- PILCO: A model-based and data-efficient approach to policy search
- L. Getoor and T. Scheffer, Eds. Bellevue, Washington, USA: Omnipress
- M. P. Deisenroth and C. E. Rasmussen, "PILCO: A model-based and data-efficient approach to policy search," in Proceedings of the 28th International Conference on Machine Learning, L. Getoor and T. Scheffer, Eds. Bellevue, Washington, USA: Omnipress, 2011, pp. 465-472.
- (2011) Proceedings of the 28th International Conference on Machine Learning , pp. 465-472
- Deisenroth, M.P.¹ Rasmussen, C.E.²

4
- 84877783832
- Regret bounds for the adaptive control of linear quadratic systems
- Y. Abbasi-Yadkori, C. Szepesvári, S. Kakade, and U. V. Luxburg, "Regret bounds for the adaptive control of linear quadratic systems," in Proceedings of the 24th Annual Conference on Learning Theory, 2011.
- (2011) Proceedings of the 24th Annual Conference on Learning Theory
- Abbasi-Yadkori, Y.¹ Szepesvári, C.² Kakade, S.³ Luxburg, U.V.⁴

5
- 84919821062
- Variational Bayesian optimization for runtime risk-sensitive control
- Sydney, Australia, July
- S. Kuindersma, R. Grupen, and A. Barto, "Variational Bayesian optimization for runtime risk-sensitive control," in Robotics: Science and Systems VIII (RSS), Sydney, Australia, July 2012.
- (2012) Robotics: Science and Systems VIII (RSS)
- Kuindersma, S.¹ Grupen, R.² Barto, A.³

6
- 84867138336
- Near-optimal BRL using optimistic local transitions
- J. Langford and J. Pineau, Eds. New York, NY, USA: Omnipress, Jul.
- M. Araya, O. Buffet, and V. Thomas, "Near-optimal BRL using optimistic local transitions," in Proceedings of the 29th International Conference on Machine Learning (ICML-12), ser. ICML '12, J. Langford and J. Pineau, Eds. New York, NY, USA: Omnipress, Jul. 2012, pp. 97-104.
- (2012) Proceedings of the 29th International Conference on Machine Learning (ICML-12), Ser. ICML '12 , pp. 97-104
- Araya, M.¹ Buffet, O.² Thomas, V.³

7
- 84977482296
- Optimistic linear programming gives logarithmic regret for irreducible MDPs
- P. L. B. Ambuj Tewari, "Optimistic linear programming gives logarithmic regret for irreducible MDPs," in Proceedings of Neural Information Processing Systems Conference, 2007.
- (2007) Proceedings of Neural Information Processing Systems Conference
- Ambuj Tewari, P.L.B.¹

8
- 84867186048
- Variational inference for Dirichlet process mixtures
- Mar
- D. M. Blei and M. I. Jordan, "Variational inference for Dirichlet process mixtures," Bayesian Analysis, vol. 1, no. 1, pp. 121-143, Mar. 2006.
- (2006) Bayesian Analysis , vol.1 , Issue.1 , pp. 121-143
- Blei, D.M.¹ Jordan, M.I.²

9
- 33845488897
- Direct trajectory optimization and costate estimation via an orthogonal collocation method
- Nov
- D. A. Benson, G. T. Huntington, T. P. Thorvaldsen, and A. V. Rao, "Direct trajectory optimization and costate estimation via an orthogonal collocation method," Journal of Guidance, Control, and Dynamics, vol. 29, no. 6, pp. 1435-1440, Nov. 2006.
- (2006) Journal of Guidance, Control, and Dynamics , vol.29 , Issue.6 , pp. 1435-1440
- Benson, D.A.¹ Huntington, G.T.² Thorvaldsen, T.P.³ Rao, A.V.⁴

10
- 84903590417
- A survey on policy search for robotics
- M. Deisenroth, G. Neumann, and J. Peters, "A survey on policy search for robotics," Foundations and trends in robotics, vol. 2, no. 1-2, pp. 1-142, 2013.
- (2013) Foundations and Trends in Robotics , vol.2 , Issue.1-2 , pp. 1-142
- Deisenroth, M.¹ Neumann, G.² Peters, J.³

11
- 84899013244
- Streaming variational Bayes
- T. Broderick, N. Boyd, A. Wibisono, A. C. Wilson, and M. I. Jordan, "Streaming variational Bayes," in Advances in Neural Information Processing Systems 26 (NIPS 2013), 2013.
- (2013) Advances in Neural Information Processing Systems 26 (NIPS 2013)
- Broderick, T.¹ Boyd, N.² Wibisono, A.³ Wilson, A.C.⁴ Jordan, M.I.⁵

12
- 62949181077
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Apr
- J.-Y. Audibert, R. Munos, and C. Szepesvári, "Exploration-exploitation tradeoff using variance estimates in multi-armed bandits," Theoretical Computer Science, vol. 410, no. 19, pp. 1876-1902, Apr. 2009.
- (2009) Theoretical Computer Science , vol.410 , Issue.19 , pp. 1876-1902
- Audibert, J.-Y.¹ Munos, R.² Szepesvári, C.³

13
- 77956217715
- Dirichlet process
- C. Sammut and G. I. Webb, Eds. Springer
- Y. W. Teh, "Dirichlet process," in Encyclopedia of Machine Learning, C. Sammut and G. I. Webb, Eds. Springer, 2010, pp. 280-287.
- (2010) Encyclopedia of Machine Learning , pp. 280-287
- Teh, Y.W.¹

14
- 0033135677
- Model predictive control: Past, present and future
- M. Morari and J. H Lee, "Model predictive control: past, present and future," Computers & Chemical Engineering, vol. 23, no. 4, pp. 667-682, 1999.
- (1999) Computers & Chemical Engineering , vol.23 , Issue.4 , pp. 667-682
- Morari, M.¹ Lee, J.H.²

15
- 81855173186
- Practical methods for optimal control and estimation using nonlinear programming
- J. T. Betts, Practical methods for optimal control and estimation using nonlinear programming. Society for Industrial & Applied Mathematics, 2010, vol. 19.
- (2010) Society for Industrial & Applied Mathematics , vol.19
- Betts, J.T.¹

16
- 0024668467
- Model predictive control: Theory and practicea survey
- C. E. Garcia, D. M. Prett, and M. Morari, "Model predictive control: theory and practicea survey," Automatica, vol. 25, no. 3, pp. 335-348, 1989.
- (1989) Automatica , vol.25 , Issue.3 , pp. 335-348
- Garcia, C.E.¹ Prett, D.M.² Morari, M.³

17
- 0003517858
- Springer Berlin
- E. F. Camacho and C. Bordons, Model predictive control. Springer Berlin, 1999, vol. 303.
- (1999) Model Predictive Control , vol.303
- Camacho, E.F.¹ Bordons, C.²

18
- 84869387329
- Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter
- A. Aswani and P. Bouffard, "Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter," in Proc. American Control Conference (ACC), 2012.
- (2012) Proc. American Control Conference (ACC)
- Aswani, A.¹ Bouffard, P.²

19
- 0020190760
- Nonlinear optimization by successive linear programming
- F. Palacios-Gomez, L. Lasdon, and M. Engquist, "Nonlinear optimization by successive linear programming," Management Science, vol. 28, no. 10, pp. 1106-1120, 1982.
- (1982) Management Science , vol.28 , Issue.10 , pp. 1106-1120
- Palacios-Gomez, F.¹ Lasdon, L.² Engquist, M.³

20
- 77955827753
- BM: An iterative algorithm to learn stable non-linear dynamical systems with Gaussian mixture models
- S. M. Khansari-Zadeh and A. Billard, "BM: An iterative algorithm to learn stable non-linear dynamical systems with Gaussian mixture models," in International Conference on Robotics and Automation (ICRA), 2010.
- (2010) International Conference on Robotics and Automation (ICRA)
- Khansari-Zadeh, S.M.¹ Billard, A.²

21
- 84859432221
- A nonparametric Bayesian approach toward robot learning by demonstration
- Jun
- S. P. Chatzis, D. Korkinof, and Y. Demiris, "A nonparametric Bayesian approach toward robot learning by demonstration," Robotics and Autonomous Systems, vol. 60, no. 6, pp. 789-802, Jun. 2012.
- (2012) Robotics and Autonomous Systems , vol.60 , Issue.6 , pp. 789-802
- Chatzis, S.P.¹ Korkinof, D.² Demiris, Y.³

22
- 84855374561
- Vieweg+Teubner Verlag
- T. Strutz, Data Fitting and Uncertainty: A practical introduction to weighted least squares and beyond. Vieweg+Teubner Verlag, 2010.
- (2010) Data Fitting and Uncertainty: A Practical Introduction to Weighted Least Squares and beyond
- Strutz, T.¹

23
- 84907389672
- Gaussian processes for data-efficient learning in robotics and control
- M. P. Deisenroth, D. Fox, and C. E. Rasmussen, "Gaussian processes for data-efficient learning in robotics and control," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 99, 2013.
- (2013) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.99
- Deisenroth, M.P.¹ Fox, D.² Rasmussen, C.E.³

24
- 72749107057
- Neuroevolutionary reinforcement learning for generalized helicopter control
- New York, NY, USA: ACM
- R. Koppejan and S. Whiteson, "Neuroevolutionary reinforcement learning for generalized helicopter control," in Proceedings of the 11th Annual conference on Genetic and evolutionary computation, ser. GECCO '09. New York, NY, USA: ACM, 2009, pp. 145-152.
- (2009) Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, Ser. GECCO '09 , pp. 145-152
- Koppejan, R.¹ Whiteson, S.²

25
- 77955809093
- Autonomous helicopter aerobatics through apprenticeship learning
- Jun
- P. Abbeel, A. Coates, and A. Y. Ng, "Autonomous helicopter aerobatics through apprenticeship learning," The International Journal of Robotics Research, Jun. 2010.
- (2010) The International Journal of Robotics Research
- Abbeel, P.¹ Coates, A.² Ng, A.Y.³

26
- 84883027643
- Autonomous autorotation of an RC helicopter
- P. Abbeel, A. Coates, T. Hunter, and A. Ng, "Autonomous autorotation of an RC helicopter," Experimental Robotics, pp. 385-394, 2009.
- (2009) Experimental Robotics , pp. 385-394
- Abbeel, P.¹ Coates, A.² Hunter, T.³ Ng, A.⁴

27
- 79952175443
- Stable dynamic walking over uneven terrain
- I. Manchester, U. Mettin, F. Iida, and R. Tedrake, "Stable dynamic walking over uneven terrain," International Journal of Robotics Research, vol. 30, no. 3, pp. 265-279, 2011.
- (2011) International Journal of Robotics Research , vol.30 , Issue.3 , pp. 265-279
- Manchester, I.¹ Mettin, U.² Iida, F.³ Tedrake, R.⁴

28
- 84911472718
- An integrated system for real-time model-predictive control of humanoid robots
- T. Erez, K. Lowrey, Y. Tassa, V. Kumar, S. Kolev, and E. Todorov, "An integrated system for real-time model-predictive control of humanoid robots," in IEEE/RAS International Conference on Humanoid Robots (Humanoids), 2013.
- (2013) IEEE/RAS International Conference on Humanoid Robots (Humanoids)
- Erez, T.¹ Lowrey, K.² Tassa, Y.³ Kumar, V.⁴ Kolev, S.⁵ Todorov, E.⁶

29
- 84907088263
- Solving linear and quadratic programs with an analog circuit
- S. Vichik and F. Borrelli, "Solving linear and quadratic programs with an analog circuit," Computers and Chemical Engineering, 2014.
- (2014) Computers and Chemical Engineering
- Vichik, S.¹ Borrelli, F.²

30
- 0031103152
- Multivariable adaptive algorithms for reconfigurable flight control
- M. Bodson and J. E. Groszkiewicz, "Multivariable adaptive algorithms for reconfigurable flight control," IEEE Transactions on Control Systems Technology, vol. 5, no. 2, pp. 217-229, 1997.
- (1997) IEEE Transactions on Control Systems Technology , vol.5 , Issue.2 , pp. 217-229
- Bodson, M.¹ Groszkiewicz, J.E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.