메뉴 건너뛰기




Volumn 2015-June, Issue June, 2015, Pages 3239-3246

Optimism-driven exploration for nonlinear systems

Author keywords

[No Author keywords available]

Indexed keywords

AGRICULTURAL ROBOTS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING; ROBOTICS;

EID: 84938265627     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICRA.2015.7139645     Document Type: Conference Paper
Times cited : (33)

References (30)
  • 2
    • 0004255876 scopus 로고
    • Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc.
    • K. Astrom and B. Wittenmark, Adaptive control. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1994.
    • (1994) Adaptive Control
    • Astrom, K.1    Wittenmark, B.2
  • 3
    • 80053441894 scopus 로고    scopus 로고
    • PILCO: A model-based and data-efficient approach to policy search
    • L. Getoor and T. Scheffer, Eds. Bellevue, Washington, USA: Omnipress
    • M. P. Deisenroth and C. E. Rasmussen, "PILCO: A model-based and data-efficient approach to policy search," in Proceedings of the 28th International Conference on Machine Learning, L. Getoor and T. Scheffer, Eds. Bellevue, Washington, USA: Omnipress, 2011, pp. 465-472.
    • (2011) Proceedings of the 28th International Conference on Machine Learning , pp. 465-472
    • Deisenroth, M.P.1    Rasmussen, C.E.2
  • 5
    • 84919821062 scopus 로고    scopus 로고
    • Variational Bayesian optimization for runtime risk-sensitive control
    • Sydney, Australia, July
    • S. Kuindersma, R. Grupen, and A. Barto, "Variational Bayesian optimization for runtime risk-sensitive control," in Robotics: Science and Systems VIII (RSS), Sydney, Australia, July 2012.
    • (2012) Robotics: Science and Systems VIII (RSS)
    • Kuindersma, S.1    Grupen, R.2    Barto, A.3
  • 8
    • 84867186048 scopus 로고    scopus 로고
    • Variational inference for Dirichlet process mixtures
    • Mar
    • D. M. Blei and M. I. Jordan, "Variational inference for Dirichlet process mixtures," Bayesian Analysis, vol. 1, no. 1, pp. 121-143, Mar. 2006.
    • (2006) Bayesian Analysis , vol.1 , Issue.1 , pp. 121-143
    • Blei, D.M.1    Jordan, M.I.2
  • 9
    • 33845488897 scopus 로고    scopus 로고
    • Direct trajectory optimization and costate estimation via an orthogonal collocation method
    • Nov
    • D. A. Benson, G. T. Huntington, T. P. Thorvaldsen, and A. V. Rao, "Direct trajectory optimization and costate estimation via an orthogonal collocation method," Journal of Guidance, Control, and Dynamics, vol. 29, no. 6, pp. 1435-1440, Nov. 2006.
    • (2006) Journal of Guidance, Control, and Dynamics , vol.29 , Issue.6 , pp. 1435-1440
    • Benson, D.A.1    Huntington, G.T.2    Thorvaldsen, T.P.3    Rao, A.V.4
  • 12
    • 62949181077 scopus 로고    scopus 로고
    • Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
    • Apr
    • J.-Y. Audibert, R. Munos, and C. Szepesvári, "Exploration-exploitation tradeoff using variance estimates in multi-armed bandits," Theoretical Computer Science, vol. 410, no. 19, pp. 1876-1902, Apr. 2009.
    • (2009) Theoretical Computer Science , vol.410 , Issue.19 , pp. 1876-1902
    • Audibert, J.-Y.1    Munos, R.2    Szepesvári, C.3
  • 13
    • 77956217715 scopus 로고    scopus 로고
    • Dirichlet process
    • C. Sammut and G. I. Webb, Eds. Springer
    • Y. W. Teh, "Dirichlet process," in Encyclopedia of Machine Learning, C. Sammut and G. I. Webb, Eds. Springer, 2010, pp. 280-287.
    • (2010) Encyclopedia of Machine Learning , pp. 280-287
    • Teh, Y.W.1
  • 14
    • 0033135677 scopus 로고    scopus 로고
    • Model predictive control: Past, present and future
    • M. Morari and J. H Lee, "Model predictive control: past, present and future," Computers & Chemical Engineering, vol. 23, no. 4, pp. 667-682, 1999.
    • (1999) Computers & Chemical Engineering , vol.23 , Issue.4 , pp. 667-682
    • Morari, M.1    Lee, J.H.2
  • 15
    • 81855173186 scopus 로고    scopus 로고
    • Practical methods for optimal control and estimation using nonlinear programming
    • J. T. Betts, Practical methods for optimal control and estimation using nonlinear programming. Society for Industrial & Applied Mathematics, 2010, vol. 19.
    • (2010) Society for Industrial & Applied Mathematics , vol.19
    • Betts, J.T.1
  • 16
    • 0024668467 scopus 로고
    • Model predictive control: Theory and practicea survey
    • C. E. Garcia, D. M. Prett, and M. Morari, "Model predictive control: theory and practicea survey," Automatica, vol. 25, no. 3, pp. 335-348, 1989.
    • (1989) Automatica , vol.25 , Issue.3 , pp. 335-348
    • Garcia, C.E.1    Prett, D.M.2    Morari, M.3
  • 18
    • 84869387329 scopus 로고    scopus 로고
    • Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter
    • A. Aswani and P. Bouffard, "Extensions of learning-based model predictive control for real-time application to a quadrotor helicopter," in Proc. American Control Conference (ACC), 2012.
    • (2012) Proc. American Control Conference (ACC)
    • Aswani, A.1    Bouffard, P.2
  • 19
    • 0020190760 scopus 로고
    • Nonlinear optimization by successive linear programming
    • F. Palacios-Gomez, L. Lasdon, and M. Engquist, "Nonlinear optimization by successive linear programming," Management Science, vol. 28, no. 10, pp. 1106-1120, 1982.
    • (1982) Management Science , vol.28 , Issue.10 , pp. 1106-1120
    • Palacios-Gomez, F.1    Lasdon, L.2    Engquist, M.3
  • 21
    • 84859432221 scopus 로고    scopus 로고
    • A nonparametric Bayesian approach toward robot learning by demonstration
    • Jun
    • S. P. Chatzis, D. Korkinof, and Y. Demiris, "A nonparametric Bayesian approach toward robot learning by demonstration," Robotics and Autonomous Systems, vol. 60, no. 6, pp. 789-802, Jun. 2012.
    • (2012) Robotics and Autonomous Systems , vol.60 , Issue.6 , pp. 789-802
    • Chatzis, S.P.1    Korkinof, D.2    Demiris, Y.3
  • 30
    • 0031103152 scopus 로고    scopus 로고
    • Multivariable adaptive algorithms for reconfigurable flight control
    • M. Bodson and J. E. Groszkiewicz, "Multivariable adaptive algorithms for reconfigurable flight control," IEEE Transactions on Control Systems Technology, vol. 5, no. 2, pp. 217-229, 1997.
    • (1997) IEEE Transactions on Control Systems Technology , vol.5 , Issue.2 , pp. 217-229
    • Bodson, M.1    Groszkiewicz, J.E.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.