메뉴 건너뛰기




Volumn 62, Issue 6, 2017, Pages 2807-2822

Online Learning of Feasible Strategies in Unknown Environments

Author keywords

Offline; primal dual methods

Indexed keywords

CONTROLLERS; COST FUNCTIONS; E-LEARNING;

EID: 85028821757     PISSN: 00189286     EISSN: None     Source Type: Journal    
DOI: 10.1109/TAC.2016.2627401     Document Type: Article
Times cited : (51)

References (40)
  • 3
    • 0033906528 scopus 로고    scopus 로고
    • Stability of extremum seeking feedback for general nonlinear dynamic systems
    • M. Krstíc and H.-H. Wang, "Stability of extremum seeking feedback for general nonlinear dynamic systems, " Automatica, vol. 36, no. 4, pp. 595-601, 2000.
    • (2000) Automatica , vol.36 , Issue.4 , pp. 595-601
    • Krstíc, M.1    Wang, H.-H.2
  • 5
    • 33646151136 scopus 로고    scopus 로고
    • On non-local stability properties of extremum seeking control
    • Y. Tan, D. Nesíc, and I. Mareels, "On non-local stability properties of extremum seeking control, " Automatica, vol. 42, no. 6, pp. 889-903, 2006.
    • (2006) Automatica , vol.42 , Issue.6 , pp. 889-903
    • Tan, Y.1    Nesíc, D.2    Mareels, I.3
  • 6
    • 70349687250 scopus 로고    scopus 로고
    • Subgradient methods for saddle-point problems
    • A. Nedíc and A. Ozdaglar, "Subgradient methods for saddle-point problems, " J. Optim. Theory Appl., vol. 142, no. 1, pp. 205-228, 2009.
    • (2009) J. Optim. Theory Appl. , vol.142 , Issue.1 , pp. 205-228
    • Nedíc, A.1    Ozdaglar, A.2
  • 8
    • 77949408829 scopus 로고
    • Gradient methods for finding saddle points
    • D. Maistroskii, "Gradient methods for finding saddle points, " Matekon, vol. 14, no. 1, pp. 3-22, 1977.
    • (1977) Matekon , vol.14 , Issue.1 , pp. 3-22
    • Maistroskii, D.1
  • 9
    • 78249268463 scopus 로고    scopus 로고
    • Stability of primal-dual gradient dynamics and applications to network optimization
    • D. Feijer and F. Paganini, "Stability of primal-dual gradient dynamics and applications to network optimization, " Automatica, vol. 46, no. 12, pp. 1974-1981, 2010.
    • (2010) Automatica , vol.46 , Issue.12 , pp. 1974-1981
    • Feijer, D.1    Paganini, F.2
  • 12
  • 13
    • 77956180543 scopus 로고    scopus 로고
    • Stochastic source seeking for nonholonomic unicycle
    • S.-J. Liu and M. Krstic, "Stochastic source seeking for nonholonomic unicycle, " Automatica, vol. 46, no. 9, pp. 1443-1453, 2010.
    • (2010) Automatica , vol.46 , Issue.9 , pp. 1443-1453
    • Liu, S.-J.1    Krstic, M.2
  • 17
    • 84972545864 scopus 로고
    • An analog of the minimax theorem for vector payoffs
    • D. Blackwell, "An analog of the minimax theorem for vector payoffs, " Pacific J. Mathematics, vol. 6, no. 1, pp. 1-8, 1956.
    • (1956) Pacific J. Mathematics , vol.6 , Issue.1 , pp. 1-8
    • Blackwell, D.1
  • 19
    • 84859418371 scopus 로고    scopus 로고
    • Online learning and online convex optimization
    • S. Shalev-Shwartz, "Online learning and online convex optimization, " Foundations and Trends in Machine Learning, vol. 4, no. 2, pp. 107-194, 2011.
    • (2011) Foundations and Trends in Machine Learning , vol.4 , Issue.2 , pp. 107-194
    • Shalev-Shwartz, S.1
  • 20
    • 1942484421 scopus 로고    scopus 로고
    • Online convex programming and generalized infinitesimal gradient ascent
    • M. Zinkevich, "Online convex programming and generalized infinitesimal gradient ascent, " in ICML, pp. 928-936, 2003.
    • (2003) ICML , pp. 928-936
    • Zinkevich, M.1
  • 21
    • 35348918820 scopus 로고    scopus 로고
    • Logarithmic regret algorithms for online convex optimization
    • E. Hazan, A. Agarwal, and S. Kale, "Logarithmic regret algorithms for online convex optimization, " Machine Learning, vol. 69, no. 2-3, pp. 169-192, 2007.
    • (2007) Machine Learning , vol.69 , Issue.2-3 , pp. 169-192
    • Hazan, E.1    Agarwal, A.2    Kale, S.3
  • 22
    • 0033355765 scopus 로고    scopus 로고
    • Optimization flow control-i: Basic algorithm and convergence
    • S. H. Low and D. E. Lapsley, "Optimization flow control-i: Basic algorithm and convergence, " IEEE/ACM Trans. Networking (TON), vol. 7, no. 6, pp. 861-874, 1999.
    • (1999) IEEE/ACM Trans. Networking (TON) , vol.7 , Issue.6 , pp. 861-874
    • Low, S.H.1    Lapsley, D.E.2
  • 23
    • 64149123172 scopus 로고    scopus 로고
    • Layering as optimization decomposition: A mathematical theory of network architectures
    • M. Chiang, S. H. Low, A. R. Calderbank, and J. C. Doyle, "Layering as optimization decomposition: A mathematical theory of network architectures, " Proc. IEEE, vol. 95, no. 1, pp. 255-312, 2007.
    • (2007) Proc. IEEE , vol.95 , Issue.1 , pp. 255-312
    • Chiang, M.1    Low, S.H.2    Calderbank, A.R.3    Doyle, J.C.4
  • 24
    • 79953201848 scopus 로고    scopus 로고
    • A first-order primal-dual algorithm for convex problems with applications to imaging
    • A. Chambolle and T. Pock, "A first-order primal-dual algorithm for convex problems with applications to imaging, " J. Math. Imaging Vision, vol. 40, no. 1, pp. 120-145, 2011.
    • (2011) J. Math. Imaging Vision , vol.40 , Issue.1 , pp. 120-145
    • Chambolle, A.1    Pock, T.2
  • 25
    • 84869152925 scopus 로고    scopus 로고
    • Trading regret for efficiency: Online convex optimization with long term constraints
    • Sep
    • M. Mahdavi, R. Jin, and T. Yang, "Trading regret for efficiency: Online convex optimization with long term constraints, " J. Machine Learning Research, vol. 13, no. Sep, pp. 2503-2528, 2012.
    • (2012) J. Machine Learning Research , vol.13 , pp. 2503-2528
    • Mahdavi, M.1    Jin, R.2    Yang, T.3
  • 26
    • 84875375293 scopus 로고    scopus 로고
    • No-regret dynamics and fictitious play
    • Y. Viossat and A. Zapechelnyuk, "No-regret dynamics and fictitious play, " J. Economic Theory, vol. 148, no. 2, pp. 825-842, 2013.
    • (2013) J. Economic Theory , vol.148 , Issue.2 , pp. 825-842
    • Viossat, Y.1    Zapechelnyuk, A.2
  • 27
    • 45749117602 scopus 로고    scopus 로고
    • Exponential weight algorithm in continuous time
    • S. Sorin, "Exponential weight algorithm in continuous time, " Mathematical Programming, vol. 116, no. 1-2, pp. 513-528, 2009.
    • (2009) Mathematical Programming , vol.116 , Issue.1-2 , pp. 513-528
    • Sorin, S.1
  • 29
    • 21644486833 scopus 로고    scopus 로고
    • Gradient methods for nonstationary unconstrained optimization problems
    • A. Y. Popkov, "Gradient methods for nonstationary unconstrained optimization problems, " Automat. Remote Control, vol. 66, no. 6, pp. 883-891, 2005.
    • (2005) Automat. Remote Control , vol.66 , Issue.6 , pp. 883-891
    • Popkov, A.Y.1
  • 31
    • 79251514978 scopus 로고    scopus 로고
    • Real-time nonlinear optimization as a generalized equation
    • V. M. Zavala and M. Anitescu, "Real-time nonlinear optimization as a generalized equation, " SIAM J. Control Optim., vol. 48, no. 8, pp. 5444-5467, 2010.
    • (2010) SIAM J. Control Optim. , vol.48 , Issue.8 , pp. 5444-5467
    • Zavala, V.M.1    Anitescu, M.2
  • 32
  • 33
    • 77955660815 scopus 로고    scopus 로고
    • Regret bounds for sleeping experts and bandits
    • R. Kleinberg, A. Niculescu-Mizil, and Y. Sharma, "Regret bounds for sleeping experts and bandits, " Machine Learning, vol. 80, no. 2-3, pp. 245-272, 2010.
    • (2010) Machine Learning , vol.80 , Issue.2-3 , pp. 245-272
    • Kleinberg, R.1    Niculescu-Mizil, A.2    Sharma, Y.3
  • 35
    • 84862274651 scopus 로고    scopus 로고
    • Sleeping experts and bandits with stochastic action availability and adversarial rewards
    • V. Kanade, H. B. McMahan, and B. Bryan, "Sleeping experts and bandits with stochastic action availability and adversarial rewards, " in Proc. Int. Conf. Artificial Intell. Statistics, pp. 272-279, 2009.
    • (2009) Proc. Int. Conf. Artificial Intell. Statistics , pp. 272-279
    • Kanade, V.1    McMahan, H.B.2    Bryan, B.3
  • 36
    • 84937931658 scopus 로고    scopus 로고
    • Online combinatorial optimization with stochastic decision sets and adversarial losses
    • G. Neu and M. Valko, "Online combinatorial optimization with stochastic decision sets and adversarial losses, " in Adv. Neural Information Processing Syst., pp. 2780-2788, 2014.
    • (2014) Adv. Neural Information Processing Syst. , pp. 2780-2788
    • Neu, G.1    Valko, M.2
  • 37
    • 21844519677 scopus 로고
    • On the stability of projected dynamical systems
    • Apr.
    • D. Zhang and A. Nagurney, "On the stability of projected dynamical systems, " J. Optim. Theory Appl., vol. 85, pp. 97-124, Apr. 1995.
    • (1995) J. Optim. Theory Appl. , vol.85 , pp. 97-124
    • Zhang, D.1    Nagurney, A.2
  • 38
    • 0013327463 scopus 로고    scopus 로고
    • A general class of adaptive strategies
    • S. Hart and A. Mas-Colell, "A general class of adaptive strategies, " J. Economic Theory, vol. 98, no. 1, pp. 26-54, 2001.
    • (2001) J. Economic Theory , vol.98 , Issue.1 , pp. 26-54
    • Hart, S.1    Mas-Colell, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.