SCOPUS 정보 검색 플랫폼

IEEE Transactions on Automatic Control

Volumn 62, Issue 6, 2017, Pages 2807-2822

Online Learning of Feasible Strategies in Unknown Environments

(2) Paternain, Santiago a Ribeiro, Alejandro a

a UNIVERSITY OF PENNSYLVANIA (United States)

Author keywords

Offline; primal dual methods

Indexed keywords

CONTROLLERS; COST FUNCTIONS; E-LEARNING;

CONSTRAINT VIOLATION; CONVEX CONSTRAINTS; COST DIFFERENCES; NUMERICAL EXPERIMENTS; OFFLINE; ONLINE LEARNING; OPTIMAL ACTIONS; PRIMAL-DUAL METHODS;

COSTS;

EID: 85028821757 PISSN: 00189286 EISSN: None Source Type: Journal
DOI: 10.1109/TAC.2016.2627401 Document Type: Article

Times cited : (51)

References (40)

1
- 0039041998
- CA: Stanford University Press
- K. J. Arrow and L. Hurwicz, Studies in Linear and Nonlinear Programming. CA: Stanford University Press, 1958.
- (1958) Studies in Linear and Nonlinear Programming
- Arrow, K.J.¹ Hurwicz, L.²

2
- 22144491187
- Academic Press
- M. W. Hirsch, S. Smale, and R. L. Devaney, Differential Equations, Dynamical Systems, and An Introduction To Chaos, vol. 60. Academic Press, 2004.
- (2004) Differential Equations, Dynamical Systems, and An Introduction to Chaos , vol.60
- Hirsch, M.W.¹ Smale, S.² Devaney, R.L.³

3
- 0033906528
- Stability of extremum seeking feedback for general nonlinear dynamic systems
- M. Krstíc and H.-H. Wang, "Stability of extremum seeking feedback for general nonlinear dynamic systems, " Automatica, vol. 36, no. 4, pp. 595-601, 2000.
- (2000) Automatica , vol.36 , Issue.4 , pp. 595-601
- Krstíc, M.¹ Wang, H.-H.²

4
- 14244260540
- Wiley
- K. B. Ariyur andM. Krstic, Real-TimeOptimization By Extremum-Seeking Control. Wiley, 2003.
- (2003) Real-TimeOptimization by Extremum-Seeking Control
- Ariyur, K.B.¹ Krstic, M.²

5
- 33646151136
- On non-local stability properties of extremum seeking control
- Y. Tan, D. Nesíc, and I. Mareels, "On non-local stability properties of extremum seeking control, " Automatica, vol. 42, no. 6, pp. 889-903, 2006.
- (2006) Automatica , vol.42 , Issue.6 , pp. 889-903
- Tan, Y.¹ Nesíc, D.² Mareels, I.³

6
- 70349687250
- Subgradient methods for saddle-point problems
- A. Nedíc and A. Ozdaglar, "Subgradient methods for saddle-point problems, " J. Optim. Theory Appl., vol. 142, no. 1, pp. 205-228, 2009.
- (2009) J. Optim. Theory Appl. , vol.142 , Issue.1 , pp. 205-228
- Nedíc, A.¹ Ozdaglar, A.²

7
- 0002513916
- Iterative methods for concave programming
- H. Uzawa, "Iterative methods for concave programming, " Studies in Linear and Nonlinear Programming, vol. 6, 1958.
- (1958) Studies in Linear and Nonlinear Programming , vol.6
- Uzawa, H.¹

8
- 77949408829
- Gradient methods for finding saddle points
- D. Maistroskii, "Gradient methods for finding saddle points, " Matekon, vol. 14, no. 1, pp. 3-22, 1977.
- (1977) Matekon , vol.14 , Issue.1 , pp. 3-22
- Maistroskii, D.¹

9
- 78249268463
- Stability of primal-dual gradient dynamics and applications to network optimization
- D. Feijer and F. Paganini, "Stability of primal-dual gradient dynamics and applications to network optimization, " Automatica, vol. 46, no. 12, pp. 1974-1981, 2010.
- (2010) Automatica , vol.46 , Issue.12 , pp. 1974-1981
- Feijer, D.¹ Paganini, F.²

10
- 0004055894
- Cambridge University press
- S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge University press, 2004.
- (2004) Convex Optimization
- Boyd, S.¹ Vandenberghe, L.²

11
- 84864438962
- Stochastic source seeking in complex environments
- IEEE
- N. Atanasov, J. Le Ny, N. Michael, and G. J. Pappas, "Stochastic source seeking in complex environments, " in Proc. IEEE Int. Conf. Robotics Automation (ICRA), pp. 3013-3018, IEEE, 2012.
- (2012) Proc. IEEE Int. Conf. Robotics Automation (ICRA) , pp. 3013-3018
- Atanasov, N.¹ Le Ny, J.² Michael, N.³ Pappas, G.J.⁴

12
- 84865693343
- Stochastic source seeking by mobile robots
- S.-I. Azuma, M. S. Sakar, and G. J. Pappas, "Stochastic source seeking by mobile robots, " IEEE Trans. Automatic Control, vol. 57, no. 9, pp. 2308-2321, 2012.
- (2012) IEEE Trans. Automatic Control , vol.57 , Issue.9 , pp. 2308-2321
- Azuma, S.-I.¹ Sakar, M.S.² Pappas, G.J.³

13
- 77956180543
- Stochastic source seeking for nonholonomic unicycle
- S.-J. Liu and M. Krstic, "Stochastic source seeking for nonholonomic unicycle, " Automatica, vol. 46, no. 9, pp. 1443-1453, 2010.
- (2010) Automatica , vol.46 , Issue.9 , pp. 1443-1453
- Liu, S.-J.¹ Krstic, M.²

14
- 0000016172
- A stochastic approximation method
- H. Robbins and S. Monro, "A stochastic approximation method, " Annals Mathematical Statistics, pp. 400-407, 1951.
- (1951) Annals Mathematical Statistics , pp. 400-407
- Robbins, H.¹ Monro, S.²

15
- 84899025130
- arXiv Preprint arXiv: 1309. 2388
- M. Schmidt, N. L. Roux, and F. Bach, "Minimizing finite sums with the stochastic average gradient, " arXiv Preprint arXiv: 1309. 2388, 2013.
- (2013) Minimizing Finite Sums with the Stochastic Average Gradient
- Schmidt, M.¹ Roux, N.L.² Bach, F.³

16
- 84910045630
- arXiv Preprint arXiv: 1312. 1666
- J. Konecnỳ and P. Richtárik, "Semi-stochastic gradient descent methods, " arXiv Preprint arXiv: 1312. 1666, 2013.
- (2013) Semi-stochastic Gradient Descent Methods
- Konecnỳ, J.¹ Richtárik, P.²

17
- 84972545864
- An analog of the minimax theorem for vector payoffs
- D. Blackwell, "An analog of the minimax theorem for vector payoffs, " Pacific J. Mathematics, vol. 6, no. 1, pp. 1-8, 1956.
- (1956) Pacific J. Mathematics , vol.6 , Issue.1 , pp. 1-8
- Blackwell, D.¹

18
- 0003450542
- Springer
- V. Vapnik, The Nature of Statistical Learning Theory. Springer, 2000.
- (2000) The Nature of Statistical Learning Theory
- Vapnik, V.¹

19
- 84859418371
- Online learning and online convex optimization
- S. Shalev-Shwartz, "Online learning and online convex optimization, " Foundations and Trends in Machine Learning, vol. 4, no. 2, pp. 107-194, 2011.
- (2011) Foundations and Trends in Machine Learning , vol.4 , Issue.2 , pp. 107-194
- Shalev-Shwartz, S.¹

20
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- M. Zinkevich, "Online convex programming and generalized infinitesimal gradient ascent, " in ICML, pp. 928-936, 2003.
- (2003) ICML , pp. 928-936
- Zinkevich, M.¹

21
- 35348918820
- Logarithmic regret algorithms for online convex optimization
- E. Hazan, A. Agarwal, and S. Kale, "Logarithmic regret algorithms for online convex optimization, " Machine Learning, vol. 69, no. 2-3, pp. 169-192, 2007.
- (2007) Machine Learning , vol.69 , Issue.2-3 , pp. 169-192
- Hazan, E.¹ Agarwal, A.² Kale, S.³

22
- 0033355765
- Optimization flow control-i: Basic algorithm and convergence
- S. H. Low and D. E. Lapsley, "Optimization flow control-i: Basic algorithm and convergence, " IEEE/ACM Trans. Networking (TON), vol. 7, no. 6, pp. 861-874, 1999.
- (1999) IEEE/ACM Trans. Networking (TON) , vol.7 , Issue.6 , pp. 861-874
- Low, S.H.¹ Lapsley, D.E.²

23
- 64149123172
- Layering as optimization decomposition: A mathematical theory of network architectures
- M. Chiang, S. H. Low, A. R. Calderbank, and J. C. Doyle, "Layering as optimization decomposition: A mathematical theory of network architectures, " Proc. IEEE, vol. 95, no. 1, pp. 255-312, 2007.
- (2007) Proc. IEEE , vol.95 , Issue.1 , pp. 255-312
- Chiang, M.¹ Low, S.H.² Calderbank, A.R.³ Doyle, J.C.⁴

24
- 79953201848
- A first-order primal-dual algorithm for convex problems with applications to imaging
- A. Chambolle and T. Pock, "A first-order primal-dual algorithm for convex problems with applications to imaging, " J. Math. Imaging Vision, vol. 40, no. 1, pp. 120-145, 2011.
- (2011) J. Math. Imaging Vision , vol.40 , Issue.1 , pp. 120-145
- Chambolle, A.¹ Pock, T.²

25
- 84869152925
- Trading regret for efficiency: Online convex optimization with long term constraints
- Sep
- M. Mahdavi, R. Jin, and T. Yang, "Trading regret for efficiency: Online convex optimization with long term constraints, " J. Machine Learning Research, vol. 13, no. Sep, pp. 2503-2528, 2012.
- (2012) J. Machine Learning Research , vol.13 , pp. 2503-2528
- Mahdavi, M.¹ Jin, R.² Yang, T.³

26
- 84875375293
- No-regret dynamics and fictitious play
- Y. Viossat and A. Zapechelnyuk, "No-regret dynamics and fictitious play, " J. Economic Theory, vol. 148, no. 2, pp. 825-842, 2013.
- (2013) J. Economic Theory , vol.148 , Issue.2 , pp. 825-842
- Viossat, Y.¹ Zapechelnyuk, A.²

27
- 45749117602
- Exponential weight algorithm in continuous time
- S. Sorin, "Exponential weight algorithm in continuous time, " Mathematical Programming, vol. 116, no. 1-2, pp. 513-528, 2009.
- (2009) Mathematical Programming , vol.116 , Issue.1-2 , pp. 513-528
- Sorin, S.¹

28
- 84919599974
- arXiv Preprint arXiv: 1401. 6956
- J. Kwon and P. Mertikopoulos, "A continuous-time approach to online optimization, " arXiv Preprint arXiv: 1401. 6956, 2014.
- (2014) A Continuous-time Approach to Online Optimization
- Kwon, J.¹ Mertikopoulos, P.²

29
- 21644486833
- Gradient methods for nonstationary unconstrained optimization problems
- A. Y. Popkov, "Gradient methods for nonstationary unconstrained optimization problems, " Automat. Remote Control, vol. 66, no. 6, pp. 883-891, 2005.
- (2005) Automat. Remote Control , vol.66 , Issue.6 , pp. 883-891
- Popkov, A.Y.¹

30
- 84982862627
- arXiv Preprint arXiv: 1510. 01396
- M. Fazlyab, S. Paternain, V. M. Preciado, and A. Ribeiro, "Interior point method for dynamic constrained optimization in continuous time, " arXiv Preprint arXiv: 1510. 01396, 2015.
- (2015) Interior Point Method for Dynamic Constrained Optimization in Continuous Time
- Fazlyab, M.¹ Paternain, S.² Preciado, V.M.³ Ribeiro, A.⁴

31
- 79251514978
- Real-time nonlinear optimization as a generalized equation
- V. M. Zavala and M. Anitescu, "Real-time nonlinear optimization as a generalized equation, " SIAM J. Control Optim., vol. 48, no. 8, pp. 5444-5467, 2010.
- (2010) SIAM J. Control Optim. , vol.48 , Issue.8 , pp. 5444-5467
- Zavala, V.M.¹ Anitescu, M.²

32
- 84965039936
- Stochastic approximation
- V. S. Borkar, "Stochastic approximation, " Cambridge Books, 2008.
- (2008) Cambridge Books
- Borkar, V.S.¹

33
- 77955660815
- Regret bounds for sleeping experts and bandits
- R. Kleinberg, A. Niculescu-Mizil, and Y. Sharma, "Regret bounds for sleeping experts and bandits, " Machine Learning, vol. 80, no. 2-3, pp. 245-272, 2010.
- (2010) Machine Learning , vol.80 , Issue.2-3 , pp. 245-272
- Kleinberg, R.¹ Niculescu-Mizil, A.² Sharma, Y.³

34
- 85028815320
- arXiv Preprint arXiv: 1509. 03600
- S. Kale, C. Lee, and D. Pál, "Hardness of online sleeping combinatorial optimization problems, " arXiv Preprint arXiv: 1509. 03600, 2015.
- (2015) Hardness of Online Sleeping Combinatorial Optimization Problems
- Kale, S.¹ Lee, C.² Pál, D.³

35
- 84862274651
- Sleeping experts and bandits with stochastic action availability and adversarial rewards
- V. Kanade, H. B. McMahan, and B. Bryan, "Sleeping experts and bandits with stochastic action availability and adversarial rewards, " in Proc. Int. Conf. Artificial Intell. Statistics, pp. 272-279, 2009.
- (2009) Proc. Int. Conf. Artificial Intell. Statistics , pp. 272-279
- Kanade, V.¹ McMahan, H.B.² Bryan, B.³

36
- 84937931658
- Online combinatorial optimization with stochastic decision sets and adversarial losses
- G. Neu and M. Valko, "Online combinatorial optimization with stochastic decision sets and adversarial losses, " in Adv. Neural Information Processing Syst., pp. 2780-2788, 2014.
- (2014) Adv. Neural Information Processing Syst. , pp. 2780-2788
- Neu, G.¹ Valko, M.²

37
- 21844519677
- On the stability of projected dynamical systems
- Apr.
- D. Zhang and A. Nagurney, "On the stability of projected dynamical systems, " J. Optim. Theory Appl., vol. 85, pp. 97-124, Apr. 1995.
- (1995) J. Optim. Theory Appl. , vol.85 , pp. 97-124
- Zhang, D.¹ Nagurney, A.²

38
- 0013327463
- A general class of adaptive strategies
- S. Hart and A. Mas-Colell, "A general class of adaptive strategies, " J. Economic Theory, vol. 98, no. 1, pp. 26-54, 2001.
- (2001) J. Economic Theory , vol.98 , Issue.1 , pp. 26-54
- Hart, S.¹ Mas-Colell, A.²

39
- 0001944917
- The evolution of conventions
- H. P. Young, "The evolution of conventions, " Econometrica: J. Econometric Soc., pp. 57-84, 1993.
- (1993) Econometrica: J. Econometric Soc. , pp. 57-84
- Young, H.P.¹

40
- 84871693138
- Minimum snap trajectory generation and control for quadrotors
- May
- D. Mellinger and V. Kumar, "Minimum snap trajectory generation and control for quadrotors, " in Proc. IEEE Int. Conf. Robotics Autom. (ICRA), May 2011.
- (2011) Proc. IEEE Int. Conf. Robotics Autom. (ICRA)
- Mellinger, D.¹ Kumar, V.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.