SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 9851 LNAI, Issue , 2016, Pages 665-680

On the convergence of a family of robust losses for stochastic gradient descent

(3) Han, Bo a Tsang, Ivor W a Chen, Ling a

a UNIVERSITY OF TECHNOLOGY SYDNEY (Australia)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; STOCHASTIC SYSTEMS; TIME VARYING NETWORKS;

BASELINE METHODS; CONVERGENCE RATES; FAST CONVERGENCE; LOSS FUNCTIONS; NOISY LABELS; REAL-WORLD DATASETS; ROBUSTNESS ANALYSIS; STOCHASTIC GRADIENT DESCENT;

LEARNING SYSTEMS;

EID: 84988566042 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-46128-1_42 Document Type: Conference Paper

Times cited : (23)

References (19)

1
- 85162498265
- Better mini-batch algorithms via accelerated gradient methods
- Cotter, A., Shamir, O., Srebro, N., Sridharan, K.: Better mini-batch algorithms via accelerated gradient methods. In: Advances in Neural Information Processing Systems (NIPS), pp. 1647–1655 (2011)
- (2011) Advances in Neural Information Processing Systems (NIPS) , pp. 1647-1655
- Cotter, A.¹ Shamir, O.² Srebro, N.³ Sridharan, K.⁴

2
- 84898955433
- Memory limited, streaming PCA
- Mitliagkas, I., Caramanis, C., Jain, P.: Memory limited, streaming PCA. In: Advances in Neural Information Processing Systems (NIPS), pp. 2886–2894 (2013)
- (2013) Advances in Neural Information Processing Systems (NIPS) , pp. 2886-2894
- Mitliagkas, I.¹ Caramanis, C.² Jain, P.³

3
- 84865692149
- Efficiency of coordinate descent methods on huge-scale optimization problems
- Nesterov, Y.: Efficiency of coordinate descent methods on huge-scale optimization problems. SIAM J. Optim. (SIAM) 22(2), 341–362 (2012)
- (2012) SIAM J. Optim. (SIAM) , vol.22 , Issue.2 , pp. 341-362
- Nesterov, Y.¹

4
- 84877750537
- Stochastic convex optimization with bandit feedback
- Agarwal, A., Foster, D.P., Hsu, D., Kakade, S.M., Rakhlin, A.: Stochastic convex optimization with bandit feedback. SIAM J. Optim. (SIAM) 23(1), 213–240 (2013)
- (2013) SIAM J. Optim. (SIAM) , vol.23 , Issue.1 , pp. 213-240
- Agarwal, A.¹ Foster, D.P.² Hsu, D.³ Kakade, S.M.⁴ Rakhlin, A.⁵

5
- 84904136037
- Large-scale machine learning with stochastic gradient descent
- Bottou, L.: Large-scale machine learning with stochastic gradient descent. In: Proceedings of the 19th International Conference on Computational Statistics (COMPSTAT), pp. 177–187 (2010)
- (2010) Proceedings of the 19Th International Conference on Computational Statistics (COMPSTAT) , pp. 177-187
- Bottou, L.¹

6
- 80053437034
- On optimization methods for deep learning
- Le, Q.V., Ngiam, J., Coates, A., Lahiri, A., Prochnow, B., Ng, A.Y.: On optimization methods for deep learning. In: Proceedings of the 28th International Conference on Machine Learning (ICML), pp. 265–272 (2011)
- (2011) Proceedings of the 28Th International Conference on Machine Learning (ICML) , pp. 265-272
- Le, Q.V.¹ Ngiam, J.² Coates, A.³ Lahiri, A.⁴ Prochnow, B.⁵ Ng, A.Y.⁶

7
- 79952748054
- Pegasos: Primal estimated sub-gradient solver for SVM
- Shalev-shwartz, S., Singer, Y., Srebro, N., Cotter, A.: Pegasos: primal estimated sub-gradient solver for SVM. Math. Program. 127(1), 3–30 (2011)
- (2011) Math. Program , vol.127 , Issue.1 , pp. 3-30
- Shalev-Shwartz, S.¹ Singer, Y.² Srebro, N.³ Cotter, A.⁴

8
- 80052400610
- Modeling annotator expertise: Learning when everybody knows a bit of something
- Yan, Y., Rosales, R., Fung, G., Schmidt, M., Hermosillo, G., Bogoni, L., Moy, L., Dy, J.-G.: Modeling annotator expertise: learning when everybody knows a bit of something. In: Proceedings of the 13th International Conference on Artificial Intelligence and Statistics (AISTATS), pp. 932–939 (2010)
- (2010) Proceedings of the 13Th International Conference on Artificial Intelligence and Statistics (AISTATS) , pp. 932-939
- Yan, Y.¹ Rosales, R.² Fung, G.³ Schmidt, M.⁴ Hermosillo, G.⁵ Bogoni, L.⁶ Moy, L.⁷ Dy, J.-G.⁸

9
- 33747105621
- Trading convexity for scalability
- Collobert, R., Sinz, F., Weston, J., Bottou, L.: Trading convexity for scalability. In: Proceedings of the 23rd International Conference on Machine Learning (ICML), pp. 201–208 (2006)
- (2006) Proceedings of the 23Rd International Conference on Machine Learning (ICML) , pp. 201-208
- Collobert, R.¹ Sinz, F.² Weston, J.³ Bottou, L.⁴

10
- 85162048991
- Relaxed clipping: A global training method for robust regression and classification
- Yu, Y.-L., Yang, M., Xu, L.-L., White, M., Schuurmans, D.: Relaxed clipping: a global training method for robust regression and classification. In: Advances in Neural Information Processing Systems (NIPS), pp. 2532–2540 (2010)
- (2010) Advances in Neural Information Processing Systems (NIPS) , pp. 2532-2540
- Yu, Y.-L.¹ Yang, M.² Xu, L.-L.³ White, M.⁴ Schuurmans, D.⁵

11
- 84867119104
- arXiv preprint arXiv:1107.2490
- Xu, W.: Towards optimal one pass large scale learning with averaged stochastic gradient descent. arXiv preprint arXiv:1107.2490 (2011)
- (2011) Towards Optimal One Pass Large Scale Learning with Averaged Stochastic Gradient Descent
- Xu, W.¹

12
- 84892854517
- Stochastic first-and zeroth-order methods for nonconvex stochastic programming
- Ghadimi, S., Lan, G.-H.: Stochastic first-and zeroth-order methods for nonconvex stochastic programming. SIAM J. Optim. (SIAM) 23(4), 2341–2368 (2013)
- (2013) SIAM J. Optim. (SIAM) , vol.23 , Issue.4 , pp. 2341-2368
- Ghadimi, S.¹ Lan, G.-H.²

13
- 84958124116
- Accelerated gradient methods for nonconvex nonlinear and stochastic programming
- Ghadimi, S., Lan, G.-H.: Accelerated gradient methods for nonconvex nonlinear and stochastic programming. Math. Program. 156, 59–99 (2015)
- (2015) Math. Program , vol.156 , pp. 59-99
- Ghadimi, S.¹ Lan, G.-H.²

14
- 56449098486
- Training robust support vector machine with smooth ramp loss in the primal space
- Wang, L., Jia, H.-D., Li, J.: Training robust support vector machine with smooth ramp loss in the primal space. Neurocomputing 71(13), 3020–3025 (2008)
- (2008) Neurocomputing , vol.71 , Issue.13 , pp. 3020-3025
- Wang, L.¹ Jia, H.-D.² Li, J.³

15
- 85083950731
- Training convolution networks with noisy labels
- Sukhbaatar, S., Bruna, J., Paluri, M., Bourdev, L., Fergus, R.: Training convolution networks with noisy labels. In: Proceedings of the International Conference on Learning Representations (ICLR) (2015)
- (2015) Proceedings of the International Conference on Learning Representations (ICLR)
- Sukhbaatar, S.¹ Bruna, J.² Paluri, M.³ Bourdev, L.⁴ Fergus, R.⁵

16
- 84898932626
- Learning with noisy labels
- Natarajan, N., Dhillon, I.S., Ravikumar, P.K., Tewari, A.: Learning with noisy labels. In: Advances in Neural Information Processing Systems (NIPS), pp. 1196–1204 (2013)
- (2013) Advances in Neural Information Processing Systems (NIPS) , pp. 1196-1204
- Natarajan, N.¹ Dhillon, I.S.² Ravikumar, P.K.³ Tewari, A.⁴

17
- 33645505792
- Convexity, classification, and risk bounds
- Bartlett, P.L., Jordan, M.I., McAuliffe, J.D.: Convexity, classification, and risk bounds. J. Am. Stat. Assoc. 101(473), 138–156 (2006)
- (2006) J. Am. Stat. Assoc , vol.101 , Issue.473 , pp. 138-156
- Bartlett, P.L.¹ Jordan, M.I.² McAuliffe, J.D.³

18
- 84873371070
- Fast global convergence of gradient methods for high-dimensional statistical recovery
- Agarwal, A., Negahban, S., Wainwright, M.J.: Fast global convergence of gradient methods for high-dimensional statistical recovery. Ann. Stat. 40(5), 2452–2482 (2012)
- (2012) Ann. Stat , vol.40 , Issue.5 , pp. 2452-2482
- Agarwal, A.¹ Negahban, S.² Wainwright, M.J.³

19
- 84930632658
- Regularized M-estimators with nonconvexity: Statistical and algorithmic theory for local optima
- Loh, P.-L., Wainwright, M.J.: Regularized M-estimators with nonconvexity: statistical and algorithmic theory for local optima. J. Mach. Learn. Res. (JMLR) 16, 559–616 (2015)
- (2015) J. Mach. Learn. Res. (JMLR) , vol.16 , pp. 559-616
- Loh, P.-L.¹ Wainwright, M.J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.