SCOPUS 정보 검색 플랫폼

31st International Conference on Machine Learning, ICML 2014

Volumn 3, Issue , 2014, Pages 1937-1945

Least squares revisited: Scalable approaches for multi-class prediction

(5) Agarwal, Alekh a Kakade, Sham M b Karampatziakis, Nikos c Song, Le d Valiant, Gregory e

a MICROSOFT RESEARCH (United States)

b MICROSOFT RESEARCH (United Kingdom)

c MICROSOFT (United States)

d Georgia Institute of Technology (United States)

e Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; ITERATIVE METHODS; LEARNING SYSTEMS;

CLASS PREDICTION; DATA DIMENSIONS; FIRST-ORDER METHODS; HIGH DIMENSIONAL DATASETS; ITERATIVE LEAST SQUARES; OPTIMIZATION PACKAGES; SCALABLE APPROACH; SIMPLE ALGORITHM;

ALGORITHMS;

EID: 84919949624 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (13)

References (31)

1
- 84897528650
- Selective sampling algorithms for cost-sensitive multiclass prediction
- Agarwal, A. Selective sampling algorithms for cost-sensitive multiclass prediction. In ICML, 2013.
- (2013) ICML
- Agarwal, A.¹

2
- 84961742418
- arXiv preprint arXiv.1310.1949
- Agarwal, A., Kakade, S. M., Karampatziakis, N., Song, L., and Valiant, G. Least squares revisited: Scalable approaches for multi-class prediction. arXiv preprint arXiv.1310.1949, 2013.
- (2013) Least Squares Revisited: Scalable Approaches for Multi-class Prediction
- Agarwal, A.¹ Kakade, S.M.² Karampatziakis, N.³ Song, L.⁴ Valiant, G.⁵

3
- 68949096711
- Sgd-qn: Careful quasi- newton stochastic gradient descent
- July
- Bordes, A., Bottou, L., and Gallinari, P. Sgd-qn: Careful quasi- newton stochastic gradient descent. Journal of Machine Learning Research, 10:1737-1754, July 2009.
- (2009) Journal of Machine Learning Research , vol.10 , pp. 1737-1754
- Bordes, A.¹ Bottou, L.² Gallinari, P.³

4
- 85162035281
- The tradeoffs of large scale learning
- Bottou, L. and Bousquet, O. The tradeoffs of large scale learning. In NIPS. 2008.
- (2008) NIPS
- Bottou, L.¹ Bousquet, O.²

5
- 0003795688
- E-entropy of convex sets and functions
- Bronshtein, E.M. e-entropy of convex sets and functions. Siberian Mathematical Journal, 17(3):393-398, 1976.
- (1976) Siberian Mathematical Journal , vol.17 , Issue.3 , pp. 393-398
- Bronshtein, E.M.¹

6
- 80054732060
- On the use of stochastic hessian information in optimization methods for machine learning
- Byrd, R. H., Chin, G. M., Neveitt, W., and Nocedal, J. On the use of stochastic hessian information in optimization methods for machine learning. SIAM Journal on Optimization, 21(3): 977-995, 2011.
- (2011) SIAM Journal on Optimization , vol.21 , Issue.3 , pp. 977-995
- Byrd, R.H.¹ Chin, G.M.² Neveitt, W.³ Nocedal, J.⁴

7
- 34247849152
- Training a support vector machine in the primal
- Chapelle, O. Training a support vector machine in the primal. Neural Comput., 19(5): 1155-1178, 2007.
- (2007) Neural Comput. , vol.19 , Issue.5 , pp. 1155-1178
- Chapelle, O.¹

8
- 84862283411
- An analysis of single-layer networks in unsupervised feature learning
- Coates, A., Ng, A. Y., and Lee, H. An analysis of single-layer networks in unsupervised feature learning. Journal of Machine Learning Research - Proceedings Track, 15:215-223, 2011.
- (2011) Journal of Machine Learning Research - Proceedings Track , vol.15 , pp. 215-223
- Coates, A.¹ Ng, A.Y.² Lee, H.³

9
- 26444551655
- Discriminative reranking for natural language parsing
- Collins, M. and Koo, T. Discriminative reranking for natural language parsing. In ICML, 2000.
- (2000) ICML
- Collins, M.¹ Koo, T.²

10
- 56449092085
- 1-ball for learning in high dimensions
- 1-ball for learning in high dimensions. In ICML, 2008.
- (2008) ICML
- Duchi, J.¹ Shalev-Shwartz, S.² Singer, Y.³ Chandra, T.⁴

11
- 50949133669
- Liblinear: A library for large linear classification
- Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., and Lin, C.- J. Liblinear: A library for large linear classification. Journal of Machine Learning Research, 9:1871-1874, 2008.
- (2008) Journal of Machine Learning Research , vol.9 , pp. 1871-1874
- Fan, R.-E.¹ Chang, K.-W.² Hsieh, C.-J.³ Wang, X.-R.⁴ Lin C.-., J.⁵

12
- 0035470889
- Greedy function approximation: A gradient boosting machine
- english summary
- Friedman, J. H. Greedy function approximation: A gradient boosting machine.(english summary). Ann. Statist, 29(5): 1189- 1232, 2001.
- (2001) Ann. Statist , vol.29 , Issue.5 , pp. 1189-1232
- Friedman, J.H.¹

13
- 84897498659
- Maxout networks
- Goodfellow, I. J., Warde-Farley, D., Mirza, M., Courville, A. C., and Bengio, Y. Maxout networks. CoRR, 2013.
- (2013) CoRR
- Goodfellow, I.J.¹ Warde-Farley, D.² Mirza, M.³ Courville, A.C.⁴ Bengio, Y.⁵

14
- 84867720412
- arXiv preprint arXiv:1207.0580
- Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. R. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580, 2012.
- (2012) Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
- Hinton, G.E.¹ Srivastava, N.² Krizhevsky, A.³ Sutskever, I.⁴ Salakhutdinov, R.R.⁵

15
- 84877743512
- Majorization for crfs and latent likelihoods
- Jebara, T. and Choromanska, A. Majorization for crfs and latent likelihoods. In NIPS, 2012.
- (2012) NIPS
- Jebara, T.¹ Choromanska, A.²

16
- 85162537884
- Efficient learning of generalized linear and single index models with isotonic regression
- Kakade, S. M., Kalai, A., Kanade, V., and Shamir, O. Efficient learning of generalized linear and single index models with isotonic regression. In NIPS, 2011.
- (2011) NIPS
- Kakade, S.M.¹ Kalai, A.² Kanade, V.³ Shamir, O.⁴

17
- 84898072863
- The isotron algorithm: High- dimensional isotonic regression
- Kalai, A. T. and Sastry, R. The isotron algorithm: High- dimensional isotonic regression. In COLT '09, 2009.
- (2009) COLT '09
- Kalai, A.T.¹ Sastry, R.²

18
- 84876811202
- Rcvl: A new benchmark collection for text categorization research
- Lewis, D. D., Yang, Y., Rose, T. G., and Li, F. Rcvl: A new benchmark collection for text categorization research. The Journal of Machine Learning Research, 5:361-397, 2004.
- (2004) The Journal of Machine Learning Research , vol.5 , pp. 361-397
- Lewis, D.D.¹ Yang, Y.² Rose, T.G.³ Li, F.⁴

19
- 84866046574
- Linear support vector machines via dual cached loops
- Matsushima, S., Vishwanathan, S. V. N., and Smola, A. J. Linear support vector machines via dual cached loops. In KDD, 2012.
- (2012) KDD
- Matsushima, S.¹ Vishwanathan, S.V.N.² Smola, A.J.³

20
- 0003692801
- New York
- Nemirovsky, A. S. and Yudin, D. B. Problem Complexity and Method Efficiency in Optimization. New York, 1983.
- (1983) Problem Complexity and Method Efficiency in Optimization
- Nemirovsky, A.S.¹ Yudin, D.B.²

21
- 0003696537
- New York
- Nesterov, Y. Introductory Lectures on Convex Optimization. New York, 2004.
- (2004) Introductory Lectures on Convex Optimization
- Nesterov, Y.¹

22
- 84865692149
- Efficiency of coordinate descent methods on huge- scale optimization problems
- Nesterov, Y. Efficiency of coordinate descent methods on huge- scale optimization problems. SIAM Journal on Optimization, 22(2):341-362, 2012.
- (2012) SIAM Journal on Optimization , vol.22 , Issue.2 , pp. 341-362
- Nesterov, Y.¹

23
- 0003243224
- Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
- MIT Press
- Piatt, J. C. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Adavances in large margin classifiers, pp. 61-74. MIT Press, 1999.
- (1999) Adavances in Large Margin Classifiers , pp. 61-74
- Piatt, J.C.¹

24
- 85161980201
- Random features for large-scale kernel machines
- Rahimi, A. and Recht, B. Random features for large-scale kernel machines. Advances in neural information processing systems, 20:1177-1184, 2007.
- (2007) Advances in Neural Information Processing Systems , vol.20 , pp. 1177-1184
- Rahimi, A.¹ Recht, B.²

25
- 85162467517
- Hogwild: A lock-free approach to parallelizing stochastic gradient descent
- Recht, B., Re, C., Wright, S. J., and Niu, F. Hogwild: A lock-free approach to parallelizing stochastic gradient descent. In NIPS, pp. 693-701, 2011.
- (2011) NIPS , pp. 693-701
- Recht, B.¹ Re, C.² Wright, S.J.³ Niu, F.⁴

26
- 84882256468
- URL
- Richtarik, P. and Takac, M. Parallel coordinate descent methods for big data optimization. 2012. URL http://arxiv.org/abs/1212.0873.
- (2012) Parallel Coordinate Descent Methods for Big Data Optimization
- Richtarik, P.¹ Takac, M.²

27
- 84972545670
- Characterization of the subdifferentials of convex functions
- Rockafellar, R.T. Characterization of the subdifferentials of convex functions. Pac. J. Math., 17:497-510, 1966.
- (1966) Pac. J. Math. , vol.17 , pp. 497-510
- Rockafellar, R.T.¹

28
- 84877725219
- A stochastic gradient method with an exponential convergence rate for finite training sets
- Roux, N. L., Schmidt, M., and Bach, F. A stochastic gradient method with an exponential convergence rate for finite training sets. In NIPS, pp. 2672-2680. 2012.
- (2012) NIPS , pp. 2672-2680
- Roux, N.L.¹ Schmidt, M.² Bach, F.³

29
- 84859418371
- Online learning and online convex optimization
- Shalev-Shwartz, S. Online learning and online convex optimization. Foundations and Trends in Machine Learning, 4(2), 2012.
- (2012) Foundations and Trends in Machine Learning , vol.4 , Issue.2
- Shalev-Shwartz, S.¹

30
- 84875134236
- Stochastic dual coordinate ascent methods for regularized loss minimization
- Shalev-Shwartz, S. and Zhang, T. Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization. Journal of Machine Learning Reearch, 14:567-599, 2013.
- (2013) Journal of Machine Learning Reearch , vol.14 , pp. 567-599
- Shalev-Shwartz, S.¹ Zhang, T.²

31
- 84863266107
- Large linear classification when data cannot fit in memory
- Yu, H.-F., Hsieh, C.-J., Chang, K.-W., and Lin, C.-J. Large linear classification when data cannot fit in memory. TKDD, 5(4), 2012.
- (2012) TKDD , vol.5 , Issue.4
- Yu, H.-F.¹ Hsieh, C.-J.² Chang, K.-W.³ Lin, C.-J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.