-
1
-
-
84897528650
-
Selective sampling algorithms for cost-sensitive multiclass prediction
-
Agarwal, A. Selective sampling algorithms for cost-sensitive multiclass prediction. In ICML, 2013.
-
(2013)
ICML
-
-
Agarwal, A.1
-
2
-
-
84961742418
-
-
arXiv preprint arXiv.1310.1949
-
Agarwal, A., Kakade, S. M., Karampatziakis, N., Song, L., and Valiant, G. Least squares revisited: Scalable approaches for multi-class prediction. arXiv preprint arXiv.1310.1949, 2013.
-
(2013)
Least Squares Revisited: Scalable Approaches for Multi-class Prediction
-
-
Agarwal, A.1
Kakade, S.M.2
Karampatziakis, N.3
Song, L.4
Valiant, G.5
-
3
-
-
68949096711
-
Sgd-qn: Careful quasi- newton stochastic gradient descent
-
July
-
Bordes, A., Bottou, L., and Gallinari, P. Sgd-qn: Careful quasi- newton stochastic gradient descent. Journal of Machine Learning Research, 10:1737-1754, July 2009.
-
(2009)
Journal of Machine Learning Research
, vol.10
, pp. 1737-1754
-
-
Bordes, A.1
Bottou, L.2
Gallinari, P.3
-
4
-
-
85162035281
-
The tradeoffs of large scale learning
-
Bottou, L. and Bousquet, O. The tradeoffs of large scale learning. In NIPS. 2008.
-
(2008)
NIPS
-
-
Bottou, L.1
Bousquet, O.2
-
5
-
-
0003795688
-
E-entropy of convex sets and functions
-
Bronshtein, E.M. e-entropy of convex sets and functions. Siberian Mathematical Journal, 17(3):393-398, 1976.
-
(1976)
Siberian Mathematical Journal
, vol.17
, Issue.3
, pp. 393-398
-
-
Bronshtein, E.M.1
-
6
-
-
80054732060
-
On the use of stochastic hessian information in optimization methods for machine learning
-
Byrd, R. H., Chin, G. M., Neveitt, W., and Nocedal, J. On the use of stochastic hessian information in optimization methods for machine learning. SIAM Journal on Optimization, 21(3): 977-995, 2011.
-
(2011)
SIAM Journal on Optimization
, vol.21
, Issue.3
, pp. 977-995
-
-
Byrd, R.H.1
Chin, G.M.2
Neveitt, W.3
Nocedal, J.4
-
7
-
-
34247849152
-
Training a support vector machine in the primal
-
Chapelle, O. Training a support vector machine in the primal. Neural Comput., 19(5): 1155-1178, 2007.
-
(2007)
Neural Comput.
, vol.19
, Issue.5
, pp. 1155-1178
-
-
Chapelle, O.1
-
8
-
-
84862283411
-
An analysis of single-layer networks in unsupervised feature learning
-
Coates, A., Ng, A. Y., and Lee, H. An analysis of single-layer networks in unsupervised feature learning. Journal of Machine Learning Research - Proceedings Track, 15:215-223, 2011.
-
(2011)
Journal of Machine Learning Research - Proceedings Track
, vol.15
, pp. 215-223
-
-
Coates, A.1
Ng, A.Y.2
Lee, H.3
-
9
-
-
26444551655
-
Discriminative reranking for natural language parsing
-
Collins, M. and Koo, T. Discriminative reranking for natural language parsing. In ICML, 2000.
-
(2000)
ICML
-
-
Collins, M.1
Koo, T.2
-
11
-
-
50949133669
-
Liblinear: A library for large linear classification
-
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., and Lin, C.- J. Liblinear: A library for large linear classification. Journal of Machine Learning Research, 9:1871-1874, 2008.
-
(2008)
Journal of Machine Learning Research
, vol.9
, pp. 1871-1874
-
-
Fan, R.-E.1
Chang, K.-W.2
Hsieh, C.-J.3
Wang, X.-R.4
Lin C.-., J.5
-
12
-
-
0035470889
-
Greedy function approximation: A gradient boosting machine
-
english summary
-
Friedman, J. H. Greedy function approximation: A gradient boosting machine.(english summary). Ann. Statist, 29(5): 1189- 1232, 2001.
-
(2001)
Ann. Statist
, vol.29
, Issue.5
, pp. 1189-1232
-
-
Friedman, J.H.1
-
13
-
-
84897498659
-
Maxout networks
-
Goodfellow, I. J., Warde-Farley, D., Mirza, M., Courville, A. C., and Bengio, Y. Maxout networks. CoRR, 2013.
-
(2013)
CoRR
-
-
Goodfellow, I.J.1
Warde-Farley, D.2
Mirza, M.3
Courville, A.C.4
Bengio, Y.5
-
14
-
-
84867720412
-
-
arXiv preprint arXiv:1207.0580
-
Hinton, G. E., Srivastava, N., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R. R. Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580, 2012.
-
(2012)
Improving Neural Networks by Preventing Co-adaptation of Feature Detectors
-
-
Hinton, G.E.1
Srivastava, N.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.R.5
-
15
-
-
84877743512
-
Majorization for crfs and latent likelihoods
-
Jebara, T. and Choromanska, A. Majorization for crfs and latent likelihoods. In NIPS, 2012.
-
(2012)
NIPS
-
-
Jebara, T.1
Choromanska, A.2
-
16
-
-
85162537884
-
Efficient learning of generalized linear and single index models with isotonic regression
-
Kakade, S. M., Kalai, A., Kanade, V., and Shamir, O. Efficient learning of generalized linear and single index models with isotonic regression. In NIPS, 2011.
-
(2011)
NIPS
-
-
Kakade, S.M.1
Kalai, A.2
Kanade, V.3
Shamir, O.4
-
17
-
-
84898072863
-
The isotron algorithm: High- dimensional isotonic regression
-
Kalai, A. T. and Sastry, R. The isotron algorithm: High- dimensional isotonic regression. In COLT '09, 2009.
-
(2009)
COLT '09
-
-
Kalai, A.T.1
Sastry, R.2
-
18
-
-
84876811202
-
Rcvl: A new benchmark collection for text categorization research
-
Lewis, D. D., Yang, Y., Rose, T. G., and Li, F. Rcvl: A new benchmark collection for text categorization research. The Journal of Machine Learning Research, 5:361-397, 2004.
-
(2004)
The Journal of Machine Learning Research
, vol.5
, pp. 361-397
-
-
Lewis, D.D.1
Yang, Y.2
Rose, T.G.3
Li, F.4
-
22
-
-
84865692149
-
Efficiency of coordinate descent methods on huge- scale optimization problems
-
Nesterov, Y. Efficiency of coordinate descent methods on huge- scale optimization problems. SIAM Journal on Optimization, 22(2):341-362, 2012.
-
(2012)
SIAM Journal on Optimization
, vol.22
, Issue.2
, pp. 341-362
-
-
Nesterov, Y.1
-
23
-
-
0003243224
-
Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods
-
MIT Press
-
Piatt, J. C. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Adavances in large margin classifiers, pp. 61-74. MIT Press, 1999.
-
(1999)
Adavances in Large Margin Classifiers
, pp. 61-74
-
-
Piatt, J.C.1
-
25
-
-
85162467517
-
Hogwild: A lock-free approach to parallelizing stochastic gradient descent
-
Recht, B., Re, C., Wright, S. J., and Niu, F. Hogwild: A lock-free approach to parallelizing stochastic gradient descent. In NIPS, pp. 693-701, 2011.
-
(2011)
NIPS
, pp. 693-701
-
-
Recht, B.1
Re, C.2
Wright, S.J.3
Niu, F.4
-
27
-
-
84972545670
-
Characterization of the subdifferentials of convex functions
-
Rockafellar, R.T. Characterization of the subdifferentials of convex functions. Pac. J. Math., 17:497-510, 1966.
-
(1966)
Pac. J. Math.
, vol.17
, pp. 497-510
-
-
Rockafellar, R.T.1
-
28
-
-
84877725219
-
A stochastic gradient method with an exponential convergence rate for finite training sets
-
Roux, N. L., Schmidt, M., and Bach, F. A stochastic gradient method with an exponential convergence rate for finite training sets. In NIPS, pp. 2672-2680. 2012.
-
(2012)
NIPS
, pp. 2672-2680
-
-
Roux, N.L.1
Schmidt, M.2
Bach, F.3
-
30
-
-
84875134236
-
Stochastic dual coordinate ascent methods for regularized loss minimization
-
Shalev-Shwartz, S. and Zhang, T. Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization. Journal of Machine Learning Reearch, 14:567-599, 2013.
-
(2013)
Journal of Machine Learning Reearch
, vol.14
, pp. 567-599
-
-
Shalev-Shwartz, S.1
Zhang, T.2
-
31
-
-
84863266107
-
Large linear classification when data cannot fit in memory
-
Yu, H.-F., Hsieh, C.-J., Chang, K.-W., and Lin, C.-J. Large linear classification when data cannot fit in memory. TKDD, 5(4), 2012.
-
(2012)
TKDD
, vol.5
, Issue.4
-
-
Yu, H.-F.1
Hsieh, C.-J.2
Chang, K.-W.3
Lin, C.-J.4
|