-
1
-
-
0004161991
-
-
Upper Saddle River, NJ, USA: Prentice-Hall, Inc.
-
A. K. Jain and R. C. Dubes, Algorithms for Clustering Data. Upper Saddle River, NJ, USA: Prentice-Hall, Inc., 1988.
-
(1988)
Algorithms for Clustering Data
-
-
Jain, A.K.1
Dubes, R.C.2
-
2
-
-
43049150241
-
Clustering by means of medoids
-
Department of Mathematics and Informatics, Tech. Rep.
-
L. Kaufman and p. Rousseeuw, "Clustering by means of medoids," Technische Hogeschool, Delft (Netherlands). Department of Mathematics and Informatics, Tech. Rep., 1987.
-
(1987)
Technische Hogeschool, Delft (Netherlands)
-
-
Kaufman, L.1
Rousseeuw, P.2
-
3
-
-
84989525001
-
Indexing by latent semantic analysis
-
S. Deerwester, S. Dumais, G. Furnas, T. Landauer, and R. Harshman, "Indexing by latent semantic analysis," Journal of the American Society for Information Science and Technology, vol. 41, no. 6, pp. 391-407, 1990.
-
(1990)
Journal of the American Society for Information Science and Technology
, vol.41
, Issue.6
, pp. 391-407
-
-
Deerwester, S.1
Dumais, S.2
Furnas, G.3
Landauer, T.4
Harshman, R.5
-
4
-
-
70349339356
-
Clustered subset selection and its applications on it service metrics
-
C. Boutsidis, J. Sun, and N. Anerousis, "Clustered subset selection and its applications on it service metrics," in Proceedings of the Seventeenth ACM Conference on Information and Knowledge Management (CIKM'08), 2008, pp. 599-608.
-
(2008)
Proceedings of the Seventeenth ACM Conference on Information and Knowledge Management (CIKM'08)
, pp. 599-608
-
-
Boutsidis, C.1
Sun, J.2
Anerousis, N.3
-
5
-
-
70349152160
-
An improved approximation algorithm for the column subset selection problem
-
C. Boutsidis, M. W. Mahoney, and p. Drineas, "An improved approximation algorithm for the column subset selection problem," in Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA'09), 2009, pp. 968-977.
-
(2009)
Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA'09)
, pp. 968-977
-
-
Boutsidis, C.1
Mahoney, M.W.2
Drineas, P.3
-
6
-
-
84863303500
-
Near optimal column-based matrix reconstruction
-
C. Boutsidis, p. Drineas, and M. Magdon-Ismail, "Near optimal column-based matrix reconstruction," in Proceedings of the 52nd Annual IEEE Symposium on Foundations of Computer Science (FOCS'11), 2011, pp. 305-314.
-
(2011)
Proceedings of the 52nd Annual IEEE Symposium on Foundations of Computer Science (FOCS'11)
, pp. 305-314
-
-
Boutsidis, C.1
Drineas, P.2
Magdon-Ismail, M.3
-
7
-
-
37549003336
-
MapReduce: Simplified data processing on large clusters
-
J. Dean and S. Ghemawat, "MapReduce: Simplified data processing on large clusters," Communications of the ACM, vol. 51, no. 1, pp. 107-113, 2008.
-
(2008)
Communications of the ACM
, vol.51
, Issue.1
, pp. 107-113
-
-
Dean, J.1
Ghemawat, S.2
-
9
-
-
0032308232
-
Fast monte-carlo algorithms for finding low-rank approximations
-
A. Frieze, R. Kannan, and S. Vempala, "Fast Monte-Carlo algorithms for finding low-rank approximations," in Proceedings of the 39th Annual IEEE Symposium on Foundations of Computer Science (FOCS'98), 1998, pp. 370-378.
-
(1998)
Proceedings of the 39th Annual IEEE Symposium on Foundations of Computer Science (FOCS'98)
, pp. 370-378
-
-
Frieze, A.1
Kannan, R.2
Vempala, S.3
-
10
-
-
3142750484
-
Clustering large graphs via the singular value decomposition
-
p. Drineas, A. Frieze, R. Kannan, S. Vempala, and V. Vinay, "Clustering large graphs via the singular value decomposition," Machine Learning, vol. 56, no. 1-3, pp. 9-33, 2004.
-
(2004)
Machine Learning
, vol.56
, Issue.1-3
, pp. 9-33
-
-
Drineas, P.1
Frieze, A.2
Kannan, R.3
Vempala, S.4
Vinay, V.5
-
11
-
-
33751075906
-
Fast Monte Carlo algorithms for matrices II: Computing a low-rank approximation to a matrix
-
p. Drineas, R. Kannan, and M. Mahoney, "Fast Monte Carlo algorithms for matrices II: Computing a low-rank approximation to a matrix," SIAM Journal on Computing, vol. 36, no. 1, pp. 158-183, 2007.
-
(2007)
SIAM Journal on Computing
, vol.36
, Issue.1
, pp. 158-183
-
-
Drineas, P.1
Kannan, R.2
Mahoney, M.3
-
12
-
-
33750079844
-
Subspace sampling and relative-error matrix approximation: Column-based methods
-
Springer Berlin/Heidelberg
-
p. Drineas, M. Mahoney, and S. Muthukrishnan, "Subspace sampling and relative-error matrix approximation: Column-based methods," in Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques. Springer Berlin/Heidelberg, 2006, pp. 316-326.
-
(2006)
Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques.
, pp. 316-326
-
-
Drineas, P.1
Mahoney, M.2
Muthukrishnan, S.3
-
13
-
-
45849092005
-
Matrix approximation and projective clustering via volume sampling
-
A. Deshpande, L. Rademacher, S. Vempala, and G. Wang, "Matrix approximation and projective clustering via volume sampling," Theory of Computing, vol. 2, no. 1, pp. 225-247, 2006.
-
(2006)
Theory of Computing
, vol.2
, Issue.1
, pp. 225-247
-
-
Deshpande, A.1
Rademacher, L.2
Vempala, S.3
Wang, G.4
-
14
-
-
84856610315
-
Column subset selection via sparse approximation of SVD
-
A. Çivril and M. Magdon-Ismail, "Column subset selection via sparse approximation of SVD," Theoretical Computer Science, vol. 421, no. 0, pp. 1-14, 2012.
-
(2012)
Theoretical Computer Science
, vol.421
, Issue.0
, pp. 1-14
-
-
Çivril, A.1
Magdon-Ismail, M.2
-
15
-
-
84857146334
-
An efficient greedy method for unsupervised feature selection
-
A. K. Farahat, A. Ghodsi, and M. S. Kamel, "An efficient greedy method for unsupervised feature selection," in Proceedings of the Eleventh IEEE International Conference on Data Mining (ICDM'11), 2011, pp. 161-170.
-
(2011)
Proceedings of the Eleventh IEEE International Conference on Data Mining (ICDM'11)
, pp. 161-170
-
-
Farahat, A.K.1
Ghodsi, A.2
Kamel, M.S.3
-
16
-
-
84876024189
-
Efficient greedy feature selection for unsupervised learning
-
-, "Efficient greedy feature selection for unsupervised learning," Knowledge and Information Systems, vol. 35, no. 2, pp. 285-310, 2013.
-
(2013)
Knowledge and Information Systems
, vol.35
, Issue.2
, pp. 285-310
-
-
Farahat, A.K.1
Ghodsi, A.2
Kamel, M.S.3
-
17
-
-
84859921422
-
Pairwise document similarity in large collections with mapreduce
-
T. Elsayed, J. Lin, and D. W. Oard, "Pairwise document similarity in large collections with MapReduce," in Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers (HLT'08), 2008, pp. 265-268.
-
(2008)
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers (HLT'08)
, pp. 265-268
-
-
Elsayed, T.1
Lin, J.2
Oard, D.W.3
-
18
-
-
80052666000
-
Fast clustering using mapreduce
-
A. Ene, S. Im, and B. Moseley, "Fast clustering using MapReduce," in Proceedings of the Seventeenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), 2011, pp. 681-689.
-
(2011)
Proceedings of the Seventeenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11)
, pp. 681-689
-
-
Ene, A.1
Im, S.2
Moseley, B.3
-
19
-
-
77951678492
-
A model of computation for mapreduce
-
H. Karloff, S. Suri, and S. Vassilvitskii, "A model of computation for MapReduce," in Proceedings of the 21st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA'10), 2010, pp. 938-948.
-
(2010)
Proceedings of the 21st Annual ACM-SIAM Symposium on Discrete Algorithms (SODA'10)
, pp. 938-948
-
-
Karloff, H.1
Suri, S.2
Vassilvitskii, S.3
-
20
-
-
0037236821
-
An elementary proof of a theorem of johnson and lindenstrauss
-
S. Dasgupta and A. Gupta, "An elementary proof of a theorem of Johnson and Lindenstrauss," Random Structures and Algorithms, vol. 22, no. 1, pp. 60-65, 2003.
-
(2003)
Random Structures and Algorithms
, vol.22
, Issue.1
, pp. 60-65
-
-
Dasgupta, S.1
Gupta, A.2
-
21
-
-
0038166193
-
Database-friendly random projections: Johnson-lindenstrauss with binary coins
-
D. Achlioptas, "Database-friendly random projections: Johnson-Lindenstrauss with binary coins," Journal of computer and System Sciences, vol. 66, no. 4, pp. 671-687, 2003.
-
(2003)
Journal of Computer and System Sciences
, vol.66
, Issue.4
, pp. 671-687
-
-
Achlioptas, D.1
-
22
-
-
33749573641
-
Very sparse random projections
-
p. Li, T. J. Hastie, and K. W. Church, "Very sparse random projections," in Proceedings of the Twelfth ACM SIGKDD international conference on Knowledge Discovery and Data Mining (KDD'06), 2006, pp. 287-296.
-
(2006)
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06)
, pp. 287-296
-
-
Li, P.1
Hastie, T.J.2
Church, K.W.3
-
26
-
-
84876811202
-
Rcv1: A new benchmark collection for text categorization research
-
D. D. Lewis, Y. Yang, T. G. Rose, and F. Li, "Rcv1: A new benchmark collection for text categorization research," The Journal of Machine Learning Research, vol. 5, pp. 361-397, 2004.
-
(2004)
The Journal of Machine Learning Research
, vol.5
, pp. 361-397
-
-
Lewis, D.D.1
Yang, Y.2
Rose, T.G.3
Li, F.4
-
27
-
-
79551528802
-
Parallel spectral clustering in distributed systems
-
W.-Y. Chen, Y. Song, H. Bai, C.-J. Lin, and E. Chang, "Parallel spectral clustering in distributed systems," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 33, no. 3, pp. 568-586, 2011.
-
(2011)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.33
, Issue.3
, pp. 568-586
-
-
Chen, W.-Y.1
Song, Y.2
Bai, H.3
Lin, C.-J.4
Chang, E.5
-
28
-
-
54749092170
-
80 million tiny images: A large data set for nonparametric object and scene recognition
-
A. Torralba, R. Fergus, and W. Freeman, "80 million tiny images: A large data set for nonparametric object and scene recognition," Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol. 30, no. 11, pp. 1958-1970, 2008.
-
(2008)
Pattern Analysis and Machine Intelligence, IEEE Transactions on
, vol.30
, Issue.11
, pp. 1958-1970
-
-
Torralba, A.1
Fergus, R.2
Freeman, W.3
-
29
-
-
81555213068
-
An algorithm for the principal component analysis of large data sets
-
N. Halko, p. -G. Martinsson, Y. Shkolnisky, and M. Tygert, "An algorithm for the principal component analysis of large data sets," SIAM Journal on Scientific Computing, vol. 33, no. 5, pp. 2580-2594, 2011.
-
(2011)
SIAM Journal on Scientific Computing
, vol.33
, Issue.5
, pp. 2580-2594
-
-
Halko, N.1
Martinsson, P.-G.2
Shkolnisky, Y.3
Tygert, M.4
|