-
1
-
-
0034830284
-
-
D. Achlioptas and F. McSherry. Fast computation of low rank matrix approximations. In In Proc. 33rd Annual ACM STOC, pp. 611-618, 2001.
-
D. Achlioptas and F. McSherry. Fast computation of low rank matrix approximations. In In Proc. 33rd Annual ACM STOC, pp. 611-618, 2001.
-
-
-
-
2
-
-
30544452185
-
Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices
-
M. W. Berry, S. A. Pulatova, and G. W. Stewart. Algorithm 844: Computing sparse reduced-rank approximations to sparse matrices. ACM Trans. Math. Softw., 31(2):252-269, 2005.
-
(2005)
ACM Trans. Math. Softw
, vol.31
, Issue.2
, pp. 252-269
-
-
Berry, M.W.1
Pulatova, S.A.2
Stewart, G.W.3
-
3
-
-
0038294436
-
-
V. Castelli, A. Thomasian, and C.-S. Li. CSVD: Clustering and singular value decomposition for approximate similarity search in high-dimensional spaces. IEEE Trans. Knowledge and Data Engin., 15(3):671-685, 2003.
-
V. Castelli, A. Thomasian, and C.-S. Li. CSVD: Clustering and singular value decomposition for approximate similarity search in high-dimensional spaces. IEEE Trans. Knowledge and Data Engin., 15(3):671-685, 2003.
-
-
-
-
5
-
-
0038444347
-
Latent semantic kernels
-
C. Brodley and A. Danyluk, editors, Morgan Kaufmann, San Francisco
-
N. Cristianini, J. Shawe-Taylor, and H. Lodhi. Latent semantic kernels. In C. Brodley and A. Danyluk, editors, Proc. 18th ICMLA, pp. 66-73. Morgan Kaufmann, San Francisco, 2001.
-
(2001)
Proc. 18th ICMLA
, pp. 66-73
-
-
Cristianini, N.1
Shawe-Taylor, J.2
Lodhi, H.3
-
6
-
-
0034824884
-
Concept decompositions for large sparse text data using clustering
-
Jan
-
I. S. Dhillon and D. S. Modha. Concept decompositions for large sparse text data using clustering. Machine Learning, 42(1):143-175, Jan 2001.
-
(2001)
Machine Learning
, vol.42
, Issue.1
, pp. 143-175
-
-
Dhillon, I.S.1
Modha, D.S.2
-
7
-
-
24144499504
-
Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition
-
Technical Report, TR-1270, Computer Science Dept, Yale Univ, February
-
P. Drineas, R. Kannan, and M. Mahoney. Fast Monte Carlo algorithms for matrices III: Computing a compressed approximate matrix decomposition. Technical Report, TR-1270, Computer Science Dept, Yale Univ., February 2004.
-
(2004)
-
-
Drineas, P.1
Kannan, R.2
Mahoney, M.3
-
8
-
-
84878064946
-
-
J. Gao and J. Zhang. Clustered SVD strategies in latent semantic indexing. Information Processing and Management, 431:10511063, 2005.
-
J. Gao and J. Zhang. Clustered SVD strategies in latent semantic indexing. Information Processing and Management, 431:10511063, 2005.
-
-
-
-
9
-
-
0004236492
-
-
The Johns Hopkins University Press, Baltimore, 3d edition
-
G. Golub and C. Van Loan. Matrix Computations. The Johns Hopkins University Press, Baltimore, 3d edition, 1996.
-
(1996)
Matrix Computations
-
-
Golub, G.1
Van Loan, C.2
-
11
-
-
34447620746
-
Improving text retrieval for the routing problem using latent semantic indexing
-
New York, NY, USA, Springer-Verlag New York, Inc
-
D. Hull. Improving text retrieval for the routing problem using latent semantic indexing. In Proc. 17th ACM SIGIR, pages 282-291, New York, NY, USA, 1994. Springer-Verlag New York, Inc.
-
(1994)
Proc. 17th ACM SIGIR
, pp. 282-291
-
-
Hull, D.1
-
12
-
-
85132267044
-
Distributed clustering using collective principal component analysis
-
H. Kargupta, W. Huang, K. Sivakumar, and E. Johnson. Distributed clustering using collective principal component analysis. Knowledge and Information Systems, 3(4):422-448, 2001.
-
(2001)
Knowledge and Information Systems
, vol.3
, Issue.4
, pp. 422-448
-
-
Kargupta, H.1
Huang, W.2
Sivakumar, K.3
Johnson, E.4
-
13
-
-
34748835594
-
Clustering very large data sets with principal direction divisive partitioning
-
J. Kogan, C. Nicholas, and M. Teboulle, editors, Springer, Berlin
-
D. Littau and D. Boley. Clustering very large data sets with principal direction divisive partitioning. In J. Kogan, C. Nicholas, and M. Teboulle, editors, Grouping Multidimensional Data: Recent Advances in Clustering, pp. 99-126. Springer, Berlin, 2006.
-
(2006)
Grouping Multidimensional Data: Recent Advances in Clustering
, pp. 99-126
-
-
Littau, D.1
Boley, D.2
-
14
-
-
19544372770
-
Improving text classification using local latent semantic indexing
-
Washington, IEEE Computer Society
-
T. Liu, Z. Chen, B. Zhang, W. Ma, and G. Wu. Improving text classification using local latent semantic indexing. In Proc. ICDM'04, pp. 162-169, Washington, 2004. IEEE Computer Society.
-
(2004)
Proc. ICDM'04
, pp. 162-169
-
-
Liu, T.1
Chen, Z.2
Zhang, B.3
Ma, W.4
Wu, G.5
-
15
-
-
1542316654
-
Nonlinear feature extraction based on centroids and kernel functions
-
C. Park and H. Park. Nonlinear feature extraction based on centroids and kernel functions. Pattern Recognition, 37(4):801-810, 2004.
-
(2004)
Pattern Recognition
, vol.37
, Issue.4
, pp. 801-810
-
-
Park, C.1
Park, H.2
-
16
-
-
4043176922
-
Lower dimensional representation of text data based on centroids and least squares
-
H. Park, M. Jeon, and J. Rosen. Lower dimensional representation of text data based on centroids and least squares. BIT, 43(2):427-448, 2003.
-
(2003)
BIT
, vol.43
, Issue.2
, pp. 427-448
-
-
Park, H.1
Jeon, M.2
Rosen, J.3
-
17
-
-
70350496912
-
Principal component analysis for dimension reduction in massive distributed data sets
-
Y. Qu, G. Ostrouchov, N. F. Samatova, and A. Geist. Principal component analysis for dimension reduction in massive distributed data sets. In Workshop on High Performance Data Mining held with 2nd SIAM Int'l. Conf. Data Mining, pp. 4-9, 2002.
-
(2002)
Workshop on High Performance Data Mining held with 2nd SIAM Int'l. Conf. Data Mining
, pp. 4-9
-
-
Qu, Y.1
Ostrouchov, G.2
Samatova, N.F.3
Geist, A.4
-
18
-
-
0029206376
-
A comparison of classifiers and document representations for the routing problem
-
New York
-
H. Schütze, D. Hull, and J. Pedersen. A comparison of classifiers and document representations for the routing problem. In Proc. 18th ACM SIGIR, pp. 229-237. New York, 1995.
-
(1995)
Proc. 18th ACM SIGIR
, pp. 229-237
-
-
Schütze, H.1
Hull, D.2
Pedersen, J.3
-
19
-
-
0002442796
-
Machine learning in automated text categorization
-
F. Sebastiani. Machine learning in automated text categorization. ACM Comput. Surveys, 34(1):1-47, 2002.
-
(2002)
ACM Comput. Surveys
, vol.34
, Issue.1
, pp. 1-47
-
-
Sebastiani, F.1
-
21
-
-
0001700195
-
A neural network approach to topic spotting
-
Las Vegas, US
-
E. Wiener, J. Pedersen, and A. Weigend. A neural network approach to topic spotting. In Proc. SDAIR-95, 4th Annual Symposium on Document Analysis and Information Retrieval, pp. 317-332, Las Vegas, US, 1995.
-
(1995)
Proc. SDAIR-95, 4th Annual Symposium on Document Analysis and Information Retrieval
, pp. 317-332
-
-
Wiener, E.1
Pedersen, J.2
Weigend, A.3
-
22
-
-
0029180724
-
Noise reduction in a statistical approach to text categorization
-
New York
-
Y. Yang. Noise reduction in a statistical approach to text categorization. In Proc. 18th ACM SIGIR, pp. 256-263, New York, 1995.
-
(1995)
Proc. 18th ACM SIGIR
, pp. 256-263
-
-
Yang, Y.1
-
23
-
-
0034785186
-
A study of thresholding strategies for text categorization
-
New York
-
Y. Yang. A study of thresholding strategies for text categorization. In Proc. 24th ACM SIGIR, pp. 137-145, New York, 2001.
-
(2001)
Proc. 24th ACM SIGIR
, pp. 137-145
-
-
Yang, Y.1
-
24
-
-
0013376788
-
A linear least squares fit mapping method for information retrieval from natural language texts
-
Morristown, NJ, USA, Association for Computational Linguistics
-
Y. Yang and C. Chute. A linear least squares fit mapping method for information retrieval from natural language texts. In Proc. 14th Conf. Computational Linguistics, pp. 447-453, Morristown, NJ, USA, 1992. Association for Computational Linguistics.
-
(1992)
Proc. 14th Conf. Computational Linguistics
, pp. 447-453
-
-
Yang, Y.1
Chute, C.2
-
25
-
-
85024373635
-
A re-examination of text categorization methods
-
New York, ACM Press
-
Y. Yang and X. Liu. A re-examination of text categorization methods. In Proc. 22nd ACM SIGIR, pp. 42-49, New York, 1999. ACM Press.
-
(1999)
Proc. 22nd ACM SIGIR
, pp. 42-49
-
-
Yang, Y.1
Liu, X.2
-
26
-
-
1542317646
-
A scalability analysis of classifiers in text categorization
-
New York
-
Y. Yang, J. Zhang, and B. Kisiel. A scalability analysis of classifiers in text categorization. In Proc. SIGIR '03, pp. 96-103, New York, 2003.
-
(2003)
Proc. SIGIR '03
, pp. 96-103
-
-
Yang, Y.1
Zhang, J.2
Kisiel, B.3
-
27
-
-
33749556153
-
CLSI: A flexible approximation scheme from clustered term-document matrices
-
Newport Beach, California
-
D. Zeimpekis and E. Gallopoulos. CLSI: A flexible approximation scheme from clustered term-document matrices. In Proc. 5th SIAM Int'l. Conf. Data Mining, pp. 631-635, Newport Beach, California, 2005.
-
(2005)
Proc. 5th SIAM Int'l. Conf. Data Mining
, pp. 631-635
-
-
Zeimpekis, D.1
Gallopoulos, E.2
-
28
-
-
84891988635
-
TMG: A MATLAB toolbox for generating term-document matrices from text collections
-
J. Kogan, C. Nicholas, and M. Teboulle, editors, Springer, Berlin
-
D. Zeimpekis and E. Gallopoulos. TMG: A MATLAB toolbox for generating term-document matrices from text collections. In J. Kogan, C. Nicholas, and M. Teboulle, editors, Grouping Multidimensional Data: Recent Advances in Clustering, pp. 187-210. Springer, Berlin, 2006.
-
(2006)
Grouping Multidimensional Data: Recent Advances in Clustering
, pp. 187-210
-
-
Zeimpekis, D.1
Gallopoulos, E.2
-
29
-
-
0035751902
-
Using LSI for text classification in the presence of background text
-
New York, ACM Press
-
S. Zelikovitz and H. Hirsh. Using LSI for text classification in the presence of background text. In Proc. CIKM '01, pages 113-118, New York, 2001. ACM Press.
-
(2001)
Proc. CIKM '01
, pp. 113-118
-
-
Zelikovitz, S.1
Hirsh, H.2
|