메뉴 건너뛰기




Volumn 42, Issue 1-2, 2001, Pages 143-175

Concept decompositions for large sparse text data using clustering

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; DATA MINING; ERROR ANALYSIS; FRACTALS; INFORMATION RETRIEVAL; LEAST SQUARES APPROXIMATIONS; MATHEMATICAL MODELS; STATISTICAL METHODS; VECTORS;

EID: 0034824884     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1007612920971     Document Type: Article
Times cited : (1115)

References (40)
  • 1
    • 0029546874 scopus 로고
    • Using linear algebra for intelligent information retrieval
    • Berry, M. W., Dumais, S. T., & O'Brien, G. W. (1995). Using linear algebra for intelligent information retrieval. SIAM Review 37(4), 573-595.
    • (1995) SIAM Review , vol.37 , Issue.4 , pp. 573-595
    • Berry, M.W.1    Dumais2    T, S.3    O'Brien, G.W.4
  • 2
    • 84966204312 scopus 로고
    • Numerical methods for computing angles between linear subspaces
    • Björck, A. & Golub, G. (1973). Numerical methods for computing angles between linear subspaces. Mathematics of Computation 27(123).
    • (1973) Mathematics of Computation , vol.27 , Issue.123
    • Björck, A.1    Golub, G.2
  • 6
    • 0026961606 scopus 로고
    • Scatter/Gather: A cluster-based approach to browsing large document collections
    • Cutting, D. R., Karger, D. R., Pedersen, J. O., & Tukey, J. W. (1992). Scatter/Gather: A cluster-based approach to browsing large document collections. In: Proc. ACM SIGIR.
    • (1992) Proc. ACM SIGIR
    • Cutting, D.R.1    Karger, D.R.2    Pedersen, J.O.3    Tukey, J.W.4
  • 8
    • 9444247728 scopus 로고    scopus 로고
    • Concept decompositions for large sparse text data using clustering
    • IBM Almaden Research Center
    • Dhillon, I. S. & Modha, D. S. (1999). Concept decompositions for large sparse text data using clustering. Technical Report RJ 10147 (95022), IBM Almaden Research Center.
    • (1999) Technical Report RJ 10147 (95022)
    • Dhillon, I.S.1    Modha, D.S.2
  • 9
    • 0343586025 scopus 로고    scopus 로고
    • A parallel data-clustering algorithm for distributed memory multiprocessors
    • M. J. Zaki and C. T. Ho (eds.): Springer-Verlag, New York, Presented at the 1999 Large-Scale Parallel KDD Systems Workshop, San Diego, CA
    • Dhillon, I. S. & Modha, D. S. (2000). A parallel data-clustering algorithm for distributed memory multiprocessors. In: M. J. Zaki and C. T. Ho (eds.): Large-Scale Parallel Data Mining, Lecture Notes in Artificial Intelligence, Volume 1759. Springer-Verlag, New York, pp. 245-260. Presented at the 1999 Large-Scale Parallel KDD Systems Workshop, San Diego, CA.
    • (2000) Large-Scale Parallel Data Mining, Lecture Notes in Artificial Intelligence , vol.1759 , pp. 245-260
    • Dhillon, I.S.1    Modha, D.S.2
  • 15
    • 0004236492 scopus 로고    scopus 로고
    • Baltimore, MD, USA: The Johns Hopkins University Press
    • Golub, G. H. and Van Loan, C. F. (1996). Matrix computations. Baltimore, MD, USA: The Johns Hopkins University Press.
    • (1996) Matrix Computations
    • Golub, G.H.1    Van Loan, C.F.2
  • 17
    • 0030381274 scopus 로고    scopus 로고
    • Reexamining the cluster hypothesis: Scatter/Gather on retrieval results
    • Hearst, M. A. & Pedersen, J. O. (1996). Reexamining the cluster hypothesis: Scatter/Gather on retrieval results. In: Proc. ACM SIGIR.
    • (1996) Proc. ACM SIGIR
    • Hearst, M.A.1    Pedersen, J.O.2
  • 18
    • 85026972772 scopus 로고    scopus 로고
    • Probabilistic latent semantic indexing
    • Hofmann, T. (1999). Probabilistic latent semantic indexing. In: Proc. ACM SIGIR.
    • (1999) Proc. ACM SIGIR
    • Hofmann, T.1
  • 19
    • 0001827680 scopus 로고    scopus 로고
    • Restructuring sparse high dimensional data for effective retrieval
    • Isbell, C. L. & Viola, P. (1998). Restructuring sparse high dimensional data for effective retrieval. In: Advances in neural information processing (Vol. 11).
    • (1998) Advances in Neural Information Processing , vol.11
    • Isbell, C.L.1    Viola, P.2
  • 24
    • 0020721236 scopus 로고
    • Digital image compression by outer product expansion
    • O'Leary, D. P. & Peleg, S. (1983). Digital image compression by outer product expansion. IEEE Trans. Communications 31, 441-444.
    • (1983) IEEE Trans. Communications , vol.31 , pp. 441-444
    • O'Leary, D.P.1    Peleg, S.2
  • 26
    • 0020101636 scopus 로고
    • Quantization and the method of k-means
    • Pollard, D. (1982). Quantization and the method of k-means. IEEE Trans. Inform. Theory 28, 199-205.
    • (1982) IEEE Trans. Inform. Theory , vol.28 , pp. 199-205
    • Pollard, D.1
  • 27
    • 0000019005 scopus 로고
    • Clustering Algorithms
    • W. B. Frakes & R. Baeza-Yates (eds.): Prentice-Hall
    • Rasmussen, E. (1992). Clustering Algorithms. In: W. B. Frakes & R. Baeza-Yates (eds.): Information retrieval: Data structures and algorithms, pp. 419-442, Prentice-Hall.
    • (1992) Information Retrieval: Data Structures and Algorithms , pp. 419-442
    • Rasmussen, E.1
  • 28
    • 0026838784 scopus 로고
    • Density estimation by stochastic complexity
    • Rissanen, J., Speed, T., & Yu, B. (1992). Density estimation by stochastic complexity. IEEE Trans. Inform. Theory 38, 315-323.
    • (1992) IEEE Trans. Inform. Theory , vol.38 , pp. 315-323
    • Rissanen, J.1    Speed, T.2    Yu3    B4
  • 29
    • 0022683355 scopus 로고
    • Global convergence and empirical consistency of the generalized Lloyd algorithm
    • Sabin, M. J. & Gray, R. M. (1986). Global convergence and empirical consistency of the generalized Lloyd algorithm. IEEE Trans. Inform. Theory 32(2), 148-155.
    • (1986) IEEE Trans. Inform. Theory , vol.32 , Issue.2 , pp. 148-155
    • Sabin, M.J.1    Gray, R.M.2
  • 31
    • 45549117987 scopus 로고
    • Term-weighting approaches in automatic text retrieval
    • Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Inform. proc. & management. pp. 513-523.
    • (1988) Inform. Proc. & Management. , pp. 513-523
    • Salton, G.1    Buckley, C.2
  • 34
    • 0030649811 scopus 로고    scopus 로고
    • Projections for efficient document clustering
    • Schütze, H. & Silverstein, C. (1997). Projections for efficient document clustering. In: Proc. ACM SIGIR.
    • (1997) Proc. ACM SIGIR
    • Schütze, H.1    Silverstein, C.2
  • 35
    • 0030684944 scopus 로고    scopus 로고
    • Almost-constant-time clustering of arbitrary corpus subsets
    • Silverstein, C. & Pedersen, J. O. (1997). Almost-constant-time clustering of arbitrary corpus subsets. In: Proc. ACM SIGIR.
    • (1997) Proc. ACM SIGIR
    • Silverstein, C.1    Pedersen, J.O.2
  • 37
    • 0042220696 scopus 로고    scopus 로고
    • Model selection in unsupervised learning with applications to document clustering
    • Bled, Slovenia
    • Vaithyanathan, S. & Dom, B. (1999). Model selection in unsupervised learning with applications to document clustering. In: Proc. 16th Int. Machine Learning Conf., Bled, Slovenia.
    • (1999) Proc. 16th Int. Machine Learning Conf.
    • Vaithyanathan, S.1    Dom, B.2
  • 38
    • 3543147086 scopus 로고
    • Recent trends in hierarchic document clustering: A critical review
    • Willet, P. (1988). Recent trends in hierarchic document clustering: a critical review. Inform. Proc. & Management pp. 577-597.
    • (1988) Inform. Proc. & Management , pp. 577-597
    • Willet, P.1
  • 39
    • 0032268443 scopus 로고    scopus 로고
    • Web document clustering: A feasibility demonstration
    • Zamir, O. & Etzioni, O. (1998). Web document clustering: A feasibility demonstration. In: Proc. ACM SIGIR.
    • (1998) Proc. ACM SIGIR
    • Zamir, O.1    Etzioni, O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.