메뉴 건너뛰기




Volumn , Issue 49, 2003, Pages 33-75

Techniques of document clustering: A review

Author keywords

[No Author keywords available]

Indexed keywords


EID: 13444259727     PISSN: 03734447     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Review
Times cited : (5)

References (100)
  • 8
    • 0013183065 scopus 로고
    • Automatic routing and ad-hoc retrieval using SMART; TREC2
    • D. Harman ed. National Institute of Standards and Technology
    • Buckley, C.; Allan, J.; Salton, G. "Automatic routing and ad-hoc retrieval using SMART; TREC2". Proceedings of the Second Text Retrieval Conference (TREC2). D. Harman ed. National Institute of Standards and Technology, 1994, p. 45-55.
    • (1994) Proceedings of the Second Text Retrieval Conference (TREC2) , pp. 45-55
    • Buckley, C.1    Allan, J.2    Salton, G.3
  • 9
    • 0027578653 scopus 로고
    • Incremental clustering for dynamic information processing
    • Can, Fazli. Incremental clustering for dynamic information processing. ACM Transactions on Information Systems, vol. 10, no. 2, 1993, p. 143-164.
    • (1993) ACM Transactions on Information Systems , vol.10 , Issue.2 , pp. 143-164
    • Can, F.1
  • 10
    • 3343022594 scopus 로고
    • Incremental clustering for very large document databases: Initial MARIAN experience
    • Can, Fazli; Fox, Edward A.; Snavely, Cory D.; France, Robert K. Incremental clustering for very large document databases: initial MARIAN experience. Information Sciences. vol.84, 1995, p. 101-114.
    • (1995) Information Sciences , vol.84 , pp. 101-114
    • Can, F.1    Fox, E.A.2    Snavely, C.D.3    France, R.K.4
  • 12
    • 0021851676 scopus 로고
    • Similarity and stability analysis of the two partitioning type clustering algorithms
    • Can, Fazli; Ozkarahan, Esen A. Similarity and stability analysis of the two partitioning type clustering algorithms. Journal of the American Society for Information Science, vol. 36, no. 1, 1985, p. 3-14.
    • (1985) Journal of the American Society for Information Science , vol.36 , Issue.1 , pp. 3-14
    • Can, F.1    Ozkarahan, E.A.2
  • 13
    • 0023347646 scopus 로고
    • Computation of term/document discrimination values by use of the cover coefficient concept
    • Can, Fazli; Ozkarahan, Esen A. Computation of term/document discrimination values by use of the cover coefficient concept. Journal of the American Society for Information Science, vol. 38, no. 3, 1987, p. 171-183.
    • (1987) Journal of the American Society for Information Science , vol.38 , Issue.3 , pp. 171-183
    • Can, F.1    Ozkarahan, E.A.2
  • 14
    • 0025597381 scopus 로고
    • Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases
    • Can, Fazli; Ozkarahan, Esen A. Concepts and effectiveness of the cover-coefficient-based clustering methodology for text databases. ACM Transactions on Database Systems, vol. 15, no. 4, 1990, p. 483-517.
    • (1990) ACM Transactions on Database Systems , vol.15 , Issue.4 , pp. 483-517
    • Can, F.1    Ozkarahan, E.A.2
  • 15
    • 0028516598 scopus 로고
    • Automatic concept classification of text from electronic meetings
    • Chen, H.; Hsu, P.; Orwig, R.; Hoopes, L; Nunamaker, F., Jr. Automatic concept classification of text from electronic meetings. Communications of the ACM. vol. 37, no. 10, 1994, p. 56-73.
    • (1994) Communications of the ACM , vol.37 , Issue.10 , pp. 56-73
    • Chen, H.1    Hsu, P.2    Orwig, R.3    Hoopes, L.4    Nunamaker Jr., F.5
  • 16
    • 0017551537 scopus 로고
    • Clustering large flies of documents using the single-link method
    • Croft, W. Bruce. Clustering large flies of documents using the single-link method. Journal of the American Society for Information Science, vol. 28, no. 6, 1977, p. 341-344.
    • (1977) Journal of the American Society for Information Science , vol.28 , Issue.6 , pp. 341-344
    • Croft, W.B.1
  • 17
    • 0016519169 scopus 로고
    • A file organization and maintenance procedure for dynamic document collections
    • Crouch, Donald B. A file organization and maintenance procedure for dynamic document collections. Information Processing and Management, vol. 11, no. 1/2, 1975, p. 11-21.
    • (1975) Information Processing and Management , vol.11 , Issue.1-2 , pp. 11-21
    • Crouch, D.B.1
  • 18
    • 13444302136 scopus 로고    scopus 로고
    • Document clustering in concept space; the NIST Information Retrieval Visualization Engine (NIRVE)
    • Cugini, John; Laskowski, Sharon; Piatko, Christine. "Document clustering in concept space; The NIST Information Retrieval Visualization Engine (NIRVE)". CODATA Euro-American Workshop, 1997.
    • (1997) CODATA Euro-American Workshop
    • Cugini, J.1    Laskowski, S.2    Piatko, C.3
  • 21
    • 0242652097 scopus 로고    scopus 로고
    • Feature selection and document clustering
    • M. W. Berry, ed., Springer
    • Dhillon, Inderjit; Kogan, Jacob; Nicholas, Charles. "Feature selection and document clustering". Survey of Text Mining. M. W. Berry, ed., Springer, 2004, p. 73-100.
    • (2004) Survey of Text Mining , pp. 73-100
    • Dhillon, I.1    Kogan, J.2    Nicholas, C.3
  • 23
    • 13444252179 scopus 로고    scopus 로고
    • Japanese source
  • 24
    • 84965770524 scopus 로고
    • Techniques for the measurement of clustering tendency in document retrieval systems
    • El-Hamdouchi, Abdelmoula; Willett, Peter. Techniques for the measurement of clustering tendency in document retrieval systems. Journal of Information Science, vol. 13, 1987, p. 361-365.
    • (1987) Journal of Information Science , vol.13 , pp. 361-365
    • El-Hamdouchi, A.1    Willett, P.2
  • 26
    • 13444303569 scopus 로고    scopus 로고
    • Simultaneous clustering and dynamic keyword weighting for text documents
    • M. W. Berry, ed. Springer
    • Frigui, Hichem; Nasraoui, Olfa. "Simultaneous clustering and dynamic keyword weighting for text documents". Survey of Text Mining. M. W. Berry, ed. Springer, 2004, p. 45-72.
    • (2004) Survey of Text Mining , pp. 45-72
    • Frigui, H.1    Nasraoui, O.2
  • 29
    • 0021496227 scopus 로고
    • Hierarchic agglomerative clustering methods for automatic classification
    • Griffith, Alan; Robinson, Lesley A.; Willett, Peter. Hierarchic agglomerative clustering methods for automatic classification. Journal of Documentation, vol. 40, no. 3, 1984, p. 175-205.
    • (1984) Journal of Documentation , vol.40 , Issue.3 , pp. 175-205
    • Griffith, A.1    Robinson, L.A.2    Willett, P.3
  • 35
    • 84880663504 scopus 로고    scopus 로고
    • The cluster-abstraction model: Unsupervised learning of topic hierarchies from text data
    • Hofmann, Thomas. "The cluster-abstraction model: unsupervised learning of topic hierarchies from text data". Proceedings of IJCAI-99, 1999.
    • (1999) Proceedings of IJCAI-99
    • Hofmann, T.1
  • 38
    • 13444249381 scopus 로고    scopus 로고
    • Japanese source
  • 40
    • 49649154957 scopus 로고
    • The use of hierarchic clustering in information retrieval
    • Jardin, N.; van Rijsbergen, C. J. The use of hierarchic clustering in information retrieval. Information Storage and Retrieval, vol. 7, no. 5, 1971, p. 217-240.
    • (1971) Information Storage and Retrieval , vol.7 , Issue.5 , pp. 217-240
    • Jardin, N.1    Van Rijsbergen, C.J.2
  • 41
    • 13444303736 scopus 로고
    • Nonhierarchic document clustering using a genetic algorithm
    • Jones, Gareth; Robertson, Alexander M.; Santimetvivul, Chawchat; Willett, Peter. Nonhierarchic document clustering using a genetic algorithm. Information Research, vol. 1, no. 1, 1995. http://informationr.net/ir/1-1/paper1.html
    • (1995) Information Research , vol.1 , Issue.1
    • Jones, G.1    Robertson, A.M.2    Santimetvivul, C.3    Willett, P.4
  • 42
    • 13444268840 scopus 로고    scopus 로고
    • Japanese source
  • 45
    • 13444267202 scopus 로고    scopus 로고
    • Japanese source
  • 47
    • 10844223124 scopus 로고    scopus 로고
    • Vector space models for search and cluster mining
    • M.W. Berry ed. Springer
    • Kobayashi, Mei; Aono, Masaki. "Vector space models for search and cluster mining". Survey of Text Mining. M.W. Berry ed. Springer, 2004. p. 103-122.
    • (2004) Survey of Text Mining , pp. 103-122
    • Kobayashi, M.1    Aono, M.2
  • 53
    • 13444258644 scopus 로고    scopus 로고
    • Text retrieval using self-organized document maps
    • Helsinki University of Technology, Laboratory of Computer and Information Science
    • Lagus, Krista. Text Retrieval Using Self-organized Document Maps. Technical Report A61. Helsinki University of Technology, Laboratory of Computer and Information Science, 2000.
    • (2000) Technical Report , vol.A61
    • Lagus, K.1
  • 59
    • 0032272403 scopus 로고    scopus 로고
    • TOPIC ISLANDS: A wavelet-based text visualization system
    • Miller, Nancy; Wong, Pak Chung; Brewster, Mary; Foote, Harlan. "TOPIC ISLANDS: a wavelet-based text visualization system". IEEE Visualization'98, 1998, p. 189-196.
    • (1998) IEEE Visualization'98 , pp. 189-196
    • Miller, N.1    Wong, P.C.2    Brewster, M.3    Foote, H.4
  • 60
    • 13444257259 scopus 로고    scopus 로고
    • Japanese source
  • 61
    • 66749176213 scopus 로고    scopus 로고
    • Document clustering and language models for system-mediated information access
    • Panos Constantopoulos and Ingeborg T. Solvberg eds. Springer, LNCS 2163
    • Muresan, Gheorghe; Harper, David J. "Document clustering and language models for system-mediated information access". Research and Advanced Technology for Digital Libraries: 5th European Conference, ECDL 2001. Panos Constantopoulos and Ingeborg T. Solvberg eds. Springer, LNCS 2163, 2001, p. 438-449.
    • (2001) Research and Advanced Technology for Digital Libraries: 5th European Conference, ECDL 2001 , pp. 438-449
    • Muresan, G.1    Harper, D.J.2
  • 62
    • 0021600424 scopus 로고
    • Structure of hierarchic clustering: Implications for information retrieval and for multivariate data analysis
    • Murtagh, F. Structure of hierarchic clustering: implications for information retrieval and for multivariate data analysis. Information Processing and Management, vol. 20, no. 5/6, 1984, p. 611-617.
    • (1984) Information Processing and Management , vol.20 , Issue.5-6 , pp. 611-617
    • Murtagh, F.1
  • 63
    • 11244321552 scopus 로고    scopus 로고
    • Hierarchical clustering using non-greedy principal direction divisive partitioning
    • Nilsson, Martin. Hierarchical clustering using non-greedy principal direction divisive partitioning. Information Retrieval, vol. 5, 2002, p. 311-321.
    • (2002) Information Retrieval , vol.5 , pp. 311-321
    • Nilsson, M.1
  • 69
    • 4944264472 scopus 로고    scopus 로고
    • Topic detection and tracking: Event clustering as a basis for first story detection
    • W. Bruce Croft ed. Kluwer Academic Publishers
    • Papka, Ron; Allan, James. "Topic detection and tracking: event clustering as a basis for first story detection". Advances in Information Retrieval. W. Bruce Croft ed. Kluwer Academic Publishers, 2000. p. 97-126.
    • (2000) Advances in Information Retrieval , pp. 97-126
    • Papka, R.1    Allan, J.2
  • 70
    • 0000019005 scopus 로고
    • Clustering algorithms
    • William B. Frakes and Ricardo Baeza-Yates eds. PTR Prentice Hall
    • Rasmussen, Edie. "Clustering algorithms". Information Retrieval: Data Structure and Algorithms. William B. Frakes and Ricardo Baeza-Yates eds. PTR Prentice Hall, 1992. p. 419-442.
    • (1992) Information Retrieval: Data Structure and Algorithms , pp. 419-442
    • Rasmussen, E.1
  • 71
    • 0035502266 scopus 로고    scopus 로고
    • Information navigation on the web by clustering and summarizing query results
    • Roussinov, Dmitri G.; Chen, Hsinchun. Information navigation on the web by clustering and summarizing query results. Information Processing and Management, vol. 37, 2001, p. 789-816.
    • (2001) Information Processing and Management , vol.37 , pp. 789-816
    • Roussinov, D.G.1    Chen, H.2
  • 75
    • 0002663098 scopus 로고
    • SLINK: An optimally efficient algorithm for the single-link cluster method
    • Sibson, R. SLINK: an optimally efficient algorithm for the single-link cluster method. The Computer Journal, vol. 16, no. 1, 1973, p. 30-34.
    • (1973) The Computer Journal , vol.16 , Issue.1 , pp. 30-34
    • Sibson, R.1
  • 78
    • 0001693049 scopus 로고    scopus 로고
    • Update on science mapping: Creating large documentation space
    • Small, Henry. Update on science mapping: creating large documentation space. Scientometrics. vol. 38, no. 2, 1997, p. 275-293.
    • (1997) Scientometrics , vol.38 , Issue.2 , pp. 275-293
    • Small, H.1
  • 84
    • 0006290526 scopus 로고
    • Further experiments with hierarchic document clustering in document retrieval
    • van Rijsbergen, C. J. Further experiments with hierarchic document clustering in document retrieval. Information Storage and Retrieval, vol. 10, no. 1, 1974, p. 1-14.
    • (1974) Information Storage and Retrieval , vol.10 , Issue.1 , pp. 1-14
    • Van Rijsbergen, C.J.1
  • 86
    • 0016664715 scopus 로고
    • Document clustering: An evaluation of some experiments with the cranfield 1400 collection
    • van Rijsbergen, C. J; Croft, W. B. Document clustering: an evaluation of some experiments with the Cranfield 1400 collection. Information Processing and Management, vol. 11, no. 5/7, 1975, p. 171-182.
    • (1975) Information Processing and Management , vol.11 , Issue.5-7 , pp. 171-182
    • Van Rijsbergen, C.J.1    Croft, W.B.2
  • 87
    • 0039218390 scopus 로고
    • A test for the separation of relevant and non-relevant documents in experimental retrieval collections
    • van Rijsbergen, C. J.; Sparck Jones, K. A test for the separation of relevant and non-relevant documents in experimental retrieval collections. Journal of Documentation, vol. 29, no. 3, 1973, p. 251-257.
    • (1973) Journal of Documentation , vol.29 , Issue.3 , pp. 251-257
    • Van Rijsbergen, C.J.1    Sparck Jones, K.2
  • 88
    • 0022906994 scopus 로고
    • Implementing agglomerative hierarchic clustering algorithms for use in document retrieval
    • Voorhees, Ellen M. Implementing agglomerative hierarchic clustering algorithms for use in document retrieval. Information Processing and Management, vol. 22, no. 6, 1986, p. 465-476.
    • (1986) Information Processing and Management , vol.22 , Issue.6 , pp. 465-476
    • Voorhees, E.M.1
  • 89
    • 0038156234 scopus 로고    scopus 로고
    • Evaluating contents-link coupled web page clustering for web search results
    • Wang, Yitong; Kitsuregawa, Masaru. "Evaluating contents-link coupled web page clustering for web search results". Proceedings of the 2002 ACM CIKM. 2002, p. 499-506.
    • (2002) Proceedings of the 2002 ACM CIKM , pp. 499-506
    • Wang, Y.1    Kitsuregawa, M.2
  • 90
    • 84860091777 scopus 로고    scopus 로고
    • Lightweight document clustering
    • No. RC-21684
    • Weiss, Sholom; White, Brian F.; Apte, Chidanand V. "Lightweight document clustering". IBM Research Report. No. RC-21684, 2000. http://www.research.ibm.com/dar/papers/pdf/weiss_ldc_with-cover.pdf
    • (2000) IBM Research Report
    • Weiss, S.1    White, B.F.2    Apte, C.V.3
  • 91
    • 0013293386 scopus 로고
    • Document clustering using an inverted file approach
    • Willett, Peter. Document clustering using an inverted file approach. Journal of Information Science, vol. 2, no. 5, 1980, p. 223-231.
    • (1980) Journal of Information Science , vol.2 , Issue.5 , pp. 223-231
    • Willett, P.1
  • 92
    • 0019663477 scopus 로고
    • A fast procedure for the calculation of similarity coefficients in automatic classification
    • Willett, Peter. A fast procedure for the calculation of similarity coefficients in automatic classification. Information Processing and Management, vol. 17, 1981, p. 53-60.
    • (1981) Information Processing and Management , vol.17 , pp. 53-60
    • Willett, P.1
  • 93
    • 3543147086 scopus 로고
    • Recent trends in hierarchic document clustering: A critical review
    • Willett, Peter. Recent trends in hierarchic document clustering: a critical review. Information Processing and Management, vol. 24, no. 5, 1988, p. 577-597.
    • (1988) Information Processing and Management , vol.24 , Issue.5 , pp. 577-597
    • Willett, P.1
  • 100
    • 0038156237 scopus 로고    scopus 로고
    • Evaluation of hierarchical clustering algorithms for document databases
    • Zhao, Ying; Karypis, George. "Evaluation of hierarchical clustering algorithms for document databases". Proceeding of the 2002 ACM CIKM. 2002, p. 515-524.
    • (2002) Proceeding of the 2002 ACM CIKM , pp. 515-524
    • Zhao, Y.1    Karypis, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.