메뉴 건너뛰기




Volumn 67, Issue 1, 2008, Pages 30-50

Improving density-based methods for hierarchical clustering of web pages

Author keywords

Average linkage; Density based approaches; Hierarchical clustering; Single linkage; Web clustering

Indexed keywords

CHLORINE COMPOUNDS; CLUSTER ANALYSIS; FLOW OF SOLIDS; INDUSTRIAL MANAGEMENT; INFORMATION MANAGEMENT; MANAGEMENT INFORMATION SYSTEMS;

EID: 50049106015     PISSN: 0169023X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.datak.2008.06.006     Document Type: Article
Times cited : (42)

References (44)
  • 1
    • 18844383409 scopus 로고    scopus 로고
    • Clustering documents into a web directory for bootstrapping a supervised classification
    • Adami G., Avesani P., and Sona D. Clustering documents into a web directory for bootstrapping a supervised classification. Data & Knowledge Engineering 54 (2005) 301-325
    • (2005) Data & Knowledge Engineering , vol.54 , pp. 301-325
    • Adami, G.1    Avesani, P.2    Sona, D.3
  • 3
    • 0025447750 scopus 로고    scopus 로고
    • *-tree: an efficient and robust access method for points and rectangles, International Conference on Management of Aata ACM SIGMOD'90, 19(2) (1990) 322-331.
    • *-tree: an efficient and robust access method for points and rectangles, International Conference on Management of Aata ACM SIGMOD'90, 19(2) (1990) 322-331.
  • 4
    • 50049086770 scopus 로고    scopus 로고
    • S. Brin, L. Page, The anatomy of a large scale hypertextual web search engine, in: Proceedings of WWW7, Brisbane, Australia, 1998.
    • S. Brin, L. Page, The anatomy of a large scale hypertextual web search engine, in: Proceedings of WWW7, Brisbane, Australia, 1998.
  • 5
    • 0032090684 scopus 로고    scopus 로고
    • S. Chakrabarti, B. Dom, P. Indyk, Enhanced Hypertext Categorization Using Hyperlinks, in: Proceedings of SIGMOD98, 1998, pp. 307-318.
    • S. Chakrabarti, B. Dom, P. Indyk, Enhanced Hypertext Categorization Using Hyperlinks, in: Proceedings of SIGMOD98, 1998, pp. 307-318.
  • 6
    • 84993661659 scopus 로고    scopus 로고
    • P. Ciaccia, M. Patella, P. Zezula, M-tree: an efficient access method for similarity search in metric spaces, in: Proceedings of the 23rd VLDB, 1997.
    • P. Ciaccia, M. Patella, P. Zezula, M-tree: an efficient access method for similarity search in metric spaces, in: Proceedings of the 23rd VLDB, 1997.
  • 10
    • 21244481246 scopus 로고    scopus 로고
    • W.H.E. Day, H. Edelsbrunner, Investigation of Proportional Link Linkage Clustering Methods, Journal of Classification, vol. 2, Springer-Verlag, New York Inc., 1985, pp. 239-254.
    • W.H.E. Day, H. Edelsbrunner, Investigation of Proportional Link Linkage Clustering Methods, Journal of Classification, vol. 2, Springer-Verlag, New York Inc., 1985, pp. 239-254.
  • 11
    • 50049089878 scopus 로고    scopus 로고
    • J. Dean, M. Henzinger, Finding related page in the World Wide Web, in: Proceedings of WWW8, 1999.
    • J. Dean, M. Henzinger, Finding related page in the World Wide Web, in: Proceedings of WWW8, 1999.
  • 13
    • 85170282443 scopus 로고    scopus 로고
    • A density-based algorithm for discovering clusters in large spatial databases with noise
    • Ester M., Kriegel H.-P., Sander J., and Xu X. A density-based algorithm for discovering clusters in large spatial databases with noise. KDD'96 (1996) 226-231
    • (1996) KDD'96 , pp. 226-231
    • Ester, M.1    Kriegel, H.-P.2    Sander, J.3    Xu, X.4
  • 14
    • 50049115378 scopus 로고    scopus 로고
    • T. Haveliwala, A. Gionis, D. Klein, P. Indyk, Similarity Search on the Web: Evaluation and Scalability Considerations, Extended Technical Report.
    • T. Haveliwala, A. Gionis, D. Klein, P. Indyk, Similarity Search on the Web: Evaluation and Scalability Considerations, Extended Technical Report.
  • 16
    • 50049101284 scopus 로고    scopus 로고
    • J. Hou, Y. Zhang, J. Cao, Web page clustering: a hyperlink-based similarity and matrix-based hierarchical algorithms, in: Proceedings of APWeb'03, Xian China, LNCS, 2003.
    • J. Hou, Y. Zhang, J. Cao, Web page clustering: a hyperlink-based similarity and matrix-based hierarchical algorithms, in: Proceedings of APWeb'03, Xian China, LNCS, 2003.
  • 22
    • 50049088324 scopus 로고    scopus 로고
    • B. Larsen, C. Aone, Fast and effective text mining using linear-time document clustering, SIGKDD'99, San Diego, CA, 1999, pp. 16-22.
    • B. Larsen, C. Aone, Fast and effective text mining using linear-time document clustering, SIGKDD'99, San Diego, CA, 1999, pp. 16-22.
  • 23
    • 50049109820 scopus 로고    scopus 로고
    • M. Marchiori, The quest for correct information on the web: hyper search engines, in: Proceedings of the 6th International Word Wide Web Conference, 1997.
    • M. Marchiori, The quest for correct information on the web: hyper search engines, in: Proceedings of the 6th International Word Wide Web Conference, 1997.
  • 24
    • 50049098122 scopus 로고    scopus 로고
    • J. McQueen, Some methods for classification and analysis of multivariate observations, in: Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1967, pp. 281-297.
    • J. McQueen, Some methods for classification and analysis of multivariate observations, in: Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1967, pp. 281-297.
  • 25
    • 34250115918 scopus 로고
    • An examination of procedures for detecting the number of clusters in a data set
    • Milligan G.W., and Cooper M.C. An examination of procedures for detecting the number of clusters in a data set. Psychometrika 50 (1985) 159-179
    • (1985) Psychometrika , vol.50 , pp. 159-179
    • Milligan, G.W.1    Cooper, M.C.2
  • 26
    • 0029719632 scopus 로고    scopus 로고
    • P. Pirolli, J. Pitkow, R. Rao, Silk from a Sow's ear: extracting usable structures from the Web, in: Proceedings of ACM SIGCHI Conference on Human Factors in Computing, 1996.
    • P. Pirolli, J. Pitkow, R. Rao, Silk from a Sow's ear: extracting usable structures from the Web, in: Proceedings of ACM SIGCHI Conference on Human Factors in Computing, 1996.
  • 28
    • 0030644761 scopus 로고    scopus 로고
    • J. Pitkow, P. Pirolli, Life, death, and lawfulness on the electronic frontier, in: Proceedings of ACM CHI'97, 1997, pp. 383-390.
    • J. Pitkow, P. Pirolli, Life, death, and lawfulness on the electronic frontier, in: Proceedings of ACM CHI'97, 1997, pp. 383-390.
  • 31
    • 34948906849 scopus 로고    scopus 로고
    • T. Syeda-Mahmood, F. Wang, Unsupervised clustering using multi-resolution perceptual grouping, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007, pp. 1-8.
    • T. Syeda-Mahmood, F. Wang, Unsupervised clustering using multi-resolution perceptual grouping, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007, pp. 1-8.
  • 32
    • 50049111236 scopus 로고    scopus 로고
    • M. Steinbach et al. A Comparison of Document Clustering Techniques, KDD'2000, Technical report of University of Minnesota, 2000.
    • M. Steinbach et al. A Comparison of Document Clustering Techniques, KDD'2000, Technical report of University of Minnesota, 2000.
  • 33
    • 7444269028 scopus 로고    scopus 로고
    • X. Wang, H.J. Hamilton, DBRS: A Density-Based Spatial Clustering Method with Random Sampling, PAKDD, Korea, 2003, pp. 563-575.
    • X. Wang, H.J. Hamilton, DBRS: A Density-Based Spatial Clustering Method with Random Sampling, PAKDD, Korea, 2003, pp. 563-575.
  • 34
    • 84974727323 scopus 로고    scopus 로고
    • Y. Wang, M. Kitsuregawa, Link based clustering of web search results, in: Second International Conference on Advances in Web-Age Information Management (WAIM), 2001, pp. 225-236.
    • Y. Wang, M. Kitsuregawa, Link based clustering of web search results, in: Second International Conference on Advances in Web-Age Information Management (WAIM), 2001, pp. 225-236.
  • 35
    • 84949743789 scopus 로고    scopus 로고
    • Y. Wang, M. Kitsuregawa, On combining link and contents information for web page clustering, in: Proceedings of the 13th International Conference on Database and Expert Systems Applications, 2002, pp. 902-913.
    • Y. Wang, M. Kitsuregawa, On combining link and contents information for web page clustering, in: Proceedings of the 13th International Conference on Database and Expert Systems Applications, 2002, pp. 902-913.
  • 36
    • 51849152371 scopus 로고    scopus 로고
    • Y. Wang, M. Kitsuregawa, Use link-based clustering to improve web search results, in: Second International Conference on Web Information Systems Engineering (WISE), 2001, pp. 119-128.
    • Y. Wang, M. Kitsuregawa, Use link-based clustering to improve web search results, in: Second International Conference on Web Information Systems Engineering (WISE), 2001, pp. 119-128.
  • 38
    • 84974662382 scopus 로고    scopus 로고
    • C.W. Wen, H. Liu, W.X. Wen, J. Zheng, A distributed hierarchical clustering system for web mining, in: Proceedings of the Second International Conference on Web-Age Information Management (WAIM2001), 2001, pp. 103-113.
    • C.W. Wen, H. Liu, W.X. Wen, J. Zheng, A distributed hierarchical clustering system for web mining, in: Proceedings of the Second International Conference on Web-Age Information Management (WAIM2001), 2001, pp. 103-113.
  • 40
    • 0014976008 scopus 로고
    • Graph-theoretical methods for detecting and describing gestalt structures
    • Zahn C.T. Graph-theoretical methods for detecting and describing gestalt structures. IEEE Transactions on Computers C-20 (1971) 68-86
    • (1971) IEEE Transactions on Computers , vol.C-20 , pp. 68-86
    • Zahn, C.T.1
  • 41
    • 0032268443 scopus 로고    scopus 로고
    • O. Zamir, O. Etzioni, Web document clustering: a feasibility demonstration, in: Proceedings of SIGIR' 98 Melbourne, Australia 1998.
    • O. Zamir, O. Etzioni, Web document clustering: a feasibility demonstration, in: Proceedings of SIGIR' 98 Melbourne, Australia 1998.
  • 42
    • 3543085722 scopus 로고    scopus 로고
    • Empirical and theoretical comparisons of selected criterion functions for document clustering
    • Zhao Y., and Karypis G. Empirical and theoretical comparisons of selected criterion functions for document clustering. Machine Learning 55 3 (2004) 311-331
    • (2004) Machine Learning , vol.55 , Issue.3 , pp. 311-331
    • Zhao, Y.1    Karypis, G.2
  • 43
    • 0038156237 scopus 로고    scopus 로고
    • Y. Zhao, G. Karypis, Evaluation of Hierarchical Clustering Algorithms for Document Datasets, CIKM'02, 2002.
    • Y. Zhao, G. Karypis, Evaluation of Hierarchical Clustering Algorithms for Document Datasets, CIKM'02, 2002.
  • 44
    • 24044537630 scopus 로고    scopus 로고
    • Hierarchical clustering algorithms for document datasets
    • Zhao Y., and Karypis G. Hierarchical clustering algorithms for document datasets. Data Mining and Knowledge Discovery 10 (2005) 141-168
    • (2005) Data Mining and Knowledge Discovery , vol.10 , pp. 141-168
    • Zhao, Y.1    Karypis, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.