메뉴 건너뛰기




Volumn , Issue , 2006, Pages 511-518

Improving web page clustering through selecting appropiate term weighting functions

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION EXTRACTIONS; REDUCTION METHODS; SIMILARITY SEARCHES; TERM WEIGHTING; WEB DOCUMENTS; WEB INFORMATION EXTRACTIONS; WEB MININGS; WEB PAGE CLUSTERING; WEB PAGES;

EID: 51849128804     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDIM.2007.369244     Document Type: Conference Paper
Times cited : (3)

References (30)
  • 1
    • 84871059997 scopus 로고    scopus 로고
    • Evaluation of Web Page Representations by Content through Clustering. String Processing and Information Retrieval
    • A. Casillas, V. Fresno, M. González de Lena and R. Martínez. "Evaluation of Web Page Representations by Content through Clustering". String Processing and Information Retrieval. LNCS series of Springer-Verlag, 129-130, 2004.
    • (2004) LNCS series of Springer-Verlag , vol.129-130
    • Casillas, A.1    Fresno, V.2    González de Lena, M.3    Martínez, R.4
  • 3
    • 3543096195 scopus 로고    scopus 로고
    • An Analytical Approach to Concept Extraction in HTML Environments
    • Kluwer Academic Publishers
    • V. Fresno and A. Ribeiro. "An Analytical Approach to Concept Extraction in HTML Environments". Journal of Intelligent Information Systems - JIIS. Kluwer Academic Publishers, 215-235, 2004.
    • (2004) Journal of Intelligent Information Systems - JIIS , pp. 215-235
    • Fresno, V.1    Ribeiro, A.2
  • 5
    • 0345566259 scopus 로고    scopus 로고
    • THESUS: Organizing Web Document Collections Based on Link Semantics
    • M. Halkidi, B. Nguyen, I. Varlamis and M. Vazirgiannis. "THESUS: Organizing Web Document Collections Based on Link Semantics". In VLDB Journal, special issue on Semantic Web, 2003.
    • (2003) VLDB Journal , Issue.SPEC. ISSUE ON SEMANTIC WEB
    • Halkidi, M.1    Nguyen, B.2    Varlamis, I.3    Vazirgiannis, M.4
  • 9
    • 84953744816 scopus 로고
    • A statistical interpretation of term specificity and its application in retrieval
    • S. Jones. "A statistical interpretation of term specificity and its application in retrieval". Journal of Documentation, Vol. 28, N. 1, 11-21, 1972.
    • (1972) Journal of Documentation , vol.28 , Issue.1 , pp. 11-21
    • Jones, S.1
  • 10
    • 0038163983 scopus 로고    scopus 로고
    • CLUTO: "A Clustering Toolkit
    • Technical Report: 02-017. University of Minnesota, Department of Computer Science, Minneapolis, MN 55455
    • G. Karypis. CLUTO: "A Clustering Toolkit". Technical Report: 02-017. University of Minnesota, Department of Computer Science, Minneapolis, MN 55455.
    • Karypis, G.1
  • 11
    • 51849095425 scopus 로고    scopus 로고
    • A. Leuski and J., Allan. Improving interactive retrieval by combining ranked lists and clustering. Proceedings of RIAO2000, 665-681, 2000.
    • A. Leuski and J., Allan. "Improving interactive retrieval by combining ranked lists and clustering". Proceedings of RIAO2000, 665-681, 2000.
  • 12
    • 0000159640 scopus 로고
    • A statistical approach to mechanized encoding and searching of literaty information
    • H. P. Luhn. "A statistical approach to mechanized encoding and searching of literaty information". IBM Journal of Research and Development, Vol. 1, N. 4, 307-319, 1957.
    • (1957) IBM Journal of Research and Development , vol.1 , Issue.4 , pp. 307-319
    • Luhn, H.P.1
  • 15
    • 0037660997 scopus 로고    scopus 로고
    • A. Molinari, G. Passi and R. A. Marques Pereira. An indexing model of HTML documents. SAC '03: Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 834-840, 2003.
    • A. Molinari, G. Passi and R. A. Marques Pereira. "An indexing model of HTML documents". SAC '03: Proceedings of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 834-840, 2003.
  • 18
    • 0013362290 scopus 로고    scopus 로고
    • Reprinted in Sparck Jones, Karen, and Peter Willet, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, 1997
    • M.F. Porter. "An algorithm for suffix stripping". Reprinted in Sparck Jones, Karen, and Peter Willet, Readings in Information Retrieval, San Francisco: Morgan Kaufmann, 1997.
    • An algorithm for suffix stripping
    • Porter, M.F.1
  • 19
    • 51849158288 scopus 로고    scopus 로고
    • A. Ribeiro, V. Fresno, M. García-Alegre and D. Guinea. A Fuzzy System for the Web Page Representation. Intelligent Exploration of the Web, Springer-Verlag Group, 19-38, 2002.
    • A. Ribeiro, V. Fresno, M. García-Alegre and D. Guinea. "A Fuzzy System for the Web Page Representation". Intelligent Exploration of the Web, Springer-Verlag Group, 19-38, 2002.
  • 20
    • 84953588425 scopus 로고
    • On the specification of term values in automatic indexing
    • G. Salton, C. S. Yang. "On the specification of term values in automatic indexing". Journal of Documentation, Vol. 29, N. 4, 351-372, 1973.
    • (1973) Journal of Documentation , vol.29 , Issue.4 , pp. 351-372
    • Salton, G.1    Yang, C.S.2
  • 23
    • 0002442796 scopus 로고    scopus 로고
    • Machine Learning in Automated Text Categorization
    • F. Sebastiani. "Machine Learning in Automated Text Categorization". ACM Computing Surveys, Vol. 34, N. 1, 1-47, 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.1 , pp. 1-47
    • Sebastiani, F.1
  • 24
    • 0242602366 scopus 로고    scopus 로고
    • A Large Benchmark Dataset for Web Document Clustering. Soft Computing Systems: Design, Management and Applications
    • M. P. Sinka and D. W. Corne. "A Large Benchmark Dataset for Web Document Clustering". Soft Computing Systems: Design, Management and Applications, Frontiers in Artificial Intelligence and Applications, Vol. 87, 881-890, 2002.
    • (2002) Frontiers in Artificial Intelligence and Applications , vol.87 , pp. 881-890
    • Sinka, M.P.1    Corne, D.W.2
  • 26
    • 26844550931 scopus 로고
    • Text categorization based on Weighted Inverse Document Frequency
    • Technical Report 94 TR0001, Department of Computer Science, Tokyo Institute of Technology
    • T. Tokunaga and M. Iwayama. "Text categorization based on Weighted Inverse Document Frequency". Technical Report 94 TR0001, Department of Computer Science, Tokyo Institute of Technology, 1994.
    • (1994)
    • Tokunaga, T.1    Iwayama, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.