메뉴 건너뛰기




Volumn , Issue , 2003, Pages 43-48

Web page cleaning for web mining through feature weighting

Author keywords

[No Author keywords available]

Indexed keywords

COMMON STRUCTURES; FEATURE WEIGHTING; LARGE AMOUNTS; NODE IMPORTANCE; WEB PAGE CLASSIFICATION; WEB PAGE CLUSTERING; WEB PAGE NOISE; WEIGHTING METHODS;

EID: 84880811191     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (60)

References (21)
  • 4
    • 77953052174 scopus 로고    scopus 로고
    • Template detection via data mining and its applications
    • Ziv Bar-Yossef and Sridhar Rajagopalan. Template detection via data mining and its applications, WWW-2002, 2002.
    • (2002) WWW-2002
    • Bar-Yossef, Z.1    Rajagopalan, S.2
  • 5
  • 6
    • 0032674505 scopus 로고    scopus 로고
    • Statistical models for text segmentation
    • Doug Beeferman, Adam Berger and John Lafferty. Statistical models for text segmentation. Machine Learning, 34 (1-3): 177-210, 1999.
    • (1999) Machine Learning , vol.34 , Issue.1-3 , pp. 177-210
    • Beeferman, D.1    Berger, A.2    Lafferty, J.3
  • 7
    • 0034791059 scopus 로고    scopus 로고
    • Enhanced topic distillation using text, markup tags, and hyperlinks
    • Soumen Chakrabarti, Mukul M. Joshi and Vivek B. Tawde. Enhanced topic distillation using text, markup tags, and hyperlinks. SIGIR-2001.
    • SIGIR-2001
    • Chakrabarti, S.1    Joshi, M.M.2    Tawde, V.B.3
  • 9
    • 0027709747 scopus 로고
    • Subtopic structuring for full-length document access
    • Marti A. Hearst and Christian Plaunt. Subtopic structuring for full-length document access. SJGIR-93, 1993.
    • (1993) SJGIR-93
    • Hearst, M.A.1    Plaunt, C.2
  • 10
    • 0000636553 scopus 로고    scopus 로고
    • Text categorization with support vector machines: Learning with many relevant features
    • Thorsten Joachims. Text categorization with support vector machines: learning with many relevant features. ECML-1997, 1997.
    • (1997) ECML-1997
    • Joachims, T.1
  • 11
    • 0002714543 scopus 로고    scopus 로고
    • Making large-Scale SVM Learning Practical
    • B. Scholkopf and C. Burges and A. Smola (ed.), MIT-Press
    • Thorsten Joachims. Making large-Scale SVM Learning Practical. Advances in Kernel Methods - Support Vector Learning, B. Scholkopf and C. Burges and A. Smola (ed.), MIT-Press, 1999.
    • (1999) Advances in Kernel Methods - Support Vector Learning
    • Joachims, T.1
  • 12
    • 84880844513 scopus 로고    scopus 로고
    • Entropy-Based Link Analysis for Mining Web Informative Structures
    • Hung-Yu Kao, Ming-Syan Chen Shian-Hua Lin, and Jan-Ming Ho, Entropy-Based Link Analysis for Mining Web Informative Structures. C1KM-2002, 2002.
    • (2002) C1KM-2002
    • Kao, H.-Y.1    Chen, M.-S.2    Lin, S.-H.3    Ho, J.-M.4
  • 13
    • 8844283642 scopus 로고    scopus 로고
    • Cohesion and collocation: Using context vectors in text segmentation
    • Stefan Kaufmann. Cohesion and collocation: Using context vectors in text segmentation. ACL-1999, 1999.
    • (1999) ACL-1999
    • Kaufmann, S.1
  • 15
    • 63249124034 scopus 로고    scopus 로고
    • Learning to remove Internet advertisements
    • Nicholas Kushmerick. Learning to remove Internet advertisements. Agnets-1999, 1999.
    • (1999) Agnets-1999
    • Kushmerick, N.1
  • 16
    • 0034592786 scopus 로고    scopus 로고
    • Intelliclean: A knowledge-based intelligent data cleaner
    • Mong Li Lee, Tok Wang Ling, Wai Lup Low. Intelliclean: A knowledge-based intelligent data cleaner. SIGKDD-2000, 2000.
    • (2000) SIGKDD-2000
    • Lee, M.L.1    Ling, T.W.2    Low, W.L.3
  • 17
    • 0242456776 scopus 로고    scopus 로고
    • Discovering informative content blocks from Web documents
    • Shian-Hua Lin and Jan-Ming Ho. Discovering informative content blocks from Web documents. SIGKDD-2002, 2002.
    • (2002) SIGKDD-2002
    • Lin, S.-H.1    Ho, J.-M.2
  • 18
    • 85149127550 scopus 로고    scopus 로고
    • Statistical Models for Topic Segmentation
    • Jeffrey C. Reynar. Statistical Models for Topic Segmentation. ACL-99, 1999
    • (1999) ACL-99
    • Reynar, J.C.1
  • 20
    • 84856043672 scopus 로고
    • A Mathematical Theory of Communication
    • July and October
    • Shannon, C. A Mathematical Theory of Communication. Bell System TechnicalJournal, Vol 27, pp.379-423and 623-656, July and October, 1948.
    • (1948) Bell System TechnicalJournal , vol.27
    • Shannon, C.1
  • 21
    • 0003141935 scopus 로고    scopus 로고
    • A comparative study of feature selection in text categorization
    • Yiming Yang, Jan O. Pedersen. A comparative study of feature selection in text categorization. ICML-97, 1997.
    • (1997) ICML-97
    • Yang, Y.1    Pedersen, J.O.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.