메뉴 건너뛰기




Volumn , Issue , 2007, Pages 450-453

Using XPath to discover informative content blocks of web pages

Author keywords

[No Author keywords available]

Indexed keywords

CLUTTER (INFORMATION THEORY); COMPUTER SOFTWARE; FOOD PROCESSING; INFORMATION RETRIEVAL; INFORMATION SERVICES; INFORMATION THEORY; MARKUP LANGUAGES; SEARCH ENGINES; SEMANTICS; WEBSITES;

EID: 50149098792     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/SKG.2007.106     Document Type: Conference Paper
Times cited : (7)

References (20)
  • 2
    • 77953052174 scopus 로고    scopus 로고
    • Template detection via data mining and its applications
    • Z. Bar-Yossef and S. Rajagopalan. Template detection via data mining and its applications. In WWW, pages 580-591, 2002.
    • (2002) , pp. 580-591
    • Bar-Yossef, Z.1    Rajagopalan, S.2
  • 4
    • 26844469211 scopus 로고    scopus 로고
    • S. Debnath, P. Mitra, and C. L. Giles. Automatic extraction of informative blocks from webpages. In H. Haddad, L. M. Liebrock, A. Omicini, and R. L. Wainwright, editors, SAC, pages 1722-1726. ACM, 2005.
    • S. Debnath, P. Mitra, and C. L. Giles. Automatic extraction of informative blocks from webpages. In H. Haddad, L. M. Liebrock, A. Omicini, and R. L. Wainwright, editors, SAC, pages 1722-1726. ACM, 2005.
  • 5
  • 6
    • 50149086880 scopus 로고    scopus 로고
    • Adaptive filtering of advertisements on web
    • Chiba, Japan, May 10-14
    • B. Esfandiari and R. Nock. Adaptive filtering of advertisements on web pages. In WWW (Special interest tracks and posters), pages 916-917, Chiba, Japan, May 10-14 2005.
    • (2005) WWW (Special interest tracks and posters) , pp. 916-917
    • Esfandiari, B.1    Nock, R.2
  • 8
    • 77953053369 scopus 로고    scopus 로고
    • The volume and evolution of web page templates
    • A. Ellis and T. Hagino, editors, ACM
    • D. Gibson, K. Punera, and A. Tomkins. The volume and evolution of web page templates. In A. Ellis and T. Hagino, editors, WWW (Special interest tracks and posters), pages 830-839. ACM, 2005.
    • (2005) WWW (Special interest tracks and posters) , pp. 830-839
    • Gibson, D.1    Punera, K.2    Tomkins, A.3
  • 9
    • 84880498138 scopus 로고    scopus 로고
    • Dombased content extraction of html documents
    • S. Gupta, G. E. Kaiser, D. Neistadt, and P. Grimm. Dombased content extraction of html documents. In WWW, pages 207-214, 2003.
    • (2003) , pp. 207-214
    • Gupta, S.1    Kaiser, G.E.2    Neistadt, D.3    Grimm, P.4
  • 10
    • 50149112994 scopus 로고    scopus 로고
    • http://adblock.mozdev.org.
  • 11
    • 50149094227 scopus 로고    scopus 로고
    • http://people.apache.org/andyc/neko/doc/html/.
  • 12
    • 50149098339 scopus 로고    scopus 로고
    • http://www.w3.org/DOM.
  • 13
    • 50149089354 scopus 로고    scopus 로고
    • http://www.w3.org/TR/xpath.
  • 15
    • 0242456776 scopus 로고    scopus 로고
    • S.-H. Lin and J.-M. Ho. Discovering informative content blocks from web documents. In KDD, pages 588-593. ACM, 2002.
    • S.-H. Lin and J.-M. Ho. Discovering informative content blocks from web documents. In KDD, pages 588-593. ACM, 2002.
  • 17
    • 0003780986 scopus 로고    scopus 로고
    • The pagerank citation ranking: Bringing order to the web
    • Technical report
    • L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, 1999.
    • (1999)
    • Page, L.1    Brin, S.2    Motwani, R.3    Winograd, T.4
  • 19
    • 34547631600 scopus 로고    scopus 로고
    • A fast and robust method for web page template detection and removal
    • P. S. Yu, V. J. Tsotras, E. A. Fox, and B. Liu, editors, ACM
    • K. Vieira, A. S. da Silva, N. Pinto, E. S. de Moura, J. M. B. Cavalcanti, and J. Freire. A fast and robust method for web page template detection and removal. In P. S. Yu, V. J. Tsotras, E. A. Fox, and B. Liu, editors, CIKM, pages 258-267. ACM, 2006.
    • (2006) CIKM , pp. 258-267
    • Vieira, K.1    da Silva, A.S.2    Pinto, N.3    de Moura, E.S.4    Cavalcanti, J.M.B.5    Freire, J.6
  • 20
    • 77952370025 scopus 로고    scopus 로고
    • L. Yi, B. Liu, and X. Li. Eliminating noisy information in web pages for data mining. In L. Getoor, T. E. Senator, P. Domingos, and C. Faloutsos, editors, KDD, pages 296-305. ACM, 2003.
    • L. Yi, B. Liu, and X. Li. Eliminating noisy information in web pages for data mining. In L. Getoor, T. E. Senator, P. Domingos, and C. Faloutsos, editors, KDD, pages 296-305. ACM, 2003.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.