메뉴 건너뛰기




Volumn , Issue , 2009, Pages 37-41

Efficient web page main text extraction towards online news analysis

Author keywords

Web content analysis; Web information extraction

Indexed keywords

DOM TREE; HTML SOURCES; NEWS CONTENT; ONLINE NEWS; SIMPLE APPROACH; SOLUTION PROCESS; TEXT CONTENT; TEXT EXTRACTION; TEXT STRING; WEB CONTENT ANALYSIS; WEB INFORMATION EXTRACTION; WEB PAGE;

EID: 77951027461     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICEBE.2009.15     Document Type: Conference Paper
Times cited : (9)

References (12)
  • 1
    • 77951050289 scopus 로고    scopus 로고
    • HTML, available at
    • HTML, available at http://www.w3.org/html/.
  • 2
    • 77951067070 scopus 로고    scopus 로고
    • Document Object Model (DOM), available at
    • Document Object Model (DOM), available at http://www.w3.org/DOM/.
  • 4
    • 19944413623 scopus 로고    scopus 로고
    • WISDOM: Web intrapage informative structure mining based on document object model
    • H.-Y. Kao, J.-M. Ho and M.-S. Chen, "WISDOM: Web intrapage informative structure mining based on document object model," IEEE Transactions on Knowledge and Data Engineering, Vol. 17, No. 5, pp. 614-627, 2005.
    • (2005) IEEE Transactions on Knowledge and Data Engineering , vol.17 , Issue.5 , pp. 614-627
    • Kao, H.-Y.1    Ho, J.-M.2    Chen, M.-S.3
  • 9
    • 77951087624 scopus 로고    scopus 로고
    • available at
    • HTML 4 Block-Level Elements, available at http://htmlhelp.com/reference/ html40/block.html.
    • HTML 4 Block-Level Elements
  • 10
    • 77951075795 scopus 로고    scopus 로고
    • available at
    • Yahoo! News, available at http://news.yahoo.com.
  • 11
    • 77951044410 scopus 로고    scopus 로고
    • available at
    • HTML Parser, available at http://htmlparser.sourceforge.net/.
  • 12
    • 77951072548 scopus 로고    scopus 로고
    • available at
    • Wikipedia, available at http://www.wikipedia.org/..


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.