메뉴 건너뛰기




Volumn 59, Issue 2, 2006, Pages 213-230

Sampling, information extraction and summarisation of Hidden Web databases

Author keywords

Document sampling; Hidden Web databases; Information extraction

Indexed keywords

INFORMATION RETRIEVAL; QUERY LANGUAGES; SAMPLING; STORAGE ALLOCATION (COMPUTER); WORLD WIDE WEB;

EID: 33748195920     PISSN: 0169023X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.datak.2006.01.009     Document Type: Article
Times cited : (22)

References (25)
  • 1
    • 1142303684 scopus 로고    scopus 로고
    • A. Arasu, H. Garcia-Molina, Extracting structured data from Web Pages, in: Proceedings of the ACM International Conference on Management (SIGMOD'03), pp. 337-348.
  • 2
    • 77953052174 scopus 로고    scopus 로고
    • Z. Bar-Yossef, S. Rajagopalan, Template detection via data mining and its applications, in: Proceedings of the WWW'02, pp. 580-591.
  • 3
    • 0003259187 scopus 로고    scopus 로고
    • M.K. Bergman, The Deep Web: Surfacing hidden value, Appeared in The Journal of Electronic Publishing from the University of Michigan. , 2001 (accessed 10.01.05).
  • 4
    • 8644267730 scopus 로고    scopus 로고
    • D. Cai, S. Yu, J.R. Wen, W.Y. Ma, Block-based Web Search, in: Proceedings of the 27th Annual International Conference on Research and Development in Information Retrieval (SIGIR'04), pp. 456-463.
  • 6
    • 84944327150 scopus 로고    scopus 로고
    • V. Crescenzi, G. Mecca, P. Merialdo, ROADRUNNER: Towards automatic data extraction from large Web sites, in: Proceedings of the 27th International Conference on Very Large Data Bases (VLDB'01), pp. 109-118.
  • 7
    • 26844469211 scopus 로고    scopus 로고
    • S. Debnath, P. Mitra, C.L. Giles, Automatic extraction of informative blocks from Webpages, in: Proceedings of the Special Track on Web Technologies and Applications in the ACM Symposium of Applied Computing, 2005, pp. 1722-1726.
  • 9
    • 33748191316 scopus 로고    scopus 로고
    • M. Heß, O. Drobnik, Clustering specialised Web-databases by exploiting hyperlinks, in: Proceedings of the Second Asian Digital Library Conference, 1999.
  • 10
    • 35048890098 scopus 로고    scopus 로고
    • Y.L. Hedley, M. Younas, A. James, A TNATS approach to Hidden Web documents, in: Proceedings of the First International Conference on Distributed Computing & Internet Technology (ICDCIT'04), pp. 158-167.
  • 11
    • 18744389475 scopus 로고    scopus 로고
    • Y.L. Hedley, M. Younas, A. James, M. Sanderson, A two-phase sampling technique for information extraction from Hidden Web databases, in: Proceedings of the 6th ACM CIKM Workshop on Web Information and Data Management (WIDM'04), pp. 1-8.
  • 12
    • 1842832183 scopus 로고    scopus 로고
    • Automatic generation of agents for collecting Hidden Web pages for data extraction
    • Lage J.P., da Silva A.S., Golgher P.B., and Laender A.H.F. Automatic generation of agents for collecting Hidden Web pages for data extraction. Data and Knowledge Engineering 49 2 (2004) 177-196
    • (2004) Data and Knowledge Engineering , vol.49 , Issue.2 , pp. 177-196
    • Lage, J.P.1    da Silva, A.S.2    Golgher, P.B.3    Laender, A.H.F.4
  • 13
    • 84937429789 scopus 로고    scopus 로고
    • S.W. Liddle, S.H. Yau, D.W. Embley, On the automatic extraction of data from the Hidden Web, in: Proceedings of the 20th International Conference on Conceptual Modeling Workshops (ER'01), pp. 212-226.
  • 14
    • 70349109840 scopus 로고    scopus 로고
    • K.I. Lin, H. Chen, Automatic information discovery from the Invisible Web, in: Proceedings of the International Conference on Information Technology: Coding and Computing (ITCC'02), pp. 332-337.
  • 16
    • 33748127231 scopus 로고    scopus 로고
    • Open Directory Project (ODP), .
  • 17
    • 84944325093 scopus 로고    scopus 로고
    • S. Raghavan, H. Garcia-Molina, Crawling the Hidden Web, in: Proceedings of the 27th International Conference on Very Large Databases (VLDB'01), pp. 129-138.
  • 18
    • 0034785512 scopus 로고    scopus 로고
    • B. Rahardjo, R. Yap, Automatic information extraction from Web pages, in: Proceedings of the 24th Annual International ACM Conference (SIGIR'01), pp. 430-431.
  • 19
    • 18744380220 scopus 로고    scopus 로고
    • L. Ramaswamy, A. Iyengar, L. Liu, F. Douglis, Automatic detection of fragments in dynamically generated Web pages, in: Proceedings of the 13th World Wide Web Conference, 2004, pp. 443-454.
  • 23
    • 33748144381 scopus 로고    scopus 로고
    • C. Sherman, The Invisible Web. , 2001 (accessed 01.03.05).
  • 24
    • 18744381159 scopus 로고    scopus 로고
    • R. Song, H. Liu, J.R. Wen, W.Y. Ma, Learning block importance models for Web pages, in: Proceedings of the 13th World Wide Web Conference, 2004, pp. 203-211.
  • 25
    • 0033705619 scopus 로고    scopus 로고
    • A. Sugiura, O. Etzioni, Query routing for Web search engines: architecture and experiments, in: Proceedings of the 9th International World Wide Web Conference: The Web: The Next Generation (WWW9), 2000, pp. 417-430.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.