메뉴 건너뛰기




Volumn , Issue , 2008, Pages 550-559

xCrawl: A high-recall crawling method for web mining

Author keywords

[No Author keywords available]

Indexed keywords

APPLICATION SCENARIO; FACT EXTRACTION; FOCUSED CRAWLING; INFORMATION EXTRACTION; NAVIGATIONAL STRUCTURES; PRODUCT AND SERVICES; QUERY GENERATION; REDUNDANT DATA; TECHNIQUES USED; WEB DOCUMENT; WEB MINING; WEB PAGE; WEB SOURCES;

EID: 67049169361     PISSN: 15504786     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDM.2008.121     Document Type: Conference Paper
Times cited : (3)

References (19)
  • 4
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual web search engine
    • S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117, 1998.
    • (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 6
    • 0033294474 scopus 로고    scopus 로고
    • Focused crawling: A new approach to topic-specific web resource discovery
    • S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: a new approach to topic-specific web resource discovery. Computer Networks, 31(11-16):1623-1640, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1623-1640
    • Chakrabarti, S.1    Van Den Berg, M.2    Dom, B.3
  • 10
    • 36248993305 scopus 로고    scopus 로고
    • Focused Web crawling: A generic framework for specifying the user interest and for adaptive crawling strategies
    • Roma, Italy
    • M. Ester, M. Grob, and H. Kriegel. Focused Web crawling: A generic framework for specifying the user interest and for adaptive crawling strategies. In Proceedings of 27th International Conference on Very Large Data Bases, pages 321-329, Roma, Italy, 2001.
    • (2001) Proceedings of 27th International Conference on Very Large Data Bases , pp. 321-329
    • Ester, M.1    Grob, M.2    Kriegel, H.3
  • 11
    • 35348900845 scopus 로고    scopus 로고
    • Towards domain-independent information extraction from web tables
    • DOI 10.1145/1242572.1242583, 16th International World Wide Web Conference, WWW2007
    • W. Gatterbauer, P. Bohunsky, M. Herzog, B. Krüpl, and B. Pollak. Towards domain-independent information extraction from web tables. In Proceedings of the 16th International World Wide Web conference, pages 71-80, Banff, Alberta, Canada, 2007. (Pubitemid 47582240)
    • (2007) 16th International World Wide Web Conference, WWW2007 , pp. 71-80
    • Gatterbauer, W.1    Bohunsky, P.2    Herzog, M.3    Krupl, B.4    Pollak, B.5
  • 12
    • 34250654176 scopus 로고    scopus 로고
    • To search or to crawl?: Towards a query optimizer for text-centric tasks
    • DOI 10.1145/1142473.1142504, SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data
    • P. G. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano. To search or to crawl? Towards a query optimizer for textcentric tasks. In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pages 265-276, New York, NY, USA, 2006. (Pubitemid 46946519)
    • (2006) Proceedings of the ACM SIGMOD International Conference on Management of Data , pp. 265-276
    • Ipeirotis, P.G.1    Agichtein, E.2    Jain, P.3    Gravano, L.4
  • 13
    • 4243148480 scopus 로고    scopus 로고
    • Authoritative sources in a hyperlinked environment
    • J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46(5):604-632, 1999.
    • (1999) Journal of the ACM (JACM) , vol.46 , Issue.5 , pp. 604-632
    • Kleinberg, J.1
  • 14
    • 0033297068 scopus 로고    scopus 로고
    • Trawling the Web for emerging cyber-communities
    • R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the Web for emerging cyber-communities. Computer Networks, 31(11-16):1481-1493, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1481-1493
    • Kumar, R.1    Raghavan, P.2    Rajagopalan, S.3    Tomkins, A.4
  • 16
    • 34250618783 scopus 로고    scopus 로고
    • Do not crawl in the DUST: Different URLs with similar text
    • DOI 10.1145/1135777.1135992, Proceedings of the 15th International Conference on World Wide Web
    • U. Schonfeld, Z. Bar-Yossef, and I. Keidar. Do not crawl in the DUST: different URLs with similar text. In Proceedings of the 15th International World Wide Web Conference, pages 1015-1016, New York, NY, USA, 2006. (Pubitemid 46946760)
    • (2006) Proceedings of the 15th International Conference on World Wide Web , pp. 1015-1016
    • Schonfeld, U.1    Bar-Yossef, Z.2    Keidar, I.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.