메뉴 건너뛰기




Volumn , Issue , 2011, Pages 87-92

User browsing behavior-driven web crawling

Author keywords

URL pattern discovery; web crawling; web log mining

Indexed keywords

BREADTH-FIRST; IMPORTANCE MEASURE; LINK STRUCTURE; LOG MINING; PAGERANK; PATTERN DISCOVERY; USER INTERESTS; WEB CRAWLERS; WEB CRAWLING; WEB LOG MINING;

EID: 83055181892     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2063576.2063593     Document Type: Conference Paper
Times cited : (9)

References (22)
  • 1
    • 83055179321 scopus 로고    scopus 로고
    • Uniform resource identifier: Generic syntax
    • Uniform resource identifier: Generic syntax. RFC 3986.
    • RFC , vol.3986
  • 2
    • 84880480000 scopus 로고    scopus 로고
    • Adaptive on-line page importance computation
    • S. Abiteboul, M. Preda, and G. Cobena. Adaptive on-line page importance computation. In Proc. WWW, pages 280-290, 2003.
    • (2003) Proc. WWW , pp. 280-290
    • Abiteboul, S.1    Preda, M.2    Cobena, G.3
  • 3
    • 77953053635 scopus 로고    scopus 로고
    • Crawling a country: Better strategies than breadth-first for web page ordering
    • R. Baeza-Yates, C. Castillo, M. Marin, and A. Rodriguez. Crawling a country: better strategies than breadth-first for web page ordering. In Proc. WWW, pages 864-872, 2005.
    • (2005) Proc. WWW , pp. 864-872
    • Baeza-Yates, R.1    Castillo, C.2    Marin, M.3    Rodriguez, A.4
  • 4
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual web search engine
    • S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7):107-117, 1998.
    • (1998) Computer Networks , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 5
    • 57349112859 scopus 로고    scopus 로고
    • IRobot: An intelligent crawler for web forums
    • R. Cai, J.-M. Yang, W. Lai, Y. Wang, and L. Zhang. iRobot: an intelligent crawler for web forums. In Proc. WWW, pages 447-456, 2008.
    • (2008) Proc. WWW , pp. 447-456
    • Cai, R.1    Yang, J.-M.2    Lai, W.3    Wang, Y.4    Zhang, L.5
  • 6
    • 0033294474 scopus 로고    scopus 로고
    • Focused crawling: A new approach to topic-specific web resource discovery
    • S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: a new approach to topic-specific web resource discovery. Computer Networks, 31(11-16):1623-1640, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1623-1640
    • Chakrabarti, S.1    Van Den Berg, M.2    Dom, B.3
  • 8
    • 85011105564 scopus 로고    scopus 로고
    • RankMass crawler: A crawler with high personalized PageRank coverage guarantee
    • J. Cho and U. Schonfeld. RankMass crawler: a crawler with high personalized PageRank coverage guarantee. In Proc. VLDB, pages 375-386, 2007.
    • (2007) Proc. VLDB , pp. 375-386
    • Cho, J.1    Schonfeld, U.2
  • 9
    • 72449205154 scopus 로고    scopus 로고
    • The impact of crawl policy on web search effectiveness
    • D. Fetterly, N. Craswell, and V. Vinay. The impact of crawl policy on web search effectiveness. In Proc. SIGIR, pages 580-587, 2009.
    • (2009) Proc. SIGIR , pp. 580-587
    • Fetterly, D.1    Craswell, N.2    Vinay, V.3
  • 11
    • 4243148480 scopus 로고    scopus 로고
    • Authoritative sources in a hyperlinked environment
    • J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.
    • (1999) Journal of the ACM , vol.46 , Issue.5 , pp. 604-632
    • Kleinberg, J.1
  • 12
    • 77954589296 scopus 로고    scopus 로고
    • A pattern tree-based approach to learning URL normalization rules
    • T. Lei, R. Cai, J.-M. Yang, Y. Ke, X. Fan, and L. Zhang. A pattern tree-based approach to learning URL normalization rules. In Proc. WWW, pages 611-620, 2010.
    • (2010) Proc. WWW , pp. 611-620
    • Lei, T.1    Cai, R.2    Yang, J.-M.3    Ke, Y.4    Fan, X.5    Zhang, L.6
  • 13
    • 9744257884 scopus 로고    scopus 로고
    • Topical web crawlers: Evaluating adaptive algorithms
    • DOI 10.1145/1031114.1031117
    • F. Menczer, G. Pant, and P. Srinivasan. Topical web crawlers: evaluating adaptive algorithms. ACM Trans. Internet Techn., 4(4):378-419, 2004. (Pubitemid 40009828)
    • (2004) ACM Transactions on Internet Technology , vol.4 , Issue.4 , pp. 378-419
    • Menczer, F.1    Pant, G.2    Srinivasan, P.3
  • 16
    • 33745753308 scopus 로고    scopus 로고
    • User-centric web crawling
    • S. Pandey and C. Olston. User-centric web crawling. In Proc. WWW, pages 401-411, 2005.
    • (2005) Proc. WWW , pp. 401-411
    • Pandey, S.1    Olston, C.2
  • 17
    • 42549138928 scopus 로고    scopus 로고
    • Crawl ordering by search impact
    • S. Pandey and C. Olston. Crawl ordering by search impact. In Proc. WSDM, pages 3-14, 2008.
    • (2008) Proc. WSDM , pp. 3-14
    • Pandey, S.1    Olston, C.2
  • 19
    • 0343374008 scopus 로고
    • Finding what people want: Experiences with the web crawler
    • B. Pinkerton. Finding what people want: experiences with the web crawler. In Proc. WWW, pages 3-14, 1994.
    • (1994) Proc. WWW , pp. 3-14
    • Pinkerton, B.1
  • 20
    • 0000318553 scopus 로고
    • Stochastic complexity and modeling
    • J. Rissanen. Stochastic complexity and modeling. The Annals of Statistics, 14(3):1080-1100, 1986.
    • (1986) The Annals of Statistics , vol.14 , Issue.3 , pp. 1080-1100
    • Rissanen, J.1
  • 22
    • 57349157726 scopus 로고    scopus 로고
    • Exploring traversal strategy for efficient web forum crawling
    • Y. Wang, J.-M. Yang, W. Lai, R. Cai, L. Zhang, and W.-Y. Ma. Exploring traversal strategy for efficient web forum crawling. In Proc. SIGIR, pages 459-466, 2008.
    • (2008) Proc. SIGIR , pp. 459-466
    • Wang, Y.1    Yang, J.-M.2    Lai, W.3    Cai, R.4    Zhang, L.5    Ma, W.-Y.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.