메뉴 건너뛰기




Volumn , Issue , 2003, Pages 679-689

Efficient URL caching for world wide web crawling

Author keywords

caching; crawling; distributed crawlers; URL caching; web crawlers; web graph models

Indexed keywords

CACHING; CRAWLING; DISTRIBUTED CRAWLER; URL CACHING; WEB CRAWLERS; WEB GRAPHS;

EID: 84880464612     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/775152.775247     Document Type: Conference Paper
Times cited : (55)

References (38)
  • 2
    • 0003003638 scopus 로고
    • A study of replacement algorithms for a virtual-storage computer
    • L. A. Belady. A study of replacement algorithms for a virtual-storage computer. IBM Systems Journal, 5(2):78-101, 1966.
    • (1966) IBM Systems Journal , vol.5 , Issue.2 , pp. 78-101
    • Belady, L.A.1
  • 3
    • 20444387298 scopus 로고    scopus 로고
    • A technique for measuring the relative size and overlap of public web search engines
    • K. Bharat and A. Z. Broder. A technique for measuring the relative size and overlap of public web search engines. In Proceedings of the 7th World Wide Web Conference, pages 379-388, 1998. http://www7.scu.edu.au/programme/ fullpapers/1937/com1937.htm.
    • (1998) Proceedings of the 7th World Wide Web Conference , pp. 379-388
    • Bharat, K.1    Broder, A.Z.2
  • 7
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual Web search engine
    • S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. In Proceedings of the 7th World Wide Web Conference, pages 107-117, 1998. http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm.
    • (1998) Proceedings of the 7th World Wide Web Conference , pp. 107-117
    • Brin, S.1    Page, L.2
  • 8
    • 0000369806 scopus 로고
    • Some applications of Rabin's fingerprinting method
    • R. Capocelli, A. De Santis, and U. Vaccaro, editors, Springer-Verlag
    • A. Z. Broder. Some applications of Rabin's fingerprinting method. In R. Capocelli, A. De Santis, and U. Vaccaro, editors, Sequences II: Methods in Communications, Security, and Computer Science, pages 143-152. Springer-Verlag, 1993.
    • (1993) Sequences II: Methods in Communications, Security, and Computer Science , pp. 143-152
    • Broder, A.Z.1
  • 10
    • 0342652248 scopus 로고    scopus 로고
    • Crawling towards eternity: Building an archive of the world wide web
    • May
    • M. Burner. Crawling towards eternity: Building an archive of the world wide web. Web Techniques Magazine, 2(5), May 1997.
    • (1997) Web Techniques Magazine , vol.2 , Issue.5
    • Burner, M.1
  • 20
    • 84880465500 scopus 로고    scopus 로고
    • Google. http://www.google.com.
  • 22
    • 79951675059 scopus 로고    scopus 로고
    • Mercator: A scalable, extensible web crawler
    • A. Heydon and M. Najork. Mercator: A scalable, extensible web crawler. World Wide Web, 2(4):219-229, 1999.
    • (1999) World Wide Web , vol.2 , Issue.4 , pp. 219-229
    • Heydon, A.1    Najork, M.2
  • 23
    • 0033470435 scopus 로고    scopus 로고
    • Asymptotic approximation of the move-to-front search cost distribution and least-recently used caching fault probabilities
    • Available from
    • P. R. Jelenković. Asymptotic approximation of the move-to-front search cost distribution and least-recently used caching fault probabilities. Ann. Appl. Prob., 9(2):430-464, 1999. Available from http://comet.ctr.columbia. edu/∼predrag/mypub/mtfRevised.ps.
    • (1999) Ann. Appl. Prob. , vol.9 , Issue.2 , pp. 430-464
    • Jelenković, P.R.1
  • 24
    • 0004502641 scopus 로고
    • An analysis of optimum caching
    • D. E. Knuth. An analysis of optimum caching. Journal of Algorithms, 6(2):181-199, 1985.
    • (1985) Journal of Algorithms , vol.6 , Issue.2 , pp. 181-199
    • Knuth, D.E.1
  • 25
    • 0003905880 scopus 로고    scopus 로고
    • Reprinted as Chapter 17 of Stanford, California, Center for the Study of Language and Information
    • Reprinted as Chapter 17 of Selected Papers on Analysis of Algorithms by Donald E. Knuth, Stanford, California, Center for the Study of Language and Information, 2000.
    • (2000) Selected Papers on Analysis of Algorithms
    • Knuth, D.E.1
  • 28
    • 0032478628 scopus 로고    scopus 로고
    • Searching the world wide web
    • DOI 10.1126/science.280.5360.98
    • S. Lawrence and C. L. Giles. Searching the World Wide Web. Science, 280(5360):98-100, 1998. (Pubitemid 28169089)
    • (1998) Science , vol.280 , Issue.5360 , pp. 98-100
    • Lawrence, S.1    Giles, C.L.2
  • 29
    • 0033536218 scopus 로고    scopus 로고
    • Accessibility of information on the web
    • DOI 10.1038/21987
    • S. Lawrence and C. L. Giles. Accessibility of information on the web. Nature, 400:107-109, 1999. (Pubitemid 29327520)
    • (1999) Nature , vol.400 , Issue.6740 , pp. 107-109
    • Lawrence, S.1    Giles, C.L.2
  • 30
    • 0013238179 scopus 로고    scopus 로고
    • High-performance web crawling
    • Compaq Systems Research Center, Palo Alto, CA, Sept.
    • M. Najork and A. Heydon. High-performance web crawling. SRC Research Report 173, Compaq Systems Research Center, Palo Alto, CA, Sept. 2001.
    • (2001) SRC Research Report 173
    • Najork, M.1    Heydon, A.2
  • 33
    • 0003676885 scopus 로고
    • Technical Report TR-15-81, Center for Research in Computing Technology, Harvard University
    • M. O. Rabin. Fingerprinting by random polynomials. Technical Report TR-15-81, Center for Research in Computing Technology, Harvard University, 1981.
    • (1981) Fingerprinting by Random Polynomials
    • Rabin, M.O.1
  • 37
    • 84880488915 scopus 로고    scopus 로고
    • Personal communication. Jan.
    • T. Suel. Personal communication. Jan. 2003.
    • (2003)
    • Suel, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.