메뉴 건너뛰기




Volumn , Issue , 2004, Pages 10-17

Scheduling algorithms for Web crawling

Author keywords

[No Author keywords available]

Indexed keywords

DOWNLOADING; FRAMEWORKS; SCHEDULING ALGORITHMS; WEB CRAWLERS;

EID: 15844394068     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/WEBMED.2004.1348139     Document Type: Conference Paper
Times cited : (40)

References (44)
  • 1
    • 84860107024 scopus 로고    scopus 로고
    • Robotcop. www.robotcop.org, 2002.
    • (2002)
  • 2
    • 84860107020 scopus 로고    scopus 로고
    • HT://Dig. GPL software
    • HT://Dig. http://www.htdig.org/, 2004. GPL software.
    • (2004)
  • 3
    • 15844418414 scopus 로고    scopus 로고
    • Larbin
    • S. Ailleret. Larbin, http://larbin.sourceforge.net/index-eng.html, 2004. GPL software.
    • (2004) GPL Software
    • Ailleret, S.1
  • 4
    • 84963904043 scopus 로고    scopus 로고
    • Relating web characteristics with link based web page ranking
    • Laguna San Rafael, Chile, November. IEEE Cs. Press
    • R. Baeza-Yates and C. Castillo. Relating web characteristics with link based web page ranking. In Proceedings of String Processing and Information Retrieval, pages 21-32, Laguna San Rafael, Chile, November 2001. IEEE Cs. Press.
    • (2001) Proceedings of String Processing and Information Retrieval , pp. 21-32
    • Baeza-Yates, R.1    Castillo, C.2
  • 8
    • 0038589165 scopus 로고    scopus 로고
    • The anatomy of a large-scale hypertextual Web search engine
    • April
    • S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7): 107-117, April 1998.
    • (1998) Computer Networks and ISDN Systems , vol.30 , Issue.1-7 , pp. 107-117
    • Brin, S.1    Page, L.2
  • 11
    • 0342652248 scopus 로고    scopus 로고
    • Crawling towards eternity - Building an archive of the world wide web
    • May
    • M. Burner. Crawling towards eternity - building an archive of the world wide web. Web Techniques, 2(5), May 1997.
    • (1997) Web Techniques , vol.2 , Issue.5
    • Burner, M.1
  • 13
    • 84877324786 scopus 로고    scopus 로고
    • The evolution of the web and implications for an incremental crawler
    • Cairo, Egypt, September. Morgan Kaufmann
    • J. Cho. The evolution of the web and implications for an incremental crawler. In Proceedings of 26th International Conference on Very Large Databases (VLDB), pages 527-534, Cairo, Egypt, September 2000. Morgan Kaufmann.
    • (2000) Proceedings of 26th International Conference on Very Large Databases (VLDB) , pp. 527-534
    • Cho, J.1
  • 14
    • 15844378516 scopus 로고    scopus 로고
    • Page quality: In search of an unbiased Web ranking
    • UCLA Computer Science
    • J. Cho and R. Adams. Page quality: In search of an unbiased Web ranking. Technical report, UCLA Computer Science, 2004.
    • (2004) Technical Report
    • Cho, J.1    Adams, R.2
  • 21
    • 84860097936 scopus 로고    scopus 로고
    • GPL Software
    • L. Dacharay. WebBase. http://freesoftware.fsf.org/webbase/, 2002. GPL Software.
    • (2002) WebBase
    • Dacharay, L.1
  • 24
    • 0002371171 scopus 로고    scopus 로고
    • Optimal robot scheduling for web search engines
    • R. W. Edward G. Coffman, Z. Liu. Optimal robot scheduling for web search engines. Journal of Scheduling, 1(1): 15-29, 1998.
    • (1998) Journal of Scheduling , vol.1 , Issue.1 , pp. 15-29
    • Edward, R.W.1    Coffman, G.2    Liu, Z.3
  • 25
  • 26
    • 9944234613 scopus 로고
    • The RBSE spider: Balancing effective search against web load
    • Geneva, Switzerland, May
    • D. Eichmann. The RBSE spider: balancing effective search against web load. In Proceedings of the first World Wide Web Conference, Geneva, Switzerland, May 1994.
    • (1994) Proceedings of the First World Wide Web Conference
    • Eichmann, D.1
  • 28
    • 79951675059 scopus 로고    scopus 로고
    • Mercator: A scalable, extensible web crawler
    • April
    • A. Heydon and M. Najork. Mercator: A scalable, extensible web crawler. World Wide Web Conference, 2(4):219-229, April 1999.
    • (1999) World Wide Web Conference , vol.2 , Issue.4 , pp. 219-229
    • Heydon, A.1    Najork, M.2
  • 29
    • 0040511952 scopus 로고
    • Robots in the web: Threat or treat?
    • April
    • M. Koster. Robots in the web: threat or treat? Connexions, 9(4), April 1995.
    • (1995) Connexions , vol.9 , Issue.4
    • Koster, M.1
  • 30
    • 0033297068 scopus 로고    scopus 로고
    • Trawling the Web for emerging cyber-communities
    • R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. Trawling the Web for emerging cyber-communities. Computer Networks, 31(11-16):1481-1493, 1999.
    • (1999) Computer Networks , vol.31 , Issue.11-16 , pp. 1481-1493
    • Kumar, R.1    Raghavan, P.2    Rajagopalan, S.3    Tomkins, A.4
  • 31
    • 0032478628 scopus 로고    scopus 로고
    • Searching the World Wide Web
    • S. Lawrence and C. L. Giles. Searching the World Wide Web. Science, 280(5360):98-100, 1998.
    • (1998) Science , vol.280 , Issue.5360 , pp. 98-100
    • Lawrence, S.1    Giles, C.L.2
  • 32
    • 84974698258 scopus 로고    scopus 로고
    • Characterizing Web document change
    • Proceedings of the Second International Conference on Advances in Web-Age Information Management, London, UK, July. Springer
    • L. Lim, M. Wang, S. Padmanabhan, J. S. Vitter, and R. Agarwal. Characterizing Web document change. In Proceedings of the Second International Conference on Advances in Web-Age Information Management, volume 2118 of Lecture Notes in Computer Science, pages 133-144, London, UK, July 2001. Springer.
    • (2001) Volume 2118 of Lecture Notes in Computer Science , vol.2118 , pp. 133-144
    • Lim, L.1    Wang, M.2    Padmanabhan, S.3    Vitter, J.S.4    Agarwal, R.5
  • 33
    • 0004312089 scopus 로고    scopus 로고
    • Master's thesis, Virginia State University, Blacksburg, Virginia, USA, April
    • B. Liu. Characterizing web response time. Master's thesis, Virginia State University, Blacksburg, Virginia, USA, April 1998.
    • (1998) Characterizing Web Response Time
    • Liu, B.1
  • 34
    • 0003322030 scopus 로고    scopus 로고
    • Web traffic latency: Characteristics and implications
    • B. Liu and E. A. Fox. Web traffic latency: Characteristics and implications. J.UCS: Journal of Universal Computer Science, 4(9):763-778, 1998.
    • (1998) J.UCS: Journal of Universal Computer Science , vol.4 , Issue.9 , pp. 763-778
    • Liu, B.1    Fox, E.A.2
  • 36
    • 0001554832 scopus 로고    scopus 로고
    • Sphinx: A framework for creating personal, site-specific web crawlers
    • Brisbane, Australia, April
    • R. Miller and K. Bharat. Sphinx: A framework for creating personal, site-specific web crawlers. In Proceedings of the seventh conference on World Wide Web, Brisbane, Australia, April 1998.
    • (1998) Proceedings of the Seventh Conference on World Wide Web
    • Miller, R.1    Bharat, K.2
  • 38
    • 15844394231 scopus 로고    scopus 로고
    • What's new on the web?: The evolution of the web from a search engine perspective
    • New York, NY, USA, May. ACM Press
    • A. Ntoulas, J. Cho, and C. Olston. What's new on the web?: the evolution of the web from a search engine perspective. In Proceedings of the 13th conference on World Wide Web, pages 1-12, New York, NY, USA, May 2004. ACM Press.
    • (2004) Proceedings of the 13th Conference on World Wide Web , pp. 1-12
    • Ntoulas, A.1    Cho, J.2    Olston, C.3
  • 39
    • 0343374008 scopus 로고
    • Finding what people want: Experiences with the WebCrawler
    • Geneva, Switzerland, May
    • B. Pinkerton. Finding what people want: Experiences with the WebCrawler. In Proceedings of the first World Wide Web Conference, Geneva, Switzerland, May 1994.
    • (1994) Proceedings of the First World Wide Web Conference
    • Pinkerton, B.1
  • 41
    • 0036204395 scopus 로고    scopus 로고
    • Design and implementation of a high-performance distributed web crawler
    • San Jose, California, February. IEEE Cs. Press
    • V. Shkapenyuk and T. Suel. Design and implementation of a high-performance distributed web crawler. In Proceedings of the 18th International Conference on Data Engineering (ICDE), pages 357 - 368, San Jose, California, February 2002. IEEE Cs. Press.
    • (2002) Proceedings of the 18th International Conference on Data Engineering (ICDE) , pp. 357-368
    • Shkapenyuk, V.1    Suel, T.2
  • 44
    • 0036109905 scopus 로고    scopus 로고
    • Discovery of web robots session based on their navigational patterns
    • P.-N. Tan and V. Kumar. Discovery of web robots session based on their navigational patterns. Data Mining and Knowledge discovery, 6(1):9-35, 2002.
    • (2002) Data Mining and Knowledge Discovery , vol.6 , Issue.1 , pp. 9-35
    • Tan, P.-N.1    Kumar, V.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.