-
1
-
-
83055179321
-
Uniform resource identifier: Generic syntax
-
Uniform resource identifier: Generic syntax. RFC 3986.
-
RFC
, vol.3986
-
-
-
2
-
-
84880480000
-
Adaptive on-line page importance computation
-
S. Abiteboul, M. Preda, and G. Cobena. Adaptive on-line page importance computation. In Proc. WWW, pages 280-290, 2003.
-
(2003)
Proc. WWW
, pp. 280-290
-
-
Abiteboul, S.1
Preda, M.2
Cobena, G.3
-
3
-
-
77953053635
-
Crawling a country: Better strategies than breadth-first for web page ordering
-
R. Baeza-Yates, C. Castillo, M. Marin, and A. Rodriguez. Crawling a country: better strategies than breadth-first for web page ordering. In Proc. WWW, pages 864-872, 2005.
-
(2005)
Proc. WWW
, pp. 864-872
-
-
Baeza-Yates, R.1
Castillo, C.2
Marin, M.3
Rodriguez, A.4
-
4
-
-
0038589165
-
The anatomy of a large-scale hypertextual web search engine
-
S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks, 30(1-7):107-117, 1998.
-
(1998)
Computer Networks
, vol.30
, Issue.1-7
, pp. 107-117
-
-
Brin, S.1
Page, L.2
-
5
-
-
57349112859
-
IRobot: An intelligent crawler for web forums
-
R. Cai, J.-M. Yang, W. Lai, Y. Wang, and L. Zhang. iRobot: an intelligent crawler for web forums. In Proc. WWW, pages 447-456, 2008.
-
(2008)
Proc. WWW
, pp. 447-456
-
-
Cai, R.1
Yang, J.-M.2
Lai, W.3
Wang, Y.4
Zhang, L.5
-
6
-
-
0033294474
-
Focused crawling: A new approach to topic-specific web resource discovery
-
S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: a new approach to topic-specific web resource discovery. Computer Networks, 31(11-16):1623-1640, 1999.
-
(1999)
Computer Networks
, vol.31
, Issue.11-16
, pp. 1623-1640
-
-
Chakrabarti, S.1
Van Den Berg, M.2
Dom, B.3
-
8
-
-
85011105564
-
RankMass crawler: A crawler with high personalized PageRank coverage guarantee
-
J. Cho and U. Schonfeld. RankMass crawler: a crawler with high personalized PageRank coverage guarantee. In Proc. VLDB, pages 375-386, 2007.
-
(2007)
Proc. VLDB
, pp. 375-386
-
-
Cho, J.1
Schonfeld, U.2
-
9
-
-
72449205154
-
The impact of crawl policy on web search effectiveness
-
D. Fetterly, N. Craswell, and V. Vinay. The impact of crawl policy on web search effectiveness. In Proc. SIGIR, pages 580-587, 2009.
-
(2009)
Proc. SIGIR
, pp. 580-587
-
-
Fetterly, D.1
Craswell, N.2
Vinay, V.3
-
11
-
-
4243148480
-
Authoritative sources in a hyperlinked environment
-
J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604-632, 1999.
-
(1999)
Journal of the ACM
, vol.46
, Issue.5
, pp. 604-632
-
-
Kleinberg, J.1
-
12
-
-
77954589296
-
A pattern tree-based approach to learning URL normalization rules
-
T. Lei, R. Cai, J.-M. Yang, Y. Ke, X. Fan, and L. Zhang. A pattern tree-based approach to learning URL normalization rules. In Proc. WWW, pages 611-620, 2010.
-
(2010)
Proc. WWW
, pp. 611-620
-
-
Lei, T.1
Cai, R.2
Yang, J.-M.3
Ke, Y.4
Fan, X.5
Zhang, L.6
-
13
-
-
9744257884
-
Topical web crawlers: Evaluating adaptive algorithms
-
DOI 10.1145/1031114.1031117
-
F. Menczer, G. Pant, and P. Srinivasan. Topical web crawlers: evaluating adaptive algorithms. ACM Trans. Internet Techn., 4(4):378-419, 2004. (Pubitemid 40009828)
-
(2004)
ACM Transactions on Internet Technology
, vol.4
, Issue.4
, pp. 378-419
-
-
Menczer, F.1
Pant, G.2
Srinivasan, P.3
-
15
-
-
0003780986
-
-
Technical report, Stanford University
-
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: bringing order to the Web. Technical report, Stanford University, 1999.
-
(1999)
The PageRank Citation Ranking: Bringing Order to the Web
-
-
Page, L.1
Brin, S.2
Motwani, R.3
Winograd, T.4
-
16
-
-
33745753308
-
User-centric web crawling
-
S. Pandey and C. Olston. User-centric web crawling. In Proc. WWW, pages 401-411, 2005.
-
(2005)
Proc. WWW
, pp. 401-411
-
-
Pandey, S.1
Olston, C.2
-
17
-
-
42549138928
-
Crawl ordering by search impact
-
S. Pandey and C. Olston. Crawl ordering by search impact. In Proc. WSDM, pages 3-14, 2008.
-
(2008)
Proc. WSDM
, pp. 3-14
-
-
Pandey, S.1
Olston, C.2
-
19
-
-
0343374008
-
Finding what people want: Experiences with the web crawler
-
B. Pinkerton. Finding what people want: experiences with the web crawler. In Proc. WWW, pages 3-14, 1994.
-
(1994)
Proc. WWW
, pp. 3-14
-
-
Pinkerton, B.1
-
20
-
-
0000318553
-
Stochastic complexity and modeling
-
J. Rissanen. Stochastic complexity and modeling. The Annals of Statistics, 14(3):1080-1100, 1986.
-
(1986)
The Annals of Statistics
, vol.14
, Issue.3
, pp. 1080-1100
-
-
Rissanen, J.1
-
21
-
-
33750333928
-
Structure-driven crawler generation by example
-
M. L. A. Vidal, A. S. da Silva, E. S. de Moura, and J. M. B. Caval-canti. Structure-driven crawler generation by example. In Proc. SIGIR, pages 292-299, 2006.
-
(2006)
Proc. SIGIR
, pp. 292-299
-
-
Vidal, M.L.A.1
Da Silva, A.S.2
De Moura, E.S.3
Caval-canti, J.M.B.4
-
22
-
-
57349157726
-
Exploring traversal strategy for efficient web forum crawling
-
Y. Wang, J.-M. Yang, W. Lai, R. Cai, L. Zhang, and W.-Y. Ma. Exploring traversal strategy for efficient web forum crawling. In Proc. SIGIR, pages 459-466, 2008.
-
(2008)
Proc. SIGIR
, pp. 459-466
-
-
Wang, Y.1
Yang, J.-M.2
Lai, W.3
Cai, R.4
Zhang, L.5
Ma, W.-Y.6
|