-
1
-
-
57349088963
-
-
Internet Forum
-
Internet Forum. http://en.wikipedia.org/wiki/Internet-forum.
-
-
-
-
2
-
-
84880480000
-
Adaptive on-line page importance computation
-
Budapest, Hungary, May 20-24
-
th WWW, pages 280-290, Budapest, Hungary, May 20-24, 2003.
-
(2003)
th WWW
, pp. 280-290
-
-
Abiteboul, S.1
Preda, M.2
Cobena, G.3
-
3
-
-
35048834240
-
Crawling the infinite Web: Five levels are enough
-
rd Workshop on Algorithms and Models for the Web-Graph, Rome, Italy, Oct. 16
-
rd Workshop on Algorithms and Models for the Web-Graph, LNCS, volume 3243, pages 156-167, Rome, Italy, Oct. 16, 2004.
-
(2004)
LNCS
, vol.3243
, pp. 156-167
-
-
Baeza-Yates, R.1
Castillo, C.2
-
4
-
-
77953053635
-
Crawling a Country: Better strategies than breadth-first for Web page ordering
-
Chiba, Japan, May 10-14
-
th WWW, pages 864-872, Chiba, Japan, May 10-14, 2005.
-
(2005)
th WWW
, pp. 864-872
-
-
Baeza-Yates, R.1
Castillo, C.2
Marin, M.3
Rodriguez, A.4
-
5
-
-
0038589165
-
The anatomy of a large-scale hypertextual Web search engine
-
S. Brin and L. Page. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-111, 1998.
-
(1998)
Computer Networks and ISDN Systems
, vol.30
, Issue.1-7
, pp. 107-111
-
-
Brin, S.1
Page, L.2
-
6
-
-
0010362121
-
Syntactic clustering of the Web
-
Santa Clara, California, USA, Apr
-
th WWW, pages 1157-1166, Santa Clara, California, USA, Apr. 1997.
-
(1997)
th WWW
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
7
-
-
57349112859
-
-
th WWW, pages 447-456, Beijing, P.R. China, April 21-25, 2008.
-
th WWW, pages 447-456, Beijing, P.R. China, April 21-25, 2008.
-
-
-
-
8
-
-
0033294474
-
Focused crawling: A new approach to topic-specific Web resource discovery
-
S. Chakrabarti, M. van den Berg, and B. Dom. Focused crawling: a new approach to topic-specific Web resource discovery. Computer Networks, 31(11-16):1623-1640, 1999.
-
(1999)
Computer Networks
, vol.31
, Issue.11-16
, pp. 1623-1640
-
-
Chakrabarti, S.1
van den Berg, M.2
Dom, B.3
-
9
-
-
42549134877
-
-
Hong Kong, Dec
-
Y. Guo, K. Li, K. Zhang, and G. Zhang. Board forum crawling: a Web crawling method for Web forum. In Proc. 2006 IEEE/WIC/ACM Int. Conf. Web Intelligence, pages 745-748, Hong Kong, Dec. 2006.
-
(2006)
Proc. 2006 IEEE/WIC/ACM Int. Conf. Web Intelligence, Board forum crawling: A Web crawling method for Web forum
, pp. 745-748
-
-
Guo, Y.1
Li, K.2
Zhang, K.3
Zhang, G.4
-
10
-
-
33750296887
-
Finding near-duplicate Web pages: A large-scale evaluation of algorithms
-
Seattle, Washington, USA, Aug
-
th SIGIR, pages 284-291, Seattle, Washington, USA, Aug. 2006.
-
(2006)
th SIGIR
, pp. 284-291
-
-
Henzinger, M.1
-
11
-
-
35348911985
-
Detecting near-duplicates for web crawling
-
Banff, Canada, May 8-12
-
th WWW, pages 141-150, Banff, Canada, May 8-12, 2007.
-
(2007)
th WWW
, pp. 141-150
-
-
Manku, G.S.1
Jain, A.2
Sarma, A.D.3
-
12
-
-
0034794539
-
Evaluating topic-driven Web crawlers
-
New Orleans, LA, USA, Sept. 9-12
-
th SIGIR, pages 241-249, New Orleans, LA, USA, Sept. 9-12, 2001.
-
(2001)
th SIGIR
, pp. 241-249
-
-
Menczer, F.1
Pant, G.2
Srinivasan, P.3
Ruiz, M.E.4
-
13
-
-
33745753308
-
User-centric Web crawling
-
Chiba, May 10-14
-
th WWW, pages 401-411, Chiba, May 10-14, 2005.
-
(2005)
th WWW
, pp. 401-411
-
-
Pandey, S.1
Olston, C.2
-
14
-
-
84944325093
-
Crawling the hidden Web
-
San Francisco, CA, USA, Sept. 11-14
-
th VLDB, pages 129-138, San Francisco, CA, USA, Sept. 11-14, 2001.
-
(2001)
th VLDB
, pp. 129-138
-
-
Raghavan, S.1
Garcia-Molina, H.2
-
15
-
-
33750333928
-
Structure-driven crawler generation by example
-
Seattle, Washington, USA, Aug. 6-11
-
th SIGIR, pages 292-299, Seattle, Washington, USA, Aug. 6-11, 2006.
-
(2006)
th SIGIR
, pp. 292-299
-
-
Vidal, M.L.A.1
da Siva, A.S.2
de Moura, E.S.3
Cavalcanti, J.M.B.4
|