-
1
-
-
0033077324
-
The space complexity of approximating the frequency moments
-
Alon, N., Matias, Y., and Szegedy, M. 1999. The space complexity of approximating the frequency moments. J. Comput. Syst. Sci. 58, 1, 137-147.
-
(1999)
J. Comput. Syst. Sci
, vol.58
, Issue.1
, pp. 137-147
-
-
Alon, N.1
Matias, Y.2
Szegedy, M.3
-
3
-
-
33750372419
-
Generalizing pagerank: Damping functions for link-based ranking algorithms
-
ACM Press. WA
-
Baeza-Yates, R., Boldi, P., and Castillo, C. 2006. Generalizing pagerank: Damping functions for link-based ranking algorithms. In Proceedings of ACM SIGIR. ACM Press. WA, 308-315.
-
(2006)
Proceedings of ACM SIGIR
, pp. 308-315
-
-
Baeza-Yates, R.1
Boldi, P.2
Castillo, C.3
-
6
-
-
84876833271
-
Link-based characterization and detection of Web Spam
-
AIRWeb, Seattle, WA
-
Becchetti, L., Castillo, C., Donato, D., Leonardi, S., and Baeza-Yates, R. 2006a. Link-based characterization and detection of Web Spam. In 2nd International Workshop on Adversarial Information Retrieval on the Web (AIRWeb). Seattle, WA.
-
(2006)
2nd International Workshop on Adversarial Information Retrieval on the Web
-
-
Becchetti, L.1
Castillo, C.2
Donato, D.3
Leonardi, S.4
Baeza-Yates, R.5
-
7
-
-
37149002369
-
Using rank propagation and probabilistic counting for link-based spam detection
-
WebKDD, ACM Press
-
Becchetti, L., Castillo, C., Donato, D., Leonardi, S., and Baeza-Yates, R. 2006b. Using rank propagation and probabilistic counting for link-based spam detection. In Proceedings of the Workshop on Web Mining and Web Usage Analysis (WebKDD). ACM Press.
-
(2006)
Proceedings of the Workshop on Web Mining and Web Usage Analysis
-
-
Becchetti, L.1
Castillo, C.2
Donato, D.3
Leonardi, S.4
Baeza-Yates, R.5
-
8
-
-
34250660925
-
Spamrank: Fully automatic link spam detection
-
Chiba, Japan
-
Benczr, A. A., Csalogny, K., Sarls, T., and Uher, M. 2005. Spamrank: Fully automatic link spam detection. In Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web. Chiba, Japan.
-
(2005)
Proceedings of the 1st International Workshop on Adversarial Information Retrieval on the Web
-
-
Benczr, A.A.1
Csalogny, K.2
Sarls, T.3
Uher, M.4
-
9
-
-
3042680184
-
Ubicrawler: A scalable fully distributed web crawler
-
Boldi, P., Codenotti, B., Santini, M., and Vigna, S. 2004. Ubicrawler: A scalable fully distributed web crawler. Softw. Pract. Exp. 34, 8, 711-726.
-
(2004)
Softw. Pract. Exp
, vol.34
, Issue.8
, pp. 711-726
-
-
Boldi, P.1
Codenotti, B.2
Santini, M.3
Vigna, S.4
-
10
-
-
0030211964
-
Bagging predictors
-
Breiman, L. 1996. Bagging predictors. Machine Learn. 24, 2, 123-140.
-
(1996)
Machine Learn
, vol.24
, Issue.2
, pp. 123-140
-
-
Breiman, L.1
-
11
-
-
70450232823
-
Network applications of Bloom filters: A survey
-
Broder, A. and Mitzenmacher, M. 2003. Network applications of Bloom filters: A survey. Internet Math. 1, 4, 485-509.
-
(2003)
Internet Math
, vol.1
, Issue.4
, pp. 485-509
-
-
Broder, A.1
Mitzenmacher, M.2
-
12
-
-
34547964237
-
A reference collection for web spam
-
Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., and Vigna, S. 2006a. A reference collection for web spam. SIGIR Forum 40, 2, 11-24.
-
(2006)
SIGIR Forum
, vol.40
, Issue.2
, pp. 11-24
-
-
Castillo, C.1
Donato, D.2
Becchetti, L.3
Boldi, P.4
Leonardi, S.5
Santini, M.6
Vigna, S.7
-
13
-
-
34547964237
-
A reference collection for web spam
-
Castillo, C., Donato, D., Becchetti, L., Boldi, P., Leonardi, S., Santini, M., and Vigna, S. 2006b. A reference collection for web spam. SIGIR Forum 40, 2, 11-24.
-
(2006)
SIGIR Forum
, vol.40
, Issue.2
, pp. 11-24
-
-
Castillo, C.1
Donato, D.2
Becchetti, L.3
Boldi, P.4
Leonardi, S.5
Santini, M.6
Vigna, S.7
-
14
-
-
36448992581
-
Know your neighbors: Web spam detection using the web topology
-
ACM Press
-
Castillo, C., Donato, D., Gionis, A., Murdock, V., and Silvestri, F. 2007. Know your neighbors: Web spam detection using the web topology. In Proceedings of the 30th Annual International ACM SIGIR Conference (SIGIR). ACM Press, 423-430.
-
(2007)
Proceedings of the 30th Annual International ACM SIGIR Conference (SIGIR)
, pp. 423-430
-
-
Castillo, C.1
Donato, D.2
Gionis, A.3
Murdock, V.4
Silvestri, F.5
-
15
-
-
0031353179
-
Size-estimation framework with applications to transitive closure and reachability
-
Cohen, E. 1997. Size-estimation framework with applications to transitive closure and reachability. J. Comput. Syst. Sci. 55, 3, 441-453.
-
(1997)
J. Comput. Syst. Sci
, vol.55
, Issue.3
, pp. 441-453
-
-
Cohen, E.1
-
17
-
-
34250686269
-
Site level noise removal for search engines
-
ACM Press
-
da Costa-Carvalho, A. L., Chirita, P.-A., de Moura, E. S., Calado, P., and Nejdl, W. 2006. Site level noise removal for search engines. In Proceedings of the 15th International Conference on World Wide Web (WWW'06), ACM Press, 73-82.
-
(2006)
Proceedings of the 15th International Conference on World Wide Web (WWW'06)
, pp. 73-82
-
-
da Costa-Carvalho, A.L.1
Chirita, P.-A.2
de Moura, E.S.3
Calado, P.4
Nejdl, W.5
-
18
-
-
8644220983
-
Recognizing nepotistic links on the Web
-
AAAI Press, TX
-
Davison, B. D. 2000a. Recognizing nepotistic links on the Web. In Artificial Intelligence for Web Search. AAAI Press, TX, 23-28.
-
(2000)
Artificial Intelligence for Web Search
, pp. 23-28
-
-
Davison, B.D.1
-
21
-
-
33646432218
-
Thwarting the nigritude ultramarine: Learning to identify link spam
-
Proceedings of the 16th European Conference on Machine Learning ECML, Springer
-
Drost, I. and Scheffer, T. 2005. Thwarting the nigritude ultramarine: learning to identify link spam. In Proceedings of the 16th European Conference on Machine Learning (ECML). Lecture Notes in Artificial Intelligence, vol. 3720. Springer, 233-243.
-
(2005)
Lecture Notes in Artificial Intelligence
, vol.3720
, pp. 233-243
-
-
Drost, I.1
Scheffer, T.2
-
22
-
-
0142152716
-
Loglog counting of large cardinalities (extended abstract)
-
Proceedings of 11th Annual European Symposium on Algorithms, Springer
-
Durand, M. and Flajolet, P. 2003. Loglog counting of large cardinalities (extended abstract). In Proceedings of 11th Annual European Symposium on Algorithms. Lecture Notes in Computer Science, vol. 2832. Springer, 605-617.
-
(2003)
Lecture Notes in Computer Science
, vol.2832
, pp. 605-617
-
-
Durand, M.1
Flajolet, P.2
-
23
-
-
19944369101
-
Ranking the web frontier
-
ACM Press
-
Eiron, N., Curley, K. S., and Tomlin, J. A. 2004. Ranking the web frontier. In Proceedings of the 13th International Conference on World Wide Web. ACM Press, 309-318.
-
(2004)
Proceedings of the 13th International Conference on World Wide Web
, pp. 309-318
-
-
Eiron, N.1
Curley, K.S.2
Tomlin, J.A.3
-
24
-
-
84947119654
-
-
Feigenbaum, J., Kannan, S., Gregor, M. A., Suri, S., and Zhang, J. 2004. On graph problems in a semi-streaming model. In 31st International Colloquium on Automata, Languages and Programming.
-
Feigenbaum, J., Kannan, S., Gregor, M. A., Suri, S., and Zhang, J. 2004. On graph problems in a semi-streaming model. In 31st International Colloquium on Automata, Languages and Programming.
-
-
-
-
25
-
-
77954428596
-
Spam, damn spam, and statistics: Using statistical analysis to locate spam web
-
WebDB, Paris, France
-
Fetterly, D., Manasse, M., and Najork, M. 2004. Spam, damn spam, and statistics: Using statistical analysis to locate spam web pages. In Proceedings of the 7th Workshop on the Web and Databases (WebDB). Paris, France. 1-6.
-
(2004)
Proceedings of the 7th Workshop on the Web and Databases
, pp. 1-6
-
-
Fetterly, D.1
Manasse, M.2
Najork, M.3
-
26
-
-
0020828424
-
Probabilistic counting algorithms for data base applications
-
Flajolet, P. and Martin, N. G. 1985. Probabilistic counting algorithms for data base applications. J. Comput. Syst. Sci. 31, 2, 182-209.
-
(1985)
J. Comput. Syst. Sci
, vol.31
, Issue.2
, pp. 182-209
-
-
Flajolet, P.1
Martin, N.G.2
-
28
-
-
84901272818
-
-
Gomes, L. H., Almeida, R. B., Bettencourt, L. M. A., Almeida, V., and Almeida, J. M. 2005. Comparative graph theoretical characterization of networks of spam and legitimate email. URL: http://www.ceas.cc/papers-2005/131.pdf.
-
Gomes, L. H., Almeida, R. B., Bettencourt, L. M. A., Almeida, V., and Almeida, J. M. 2005. Comparative graph theoretical characterization of networks of spam and legitimate email. URL: http://www.ceas.cc/papers-2005/131.pdf.
-
-
-
-
29
-
-
14544277619
-
The bubble of web visibility
-
Gori, M. and Witten, I. 2005. The bubble of web visibility. Comm. ACM 48, 3, 115-117.
-
(2005)
Comm. ACM
, vol.48
, Issue.3
, pp. 115-117
-
-
Gori, M.1
Witten, I.2
-
31
-
-
0024817947
-
Networks of sexual contacts: Implications for the pattern of spread of hiv
-
Gupta, S., Anderson, R. M., and May, R. M. 1989. Networks of sexual contacts: implications for the pattern of spread of hiv. AIDS 3, 12, 807-817.
-
(1989)
AIDS
, vol.3
, Issue.12
, pp. 807-817
-
-
Gupta, S.1
Anderson, R.M.2
May, R.M.3
-
32
-
-
34548764345
-
Link spam detection based on mass estimation
-
ACM
-
Gyngyi, Z., Berkhin, P., Garcia-Molina, H., and Pedersen, J. 2006. Link spam detection based on mass estimation. In Proceedings of the 32nd International Conference on Very Large Data Bases. ACM, 439-450.
-
(2006)
Proceedings of the 32nd International Conference on Very Large Data Bases
, pp. 439-450
-
-
Gyngyi, Z.1
Berkhin, P.2
Garcia-Molina, H.3
Pedersen, J.4
-
34
-
-
85131818719
-
Combating Web spam with TrustRank
-
Morgan Kaufmann
-
Gyngyi, Z., Garcia-Molina, H., and Pedersen, J. 2004. Combating Web spam with TrustRank. In Proceedings of the 30th International Conference on Very Large Data Bases (VLDB). Morgan Kaufmann, 576-587.
-
(2004)
Proceedings of the 30th International Conference on Very Large Data Bases (VLDB)
, pp. 576-587
-
-
Gyngyi, Z.1
Garcia-Molina, H.2
Pedersen, J.3
-
35
-
-
0004255482
-
Efficient computation of pagerank
-
Tech. rep, Stanford University
-
Haveliwala, T. 1999. Efficient computation of pagerank. Tech. rep., Stanford University.
-
(1999)
-
-
Haveliwala, T.1
-
36
-
-
0001284253
-
Computing on data streams
-
Henzinger, M. R., Raghavan, P., and Rajagopalan, S. 1999. Computing on data streams. In Dimacs Series in Discrete Mathematics and Theoretical Computer Science, 107-118.
-
(1999)
Dimacs Series in Discrete Mathematics and Theoretical Computer Science
, pp. 107-118
-
-
Henzinger, M.R.1
Raghavan, P.2
Rajagopalan, S.3
-
37
-
-
32344436210
-
Graphs over time: Densification laws, shrinking diameters and possible explanations
-
ACM Press
-
Leskovec, J., Kleinberg, J., and Faloutsos, C. 2005. Graphs over time: densification laws, shrinking diameters and possible explanations. In Proceeding of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining (KDD'05). ACM Press, 177-187.
-
(2005)
Proceeding of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining (KDD'05)
, pp. 177-187
-
-
Leskovec, J.1
Kleinberg, J.2
Faloutsos, C.3
-
41
-
-
0018021561
-
Counting large numbers of events in small registers
-
Morris, R. 1978. Counting large numbers of events in small registers. Comm. ACM 21, 10, 840-842.
-
(1978)
Comm. ACM
, vol.21
, Issue.10
, pp. 840-842
-
-
Morris, R.1
-
42
-
-
34250653315
-
Detecting spam web pages through content analysis
-
Edinburgh, Scotland
-
Ntoulas, A., Najork, M., Manasse, M., and Fetterly, D. 2006. Detecting spam web pages through content analysis. In Proceedings of the World Wide Web Conference. Edinburgh, Scotland. 83-92.
-
(2006)
Proceedings of the World Wide Web Conference
, pp. 83-92
-
-
Ntoulas, A.1
Najork, M.2
Manasse, M.3
Fetterly, D.4
-
43
-
-
0003780986
-
The PageRank citation ranking: Bringing order to the Web
-
Tech. rep, Stanford Digital Library Technologies Project
-
Page, L., Brin, S., Motwani, R., and Winograd, T. 1998. The PageRank citation ranking: bringing order to the Web. Tech. rep., Stanford Digital Library Technologies Project.
-
(1998)
-
-
Page, L.1
Brin, S.2
Motwani, R.3
Winograd, T.4
-
47
-
-
79955064984
-
Detecting link spam using temporal information
-
Hong Kong
-
Shen, G., Gao, B., Liu, T.-Y., Feng, G., Song, S., and Li, H. 2006. Detecting link spam using temporal information. In Proceedings of the International Conference on Data Mining (ICDM). Hong Kong.
-
(2006)
Proceedings of the International Conference on Data Mining (ICDM)
-
-
Shen, G.1
Gao, B.2
Liu, T.-Y.3
Feng, G.4
Song, S.5
Li, H.6
-
48
-
-
0001321490
-
External memory algorithms and data structures
-
Vitter, J. S. 2001. External memory algorithms and data structures. ACM Comput. Surv. 33, 2, 209-271.
-
(2001)
ACM Comput. Surv
, vol.33
, Issue.2
, pp. 209-271
-
-
Vitter, J.S.1
-
51
-
-
35048868826
-
Making eigenvector-based reputation systems robust to collusion
-
Proceedings of the 3rd Workshop on Web Graphs WAW, Springer
-
Zhang, H., Goel, A., Govindan, R., Mason, K., and Van Roy, B. 2004. Making eigenvector-based reputation systems robust to collusion. In Proceedings of the 3rd Workshop on Web Graphs (WAW). Lecture Notes in Computer Science, vol. 3243. Springer, 92-104.
-
(2004)
Lecture Notes in Computer Science
, vol.3243
, pp. 92-104
-
-
Zhang, H.1
Goel, A.2
Govindan, R.3
Mason, K.4
Van Roy, B.5
-
52
-
-
33749580724
-
Linear prediction models with graph regularization for web-page categorization
-
ACM Press
-
Zhang, T., Popescul, A., and Dom, B. 2006. Linear prediction models with graph regularization for web-page categorization. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06). ACM Press, 821-826.
-
(2006)
Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06)
, pp. 821-826
-
-
Zhang, T.1
Popescul, A.2
Dom, B.3
|