-
1
-
-
0032777414
-
A small approximately min-wise independent family of hash functions
-
lndyk, P.: A small approximately min-wise independent family of hash functions. In: Journal of Algorithms. (1999) 454-456.
-
(1999)
Journal of Algorithms
, pp. 454-456
-
-
Lndyk, P.1
-
2
-
-
70349672710
-
Tighter estimation using bottom k sketches
-
Cohen, E., Kaplan, H.: Tighter estimation using bottom k sketches. PVLDB 1(1) (2008) 213-224.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 213-224
-
-
Cohen, E.1
Kaplan, H.2
-
4
-
-
0031353179
-
Size-Estimation Framework with Applications to Transitive Closure and Reachability
-
Cohen, E.: Size-estimation framework with applications to transitive closure and reachability. J. Comput. Syst. Sci. 55(3) (1997) 441-453. (Pubitemid 127432363)
-
(1997)
Journal of Computer and System Sciences
, vol.55
, Issue.3
, pp. 441-453
-
-
Cohen, E.1
-
5
-
-
84938057127
-
Estimating rarity and similarity over data stream windows
-
volume 2461 of Lecture Notes in Computer Science
-
Datar, M., Muthukrishnan, S.: Estimating rarity and similarity over data stream windows. In: In Proceedings of 10th Annual European Symposium on Algorithms, volume 2461 of Lecture Notes in Computer Science. (2002) 323-334.
-
(2002)
Proceedings of 10th Annual European Symposium on Algorithms
, pp. 323-334
-
-
Datar, M.1
Muthukrishnan, S.2
-
7
-
-
77953112255
-
Evaluating strategies for similarity search on the web
-
New York, NY, USA, ACM
-
Haveliwala, T.H., Gionis, A., Klein, D., Indyk, P.: Evaluating strategies for similarity search on the web. In: WWW'02: Proceedings of the 11th international conference on World Wide Web, New York, NY, USA, ACM (2002) 432-442.
-
(2002)
WWW'02: Proceedings of the 11th International Conference on World Wide Web
, pp. 432-442
-
-
Haveliwala, T.H.1
Gionis, A.2
Klein, D.3
Indyk, P.4
-
8
-
-
70350625168
-
Sketching algorithms for approximating rank correlations in collaborative filtering systems
-
Karlgren, J. Tarhio, J. Hyyrö, H. eds., Volume 5721 of Lecture Notes in Computer Science. Springer
-
Bachrach, Y., Herbrich, R., Porat, E.: Sketching algorithms for approximating rank correlations in collaborative filtering systems. In Karlgren, J., Tarhio, J., Hyyrö, H., eds.: SPIRE. Volume 5721 of Lecture Notes in Computer Science., Springer (2009) 344-352.
-
(2009)
SPIRE
, pp. 344-352
-
-
Bachrach, Y.1
Herbrich, R.2
Porat, E.3
-
9
-
-
77957061059
-
Sketching techniques for collaborative filtering
-
Pasadena, California
-
Bachrach, Y., Porat, E., Rosenschein, J.S.: Sketching techniques for collaborative filtering. In: The Twenty- First International Joint Conference on Artificial Intelligence (IJCAI 2009), Pasadena, California (2009) 2016-2021.
-
(2009)
Twenty-first International Joint Conference on Artificial Intelligence (IJCAI 2009)
, pp. 2016-2021
-
-
Bachrach, Y.1
Porat, E.2
Rosenschein, J.S.3
-
10
-
-
35348911985
-
Detecting near-duplicates for web crawling
-
DOI 10.1145/1242572.1242592, 16th International World Wide Web Conference, WWW2007
-
Manku, G.S., Jain, A., Das Sarma, A.: Detecting near-duplicates for web crawling. In: WWW'07: Proceedings of the 16th international conference on World Wide Web, New York, NY, USA, ACM (2007) 141-150. (Pubitemid 47582246)
-
(2007)
16th International World Wide Web Conference, WWW2007
, pp. 141-150
-
-
Manku, G.S.1
Jain, A.2
Das Sarma, A.3
-
11
-
-
29844455221
-
What's new: Finding significant differences in network data streams
-
DOI 10.1109/TNET.2005.860096
-
Cormode, G., Muthukrishnan, S.: What's new: finding significant differences in network data streams. IEEE/ACM Trans. Netw. 13(6) (2005) 1219-1232. (Pubitemid 43034131)
-
(2005)
IEEE/ACM Transactions on Networking
, vol.13
, Issue.6
, pp. 1219-1232
-
-
Cormode, G.1
Muthukrishnan, S.2
-
12
-
-
1142303685
-
Processing set expressions over continuous update streams
-
New York, NY, USA, ACM
-
Ganguly, S., Garofalakis, M., Rastogi, R.: Processing set expressions over continuous update streams. In: SIGMOD'03: Proceedings of the 2003 ACM SIGMOD international conference on Management of data, New York, NY, USA, ACM (2003) 265-276.
-
(2003)
SIGMOD'03: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data
, pp. 265-276
-
-
Ganguly, S.1
Garofalakis, M.2
Rastogi, R.3
-
13
-
-
35348914807
-
Google news personalization: Scalable online collaborative filtering
-
DOI 10.1145/1242572.1242610, 16th International World Wide Web Conference, WWW2007
-
Das, A.S., Datar, M., Garg, A., Rajaram, S.: Google news personalization: scalable online collaborative filtering. In: WWW'07: Proceedings of the 16th international conference on World Wide Web, New York, NY, USA, ACM (2007) 271-280. (Pubitemid 47582257)
-
(2007)
16th International World Wide Web Conference, WWW2007
, pp. 271-280
-
-
Das, A.S.1
Datar, M.2
Garg, A.3
Rajaram, S.4
-
14
-
-
0034832662
-
Estimating simple functions on the union of data streams
-
New York, NY, USA, ACM
-
Gibbons, P.B., Tirthapura, S.: Estimating simple functions on the union of data streams. In: SPAA'01: Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures, New York, NY, USA, ACM (2001) 281-291.
-
(2001)
SPAA'01: Proceedings of the Thirteenth Annual ACM Symposium on Parallel Algorithms and Architectures
, pp. 281-291
-
-
Gibbons, P.B.1
Tirthapura, S.2
-
15
-
-
77953112255
-
Evaluating strategies for similarity search on the web
-
New York, NY, USA, ACM
-
Haveliwala, T.H., Gionis, A., Klein, D., Indyk, P.: Evaluating strategies for similarity search on the web. In: WWW'02: Proceedings of the 11th international conference on World Wide Web, New York, NY, USA, ACM (2002) 432-442.
-
(2002)
WWW'02: Proceedings of the 11th International Conference on World Wide Web
, pp. 432-442
-
-
Haveliwala, T.H.1
Gionis, A.2
Klein, D.3
Indyk, P.4
-
16
-
-
33750311279
-
Near-duplicate detection by instance-level constrained clustering
-
New York, NY, USA, ACM
-
Yang, H., Callan, J.: Near-duplicate detection by instance-level constrained clustering. In: SIGIR'06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, ACM (2006) 421-428.
-
(2006)
SIGIR'06: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 421-428
-
-
Yang, H.1
Callan, J.2
-
17
-
-
33750296887
-
Finding near-duplicate web pages: A large-scale evaluation of algorithms
-
New York, NY, USA, ACM
-
Henzinger, M.: Finding near-duplicate web pages: a large-scale evaluation of algorithms. In: SIGIR'06: Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, New York, NY, USA, ACM (2006) 284-291.
-
(2006)
SIGIR'06: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
, pp. 284-291
-
-
Henzinger, M.1
-
18
-
-
52649158132
-
Greedy list intersection
-
Krauthgamer, R., Mehta, A., Raman, V., Rudra, A.: Greedy list intersection. In: IEEE 24th International Conference on Data Engineering, 2008. ICDE 2008. (2008) 1033-1042.
-
(2008)
IEEE 24th International Conference on Data Engineering, 2008. ICDE 2008
, pp. 1033-1042
-
-
Krauthgamer, R.1
Mehta, A.2
Raman, V.3
Rudra, A.4
-
20
-
-
84969151029
-
A near-optimal algorithm for computing the entropy of a stream
-
Bansal, N. Pruhs, K. Stein, C. eds., SIAM
-
Chakrabarti, A., Cormode, G., McGregor, A.: A near-optimal algorithm for computing the entropy of a stream. In Bansal, N., Pruhs, K., Stein, C., eds.: SODA, SIAM (2007) 328-335.
-
(2007)
SODA
, pp. 328-335
-
-
Chakrabarti, A.1
Cormode, G.2
McGregor, A.3
-
21
-
-
52749092341
-
Randomized geometric algorithms and pseudo-random generators
-
Washington, DC, USA, IEEE Computer Society
-
Mulmuley, K.: Randomized geometric algorithms and pseudo-random generators. In: SFCS'92: Proceedings of the 33rd Annual Symposium on Foundations of Computer Science, Washington, DC, USA, IEEE Computer Society (1992) 90-100.
-
(1992)
SFCS'92: Proceedings of the 33rd Annual Symposium on Foundations of Computer Science
, pp. 90-100
-
-
Mulmuley, K.1
-
22
-
-
0031620041
-
Min-wise independent permutations (extended abstract)
-
New York, NY, USA, ACM
-
Broder, A.Z., Charikar, M., Frieze, A.M., Mitzen-macher, M.: Min-wise independent permutations (extended abstract). In: STOC'98: Proceedings of the thirtieth annual ACM symposium on Theory of computing, New York, NY, USA, ACM (1998) 327-336.
-
(1998)
STOC'98: Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing
, pp. 327-336
-
-
Broder, A.Z.1
Charikar, M.2
Frieze, A.M.3
Mitzen-Macher, M.4
-
23
-
-
0031346696
-
On the resemblance and containment of documents
-
(SEQUENCES97, IEEE Computer Society)
-
Broder, A.Z.: On the resemblance and containment of documents. In: In Compression and Complexity of Sequences (SEQUENCES97, IEEE Computer Society (1997) 21-29.
-
(1997)
Compression and Complexity of Sequences
, pp. 21-29
-
-
Broder, A.Z.1
-
24
-
-
0010362121
-
Syntactic clustering of the web
-
Essex, UK, Elsevier Science Publishers Ltd.
-
Broder, A.Z., Glassman, S.C., Manasse, M.S., Zweig, G.: Syntactic clustering of the web. In: Selected papers from the sixth international conference on World Wide Web, Essex, UK, Elsevier Science Publishers Ltd. (1997) 1157-1166.
-
(1997)
Selected Papers from the Sixth International Conference on World Wide Web
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
25
-
-
79955732417
-
-
Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding interesting associations without support pruning (1999).
-
(1999)
Finding Interesting Associations without Support Pruning
-
-
Cohen, E.1
Datar, M.2
Fujiwara, S.3
Gionis, A.4
Indyk, P.5
Motwani, R.6
Ullman, J.D.7
Yang, C.8
-
26
-
-
0033907633
-
Low discrepancy sets yield approximate min-wise independent permutation families
-
DOI 10.1016/S0020-0190(99)00163-5
-
Saks, M., Srinivasan, A., Zhou, S., Zuckerman, D.: Low discrepancy sets yield approximate min-wise independent permutation families. In: In Proc. International Workshop on Randomization and Approximation Techniques in Computer Science, Springer (1999) 29-32. (Pubitemid 30563377)
-
(2000)
Information Processing Letters
, vol.73
, Issue.1-2
, pp. 29-32
-
-
Saks, M.1
Srinivasan, A.2
Zhou, S.3
Zuckerman, D.4
-
28
-
-
84958668671
-
A de-randomization using min-wise independent permutations
-
Springer
-
Broder, A.Z., Charikar, M., Mitzenmacher, M.: A de-randomization using min-wise independent permutations. In: In Randomization and approximation techniques in computer science, (Springer) 15-24.
-
Randomization and Approximation Techniques in Computer Science
, pp. 15-24
-
-
Broder, A.Z.1
Charikar, M.2
Mitzenmacher, M.3
-
29
-
-
0003415652
-
-
Addison-Wesley Longman Publishing Co. Inc. Boston, MA, USA
-
Aho, A.V., Hopcroft, J.E.: The Design and Analysis of Computer Algorithms. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA (1974).
-
(1974)
The Design and Analysis of Computer Algorithms
-
-
Aho, A.V.1
Hopcroft, J.E.2
-
30
-
-
84959038140
-
Counting distinct elements in a data stream
-
London, UK, Springer-Verlag
-
Bar-Yossef, Z., Jayram, T.S., Kumar, R., Sivakumar, D., Trevisan, L.: Counting distinct elements in a data stream. In: RANDOM'02: Proceedings of the 6th International Workshop on Randomization and Approximation Techniques, London, UK, Springer-Verlag (2002) 1-10.
-
(2002)
RANDOM'02: Proceedings of the 6th International Workshop on Randomization and Approximation Techniques
, pp. 1-10
-
-
Bar-Yossef, Z.1
Jayram, T.S.2
Kumar, R.3
Sivakumar, D.4
Trevisan, L.5
-
31
-
-
77954742764
-
An optimal algorithm for the distinct elements problem
-
New York, NY, USA, ACM
-
Kane, D.M., Nelson, J., Woodruff, D.P.: An optimal algorithm for the distinct elements problem. In: PODS'10: Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems of data, New York, NY, USA, ACM (2010) 41-52.
-
(2010)
PODS'10: Proceedings of the Twenty-ninth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems of Data
, pp. 41-52
-
-
Kane, D.M.1
Nelson, J.2
Woodruff, D.P.3
|