메뉴 건너뛰기




Volumn , Issue , 2009, Pages 402-411

Finding duplicates in a data stream

Author keywords

[No Author keywords available]

Indexed keywords

DATA STREAM; HALF-SPACE; ISOLATION LEMMATA; LINEAR SPACES; ONE-PASS; POSITIVE FREQUENCIES; RANDOMIZED ALGORITHMS; SINGLE PASS;

EID: 70349155021     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1137/1.9781611973068.45     Document Type: Conference Paper
Times cited : (22)

References (17)
  • 1
    • 0033077324 scopus 로고    scopus 로고
    • The space complexity of approximating the frequency moments
    • [AMS99] N. Alon, Y. Matias, and M. Szegedy. The space complexity of approximating the frequency moments. JCSS, 58(1):137-147, 1999.
    • (1999) JCSS , vol.58 , Issue.1 , pp. 137-147
    • Alon, N.1    Matias, Y.2    Szegedy, M.3
  • 2
  • 3
    • 84880464612 scopus 로고    scopus 로고
    • Efficient URL caching for world wide web crawling
    • [BNW03] A. Z. Broder, M. Najork, and J. L. Wiener. Efficient URL caching for world wide web crawling. In WWW, 2003.
    • (2003) WWW
    • Broder, A.Z.1    Najork, M.2    Wiener, J.L.3
  • 4
    • 0002739272 scopus 로고
    • On the power of two-point based sampling
    • [CG89] B. Chor and O. Goldreich. On the power of two-point based sampling. J. Complexity, 5(1):96-106, (1989).
    • (1989) J. Complexity , vol.5 , Issue.1 , pp. 96-106
    • Chor, B.1    Goldreich, O.2
  • 5
    • 70349118455 scopus 로고    scopus 로고
    • Top-k frequent item maintenance over streams
    • [Cha08] M. Charikar. Top-k frequent item maintenance over streams. Survey article, 2008.
    • (2008) Survey Article
    • Charikar, M.1
  • 7
    • 34250668863 scopus 로고    scopus 로고
    • Approximately detecting duplicates for streaming data using stable bloom filters
    • DOI 10.1145/1142473.1142477, SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data
    • [DR06] F. Deng and D. Rafiei. Approximately detecting duplicates for streaming data using stable Bloom filters. In SIGMOD Conference, pages 25-36, 2006. (Pubitemid 46946501)
    • (2006) Proceedings of the ACM SIGMOD International Conference on Management of Data , pp. 25-36
    • Deng, F.1    Rafiei, D.2
  • 10
    • 33746851190 scopus 로고    scopus 로고
    • Stable distributions, pseudorandom generators, embeddings, and data stream computation
    • DOI 10.1145/1147954.1147955
    • [Ind06] P. Indyk. Stable distributions, pseudorandom generators, embeddings and data stream computation. J. ACM, 53(3):307-323, 2006. (Pubitemid 44180740)
    • (2006) Journal of the ACM , vol.53 , Issue.3 , pp. 307-323
    • Indyk, P.1
  • 12
    • 0001190474 scopus 로고
    • The probabilistic communication complexity of set intersection
    • [KS92] B. Kalyanasundaram and G. Schnitger. The probabilistic communication complexity of set intersection. SIAM Journal on Discrete Math, 5(5):545-557, 1992.
    • (1992) SIAM Journal on Discrete Math , vol.5 , Issue.5 , pp. 545-557
    • Kalyanasundaram, B.1    Schnitger, G.2
  • 13
    • 34548333306 scopus 로고    scopus 로고
    • Duplicate detection in click streams
    • [MAA05] A. Metwally, D. Agrawal, and A. El Abbadi. Duplicate detection in click streams. In WWW, pages 12-21, 2005.
    • (2005) WWW , pp. 12-21
    • Metwally, A.1    Agrawal, D.2    El Abbadi, A.3
  • 15
    • 0000947929 scopus 로고
    • Pseudorandom generators for space-bounded computation
    • [Nis92] N. Nisan. Pseudorandom generators for space-bounded computation. Combinatorica, 12(4):449-461, 1992.
    • (1992) Combinatorica , vol.12 , Issue.4 , pp. 449-461
    • Nisan, N.1
  • 16
    • 0026990361 scopus 로고
    • On the distributional complexity of disjointness
    • [Raz92] A. A. Razborov. On the distributional complexity of disjointness. Theoretical Computer Science, 106(2):385-390, 1992.
    • (1992) Theoretical Computer Science , vol.106 , Issue.2 , pp. 385-390
    • Razborov, A.A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.