메뉴 건너뛰기




Volumn 2, Issue 1, 2009, Pages 658-669

Power-law based estimation of set similarity join size

Author keywords

[No Author keywords available]

Indexed keywords

COUNTING TECHNIQUES; EFFICIENT ESTIMATION; EXPERIMENTAL EVALUATION; INCLUSION-EXCLUSION; NOVEL TECHNIQUES; POWER LAW RELATIONSHIP; SET-SIMILARITY JOINS; SUCCINCT REPRESENTATION;

EID: 77957718350     PISSN: None     EISSN: 21508097     Source Type: Conference Proceeding    
DOI: 10.14778/1687627.1687702     Document Type: Article
Times cited : (29)

References (30)
  • 1
    • 0001882616 scopus 로고
    • Fast Algorithms for Mining Association Rules
    • R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules. In Proc. VLDB, pages 487-499, 1994.
    • (1994) Proc. VLDB , pp. 487-499
    • Agrawal, R.1    Srikant, R.2
  • 2
    • 85104914015 scopus 로고    scopus 로고
    • Efficient Exact Set-Similarity Joins
    • A. Arasu, V. Ganti, and R. Kaushik. Efficient Exact Set-Similarity Joins. In Proc. VLDB, pages 918-929, 2006.
    • (2006) Proc. VLDB , pp. 918-929
    • Arasu, A.1    Ganti, V.2    Kaushik, R.3
  • 3
    • 35348849154 scopus 로고    scopus 로고
    • Scaling up all pairs similarity search
    • R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In Proc. WWW, pages 131-140, 2007.
    • (2007) Proc. WWW , pp. 131-140
    • Bayardo, R.J.1    Ma, Y.2    Srikant, R.3
  • 4
    • 67049119060 scopus 로고    scopus 로고
    • A Randomized Approach for Approximating the Number of Frequent Sets
    • M. Boley and H. Grosskreutz. A Randomized Approach for Approximating the Number of Frequent Sets. In Proc. ICDM, pages 43-52, 2008.
    • (2008) Proc. ICDM , pp. 43-52
    • Boley, M.1    Grosskreutz, H.2
  • 5
    • 0031346696 scopus 로고    scopus 로고
    • On the Resemblance and Containment of Documents
    • A. Z. Broder. On the Resemblance and Containment of Documents. In Proc. SEQUENCES, pages 21-29, 1997.
    • (1997) Proc. SEQUENCES , pp. 21-29
    • Broder, A.Z.1
  • 7
    • 33749597967 scopus 로고    scopus 로고
    • A Primitive Operator for Similarity Joins in Data Cleaning
    • S. Chaudhuri, V. Ganti, and R. Kaushik. A Primitive Operator for Similarity Joins in Data Cleaning. In Proc. ICDE, page 5, 2006.
    • (2006) Proc. ICDE , pp. 5
    • Chaudhuri, S.1    Ganti, V.2    Kaushik, R.3
  • 9
    • 46749083938 scopus 로고    scopus 로고
    • Power-law relationship and self-similarity in the itemset support distribution: analysis and applications
    • K.-T. Chuang, J.-L. Huang, and M.-S. Chen. Power-law relationship and self-similarity in the itemset support distribution: analysis and applications. The VLDB Journal, 17(5):1121-1141, 2008.
    • (2008) The VLDB Journal , vol.17 , Issue.5 , pp. 1121-1141
    • Chuang, K.-T.1    Huang, J.-L.2    Chen, M.-S.3
  • 10
  • 11
    • 0031353179 scopus 로고    scopus 로고
    • Size-estimation framework with applications to transitive closure and reachability
    • E. Cohen. Size-estimation framework with applications to transitive closure and reachability. Journal of Computer and System Sciences, 55(3):441-453, 1997.
    • (1997) Journal of Computer and System Sciences , vol.55 , Issue.3 , pp. 441-453
    • Cohen, E.1
  • 12
    • 0036366837 scopus 로고    scopus 로고
    • Mining database structure; or, how to build a data quality browser
    • T. Dasu, T. Johnson, S. Muthukrishnan, and V. Shkapenyuk. Mining database structure; or, how to build a data quality browser. In Proc. SIGMOD, pages 240-251, 2002.
    • (2002) Proc. SIGMOD
    • Dasu, T.1    Johnson, T.2    Muthukrishnan, S.3    Shkapenyuk, V.4
  • 14
    • 77956215528 scopus 로고    scopus 로고
    • Apriori implementation of ferenc bodon
    • Ferenc Bodon. Apriori implementation of ferenc bodon. http://www.cs.bme.hu/bodon/en/apriori/.
    • Bodon, F.1
  • 15
    • 29844448776 scopus 로고    scopus 로고
    • Conquer: Efficient management of inconsistent databases
    • A. Fuxman, E. Fazli, and R. J. Miller. Conquer: Efficient management of inconsistent databases. In Proc. SIGMOD, pages 155-166, 2005.
    • (2005) Proc. SIGMOD , pp. 155-166
    • Fuxman, A.1    Fazli, E.2    Miller, R.J.3
  • 16
    • 52649145249 scopus 로고    scopus 로고
    • Fast Indexes and Algorithms for Set Similarity Selection Queries
    • M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast Indexes and Algorithms for Set Similarity Selection Queries. In Proc. ICDE, pages 267-276, 2008.
    • (2008) Proc. ICDE , pp. 267-276
    • Hadjieleftheriou, M.1    Chandel, A.2    Koudas, N.3    Srivastava, D.4
  • 17
    • 70349659026 scopus 로고    scopus 로고
    • Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries
    • M. Hadjieleftheriou, X. Yu, N. Koudas, and D. Srivastava. Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries. In Proc. VLDB, pages 201-212, 2008.
    • (2008) Proc. VLDB , pp. 201-212
    • Hadjieleftheriou, M.1    Yu, X.2    Koudas, N.3    Srivastava, D.4
  • 18
    • 0003768646 scopus 로고    scopus 로고
    • Decision tree for optimization software
    • Hans D. Mittelmann. Decision tree for optimization software. http://plato.asu.edu/sub/nonlsq.html.
    • Mittelmann, H.D.1
  • 19
    • 84976856849 scopus 로고
    • The merge/purge problem for large databases
    • M. A. Herna'ndez and S. J. Stolfo. The merge/purge problem for large databases. In Proc. SIGMOD, pages 127-138, 1995.
    • (1995) Proc. SIGMOD , pp. 127-138
    • Herna'ndez, M.A.1    Stolfo, S.J.2
  • 20
    • 0346299592 scopus 로고    scopus 로고
    • Quest synthetic data generation code
    • IBM
    • IBM. Quest synthetic data generation code. http://www.almaden.ibm.com/cs/disciplines/iis/.
  • 21
    • 34250654176 scopus 로고    scopus 로고
    • To search or to crawl?: towards a query optimizer for text-centric tasks
    • P. G. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano. To search or to crawl?: towards a query optimizer for text-centric tasks. In Proc. SIGMOD, pages 265-276, 2006.
    • (2006) Proc. SIGMOD , pp. 265-276
    • Ipeirotis, P.G.1    Agichtein, E.2    Jain, P.3    Gravano, L.4
  • 22
    • 70349096277 scopus 로고    scopus 로고
    • Estimating the Number of Frequent Itemsets in a Large Database
    • R. Jin, S. McCallen, Y. Breitbart, D. Fuhry, and D. Wang. Estimating the Number of Frequent Itemsets in a Large Database. In Proc. EDBT, pages 505-516, 2009.
    • (2009) Proc. EDBT , pp. 505-516
    • Jin, R.1    McCallen, S.2    Breitbart, Y.3    Fuhry, D.4    Wang, D.5
  • 24
    • 85011072445 scopus 로고    scopus 로고
    • Extending Q-Grams to Estimate Selectivity of String Matching with Edit Distance
    • H. Lee, R. T. Ng, and K. Shim. Extending Q-Grams to Estimate Selectivity of String Matching with Edit Distance. In Proc. VLDB, pages 195-206, 2007.
    • (2007) Proc. VLDB , pp. 195-206
    • Lee, H.1    Ng, R.T.2    Shim, K.3
  • 25
    • 70349136566 scopus 로고    scopus 로고
    • Approximate Substring Selectivity Estimation
    • H. Lee, R. T. Ng, and K. Shim. Approximate Substring Selectivity Estimation. In Proc. EDBT, pages 827-838, 2009.
    • (2009) Proc. EDBT , pp. 827-838
    • Lee, H.1    Ng, R.T.2    Shim, K.3
  • 26
    • 35348835502 scopus 로고    scopus 로고
    • DETECTIVES: DETEcting Coalition hiT Inflation attacks in adVertising nEtworks Streams
    • A. Metwally, D. Agrawal, and E. E. Abbadi. DETECTIVES: DETEcting Coalition hiT Inflation attacks in adVertising nEtworks Streams. In Proc. WWW, pages 241-250, 2007.
    • (2007) Proc. WWW , pp. 241-250
    • Metwally, A.1    Agrawal, D.2    Abbadi, E.E.3
  • 27
    • 52649112625 scopus 로고    scopus 로고
    • Self-join size estimation in large-scale distributed data systems
    • T. Pitoura and P. Triantafillou. Self-join size estimation in large-scale distributed data systems. In ICDE, pages 764-773, 2008.
    • (2008) ICDE , pp. 764-773
    • Pitoura, T.1    Triantafillou, P.2
  • 28
    • 34250638291 scopus 로고    scopus 로고
    • A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets
    • M. Sahami and T. Heilman. A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets. In Proc. WWW, pages 377-386, 2006.
    • (2006) Proc. WWW , pp. 377-386
    • Sahami, M.1    Heilman, T.2
  • 29
    • 3142777876 scopus 로고    scopus 로고
    • Efficient set joins on similarity predicates
    • S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In Proc. SIGMOD, pages 743-754, 2004.
    • (2004) Proc. SIGMOD , pp. 743-754
    • Sarawagi, S.1    Kirpal, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.