SCOPUS 정보 검색 플랫폼

Proceedings of the VLDB Endowment

Volumn 2, Issue 1, 2009, Pages 658-669

Power-law based estimation of set similarity join size

(3) Lee, Hongrae a Ng, Raymond T a Shim, Kyuseok b

a UNIVERSITY OF BRITISH COLUMBIA (Canada)

b Seoul National University (South Korea)

Author keywords

[No Author keywords available]

Indexed keywords

COUNTING TECHNIQUES; EFFICIENT ESTIMATION; EXPERIMENTAL EVALUATION; INCLUSION-EXCLUSION; NOVEL TECHNIQUES; POWER LAW RELATIONSHIP; SET-SIMILARITY JOINS; SUCCINCT REPRESENTATION;

EID: 77957718350 PISSN: None EISSN: 21508097 Source Type: Conference Proceeding
DOI: 10.14778/1687627.1687702 Document Type: Article

Times cited : (29)

References (30)

1
- 0001882616
- Fast Algorithms for Mining Association Rules
- R. Agrawal and R. Srikant. Fast Algorithms for Mining Association Rules. In Proc. VLDB, pages 487-499, 1994.
- (1994) Proc. VLDB , pp. 487-499
- Agrawal, R.¹ Srikant, R.²

2
- 85104914015
- Efficient Exact Set-Similarity Joins
- A. Arasu, V. Ganti, and R. Kaushik. Efficient Exact Set-Similarity Joins. In Proc. VLDB, pages 918-929, 2006.
- (2006) Proc. VLDB , pp. 918-929
- Arasu, A.¹ Ganti, V.² Kaushik, R.³

3
- 35348849154
- Scaling up all pairs similarity search
- R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In Proc. WWW, pages 131-140, 2007.
- (2007) Proc. WWW , pp. 131-140
- Bayardo, R.J.¹ Ma, Y.² Srikant, R.³

4
- 67049119060
- A Randomized Approach for Approximating the Number of Frequent Sets
- M. Boley and H. Grosskreutz. A Randomized Approach for Approximating the Number of Frequent Sets. In Proc. ICDM, pages 43-52, 2008.
- (2008) Proc. ICDM , pp. 43-52
- Boley, M.¹ Grosskreutz, H.²

5
- 0031346696
- On the Resemblance and Containment of Documents
- A. Z. Broder. On the Resemblance and Containment of Documents. In Proc. SEQUENCES, pages 21-29, 1997.
- (1997) Proc. SEQUENCES , pp. 21-29
- Broder, A.Z.¹

6
- 33747680326
- Association mining
- A. Ceglar and J. F. Roddick. Association mining. ACM Computing Surveys, 38(2), 2006.
- (2006) ACM Computing Surveys , vol.38 , Issue.2
- Ceglar, A.¹ Roddick, J.F.²

7
- 33749597967
- A Primitive Operator for Similarity Joins in Data Cleaning
- S. Chaudhuri, V. Ganti, and R. Kaushik. A Primitive Operator for Similarity Joins in Data Cleaning. In Proc. ICDE, page 5, 2006.
- (2006) Proc. ICDE , pp. 5
- Chaudhuri, S.¹ Ganti, V.² Kaushik, R.³

8
- 0033688075
- Selectivity Estimation For Boolean Queries
- Z. Chen, F. Korn, N. Koudas, and S. Muthukrishnan. Selectivity Estimation For Boolean Queries. In Proc. PODS, pages 216-225, 2000.
- (2000) Proc. PODS , pp. 216-225
- Chen, Z.¹ Korn, F.² Koudas, N.³ Muthukrishnan, S.⁴

9
- 46749083938
- Power-law relationship and self-similarity in the itemset support distribution: analysis and applications
- K.-T. Chuang, J.-L. Huang, and M.-S. Chen. Power-law relationship and self-similarity in the itemset support distribution: analysis and applications. The VLDB Journal, 17(5):1121-1141, 2008.
- (2008) The VLDB Journal , vol.17 , Issue.5 , pp. 1121-1141
- Chuang, K.-T.¹ Huang, J.-L.² Chen, M.-S.³

10
- 65549085067
- Power-law distributions in empirical data
- (to appear)
- A. Clauset, C. R. Shalizi, and M. E. J. Newman. Power-law distributions in empirical data. SIAM Review, (to appear) 2009.
- (2009) SIAM Review
- Clauset, A.¹ Shalizi, C.R.² Newman, M.E.J.³

11
- 0031353179
- Size-estimation framework with applications to transitive closure and reachability
- E. Cohen. Size-estimation framework with applications to transitive closure and reachability. Journal of Computer and System Sciences, 55(3):441-453, 1997.
- (1997) Journal of Computer and System Sciences , vol.55 , Issue.3 , pp. 441-453
- Cohen, E.¹

12
- 0036366837
- Mining database structure; or, how to build a data quality browser
- T. Dasu, T. Johnson, S. Muthukrishnan, and V. Shkapenyuk. Mining database structure; or, how to build a data quality browser. In Proc. SIGMOD, pages 240-251, 2002.
- (2002) Proc. SIGMOD
- Dasu, T.¹ Johnson, T.² Muthukrishnan, S.³ Shkapenyuk, V.⁴

13
- 0012904212
- Spatial join selectivity using power laws
- C. Faloutsos, B. Seeger, A. Traina, and J. Caetano Traina. Spatial join selectivity using power laws. In SIGMOD, pages 177-188, 2000.
- (2000) SIGMOD , pp. 177-188
- Faloutsos, C.¹ Seeger, B.² Traina, A.³ Caetano Traina, J.⁴

14
- 77956215528
- Apriori implementation of ferenc bodon
- Ferenc Bodon. Apriori implementation of ferenc bodon. http://www.cs.bme.hu/bodon/en/apriori/.
- Bodon, F.¹

15
- 29844448776
- Conquer: Efficient management of inconsistent databases
- A. Fuxman, E. Fazli, and R. J. Miller. Conquer: Efficient management of inconsistent databases. In Proc. SIGMOD, pages 155-166, 2005.
- (2005) Proc. SIGMOD , pp. 155-166
- Fuxman, A.¹ Fazli, E.² Miller, R.J.³

16
- 52649145249
- Fast Indexes and Algorithms for Set Similarity Selection Queries
- M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast Indexes and Algorithms for Set Similarity Selection Queries. In Proc. ICDE, pages 267-276, 2008.
- (2008) Proc. ICDE , pp. 267-276
- Hadjieleftheriou, M.¹ Chandel, A.² Koudas, N.³ Srivastava, D.⁴

17
- 70349659026
- Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries
- M. Hadjieleftheriou, X. Yu, N. Koudas, and D. Srivastava. Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries. In Proc. VLDB, pages 201-212, 2008.
- (2008) Proc. VLDB , pp. 201-212
- Hadjieleftheriou, M.¹ Yu, X.² Koudas, N.³ Srivastava, D.⁴

18
- 0003768646
- Decision tree for optimization software
- Hans D. Mittelmann. Decision tree for optimization software. http://plato.asu.edu/sub/nonlsq.html.
- Mittelmann, H.D.¹

19
- 84976856849
- The merge/purge problem for large databases
- M. A. Herna'ndez and S. J. Stolfo. The merge/purge problem for large databases. In Proc. SIGMOD, pages 127-138, 1995.
- (1995) Proc. SIGMOD , pp. 127-138
- Herna'ndez, M.A.¹ Stolfo, S.J.²

20
- 0346299592
- Quest synthetic data generation code
- IBM
- IBM. Quest synthetic data generation code. http://www.almaden.ibm.com/cs/disciplines/iis/.

21
- 34250654176
- To search or to crawl?: towards a query optimizer for text-centric tasks
- P. G. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano. To search or to crawl?: towards a query optimizer for text-centric tasks. In Proc. SIGMOD, pages 265-276, 2006.
- (2006) Proc. SIGMOD , pp. 265-276
- Ipeirotis, P.G.¹ Agichtein, E.² Jain, P.³ Gravano, L.⁴

22
- 70349096277
- Estimating the Number of Frequent Itemsets in a Large Database
- R. Jin, S. McCallen, Y. Breitbart, D. Fuhry, and D. Wang. Estimating the Number of Frequent Itemsets in a Large Database. In Proc. EDBT, pages 505-516, 2009.
- (2009) Proc. EDBT , pp. 505-516
- Jin, R.¹ McCallen, S.² Breitbart, Y.³ Fuhry, D.⁴ Wang, D.⁵

23
- 0003476369
- Society for Industrial Mathematics
- C. L. Lawson and R. J. Hanson. Solving Least Squares Problems. Society for Industrial Mathematics, 1987.
- (1987) Solving Least Squares Problems
- Lawson, C.L.¹ Hanson, R.J.²

24
- 85011072445
- Extending Q-Grams to Estimate Selectivity of String Matching with Edit Distance
- H. Lee, R. T. Ng, and K. Shim. Extending Q-Grams to Estimate Selectivity of String Matching with Edit Distance. In Proc. VLDB, pages 195-206, 2007.
- (2007) Proc. VLDB , pp. 195-206
- Lee, H.¹ Ng, R.T.² Shim, K.³

25
- 70349136566
- Approximate Substring Selectivity Estimation
- H. Lee, R. T. Ng, and K. Shim. Approximate Substring Selectivity Estimation. In Proc. EDBT, pages 827-838, 2009.
- (2009) Proc. EDBT , pp. 827-838
- Lee, H.¹ Ng, R.T.² Shim, K.³

26
- 35348835502
- DETECTIVES: DETEcting Coalition hiT Inflation attacks in adVertising nEtworks Streams
- A. Metwally, D. Agrawal, and E. E. Abbadi. DETECTIVES: DETEcting Coalition hiT Inflation attacks in adVertising nEtworks Streams. In Proc. WWW, pages 241-250, 2007.
- (2007) Proc. WWW , pp. 241-250
- Metwally, A.¹ Agrawal, D.² Abbadi, E.E.³

27
- 52649112625
- Self-join size estimation in large-scale distributed data systems
- T. Pitoura and P. Triantafillou. Self-join size estimation in large-scale distributed data systems. In ICDE, pages 764-773, 2008.
- (2008) ICDE , pp. 764-773
- Pitoura, T.¹ Triantafillou, P.²

28
- 34250638291
- A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets
- M. Sahami and T. Heilman. A Web-based Kernel Function for Measuring the Similarity of Short Text Snippets. In Proc. WWW, pages 377-386, 2006.
- (2006) Proc. WWW , pp. 377-386
- Sahami, M.¹ Heilman, T.²

29
- 3142777876
- Efficient set joins on similarity predicates
- S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In Proc. SIGMOD, pages 743-754, 2004.
- (2004) Proc. SIGMOD , pp. 743-754
- Sarawagi, S.¹ Kirpal, A.²

30
- 0003586256
- Addison-Wesley
- G. K. Zipf. Human Behavior and the Principle of Least Effort. Addison-Wesley, 1947.
- (1947) Human Behavior and the Principle of Least Effort
- Zipf, G.K.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.