-
1
-
-
85104914015
-
Efficient exact set-similarity joins
-
A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
-
(2006)
VLDB
, pp. 918-929
-
-
Arasu, A.1
Ganti, V.2
Kaushik, R.3
-
2
-
-
35348849154
-
Scaling up all pairs similarity search
-
R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In WWW, pages 131-140, 2007.
-
(2007)
WWW
, pp. 131-140
-
-
Bayardo, R.J.1
Ma, Y.2
Srikant, R.3
-
3
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD Conference, pages 313-324, 2003.
-
(2003)
SIGMOD Conference
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
4
-
-
33749597967
-
A primitive operator for similarity joins in data cleaning
-
S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, pages 5-16, 2006.
-
(2006)
ICDE
, pp. 5-16
-
-
Chaudhuri, S.1
Ganti, V.2
Kaushik, R.3
-
5
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
L. Gravano, P. G. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava. Approximate string joins in a database (almost) for free. In VLDB, pages 491-500, 2001.
-
(2001)
VLDB
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.G.2
Jagadish, H.V.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
6
-
-
52649145249
-
Fast indexes and algorithms for set similarity selection queries
-
M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In ICDE, pages 267-276, 2008.
-
(2008)
ICDE
, pp. 267-276
-
-
Hadjieleftheriou, M.1
Chandel, A.2
Koudas, N.3
Srivastava, D.4
-
7
-
-
70349659026
-
Hashed samples: Selectivity estimators for set similarity selection queries
-
M. Hadjieleftheriou, X. Yu, N. Koudas, and D. Srivastava. Hashed samples: selectivity estimators for set similarity selection queries. PVLDB, 1(1):201-212, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 201-212
-
-
Hadjieleftheriou, M.1
Yu, X.2
Koudas, N.3
Srivastava, D.4
-
9
-
-
33745607646
-
Selectivity estimation for fuzzy string predicates in large data sets
-
L. Jin and C. Li. Selectivity estimation for fuzzy string predicates in large data sets. In VLDB, pages 397-408, 2005.
-
(2005)
VLDB
, pp. 397-408
-
-
Jin, L.1
Li, C.2
-
10
-
-
33745621089
-
n-gram/2l: A space and time efficient two-level n-gram inverted index structure
-
M.-S. Kim, K.-Y. Whang, J.-G. Lee, and M.-J. Lee. n-gram/2l: A space and time efficient two-level n-gram inverted index structure. In VLDB, pages 325-336, 2005.
-
(2005)
VLDB
, pp. 325-336
-
-
Kim, M.-S.1
Whang, K.-Y.2
Lee, J.-G.3
Lee, M.-J.4
-
11
-
-
77957718350
-
Power-law based estimation of set similarity join size
-
H. Lee, R. T. Ng, and K. Shim. Power-law based estimation of set similarity join size. PVLDB, 2(1):658-669, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.1
, pp. 658-669
-
-
Lee, H.1
Ng, R.T.2
Shim, K.3
-
12
-
-
81055146159
-
Similarity join size estimation using locality sensitive hashing
-
H. Lee, R. T. Ng, and K. Shim. Similarity join size estimation using locality sensitive hashing. PVLDB, 4(6):338-349, 2011.
-
(2011)
PVLDB
, vol.4
, Issue.6
, pp. 338-349
-
-
Lee, H.1
Ng, R.T.2
Shim, K.3
-
13
-
-
52649086729
-
Efficient merging and filtering algorithms for approximate string searches
-
C. Li, J. Lu, and Y. Lu. Efficient merging and filtering algorithms for approximate string searches. In ICDE, 2008.
-
(2008)
ICDE
-
-
Li, C.1
Lu, J.2
Lu, Y.3
-
14
-
-
85011032600
-
Vgram: Improving performance of approximate queries on string collections using variable-length grams
-
C. Li, B. Wang, and X. Yang. Vgram: Improving performance of approximate queries on string collections using variable-length grams. In VLDB, pages 303-314, 2007.
-
(2007)
VLDB
, pp. 303-314
-
-
Li, C.1
Wang, B.2
Yang, X.3
-
15
-
-
79959922359
-
Faerie: Efficient filtering algorithms for approximate dictionary-based entity extraction
-
G. Li, D. Deng, and J. Feng. Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction. In SIGMOD Conference, pages 529-540, 2011.
-
(2011)
SIGMOD Conference
, pp. 529-540
-
-
Li, G.1
Deng, D.2
Feng, J.3
-
16
-
-
84862671407
-
Pass-join: A partition-based method for similarity joins
-
G. Li, D. Deng, J. Wang, and J. Feng. Pass-join: A partition-based method for similarity joins. PVLDB, 5(3):253-264, 2011.
-
(2011)
PVLDB
, vol.5
, Issue.3
, pp. 253-264
-
-
Li, G.1
Deng, D.2
Wang, J.3
Feng, J.4
-
17
-
-
34547421874
-
Estimating the selectivity of approximate string queries
-
A. Mazeika, M. H. Böhlen, N. Koudas, and D. Srivastava. Estimating the selectivity of approximate string queries. ACM Trans. Database Syst., 32(2):12, 2007.
-
(2007)
ACM Trans. Database Syst.
, vol.32
, Issue.2
, pp. 12
-
-
Mazeika, A.1
Böhlen, M.H.2
Koudas, N.3
Srivastava, D.4
-
18
-
-
0345566149
-
A guided tour to approximate string matching
-
G. Navarro. A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31-88, 2001.
-
(2001)
ACM Comput. Surv.
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, G.1
-
19
-
-
79960001806
-
Efficient exact edit similarity query processing with the asymmetric signature scheme
-
J. Qin, W. Wang, Y. Lu, C. Xiao, and X. Lin. Efficient exact edit similarity query processing with the asymmetric signature scheme. In SIGMOD Conference, pages 1033-1044, 2011.
-
(2011)
SIGMOD Conference
, pp. 1033-1044
-
-
Qin, J.1
Wang, W.2
Lu, Y.3
Xiao, C.4
Lin, X.5
-
20
-
-
3142777876
-
Efficient set joins on similarity predicates
-
S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In SIGMOD Conference, pages 743-754, 2004.
-
(2004)
SIGMOD Conference
, pp. 743-754
-
-
Sarawagi, S.1
Kirpal, A.2
-
21
-
-
77952772124
-
The similarity join database operator
-
Y. N. Silva, W. G. Aref, and M. H. Ali. The similarity join database operator. In ICDE, pages 892-903, 2010.
-
(2010)
ICDE
, pp. 892-903
-
-
Silva, Y.N.1
Aref, W.G.2
Ali, M.H.3
-
22
-
-
77954744650
-
Efficient parallel set-similarity joins using mapreduce
-
R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using mapreduce. In SIGMOD Conference, pages 495-506, 2010.
-
(2010)
SIGMOD Conference
, pp. 495-506
-
-
Vernica, R.1
Carey, M.J.2
Li, C.3
-
23
-
-
79957822983
-
Trie-join: Efficient trie-based string similarity joins with edit-distance constraints
-
J. Wang, G. Li, and J. Feng. Trie-join: Efficient trie-based string similarity joins with edit-distance constraints. PVLDB, 3(1):1219-1230, 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
, pp. 1219-1230
-
-
Wang, J.1
Li, G.2
Feng, J.3
-
24
-
-
79957824788
-
Fast-join: An efficient method for fuzzy token matching based string similarity join
-
J. Wang, G. Li, and J. Feng. Fast-join: An efficient method for fuzzy token matching based string similarity join. In ICDE, pages 458-469, 2011.
-
(2011)
ICDE
, pp. 458-469
-
-
Wang, J.1
Li, G.2
Feng, J.3
-
25
-
-
70849105253
-
Ed-join: An efficient algorithm for similarity joins with edit distance constraints
-
C. Xiao, W. Wang, and X. Lin. Ed-join: an efficient algorithm for similarity joins with edit distance constraints. PVLDB, 1(1):933-944, 2008.
-
(2008)
PVLDB
, vol.1
, Issue.1
, pp. 933-944
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
-
26
-
-
67649653766
-
Top-k set similarity joins
-
C. Xiao, W. Wang, X. Lin, and H. Shang. Top-k set similarity joins. In ICDE, pages 916-927, 2009.
-
(2009)
ICDE
, pp. 916-927
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Shang, H.4
-
27
-
-
66249113620
-
Efficient similarity joins for near duplicate detection
-
C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In WWW, 2008.
-
(2008)
WWW
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Yu, J.X.4
-
28
-
-
79960002413
-
Atlas: A probabilistic algorithm for high dimensional similarity search
-
J. Zhai, Y. Lou, and J. Gehrke. Atlas: a probabilistic algorithm for high dimensional similarity search. In SIGMOD Conference, pages 997-1008, 2011.
-
(2011)
SIGMOD Conference
, pp. 997-1008
-
-
Zhai, J.1
Lou, Y.2
Gehrke, J.3
-
29
-
-
77954747181
-
Bed-tree: An all-purpose index structure for string similarity search based on edit distance
-
Z. Zhang, M. Hadjieleftheriou, B. C. Ooi, and D. Srivastava. Bed-tree: an all-purpose index structure for string similarity search based on edit distance. In SIGMOD, 2010.
-
(2010)
SIGMOD
-
-
Zhang, Z.1
Hadjieleftheriou, M.2
Ooi, B.C.3
Srivastava, D.4
|