-
1
-
-
22044441103
-
Inverted index compression using wordaligned binary codes
-
V. N. Anh and A. Moffat. Inverted index compression using wordaligned binary codes. Inf. Retr., 8(1):151-166, 2005.
-
(2005)
Inf. Retr
, vol.8
, Issue.1
, pp. 151-166
-
-
Anh, V.N.1
Moffat, A.2
-
2
-
-
85104914015
-
Efficient exact set-similarity joins
-
A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
-
(2006)
VLDB
, pp. 918-929
-
-
Arasu, A.1
Ganti, V.2
Kaushik, R.3
-
3
-
-
35348849154
-
Scaling up all pairs similarity search
-
R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In WWW, pages 131-140, 2007.
-
(2007)
, pp. 131-140
-
-
Bayardo, R.J.1
Ma, Y.2
Srikant, R.3
-
4
-
-
84945153497
-
Space-constrained gram-based indexing for efficient approximate string search (full version)
-
Technical report, Department of Computer Science, UC Irvine, June
-
A. Behm, S. Ji, C. Li, and J. Lu. Space-constrained gram-based indexing for efficient approximate string search (full version). Technical report, Department of Computer Science, UC Irvine, June 2008.
-
(2008)
-
-
Behm, A.1
Ji, S.2
Li, C.3
Lu, J.4
-
5
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD Conference, pages 313-324, 2003.
-
(2003)
SIGMOD Conference
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
6
-
-
33749597967
-
A primitive operator for similarity joins in data cleaning
-
S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, page 5, 2006.
-
(2006)
ICDE
, pp. 5
-
-
Chaudhuri, S.1
Ganti, V.2
Kaushik, R.3
-
7
-
-
0016486577
-
Universal codeword sets and representations of the integers
-
Mar
-
P. Elias. Universal codeword sets and representations of the integers. Information Theory, IEEE Transactions on, 21(2):194-203, Mar 1975.
-
(1975)
Information Theory, IEEE Transactions on
, vol.21
, Issue.2
, pp. 194-203
-
-
Elias, P.1
-
8
-
-
84976808645
-
Data structures and algorithms for disjoint set union problems
-
Z. Galil and G. F. Italiano. Data structures and algorithms for disjoint set union problems. ACM Comput. Surv., 23(3):319-344, 1991.
-
(1991)
ACM Comput. Surv
, vol.23
, Issue.3
, pp. 319-344
-
-
Galil, Z.1
Italiano, G.F.2
-
10
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
L. Gravano, P. G. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava. Approximate string joins in a database (almost) for free. In VLDB, pages 491-500, 2001.
-
(2001)
VLDB
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.G.2
Jagadish, H.V.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
11
-
-
52649145249
-
Fast indexes and algorithms for set similarity selection queries
-
M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In ICDE, pages 267-276, 2008.
-
(2008)
ICDE
, pp. 267-276
-
-
Hadjieleftheriou, M.1
Chandel, A.2
Koudas, N.3
Srivastava, D.4
-
12
-
-
70349659026
-
Hashed samples: Selectivity estimators for set similarity selection queries
-
M. Hadjieleftheriou, X. Yu, N. Koudas, and D. Srivastava. Hashed samples: Selectivity estimators for set similarity selection queries. In VLDB, 2008.
-
(2008)
VLDB
-
-
Hadjieleftheriou, M.1
Yu, X.2
Koudas, N.3
Srivastava, D.4
-
13
-
-
18744405413
-
Indexing text data under space constraints
-
B. Hore, H. Hacigümüs, B. R. Iyer, and S. Mehrotra. Indexing text data under space constraints. In CIKM, pages 198-207, 2004.
-
(2004)
CIKM
, pp. 198-207
-
-
Hore, B.1
Hacigümüs, H.2
Iyer, B.R.3
Mehrotra, S.4
-
14
-
-
0001907042
-
Approximate nearest neighbors: Towards removing the curse of dimensionality
-
P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In STOC Conference, 1998.
-
(1998)
STOC Conference
-
-
Indyk, P.1
Motwani, R.2
-
15
-
-
0032681554
-
Substring selectivity estimation
-
H. V. Jagadish, R. T. Ng, and D. Srivastava. Substring selectivity estimation. In PODS, pages 249-260, 1999.
-
(1999)
PODS
, pp. 249-260
-
-
Jagadish, H.V.1
Ng, R.T.2
Srivastava, D.3
-
16
-
-
33745607646
-
Selectivity estimation for fuzzy string predicates in large data sets
-
L. Jin and C. Li. Selectivity estimation for fuzzy string predicates in large data sets. In VLDB, pages 397-408, 2005.
-
(2005)
VLDB
, pp. 397-408
-
-
Jin, L.1
Li, C.2
-
17
-
-
33745621089
-
n-Gram/2L: A space and time efficient two-level n-gram inverted index structure
-
M.-S. Kim, K.-Y. Whang, J.-G. Lee, and M.-J. Lee. n-Gram/2L: A space and time efficient two-level n-gram inverted index structure. In VLDB, pages 325-336, 2005.
-
(2005)
VLDB
, pp. 325-336
-
-
Kim, M.-S.1
Whang, K.-Y.2
Lee, J.-G.3
Lee, M.-J.4
-
18
-
-
0030157287
-
Estimating alphanumeric selectivity in the presence of wildcards
-
P. Krishnan, J. S. Vitter, and B. R. Iyer. Estimating alphanumeric selectivity in the presence of wildcards. In SIGMOD Conference, pages 282-293, 1996.
-
(1996)
SIGMOD Conference
, pp. 282-293
-
-
Krishnan, P.1
Vitter, J.S.2
Iyer, B.R.3
-
19
-
-
85011072445
-
Extending q-grams to estimate selectivity of string matching with low edit distance
-
H. Lee, R. T. Ng, and K. Shim. Extending q-grams to estimate selectivity of string matching with low edit distance. In VLDB, pages 195-206, 2007.
-
(2007)
VLDB
, pp. 195-206
-
-
Lee, H.1
Ng, R.T.2
Shim, K.3
-
20
-
-
52649086729
-
Efficient merging and filtering algorithms for approximate string searches
-
C. Li, J. Lu, and Y. Lu. Efficient merging and filtering algorithms for approximate string searches. In ICDE, pages 257-266, 2008.
-
(2008)
ICDE
, pp. 257-266
-
-
Li, C.1
Lu, J.2
Lu, Y.3
-
21
-
-
85011032600
-
VGRAM: Improving performance of approximate queries on string collections using variable-length grams
-
C. Li, B. Wang, and X. Yang. VGRAM: Improving performance of approximate queries on string collections using variable-length grams. In VLDB, pages 303-314, 2007.
-
(2007)
VLDB
, pp. 303-314
-
-
Li, C.1
Wang, B.2
Yang, X.3
-
22
-
-
34547421874
-
Estimating the selectivity of approximate string queries
-
A. Mazeika, M. H. Böhlen, N. Koudas, and D. Srivastava. Estimating the selectivity of approximate string queries. ACM Trans. Database Syst., 32(2):12, 2007.
-
(2007)
ACM Trans. Database Syst
, vol.32
, Issue.2
, pp. 12
-
-
Mazeika, A.1
Böhlen, M.H.2
Koudas, N.3
Srivastava, D.4
-
24
-
-
0030260359
-
Self-indexing inverted files for fast text retrieval
-
A. Moffat and J. Zobel. Self-indexing inverted files for fast text retrieval. ACM Trans. Inf. Syst., 14(4):349-379, 1996.
-
(1996)
ACM Trans. Inf. Syst
, vol.14
, Issue.4
, pp. 349-379
-
-
Moffat, A.1
Zobel, J.2
-
25
-
-
0345566149
-
A guided tour to approximate string matching
-
G. Navarro. A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31-88, 2001.
-
(2001)
ACM Comput. Surv
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, G.1
-
26
-
-
0344065611
-
Distance based indexing for string proximity search
-
S. C. Sahinalp, M. Tasan, J. Macker, and Z. M. Özsoyoglu. Distance based indexing for string proximity search. In ICDE, pages 125-, 2003.
-
(2003)
ICDE
, pp. 125
-
-
Sahinalp, S.C.1
Tasan, M.2
Macker, J.3
Özsoyoglu, Z.M.4
-
27
-
-
3142777876
-
Efficient set joins on similarity predicates
-
S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In SIGMOD Conference, pages 743-754, 2004.
-
(2004)
SIGMOD Conference
, pp. 743-754
-
-
Sarawagi, S.1
Kirpal, A.2
-
28
-
-
57349141410
-
Efficient similarity joins for near duplicate detection
-
C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In WWW, pages 131-140, 2008.
-
(2008)
, pp. 131-140
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Yu, J.X.4
-
29
-
-
57149130672
-
Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
-
X. Yang, B. Wang, and C. Li. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently. In SIGMOD Conference, 2008.
-
(2008)
SIGMOD Conference
-
-
Yang, X.1
Wang, B.2
Li, C.3
-
30
-
-
0018019231
-
Compression of individual sequences via variablerate coding
-
J. Ziv and A. Lempel. Compression of individual sequences via variablerate coding. IEEE Transactions on Information Theory, 24(5):530-536,1978.
-
(1978)
IEEE Transactions on Information Theory
, vol.24
, Issue.5
, pp. 530-536
-
-
Ziv, J.1
Lempel, A.2
-
31
-
-
33747729581
-
Inverted files for text search engines
-
J. Zobel and A. Moffat. Inverted files for text search engines. ACM Comput. Surv., 38(2):6, 2006.
-
(2006)
ACM Comput. Surv
, vol.38
, Issue.2
, pp. 6
-
-
Zobel, J.1
Moffat, A.2
|