메뉴 건너뛰기




Volumn , Issue , 2009, Pages 604-615

Space-constrained gram-based indexing for efficient approximate string search

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE QUERY; APPROXIMATE STRING QUERY; COMPRESSION TECHNIQUES; DATA CLEANING; INDEXING STRUCTURES; NEW OPPORTUNITIES; PERFORMANCE PENALTIES; QUERY PERFORMANCE; QUERY RELAXATION; REAL DATA SETS; SPELL-CHECKING; STRING SEARCH; USER QUERY;

EID: 67649641448     PISSN: 10844627     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICDE.2009.32     Document Type: Conference Paper
Times cited : (65)

References (31)
  • 1
    • 22044441103 scopus 로고    scopus 로고
    • Inverted index compression using wordaligned binary codes
    • V. N. Anh and A. Moffat. Inverted index compression using wordaligned binary codes. Inf. Retr., 8(1):151-166, 2005.
    • (2005) Inf. Retr , vol.8 , Issue.1 , pp. 151-166
    • Anh, V.N.1    Moffat, A.2
  • 2
    • 85104914015 scopus 로고    scopus 로고
    • Efficient exact set-similarity joins
    • A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
    • (2006) VLDB , pp. 918-929
    • Arasu, A.1    Ganti, V.2    Kaushik, R.3
  • 3
    • 35348849154 scopus 로고    scopus 로고
    • Scaling up all pairs similarity search
    • R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In WWW, pages 131-140, 2007.
    • (2007) , pp. 131-140
    • Bayardo, R.J.1    Ma, Y.2    Srikant, R.3
  • 4
    • 84945153497 scopus 로고    scopus 로고
    • Space-constrained gram-based indexing for efficient approximate string search (full version)
    • Technical report, Department of Computer Science, UC Irvine, June
    • A. Behm, S. Ji, C. Li, and J. Lu. Space-constrained gram-based indexing for efficient approximate string search (full version). Technical report, Department of Computer Science, UC Irvine, June 2008.
    • (2008)
    • Behm, A.1    Ji, S.2    Li, C.3    Lu, J.4
  • 5
    • 1142279457 scopus 로고    scopus 로고
    • Robust and efficient fuzzy match for online data cleaning
    • S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD Conference, pages 313-324, 2003.
    • (2003) SIGMOD Conference , pp. 313-324
    • Chaudhuri, S.1    Ganjam, K.2    Ganti, V.3    Motwani, R.4
  • 6
    • 33749597967 scopus 로고    scopus 로고
    • A primitive operator for similarity joins in data cleaning
    • S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, page 5, 2006.
    • (2006) ICDE , pp. 5
    • Chaudhuri, S.1    Ganti, V.2    Kaushik, R.3
  • 7
    • 0016486577 scopus 로고
    • Universal codeword sets and representations of the integers
    • Mar
    • P. Elias. Universal codeword sets and representations of the integers. Information Theory, IEEE Transactions on, 21(2):194-203, Mar 1975.
    • (1975) Information Theory, IEEE Transactions on , vol.21 , Issue.2 , pp. 194-203
    • Elias, P.1
  • 8
    • 84976808645 scopus 로고
    • Data structures and algorithms for disjoint set union problems
    • Z. Galil and G. F. Italiano. Data structures and algorithms for disjoint set union problems. ACM Comput. Surv., 23(3):319-344, 1991.
    • (1991) ACM Comput. Surv , vol.23 , Issue.3 , pp. 319-344
    • Galil, Z.1    Italiano, G.F.2
  • 11
    • 52649145249 scopus 로고    scopus 로고
    • Fast indexes and algorithms for set similarity selection queries
    • M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In ICDE, pages 267-276, 2008.
    • (2008) ICDE , pp. 267-276
    • Hadjieleftheriou, M.1    Chandel, A.2    Koudas, N.3    Srivastava, D.4
  • 12
    • 70349659026 scopus 로고    scopus 로고
    • Hashed samples: Selectivity estimators for set similarity selection queries
    • M. Hadjieleftheriou, X. Yu, N. Koudas, and D. Srivastava. Hashed samples: Selectivity estimators for set similarity selection queries. In VLDB, 2008.
    • (2008) VLDB
    • Hadjieleftheriou, M.1    Yu, X.2    Koudas, N.3    Srivastava, D.4
  • 13
    • 18744405413 scopus 로고    scopus 로고
    • Indexing text data under space constraints
    • B. Hore, H. Hacigümüs, B. R. Iyer, and S. Mehrotra. Indexing text data under space constraints. In CIKM, pages 198-207, 2004.
    • (2004) CIKM , pp. 198-207
    • Hore, B.1    Hacigümüs, H.2    Iyer, B.R.3    Mehrotra, S.4
  • 14
    • 0001907042 scopus 로고    scopus 로고
    • Approximate nearest neighbors: Towards removing the curse of dimensionality
    • P. Indyk and R. Motwani. Approximate nearest neighbors: Towards removing the curse of dimensionality. In STOC Conference, 1998.
    • (1998) STOC Conference
    • Indyk, P.1    Motwani, R.2
  • 15
    • 0032681554 scopus 로고    scopus 로고
    • Substring selectivity estimation
    • H. V. Jagadish, R. T. Ng, and D. Srivastava. Substring selectivity estimation. In PODS, pages 249-260, 1999.
    • (1999) PODS , pp. 249-260
    • Jagadish, H.V.1    Ng, R.T.2    Srivastava, D.3
  • 16
    • 33745607646 scopus 로고    scopus 로고
    • Selectivity estimation for fuzzy string predicates in large data sets
    • L. Jin and C. Li. Selectivity estimation for fuzzy string predicates in large data sets. In VLDB, pages 397-408, 2005.
    • (2005) VLDB , pp. 397-408
    • Jin, L.1    Li, C.2
  • 17
    • 33745621089 scopus 로고    scopus 로고
    • n-Gram/2L: A space and time efficient two-level n-gram inverted index structure
    • M.-S. Kim, K.-Y. Whang, J.-G. Lee, and M.-J. Lee. n-Gram/2L: A space and time efficient two-level n-gram inverted index structure. In VLDB, pages 325-336, 2005.
    • (2005) VLDB , pp. 325-336
    • Kim, M.-S.1    Whang, K.-Y.2    Lee, J.-G.3    Lee, M.-J.4
  • 18
    • 0030157287 scopus 로고    scopus 로고
    • Estimating alphanumeric selectivity in the presence of wildcards
    • P. Krishnan, J. S. Vitter, and B. R. Iyer. Estimating alphanumeric selectivity in the presence of wildcards. In SIGMOD Conference, pages 282-293, 1996.
    • (1996) SIGMOD Conference , pp. 282-293
    • Krishnan, P.1    Vitter, J.S.2    Iyer, B.R.3
  • 19
    • 85011072445 scopus 로고    scopus 로고
    • Extending q-grams to estimate selectivity of string matching with low edit distance
    • H. Lee, R. T. Ng, and K. Shim. Extending q-grams to estimate selectivity of string matching with low edit distance. In VLDB, pages 195-206, 2007.
    • (2007) VLDB , pp. 195-206
    • Lee, H.1    Ng, R.T.2    Shim, K.3
  • 20
    • 52649086729 scopus 로고    scopus 로고
    • Efficient merging and filtering algorithms for approximate string searches
    • C. Li, J. Lu, and Y. Lu. Efficient merging and filtering algorithms for approximate string searches. In ICDE, pages 257-266, 2008.
    • (2008) ICDE , pp. 257-266
    • Li, C.1    Lu, J.2    Lu, Y.3
  • 21
    • 85011032600 scopus 로고    scopus 로고
    • VGRAM: Improving performance of approximate queries on string collections using variable-length grams
    • C. Li, B. Wang, and X. Yang. VGRAM: Improving performance of approximate queries on string collections using variable-length grams. In VLDB, pages 303-314, 2007.
    • (2007) VLDB , pp. 303-314
    • Li, C.1    Wang, B.2    Yang, X.3
  • 24
    • 0030260359 scopus 로고    scopus 로고
    • Self-indexing inverted files for fast text retrieval
    • A. Moffat and J. Zobel. Self-indexing inverted files for fast text retrieval. ACM Trans. Inf. Syst., 14(4):349-379, 1996.
    • (1996) ACM Trans. Inf. Syst , vol.14 , Issue.4 , pp. 349-379
    • Moffat, A.1    Zobel, J.2
  • 25
    • 0345566149 scopus 로고    scopus 로고
    • A guided tour to approximate string matching
    • G. Navarro. A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31-88, 2001.
    • (2001) ACM Comput. Surv , vol.33 , Issue.1 , pp. 31-88
    • Navarro, G.1
  • 26
    • 0344065611 scopus 로고    scopus 로고
    • Distance based indexing for string proximity search
    • S. C. Sahinalp, M. Tasan, J. Macker, and Z. M. Özsoyoglu. Distance based indexing for string proximity search. In ICDE, pages 125-, 2003.
    • (2003) ICDE , pp. 125
    • Sahinalp, S.C.1    Tasan, M.2    Macker, J.3    Özsoyoglu, Z.M.4
  • 27
    • 3142777876 scopus 로고    scopus 로고
    • Efficient set joins on similarity predicates
    • S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In SIGMOD Conference, pages 743-754, 2004.
    • (2004) SIGMOD Conference , pp. 743-754
    • Sarawagi, S.1    Kirpal, A.2
  • 28
    • 57349141410 scopus 로고    scopus 로고
    • Efficient similarity joins for near duplicate detection
    • C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In WWW, pages 131-140, 2008.
    • (2008) , pp. 131-140
    • Xiao, C.1    Wang, W.2    Lin, X.3    Yu, J.X.4
  • 29
    • 57149130672 scopus 로고    scopus 로고
    • Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
    • X. Yang, B. Wang, and C. Li. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently. In SIGMOD Conference, 2008.
    • (2008) SIGMOD Conference
    • Yang, X.1    Wang, B.2    Li, C.3
  • 30
    • 0018019231 scopus 로고
    • Compression of individual sequences via variablerate coding
    • J. Ziv and A. Lempel. Compression of individual sequences via variablerate coding. IEEE Transactions on Information Theory, 24(5):530-536,1978.
    • (1978) IEEE Transactions on Information Theory , vol.24 , Issue.5 , pp. 530-536
    • Ziv, J.1    Lempel, A.2
  • 31
    • 33747729581 scopus 로고    scopus 로고
    • Inverted files for text search engines
    • J. Zobel and A. Moffat. Inverted files for text search engines. ACM Comput. Surv., 38(2):6, 2006.
    • (2006) ACM Comput. Surv , vol.38 , Issue.2 , pp. 6
    • Zobel, J.1    Moffat, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.