메뉴 건너뛰기




Volumn , Issue , 2013, Pages 341-348

Efficient parallel partition-based algorithms for similarity search and join with edit distance constraints

Author keywords

content filter; parallel algorithms; similarity join; similarity search

Indexed keywords

CONTENT FILTER; MULTI-CORE PROCESSOR; PARTITION-BASED ALGORITHMS; PERFORMANCE REQUIREMENTS; PRUNING TECHNIQUES; REAL DATA SETS; SIMILARITY JOIN; SIMILARITY SEARCH;

EID: 84876803927     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/2457317.2457382     Document Type: Conference Paper
Times cited : (20)

References (35)
  • 1
    • 85104914015 scopus 로고    scopus 로고
    • Efficient exact set-similarity joins
    • A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
    • (2006) VLDB , pp. 918-929
    • Arasu, A.1    Ganti, V.2    Kaushik, R.3
  • 2
    • 35348849154 scopus 로고    scopus 로고
    • Scaling up all pairs similarity search
    • R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In WWW, pages 131-140, 2007.
    • (2007) WWW , pp. 131-140
    • Bayardo, R.J.1    Ma, Y.2    Srikant, R.3
  • 3
    • 67649641448 scopus 로고    scopus 로고
    • Space-constrained gram-based indexing for efficient approximate string search
    • A. Behm, S. Ji, C. Li, and J. Lu. Space-constrained gram-based indexing for efficient approximate string search. In ICDE, pages 604-615, 2009.
    • (2009) ICDE , pp. 604-615
    • Behm, A.1    Ji, S.2    Li, C.3    Lu, J.4
  • 4
    • 79957816183 scopus 로고    scopus 로고
    • Answering approximate string queries on large data sets using external memory
    • A. Behm, C. Li, and M. J. Carey. Answering approximate string queries on large data sets using external memory. In ICDE, pages 888-899, 2011.
    • (2011) ICDE , pp. 888-899
    • Behm, A.1    Li, C.2    Carey, M.J.3
  • 5
    • 1142279457 scopus 로고    scopus 로고
    • Robust and efficient fuzzy match for online data cleaning
    • S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD Conference, pages 313-324, 2003.
    • (2003) SIGMOD Conference , pp. 313-324
    • Chaudhuri, S.1    Ganjam, K.2    Ganti, V.3    Motwani, R.4
  • 6
    • 33749597967 scopus 로고    scopus 로고
    • A primitive operator for similarity joins in data cleaning
    • S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, pages 5-16, 2006.
    • (2006) ICDE , pp. 5-16
    • Chaudhuri, S.1    Ganti, V.2    Kaushik, R.3
  • 7
    • 84880363223 scopus 로고    scopus 로고
    • Top-k string similarity search with edit-distance constraints
    • D. Deng, G. Li, and J. Feng. Top-k string similarity search with edit-distance constraints. In ICDE, 2013.
    • (2013) ICDE
    • Deng, D.1    Li, G.2    Feng, J.3
  • 8
    • 84859722963 scopus 로고    scopus 로고
    • Efficient fuzzy type-ahead search in xml data
    • J. Feng and G. Li. Efficient fuzzy type-ahead search in xml data. IEEE Trans. Knowl. Data Eng., 24(5):882-895, 2012.
    • (2012) IEEE Trans. Knowl. Data Eng. , vol.24 , Issue.5 , pp. 882-895
    • Feng, J.1    Li, G.2
  • 9
    • 84864286297 scopus 로고    scopus 로고
    • Trie-join: A trie-based method for efficient string similarity joins
    • J. Feng, J. Wang, and G. Li. Trie-join: a trie-based method for efficient string similarity joins. VLDB J., 21(4):437-461, 2012.
    • (2012) VLDB J. , vol.21 , Issue.4 , pp. 437-461
    • Feng, J.1    Wang, J.2    Li, G.3
  • 11
    • 52649145249 scopus 로고    scopus 로고
    • Fast indexes and algorithms for set similarity selection queries
    • M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In ICDE, pages 267-276, 2008.
    • (2008) ICDE , pp. 267-276
    • Hadjieleftheriou, M.1    Chandel, A.2    Koudas, N.3    Srivastava, D.4
  • 12
    • 70849096574 scopus 로고    scopus 로고
    • Incremental maintenance of length normalized indexes for approximate string matching
    • M. Hadjieleftheriou, N. Koudas, and D. Srivastava. Incremental maintenance of length normalized indexes for approximate string matching. In SIGMOD Conference, pages 429-440, 2009.
    • (2009) SIGMOD Conference , pp. 429-440
    • Hadjieleftheriou, M.1    Koudas, N.2    Srivastava, D.3
  • 13
    • 84865633750 scopus 로고    scopus 로고
    • Efficient interactive fuzzy keyword search
    • S. Ji, G. Li, C. Li, and J. Feng. Efficient interactive fuzzy keyword search. In WWW, pages 433-439, 2009.
    • (2009) WWW , pp. 433-439
    • Ji, S.1    Li, G.2    Li, C.3    Feng, J.4
  • 14
    • 52649086729 scopus 로고    scopus 로고
    • Efficient merging and filtering algorithms for approximate string searches
    • C. Li, J. Lu, and Y. Lu. Efficient merging and filtering algorithms for approximate string searches. In ICDE, pages 257-266, 2008.
    • (2008) ICDE , pp. 257-266
    • Li, C.1    Lu, J.2    Lu, Y.3
  • 15
    • 85011032600 scopus 로고    scopus 로고
    • Vgram: Improving performance of approximate queries on string collections using variable-length grams
    • C. Li, B. Wang, and X. Yang. Vgram: Improving performance of approximate queries on string collections using variable-length grams. In VLDB, pages 303-314, 2007.
    • (2007) VLDB , pp. 303-314
    • Li, C.1    Wang, B.2    Yang, X.3
  • 16
    • 84862671407 scopus 로고    scopus 로고
    • Pass-join: A partition-based method for similarity joins
    • G. Li, D. Deng, J. Wang, and J. Feng. Pass-join: A partition-based method for similarity joins. PVLDB, 5(3):253-264, 2011.
    • (2011) PVLDB , vol.5 , Issue.3 , pp. 253-264
    • Li, G.1    Deng, D.2    Wang, J.3    Feng, J.4
  • 17
    • 84871643308 scopus 로고    scopus 로고
    • Supporting search-as-you-type using sql in databases
    • G. Li, J. Feng, and C. Li. Supporting search-as-you-type using sql in databases. IEEE Trans. Knowl. Data Eng., 25(2):461-475, 2013.
    • (2013) IEEE Trans. Knowl. Data Eng. , vol.25 , Issue.2 , pp. 461-475
    • Li, G.1    Feng, J.2    Li, C.3
  • 18
    • 68349143394 scopus 로고    scopus 로고
    • Efficient type-ahead search on relational data: A tastier approach
    • G. Li, S. Ji, C. Li, and J. Feng. Efficient type-ahead search on relational data: a tastier approach. In SIGMOD Conference, pages 695-706, 2009.
    • (2009) SIGMOD Conference , pp. 695-706
    • Li, G.1    Ji, S.2    Li, C.3    Feng, J.4
  • 19
    • 79960467518 scopus 로고    scopus 로고
    • Efficient fuzzy full-text type-ahead search
    • G. Li, S. Ji, C. Li, and J. Feng. Efficient fuzzy full-text type-ahead search. VLDB J., 20(4):617-640, 2011.
    • (2011) VLDB J. , vol.20 , Issue.4 , pp. 617-640
    • Li, G.1    Ji, S.2    Li, C.3    Feng, J.4
  • 20
    • 77952746572 scopus 로고    scopus 로고
    • Efficient fuzzy type-ahead search in tastier
    • G. Li, S. Ji, C. Li, J. Wang, and J. Feng. Efficient fuzzy type-ahead search in tastier. In ICDE, pages 1105-1108, 2010.
    • (2010) ICDE , pp. 1105-1108
    • Li, G.1    Ji, S.2    Li, C.3    Wang, J.4    Feng, J.5
  • 21
    • 84866606636 scopus 로고    scopus 로고
    • Supporting efficient top-k queries in type-ahead search
    • G. Li, J. Wang, C. Li, and J. Feng. Supporting efficient top-k queries in type-ahead search. In SIGIR, pages 355-364, 2012.
    • (2012) SIGIR , pp. 355-364
    • Li, G.1    Wang, J.2    Li, C.3    Feng, J.4
  • 22
    • 84863758126 scopus 로고    scopus 로고
    • V-smart-join: A scalable mapreduce framework for all-pair similarity joins of multisets and vectors
    • A. Metwally and C. Faloutsos. V-smart-join: A scalable mapreduce framework for all-pair similarity joins of multisets and vectors. PVLDB, 5(8):704-715, 2012.
    • (2012) PVLDB , vol.5 , Issue.8 , pp. 704-715
    • Metwally, A.1    Faloutsos, C.2
  • 23
    • 79960001806 scopus 로고    scopus 로고
    • Efficient exact edit similarity query processing with the asymmetric signature scheme
    • J. Qin, W. Wang, Y. Lu, C. Xiao, and X. Lin. Efficient exact edit similarity query processing with the asymmetric signature scheme. In SIGMOD Conference, pages 1033-1044, 2011.
    • (2011) SIGMOD Conference , pp. 1033-1044
    • Qin, J.1    Wang, W.2    Lu, Y.3    Xiao, C.4    Lin, X.5
  • 24
    • 3142777876 scopus 로고    scopus 로고
    • Efficient set joins on similarity predicates
    • S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In SIGMOD Conference, pages 743-754, 2004.
    • (2004) SIGMOD Conference , pp. 743-754
    • Sarawagi, S.1    Kirpal, A.2
  • 25
    • 72949094783 scopus 로고    scopus 로고
    • Technical Report ifi-2007.02, Department of Informatics, April
    • B. S. T. Bocek, E. Hunt. Fast Similarity Search in Large Dictionaries. Technical Report ifi-2007.02, Department of Informatics, University of Zurich, April 2007. http://fastss.csg.uzh.ch/.
    • (2007) Fast Similarity Search in Large Dictionaries
    • Bocek, B.S.T.1    Hunt, E.2
  • 26
    • 77954744650 scopus 로고    scopus 로고
    • Efficient parallel set-similarity joins using mapreduce
    • R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using mapreduce. In SIGMOD, pages 495-506, 2010.
    • (2010) SIGMOD , pp. 495-506
    • Vernica, R.1    Carey, M.J.2    Li, C.3
  • 27
    • 79957822983 scopus 로고    scopus 로고
    • Trie-join: Efficient trie-based string similarity joins with edit-distance constraints
    • J. Wang, G. Li, and J. Feng. Trie-join: Efficient trie-based string similarity joins with edit-distance constraints. PVLDB, 3(1):1219-1230, 2010.
    • (2010) PVLDB , vol.3 , Issue.1 , pp. 1219-1230
    • Wang, J.1    Li, G.2    Feng, J.3
  • 28
    • 79957824788 scopus 로고    scopus 로고
    • Fast-join: An efficient method for fuzzy token matching based string similarity join
    • J. Wang, G. Li, and J. Feng. Fast-join: An efficient method for fuzzy token matching based string similarity join. In ICDE, pages 458-469, 2011.
    • (2011) ICDE , pp. 458-469
    • Wang, J.1    Li, G.2    Feng, J.3
  • 29
    • 84862702293 scopus 로고    scopus 로고
    • Can we beat the prefix filtering?: An adaptive framework for similarity join and search
    • J. Wang, G. Li, and J. Feng. Can we beat the prefix filtering?: an adaptive framework for similarity join and search. In SIGMOD Conference, pages 85-96, 2012.
    • (2012) SIGMOD Conference , pp. 85-96
    • Wang, J.1    Li, G.2    Feng, J.3
  • 30
    • 70849115286 scopus 로고    scopus 로고
    • Efficient approximate entity extraction with edit distance constraints
    • W. Wang, C. Xiao, X. Lin, and C. Zhang. Efficient approximate entity extraction with edit distance constraints. In SIGMOD Conference, pages 759-770, 2009.
    • (2009) SIGMOD Conference , pp. 759-770
    • Wang, W.1    Xiao, C.2    Lin, X.3    Zhang, C.4
  • 31
    • 70849105253 scopus 로고    scopus 로고
    • Ed-join: An efficient algorithm for similarity joins with edit distance constraints
    • C. Xiao, W. Wang, and X. Lin. Ed-join: an efficient algorithm for similarity joins with edit distance constraints. PVLDB, 1(1):933-944, 2008.
    • (2008) PVLDB , vol.1 , Issue.1 , pp. 933-944
    • Xiao, C.1    Wang, W.2    Lin, X.3
  • 32
    • 67649653766 scopus 로고    scopus 로고
    • Top-k set similarity joins
    • C. Xiao, W. Wang, X. Lin, and H. Shang. Top-k set similarity joins. In ICDE, pages 916-927, 2009.
    • (2009) ICDE , pp. 916-927
    • Xiao, C.1    Wang, W.2    Lin, X.3    Shang, H.4
  • 33
    • 57349141410 scopus 로고    scopus 로고
    • Efficient similarity joins for near duplicate detection
    • C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In WWW, pages 131-140, 2008.
    • (2008) WWW , pp. 131-140
    • Xiao, C.1    Wang, W.2    Lin, X.3    Yu, J.X.4
  • 34
    • 57149130672 scopus 로고    scopus 로고
    • Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
    • X. Yang, B. Wang, and C. Li. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently. In SIGMOD Conference, pages 353-364, 2008.
    • (2008) SIGMOD Conference , pp. 353-364
    • Yang, X.1    Wang, B.2    Li, C.3
  • 35
    • 77954747181 scopus 로고    scopus 로고
    • Bed-tree: An all-purpose index structure for string similarity search based on edit distance
    • Z. Zhang, M. Hadjieleftheriou, B. C. Ooi, and D. Srivastava. Bed-tree: an all-purpose index structure for string similarity search based on edit distance. In SIGMOD, pages 915-926, 2010.
    • (2010) SIGMOD , pp. 915-926
    • Zhang, Z.1    Hadjieleftheriou, M.2    Ooi, B.C.3    Srivastava, D.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.