SCOPUS 정보 검색 플랫폼

ACM International Conference Proceeding Series

Volumn , Issue , 2013, Pages 341-348

Efficient parallel partition-based algorithms for similarity search and join with edit distance constraints

(5) Jiang, Yu a Deng, Dong a Wang, Jiannan a Li, Guoliang a Feng, Jianhua a

a TSINGHUA UNIVERSITY (China)

Author keywords

content filter; parallel algorithms; similarity join; similarity search

Indexed keywords

CONTENT FILTER; MULTI-CORE PROCESSOR; PARTITION-BASED ALGORITHMS; PERFORMANCE REQUIREMENTS; PRUNING TECHNIQUES; REAL DATA SETS; SIMILARITY JOIN; SIMILARITY SEARCH;

PARALLEL ALGORITHMS; PARALLEL ARCHITECTURES;

PARALLEL PROCESSING SYSTEMS;

EID: 84876803927 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2457317.2457382 Document Type: Conference Paper

Times cited : (20)

References (35)

1
- 85104914015
- Efficient exact set-similarity joins
- A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
- (2006) VLDB , pp. 918-929
- Arasu, A.¹ Ganti, V.² Kaushik, R.³

2
- 35348849154
- Scaling up all pairs similarity search
- R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In WWW, pages 131-140, 2007.
- (2007) WWW , pp. 131-140
- Bayardo, R.J.¹ Ma, Y.² Srikant, R.³

3
- 67649641448
- Space-constrained gram-based indexing for efficient approximate string search
- A. Behm, S. Ji, C. Li, and J. Lu. Space-constrained gram-based indexing for efficient approximate string search. In ICDE, pages 604-615, 2009.
- (2009) ICDE , pp. 604-615
- Behm, A.¹ Ji, S.² Li, C.³ Lu, J.⁴

4
- 79957816183
- Answering approximate string queries on large data sets using external memory
- A. Behm, C. Li, and M. J. Carey. Answering approximate string queries on large data sets using external memory. In ICDE, pages 888-899, 2011.
- (2011) ICDE , pp. 888-899
- Behm, A.¹ Li, C.² Carey, M.J.³

5
- 1142279457
- Robust and efficient fuzzy match for online data cleaning
- S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD Conference, pages 313-324, 2003.
- (2003) SIGMOD Conference , pp. 313-324
- Chaudhuri, S.¹ Ganjam, K.² Ganti, V.³ Motwani, R.⁴

6
- 33749597967
- A primitive operator for similarity joins in data cleaning
- S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, pages 5-16, 2006.
- (2006) ICDE , pp. 5-16
- Chaudhuri, S.¹ Ganti, V.² Kaushik, R.³

7
- 84880363223
- Top-k string similarity search with edit-distance constraints
- D. Deng, G. Li, and J. Feng. Top-k string similarity search with edit-distance constraints. In ICDE, 2013.
- (2013) ICDE
- Deng, D.¹ Li, G.² Feng, J.³

8
- 84859722963
- Efficient fuzzy type-ahead search in xml data
- J. Feng and G. Li. Efficient fuzzy type-ahead search in xml data. IEEE Trans. Knowl. Data Eng., 24(5):882-895, 2012.
- (2012) IEEE Trans. Knowl. Data Eng. , vol.24 , Issue.5 , pp. 882-895
- Feng, J.¹ Li, G.²

9
- 84864286297
- Trie-join: A trie-based method for efficient string similarity joins
- J. Feng, J. Wang, and G. Li. Trie-join: a trie-based method for efficient string similarity joins. VLDB J., 21(4):437-461, 2012.
- (2012) VLDB J. , vol.21 , Issue.4 , pp. 437-461
- Feng, J.¹ Wang, J.² Li, G.³

10
- 84944318804
- Approximate string joins in a database (almost) for free
- L. Gravano, P. G. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava. Approximate string joins in a database (almost) for free. In VLDB, pages 491-500, 2001.
- (2001) VLDB , pp. 491-500
- Gravano, L.¹ Ipeirotis, P.G.² Jagadish, H.V.³ Koudas, N.⁴ Muthukrishnan, S.⁵ Srivastava, D.⁶

11
- 52649145249
- Fast indexes and algorithms for set similarity selection queries
- M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In ICDE, pages 267-276, 2008.
- (2008) ICDE , pp. 267-276
- Hadjieleftheriou, M.¹ Chandel, A.² Koudas, N.³ Srivastava, D.⁴

12
- 70849096574
- Incremental maintenance of length normalized indexes for approximate string matching
- M. Hadjieleftheriou, N. Koudas, and D. Srivastava. Incremental maintenance of length normalized indexes for approximate string matching. In SIGMOD Conference, pages 429-440, 2009.
- (2009) SIGMOD Conference , pp. 429-440
- Hadjieleftheriou, M.¹ Koudas, N.² Srivastava, D.³

13
- 84865633750
- Efficient interactive fuzzy keyword search
- S. Ji, G. Li, C. Li, and J. Feng. Efficient interactive fuzzy keyword search. In WWW, pages 433-439, 2009.
- (2009) WWW , pp. 433-439
- Ji, S.¹ Li, G.² Li, C.³ Feng, J.⁴

14
- 52649086729
- Efficient merging and filtering algorithms for approximate string searches
- C. Li, J. Lu, and Y. Lu. Efficient merging and filtering algorithms for approximate string searches. In ICDE, pages 257-266, 2008.
- (2008) ICDE , pp. 257-266
- Li, C.¹ Lu, J.² Lu, Y.³

15
- 85011032600
- Vgram: Improving performance of approximate queries on string collections using variable-length grams
- C. Li, B. Wang, and X. Yang. Vgram: Improving performance of approximate queries on string collections using variable-length grams. In VLDB, pages 303-314, 2007.
- (2007) VLDB , pp. 303-314
- Li, C.¹ Wang, B.² Yang, X.³

16
- 84862671407
- Pass-join: A partition-based method for similarity joins
- G. Li, D. Deng, J. Wang, and J. Feng. Pass-join: A partition-based method for similarity joins. PVLDB, 5(3):253-264, 2011.
- (2011) PVLDB , vol.5 , Issue.3 , pp. 253-264
- Li, G.¹ Deng, D.² Wang, J.³ Feng, J.⁴

17
- 84871643308
- Supporting search-as-you-type using sql in databases
- G. Li, J. Feng, and C. Li. Supporting search-as-you-type using sql in databases. IEEE Trans. Knowl. Data Eng., 25(2):461-475, 2013.
- (2013) IEEE Trans. Knowl. Data Eng. , vol.25 , Issue.2 , pp. 461-475
- Li, G.¹ Feng, J.² Li, C.³

18
- 68349143394
- Efficient type-ahead search on relational data: A tastier approach
- G. Li, S. Ji, C. Li, and J. Feng. Efficient type-ahead search on relational data: a tastier approach. In SIGMOD Conference, pages 695-706, 2009.
- (2009) SIGMOD Conference , pp. 695-706
- Li, G.¹ Ji, S.² Li, C.³ Feng, J.⁴

19
- 79960467518
- Efficient fuzzy full-text type-ahead search
- G. Li, S. Ji, C. Li, and J. Feng. Efficient fuzzy full-text type-ahead search. VLDB J., 20(4):617-640, 2011.
- (2011) VLDB J. , vol.20 , Issue.4 , pp. 617-640
- Li, G.¹ Ji, S.² Li, C.³ Feng, J.⁴

20
- 77952746572
- Efficient fuzzy type-ahead search in tastier
- G. Li, S. Ji, C. Li, J. Wang, and J. Feng. Efficient fuzzy type-ahead search in tastier. In ICDE, pages 1105-1108, 2010.
- (2010) ICDE , pp. 1105-1108
- Li, G.¹ Ji, S.² Li, C.³ Wang, J.⁴ Feng, J.⁵

21
- 84866606636
- Supporting efficient top-k queries in type-ahead search
- G. Li, J. Wang, C. Li, and J. Feng. Supporting efficient top-k queries in type-ahead search. In SIGIR, pages 355-364, 2012.
- (2012) SIGIR , pp. 355-364
- Li, G.¹ Wang, J.² Li, C.³ Feng, J.⁴

22
- 84863758126
- V-smart-join: A scalable mapreduce framework for all-pair similarity joins of multisets and vectors
- A. Metwally and C. Faloutsos. V-smart-join: A scalable mapreduce framework for all-pair similarity joins of multisets and vectors. PVLDB, 5(8):704-715, 2012.
- (2012) PVLDB , vol.5 , Issue.8 , pp. 704-715
- Metwally, A.¹ Faloutsos, C.²

23
- 79960001806
- Efficient exact edit similarity query processing with the asymmetric signature scheme
- J. Qin, W. Wang, Y. Lu, C. Xiao, and X. Lin. Efficient exact edit similarity query processing with the asymmetric signature scheme. In SIGMOD Conference, pages 1033-1044, 2011.
- (2011) SIGMOD Conference , pp. 1033-1044
- Qin, J.¹ Wang, W.² Lu, Y.³ Xiao, C.⁴ Lin, X.⁵

24
- 3142777876
- Efficient set joins on similarity predicates
- S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In SIGMOD Conference, pages 743-754, 2004.
- (2004) SIGMOD Conference , pp. 743-754
- Sarawagi, S.¹ Kirpal, A.²

25
- 72949094783
- Technical Report ifi-2007.02, Department of Informatics, April
- B. S. T. Bocek, E. Hunt. Fast Similarity Search in Large Dictionaries. Technical Report ifi-2007.02, Department of Informatics, University of Zurich, April 2007. http://fastss.csg.uzh.ch/.
- (2007) Fast Similarity Search in Large Dictionaries
- Bocek, B.S.T.¹ Hunt, E.²

26
- 77954744650
- Efficient parallel set-similarity joins using mapreduce
- R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using mapreduce. In SIGMOD, pages 495-506, 2010.
- (2010) SIGMOD , pp. 495-506
- Vernica, R.¹ Carey, M.J.² Li, C.³

27
- 79957822983
- Trie-join: Efficient trie-based string similarity joins with edit-distance constraints
- J. Wang, G. Li, and J. Feng. Trie-join: Efficient trie-based string similarity joins with edit-distance constraints. PVLDB, 3(1):1219-1230, 2010.
- (2010) PVLDB , vol.3 , Issue.1 , pp. 1219-1230
- Wang, J.¹ Li, G.² Feng, J.³

28
- 79957824788
- Fast-join: An efficient method for fuzzy token matching based string similarity join
- J. Wang, G. Li, and J. Feng. Fast-join: An efficient method for fuzzy token matching based string similarity join. In ICDE, pages 458-469, 2011.
- (2011) ICDE , pp. 458-469
- Wang, J.¹ Li, G.² Feng, J.³

29
- 84862702293
- Can we beat the prefix filtering?: An adaptive framework for similarity join and search
- J. Wang, G. Li, and J. Feng. Can we beat the prefix filtering?: an adaptive framework for similarity join and search. In SIGMOD Conference, pages 85-96, 2012.
- (2012) SIGMOD Conference , pp. 85-96
- Wang, J.¹ Li, G.² Feng, J.³

30
- 70849115286
- Efficient approximate entity extraction with edit distance constraints
- W. Wang, C. Xiao, X. Lin, and C. Zhang. Efficient approximate entity extraction with edit distance constraints. In SIGMOD Conference, pages 759-770, 2009.
- (2009) SIGMOD Conference , pp. 759-770
- Wang, W.¹ Xiao, C.² Lin, X.³ Zhang, C.⁴

31
- 70849105253
- Ed-join: An efficient algorithm for similarity joins with edit distance constraints
- C. Xiao, W. Wang, and X. Lin. Ed-join: an efficient algorithm for similarity joins with edit distance constraints. PVLDB, 1(1):933-944, 2008.
- (2008) PVLDB , vol.1 , Issue.1 , pp. 933-944
- Xiao, C.¹ Wang, W.² Lin, X.³

32
- 67649653766
- Top-k set similarity joins
- C. Xiao, W. Wang, X. Lin, and H. Shang. Top-k set similarity joins. In ICDE, pages 916-927, 2009.
- (2009) ICDE , pp. 916-927
- Xiao, C.¹ Wang, W.² Lin, X.³ Shang, H.⁴

33
- 57349141410
- Efficient similarity joins for near duplicate detection
- C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In WWW, pages 131-140, 2008.
- (2008) WWW , pp. 131-140
- Xiao, C.¹ Wang, W.² Lin, X.³ Yu, J.X.⁴

34
- 57149130672
- Cost-based variable-length-gram selection for string collections to support approximate queries efficiently
- X. Yang, B. Wang, and C. Li. Cost-based variable-length-gram selection for string collections to support approximate queries efficiently. In SIGMOD Conference, pages 353-364, 2008.
- (2008) SIGMOD Conference , pp. 353-364
- Yang, X.¹ Wang, B.² Li, C.³

35
- 77954747181
- Bed-tree: An all-purpose index structure for string similarity search based on edit distance
- Z. Zhang, M. Hadjieleftheriou, B. C. Ooi, and D. Srivastava. Bed-tree: an all-purpose index structure for string similarity search based on edit distance. In SIGMOD, pages 915-926, 2010.
- (2010) SIGMOD , pp. 915-926
- Zhang, Z.¹ Hadjieleftheriou, M.² Ooi, B.C.³ Srivastava, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.