SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the ACM SIGMOD International Conference on Management of Data

Volumn , Issue , 2012, Pages 85-96

Can we beat the prefix filtering? An adaptive framework for similarity join and search

(3) Wang, Jiannan a Li, Guoliang a Feng, Jianhua a

a TSINGHUA UNIVERSITY (China)

Author keywords

adaptive framework; cost model; prefix filtering; similarity join; similarity search

Indexed keywords

ADAPTIVE FRAMEWORK; COST MODELS; DATA CLEANING; SIMILARITY JOIN; SIMILARITY SEARCH;

DATABASE SYSTEMS;

EID: 84862702293 PISSN: 07308078 EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/2213836.2213847 Document Type: Conference Paper

Times cited : (213)

References (29)

1
- 85104914015
- Efficient exact set-similarity joins
- A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
- (2006) VLDB , pp. 918-929
- Arasu, A.¹ Ganti, V.² Kaushik, R.³

2
- 35348849154
- Scaling up all pairs similarity search
- R. J. Bayardo, Y. Ma, and R. Srikant. Scaling up all pairs similarity search. In WWW, pages 131-140, 2007.
- (2007) WWW , pp. 131-140
- Bayardo, R.J.¹ Ma, Y.² Srikant, R.³

3
- 1142279457
- Robust and efficient fuzzy match for online data cleaning
- S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In SIGMOD Conference, pages 313-324, 2003.
- (2003) SIGMOD Conference , pp. 313-324
- Chaudhuri, S.¹ Ganjam, K.² Ganti, V.³ Motwani, R.⁴

4
- 33749597967
- A primitive operator for similarity joins in data cleaning
- S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, pages 5-16, 2006.
- (2006) ICDE , pp. 5-16
- Chaudhuri, S.¹ Ganti, V.² Kaushik, R.³

5
- 84944318804
- Approximate string joins in a database (almost) for free
- L. Gravano, P. G. Ipeirotis, H. V. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava. Approximate string joins in a database (almost) for free. In VLDB, pages 491-500, 2001.
- (2001) VLDB , pp. 491-500
- Gravano, L.¹ Ipeirotis, P.G.² Jagadish, H.V.³ Koudas, N.⁴ Muthukrishnan, S.⁵ Srivastava, D.⁶

6
- 52649145249
- Fast indexes and algorithms for set similarity selection queries
- M. Hadjieleftheriou, A. Chandel, N. Koudas, and D. Srivastava. Fast indexes and algorithms for set similarity selection queries. In ICDE, pages 267-276, 2008.
- (2008) ICDE , pp. 267-276
- Hadjieleftheriou, M.¹ Chandel, A.² Koudas, N.³ Srivastava, D.⁴

7
- 70349659026
- Hashed samples: Selectivity estimators for set similarity selection queries
- M. Hadjieleftheriou, X. Yu, N. Koudas, and D. Srivastava. Hashed samples: selectivity estimators for set similarity selection queries. PVLDB, 1(1):201-212, 2008.
- (2008) PVLDB , vol.1 , Issue.1 , pp. 201-212
- Hadjieleftheriou, M.¹ Yu, X.² Koudas, N.³ Srivastava, D.⁴

8
- 46649104057
- Metric space similarity joins
- E. H. Jacox and H. Samet. Metric space similarity joins. ACM Trans. Database Syst., 33(2), 2008.
- (2008) ACM Trans. Database Syst. , vol.33 , Issue.2
- Jacox, E.H.¹ Samet, H.²

9
- 33745607646
- Selectivity estimation for fuzzy string predicates in large data sets
- L. Jin and C. Li. Selectivity estimation for fuzzy string predicates in large data sets. In VLDB, pages 397-408, 2005.
- (2005) VLDB , pp. 397-408
- Jin, L.¹ Li, C.²

10
- 33745621089
- n-gram/2l: A space and time efficient two-level n-gram inverted index structure
- M.-S. Kim, K.-Y. Whang, J.-G. Lee, and M.-J. Lee. n-gram/2l: A space and time efficient two-level n-gram inverted index structure. In VLDB, pages 325-336, 2005.
- (2005) VLDB , pp. 325-336
- Kim, M.-S.¹ Whang, K.-Y.² Lee, J.-G.³ Lee, M.-J.⁴

11
- 77957718350
- Power-law based estimation of set similarity join size
- H. Lee, R. T. Ng, and K. Shim. Power-law based estimation of set similarity join size. PVLDB, 2(1):658-669, 2009.
- (2009) PVLDB , vol.2 , Issue.1 , pp. 658-669
- Lee, H.¹ Ng, R.T.² Shim, K.³

12
- 81055146159
- Similarity join size estimation using locality sensitive hashing
- H. Lee, R. T. Ng, and K. Shim. Similarity join size estimation using locality sensitive hashing. PVLDB, 4(6):338-349, 2011.
- (2011) PVLDB , vol.4 , Issue.6 , pp. 338-349
- Lee, H.¹ Ng, R.T.² Shim, K.³

13
- 52649086729
- Efficient merging and filtering algorithms for approximate string searches
- C. Li, J. Lu, and Y. Lu. Efficient merging and filtering algorithms for approximate string searches. In ICDE, 2008.
- (2008) ICDE
- Li, C.¹ Lu, J.² Lu, Y.³

14
- 85011032600
- Vgram: Improving performance of approximate queries on string collections using variable-length grams
- C. Li, B. Wang, and X. Yang. Vgram: Improving performance of approximate queries on string collections using variable-length grams. In VLDB, pages 303-314, 2007.
- (2007) VLDB , pp. 303-314
- Li, C.¹ Wang, B.² Yang, X.³

15
- 79959922359
- Faerie: Efficient filtering algorithms for approximate dictionary-based entity extraction
- G. Li, D. Deng, and J. Feng. Faerie: efficient filtering algorithms for approximate dictionary-based entity extraction. In SIGMOD Conference, pages 529-540, 2011.
- (2011) SIGMOD Conference , pp. 529-540
- Li, G.¹ Deng, D.² Feng, J.³

16
- 84862671407
- Pass-join: A partition-based method for similarity joins
- G. Li, D. Deng, J. Wang, and J. Feng. Pass-join: A partition-based method for similarity joins. PVLDB, 5(3):253-264, 2011.
- (2011) PVLDB , vol.5 , Issue.3 , pp. 253-264
- Li, G.¹ Deng, D.² Wang, J.³ Feng, J.⁴

17
- 34547421874
- Estimating the selectivity of approximate string queries
- A. Mazeika, M. H. Böhlen, N. Koudas, and D. Srivastava. Estimating the selectivity of approximate string queries. ACM Trans. Database Syst., 32(2):12, 2007.
- (2007) ACM Trans. Database Syst. , vol.32 , Issue.2 , pp. 12
- Mazeika, A.¹ Böhlen, M.H.² Koudas, N.³ Srivastava, D.⁴

18
- 0345566149
- A guided tour to approximate string matching
- G. Navarro. A guided tour to approximate string matching. ACM Comput. Surv., 33(1):31-88, 2001.
- (2001) ACM Comput. Surv. , vol.33 , Issue.1 , pp. 31-88
- Navarro, G.¹

19
- 79960001806
- Efficient exact edit similarity query processing with the asymmetric signature scheme
- J. Qin, W. Wang, Y. Lu, C. Xiao, and X. Lin. Efficient exact edit similarity query processing with the asymmetric signature scheme. In SIGMOD Conference, pages 1033-1044, 2011.
- (2011) SIGMOD Conference , pp. 1033-1044
- Qin, J.¹ Wang, W.² Lu, Y.³ Xiao, C.⁴ Lin, X.⁵

20
- 3142777876
- Efficient set joins on similarity predicates
- S. Sarawagi and A. Kirpal. Efficient set joins on similarity predicates. In SIGMOD Conference, pages 743-754, 2004.
- (2004) SIGMOD Conference , pp. 743-754
- Sarawagi, S.¹ Kirpal, A.²

21
- 77952772124
- The similarity join database operator
- Y. N. Silva, W. G. Aref, and M. H. Ali. The similarity join database operator. In ICDE, pages 892-903, 2010.
- (2010) ICDE , pp. 892-903
- Silva, Y.N.¹ Aref, W.G.² Ali, M.H.³

22
- 77954744650
- Efficient parallel set-similarity joins using mapreduce
- R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using mapreduce. In SIGMOD Conference, pages 495-506, 2010.
- (2010) SIGMOD Conference , pp. 495-506
- Vernica, R.¹ Carey, M.J.² Li, C.³

23
- 79957822983
- Trie-join: Efficient trie-based string similarity joins with edit-distance constraints
- J. Wang, G. Li, and J. Feng. Trie-join: Efficient trie-based string similarity joins with edit-distance constraints. PVLDB, 3(1):1219-1230, 2010.
- (2010) PVLDB , vol.3 , Issue.1 , pp. 1219-1230
- Wang, J.¹ Li, G.² Feng, J.³

24
- 79957824788
- Fast-join: An efficient method for fuzzy token matching based string similarity join
- J. Wang, G. Li, and J. Feng. Fast-join: An efficient method for fuzzy token matching based string similarity join. In ICDE, pages 458-469, 2011.
- (2011) ICDE , pp. 458-469
- Wang, J.¹ Li, G.² Feng, J.³

25
- 70849105253
- Ed-join: An efficient algorithm for similarity joins with edit distance constraints
- C. Xiao, W. Wang, and X. Lin. Ed-join: an efficient algorithm for similarity joins with edit distance constraints. PVLDB, 1(1):933-944, 2008.
- (2008) PVLDB , vol.1 , Issue.1 , pp. 933-944
- Xiao, C.¹ Wang, W.² Lin, X.³

26
- 67649653766
- Top-k set similarity joins
- C. Xiao, W. Wang, X. Lin, and H. Shang. Top-k set similarity joins. In ICDE, pages 916-927, 2009.
- (2009) ICDE , pp. 916-927
- Xiao, C.¹ Wang, W.² Lin, X.³ Shang, H.⁴

27
- 66249113620
- Efficient similarity joins for near duplicate detection
- C. Xiao, W. Wang, X. Lin, and J. X. Yu. Efficient similarity joins for near duplicate detection. In WWW, 2008.
- (2008) WWW
- Xiao, C.¹ Wang, W.² Lin, X.³ Yu, J.X.⁴

28
- 79960002413
- Atlas: A probabilistic algorithm for high dimensional similarity search
- J. Zhai, Y. Lou, and J. Gehrke. Atlas: a probabilistic algorithm for high dimensional similarity search. In SIGMOD Conference, pages 997-1008, 2011.
- (2011) SIGMOD Conference , pp. 997-1008
- Zhai, J.¹ Lou, Y.² Gehrke, J.³

29
- 77954747181
- Bed-tree: An all-purpose index structure for string similarity search based on edit distance
- Z. Zhang, M. Hadjieleftheriou, B. C. Ooi, and D. Srivastava. Bed-tree: an all-purpose index structure for string similarity search based on edit distance. In SIGMOD, 2010.
- (2010) SIGMOD
- Zhang, Z.¹ Hadjieleftheriou, M.² Ooi, B.C.³ Srivastava, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.