-
1
-
-
33646699352
-
Automated ranking of database query results
-
AGRAWAL, S., CHAUDHURI, S., DAS, G., AND GIONIS, A. 2003. Automated ranking of database query results. In Proceedings of the Conference on Innovative Data Systems Research (CIDR'03).
-
(2003)
Proceedings of the Conference on Innovative Data Systems Research (CIDR'03)
-
-
Agrawal, S.1
Chaudhuri, S.2
Das, G.3
Gionis, A.4
-
6
-
-
67649641448
-
Space-Constrained gram-based indexing for efficient approximate string search
-
BEHM, A., JI, S., LI, C., AND LU, J. 2009. Space-Constrained gram-based indexing for efficient approximate string search. In Proceedings of the International Conference on Data Engineering (ICDE'09). 604-615.
-
(2009)
Proceedings of the International Conference on Data Engineering (ICDE'09)
, pp. 604-615
-
-
Behm, A.1
S, J.I.2
C, L.I.3
J, L.U.4
-
7
-
-
2342447399
-
Adaptive name matching in information integration
-
BILENKO, M., MOONEY, R. J., COHEN, W. W., RAVIKUMAR, P., AND FIENBERG, S. E. 2003. Adaptive name matching in information integration. IEEE Intell. Syst. 18, 5, 16-23.
-
(2003)
IEEE Intell. Syst.
, vol.18
, Issue.5
, pp. 16-23
-
-
Bilenko, M.1
Mooney, R.J.2
Cohen, W.W.3
Ravikumar, P.4
Fienberg, S.E.5
-
10
-
-
0010362121
-
Syntactic clustering of the web
-
BRODER, A. Z., GLASSMAN, S. C., MANASSE, M. S., AND ZWEIG, G. 1997. Syntactic clustering of the web. Comput. Netw. 29, 8-13, 1157-1166.
-
(1997)
Comput. Netw.
, vol.29
, Issue.8-13
, pp. 1157-1166
-
-
Broder, A.Z.1
Glassman, S.C.2
Manasse, M.S.3
Zweig, G.4
-
14
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
CHAUDHURI, S., GANJAM, K., GANTI, V., AND MOTWANI, R. 2003. Robust and efficient fuzzy match for online data cleaning. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 313-324.
-
(2003)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
17
-
-
0013206133
-
Collection statistics for fast duplicate document detection
-
DOI 10.1145/506309.506311
-
CHOWDHURY, A., FRIEDER, O., GROSSMAN, D. A., AND MCCABE, M. C. 2002. Collection statistics for fast duplicate document detection. ACM Trans. Inf. Syst. 20, 2, 171-191. (Pubitemid 44642301)
-
(2002)
ACM Transactions on Information Systems
, vol.20
, Issue.2
, pp. 171-191
-
-
Chowdhury, A.1
Frieder, O.2
Grossman, D.3
McCabe, M.C.4
-
19
-
-
33845667955
-
Duplicate record detection: A survey
-
DOI 10.1109/TKDE.2007.250581
-
ELMAGARMID, A. K., IPEIROTIS, P. G., AND VERYKIOS, V. S. 2007. Duplicate record detection: A survey. Trans. Knowl. Data Engin.19, 1, 1-16. (Pubitemid 44955773)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
20
-
-
0033075316
-
Combining fuzzy information from multiple systems
-
FAGIN, R. 1999. Combining fuzzy information from multiple systems. J. Comput. Syst. Sci. 58, 1, 83-99.
-
(1999)
J. Comput. Syst. Sci.
, vol.58
, Issue.1
, pp. 83-99
-
-
Fagin, R.1
-
22
-
-
0038504811
-
Optimal aggregation algorithms for middleware
-
FAGIN, R., LOTEM, A., AND NAOR, M. 2003b. Optimal aggregation algorithms for middleware. J. Comput. Syst. Sci. 66, 4, 614-656.
-
(2003)
J. Comput. Syst. Sci.
, vol.66
, Issue.4
, pp. 614-656
-
-
Fagin, R.1
Lotem, A.2
Naor, M.3
-
26
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
GRAVANO, L., IPEIROTIS, P. G., JAGADISH, H. V., KOUDAS, N., MUTHUKRISHNAN, S., AND SRIVASTAVA, D. 2001. Approximate string joins in a database (almost) for free. In Proceedings of the International Conference on Very Large Databases (VLDB'01).
-
(2001)
Proceedings of the International Conference on Very Large Databases (VLDB'01)
-
-
Gravano, L.1
Ipeirotis, P.G.2
Jagadish, H.V.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
28
-
-
52649145249
-
Fast indexes and algorithms for set similarity selection queries
-
HADJIELEFTHERIOU, M., CHANDEL, A., KOUDAS, N., AND SRIVASTAVA, D. 2008. Fast indexes and algorithms for set similarity selection queries. In Proceedings of the International Conference on Data Engineering (ICDE'08). 267-276.
-
(2008)
Proceedings of the International Conference on Data Engineering (ICDE'08)
, pp. 267-276
-
-
Hadjieleftheriou, M.1
Chandel, A.2
Koudas, N.3
Srivastava, D.4
-
29
-
-
77957931572
-
Detecting the origin of text segments efficiently
-
HAMID, O. A., BEHZADI, B., CHRISTOPH, S., AND HENZINGER, M. R. 2009. Detecting the origin of text segments efficiently. In Proceedings of the International World Wide Web Conference (WWW'09). 61-70.
-
(2009)
Proceedings of the International World Wide Web Conference (WWW'09)
, pp. 61-70
-
-
Hamid, O.A.1
Behzadi, B.2
Christoph, S.3
Henzinger, M.R.4
-
31
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
HERNANDEZ, M. A. AND STOLFO, S. J. 1998. Real-World data is dirty: Data cleansing and the merge/purge problem. Data Min. Knowl. Discov. 2, 1, 9-37. (Pubitemid 128696797)
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.1
, pp. 9-37
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
32
-
-
0037319544
-
Methods for identifying versioned and plagiarized documents
-
HOAD, T. C. AND ZOBEL, J. 2003. Methods for identifying versioned and plagiarized documents. J. Amer. Soc. Inf. Sci. Technol. 54, 3, 203-215.
-
(2003)
J. Amer. Soc. Inf. Sci. Technol.
, vol.54
, Issue.3
, pp. 203-215
-
-
Hoad, T.C.1
Zobel, J.2
-
37
-
-
35448984017
-
Spark: Top-k keyword query in relational databases
-
DOI 10.1145/1247480.1247495, SIGMOD 2007: Proceedings of the ACM SIGMOD International Conference on Management of Data
-
LUO, Y., LIN, X., WANG, W., AND ZHOU, X. 2007. SPARK: Top-k keyword query in relational databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 115-126. (Pubitemid 47630801)
-
(2007)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 115-126
-
-
Luo, Y.1
Lin, X.2
Wang, W.3
Zhou, X.4
-
39
-
-
35348911985
-
Detecting near-duplicates for web crawling
-
DOI 10.1145/1242572.1242592, 16th International World Wide Web Conference, WWW2007
-
MANKU, G. S., JAIN, A., AND SARMA, A. D. 2007. Detecting near-duplicates for web crawling. In Proceedings of the International World Wide Web Conference (WWW'07). 141-150. (Pubitemid 47582246)
-
(2007)
16th International World Wide Web Conference, WWW2007
, pp. 141-150
-
-
Manku, G.S.1
Jain, A.2
Das Sarma, A.3
-
40
-
-
0345566149
-
A guided tour to approximate string matching
-
NAVARRO, G. 2001. A guided tour to approximate string matching. ACM Comput. Surv. 33, 1, 31-88. (Pubitemid 33768480)
-
(2001)
ACM Computing Surveys
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, G.1
-
41
-
-
80052344988
-
-
U.S. patent
-
RUSSELL, R. C. 1918. Index. U.S. patent 1, 261, 167.
-
(1918)
Index
, vol.1
, Issue.261
, pp. 167
-
-
Russell, R.C.1
-
50
-
-
70849105253
-
Ed-Join: An efficient algorithm for similarity joins with edit distance constraints
-
XIAO, C., WANG, W., AND LIN, X. 2008a. Ed-Join: An efficient algorithm for similarity joins with edit distance constraints. Proc. VLDB 1, 1, 933-944.
-
(2008)
Proc. VLDB
, vol.1
, Issue.1
, pp. 933-944
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
-
51
-
-
67649653766
-
Top-k set similarity joins
-
XIAO, C., WANG, W., LIN, X., AND SHANG, H. 2009. Top-k set similarity joins. In Proceedings of the International Conference on Data Engineering (ICDE'09). 916-927.
-
(2009)
Proceedings of the International Conference on Data Engineering (ICDE'09)
, pp. 916-927
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
Shang, H.4
-
52
-
-
66249113620
-
Efficient similarity joins for near duplicate detection
-
XIAO, C., WANG, W., LIN, X., AND YU, J. X. 2008b. Efficient similarity joins for near duplicate detection. In Proceedings of the International World Wide Web Conference (WWW'08).
-
(2008)
Proceedings of the International World Wide Web Conference (WWW'08)
-
-
Xiao, C.1
Wang, W.2
Lin, X.3
X, Y.U.J.4
|