-
1
-
-
33845667955
-
Duplicate record detection: A survey
-
DOI 10.1109/TKDE.2007.250581
-
A.K. Elmagarmid, P.G. Ipeirotis, and V.S. Verykios, "Duplicate Record Detection: A Survey," IEEE Trans. Knowledge Data Eng., vol. 19, no. 1, pp. 1-16, Jan. 2007. (Pubitemid 44955773)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
2
-
-
84893405732
-
Data clustering: A review
-
A.K. Jain, M.N. Murty, and P.J. Flynn, "Data Clustering: A Review," ACM Computing Surveys, vol. 31, no. 3, pp. 264-323, 1999.
-
(1999)
ACM Computing Surveys
, vol.31
, Issue.3
, pp. 264-323
-
-
Jain, A.K.1
Murty, M.N.2
Flynn, P.J.3
-
3
-
-
0001139918
-
Record linkage: Making maximum use of the discriminating power of identifying Information
-
H.B. Newcombe and J.M. Kennedy, "Record Linkage: Making Maximum Use of the Discriminating Power of Identifying Information," Comm. ACM, vol. 5, no. 11, pp. 563-566, 1962.
-
(1962)
Comm ACM
, vol.5
, Issue.11
, pp. 563-566
-
-
Newcombe, H.B.1
Kennedy, J.M.2
-
5
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference Matching
-
A.K. McCallum, K. Nigam, and L. Ungar, "Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching," Proc. ACM Sixth SIGKDD Int'l Conf. Knowledge Discovery and Data Mining, pp. 169-178, 2000.
-
(2000)
Proc ACM Sixth SIGKDD Int'l Conf. Knowledge Discovery and Data Mining
, pp. 169-178
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.3
-
6
-
-
0001944742
-
Similarity search in high dimensions via hashing
-
A. Gionis, P. Indyk, and R. Motwani, "Similarity Search in High Dimensions via Hashing," Proc. 25th Int'l Conf. Very Large Databases (VLDB), pp. 518-529, 1999.
-
(1999)
Proc. 25th Int'l Conf. Very Large Databases (VLDB)
, pp. 518-529
-
-
Gionis, A.1
Indyk, P.2
Motwani, R.3
-
7
-
-
84875754821
-
-
Stanford Univ available at
-
S.E. Whang, D. Marmaros, and H. Garcia-Molina, "Pay-As-You-Go Entity Resolution," technical report, Stanford Univ., available at http://ilpubs.stanford.edu:8090/979/, 2012.
-
(2012)
Pay-as-you-go Entity Resolution Technical Report
-
-
Whang, S.E.1
Marmaros, D.2
Garcia-Molina, H.3
-
9
-
-
58149472338
-
Swoosh: A generic approach to entity resolution
-
O. Benjelloun, H. Garcia-Molina, D. Menestrina, Q. Su, S.E. Whang, and J. Widom, "Swoosh: A Generic Approach to Entity Resolution," VLDB J., vol. 18, no. 1, pp. 255-276, 2009.
-
(2009)
VLDB J
, vol.18
, Issue.1
, pp. 255-276
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
Su, Q.4
Whang, S.E.5
Widom, J.6
-
11
-
-
0344612511
-
A small approximately min-wise independent family of hash functions
-
DOI doi:10.1006/jagm.2000.1131
-
P. Indyk, "A Small Approximately Min-Wise Independent Family of Hash Functions," J. Algorithms, vol. 38, no. 1, pp. 84-90, 2001. (Pubitemid 33667201)
-
(2001)
Journal of Algorithms
, vol.38
, Issue.1
, pp. 84-90
-
-
Indyk, P.1
-
12
-
-
85104914015
-
Efficient exact set-similarity joins
-
A. Arasu, V. Ganti, and R. Kaushik, "Efficient Exact Set-Similarity Joins," Proc. 32nd Int'l Conf. Very Large Data Bases (VLDB), pp. 918-929, 2006.
-
(2006)
Proc. 32nd Int'l Conf. Very Large Data Bases (VLDB)
, pp. 918-929
-
-
Arasu, A.1
Ganti, V.2
Kaushik, R.3
-
13
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
W.W. Cohen, "Data Integration Using Similarity Joins and a Word-Based Information Representation Language," ACM Trans. Information Systems, vol. 18, no. 3, pp. 288-321, 2000.
-
(2000)
ACM Trans. Information Systems
, vol.18
, Issue.3
, pp. 288-321
-
-
Cohen, W.W.1
-
14
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
SIGMOD 2005: Proceedings of the ACM SIGMOD International Conference on Management of Data
-
X. Dong, A.Y. Halevy, and J. Madhavan, "Reference Reconciliation in Complex Information Spaces," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 85-96, 2005. (Pubitemid 43038919)
-
(2005)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 85-96
-
-
Dong, X.1
Halevy, A.2
Madhavan, J.3
-
15
-
-
33749618105
-
Detecting duplicates in complex XML data
-
DOI 10.1109/ICDE.2006.49, 1617477, Proceedings of the 22nd International Conference on Data Engineering, ICDE '06
-
M. Weis and F. Naumann, "Detecting Duplicates in Complex XML Data," Proc. 22nd Int'l Conf. Data Eng. (ICDE), p. 109, 2006. (Pubitemid 44539901)
-
(2006)
Proceedings - International Conference on Data Engineering
, vol.2006
, pp. 109
-
-
Weis, M.1
Naumann, F.2
-
16
-
-
72649086387
-
Framework for evaluating clustering algorithms in duplicate detection
-
O. Hassanzadeh, F. Chiang, R.J. Miller, and H.C. Lee, "Framework for Evaluating Clustering Algorithms in Duplicate Detection," Proc. VLDB Endowment, vol. 2, no. 1, pp. 1282-1293, 2009.
-
(2009)
Proc. VLDB Endowment
, vol.2
, Issue.1
, pp. 1282-1293
-
-
Hassanzadeh, O.1
Chiang, F.2
Miller, R.J.3
Lee, H.C.4
-
17
-
-
84858669792
-
Web-scale data integration: You can afford to pay as you go
-
J. Madhavan, S. Cohen, X.L. Dong, A.Y. Halevy, S.R. Jeffery, D. Ko, and C. Yu, "Web-Scale Data Integration: You Can Afford to Pay As You Go," Proc. Conf. Innovative Data Systems Research (CIDR), pp. 342-350, 2007.
-
(2007)
Proc. Conf. Innovative Data Systems Research (CIDR)
, pp. 342-350
-
-
Madhavan, J.1
Cohen, S.2
Dong, X.L.3
Halevy, A.Y.4
Jeffery, S.R.5
Ko, D.6
Yu, C.7
-
18
-
-
57149131807
-
Pay-As-You-Go user feedback for dataspace systems
-
S.R. Jeffery, M.J. Franklin, and A.Y. Halevy, "Pay-As-You-Go User Feedback for Dataspace Systems," Proc. ACM SIGMOD Int'l Conf. Management of Data, pp. 847-860, 2008.
-
(2008)
Proc ACM SIGMOD Int'l Conf. Management of Data
, pp. 847-860
-
-
Jeffery, S.R.1
Franklin, M.J.2
Halevy, A.Y.3
|