-
1
-
-
85039681411
-
-
http://secondstring.sourceforge.net/.
-
-
-
-
2
-
-
85039681574
-
-
http://www.dcs.shef.ac.uk/~sam/simmetrics.html.
-
-
-
-
3
-
-
85104914015
-
Efficient exact set-similarity joins
-
A. Arasu, V. Ganti, and R. Kaushik. Efficient exact set-similarity joins. In VLDB, pages 918-929, 2006.
-
(2006)
VLDB
, pp. 918-929
-
-
Arasu, A.1
Ganti, V.2
Kaushik, R.3
-
4
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
R. Baxter, P. Christen, and T. Churches. A comparison of fast blocking methods for record linkage. In Proceedings of the 2003 ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation, pages 25-27, 2003.
-
(2003)
Proceedings of the 2003 ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation
, pp. 25-27
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
5
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
M. Bilenko and R. J. Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, pages 39-48, 2003.
-
(2003)
KDD
, pp. 39-48
-
-
Bilenko, M.1
Mooney, R.J.2
-
6
-
-
85011029434
-
Example-driven design of efficient record matching queries
-
S. Chaudhuri, B.-C. Chen, V. Ganti, and R. Kaushik. Example-driven design of efficient record matching queries. In VLDB, pages 327-338, 2007.
-
(2007)
VLDB
, pp. 327-338
-
-
Chaudhuri, S.1
Chen, B.-C.2
Ganti, V.3
Kaushik, R.4
-
7
-
-
33749597967
-
A primitive operator for similarity joins in data cleaning
-
S. Chaudhuri, V. Ganti, and R. Kaushik. A primitive operator for similarity joins in data cleaning. In ICDE, pages 5-16, 2006.
-
(2006)
ICDE
, pp. 5-16
-
-
Chaudhuri, S.1
Ganti, V.2
Kaushik, R.3
-
8
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In IIWEB, pages 73-78, 2003.
-
(2003)
IIWEB
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
9
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
W. W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In KDD, pages 475-480, 2002.
-
(2002)
KDD
, pp. 475-480
-
-
Cohen, W.W.1
Richman, J.2
-
10
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. TKDE, 19(1):1-16, 2007.
-
(2007)
TKDE
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
11
-
-
84865086832
-
Reasoning about record matching rules
-
W. Fan, X. Jia, J. Li, and S. Ma. Reasoning about record matching rules. PVLDB, 2(1):407-418, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.1
, pp. 407-418
-
-
Fan, W.1
Jia, X.2
Li, J.3
Ma, S.4
-
12
-
-
84947399464
-
A theory for record linkage. Journal of the American Statistical Association
-
I. P. Fellegi and A. B. Sunter. A theory for record linkage. Journal of the American Statistical Association, 64(328):1183-1210, 1969.
-
(1969)
, vol.64
, Issue.328
, pp. 1183-1210
-
-
Fellegi, I.P.1
Sunter, A.B.2
-
13
-
-
0344756845
-
Declarative data cleaning: Language, model, and algorithms
-
H. Galhardas, D. Florescu, D. Shasha, E. Simon, and C.-A. Saita. Declarative data cleaning: Language, model, and algorithms. In VLDB, pages 371-380, 2001.
-
(2001)
VLDB
, pp. 371-380
-
-
Galhardas, H.1
Florescu, D.2
Shasha, D.3
Simon, E.4
Saita, C.-A.5
-
14
-
-
84976856849
-
The merge/purge problem for large databases
-
M. A. Herńandez and S. J. Stolfo. The merge/purge problem for large databases. In SIGMOD, pages 127-138, 1995.
-
(1995)
SIGMOD
, pp. 127-138
-
-
Herńandez, M.A.1
Stolfo, S.J.2
-
16
-
-
84950419860
-
Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida
-
M. A. Jaro. Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida. Journal of the American Statistical Association, 84(406):414-420, 1989.
-
(1989)
Journal of the American Statistical Association
, vol.84
, Issue.406
, pp. 414-420
-
-
Jaro, M.A.1
-
17
-
-
34250670467
-
Record linkage: similarity measures and algorithms
-
N. Koudas, S. Sarawagi, and D. Srivastava. Record linkage: similarity measures and algorithms. In SIGMOD, pages 802-803, 2006.
-
(2006)
SIGMOD
, pp. 802-803
-
-
Koudas, N.1
Sarawagi, S.2
Srivastava, D.3
-
18
-
-
0027189241
-
Entity identification in database integration
-
E. Lim, J. Srivastava, S. Prabhakar, and J. Richardson. Entity identification in database integration. In ICDE, pages 294-301, 1993.
-
(1993)
ICDE
, pp. 294-301
-
-
Lim, E.1
Srivastava, J.2
Prabhakar, S.3
Richardson, J.4
-
19
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. McCallum, K. Nigam, and L. H. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In KDD, pages 169-178, 2000.
-
(2000)
KDD
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
20
-
-
0242456811
-
Interactive deduplication using active learning
-
S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In KDD, pages 269-278, 2002.
-
(2002)
KDD
, pp. 269-278
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
21
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
S. Tejada, C. A. Knoblock, and S. Minton. Learning domain-independent string transformation weights for high accuracy object identification. In KDD, pages 350-359, 2002.
-
(2002)
KDD
, pp. 350-359
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
|