-
1
-
-
33845615644
-
-
Statistical Research Division, U.S. Bureau of the Census, Washington, DC, Tech. Rep.
-
W. Winkler, "Overview of record linkage and current research directions," Statistical Research Division, U.S. Bureau of the Census, Washington, DC, Tech. Rep., 2006.
-
(2006)
Overview of record linkage and current research directions
-
-
Winkler, S.1
-
2
-
-
59249096513
-
Keeping a digital library clean: new solutions to old problems
-
A. H. F. Laender, M. A. Gonçcalves, R. G. Cota, A. A. Ferreira, R. L. T. Santos, and A. J. C. Silva, "Keeping a digital library clean: new solutions to old problems," in ACM Symposium on Document Engineering, 2008, pp. 257-262.
-
(2008)
in ACM Symposium on Document Engineering
, pp. 257-262
-
-
Laender, A.H.F.1
Gonçcalves, M.A.2
Cota, R.G.3
Ferreira, A.A.4
Santos, R.L.T.5
Silva, A.J.C.6
-
4
-
-
84976666835
-
On the complexity of the extended string-to-string correction problem
-
R. A. Wagner, "On the complexity of the extended string-to-string correction problem," in STOC, 1975, pp. 218-223.
-
(1975)
STOC
, pp. 218-223
-
-
Wagner, R.A.1
-
5
-
-
9444274777
-
Comparing clusterings by the variation of information
-
M. Meila, "Comparing clusterings by the variation of information," in COLT, 2003, pp. 173-187.
-
(2003)
COLT
, pp. 173-187
-
-
Meila, M.1
-
6
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, MA
-
A. K. McCallum, K. Nigam, and L. Ungar, "Efficient clustering of high-dimensional data sets with application to reference matching," in Proc. of KDD, Boston, MA, 2000, pp. 169-178.
-
(2000)
Proc. of KDD
, pp. 169-178
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.3
-
7
-
-
33750287715
-
Efficient name disambiguation for large-scale databases
-
J. Huang, S. Ertekin, and C. L. Giles, "Efficient name disambiguation for large-scale databases," in PKDD, 2006, pp. 536-544.
-
(2006)
PKDD
, pp. 536-544
-
-
Huang, J.1
Ertekin, S.2
Giles, C.L.3
-
8
-
-
59249101599
-
A heuristic-based hierarchical clustering method for author name disambiguation in digital libraries
-
R. G. Cota, M. A. Gonçcalves, and A. H. F. Laender, "A heuristic-based hierarchical clustering method for author name disambiguation in digital libraries," in SBBD, 2007, pp. 20-34.
-
(2007)
SBBD
, pp. 20-34
-
-
Cota, R.G.1
Gonçcalves, M.A.2
Laender, A.H.F.3
-
9
-
-
70849099044
-
Swoosh: a generic approach to entity resolution
-
O. Benjelloun, H. Garcia-Molina, D. Menestrina, S. E. Whang, Q. Su, and J. Widom, "Swoosh: a generic approach to entity resolution," VLDB J., 2008.
-
(2008)
VLDB J.
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
Whang, S.E.4
Su, Q.5
Widom, J.6
-
10
-
-
31844440880
-
Comparing clusterings: an axiomatic view
-
M. Meila, "Comparing clusterings: an axiomatic view," in ICML, 2005, pp. 577-584.
-
(2005)
ICML
, pp. 577-584
-
-
Meila, M.1
-
11
-
-
18744416559
-
Grouping search-engine returned citations for person-name queries
-
R. Al-Kamha and D. W. Embley, "Grouping search-engine returned citations for person-name queries," in WIDM, 2004, pp. 96-103.
-
(2004)
WIDM
, pp. 96-103
-
-
Al-Kamha, A.1
Embley, D.W.2
-
12
-
-
85038468908
-
Evaluating Entity Resolution Results (Extended version)
-
available at
-
D. Menestrina, S. E. Whang, and H. Garcia-Molina, "Evaluating Entity Resolution Results (Extended version)," Stanford University, Tech. Rep., available at http://ilpubs.stanford.edu/930/.
-
Stanford University, Tech. Rep.
-
-
Menestrina, D.1
Whang, S.E.2
Garcia-Molina, h.3
-
13
-
-
0042920396
-
On the functional equation f(x+y, z) + f(x, y) = f(x, y+z) + f(y, z)
-
09
-
M. Hosszú, "On the functional equation f(x+y,z) + f(x,y) = f(x,y+z) + f(y,z)," Periodica Mathematica Hungarica, vol. 1, no. 3, pp. 213-216, 09 1971.
-
(1971)
Periodica Mathematica Hungarica
, vol.1
, Issue.3
, pp. 213-216
-
-
Hosszú, M.1
-
14
-
-
0001139918
-
Record linkage: making maximum use of the discriminating power of identifying information
-
H. B. Newcombe and J. M. Kennedy, "Record linkage: making maximum use of the discriminating power of identifying information," Commun. ACM, vol. 5, no. 11, pp. 563-566, 1962.
-
(1962)
Commun. ACM
, vol.5
, Issue.11
, pp. 563-566
-
-
Newcombe, H.B.1
Kennedy, J.M.2
-
15
-
-
70849098813
-
-
SIGMOD Conference
-
S. E. Whang, D. Menestrina, G. Koutrika, M. Theobald, and H. Garcia-Molina, "Entity resolution with iterative blocking," in SIGMOD Conference, 2009, pp. 219-232.
-
(2009)
Entity resolution with iterative blocking
, pp. 219-232
-
-
Whang, S.E.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
-
16
-
-
4944248042
-
An efficient domain-independent algorithm for detecting approximately duplicate database records
-
A. E. Monge and C. Elkan, "An efficient domain-independent algorithm for detecting approximately duplicate database records," in DMKD, 1997, pp. 23-29.
-
(1997)
DMKD
, pp. 23-29
-
-
Monge, A.E.1
Elkan, C.2
-
17
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
Tokyo, Japan
-
S. Chaudhuri, V. Ganti, and R. Motwani, "Robust identification of fuzzy duplicates," in Proc. of ICDE, Tokyo, Japan, 2005.
-
(2005)
Proc. of ICDE
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
18
-
-
84943425383
-
Efficient record linkage in large data sets
-
L. Jin, C. Li, and S. Mehrotra, "Efficient record linkage in large data sets," in DASFAA, 2003, pp. 137-.
-
(2003)
DASFAA
, pp. 137
-
-
Jin, L.1
Li, C.2
Mehrotra, S.3
-
19
-
-
84976856849
-
The merge/purge problem for large databases
-
M. A. Hernández and S. J. Stolfo, "The merge/purge problem for large databases," in Proc. of ACM SIGMOD, 1995, pp. 127-138.
-
(1995)
Proc. of ACM SIGMOD
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.A.2
-
22
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
R. Baxter, P. Christen, and T. Churches, "A comparison of fast blocking methods for record linkage," in Proc. of ACM SIGKDD'03 Workshop on Data Cleaning, Record Linkage, and Object Consolidation, 2003.
-
(2003)
Proc. of ACM SIGKDD'03 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
23
-
-
72649086387
-
Framework for evaluating clustering algorithms in duplicate detection
-
O. Hassanzadeh, F. Chiang, R. J. Miller, and H. C. Lee, "Framework for evaluating clustering algorithms in duplicate detection," PVLDB, vol. 2, no. 1, pp. 1282-1293, 2009.
-
(2009)
PVLDB
, vol.2
, Issue.1
, pp. 1282-1293
-
-
Hassanzadeh, O.1
Chiang, F.2
Miller, R.J.3
Lee, H.C.4
-
24
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
M. Bilenko and R. Mooney, "Adaptive duplicate detection using learnable string similarity measures," in KDD, 2003.
-
(2003)
KDD
-
-
Bilenko, M.1
Mooney, R.2
|