-
1
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
M. Bilenko, B. Kamath, and R. J. Mooney. Adaptive blocking: Learning to scale up record linkage. In ICDM, pages 87-96, 2006.
-
(2006)
ICDM
, pp. 87-96
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
2
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
M. Bilenko and R. J. Mooney. Adaptive duplicate detection using learnable string similarity measures. In KDD, pages 39-48, 2003.
-
(2003)
KDD
, pp. 39-48
-
-
Bilenko, M.1
Mooney, R.J.2
-
3
-
-
70849083729
-
Exploiting context analysis for combining multiple entity resolution systems
-
Z. Chen, D. V. Kalashnikov, and S. Mehrotra. Exploiting context analysis for combining multiple entity resolution systems. In SIGMOD, pages 207-218, 2009.
-
(2009)
SIGMOD
, pp. 207-218
-
-
Chen, Z.1
Kalashnikov, D.V.2
Mehrotra, S.3
-
4
-
-
84920595044
-
A survey of indexing techniques for scalable record linkage and deduplication
-
P. Christen. A survey of indexing techniques for scalable record linkage and deduplication. TKDE, 24(9):1537-1555, 2012.
-
(2012)
TKDE
, vol.24
, Issue.9
, pp. 1537-1555
-
-
Christen, P.1
-
5
-
-
67650700151
-
Accurate synthetic generation of realistic personal information
-
P. Christen and A. Pudjijono. Accurate synthetic generation of realistic personal information. In PAKDD, pages 507-514, 2009.
-
(2009)
PAKDD
, pp. 507-514
-
-
Christen, P.1
Pudjijono, A.2
-
6
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
W. W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In KDD, pages 475-480, 2002.
-
(2002)
KDD
, pp. 475-480
-
-
Cohen, W.W.1
Richman, J.2
-
7
-
-
79959973992
-
Robust record linkage blocking using suffix arrays
-
T. de Vries, H. Ke, S. Chawla, and P. Christen. Robust record linkage blocking using suffix arrays. In CIKM, pages 1565-1568, 2009.
-
(2009)
CIKM
, pp. 1565-1568
-
-
de Vries, T.1
Ke, H.2
Chawla, S.3
Christen, P.4
-
10
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
L. Gravano, P. Ipeirotis, H. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava. Approximate string joins in a database (almost) for free. In VLDB, pages 491-500, 2001.
-
(2001)
VLDB
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.2
Jagadish, H.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
12
-
-
84976856849
-
The merge/purge problem for large databases
-
M. Hernández and S. Stolfo. The merge/purge problem for large databases. In SIGMOD, pages 127-138, 1995.
-
(1995)
SIGMOD
, pp. 127-138
-
-
Hernández, M.1
Stolfo, S.2
-
13
-
-
84892971761
-
Mfiblocks: An effective blocking algorithm for entity resolution
-
B. Kenig and A. Gal. Mfiblocks: An effective blocking algorithm for entity resolution. Inf. Syst., 38(6):908-926, 2013.
-
(2013)
Inf. Syst.
, vol.38
, Issue.6
, pp. 908-926
-
-
Kenig, B.1
Gal, A.2
-
14
-
-
77952280581
-
HARRA: fast iterative hashed record linkage for large-scale data collections
-
H. Kim and D. Lee. HARRA: fast iterative hashed record linkage for large-scale data collections. In EDBT, pages 525-536, 2010.
-
(2010)
EDBT
, pp. 525-536
-
-
Kim, H.1
Lee, D.2
-
16
-
-
84874283130
-
Typimatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration
-
Y. Ma and T. Tran. Typimatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration. In WSDM, pages 325-334, 2013.
-
(2013)
WSDM
, pp. 325-334
-
-
Ma, Y.1
Tran, T.2
-
17
-
-
0034592784
-
Efficient clustering of highdimensional data sets with application to reference matching
-
A. McCallum, K. Nigam, and L. Ungar. Efficient clustering of highdimensional data sets with application to reference matching. In KDD, pages 169-178, 2000.
-
(2000)
KDD
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
18
-
-
33750728911
-
Learning blocking schemes for record linkage
-
M. Michelson and C. A. Knoblock. Learning blocking schemes for record linkage. In AAAI, pages 440-445, 2006.
-
(2006)
AAAI
, pp. 440-445
-
-
Michelson, M.1
Knoblock, C.A.2
-
20
-
-
84858041897
-
Beyond 100 million entities: Large-scale blocking-based resolution for heterogeneous data
-
G. Papadakis, E. Ioannou, C. Niederée, T. Palpanas, and W. Nejdl. Beyond 100 million entities: Large-scale blocking-based resolution for heterogeneous data. In WSDM, pages 53-62, 2012.
-
(2012)
WSDM
, pp. 53-62
-
-
Papadakis, G.1
Ioannou, E.2
Niederée, C.3
Palpanas, T.4
Nejdl, W.5
-
21
-
-
84887673907
-
A blocking framework for entity resolution in highly heterogeneous information spaces
-
G. Papadakis, E. Ioannou, T. Palpanas, C. Niederée, and W. Nejdl. A blocking framework for entity resolution in highly heterogeneous information spaces. IEEE Trans. Knowl. Data Eng., 25(12):2665-2682, 2013.
-
(2013)
IEEE Trans. Knowl. Data Eng.
, vol.25
, Issue.12
, pp. 2665-2682
-
-
Papadakis, G.1
Ioannou, E.2
Palpanas, T.3
Niederée, C.4
Nejdl, W.5
-
22
-
-
84904650785
-
Meta-blocking: Taking entity resolutionto the next level
-
G. Papadakis, G. Koutrika, T. Palpanas, andW. Nejdl. Meta-blocking: Taking entity resolutionto the next level. IEEE Trans. Knowl. Data Eng., 26(8):1946-1960, 2014.
-
(2014)
IEEE Trans. Knowl. Data Eng.
, vol.26
, Issue.8
, pp. 1946-1960
-
-
Papadakis, G.1
Koutrika, G.2
Palpanas, T.3
Nejdl, W.4
-
23
-
-
0242456811
-
Interactive deduplication using active learning
-
S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In KDD, pages 269-278, 2002.
-
(2002)
KDD
, pp. 269-278
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
24
-
-
0242456803
-
Learning domainindependent string transformation weights for high accuracy object identification
-
S. Tejada, C. A. Knoblock, and S. Minton. Learning domainindependent string transformation weights for high accuracy object identification. In KDD, pages 350-359, 2002.
-
(2002)
KDD
, pp. 350-359
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
25
-
-
70849098813
-
Entity resolution with iterative blocking
-
S. E. Whang, D. Menestrina, G. Koutrika, M. Theobald, and H. Garcia-Molina. Entity resolution with iterative blocking. In SIGMOD Conference, pages 219-232, 2009.
-
(2009)
SIGMOD Conference
, pp. 219-232
-
-
Whang, S.E.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
|