-
1
-
-
56349095491
-
Aggregating inconsistent information: Ranking and clustering
-
N. Ailon, M. Charikar, and A. Newman. Aggregating inconsistent information: Ranking and clustering. J. ACM, 2008.
-
(2008)
J. ACM
-
-
Ailon, N.1
Charikar, M.2
Newman, A.3
-
2
-
-
67649649597
-
Large-scale deduplication with constraints using dedupalog
-
A. Arasu, C. Rè, and D. Suciu. Large-scale deduplication with constraints using dedupalog. In ICDE, 2009.
-
(2009)
ICDE
-
-
Arasu, A.1
Rè, C.2
Suciu, D.3
-
4
-
-
9444249661
-
On evaluation and training-set construction for duplicate detection
-
M. Bilenko and R. J. Mooney. On evaluation and training-set construction for duplicate detection. In KDD, 2003.
-
(2003)
KDD
-
-
Bilenko, M.1
Mooney, R.J.2
-
5
-
-
84866626604
-
Effective and efficient entity search in rdf data
-
R. Blanco, P. Mika, and S. Vigna. Effective and efficient entity search in rdf data. In ISWC, 2011.
-
(2011)
ISWC
-
-
Blanco, R.1
Mika, P.2
Vigna, S.3
-
6
-
-
36849045251
-
Canonicalization of database records using adaptive similarity measures
-
A. Culotta, M. Wick, R. Hall, M. Marzilli, and A. McCallum. Canonicalization of database records using adaptive similarity measures. In KDD, 2007.
-
(2007)
KDD
-
-
Culotta, A.1
Wick, M.2
Hall, R.3
Marzilli, M.4
McCallum, A.5
-
7
-
-
79959294918
-
A fast approach for parallel deduplication on multicore processors
-
G. Dal Bianco, R. Galante, and C. A. Heuser. A fast approach for parallel deduplication on multicore processors. In SACC, 2011.
-
(2011)
SACC
-
-
Dal Bianco, G.1
Galante, R.2
Heuser, C.A.3
-
8
-
-
77954579212
-
A web of concepts
-
N. Dalvi, R. Kumar, B. Pang, R. Ramakrishnan, A. Tomkins, P. Bohannon, S. Keerthi, and S. Merugu. A web of concepts. In PODS, 2009.
-
(2009)
PODS
-
-
Dalvi, N.1
Kumar, R.2
Pang, B.3
Ramakrishnan, R.4
Tomkins, A.5
Bohannon, P.6
Keerthi, S.7
Merugu, S.8
-
13
-
-
84891084500
-
Improving entity resolution with global constraints
-
J. Gemmell, B. Rubinstein, and A. K. Chandra. Improving entity resolution with global constraints. CoRR, 2011.
-
(2011)
CoRR
-
-
Gemmell, J.1
Rubinstein, B.2
Chandra, A.K.3
-
14
-
-
84872977079
-
Dedoop: Efficient Deduplication with Hadoop
-
L. Kolb, A. Thor, and E. Rahm. Dedoop: Efficient Deduplication with Hadoop. In VLDB, 2012.
-
(2012)
VLDB
-
-
Kolb, L.1
Thor, A.2
Rahm, E.3
-
15
-
-
72649095071
-
Frameworks for entity matching: A comparison
-
H. KÖpcke and E. Rahm. Frameworks for entity matching: A comparison. Data Knowl. Eng., 2010.
-
(2010)
Data Knowl. Eng.
-
-
Köpcke, H.1
Rahm, E.2
-
16
-
-
80455148340
-
Evaluation of entity resolution approaches on real-world match problems
-
H. Kopcke, A. Thor, and E. Rahm. Evaluation of entity resolution approaches on real-world match problems. PVLDB, 2010.
-
(2010)
PVLDB
-
-
Kopcke, H.1
Thor, A.2
Rahm, E.3
-
17
-
-
85039651406
-
Robust runtime optimization and skew-resistant execution of analytical sparql queries on pig
-
S. Kotoulas, J. Urbani, P. Boncz, and P. Mika. Robust runtime optimization and skew-resistant execution of analytical sparql queries on pig. In ISWC, 2012.
-
(2012)
ISWC
-
-
Kotoulas, S.1
Urbani, J.2
Boncz, P.3
Mika, P.4
-
18
-
-
84885862102
-
Dynamic Record Blocking: Efficient Linking of Massive Databases in MapReduce
-
B. McNeill, H. Kardes, and A. Borthwick. Dynamic Record Blocking: Efficient Linking of Massive Databases in MapReduce. In QDB, 2012.
-
(2012)
QDB
-
-
McNeill, B.1
Kardes, H.2
Borthwick, A.3
-
19
-
-
70849114415
-
Prima: archiving and querying historical data with evolving schemas
-
H. J. Moon, C. Curino, M. Ham, and C. Zaniolo. Prima: archiving and querying historical data with evolving schemas. In SIGMOD, 2009.
-
(2009)
SIGMOD
-
-
Moon, H.J.1
Curino, C.2
Ham, M.3
Zaniolo, C.4
-
20
-
-
84860875510
-
Information integration over time in unreliable and uncertain environments
-
A. Pal, V. Rastogi, A. Machanavajjhala, and P. Bohannon. Information integration over time in unreliable and uncertain environments. In WWW, 2012.
-
(2012)
WWW
-
-
Pal, A.1
Rastogi, V.2
Machanavajjhala, A.3
Bohannon, P.4
-
21
-
-
79958070483
-
Efficient entity resolution methods for heterogeneous information spaces
-
G. Papadakis and W. Nejdl. Efficient entity resolution methods for heterogeneous information spaces. In ICDE Workshops, 2011.
-
(2011)
ICDE Workshops
-
-
Papadakis, G.1
Nejdl, W.2
-
22
-
-
84891103393
-
An automatic blocking mechanism for large-scale de-duplication tasks
-
A. D. Sarma, A. Jain, A. Machanavajjhala, and P. Bohannon. An automatic blocking mechanism for large-scale de-duplication tasks. In CIKM, 2012.
-
(2012)
CIKM
-
-
Sarma, A.D.1
Jain, A.2
Machanavajjhala, A.3
Bohannon, P.4
-
23
-
-
73649090431
-
Structural characterizations of schema-mapping languages
-
B. ten Cate and P. G. Kolaitis. Structural characterizations of schema-mapping languages. Commun. ACM, 53(1), 2010.
-
(2010)
Commun. ACM
, vol.53
, Issue.1
-
-
ten Cate, B.1
Kolaitis, P.G.2
-
24
-
-
77954744650
-
Efficient parallel set-similarity joins using mapreduce
-
R. Vernica, M. J. Carey, and C. Li. Efficient parallel set-similarity joins using mapreduce. In SIGMOD, 2010.
-
(2010)
SIGMOD
-
-
Vernica, R.1
Carey, M.J.2
Li, C.3
-
25
-
-
84891077668
-
-
High quality real-time incremental entity resolution in a knowledge base
-
M. J. Welch, C. Drome, and A. Sane. High quality real-time incremental entity resolution in a knowledge base.
-
-
-
Welch, M.J.1
Drome, C.2
Sane, A.3
-
26
-
-
84959315823
-
Fast and accurate incremental entity resolution relative to a batch resolved corpus
-
M. J. Welch, C. Drome, and A. Sane. Fast and accurate incremental entity resolution relative to a batch resolved corpus. In CIKM, 2012.
-
(2012)
CIKM
-
-
Welch, M.J.1
Drome, C.2
Sane, A.3
|