-
2
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
R. Baxter, P. Christen, and T. Churches, "A Comparison of Fast Blocking Methods for Record Linkage," Proc. Workshop Data Cleaning, Record Linkage and Object Consolidation at SIGKDD, pp. 25-27, 2003
-
(2003)
Proc. Workshop Data Cleaning, Record Linkage and Object Consolidation at SIGKDD
, pp. 25-27
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
3
-
-
84878049861
-
Adaptive blocking: Learning to scale up record linkage
-
M. Bilenko, B. Kamath, and R.J. Mooney, "Adaptive Blocking: Learning to Scale Up Record Linkage," Proc. Sixth Intl Conf. Data Mining (ICDM), pp. 87-96, 2006
-
(2006)
Proc. Sixth Intl Conf. Data Mining (ICDM)
, pp. 87-96
-
-
Bilenko, M.1
Kamath, B.2
Mooney, R.J.3
-
4
-
-
77952649961
-
Linked data-The story so far
-
C. Bizer, T. Heath, and T. Berners-Lee, "Linked Data-The Story So Far," Intl J. Semantic Web Information Systems, vol. 5, no. 3, pp. 1-22, 2009
-
(2009)
Intl J. Semantic Web Information Systems
, vol.5
, Issue.3
, pp. 1-22
-
-
Bizer, C.1
Heath, T.2
Berners-Lee, T.3
-
5
-
-
84920595044
-
A survey of indexing techniques for scalable record linkage and deduplication
-
Sept.
-
P. Christen, "A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication," IEEE Trans. Knowledge and Data Eng., vol. 24, no. 9, pp. 1537-1555, Sept. 2012
-
(2012)
IEEE Trans. Knowledge and Data Eng
, vol.24
, Issue.9
, pp. 1537-1555
-
-
Christen, P.1
-
6
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
W.W. Cohen, P.D. Ravikumar, and S.E. Fienberg, "A Comparison of String Distance Metrics for Name-Matching Tasks," Proc. Workshop Information Integration Web (IIWeb), pp. 73-78, 2003
-
(2003)
Proc. Workshop Information Integration Web (IIWeb)
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.D.2
Fienberg, S.E.3
-
7
-
-
74549152150
-
Robust record linkage blocking using suffix arrays
-
T. de Vries, H. Ke, S. Chawla, and P. Christen, "Robust Record Linkage Blocking Using Suffix Arrays," Proc. 18th ACM Conf. Information and Knowledge Management (CIKM), pp. 305-314, 2009
-
(2009)
Proc. 18th ACM Conf. Information and Knowledge Management (CIKM)
, pp. 305-314
-
-
De Vries, T.1
Ke, H.2
Chawla, S.3
Christen, P.4
-
8
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
A. Dempster, N. Laird, and D. Rubin, "Maximum Likelihood from Incomplete Data via the EM Algorithm," J. Royal Statistical Soc., vol. 39, pp. 1-38, 1977
-
(1977)
J. Royal Statistical Soc
, vol.39
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
9
-
-
17244380794
-
Semantic-integration research in the database community: A brief survey
-
A. Doan and A. Halevy, "Semantic Integration Research in the Database Community: A Brief Survey," AI Magazine, vol. 26, no. 1, pp. 83-94, 2005 (Pubitemid 40527390)
-
(2005)
AI Magazine
, vol.26
, Issue.1
, pp. 83-94
-
-
Doan, A.H.1
Halevy, A.Y.2
-
10
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
SIGMOD 2005: Proceedings of the ACM SIGMOD International Conference on Management of Data
-
X. Dong, A. Halevy, and J. Madhavan, "Reference Reconciliation in Complex Information Spaces," Proc. ACM SIGMOD Intl Conf. Management of Data (SIGMOD), pp. 85-96, 2005 (Pubitemid 43038919)
-
(2005)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 85-96
-
-
Dong, X.1
Halevy, A.2
Madhavan, J.3
-
11
-
-
33845667955
-
Duplicate record detection: A survey
-
DOI 10.1109/TKDE.2007.250581
-
A. Elmagarmid, P. Ipeirotis, and V. Verykios, "Duplicate Record Detection: A Survey," IEEE Trans. Knowledge and Data Eng., vol. 19, no. 1, pp. 1-16, Jan. 2007 (Pubitemid 44955773)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
13
-
-
55349091998
-
Summarization system evaluation revisited: N-Gram graphs
-
G. Giannakopoulos, V. Karkaletsis, G. Vouros, and P. Stamatopoulos, "Summarization System Evaluation Revisited: N-Gram Graphs," ACM Trans. Speech and Language Processing, vol. 5, pp. 1-39, 2008
-
(2008)
ACM Trans. Speech and Language Processing
, vol.5
, pp. 1-39
-
-
Giannakopoulos, G.1
Karkaletsis, V.2
Vouros, G.3
Stamatopoulos, P.4
-
14
-
-
79951755891
-
Content and type as orthogonal modeling features: A study on user interest awareness in entity subscription services
-
G. Giannakopoulos and T. Palpanas, "Content and Type as Orthogonal Modeling Features: A Study on User Interest Awareness in Entity Subscription Services," Intl J. Advances on Networks and Services, vol. 3, no. 2, pp. 296-309, 2010
-
(2010)
Intl J. Advances on Networks and Services
, vol.3
, Issue.2
, pp. 296-309
-
-
Giannakopoulos, G.1
Palpanas, T.2
-
15
-
-
84944318804
-
Approximate string joins in a database (Almost) for free
-
L. Gravano, P. Ipeirotis, H. Jagadish, N. Koudas, S. Muthukrishnan, and D. Srivastava, "Approximate String Joins in a Database (Almost) for Free," Proc. 27th Intl Conf. Very Large Data Bases (VLDB), pp. 491-500, 2001
-
(2001)
Proc. 27th Intl Conf. Very Large Data Bases (VLDB)
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.2
Jagadish, H.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
16
-
-
79960023714
-
Record linkage with uniqueness constraints and erroneous values
-
S. Guo, X. Dong, D. Srivastava, and R. Zajac, "Record Linkage with Uniqueness Constraints and Erroneous Values," Proc. VLDB Endowment, vol. 3, no. 1, pp. 417-428, 2010
-
(2010)
Proc. VLDB Endowment
, vol.3
, Issue.1
, pp. 417-428
-
-
Guo, S.1
Dong, X.2
Srivastava, D.3
Zajac, R.4
-
17
-
-
34250660624
-
Principles of dataspace systems
-
DOI 10.1145/1142351.1142352, Proceedings of the Twenty-Fifth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, PODS 2006
-
A.Y. Halevy, M.J. Franklin, and D. Maier, "Principles of Dataspace Systems," Proc. 25th ACM SIGMOD-SIGACT-SIGART Symp. Principles of Database Systems (PODS), pp. 1-9, 2006 (Pubitemid 46946461)
-
(2006)
Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems
, pp. 1-9
-
-
Halevy, A.1
Franklin, M.2
Maier, D.3
-
19
-
-
79959927816
-
On-The-Fly entity-aware query processing in the presence of linkage
-
E. Ioannou, W. Nejdl, C. Niederee, and Y. Velegrakis, "On-the-Fly Entity-Aware Query Processing in the Presence of Linkage," Proc. VLDB Endowment, vol. 3, no. 1, pp. 429-438, 2010
-
(2010)
Proc. VLDB Endowment
, vol.3
, Issue.1
, pp. 429-438
-
-
Ioannou, E.1
Nejdl, W.2
Niederee, C.3
Velegrakis, Y.4
-
22
-
-
34250670467
-
Record linkage: Similarity measures and algorithms
-
DOI 10.1145/1142473.1142599, SIGMOD 2006 - Proceedings of the ACM SIGMOD International Conference on Management of Data
-
N. Koudas, S. Sarawagi, and D. Srivastava, "Record Linkage: Similarity Measures and Algorithms," Proc. ACM SIGMOD Intl Conf. Management of Data (SIGMOD), pp. 802-803, 2006 (Pubitemid 46946588)
-
(2006)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 802-803
-
-
Koudas, N.1
Sarawagi, S.2
Srivastava, D.3
-
23
-
-
84858669792
-
Web-Scale data integration: You can afford to pay as you go
-
J. Madhavan, S. Cohen, X. Dong, A. Halevy, S. Jeffery, D. Ko, and C. Yu, "Web-Scale Data Integration: You Can Afford to Pay as You Go," Proc. Conf. Innovative Data Systems Research (CIDR), pp. 342-350, 2007
-
(2007)
Proc. Conf. Innovative Data Systems Research (CIDR)
, pp. 342-350
-
-
Madhavan, J.1
Cohen, S.2
Dong, X.3
Halevy, A.4
Jeffery, S.5
Ko, D.6
Yu, C.7
-
25
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
A. McCallum, K. Nigam, and L. Ungar, "Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching," Proc. Sixth ACM SIGKDD Intl Conf. Knowledge Discovery and Data Mining (KDD), pp. 169-178, 2000
-
(2000)
Proc. Sixth ACM SIGKDD Intl Conf. Knowledge Discovery and Data Mining (KDD)
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
26
-
-
33750728911
-
Learning blocking schemes for record linkage
-
Proceedings of the 21st National Conference on Artificial Intelligence and the 18th Innovative Applications of Artificial Intelligence Conference, AAAI-06/IAAI-06
-
M. Michelson and C.A. Knoblock, "Learning Blocking Schemes for Record Linkage," Proc. 21st Natl Conf. Artificial Intelligence (AAAI), pp. 440-445, 2006 (Pubitemid 44705323)
-
(2006)
Proceedings of the National Conference on Artificial Intelligence
, vol.1
, pp. 440-445
-
-
Michelson, M.1
Knoblock, C.A.2
-
27
-
-
85009048916
-
Semantic interoperability in global information systems: A brief introduction to the research area and the special section
-
A. Ouksel and A. Sheth, "Semantic Interoperability in Global Information Systems: A Brief Introduction to the Research Area and the Special Section," SIGMOD Record, vol. 28, no. 1, pp. 5-12, 1999
-
(1999)
SIGMOD Record
, vol.28
, Issue.1
, pp. 5-12
-
-
Ouksel, A.1
Sheth, A.2
-
28
-
-
79960545856
-
Detecting and exploiting stability in evolving heterogeneous information spaces
-
G. Papadakis, G. Giannakopoulos, C. Niederee, T. Palpanas, and W. Nejdl, "Detecting and Exploiting Stability in Evolving Heterogeneous Information Spaces," Proc. ACM/IEEE 11th Ann. Intl Joint Conf. Digital Libraries (JCDL), pp. 95-104, 2011
-
(2011)
Proc. ACM/IEEE 11th Ann. Intl Joint Conf. Digital Libraries (JCDL)
, pp. 95-104
-
-
Papadakis, G.1
Giannakopoulos, G.2
Niederee, C.3
Palpanas, T.4
Nejdl, W.5
-
29
-
-
79952386495
-
Efficient entity resolution for large heterogeneous information spaces
-
G. Papadakis, E. Ioannou, C. Niederee, and P. Fankhauser, "Efficient Entity Resolution for Large Heterogeneous Information Spaces," Proc. Fourth ACM Intl Conf. Web Search and Data Mining (WSDM), pp. 535-544, 2011
-
(2011)
Proc. Fourth ACM Intl Conf. Web Search and Data Mining (WSDM)
, pp. 535-544
-
-
Papadakis, G.1
Ioannou, E.2
Niederee, C.3
Fankhauser, P.4
-
30
-
-
79960519872
-
Eliminating the redundancy in blocking-based entity resolution methods
-
G. Papadakis, E. Ioannou, C. Niederee, T. Palpanas, and W. Nejdl, "Eliminating the Redundancy in Blocking-Based Entity Resolution Methods," Proc. 11th Ann. ACM/IEEE Intl Joint Conf. Digital Libraries (JCDL), pp. 85-94, 2011
-
(2011)
Proc. 11th Ann. ACM/IEEE Intl Joint Conf. Digital Libraries (JCDL)
, pp. 85-94
-
-
Papadakis, G.1
Ioannou, E.2
Niederee, C.3
Palpanas, T.4
Nejdl, W.5
-
31
-
-
85143179033
-
To compare or not to compare: Making entity resolution more efficient
-
G. Papadakis, E. Ioannou, C. Niederee, T. Palpanas, and W. Nejdl, "To Compare or Not to Compare: Making Entity Resolution More Efficient," Proc. Intl Workshop Semantic Web Information Management (SWIM), 2011
-
(2011)
Proc. Intl Workshop Semantic Web Information Management (SWIM)
-
-
Papadakis, G.1
Ioannou, E.2
Niederee, C.3
Palpanas, T.4
Nejdl, W.5
-
32
-
-
84858041897
-
Beyond 100 million entities: Large-Scale blocking-based resolution for heterogeneous data
-
G. Papadakis, E. Ioannou, C. Niederee, T. Palpanas, and W. Nejdl, "Beyond 100 Million Entities: Large-Scale Blocking-Based Resolution for Heterogeneous Data," Proc. Fifth ACM Intl Conf. Web Search and Data Mining (WSDM), pp. 53-62, 2012
-
(2012)
Proc. Fifth ACM Intl Conf. Web Search and Data Mining (WSDM)
, pp. 53-62
-
-
Papadakis, G.1
Ioannou, E.2
Niederee, C.3
Palpanas, T.4
Nejdl, W.5
-
33
-
-
70849098813
-
Entity resolution with iterative blocking
-
S. Whang, D. Menestrina, G. Koutrika, M. Theobald, and H. Garcia-Molina, "Entity Resolution with Iterative Blocking," Proc. ACM SIGMOD Intl Conf. Management of Data (SIGMOD), pp. 219-232, 2009
-
(2009)
Proc ACM SIGMOD Intl Conf. Management of Data (SIGMOD)
, pp. 219-232
-
-
Whang, S.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
|