-
3
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
Baxter, R., Christen, P, and Churches, T.: A comparison of fast blocking methods for record linkage. In Proc. of ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation, 2003.
-
(2003)
Proc. of ACM SIGKDD Workshop on Data Cleaning, Record Linkage, and Object Consolidation
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
4
-
-
35348814713
-
Generic entity resolution in the SERF project
-
Benjelloun, O, Gracia-Molina, H., Kawai, H., Larson, T. E., Minestrina, D., Su, Q., Thavisomboon, S., and Widom, J.: Generic Entity Resolution in the SERF project. IEEE Data Engineering Bulletin, Vol. 29, Number 2, 2006.
-
(2006)
IEEE Data Engineering Bulletin
, vol.29
, Issue.2
-
-
Benjelloun, O.1
Gracia-Molina, H.2
Kawai, H.3
Larson, T.E.4
Minestrina, D.5
Su, Q.6
Thavisomboon, S.7
Widom, J.8
-
5
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
Bilenko, M. and Mooney, R. J.: Adaptive duplicate detection using learnable string similarity measures. In Proc. of ACM SIGKDD, 2003.
-
(2003)
Proc. of ACM SIGKDD
-
-
Bilenko, M.1
Mooney, R.J.2
-
7
-
-
33749597967
-
A primitive operator for similarity joins in data cleaning
-
Chaudhuri, D., Ganti, V., and Kaushik, R.: A Primitive Operator for Similarity Joins in Data Cleaning. In Proc. of. ICDE, 2006.
-
(2006)
Proc. Of. ICDE
-
-
Chaudhuri, D.1
Ganti, V.2
Kaushik, R.3
-
8
-
-
85011029434
-
Example-driven design of efficient record machting queries
-
Chaudhuri, S., Chen, B.-C., Ganti, V., and Kaushik, R.: Example-driven design of efficient record machting queries. In Proc. of VLDB, 2007.
-
(2007)
Proc. of VLDB
-
-
Chaudhuri, S.1
Chen, B.-C.2
Ganti, V.3
Kaushik, R.4
-
9
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
Cohen, W. W. and Richman, J.: Learning to match and cluster large high-dimensional data sets for data integration. In Proc. of ACM SIGKDD, 2002.
-
(2002)
Proc. of ACM SIGKDD
-
-
Cohen, W.W.1
Richman, J.2
-
13
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid, A.K., Ipeirotis, P.G., and Verykios, V.S.: Duplicate Record Detection: A Survey. IEEE Transactions on Knowledge and Data Engineering 19(1), 2007.
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
14
-
-
85012212427
-
AJAX: An extensible data cleaning tool
-
Galhardas, H., Florescu, D., Shash, D., and Simon, E.: AJAX: An Extensible Data Cleaning Tool. In Proc. of ACM SIGMOD, 2000.
-
(2000)
Proc. of ACM SIGMOD
-
-
Galhardas, H.1
Florescu, D.2
Shash, D.3
Simon, E.4
-
16
-
-
79953162324
-
Merging the results of approximate match operations
-
Guha, S., Koudas, N., Marathe, A., and Srivastava, D.: Merging the results of approximate match operations. In Proc. of VLDB, 2004.
-
(2004)
Proc. of VLDB
-
-
Guha, S.1
Koudas, N.2
Marathe, A.3
Srivastava, D.4
-
19
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
McCallum, A., Nigam, K., and Ungar, L. H.: Efficient Clustering of High-Dimensional Data Sets with Application to Reference Matching. In Proc. of ACM SIGKDD, 2000.
-
(2000)
Proc. of ACM SIGKDD
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
20
-
-
33749558210
-
Yale: Rapid prototyping for complex data mining tasks
-
Mierswa, I., Wurst, M., Klinkenberg, R., Scholz, M, and Euler, T.: Yale: Rapid Prototyping for Complex Data Mining Tasks. In Proc. of ACM SIGKDD, 2006.
-
(2006)
Proc. of ACM SIGKDD
-
-
Mierswa, I.1
Wurst, M.2
Klinkenberg, R.3
Scholz, M.4
Euler, T.5
-
21
-
-
33750728576
-
A heterogeneous field matching method for record linkage
-
Minton, S. N, Nanjo, C., Knobloch, C. A., Michalowski, M., and Michelson, M.: A Heterogeneous Field Matching Method for Record Linkage. In Proc. IEEE International Conference on Data Mining, 2005.
-
(2005)
Proc. IEEE International Conference on Data Mining
-
-
Minton, S.N.1
Nanjo, C.2
Knobloch, C.A.3
Michalowski, M.4
Michelson, M.5
-
22
-
-
0037884935
-
Methods for linking and mining massive heterogeneous datasets
-
Pinheiro, J. C., and Sun, D. X.: Methods for linking and mining massive heterogeneous datasets. In Proc. of ACM SIGKDD, 1998.
-
(1998)
Proc. of ACM SIGKDD
-
-
Pinheiro, J.C.1
Sun, D.X.2
-
25
-
-
34548769261
-
Source-aware entity matching: A compositional approach
-
Shen, W., DeRose, R., Vu, L., Doan, A., and Ramakrishnan, R.: Source-aware Entity Matching: A Compositional Approach. In Proc. of ICDE, 2007.
-
(2007)
Proc. of ICDE
-
-
Shen, W.1
DeRose, R.2
Vu, L.3
Doan, A.4
Ramakrishnan, R.5
-
26
-
-
0035545848
-
Learning object identification rules for information integration
-
Tejada, S., Knoblock, C. A., and Minton, S.: Learning object identification rules for information integration. Information Systems Journal, 26(8), 2001.
-
(2001)
Information Systems Journal
, vol.26
, Issue.8
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
27
-
-
40949147166
-
MOMA - A mapping-based object matching system
-
Thor, A., and Rahm, E.: MOMA - A Mapping-based Object Matching System. In Proc. of CIDR, 2007.
-
(2007)
Proc. of CIDR
-
-
Thor, A.1
Rahm, E.2
-
28
-
-
0038208065
-
A Bayesian decision model for cost optimal record matching
-
Verykios, V. S., Moustakides, G.V., and Elfeky, M. G.: A Bayesian decision model for cost optimal record matching. The VLDB Journal, 12(1), 2003.
-
(2003)
The VLDB Journal
, vol.12
, Issue.1
-
-
Verykios, V.S.1
Moustakides, G.V.2
Elfeky, M.G.3
|