-
1
-
-
34848900466
-
D-swoosh: A family of algorithms for generic, distributed entity resolution
-
Benjelloun, O., Garcia-Molina, H., Kawai, H., Larson, T.E., Menestrina, D., Thavisomboon, S.: D-swoosh: A family of algorithms for generic, distributed entity resolution. In: ICDCS (2007)
-
(2007)
ICDCS
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Kawai, H.3
Larson, T.E.4
Menestrina, D.5
Thavisomboon, S.6
-
2
-
-
58149472338
-
Swoosh: A generic approach to entity resolution
-
doi: 10.1007/s00778-008-0098-x
-
Benjelloun, O., Garcia-Molina, H., Menestrina, D., Whang, S.E., Su, Q., Widom, J.: Swoosh: a generic approach to entity resolution. VLDB J. (2008). doi: 10.1007/s00778-008-0098-x
-
(2008)
VLDB J.
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
Whang, S.E.4
Su, Q.5
Widom, J.6
-
4
-
-
33745329531
-
A cost-based model and effective heuristic for repairing constraints by value modification
-
Bohannon, P., Flaster, M., Fan, W., Rastogi, R.: A cost-based model and effective heuristic for repairing constraints by value modification. In: SIGMOD Conference, pp. 143-154 (2005)
-
(2005)
SIGMOD Conference
, pp. 143-154
-
-
Bohannon, P.1
Flaster, M.2
Fan, W.3
Rastogi, R.4
-
5
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
Tokyo, Japan
-
Chaudhuri, S., Ganti, V., Motwani, R.: Robust identification of fuzzy duplicates. In: Proceedings of ICDE. Tokyo, Japan (2005)
-
(2005)
Proceedings of ICDE
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
6
-
-
35448937301
-
Leveraging aggregate constraints for deduplication
-
DOI 10.1145/1247480.1247530, SIGMOD 2007: Proceedings of the ACM SIGMOD International Conference on Management of Data
-
Chaudhuri, S., Sarma, A.D., Ganti, V., Kaushik, R.: Leveraging aggregate constraints for deduplication. In: SIGMOD'07: Proceedings of the 2007 ACM SIGMOD international conference on management of data, pp. 437-448. ACM Press, New York (2007). doi: 10.1145/1247480.1247530 (Pubitemid 47630827)
-
(2007)
Proceedings of the ACM SIGMOD International Conference on Management of Data
, pp. 437-447
-
-
Chaudhuri, S.1
Das Sarma, A.2
Ganti, V.3
Kaushik, R.4
-
7
-
-
36849055542
-
On the computational complexity of minimal-change integrity maintenance in relational databases
-
Chomicki, J., Marcinkowski, J.: On the computational complexity of minimal-change integrity maintenance in relational databases. In: Inconsistency Tolerance, pp. 119-150 (2005)
-
(2005)
Inconsistency Tolerance
, pp. 119-150
-
-
Chomicki, J.1
Marcinkowski, J.2
-
8
-
-
18744368587
-
Object matching for information integration: A profiler-based approach
-
Doan, A., Lu, Y., Lee, Y., Han, J.: Object matching for information integration: A profiler-based approach. In: IIWeb, pp. 53-58 (2003)
-
(2003)
IIWeb
, pp. 53-58
-
-
Doan, A.1
Lu, Y.2
Lee, Y.3
Han, J.4
-
9
-
-
2342615638
-
Profile-based object matching for information integration
-
10.1109/MIS.2003.1234770
-
A. Doan Y. Lu Y. Lee J. Han 2003 Profile-based object matching for information integration IEEE Intell. Syst. 18 5 54 59 10.1109/MIS.2003.1234770
-
(2003)
IEEE Intell. Syst.
, vol.18
, Issue.5
, pp. 54-59
-
-
Doan, A.1
Lu, Y.2
Lee, Y.3
Han, J.4
-
10
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
Dong, X., Halevy, A.Y., Madhavan, J.: Reference reconciliation in complex information spaces. In: SIGMOD Conference, pp. 85-96 (2005)
-
(2005)
SIGMOD Conference
, pp. 85-96
-
-
Dong, X.1
Halevy, A.Y.2
Madhavan, J.3
-
12
-
-
71349085327
-
Functional specifications of subsystem for database integrity
-
Eswaran, K.P., Chamberlin, D.D.: Functional specifications of subsystem for database integrity. In: VLDB, pp. 48-68 (1975)
-
(1975)
VLDB
, pp. 48-68
-
-
Eswaran, K.P.1
Chamberlin, D.D.2
-
13
-
-
1542305821
-
A systematic approach to automatic edit and imputation
-
Fellegi, I.P., Holt, D.: A systematic approach to automatic edit and imputation. J Am. Stat. Assoc. 71(353), 17-35 (1976). http://www.jstor.org/ stable/2285726
-
(1976)
J Am. Stat. Assoc.
, vol.71
, Issue.353
, pp. 17-35
-
-
Fellegi, I.P.1
Holt, D.2
-
14
-
-
84877086968
-
Census data repair: A challenging application of disjunctive logic programming
-
Springer, London
-
Franconi, E., Palma, A.L., Leone, N., Perri, S., Scarcello, F.: Census data repair: A challenging application of disjunctive logic programming. In: LPAR'01: Proceedings of the Artificial Intelligence on Logic for Programming, pp. 561-578. Springer, London (2001)
-
(2001)
LPAR'01: Proceedings of the Artificial Intelligence on Logic for Programming
, pp. 561-578
-
-
Franconi, E.1
Palma, A.L.2
Leone, N.3
Perri, S.4
Scarcello, F.5
-
17
-
-
33845350152
-
Record linkage: Current practice and future directions
-
Gu, L., Baxter, R., Vickers, D., Rainsford, C.: Record linkage: Current practice and future directions. Tech. Rep. 03/83, CSIRO Mathematical and Information Sciences (2003)
-
(2003)
Tech. Rep. 03/83, CSIRO Mathematical and Information Sciences
-
-
Gu, L.1
Baxter, R.2
Vickers, D.3
Rainsford, C.4
-
18
-
-
71349088695
-
Semantic integrity in a relational data base system
-
Hammer, M., McLeod, D.: Semantic integrity in a relational data base system. In: VLDB, pp. 25-47 (1975)
-
(1975)
VLDB
, pp. 25-47
-
-
Hammer, M.1
McLeod, D.2
-
19
-
-
84976856849
-
The merge/purge problem for large databases
-
Hernández, M.A., Stolfo, S.J.: The merge/purge problem for large databases. In: Proceedings of ACM SIGMOD, pp. 127-138 (1995)
-
(1995)
Proceedings of ACM SIGMOD
, pp. 127-138
-
-
Hernández, M.A.1
-
20
-
-
84950419860
-
Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida
-
10.2307/2289924
-
M.A. Jaro 1989 Advances in record-linkage methodology as applied to matching the 1985 census of tampa, florida J. Am. Stat. Assoc. 84 406 414 420 10.2307/2289924
-
(1989)
J. Am. Stat. Assoc.
, vol.84
, Issue.406
, pp. 414-420
-
-
Jaro, M.A.1
-
21
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, MA
-
McCallum, A.K., Nigam, K., Ungar, L.: Efficient clustering of high-dimensional data sets with application to reference matching. In: Proceedings of KDD, pp. 169-178. Boston, MA (2000)
-
(2000)
Proceedings of KDD
, pp. 169-178
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.3
-
22
-
-
34547960421
-
Generic entity resolution with data confidences
-
Seoul, Korea
-
Menestrina, D., Benjelloun, O., Garcia-Molina, H.: Generic entity resolution with data confidences. In: First International VLDB Workshop on Clean Databases. Seoul, Korea (2006)
-
(2006)
First International VLDB Workshop on Clean Databases
-
-
Menestrina, D.1
Benjelloun, O.2
Garcia-Molina, H.3
-
24
-
-
0001592068
-
Automatic linkage of vital records
-
10.1126/science.130.3381.954
-
H.B. Newcombe J.M. Kennedy S.J. Axford A.P. James 1959 Automatic linkage of vital records Science 130 3381 954 959 10.1126/science.130.3381.954
-
(1959)
Science
, vol.130
, Issue.3381
, pp. 954-959
-
-
Newcombe, H.B.1
Kennedy, J.M.2
Axford, S.J.3
James, A.P.4
-
27
-
-
0242456811
-
Interactive deduplication using active learning
-
Edmonton, Alberta
-
Sarawagi, S., Bhamidipaty, A.: Interactive deduplication using active learning. In: Proceedings of ACM SIGKDD. Edmonton, Alberta (2002)
-
(2002)
Proceedings of ACM SIGKDD
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
28
-
-
29344435802
-
Constraint-based entity matching
-
Shen, W., Li, X., Doan, A.: Constraint-based entity matching. In: AAAI, pp. 862-867 (2005)
-
(2005)
AAAI
, pp. 862-867
-
-
Shen, W.1
Li, X.2
Doan, A.3
-
29
-
-
0035545935
-
Discovering and reconciling value conflicts for numerical data integration
-
DOI 10.1016/S0306-4379(01)00043-6, Data Extraction, Cleaning and Reconciliation
-
S. Tejada C.A. Knoblock S. Minton 2001 Learning object identification rules for information integration Inf. Syst. J. 26 8 635 656 10.1016/S0306-4379(01)00043-6 (Pubitemid 33046274)
-
(2001)
Information Systems
, vol.26
, Issue.8
, pp. 635-656
-
-
Fan, W.1
Lu, H.2
Madnick, S.E.3
Cheung, D.4
-
30
-
-
84864155886
-
Additional experiments on negative rules
-
Stanford University
-
Whang, S.E., Benjelloun, O., Garcia-Molina, H.: Additional experiments on negative rules. Tech. rep., Stanford University. http://dbpubs.stanford.edu/ pub/2005-5
-
Tech. Rep.
-
-
Whang, S.E.1
Benjelloun, O.2
Garcia-Molina, H.3
-
31
-
-
70849106138
-
Entity resolution with iterative blocking
-
Stanford University
-
Whang, S.E., Menestrina, D., Koutrika, G., Theobald, M., Garcia-Molina, H.: Entity resolution with iterative blocking. Tech. rep., Stanford University (2008). http://dbpubs.stanford.edu/pub/2008-19
-
(2008)
Tech. Rep.
-
-
Whang, S.E.1
Menestrina, D.2
Koutrika, G.3
Theobald, M.4
Garcia-Molina, H.5
-
33
-
-
33845615644
-
Overview of record linkage and current research directions
-
US Bureau of the Census, Washington, DC
-
Winkler, W.: Overview of record linkage and current research directions. Tech. rep., Statistical Research Division, US Bureau of the Census, Washington, DC (2006)
-
(2006)
Tech. Rep., Statistical Research Division
-
-
Winkler, W.1
-
34
-
-
71349084662
-
State of statistical data editing and current research problems
-
Working Paper n.29
-
Winkler, W.E.: State of statistical data editing and current research problems. In: UN/ECE Work Session on Statistical Data Editing, Working Paper n.29, pp. 2-4 (1999)
-
(1999)
UN/ECE Work Session on Statistical Data Editing
, pp. 2-4
-
-
Winkler, W.E.1
|