-
1
-
-
84976856849
-
The merge/purge problem for large databases
-
Hernandez, M., Stolfo, S.: The merge/purge problem for large databases. In: SIGMOD (1995)
-
(1995)
SIGMOD
-
-
Hernandez, M.1
Stolfo, S.2
-
2
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
Hernandez, M., Stolfo, S.: Real-world data is dirty: data cleansing and the merge/purge problem for large databases. Data mining and knowledge discovery 2(1), 9-37 (1998) (Pubitemid 128696797)
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.1
, pp. 9-37
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
3
-
-
0242456811
-
Interactive deduplication using axtive learning
-
Sarawagi, S., Bhamidipaty, A.: Interactive deduplication using axtive learning. In: SIGKDD (2002)
-
(2002)
SIGKDD
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
4
-
-
0024863169
-
The inter-database instance identification problem in integrating autonomous systems
-
Wang, Y., Madnick, S.: The inter-database instance identification problem in integrating autonomous systems. In: ICDE (1989)
-
(1989)
ICDE
-
-
Wang, Y.1
Madnick, S.2
-
5
-
-
58149472338
-
Swoosh: A generic approach to entity resolution
-
Benjelloun, O., Garcia-Molina, H., Menestrina, D., Su, Q., Whang, S.E., Widom, J.: Swoosh: A generic approach to entity resolution. The VLDB Journal (2008)
-
(2008)
The VLDB Journal
-
-
Benjelloun, O.1
Garcia-Molina, H.2
Menestrina, D.3
Su, Q.4
Whang, S.E.5
Widom, J.6
-
6
-
-
0014087577
-
Record linking: The design of efficient systems for linking records into individual and family histories
-
Newcombe, H.: Record linking: The design of efficient systems for linking records into individual and family histories. Am. J. Human Genetics 19(3), 335-359 (1967)
-
(1967)
Am. J. Human Genetics
, vol.19
, Issue.3
, pp. 335-359
-
-
Newcombe, H.1
-
7
-
-
0009018963
-
A model for optimum linkage of records
-
Tepping, B.: A model for optimum linkage of records. J. Am. Statistical Assoc. 63(324), 1321-1332 (1968)
-
(1968)
J. Am. Statistical Assoc.
, vol.63
, Issue.324
, pp. 1321-1332
-
-
Tepping, B.1
-
10
-
-
78650446929
-
An efficient domain independent algorithm for detecting approacimatly duplicate database records
-
Monge, A., Elkan, C.: An efficient domain independent algorithm for detecting approacimatly duplicate database records. In: SIGKDD (1997)
-
(1997)
SIGKDD
-
-
Monge, A.1
Elkan, C.2
-
11
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
Bilenko, M., Mooney, R.: Adaptive duplicate detection using learnable string similarity measures. In: SIGKDD (2003)
-
(2003)
SIGKDD
-
-
Bilenko, M.1
Mooney, R.2
-
12
-
-
0242540438
-
Learing to match and cluster large hihg-dimensional data sets for data integration
-
Cohen, W., Richman, J.: Learing to match and cluster large hihg-dimensional data sets for data integration. In: SIGKDD (2002)
-
(2002)
SIGKDD
-
-
Cohen, W.1
Richman, J.2
-
13
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
Chaudhuri, S., Ganti, V., Motwani, R.: Robust identification of fuzzy duplicates. In: ICDE (2005)
-
(2005)
ICDE
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
14
-
-
33845667955
-
Duplicate record detection: A survey
-
Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate record detection: A survey. TKDE 19(1), 1-16 (2007)
-
(2007)
TKDE
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
16
-
-
0242456803
-
Learing domain-independent string trandformation weights for high accuracy object identification
-
Tejada, S., Knoblosk, C., Minton, S.: Learing domain-independent string trandformation weights for high accuracy object identification. In: SIGKDD (2002)
-
(2002)
SIGKDD
-
-
Tejada, S.1
Knoblosk, C.2
Minton, S.3
-
18
-
-
35448937301
-
Leveraging aggregate constraints for deduplication
-
Chaudhuri, S., Sarma, A.D., Ganti, V., Kaushik, R.: Leveraging aggregate constraints for deduplication. In: SIGMOD (2007)
-
(2007)
SIGMOD
-
-
Chaudhuri, S.1
Sarma, A.D.2
Ganti, V.3
Kaushik, R.4
|