-
3
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
Hernandez, M.A., Stolfo, S.J.: Real-world data is dirty: Data cleansing and the merge/purge problem. Data Mining and Knowledge Discovery 2 (1998) 9-37
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, pp. 9-37
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
4
-
-
0033891155
-
An extensible framework for data cleaning
-
Galhardas, H., Florescu, D., Shasha, D., Simon, E.: An extensible framework for data cleaning. In: Proceddings of the 16th International Conference on Data Engineering (ICDE '00). (2000) 312
-
(2000)
Proceddings of the 16th International Conference on Data Engineering (ICDE '00)
, pp. 312
-
-
Galhardas, H.1
Florescu, D.2
Shasha, D.3
Simon, E.4
-
5
-
-
38549128513
-
-
Monge, A., Elkan, C.: An efficient domain independent algorithm for detecting approximately duplicate database records. In: In Proceedings of the SIGMOD Workshop on Data Mining and Knowledge Discovery. (1997)
-
Monge, A., Elkan, C.: An efficient domain independent algorithm for detecting approximately duplicate database records. In: In Proceedings of the SIGMOD Workshop on Data Mining and Knowledge Discovery. (1997)
-
-
-
-
6
-
-
0001592068
-
Automatic linkage of vital records
-
Newcombe, H.B., Kennedy, J.M., Axford, S.J., James, A.P.: Automatic linkage of vital records. Science 130 (1959) 954-959
-
(1959)
Science
, vol.130
, pp. 954-959
-
-
Newcombe, H.B.1
Kennedy, J.M.2
Axford, S.J.3
James, A.P.4
-
7
-
-
0002940254
-
Using the em algorithm for weight computation in the fellegisunter model of record linkage
-
American Statistical Association
-
Winkler, W.E.: Using the em algorithm for weight computation in the fellegisunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, American Statistical Association. (1988) 667-671
-
(1988)
Proceedings of the Section on Survey Research Methods
, pp. 667-671
-
-
Winkler, W.E.1
-
8
-
-
0002629270
-
Maximum likelihood from incomplete data via the em algorithm
-
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the em algorithm. Journal of the Royal Statistical Society, Series B 34 (1977) 1-38
-
(1977)
Journal of the Royal Statistical Society, Series B
, vol.34
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
9
-
-
2942702984
-
Improved decision rules in the fellegi-sunter model of record linkage
-
American Statistical Association
-
Winkler, W.E.: Improved decision rules in the fellegi-sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods, American Statistical Association. (1993) 274-279
-
(1993)
Proceedings of the Section on Survey Research Methods
, pp. 274-279
-
-
Winkler, W.E.1
-
12
-
-
0036203458
-
Tailor: A record linkage tool-box
-
Washington, DC, USA, IEEE Computer Society
-
Elfeky, M.G., Verykios, V.S., Elmargarid, A.K.: Tailor: A record linkage tool-box. In: Proceedings of the 18th International Conference on Data Engineering (ICDE'02), Washington, DC, USA, IEEE Computer Society (2002) 17
-
(2002)
Proceedings of the 18th International Conference on Data Engineering (ICDE'02)
, pp. 17
-
-
Elfeky, M.G.1
Verykios, V.S.2
Elmargarid, A.K.3
-
14
-
-
2342566765
-
Learning to combine trained distance metrics for duplicate detection in databases
-
02-296, Artificial Intelligence Laboratory, University of Texas at Austin, Austin, TX
-
Bilenko, M., Mooney, R.J.: Learning to combine trained distance metrics for duplicate detection in databases. Technical Report AI 02-296, Artificial Intelligence Laboratory, University of Texas at Austin, Austin, TX (2002)
-
(2002)
Technical Report AI
-
-
Bilenko, M.1
Mooney, R.J.2
-
15
-
-
0035545848
-
Learning object identification rules for information integration
-
Tejada, S., Knoblock, C.A., Minton, S.: Learning object identification rules for information integration. Information Systems Journal 26 (2001) 635-656
-
(2001)
Information Systems Journal
, vol.26
, pp. 635-656
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
19
-
-
29844452555
-
Reference reconciliation in complex information spaces
-
Dong, X., Halevy, A.Y., Madhavan, J.: Reference reconciliation in complex information spaces. In: SIGMOD Conference. (2005) 85-96
-
(2005)
SIGMOD Conference
, pp. 85-96
-
-
Dong, X.1
Halevy, A.Y.2
Madhavan, J.3
-
20
-
-
70449590442
-
Deduplication and group detection using links
-
Workshop on Link Analysis and Group Detection, 2004
-
Bhattacharya, I., Getoor, L.: Deduplication and group detection using links. In: Proceedings of the KDD-2004 Workshop on Link Analysis and Group Detection. (2004)
-
Proceedings of the KDD-2004
-
-
Bhattacharya, I.1
Getoor, L.2
-
21
-
-
84898987614
-
Identity uncertainty and citation matching
-
MIT Press
-
Pasula, H., Marthi, B., Milch, B., Russell, S., Shpitser, I.: Identity uncertainty and citation matching. In: Advances in Neural Information Processing Systems 15, MIT Press (2003)
-
(2003)
Advances in Neural Information Processing Systems
, vol.15
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
24
-
-
33646696837
-
A comparison of string metrics for matching names and records
-
Washington, DC
-
Cohen, W.W., Ravikumar, P., Fienberg, S.E.: A comparison of string metrics for matching names and records. In: Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation, Washington, DC (2003) 13-18
-
(2003)
Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
, pp. 13-18
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
26
-
-
84936824188
-
Word association norms, mutual information and lexicography
-
Church, K.W., Hanks, P.: Word association norms, mutual information and lexicography. Computational Linguistics 16 (1990) 22-29
-
(1990)
Computational Linguistics
, vol.16
, pp. 22-29
-
-
Church, K.W.1
Hanks, P.2
-
27
-
-
0001770868
-
Using statistics in lexical analysis
-
Church, K.W., Gale, W., Hanks, P., Hindle, D.: Using statistics in lexical analysis. Lexical Acquisition: Using On-line Recources to Build a Lexicon (1991) 115-164
-
(1991)
Lexical Acquisition: Using On-line Recources to Build a Lexicon
, pp. 115-164
-
-
Church, K.W.1
Gale, W.2
Hanks, P.3
Hindle, D.4
-
35
-
-
0001116877
-
Binary codes capable of correcting insertions and reversals
-
Levenshtein, V.I.: Binary codes capable of correcting insertions and reversals. Soviet Physics Doklady 10 (1966) 707-710
-
(1966)
Soviet Physics Doklady
, vol.10
, pp. 707-710
-
-
Levenshtein, V.I.1
|