-
3
-
-
67649649597
-
Large-scale deduplication with constraints using Dedupalog
-
A. Arasu, C. Re, and D. Suciu. Large-scale deduplication with constraints using Dedupalog. In ICDE, 2009.
-
(2009)
ICDE
-
-
Arasu, A.1
Re, C.2
Suciu, D.3
-
4
-
-
79960009314
-
Consistent query answers in inconsistent databases
-
M. Arenas, L. E. Bertossi, and J. Chomicki. Consistent query answers in inconsistent databases. TPLP, 3(4-5), 2003.
-
(2003)
TPLP
, vol.3
, Issue.4-5
-
-
Arenas, M.1
Bertossi, L.E.2
Chomicki, J.3
-
6
-
-
29844436973
-
A cost-based model and effective heuristic for repairing constraints by value modification
-
P. Bohannon, W. Fan, M. Flaster, and R. Rastogi. A cost-based model and effective heuristic for repairing constraints by value modification. In SIGMOD, 2005.
-
(2005)
SIGMOD
-
-
Bohannon, P.1
Fan, W.2
Flaster, M.3
Rastogi, R.4
-
8
-
-
74549188261
-
Discovering data quality rules
-
F. Chiang and R. Miller. Discovering data quality rules. In VLDB, 2008.
-
(2008)
VLDB
-
-
Chiang, F.1
Miller, R.2
-
9
-
-
84959912087
-
Improving data quality: Consistency and accuracy
-
G. Cong, W. Fan, F. Geerts, X. Jia, and S. Ma. Improving data quality: Consistency and accuracy. In VLDB, 2007.
-
(2007)
VLDB
-
-
Cong, G.1
Fan, W.2
Geerts, F.3
Jia, X.4
Ma, S.5
-
14
-
-
84859258624
-
Global detection of complex copying relationships between sources
-
X. L. Dong, L. Berti-Equille, Y. Hu, and D. Srivastava. Global detection of complex copying relationships between sources. In VLDB, 2010.
-
(2010)
VLDB
-
-
Dong, X.L.1
Berti-Equille, L.2
Hu, Y.3
Srivastava, D.4
-
16
-
-
33845667955
-
Duplicate record detection: A survey
-
DOI 10.1109/TKDE.2007.250581
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. TKDE, 19(1):1-16, 2007. (Pubitemid 44955773)
-
(2007)
IEEE Transactions on Knowledge and Data Engineering
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
17
-
-
46649106686
-
Conditional functional dependencies for capturing data inconsistencies
-
W. Fan, F. Geerts, X. Jia, and A. Kementsietsidis. Conditional functional dependencies for capturing data inconsistencies. TODS, 33(1), 2008.
-
(2008)
TODS
, vol.33
, Issue.1
-
-
Fan, W.1
Geerts, F.2
Jia, X.3
Kementsietsidis, A.4
-
18
-
-
84865086832
-
Reasoning about record matching rules
-
W. Fan, X. Jia, J. Li, and S. Ma. Reasoning about record matching rules. In VLDB, 2009.
-
(2009)
VLDB
-
-
Fan, W.1
Jia, X.2
Li, J.3
Ma, S.4
-
19
-
-
84858615261
-
Towards certain fixes with editing rules and master data
-
W. Fan, J. Li, S. Ma, N. Tang, and W. Yu. Towards certain fixes with editing rules and master data. In VLDB, 2010.
-
(2010)
VLDB
-
-
Fan, W.1
Li, J.2
Ma, S.3
Tang, N.4
Yu, W.5
-
20
-
-
1542305821
-
A systematic approach to automatic edit and imputation
-
I. Fellegi and D. Holt. A systematic approach to automatic edit and imputation. J. American Statistical Association, 71(353):17-35, 1976.
-
(1976)
J. American Statistical Association
, vol.71
, Issue.353
, pp. 17-35
-
-
Fellegi, I.1
Holt, D.2
-
22
-
-
79960023714
-
Record linkage with uniqueness constraints and erroneous values
-
S. Guo, X. Dong, D. Srivastava, and R. Zajac. Record linkage with uniqueness constraints and erroneous values. PVLDB, 3(1), 2010.
-
(2010)
PVLDB
, vol.3
, Issue.1
-
-
Guo, S.1
Dong, X.2
Srivastava, D.3
Zajac, R.4
-
23
-
-
84943817322
-
Error detecting and error correcting codes
-
R. W. Hamming. Error detecting and error correcting codes. Bell System Technical Journal, 29(2):147-160, 1950.
-
(1950)
Bell System Technical Journal
, vol.29
, Issue.2
, pp. 147-160
-
-
Hamming, R.W.1
-
24
-
-
0013331361
-
Real-world data is dirty: Data cleansing and the merge/purge problem
-
M. A. Hernandez and S. Stolfo. Real-World Data is Dirty: Data Cleansing and the Merge/Purge Problem. Data Mining and Knowledge Discovery, 2(1):9-37, 1998. (Pubitemid 128696797)
-
(1998)
Data Mining and Knowledge Discovery
, vol.2
, Issue.1
, pp. 9-37
-
-
Hernandez, M.A.1
Stolfo, S.J.2
-
28
-
-
77954714416
-
ERACER: A database approach for statistical inference and data cleaning
-
C. Mayfield, J. Neville, and S. Prabhakar. ERACER: a database approach for statistical inference and data cleaning. In SIGMOD, 2010.
-
(2010)
SIGMOD
-
-
Mayfield, C.1
Neville, J.2
Prabhakar, S.3
-
29
-
-
35448996363
-
Data fusion in three steps: Resolving schema, tuple, and value inconsistencies
-
F. Naumann, A. Bilke, J. Bleiholder, and M. Weis. Data fusion in three steps: Resolving schema, tuple, and value inconsistencies. IEEE Data Eng. Bull., 29(2), 2006.
-
(2006)
IEEE Data Eng. Bull.
, vol.29
, Issue.2
-
-
Naumann, F.1
Bilke, A.2
Bleiholder, J.3
Weis, M.4
-
31
-
-
0031988304
-
The impact of poor data quality on the typical enterprise
-
T. Redman. The impact of poor data quality on the typical enterprise. Commun. ACM, 2:79-82, 1998.
-
(1998)
Commun. ACM
, vol.2
, pp. 79-82
-
-
Redman, T.1
-
32
-
-
74549201636
-
Discovering matching dependencies
-
S. Song and L. Chen. Discovering matching dependencies. In CIKM, 2009.
-
(2009)
CIKM
-
-
Song, S.1
Chen, L.2
-
35
-
-
29844441371
-
Dogmatix tracks down duplicates in XML
-
M. Weis and F. Naumann. Dogmatix tracks down duplicates in XML. In SIGMOD, 2005.
-
(2005)
SIGMOD
-
-
Weis, M.1
Naumann, F.2
-
37
-
-
33745206041
-
Database repairing using updates
-
DOI 10.1145/1093382.1093385
-
J. Wijsen. Database repairing using updates. TODS, 30(3):722-768, 2005. (Pubitemid 43906382)
-
(2005)
ACM Transactions on Database Systems
, vol.30
, Issue.3
, pp. 722-768
-
-
Wijsen, J.1
-
39
-
-
0018019231
-
Compression of individual sequences via variable-rate coding
-
J. Ziv and A. Lempel. Compression of individual sequences via variable-rate coding. IEEE TIT, 24(5), 1978.
-
(1978)
IEEE TIT
, vol.24
, Issue.5
-
-
Ziv, J.1
Lempel, A.2
|