-
1
-
-
85039692000
-
-
http://www.sas.com/industry/fsi/fraud/.
-
-
-
-
5
-
-
52649137537
-
Transformation-based framework for record matching
-
A. Arasu, S. Chaudhuri, and R. Kaushik. Transformation-based framework for record matching. In ICDE, 2008.
-
(2008)
ICDE
-
-
Arasu, A.1
Chaudhuri, S.2
Kaushik, R.3
-
6
-
-
67649649597
-
Large-scale deduplication with constraints using Dedupalog
-
A. Arasu, C. Re, and D. Suciu. Large-scale deduplication with constraints using Dedupalog. In ICDE, 2009.
-
(2009)
ICDE
-
-
Arasu, A.1
Re, C.2
Suciu, D.3
-
7
-
-
84865071472
-
Data Quality: Concepts, Methodologies and Techniques
-
C. Batini and M. Scannapieco. Data Quality: Concepts, Methodologies and Techniques. Springer, 2006.
-
(2006)
Springer
-
-
Batini, C.1
Scannapieco, M.2
-
8
-
-
0018442877
-
Computational problems related to the design of normal form relational schemas
-
C. Beeri and P. A. Bernstein. Computational problems related to the design of normal form relational schemas. TODS, 4(1):30-59, 1979.
-
(1979)
TODS
, vol.4
, Issue.1
, pp. 30-59
-
-
Beeri, C.1
Bernstein, P.A.2
-
9
-
-
84864170809
-
Data tables with similarity relations: Functional dependencies, complete rules and non-redundant bases
-
R. Belohlávek and V. Vychodil. Data tables with similarity relations: Functional dependencies, complete rules and non-redundant bases. In DASFAA, 2006.
-
(2006)
DASFAA
-
-
Belohlávek, R.1
Vychodil, V.2
-
10
-
-
85011029434
-
Example-driven design of efficient record matching queries
-
S. Chaudhuri, B.-C. Chen, V. Ganti, and R. Kaushik. Example-driven design of efficient record matching queries. In VLDB, 2007.
-
(2007)
VLDB
-
-
Chaudhuri, S.1
Chen, B.-C.2
Ganti, V.3
Kaushik, R.4
-
12
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
W. W. Cohen and J. Richman. Learning to match and cluster large high-dimensional data sets for data integration. In KDD, 2002.
-
(2002)
KDD
-
-
Cohen, W.W.1
Richman, J.2
-
13
-
-
24644507800
-
iMAP: Discovering complex mappings between database schemas
-
R. Dhamankar, Y. Lee, A. Doan, A. Y. Halevy, and P. Domingos. iMAP: Discovering complex mappings between database schemas. In SIGMOD, 2004.
-
(2004)
SIGMOD
-
-
Dhamankar, R.1
Lee, Y.2
Doan, A.3
Halevy, A.Y.4
Domingos, P.5
-
14
-
-
33845667955
-
Duplicate record detection: A survey
-
A. K. Elmagarmid, P. G. Ipeirotis, and V. S. Verykios. Duplicate record detection: A survey. TKDE, 19(1):1-16, 2007.
-
(2007)
TKDE
, vol.19
, Issue.1
, pp. 1-16
-
-
Elmagarmid, A.K.1
Ipeirotis, P.G.2
Verykios, V.S.3
-
15
-
-
57549084481
-
Dependencies revisited for improving data quality
-
W. Fan. Dependencies revisited for improving data quality. In PODS, 2008.
-
(2008)
PODS
-
-
Fan, W.1
-
16
-
-
1542305821
-
A systematic approach to automatic edit and imputation
-
I. Fellegi and D. Holt. A systematic approach to automatic edit and imputation. J. American Statistical Association, 71(353):17-35, 1976.
-
(1976)
J. American Statistical Association
, vol.71
, Issue.353
, pp. 17-35
-
-
Fellegi, I.1
Holt, D.2
-
18
-
-
0344756845
-
Declarative data cleaning: Language, model and algorithms
-
H. Galhardas, D. Florescu, D. Shasha, E. Simon, and C. Saita. Declarative data cleaning: Language, model and algorithms. In VLDB, 2001.
-
(2001)
VLDB
-
-
Galhardas, H.1
Florescu, D.2
Shasha, D.3
Simon, E.4
Saita, C.5
-
20
-
-
84976856849
-
The merge/purge problem for large databases
-
M. A. Hernndez and S. J. Stolfo. The merge/purge problem for large databases. In SIGMOD, 1995.
-
(1995)
SIGMOD
-
-
Hernndez, M.A.1
Stolfo, S.J.2
-
21
-
-
84950419860
-
Advances in record-linkage methodology as applied to matching the 1985 census of Tampa Florida
-
M. Jaro. Advances in record-linkage methodology as applied to matching the 1985 census of Tampa Florida. J. American Statistical Association, 89:414-420, 1989.
-
(1989)
J. American Statistical Association
, vol.89
, pp. 414-420
-
-
Jaro, M.1
-
23
-
-
0030083481
-
Entity identification in database integration
-
E.-P. Lim, J. Srivastava, S. Prabhakar, and J. Richardson. Entity identification in database integration. Inf. Sci., 89(1-2):1-38, 1996.
-
(1996)
Inf. Sci
, vol.89
, Issue.1-2
, pp. 1-38
-
-
Lim, E.-P.1
Srivastava, J.2
Prabhakar, S.3
Richardson, J.4
-
24
-
-
0001030067
-
Candidate keys for relations
-
C. L. Lucchesi and S. L. Osborn. Candidate keys for relations. JCSS, 17(2):270-279, 1978.
-
(1978)
JCSS
, vol.17
, Issue.2
, pp. 270-279
-
-
Lucchesi, C.L.1
Osborn, S.L.2
-
26
-
-
57549094363
-
Key issues for master data management
-
Technical report, Gartner
-
J. Radcliffe and A. White. Key issues for master data management. Technical report, Gartner, 2008.
-
(2008)
-
-
Radcliffe, J.1
White, A.2
-
27
-
-
0242456811
-
Interactive deduplication using active learning
-
S. Sarawagi and A. Bhamidipaty. Interactive deduplication using active learning. In KDD, 2002.
-
(2002)
KDD
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
28
-
-
34748852251
-
Constraint-based entity matching
-
W. Shen, X. Li, and A. Doan. Constraint-based entity matching. In AAAI, 2005.
-
(2005)
AAAI
-
-
Shen, W.1
Li, X.2
Doan, A.3
-
29
-
-
36349034741
-
Object identification with attributemediated dependences
-
P. Singla and P. Domingos. Object identification with attributemediated dependences. In PKDD, 2005.
-
(2005)
PKDD
-
-
Singla, P.1
Domingos, P.2
-
30
-
-
0034228352
-
Automating the approximate record-matching process
-
V. S. Verykios, A. K. Elmagarmid, and E. Houstis. Automating the approximate record-matching process. Inf. Sci., 126(1-4):83-89, 2002.
-
(2002)
Inf. Sci
, vol.126
, Issue.1-4
, pp. 83-89
-
-
Verykios, V.S.1
Elmagarmid, A.K.2
Houstis, E.3
-
31
-
-
77956549963
-
Industryscale duplicate detection
-
M. Weis, F. Naumann, U. Jehle, J. Lufter, and H. Schuster. Industryscale duplicate detection. In VLDB, 2008.
-
(2008)
VLDB
-
-
Weis, M.1
Naumann, F.2
Jehle, U.3
Lufter, J.4
Schuster, H.5
-
32
-
-
2942741943
-
Methods for record linkage and bayesian networks
-
Technical Report RRS2002/05, U.S. Census Bureau
-
W.Winkler. Methods for record linkage and bayesian networks. Technical Report RRS2002/05, U.S. Census Bureau, 2002.
-
(2002)
-
-
Winkler, W.1
-
33
-
-
2942709772
-
Methods for evaluating and creating data quality
-
W. E. Winkler. Methods for evaluating and creating data quality. Information Systems, 29(7):531-550, 2004.
-
(2004)
Information Systems
, vol.29
, Issue.7
, pp. 531-550
-
-
Winkler, W.E.1
-
34
-
-
79960444264
-
BigMatch: A program for extracting probable matches from a large file
-
Technical Report Computing 2007/01, U.S. Census Bureau
-
W. Yancey. BigMatch: A program for extracting probable matches from a large file. Technical Report Computing 2007/01, U.S. Census Bureau, 2007.
-
(2007)
-
-
Yancey, W.1
|