-
2
-
-
84870523138
-
-
Mondial database: http://www.dbis.informatik.unigoettingen.de/mondial.
-
Mondial Database
-
-
-
3
-
-
2342576574
-
Eliminating fuzzy duplicates in data warehouses
-
ANANTHAKRISHNA, R., CHAUDHURI, S., AND GANTI, V. Eliminating fuzzy duplicates in data warehouses. In Proceedings of the 28th International Conference on Very Large Databases (VLDB-2002) (Hong Kong, China, 2002).
-
Proceedings of the 28th International Conference on Very Large Databases (VLDB-2002) (Hong Kong, China, 2002)
-
-
Ananthakrishna, R.1
Chaudhuri, S.2
Ganti, V.3
-
5
-
-
84958759968
-
String matching with metric trees using an approximate distance
-
BARTOLINI, I., CIACCIA, P., AND PATELLA, M. String matching with metric trees using an approximate distance. In Proceedings of the 9th International Symposium on String Precessing and Information Retrieval (SPIRE-2002) (Belo Horizonte, Brazil, 2002), pp. 271-283.
-
(2002)
Proceedings of the 9th International Symposium on String Precessing and Information Retrieval (SPIRE-2002) Belo Horizonte, Brazil
, pp. 271-283
-
-
Bartolini, I.1
Ciaccia, P.2
Patella, M.3
-
7
-
-
33646136110
-
Employing trainable string similarity metrics for information integration
-
BILENKO, M., AND MOONEY, R. J. Employing trainable string similarity metrics for information integration. In Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web (Acapulco, Mexico, Aug. 2003), pp. 67-72.
-
Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web (Acapulco, Mexico, Aug. 2003)
, pp. 67-72
-
-
Bilenko, M.1
Mooney, R.J.2
-
8
-
-
0344756845
-
Declarative data cleaning: Language, model, and algorithms
-
GALHARDAS, H., FLORESCU, D., SHASHA, D., SIMON, E., AND SAITA, C. Declarative data cleaning: Language, model, and algorithms. In Proceedings of the 27th International Conference on Very Large Databases (VLDB-2001) (Rome, Italy, 2001), pp. 371-380.
-
Proceedings of the 27th International Conference on Very Large Databases (VLDB-2001) (Rome, Italy, 2001)
, pp. 371-380
-
-
Galhardas, H.1
Florescu, D.2
Shasha, D.3
Simon, E.4
Saita, C.5
-
9
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
GRAVANO, L., IPEIROTIS, P., JAGADISH, H., KOUDAS, N., MUTHUKRISHNAN, S., AND SRIVASTAVA, D. Approximate string joins in a database (almost) for free. In Proceedings of the 27th International Conference on Very Large Databases (VLDB-2001) (Roma, Italy, 2001), pp. 491-500.
-
Proceedings of the 27th International Conference on Very Large Databases (VLDB-2001) (Roma, Italy, 2001)
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.2
Jagadish, H.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
10
-
-
84976856849
-
The merge/purge problem for large databases
-
HERNÁNDEZ, M. A., AND STOLFO, S. J. The merge/purge problem for large databases. In Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data (SIGMOD-95) (San Jose, CA, May 1995), pp. 127-138.
-
Proceedings of the 1995 ACM SIGMOD International Conference on Management of Data (SIGMOD-95) (San Jose, CA, May 1995)
, pp. 127-138
-
-
Hernández, M.A.1
Stolfo, S.J.2
-
12
-
-
0028950887
-
Probabilistic linkage of large public health data files
-
JARO, M. A. Probabilistic linkage of large public health data files. Statistics in Medicine 14, 5-7, 491-498.
-
Statistics in Medicine
, vol.14
, Issue.5-7
, pp. 491-498
-
-
Jaro, M.A.1
-
13
-
-
4544289767
-
-
Tech. Rep. TR-DB-02-04 UCI ICS
-
JIN, L., LI, C., AND MEHROTRA, S. Efficient similarity string joins in large data sets. Tech. Rep. TR-DB-02-04, UCI ICS, 2002.
-
(2002)
Efficient Similarity String Joins in Large Data Sets
-
-
Jin, L.1
Li, C.2
Mehrotra, S.3
-
14
-
-
84943425383
-
Efficient record linkage in large data sets
-
JIN, L., LI, C., AND MEHROTRA, S. Efficient record linkage in large data sets. In Procceedings of the 8th International Conference on Database Systems for Advanced Applications (DASFAA-03) (Kyoto, Japan, 2003), pp. 137-.
-
Procceedings of the 8th International Conference on Database Systems for Advanced Applications (DASFAA-03) (Kyoto, Japan, 2003)
, pp. 137
-
-
Jin, L.1
Li, C.2
Mehrotra, S.3
-
15
-
-
29844445531
-
Efficient similarity search for hierarchical data in large databases
-
KAILING, K., KRIEGEL, H.-P., SCHNAUER, S., AND SEIDEL, T. Efficient similarity search for hierarchical data in large databases. In Proceedings of the 9th International Conference on Extending Database Technology (EDBT-2004) (Heraclion, Crete, 2004), pp. 676-693.
-
Proceedings of the 9th International Conference on Extending Database Technology (EDBT-2004) (Heraclion, Crete, 2004)
, pp. 676-693
-
-
Kailing, K.1
Kriegel, H.-P.2
Schnauer, S.3
Seidel, T.4
-
16
-
-
0027189241
-
Entity identification in database integration
-
LIM, E.-P., SRIVASTAVA, J., PRABHAKAR, S., AND RICHARDSON, J. Entity identification in database integration. In Proceedings of the 9th International Conference on Data Engineering (ICDE-93) (Vienna, Austria, April 1993), pp. 294-301.
-
Proceedings of the 9th International Conference on Data Engineering (ICDE-93) (Vienna, Austria, April 1993)
, pp. 294-301
-
-
Lim, E.-P.1
Srivastava, J.2
Prabhakar, S.3
Richardson, J.4
-
17
-
-
85018108837
-
The field matching problem: Algorithms and applications
-
MONGE, A. E., AND ELKAN, C. P. The field matching problem: Algorithms and applications. In Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96) (Portland, OR, August 1996), pp. 267-270.
-
Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining (KDD-96) (Portland, OR, August 1996)
, pp. 267-270
-
-
Monge, A.E.1
Elkan, C.P.2
-
18
-
-
0004043396
-
An efficient domain-independent algorithm for detecting approximately duplicate database records
-
MONGE, A. E., AND ELKAN, C. P. An efficient domain-independent algorithm for detecting approximately duplicate database records. In Proceedings of the SIGMOD 1997 Workshop on Research Issues on Data Mining and Knowledge Discovery (Tuscon, AZ, May 1997), pp. 23-29.
-
Proceedings of the SIGMOD 1997 Workshop on Research Issues on Data Mining and Knowledge Discovery (Tuscon, AZ, May 1997)
, pp. 23-29
-
-
Monge, A.E.1
Elkan, C.P.2
-
19
-
-
0345566149
-
A guided tour to approximate string matching
-
NAVARRO, G. A guided tour to approximate string matching. ACM Computing Surveys, Volume 33, pages 31-88 (2003).
-
(2003)
ACM Computing Surveys
, vol.33
, pp. 31-88
-
-
Navarro, G.1
-
20
-
-
29844443462
-
Record linkage for genealogical databases
-
QUASS, D., AND STARKEY, P. Record linkage for genealogical databases. In Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation (Washington, DC, 2003), pp. 40-42.
-
Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation (Washington, DC, 2003)
, pp. 40-42
-
-
Quass, D.1
Starkey, P.2
-
21
-
-
0002490026
-
Data cleaning: Problems and current approaches
-
RAHM, E., AND DO, H. H. Data cleaning: Problems and current approaches. IEEE Data Engineering Bulletin, Volume 23, pages 3-13 (2000).
-
(2000)
IEEE Data Engineering Bulletin
, vol.23
, pp. 3-13
-
-
Rahm, E.1
Do, H.H.2
-
22
-
-
84944315993
-
Potter's wheel: An interactive data cleaning system
-
RAMAN, V., AND HELLERSTEIN, J. M. Potter's wheel: An interactive data cleaning system. In Proceedings of 27th International Conference on Very Large Databases (VLDB-2001) (Rome, Italy, 2001), pp. 381-390.
-
Proceedings of 27th International Conference on Very Large Databases (VLDB-2001) (Rome, Italy, 2001)
, pp. 381-390
-
-
Raman, V.1
Hellerstein, J.M.2
-
25
-
-
2942747394
-
-
Tech. rep., Statistical Research Division, U.S. Census Bureau, Washington, DC
-
WINKLER, W. E. Advanced methods for record linkage. Tech. rep., Statistical Research Division, U.S. Census Bureau, Washington, DC, 1994.
-
(1994)
Advanced Methods for Record Linkage
-
-
Winkler, W.E.1
-
26
-
-
77049110977
-
Data cleaning methods
-
WINKLER, W. E. Data cleaning methods. In Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation (Washington, DC, 2003), pp. 1-6.
-
Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation (Washington, DC, 2003)
, pp. 1-6
-
-
Winkler, W.E.1
-
27
-
-
1142283602
-
Duplicate removal in information dissemination
-
YAN, T. W., AND GARCIA-MOLINA, H. Duplicate removal in information dissemination. In Proceedings of 21th International Conference on Very Large Data Bases (VLDB-95) (Zurich, Switzerland, 1995), pp. 66-77.
-
Proceedings of 21th International Conference on Very Large Data Bases (VLDB-95) (Zurich, Switzerland, 1995)
, pp. 66-77
-
-
Yan, T.W.1
Garcia-Molina, H.2
|