-
2
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
Baxter, R., Christen, P., Churches, T.: A comparison of fast blocking methods for record linkage. In: ACM SIGKDD 2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation, pp. 25-27 (2003)
-
(2003)
ACM SIGKDD 2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
, pp. 25-27
-
-
Baxter, R.1
Christen, P.2
Churches, T.3
-
3
-
-
84976721642
-
Data manipulation in heterogeneous databases
-
Chatterjee, A., Segev, A.: Data manipulation in heterogeneous databases. ACM SIGMOD Record 20, 64-68 (1991)
-
(1991)
ACM SIGMOD Record
, vol.20
, pp. 64-68
-
-
Chatterjee, A.1
Segev, A.2
-
4
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
Chaudhuri, S., Ganjam, K., Ganti, V., Motwani, R.: Robust and efficient fuzzy match for online data cleaning. In: SIGMOD 2003, pp. 313-324 (2003)
-
(2003)
SIGMOD 2003
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
5
-
-
78049362549
-
Towards scalable real-time entity resolution using a similarity-aware inverted index approach
-
Christen, P., Gayler, R.: Towards scalable real-time entity resolution using a similarity-aware inverted index approach. Proceedings of AusDM 2008, Glenelg, Adelaide 87, 30-39 (2008)
-
(2008)
Proceedings of AusDM 2008, Glenelg, Adelaide
, vol.87
, pp. 30-39
-
-
Christen, P.1
Gayler, R.2
-
7
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
Cohen, W., Richman, J.: Learning to match and cluster large high-dimensional data sets for data integration. In: SIGKDD 2002 (2002)
-
(2002)
SIGKDD 2002
-
-
Cohen, W.1
Richman, J.2
-
9
-
-
84944318804
-
Approximate string joins in a database (almost) for free
-
Gravano, L., Ipeirotis, P.G., Jagadish, H.V., Koudas, N., Muthukrishnan, S., Srivastava, D.: Approximate string joins in a database (almost) for free. In: VLDB 2001, pp. 491-500 (2001)
-
(2001)
VLDB 2001
, pp. 491-500
-
-
Gravano, L.1
Ipeirotis, P.G.2
Jagadish, H.V.3
Koudas, N.4
Muthukrishnan, S.5
Srivastava, D.6
-
10
-
-
78049381653
-
-
John Wiley and Sons, Chichester
-
Han, J., Kamber, M.: The data warehouse ETL toolkit: Practical techniques for extracting, cleaning, conforming, and delivering data. John Wiley and Sons, Chichester (2004)
-
(2004)
The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
-
-
Han, J.1
Kamber, M.2
-
12
-
-
84950419860
-
Advances in record linkage methodology as applied to matching the 1985 census of tampa, florida
-
Jaro, M.A.: Advances in record linkage methodology as applied to matching the 1985 census of tampa, florida. Journal of the American Statistical Society 84, 414-420 (1989)
-
(1989)
Journal of the American Statistical Society
, vol.84
, pp. 414-420
-
-
Jaro, M.A.1
-
13
-
-
0000390142
-
Binary codes capable of correcting deletions, insertions and reversals
-
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions and reversals. Doklady Akademii Nauk SSSR 163, 845-848 (1965)
-
(1965)
Doklady Akademii Nauk SSSR
, vol.163
, pp. 845-848
-
-
Levenshtein, V.I.1
-
15
-
-
0027681165
-
Suffix arrays: A new method for on-line string searches
-
Manber, U., Myers, G.: Suffix arrays: a new method for on-line string searches. SIAM Journal on Computing 22, 935-948 (1993)
-
(1993)
SIAM Journal on Computing
, vol.22
, pp. 935-948
-
-
Manber, U.1
Myers, G.2
-
16
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
McCallum, A., Nigam, K., Ungar, L.H.: Efficient clustering of high-dimensional data sets with application to reference matching. In: ACM SIGKDD, pp. 169-178 (2000)
-
(2000)
ACM SIGKDD
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
17
-
-
0004043396
-
An efficient domain-independent algorithm for detecting approximately duplicate database records
-
Monge, A.E., Elkan, C.P.: An efficient domain-independent algorithm for detecting approximately duplicate database records. In: Proceedings of DMKD 1997, pp. 23-29 (1997)
-
(1997)
Proceedings of DMKD 1997
, pp. 23-29
-
-
Monge, A.E.1
Elkan, C.P.2
-
20
-
-
84947737449
-
On using q-gram locations in approximate string matching
-
Spirakis, P.G. (ed.) ESA 1995. Springer, Heidelberg
-
Sutinen, E., Tarhio, J.: On using q-gram locations in approximate string matching. In: Spirakis, P.G. (ed.) ESA 1995. LNCS, vol. 979, pp. 327-340. Springer, Heidelberg (1995)
-
(1995)
LNCS
, vol.979
, pp. 327-340
-
-
Sutinen, E.1
Tarhio, J.2
-
21
-
-
0027113212
-
Approximate string matching with q-grams and maximal matches
-
Ukkonen, E.: Approximate string matching with q-grams and maximal matches. Theoretical Computer Science 92, 191-211 (1992)
-
(1992)
Theoretical Computer Science
, vol.92
, pp. 191-211
-
-
Ukkonen, E.1
-
22
-
-
0010111194
-
A binary n-gram technique for automatic correction of substitution, deletion, insertion, and reversal errors in words
-
Ullman, J.: A binary n-gram technique for automatic correction of substitution, deletion, insertion, and reversal errors in words. The Computer Journal 20, 141-147 (1977)
-
(1977)
The Computer Journal
, vol.20
, pp. 141-147
-
-
Ullman, J.1
-
23
-
-
2342503573
-
The state of record linkage and current research problems
-
Winkler, W.E.: The state of record linkage and current research problems. In: Statistics of Income Division (1999)
-
(1999)
Statistics of Income Division
-
-
Winkler, W.E.1
|