-
1
-
-
2342576574
-
Eliminating fuzzy duplicates in data warehouses
-
R. Ananthakrishna, S. Chaudhuri, and V. Ganti. Eliminating fuzzy duplicates in data warehouses. In Proceedings of the 28th International Conference on Very Large Databases (VLDB-2002), Hong Kong, China, 2002.
-
Proceedings of the 28th International Conference on Very Large Databases (VLDB-2002), Hong Kong, China, 2002
-
-
Ananthakrishna, R.1
Chaudhuri, S.2
Ganti, V.3
-
3
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
San Diego, CA
-
S. Chaudhuri, K. Ganjam, V. Ganti, and R. Motwani. Robust and efficient fuzzy match for online data cleaning. In Proceedings of the 2003 ACM SIGMOD international conference on on Management of data, pages 313-324, San Diego, CA, 2003.
-
(2003)
Proceedings of the 2003 ACM SIGMOD International Conference on on Management of Data
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
4
-
-
77954016315
-
-
Powerpoint presentation, available at
-
W. Cohen. Overview of record linkage methods. Powerpoint presentation, available at http://www-2.cs.cmu.edu/wcohen/Matching-1.ppt.
-
Overview of Record Linkage Methods
-
-
Cohen, W.1
-
5
-
-
0000666461
-
Data integration using similarity joins and a word-based information representation language
-
W. Cohen. Data integration using similarity joins and a word-based information representation language. ACM Transactions on Information Systems, 18:288-321, 2000.
-
(2000)
ACM Transactions on Information Systems
, vol.18
, pp. 288-321
-
-
Cohen, W.1
-
7
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
Acapulco, Mexico, Aug.
-
W. W. Cohen, P. Ravikumar, and S. E. Fienberg. A comparison of string distance metrics for name-matching tasks. In Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web, pages 73-78, Acapulco, Mexico, Aug. 2003.
-
(2003)
Proceedings of the IJCAI-2003 Workshop on Information Integration on the Web
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
10
-
-
0031622479
-
CiteSeer: An automatic citation indexing system
-
Pittsburgh, PA, June 23-26
-
C. L. Giles, K. Bollacker, and S. Lawrence. CiteSeer: An automatic citation indexing system. In Proceedings of the Third ACM Conference on Digital Libraries, pages 89-98, Pittsburgh, PA, June 23-26 1998.
-
(1998)
Proceedings of the Third ACM Conference on Digital Libraries
, pp. 89-98
-
-
Giles, C.L.1
Bollacker, K.2
Lawrence, S.3
-
11
-
-
33845350152
-
-
Technical Report 03/83, CSIRO Mathematical and Information Sciences, Canberra, Australia, April
-
L. Gu, R. Baxter, D. Vickers, and C. Rainsford. Record linkage: Current practice and future directions. Technical Report 03/83, CSIRO Mathematical and Information Sciences, Canberra, Australia, April 2003.
-
(2003)
Record Linkage: Current Practice and Future Directions
-
-
Gu, L.1
Baxter, R.2
Vickers, D.3
Rainsford, C.4
-
16
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, MA, Aug
-
A. K. McCallum, K. Nigam, and L. Ungar. Efficient clustering of high-dimensional data sets with application to reference matching. In Proceedings of the Sixth International Conference On Knowledge Discovery and Data Mining (KDD-2000), pages 169-178, Boston, MA, Aug. 2000.
-
(2000)
Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining (KDD-2000)
, pp. 169-178
-
-
McCallum, A.K.1
Nigam, K.2
Ungar, L.3
-
20
-
-
0345566149
-
A guided tour to approximate string matching
-
G. Navarro. A guided tour to approximate string matching. ACM Computing Surveys, 33(1):31-88, 2001.
-
(2001)
ACM Computing Surveys
, vol.33
, Issue.1
, pp. 31-88
-
-
Navarro, G.1
-
21
-
-
0001592068
-
Automatic linkage of vital records
-
H. Newcombe, J. Kennedy, S. Axford, and A. James. Automatic linkage of vital records. Science, 130:954-959, 1959.
-
(1959)
Science
, vol.130
, pp. 954-959
-
-
Newcombe, H.1
Kennedy, J.2
Axford, S.3
James, A.4
-
22
-
-
84898987614
-
Identity uncertainty and citation matching
-
MIT Press
-
H. Pasula, B. Marthi, B. Milch, S. Russell, and I. Shpitser. Identity uncertainty and citation matching. In Advances in Neural Information Processing Systems 15. MIT Press, 2003.
-
(2003)
Advances in Neural Information Processing Systems 15
-
-
Pasula, H.1
Marthi, B.2
Milch, B.3
Russell, S.4
Shpitser, I.5
-
25
-
-
0035545848
-
Learning object identification rules for information integration
-
S. Tejada, C. A. Knoblock, and S. Minton. Learning object identification rules for information integration. Information Systems Journal, 26(8):635-656, 2001.
-
(2001)
Information Systems Journal
, vol.26
, Issue.8
, pp. 635-656
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
27
-
-
38449092618
-
-
Technical report, Statistical Research Division, U.S. Census Bureau, Washington, DC
-
W. E. Winkler. Improved decision rules in the fellegi-sunter model of record linkage. Technical report, Statistical Research Division, U.S. Census Bureau, Washington, DC, 1993.
-
(1993)
Improved Decision Rules in the Fellegi-sunter Model of Record Linkage
-
-
Winkler, W.E.1
-
28
-
-
0012866045
-
-
Technical report, Statistical Research Division, U.S. Census Bureau, Washington, DC
-
W. E. Winkler. The state of record linkage and current research problems. Technical report, Statistical Research Division, U.S. Census Bureau, Washington, DC, 1999.
-
(1999)
The State of Record Linkage and Current Research Problems
-
-
Winkler, W.E.1
-
29
-
-
2942741943
-
-
Technical report, Statistical Research Division, U.S. Census Bureau, Washington, DC
-
W. E. Winkler. Methods for record linkage and Bayesian networks. Technical report, Statistical Research Division, U.S. Census Bureau, Washington, DC, 2002.
-
(2002)
Methods for Record Linkage and Bayesian Networks
-
-
Winkler, W.E.1
|