-
1
-
-
35048828853
-
Automated record matching in cooperative information systems
-
Siena, Italy, January
-
Bertolazzi, P., De Santis, L. and Scannapieco, M.: Automated record matching in cooperative information systems. Proceedings of the international workshop on data quality in cooperative information systems, Siena, Italy, January 2003.
-
(2003)
Proceedings of the International Workshop on Data Quality in Cooperative Information Systems
-
-
Bertolazzi, P.1
De Santis, L.2
Scannapieco, M.3
-
2
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
Washington DC, August
-
Bilenko, M. and Mooney, R.J.: Adaptive duplicate detection using learnable string similarity measures. Proceedings of the 9th ACM SIGKDD conference, Washington DC, August 2003.
-
(2003)
Proceedings of the 9th ACM SIGKDD Conference
-
-
Bilenko, M.1
Mooney, R.J.2
-
3
-
-
9444249661
-
On evaluation and training-set construction for duplicate detection
-
Washington DC, August
-
Bilenko, M. and Mooney, R.J.: On evaluation and training-set construction for duplicate detection. Proceedings of the KDD-2003 workshop on data cleaning, record linkage, and object consolidation, Washington DC, August 2003.
-
(2003)
Proceedings of the KDD-2003 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
-
-
Bilenko, M.1
Mooney, R.J.2
-
4
-
-
0003408496
-
-
University of California, Irvine, Dept. of Information and Computer Sciences
-
Blake, C.L. and Merz, C.J.: UCI Repository of machine learning databases. University of California, Irvine, Dept. of Information and Computer Sciences, http://www.ics.uci.edu/~mlearn/MLRepository.html
-
UCI Repository of Machine Learning Databases
-
-
Blake, C.L.1
Merz, C.J.2
-
5
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
Tokyo, April
-
Chaudhuri, S., Ganti, V. and Motwani, R.: Robust identification of fuzzy duplicates. Proceedings of the 21st international conference on data engineering, Tokyo, April 2005.
-
(2005)
Proceedings of the 21st International Conference on Data Engineering
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
6
-
-
33846458605
-
A parallel open source data linkage system
-
Sydney, May
-
Christen, P., Churches, T. and Hegland, M.: A parallel open source data linkage system. Proceedings of the 8th PAKDD, Sydney, May 2004.
-
(2004)
Proceedings of the 8th PAKDD
-
-
Christen, P.1
Churches, T.2
Hegland, M.3
-
7
-
-
0242540438
-
Learning to match and cluster large high-dimensional data sets for data integration
-
Edmonton, July
-
Cohen, W.W. and Richman, J.: Learning to match and cluster large high-dimensional data sets for data integration. Proceedings of the 8th ACM SIGKDD conference, Edmonton, July 2002.
-
(2002)
Proceedings of the 8th ACM SIGKDD Conference
-
-
Cohen, W.W.1
Richman, J.2
-
8
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
Acapulco, August
-
Cohen, W.W., Ravikumar, P. and Fienberg, S.E.: A comparison of string distance metrics for name-matching tasks. Proceedings of IJCAI-03 workshop on information integration on the Web (IIWeb-03), pp. 73-78, Acapulco, August 2003.
-
(2003)
Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03)
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.E.3
-
9
-
-
84941869105
-
A technique for computer detection and correction of spelling errors
-
March
-
Damerau, F.: A technique for computer detection and correction of spelling errors. Communications of the ACM, vol. 7, no. 3, pp. 171-176, March 1964.
-
(1964)
Communications of the ACM
, vol.7
, Issue.3
, pp. 171-176
-
-
Damerau, F.1
-
10
-
-
0036203458
-
TAILOR: A record linkage toolbox
-
San Jose, USA, March
-
Elfeky, M.G., Verykios, V.S. and Elmagarmid, A.K.: TAILOR: A record linkage toolbox. Proceedings of the ICDE' 2002, San Jose, USA, March 2002.
-
(2002)
Proceedings of the ICDE' 2002
-
-
Elfeky, M.G.1
Verykios, V.S.2
Elmagarmid, A.K.3
-
13
-
-
84976659284
-
Approximate string matching
-
December
-
Hall, P.A.V. and Bowling, G.R.: Approximate string matching. ACM computing surveys, vol. 12, no. 4, pp. 381-402, December 1980.
-
(1980)
ACM Computing Surveys
, vol.12
, Issue.4
, pp. 381-402
-
-
Hall, P.A.V.1
Bowling, G.R.2
-
15
-
-
0026979939
-
Techniques for automatically correcting words in text
-
December
-
Kukich, K.: Techniques for automatically correcting words in text. ACM computing surveys, vol. 24, no. 4, pp. 377-439, December 1992.
-
(1992)
ACM Computing Surveys
, vol.24
, Issue.4
, pp. 377-439
-
-
Kukich, K.1
-
16
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston, August
-
McCallum, A., Nigam, K. and Ungar, L.H.: Efficient clustering of high-dimensional data sets with application to reference matching. Proceedings of the 6th ACM SIGKDD conference, pp. 169-178, Boston, August 2000.
-
(2000)
Proceedings of the 6th ACM SIGKDD Conference
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.H.3
-
17
-
-
0037867900
-
Two approaches to handling noisy variation in text mining
-
Sydney, Australia, July
-
Nahm, U.Y, Bilenko M. and Mooney, R.J.: Two approaches to handling noisy variation in text mining. Proceedings of the ICML-2002 workshop on text learning (TextML'2002), pp. 18-27, Sydney, Australia, July 2002.
-
(2002)
Proceedings of the ICML-2002 Workshop on Text Learning (TextML'2002)
, pp. 18-27
-
-
Nahm, U.Y.1
Bilenko, M.2
Mooney, R.J.3
-
18
-
-
33746091742
-
New South Wales mothers and babies 2001
-
Centre for Epidemiology and Research, NSW Department of Health. New South Wales Mothers and Babies 2001. NSW Public Health Bull 2002; 13(S-4).
-
NSW Public Health Bull 2002
, vol.13
, Issue.S-4
-
-
-
19
-
-
84976776121
-
Automatic spelling correction in scientific and scholarly text
-
April
-
Pollock, J.J. and Zamora, A.: Automatic spelling correction in scientific and scholarly text. Communications of the ACM, vol. 27, no. 4, pp. 358-368, April 1984.
-
(1984)
Communications of the ACM
, vol.27
, Issue.4
, pp. 358-368
-
-
Pollock, J.J.1
Zamora, A.2
-
22
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
Edmonton, July
-
Tejada, S., Knoblock, C.A. and Minton, S.: Learning domain-independent string transformation weights for high accuracy object identification. Proceedings of the 8th ACM SIGKDD conference, Edmonton, July 2002.
-
(2002)
Proceedings of the 8th ACM SIGKDD Conference
-
-
Tejada, S.1
Knoblock, C.A.2
Minton, S.3
-
24
-
-
33846462437
-
String edit analysis for merging databases
-
Boston, August
-
Zhu, J.J., and Ungar, L.H.: String edit analysis for merging databases. KDD-2000 workshop on text mining, held at the 6th ACM SIGKDD conference, Boston, August 2000.
-
(2000)
KDD-2000 Workshop on Text Mining, Held at the 6th ACM SIGKDD Conference
-
-
Zhu, J.J.1
Ungar, L.H.2
|