-
1
-
-
5444258997
-
A comparison of fast blocking methods for record linkage
-
Washington, DC, USA
-
Baxter, R. A., Christen, P. & Churches, T. (2003), A comparison of fast blocking methods for record linkage, in 'ACM SIGKDD'03 Workshop on Data Cleaning, Record Linkage, and Object Consolidation', Washington, DC, USA, pp. 25- 27.
-
(2003)
ACM SIGKDD03 Workshop on Data Cleaning, Record Linkage, and Object Consolidation
, pp. 25-27
-
-
Baxter, R.A.1
Christen, P.2
Churches, T.3
-
2
-
-
77952372966
-
Adaptive duplicate detection using learnable string similarity measures
-
Bilenko, M. & Mooney, R. J. (2003), Adaptive duplicate detection using learnable string similarity measures, in 'Proceedings of ACM SIGKDD', ACM Press, Washington DC, pp. 39-48.
-
(2003)
Proceedings of ACM SIGKDD, ACM Press, Washington DC
, pp. 39-48
-
-
Bilenko, M.1
Mooney, R.J.2
-
3
-
-
33846428768
-
New South Wales mothers and babies
-
Centre for Epidemiology Research NSW Department of Health (2001)
-
Centre for Epidemiology and Research, NSW Department of Health (2001), 'New South Wales mothers and babies 2001', NSW Public Health Bull 13:S-4
-
(2001)
NSW Public Health Bull
, vol.13
-
-
-
4
-
-
1142279457
-
Robust and efficient fuzzy match for online data cleaning
-
Chaudhuri, S., Ganjam, K., Ganti, V. & Motwani, R. (2003), Robust and efficient fuzzy match for online data cleaning, in 'Proceedings of ACM SIGMOD', San Diego, pp. 313-324.
-
(2003)
Proceedings of ACM SIGMOD, San Diego
, pp. 313-324
-
-
Chaudhuri, S.1
Ganjam, K.2
Ganti, V.3
Motwani, R.4
-
5
-
-
26444550791
-
Robust identification of fuzzy duplicates
-
Tokyo
-
Chaudhuri, S., Ganti, V. & Motwani, R. (2005), Robust identification of fuzzy duplicates, in 'Proceedings of the 21st international conference on data engineering (ICDE'05)', Tokyo, pp. 865- 876.
-
(2005)
Proceedings of the 21st international conference on data engineering (ICDE05)
, pp. 865-876
-
-
Chaudhuri, S.1
Ganti, V.2
Motwani, R.3
-
6
-
-
26444478506
-
Probabilistic data generation for deduplication and data linkage
-
IDEAL05, Springer LNCS, Brisbane
-
Christen, P. (2005), Probabilistic data generation for deduplication and data linkage, in 'IDEAL'05', Springer LNCS 3578, Brisbane, pp. 109-116.
-
(2005)
, vol.3578
, pp. 109-116
-
-
Christen, P.1
-
8
-
-
7444251738
-
Febrl - A parallel open source data linkage system
-
Christen, P., Churches, T. & Hegland, M. (2004), Febrl - A parallel open source data linkage system, in 'Proceedings of the 8th PAKDD', pp. 638-647.
-
(2004)
Proceedings of the 8th PAKDD
, pp. 638-647
-
-
Christen, P.1
Churches, T.2
Hegland, M.3
-
10
-
-
17744364120
-
Clustering by compression
-
Cilibrasi, R. & Vitanyi, P. (2005), Clustering by compression, in 'IEEE Trans. Information Theory', Vol. 51, pp. 1523- 1545.
-
(2005)
IEEE Trans. Information Theory
, vol.51
, pp. 1523-1545
-
-
Cilibrasi, R.1
Vitanyi, P.2
-
11
-
-
0032091575
-
Integration of heterogeneous databases without common domains using queries based on textual similarity
-
Cohen, W. W. (1998), Integration of heterogeneous databases without common domains using queries based on textual similarity, in 'Proceedings of ACM SIGMOD', Seattle, pp. 201-212.
-
(1998)
Proceedings of ACM SIGMOD, Seattle
, pp. 201-212
-
-
Cohen, W.W.1
-
12
-
-
11144240583
-
A comparison of string distance metrics for name-matching tasks
-
Cohen, W. W., Ravikumar, P. & Fienberg, S. (2003), A comparison of string distance metrics for name-matching tasks, in 'Proceedings of IJCAI-03 workshop on information integration on the Web (IIWeb-03)', Acapulco, pp. 73-78.
-
(2003)
Proceedings of IJCAI-03 workshop on information integration on the Web (IIWeb-03), Acapulco
, pp. 73-78
-
-
Cohen, W.W.1
Ravikumar, P.2
Fienberg, S.3
-
13
-
-
0036203458
-
TAILOR: A record linkage toolbox
-
San Jose
-
Elfeky, M. G., Verykios, V. S. & Elmagarmid, A. K. (2002), TAILOR: A record linkage toolbox, in 'Proceedings of ICDE', San Jose, pp. 17-28.
-
(2002)
Proceedings of ICDE
, pp. 17-28
-
-
Elfeky, M.G.1
Verykios, V.S.2
Elmagarmid, A.K.3
-
14
-
-
84947399464
-
A theory for record linkage
-
Fellegi, I. P. & Sunter, A. B. (1969), A theory for record linkage, in 'Journal of the American Statistical Association', Vol. 64, pp. 1183-1210.
-
(1969)
Journal of the American Statistical Association
, vol.64
, pp. 1183-1210
-
-
Fellegi, I.P.1
Sunter, A.B.2
-
15
-
-
1642332418
-
Methods for automatic record matching and linking and their use in national statistics
-
Gill, L. (2001), Methods for automatic record matching and linking and their use in national statistics, in 'National Statistics Methodology Series', number 25.
-
(2001)
National Statistics Methodology Series
, Issue.25
-
-
Gill, L.1
-
16
-
-
0021938963
-
Clustering to minimize the maximum intercluster distance
-
Gonzalez, T. F. (1985), Clustering to minimize the maximum intercluster distance, in 'Theoretical Computer Science', Vol. 38, pp. 293-306.
-
(1985)
Theoretical Computer Science
, vol.38
, pp. 293-306
-
-
Gonzalez, T.F.1
-
17
-
-
84870554553
-
-
AusDM 2004, Springer LNAI, 3755, Cairns, Australia
-
Gu, L. & Baxter, R. (2004a), Decision models for record linkage, in 'AusDM 2004, Springer LNAI 3755', Cairns, Australia, pp. 146-160.
-
(2004)
Decision models for record linkage
, pp. 146-160
-
-
Gu, L.1
Baxter, R.2
-
19
-
-
0003585297
-
-
Morgan Kaufmann, San Fransisco, CA
-
Han, J. & Kamber, M. (2001), Data Mining: Concepts and Techniques, Morgan Kaufmann, San Fransisco, CA.
-
(2001)
Data Mining: Concepts and Techniques
-
-
Han, J.1
Kamber, M.2
-
20
-
-
10644281769
-
Towards parameter-free data mining
-
Keogh, E., Lonardi, S. & Ratanamahatana, C. (2004), Towards parameter-free data mining, in '2004 ACM SIGKDD international conference on knowledge discovery and data mining', pp. 206- 215.
-
(2004)
2004 ACM SIGKDD international conference on knowledge discovery and data mining, pp
, pp. 206-215
-
-
Keogh, E.1
Lonardi, S.2
Ratanamahatana, C.3
-
21
-
-
0027657540
-
Computation of normalized edit distance and applications
-
Marzal, A. & Vidal, E. (1993), 'Computation of normalized edit distance and applications.', IEEE Trans. Pattern Anal. Mach. Intell. 15(9), 926- 932.
-
(1993)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.15
, Issue.9
, pp. 926-932
-
-
Marzal, A.1
Vidal, E.2
-
22
-
-
84870484308
-
-
MatchWare Technologies, Kennebunk, Maine
-
MatchWare Technologies (1998), AutoStan and AutoMatch, User's Manuals, Kennebunk, Maine
-
(1998)
AutoStan and AutoMatch Users Manuals
-
-
-
23
-
-
0034592784
-
Efficient clustering of high-dimensional data sets with application to reference matching
-
Boston
-
McCallum, A., Nigam, K. & Ungar, L. (2000), Efficient clustering of high-dimensional data sets with application to reference matching, in 'Proceedings of ACM SIGKDD', Boston, pp. 169- 178.
-
(2000)
Proceedings of ACM SIGKDD
, pp. 169-178
-
-
McCallum, A.1
Nigam, K.2
Ungar, L.3
-
26
-
-
0037867900
-
Two approaches to handling noisy variation in text mining
-
Sydney
-
Nahm, U., Bilenko, M. & Mooney, R. (2002), Two approaches to handling noisy variation in text mining, in 'Proceedings of the ICML-2002 workshop on text learning (TextML'2002)', Sydney, pp. 18-27.
-
(2002)
Proceedings of the ICML-2002 workshop on text learning (TextML2002)
, pp. 18-27
-
-
Nahm, U.1
Bilenko, M.2
Mooney, R.3
-
27
-
-
26844557708
-
A hierarchical graphical model for record linkage
-
Banff, Canada
-
Ravikumar, P. & Cohen, W. W. (2004), A hierarchical graphical model for record linkage, in 'roc. of the 20th Conference on Uncertainty in Artificial Intelligence', Banff, Canada, pp. 454-461.
-
(2004)
roc. of the 20th Conference on Uncertainty in Artificial Intelligence
, pp. 454-461
-
-
Ravikumar, P.1
Cohen, W.W.2
-
28
-
-
0242456811
-
Interactive deduplication using active learning
-
ACM Press, Edmonton
-
Sarawagi, S. & Bhamidipaty, A. (2002), Interactive deduplication using active learning, in 'Proceedings of ACM SIGKDD', ACM Press, Edmonton, pp. 269-278.
-
(2002)
Proceedings of ACM SIGKDD
, pp. 269-278
-
-
Sarawagi, S.1
Bhamidipaty, A.2
-
29
-
-
0242456803
-
Learning domain-independent string transformation weights for high accuracy object identification
-
Edmonton
-
Tejada, S., Knoblock, C. & Minton, S. (2002), Learning domain-independent string transformation weights for high accuracy object identification, in 'Proceedings of ACM SIGKDD', Edmonton, pp. 350-359.
-
(2002)
Proceedings of ACM SIGKDD
, pp. 350-359
-
-
Tejada, S.1
Knoblock, C.2
Minton, S.3
-
31
-
-
0008976521
-
String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage
-
American Statistical Association
-
Winkler, W. E. (1990), String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage, in 'Section on Survey Research Methods', American Statistical Association, pp. 354-359.
-
(1990)
Section on Survey Research Methods
, pp. 354-359
-
-
Winkler, W.E.1
-
34
-
-
0003957032
-
-
2nd edn, Morgan Kaufmann, San Francisco
-
Witten, I. H. & Frank, E. (2005), Data Mining: Practical machine learning tools and techniques, 2nd edn, Morgan Kaufmann, San Francisco.
-
(2005)
Data Mining: Practical machine learning tools and techniques
-
-
Witten, I.H.1
Frank, E.2
|